Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
183b8d325aa542ef0be6e1aaa67c5c4e17c244f2
FastDeploy/fastdeploy/engine
T
History
chenjian 90db0bdd0d [Optimize] Optimize ttft for ep (#6098)
* optimize ttft

* fix

* fix

* fix ci

* fix ci

* fix

* fix bug

* fix

* add comments

* fix ci

* fix
2026-02-04 15:03:29 +08:00
..
sched
[BugFix] Fix bug for enable output caching (#6226)
2026-01-30 10:55:36 +08:00
__init__.py
…
args_utils.py
[Model Runner] Support overlap schedule (#6259)
2026-02-04 10:49:44 +08:00
async_llm.py
[Optimization] The pre- and post-processing pipeline do not perform dict conversion (#5494)
2026-01-22 00:50:52 +08:00
common_engine.py
[Optimize] Optimize ttft for ep (#6098)
2026-02-04 15:03:29 +08:00
engine.py
[Model Runner] Support overlap schedule (#6259)
2026-02-04 10:49:44 +08:00
expert_service.py
[BugFix] Fix port-releated errors in mix mode when FD_ENABLE_INTERNAL_ADAPTER is enabled (#6309)
2026-02-03 19:49:01 +08:00
kv_cache_interface.py
…
pooling_params.py
…
request.py
[RL] add pause, update_weights, resume interface for async RL (#6052)
2026-01-23 10:18:07 +08:00
resource_manager.py
[Feature] Support stopping the inference for the corresponding request in the online service after a disconnection request. (#5320)
2026-01-16 11:46:13 +08:00
sampling_params.py
[Optimization] The pre- and post-processing pipeline do not perform dict conversion (#5494)
2026-01-22 00:50:52 +08:00
tasks.py
…
Powered by Gitea Version: 1.26.0 Page: 2002ms Template: 208ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API