This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
8819a039c95e82ff382dfbd0c7743a1002bdb8f1
FastDeploy
/
fastdeploy
/
engine
T
History
Echo-Nie
8819a039c9
[Others] Fix typo (
#7280
)
...
* typo * typo * typo * typo
2026-04-14 17:28:22 +08:00
..
sched
[BugFix][PD Disaggregation][KVCache] Fix low cache hit rate in PD split scenario (
#7364
)
2026-04-14 16:15:43 +08:00
__init__.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
args_utils.py
[Loader] add multi-thread model loading (
#6877
)
2026-04-09 23:40:15 -07:00
async_llm.py
Split enable_mm (
#7183
)
2026-04-08 11:25:41 +08:00
common_engine.py
[Others] Fix typo (
#7280
)
2026-04-14 17:28:22 +08:00
engine.py
[Loader] add multi-thread model loading (
#6877
)
2026-04-09 23:40:15 -07:00
expert_service.py
[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (
#6533
)
2026-03-16 21:32:43 +08:00
kv_cache_interface.py
bug: fix list to List (
#4818
)
2025-11-06 16:13:12 +08:00
pooling_params.py
[Feature] support reward model (
#5301
)
2025-12-02 14:55:31 +08:00
register_manager.py
[PD Disaggregation][RL] Register to router with version and support rdma eager connect for pd (
#6718
)
2026-03-17 14:43:35 +08:00
request.py
[Speculative Decoding] Support mtp expert-parallel and support different modality deploy (
#7018
)
2026-03-26 13:52:16 +08:00
resource_manager.py
[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (
#6533
)
2026-03-16 21:32:43 +08:00
sampling_params.py
[Feature] Add Deterministic Inference Support (
#6476
)
2026-02-26 19:31:51 -08:00
tasks.py
[feature] support reward api (
#4518
)
2025-10-29 00:20:28 +08:00