This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-09 08:55:00 +08:00
Code
Issues
Actions
7
Packages
Projects
Releases
Wiki
Activity
Files
4694ed2a43046c26f1ef77ab2c036d55eb517ae1
FastDeploy
/
fastdeploy
/
engine
T
History
chen
d58c1db8a0
[Feature][OP] Append Attn Support CUDA-PDL (
#5072
)
2025-11-17 20:47:33 +08:00
..
sched
[BugFix] fix num_requests_running after clear_data (
#4927
)
2025-11-13 13:50:21 +08:00
__init__.py
…
args_utils.py
[Intel HPU] enable level 1 prefix caching and fix some bugs (
#4971
)
2025-11-14 19:42:50 +08:00
async_llm.py
[PD Disaggregation] remove splitwise deployment on single node and refine the code (
#4891
)
2025-11-14 09:56:53 +08:00
common_engine.py
[Log] Add trace log and add loggingInstrumentor tool (
#4692
)
2025-11-17 11:08:57 +08:00
engine.py
[Feature][OP] Append Attn Support CUDA-PDL (
#5072
)
2025-11-17 20:47:33 +08:00
expert_service.py
…
kv_cache_interface.py
…
pooling_params.py
…
request.py
[PD Disaggregation] remove splitwise deployment on single node and refine the code (
#4891
)
2025-11-14 09:56:53 +08:00
resource_manager.py
…
sampling_params.py
…
tasks.py
…