FastDeploy/fastdeploy/engine at 4694ed2a43046c26f1ef77ab2c036d55eb517ae1 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-09 08:55:00 +08:00

Files

T

History

chen d58c1db8a0 [Feature][OP] Append Attn Support CUDA-PDL (#5072 )

2025-11-17 20:47:33 +08:00

..

[BugFix] fix num_requests_running after clear_data (#4927 )

2025-11-13 13:50:21 +08:00

__init__.py

…

args_utils.py

[Intel HPU] enable level 1 prefix caching and fix some bugs (#4971 )

2025-11-14 19:42:50 +08:00

async_llm.py

[PD Disaggregation] remove splitwise deployment on single node and refine the code (#4891 )

2025-11-14 09:56:53 +08:00

common_engine.py

[Log] Add trace log and add loggingInstrumentor tool (#4692 )

2025-11-17 11:08:57 +08:00

engine.py

[Feature][OP] Append Attn Support CUDA-PDL (#5072 )

2025-11-17 20:47:33 +08:00

expert_service.py

…

kv_cache_interface.py

…

pooling_params.py

…

request.py

[PD Disaggregation] remove splitwise deployment on single node and refine the code (#4891 )

2025-11-14 09:56:53 +08:00

resource_manager.py

…

sampling_params.py

…

tasks.py

…