FastDeploy/fastdeploy/model_executor at 183b8d325aa542ef0be6e1aaa67c5c4e17c244f2 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

GoldPancake 183b8d325a [RL] Support GLM MTP RL Model (#6267 )

2026-02-04 20:14:35 +08:00

..

graph_optimization

[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196 )

2026-01-29 16:29:54 +08:00

guided_decoding

…

[RL] Support GLM MTP RL Model (#6267 )

2026-02-04 20:14:35 +08:00

logits_processor

…

[Loader] support dummy load weight (#6169 )

2026-01-26 13:58:53 +08:00

[RL] Support GLM MTP RL Model (#6267 )

2026-02-04 20:14:35 +08:00

[build] support build sm 80,86,89,90 to one whl package (#6173 )

2026-01-26 11:30:02 +08:00

__init__.py

…

entropy_utils.py

[Bugfix] Fix entropy calculation bugs (#5941 )

2026-01-08 20:57:35 +08:00

forward_meta.py

[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196 )

2026-01-29 16:29:54 +08:00

load_weight_utils.py

[Intel HPU] enable MoE EP for hpu (#5855 )

2026-01-15 13:08:00 +08:00

pre_and_post_process.py

[Model Runner] Support overlap schedule (#6259 )

2026-02-04 10:49:44 +08:00

utils.py

[Feature] Support NVFP4 MoE on SM100 (#6003 )

2026-01-29 14:16:07 +08:00

xpu_pre_and_post_process.py

[Feature]Support reorder ids to split prefill and decodes (#5779 )

2026-02-03 00:28:02 -08:00