Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
183b8d325aa542ef0be6e1aaa67c5c4e17c244f2
FastDeploy/fastdeploy/model_executor
T
History
GoldPancake 183b8d325a [RL] Support GLM MTP RL Model (#6267)
2026-02-04 20:14:35 +08:00
..
graph_optimization
[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196)
2026-01-29 16:29:54 +08:00
guided_decoding
…
layers
[RL] Support GLM MTP RL Model (#6267)
2026-02-04 20:14:35 +08:00
logits_processor
…
model_loader
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
models
[RL] Support GLM MTP RL Model (#6267)
2026-02-04 20:14:35 +08:00
ops
[build] support build sm 80,86,89,90 to one whl package (#6173)
2026-01-26 11:30:02 +08:00
__init__.py
…
entropy_utils.py
[Bugfix] Fix entropy calculation bugs (#5941)
2026-01-08 20:57:35 +08:00
forward_meta.py
[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196)
2026-01-29 16:29:54 +08:00
load_weight_utils.py
[Intel HPU] enable MoE EP for hpu (#5855)
2026-01-15 13:08:00 +08:00
pre_and_post_process.py
[Model Runner] Support overlap schedule (#6259)
2026-02-04 10:49:44 +08:00
utils.py
[Feature] Support NVFP4 MoE on SM100 (#6003)
2026-01-29 14:16:07 +08:00
xpu_pre_and_post_process.py
[Feature]Support reorder ids to split prefill and decodes (#5779)
2026-02-03 00:28:02 -08:00
Powered by Gitea Version: 1.26.0 Page: 1781ms Template: 199ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API