Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-07 16:08:58 +08:00
Code Issues Actions 4 Packages Projects Releases Wiki Activity
Files
72fe94cb13564c183b013190de0d04b033e89c71
FastDeploy/fastdeploy/model_executor
T
History
chen 72fe94cb13 [Feature] support glm tp+dp+ep (#6317)
2026-02-05 21:47:01 +08:00
..
graph_optimization
[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196)
2026-01-29 16:29:54 +08:00
guided_decoding
…
layers
Support Norm before Rope (#6332)
2026-02-05 15:28:52 +08:00
logits_processor
…
model_loader
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
models
[Feature] support glm tp+dp+ep (#6317)
2026-02-05 21:47:01 +08:00
ops
[build] support build sm 80,86,89,90 to one whl package (#6173)
2026-01-26 11:30:02 +08:00
__init__.py
…
entropy_utils.py
…
forward_meta.py
[Graph Optimization] Support CUDAGraph for P/PD mixed Batch using SOT subgraph spliting mode (#6196)
2026-01-29 16:29:54 +08:00
load_weight_utils.py
[Intel HPU] enable MoE EP for hpu (#5855)
2026-01-15 13:08:00 +08:00
pre_and_post_process.py
[Model Runner] Support overlap schedule (#6259)
2026-02-04 10:49:44 +08:00
utils.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
xpu_pre_and_post_process.py
[Feature]Support reorder ids to split prefill and decodes (#5779)
2026-02-03 00:28:02 -08:00
Powered by Gitea Version: 1.26.0 Page: 1823ms Template: 279ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API