This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
7cfb0ffba0e38820be172cc66c64bcf0ddfe9c37
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
moe
T
History
RichardWooSJTU
7cfb0ffba0
fix pfcc deep ep in low latency mode (
#6440
)
2026-03-02 10:35:51 +08:00
..
__init__.py
…
ep.py
fix pfcc deep ep in low latency mode (
#6440
)
2026-03-02 10:35:51 +08:00
fused_moe_backend_base.py
…
fused_moe_cutlass_backend.py
[Optimization] Enable BF16 gate computation for GLM and Qwen (
#6457
)
2026-02-26 21:08:46 -08:00
fused_moe_deepgemm_backend.py
[Optimization] Enable BF16 gate computation for GLM and Qwen (
#6457
)
2026-02-26 21:08:46 -08:00
fused_moe_marlin_backend.py
[Optimization] Enable BF16 gate computation for GLM and Qwen (
#6457
)
2026-02-26 21:08:46 -08:00
fused_moe_triton_backend.py
fix reshard error (
#6536
)
2026-02-27 22:22:37 +08:00
fused_moe_wint2_backend.py
[loader]supoort wint2 backend (
#6139
)
2026-02-08 22:42:36 -08:00
moe.py
[loader]supoort wint2 backend (
#6139
)
2026-02-08 22:42:36 -08:00
routing_indices_cache.py
…
triton_moe_kernels.py
…