This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-09 00:45:13 +08:00
Code
Issues
Actions
7
Packages
Projects
Releases
Wiki
Activity
Files
cd252ec67392dd87382d19135bb83aacd79bbc3d
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
moe
T
History
Yuan Xiaolan
c71ee0831c
add w4afp8 offline script (
#3636
)
2025-08-29 17:56:05 +08:00
..
__init__.py
…
ep.py
add input_processor plugin (
#3657
)
2025-08-28 22:53:57 +08:00
fused_moe_backend_base.py
[BugFix]fix dp&ep&tp and muti node infer (
#3629
)
2025-08-28 19:09:10 +08:00
fused_moe_cutlass_backend.py
add w4afp8 offline script (
#3636
)
2025-08-29 17:56:05 +08:00
fused_moe_deepgemm_backend.py
add w4afp8 offline script (
#3636
)
2025-08-29 17:56:05 +08:00
fused_moe_marlin_backend.py
…
fused_moe_triton_backend.py
…
fused_moe_wint2_backend.py
【New Feature】集中式支持w4afp8 (
#3644
)
2025-08-28 10:53:24 +08:00
fused_moe_xpu_backend.py
…
moe.py
…
triton_moe_kernels.py
…