This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
837ddca27308bee8edd535c193f5b946e1d5af39
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
moe
T
History
yzwu
837ddca273
[Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (
#6083
)
2026-01-21 16:01:29 +08:00
..
__init__.py
…
ep.py
[Feature] Support redundant expert for eplb (
#5918
)
2026-01-09 17:13:24 +08:00
fused_moe_backend_base.py
[Feature] Support redundant expert for eplb (
#5918
)
2026-01-09 17:13:24 +08:00
fused_moe_cutlass_backend.py
[Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (
#6083
)
2026-01-21 16:01:29 +08:00
fused_moe_deepgemm_backend.py
[Feature] Unify fp8 block_wise quant ops (
#5991
)
2026-01-15 05:50:37 -08:00
fused_moe_marlin_backend.py
…
fused_moe_triton_backend.py
[Feature] Unify fp8 block_wise quant ops (
#5991
)
2026-01-15 05:50:37 -08:00
fused_moe_wint2_backend.py
[BugFix] fix wint2 (
#6109
)
2026-01-20 21:46:21 +08:00
moe.py
[Intel HPU] enable MoE EP for hpu (
#5855
)
2026-01-15 13:08:00 +08:00
routing_indices_cache.py
[RL][CI] Support Async R3 And Add Accuracy Test (
#5937
)
2026-01-14 04:25:06 -08:00
triton_moe_kernels.py
…