FastDeploy/fastdeploy/model_executor/layers/moe at 837ddca27308bee8edd535c193f5b946e1d5af39 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

yzwu 837ddca273 [Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (#6083 )

2026-01-21 16:01:29 +08:00

..

__init__.py

…

ep.py

[Feature] Support redundant expert for eplb (#5918 )

2026-01-09 17:13:24 +08:00

fused_moe_backend_base.py

[Feature] Support redundant expert for eplb (#5918 )

2026-01-09 17:13:24 +08:00

fused_moe_cutlass_backend.py

[Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (#6083 )

2026-01-21 16:01:29 +08:00

fused_moe_deepgemm_backend.py

[Feature] Unify fp8 block_wise quant ops (#5991 )

2026-01-15 05:50:37 -08:00

fused_moe_marlin_backend.py

…

fused_moe_triton_backend.py

[Feature] Unify fp8 block_wise quant ops (#5991 )

2026-01-15 05:50:37 -08:00

fused_moe_wint2_backend.py

[BugFix] fix wint2 (#6109 )

2026-01-20 21:46:21 +08:00

moe.py

[Intel HPU] enable MoE EP for hpu (#5855 )

2026-01-15 13:08:00 +08:00

routing_indices_cache.py

[RL][CI] Support Async R3 And Add Accuracy Test (#5937 )

2026-01-14 04:25:06 -08:00

triton_moe_kernels.py

…