Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 08:21:53 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
af6c84d48d7b0a3521a2edc529024210f87c8ec0
FastDeploy/fastdeploy/model_executor/layers/moe
T
History
Yuanle Liu 8b05774fad [Others] enhance deep_ep import and support mixed mode flash_mask_attn (#6238)
* support flashmaskattn mixed and enhance deepep import

* update

* fix
2026-01-28 00:02:02 +08:00
..
__init__.py
support w4afp8 EP inference (#3044)
2025-08-25 11:27:45 +08:00
ep.py
[Others] enhance deep_ep import and support mixed mode flash_mask_attn (#6238)
2026-01-28 00:02:02 +08:00
fused_moe_backend_base.py
[Feature] Support redundant expert for eplb (#5918)
2026-01-09 17:13:24 +08:00
fused_moe_cutlass_backend.py
[Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (#6083)
2026-01-21 16:01:29 +08:00
fused_moe_deepgemm_backend.py
[Others] enhance deep_ep import and support mixed mode flash_mask_attn (#6238)
2026-01-28 00:02:02 +08:00
fused_moe_marlin_backend.py
[New][RL] Support Rollout Routing Replay (#5405)
2025-12-05 22:06:26 +08:00
fused_moe_triton_backend.py
[Feature] Unify fp8 block_wise quant ops (#5991)
2026-01-15 05:50:37 -08:00
fused_moe_wint2_backend.py
[BugFix] fix wint2 (#6109)
2026-01-20 21:46:21 +08:00
moe.py
Support MXFP4 for GPT-OSS (#5435)
2026-01-22 14:21:01 +08:00
routing_indices_cache.py
[UT] Add GLM E2E tests for non-MTP and MTP (#6163)
2026-01-23 10:34:29 +08:00
triton_moe_kernels.py
[OPs] MoE support wfp8afp8(channelwise) and improve per_token_quant_fp8 (#4238)
2025-09-24 16:39:51 +08:00
Powered by Gitea Version: 1.26.0 Page: 141ms Template: 5ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API