This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-08 08:23:25 +08:00
Code
Issues
Actions
6
Packages
Projects
Releases
Wiki
Activity
Files
268276e287c8d9cb8629634432bb35c8b898f936
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
moe
T
History
bukejiyu
dc5917289d
[loader]supoort wint2 backend (
#6139
)
...
* support wint2 * update
2026-02-08 22:42:36 -08:00
..
__init__.py
…
ep.py
[Feature] Support Ernie FP8 on sm100 ( the fixed version) (
#6304
)
2026-02-03 17:47:38 +08:00
fused_moe_backend_base.py
[Feature] Support redundant expert for eplb (
#5918
)
2026-01-09 17:13:24 +08:00
fused_moe_cutlass_backend.py
[Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (
#6083
)
2026-01-21 16:01:29 +08:00
fused_moe_deepgemm_backend.py
[Others] support import deepgemm/deepep from fleet ops (
#6351
)
2026-02-09 11:53:13 +08:00
fused_moe_marlin_backend.py
[New][RL] Support Rollout Routing Replay (
#5405
)
2025-12-05 22:06:26 +08:00
fused_moe_triton_backend.py
[Feature] FD_USE_PHI_FP8_QUANT (
#6320
)
2026-02-03 22:33:03 -08:00
fused_moe_wint2_backend.py
[loader]supoort wint2 backend (
#6139
)
2026-02-08 22:42:36 -08:00
moe.py
[loader]supoort wint2 backend (
#6139
)
2026-02-08 22:42:36 -08:00
routing_indices_cache.py
[RL] R3 Support Fused Put the Routing of All Layers (
#6099
)
2026-02-03 04:13:16 -08:00
triton_moe_kernels.py
…