This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-09 08:55:00 +08:00
Code
Issues
Actions
7
Packages
Projects
Releases
Wiki
Activity
Files
690bcb8e5097c27284f9e22adfeb102df9ef8708
FastDeploy
/
custom_ops
/
metax_ops
T
History
Neil Zhu
0edda75a56
[Metax] optimize cutlass moe and flash attention backend (
#5128
)
2025-11-20 16:12:35 +08:00
..
apply_rope.cu
[Metax] optimize cutlass moe and flash attention backend (
#5128
)
2025-11-20 16:12:35 +08:00
fused_moe_helper.h
…
fused_moe_imp_op.h
c++ code format (
#4527
)
2025-10-22 17:59:50 +08:00
fused_moe_op.h
c++ code format (
#4527
)
2025-10-22 17:59:50 +08:00
fused_moe.cu
[Metax] adapt cutlass moe for ernie-vl (
#4685
)
2025-11-03 17:44:27 +08:00
mc_fused_moe_helper.h
[Metax] optimize cutlass moe and flash attention backend (
#5128
)
2025-11-20 16:12:35 +08:00
moe_dispatch.cu
[Metax] adapt cutlass moe for ernie-vl (
#4685
)
2025-11-03 17:44:27 +08:00
moe_ffn.cu
[Metax] optimize cutlass moe and flash attention backend (
#5128
)
2025-11-20 16:12:35 +08:00
moe_reduce.cu
c++ code format (
#4527
)
2025-10-22 17:59:50 +08:00