This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
5416da8c6e6645031ffb6a34fe86ba1bff19eb9d
FastDeploy
/
custom_ops
/
iluvatar_ops
T
History
yzwu
901b38c936
[Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (
#6803
)
2026-03-12 19:21:17 +08:00
..
runtime
…
cpp_extensions.cc
…
flash_attn_unpadded.cu
…
fused_moe_helper.h
…
fused_moe_imp_op.h
…
fused_moe_op.h
…
mixed_fused_attn.cu
…
moe_dispatch.cu
…
moe_reduce.cu
…
paged_attn.cu
…
prefill_fused_attn.cu
…
restore_tokens_per_expert.cu
…
w8a16_group_gemm.cu
…
w8a16_group_gemv.cu
…