This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
6f5aa883f796fa868649b1d419d2835e5e141696
FastDeploy
/
custom_ops
T
History
zhupengyang
5780345646
[XPU] fix speculate_verify (
#6985
)
2026-03-24 18:55:09 +08:00
..
cpu_ops
…
gpu_ops
[Speculative Decoding] refactor MTP and optimize spec-decoding postprocess (
#6973
)
2026-03-24 10:19:01 +08:00
iluvatar_ops
[Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (
#6803
)
2026-03-12 19:21:17 +08:00
metax_ops
[Model Runner] Deprecate not_need_stop (
#6356
)
2026-03-05 10:55:42 +08:00
third_party
…
utils
【Optim】Optimize grid dimensions using max_tokens_per_expert for MoE models (
#6007
)
2026-01-15 19:18:42 +08:00
xpu_ops
[XPU] fix speculate_verify (
#6985
)
2026-03-24 18:55:09 +08:00
0001-DeepGEMM-95e81b3.patch
[OP]Remove extra H2D in DeepGemm (
#5262
)
2025-11-28 14:23:44 +08:00
MANIFEST.in
…
setup_ops_cpu.py
…
setup_ops.py
【Hackathon 10th Spring No.45】FastDeploy 支持在 T4/V100 硬件的编译 -part (
#6488
)
2026-03-23 19:16:23 +08:00