Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
51a8a2ed57a3f388a4dba1562e2f31c6dbfad82f
FastDeploy/custom_ops
T
History
yinwei 51a8a2ed57 [XPU] Support CudaGraph(add block attn cuda_graph support) (#6116)
* add block attn cuda_graph support
2026-01-20 19:33:11 +08:00
..
cpu_ops
…
gpu_ops
[Optimization] Avoid unnecessary penalty computation (#6078)
2026-01-19 15:24:12 +08:00
iluvatar_ops
…
metax_ops
[Metax] adapt to gemm interface on different versions of maca (#5905)
2026-01-07 10:02:24 +08:00
third_party
…
utils
【Optim】Optimize grid dimensions using max_tokens_per_expert for MoE models (#6007)
2026-01-15 19:18:42 +08:00
xpu_ops
[XPU] Support CudaGraph(add block attn cuda_graph support) (#6116)
2026-01-20 19:33:11 +08:00
0001-DeepGEMM-95e81b3.patch
…
MANIFEST.in
…
setup_ops_cpu.py
…
setup_ops.py
[Feature]Support tag phase token enforce generation (#6034)
2026-01-15 03:59:55 -08:00
Powered by Gitea Version: 1.26.0 Page: 1539ms Template: 19ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API