Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
03363cab4c7a360ddb39f3eefc0cef7546815b58
FastDeploy/custom_ops
T
History
周周周 03363cab4c make flash_mask attention pybind (#5783)
2025-12-26 14:31:35 +08:00
..
cpu_ops
c++ code format (#4527)
2025-10-22 17:59:50 +08:00
gpu_ops
make flash_mask attention pybind (#5783)
2025-12-26 14:31:35 +08:00
iluvatar_ops
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555)
2025-12-18 02:14:25 -08:00
metax_ops
[Metax] refactor cutlass moe and optimize flash attention (#5361)
2025-12-10 17:15:17 +08:00
third_party
[setup optimize]Support git submodule (#4033)
2025-09-11 17:41:16 +08:00
utils
fix w4afp8 (#5634)
2025-12-22 13:39:41 +08:00
xpu_ops
[Speculative Decoding] Fix attn_mask_offset for multi-step MTP in mixed and PD-split modes (#5738)
2025-12-25 01:54:59 -08:00
0001-DeepGEMM-95e81b3.patch
[OP]Remove extra H2D in DeepGemm (#5262)
2025-11-28 14:23:44 +08:00
MANIFEST.in
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
setup_ops_cpu.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
setup_ops.py
[Feature] Support KV Cache Storage (#5571)
2025-12-25 16:30:35 +08:00
Powered by Gitea Version: 1.26.0 Page: 1257ms Template: 6ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API