Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 21 Packages Projects Releases Wiki Activity
Files
0edda75a56bf697de17c348bf4d818d487627920
FastDeploy/custom_ops
T
History
Neil Zhu 0edda75a56 [Metax] optimize cutlass moe and flash attention backend (#5128)
2025-11-20 16:12:35 +08:00
..
cpu_ops
c++ code format (#4527)
2025-10-22 17:59:50 +08:00
gpu_ops
[Speculative Decoding][MTP]Support stop_seqs and pd-split mode (#5029)
2025-11-20 15:26:01 +08:00
iluvatar_ops
c++ code format (#4527)
2025-10-22 17:59:50 +08:00
metax_ops
[Metax] optimize cutlass moe and flash attention backend (#5128)
2025-11-20 16:12:35 +08:00
third_party
[setup optimize]Support git submodule (#4033)
2025-09-11 17:41:16 +08:00
utils
【Fix】fix deepep dispatch (#5036)
2025-11-17 10:34:01 +08:00
xpu_ops
【Hackathon 9th No.109】[CppExtension] [XPU] Support build Custom OP in setuptools 80+ -part (#5106)
2025-11-19 13:33:39 +08:00
0001-DeepGEMM-95e81b3.patch
[feat] support fa3 backend for pd disaggregated (#2695)
2025-07-03 22:33:27 +08:00
MANIFEST.in
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
setup_ops_cpu.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
setup_ops.py
[Metax] optimize cutlass moe and flash attention backend (#5128)
2025-11-20 16:12:35 +08:00
Powered by Gitea Version: 1.26.0 Page: 529ms Template: 7ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API