This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-07 16:08:58 +08:00
Code
Issues
Actions
4
Packages
Projects
Releases
Wiki
Activity
Files
490a6551dcff20d7b578e03d9bac1e981e07efc4
FastDeploy
/
custom_ops
T
History
lizexu123
f4902fe42d
[BugFix] fix wint2 (
#6109
)
...
* fix * fix * fix
2026-01-20 21:46:21 +08:00
..
cpu_ops
c++ code format (
#4527
)
2025-10-22 17:59:50 +08:00
gpu_ops
[BugFix] fix wint2 (
#6109
)
2026-01-20 21:46:21 +08:00
iluvatar_ops
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (
#5555
)
2025-12-18 02:14:25 -08:00
metax_ops
[Metax] adapt to gemm interface on different versions of maca (
#5905
)
2026-01-07 10:02:24 +08:00
third_party
[setup optimize]Support git submodule (
#4033
)
2025-09-11 17:41:16 +08:00
utils
【Optim】Optimize grid dimensions using max_tokens_per_expert for MoE models (
#6007
)
2026-01-15 19:18:42 +08:00
xpu_ops
[XPU] Support CudaGraph(add block attn cuda_graph support) (
#6116
)
2026-01-20 19:33:11 +08:00
0001-DeepGEMM-95e81b3.patch
[OP]Remove extra H2D in DeepGemm (
#5262
)
2025-11-28 14:23:44 +08:00
MANIFEST.in
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
setup_ops_cpu.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
setup_ops.py
[Feature]Support tag phase token enforce generation (
#6034
)
2026-01-15 03:59:55 -08:00