FastDeploy/custom_ops at 223b2f5d86d2fbecfce94f44f9de2ec7dfaeadc3 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

lzy 223b2f5d86 Support setting communication groups in custom_allreduce and the all-to-all\transpose fused operator during the decoding phase. (#5917 )

2026-01-12 14:09:39 +08:00

..

c++ code format (#4527 )

2025-10-22 17:59:50 +08:00

Support setting communication groups in custom_allreduce and the all-to-all\transpose fused operator during the decoding phase. (#5917 )

2026-01-12 14:09:39 +08:00

[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555 )

2025-12-18 02:14:25 -08:00

[Metax] adapt to gemm interface on different versions of maca (#5905 )

2026-01-07 10:02:24 +08:00

[setup optimize]Support git submodule (#4033 )

2025-09-11 17:41:16 +08:00

[Bugfix] Increase the shape of w4afp8 gemm (#5957 )

2026-01-09 14:11:17 +08:00

[XPU] fix dp4 (#5946 )

2026-01-09 20:36:53 +08:00

0001-DeepGEMM-95e81b3.patch

[OP]Remove extra H2D in DeepGemm (#5262 )

2025-11-28 14:23:44 +08:00

MANIFEST.in

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

setup_ops_cpu.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

setup_ops.py

[Metax] adapt to gemm interface on different versions of maca (#5905 )

2026-01-07 10:02:24 +08:00