This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
cb3b1d120c410b32ac99b75e86163e57328beb9b
FastDeploy
/
custom_ops
T
History
chen
26c47c2afc
update attn_mask_q 2 (
#7371
)
2026-04-13 23:06:04 +08:00
..
cpu_ops
[XPU] Refactor pre process (
#6993
)
2026-04-01 20:29:55 +08:00
gpu_ops
update attn_mask_q 2 (
#7371
)
2026-04-13 23:06:04 +08:00
iluvatar_ops
[Iluvatar] Support wi4a16 group_gemm (
#7078
)
2026-03-30 19:03:51 +08:00
metax_ops
[Model Runner] Deprecate not_need_stop (
#6356
)
2026-03-05 10:55:42 +08:00
third_party
…
utils
【Optim】Optimize grid dimensions using max_tokens_per_expert for MoE models (
#6007
)
2026-01-15 19:18:42 +08:00
xpu_ops
[XPU] Refactor get_padding_offset to single kernel. (
#7029
)
2026-04-13 11:04:50 +08:00
0001-DeepGEMM-95e81b3.patch
…
MANIFEST.in
…
setup_ops_cpu.py
…
setup_ops.py
[Metax][Fix] add compilation option (
#7209
)
2026-04-07 02:43:43 -07:00