This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
59b578c33728f9198dae3e91ddd6d40d013a31cf
FastDeploy
/
fastdeploy
/
model_executor
/
layers
T
History
AIbin
59b578c337
[Feature]Supports SWA based on appendattn (
#6547
)
2026-03-01 19:02:08 +08:00
..
attention
[Feature]Supports SWA based on appendattn (
#6547
)
2026-03-01 19:02:08 +08:00
backends
[XPU] support warmup with ep & remove apply_tp_fused_op (
#6289
)
2026-02-28 15:40:36 +08:00
batch_invariant_ops
[CI] Sync mm_batch_invariant with paddle.mm update (
#6557
)
2026-02-28 14:56:42 +08:00
moe
fix reshard error (
#6536
)
2026-02-27 22:22:37 +08:00
pool
…
quantization
fix reshard error (
#6536
)
2026-02-27 22:22:37 +08:00
sample
[Feature] GPU Memory Optimization and Retirement of V0 Scheduler (
#6407
)
2026-02-28 15:07:43 +08:00
__init__.py
…
activation.py
…
embeddings.py
[loader]supoort wint2 backend (
#6139
)
2026-02-08 22:42:36 -08:00
linear.py
[Optimization] Enable BF16 gate computation for GLM and Qwen (
#6457
)
2026-02-26 21:08:46 -08:00
lm_head.py
…
mtp_linear.py
…
normalization.py
…
pooler.py
…
rotary_embedding.py
…
utils.py
…