FastDeploy/fastdeploy/model_executor/layers at 59b578c33728f9198dae3e91ddd6d40d013a31cf - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

AIbin 59b578c337 [Feature]Supports SWA based on appendattn (#6547 )

2026-03-01 19:02:08 +08:00

..

[Feature]Supports SWA based on appendattn (#6547 )

2026-03-01 19:02:08 +08:00

[XPU] support warmup with ep & remove apply_tp_fused_op (#6289 )

2026-02-28 15:40:36 +08:00

batch_invariant_ops

[CI] Sync mm_batch_invariant with paddle.mm update (#6557 )

2026-02-28 14:56:42 +08:00

fix reshard error (#6536 )

2026-02-27 22:22:37 +08:00

…

fix reshard error (#6536 )

2026-02-27 22:22:37 +08:00

[Feature] GPU Memory Optimization and Retirement of V0 Scheduler (#6407 )

2026-02-28 15:07:43 +08:00

__init__.py

…

activation.py

…

embeddings.py

[loader]supoort wint2 backend (#6139 )

2026-02-08 22:42:36 -08:00

linear.py

[Optimization] Enable BF16 gate computation for GLM and Qwen (#6457 )

2026-02-26 21:08:46 -08:00

lm_head.py

…

mtp_linear.py

…

normalization.py

…

pooler.py

…

rotary_embedding.py

…

utils.py

…