Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
59b578c33728f9198dae3e91ddd6d40d013a31cf
FastDeploy/fastdeploy/model_executor/layers
T
History
AIbin 59b578c337 [Feature]Supports SWA based on appendattn (#6547)
2026-03-01 19:02:08 +08:00
..
attention
[Feature]Supports SWA based on appendattn (#6547)
2026-03-01 19:02:08 +08:00
backends
[XPU] support warmup with ep & remove apply_tp_fused_op (#6289)
2026-02-28 15:40:36 +08:00
batch_invariant_ops
[CI] Sync mm_batch_invariant with paddle.mm update (#6557)
2026-02-28 14:56:42 +08:00
moe
fix reshard error (#6536)
2026-02-27 22:22:37 +08:00
pool
…
quantization
fix reshard error (#6536)
2026-02-27 22:22:37 +08:00
sample
[Feature] GPU Memory Optimization and Retirement of V0 Scheduler (#6407)
2026-02-28 15:07:43 +08:00
__init__.py
…
activation.py
…
embeddings.py
[loader]supoort wint2 backend (#6139)
2026-02-08 22:42:36 -08:00
linear.py
[Optimization] Enable BF16 gate computation for GLM and Qwen (#6457)
2026-02-26 21:08:46 -08:00
lm_head.py
…
mtp_linear.py
…
normalization.py
…
pooler.py
…
rotary_embedding.py
…
utils.py
…
Powered by Gitea Version: 1.26.0 Page: 4370ms Template: 239ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API