Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 08:21:53 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
8c3513a410df00ae6a13a7c87f16c2888e2cdeac
FastDeploy/fastdeploy/model_executor/layers/attention
T
History
lizhenyun01 2be8656c29 [BugFix] fix mtp split kv attetion (#5920)
* [BugFix] fix mtp split kv attetion

* clean code

* clean code
2026-01-07 04:07:31 -08:00
..
ops
FA3 support qwen3 (#5441)
2025-12-09 16:16:16 +08:00
__init__.py
…
append_attn_backend.py
[BugFix] fix mtp split kv attetion (#5920)
2026-01-07 04:07:31 -08:00
attention_selecter.py
…
attention.py
[Model] tp+ep support v1_loader (#5465)
2025-12-18 14:31:54 +08:00
base_attention_backend.py
…
block_multihead_attn_backend.py
…
flash_attn_backend.py
FA3 support qwen3 (#5441)
2025-12-09 16:16:16 +08:00
flash_mask_attn_backend.py
make flash_mask attention pybind (#5783)
2025-12-26 14:31:35 +08:00
iluvatar_attn_backend.py
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555)
2025-12-18 02:14:25 -08:00
mla_attention_backend.py
…
moba_attention_backend.py
…
native_paddle_backend.py
…
utils.py
…
xpu_attn_backend.py
[XPU] refactor of block_attn param 'pos_emb_type' (#5511)
2025-12-12 14:30:09 +08:00
Powered by Gitea Version: 1.26.0 Page: 1341ms Template: 4ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API