Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 17:11:21 +08:00
Code Issues Actions 23 Packages Projects Releases Wiki Activity
Files
385fe6dade1feed3a0bfe25cfb303e2faa777140
FastDeploy/fastdeploy/model_executor/layers/attention
T
History
周周周 385fe6dade [Others] clean code (#5133)
2025-11-20 18:44:08 +08:00
..
ops
[Others]get_block_shape_and_split_kv_block clean code (#5123)
2025-11-20 16:40:04 +08:00
__init__.py
【FIX】Change the name of sparse attn from moba to plas (#4006) (#4076)
2025-09-23 10:26:40 +08:00
append_attn_backend.py
[Others] clean code (#5133)
2025-11-20 18:44:08 +08:00
attention_selecter.py
…
attention.py
fix Cfp8 for RL load (#4144)
2025-11-03 17:51:51 +08:00
base_attention_backend.py
…
block_multihead_attn_backend.py
[KVCache] support unified cache backend (#4903)
2025-11-12 14:54:52 +08:00
flash_attn_backend.py
[Others]get_block_shape_and_split_kv_block clean code (#5123)
2025-11-20 16:40:04 +08:00
iluvatar_attn_backend.py
[KVCache] support unified cache backend (#4903)
2025-11-12 14:54:52 +08:00
mla_attention_backend.py
[Others]get_block_shape_and_split_kv_block clean code (#5123)
2025-11-20 16:40:04 +08:00
moba_attention_backend.py
[KVCache] support unified cache backend (#4903)
2025-11-12 14:54:52 +08:00
native_paddle_backend.py
…
utils.py
supports pd partn (#4615)
2025-11-04 16:36:35 +08:00
xpu_attn_backend.py
[KVCache] support unified cache backend (#4903)
2025-11-12 14:54:52 +08:00
Powered by Gitea Version: 1.26.0 Page: 1272ms Template: 9ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API