FastDeploy/fastdeploy/model_executor/layers/attention at 385fe6dade1feed3a0bfe25cfb303e2faa777140 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 17:11:21 +08:00

Files

T

History

周周周 385fe6dade [Others] clean code (#5133 )

2025-11-20 18:44:08 +08:00

..

[Others]get_block_shape_and_split_kv_block clean code (#5123 )

2025-11-20 16:40:04 +08:00

__init__.py

【FIX】Change the name of sparse attn from moba to plas (#4006 ) (#4076 )

2025-09-23 10:26:40 +08:00

append_attn_backend.py

[Others] clean code (#5133 )

2025-11-20 18:44:08 +08:00

attention_selecter.py

…

attention.py

fix Cfp8 for RL load (#4144 )

2025-11-03 17:51:51 +08:00

base_attention_backend.py

…

block_multihead_attn_backend.py

[KVCache] support unified cache backend (#4903 )

2025-11-12 14:54:52 +08:00

flash_attn_backend.py

[Others]get_block_shape_and_split_kv_block clean code (#5123 )

2025-11-20 16:40:04 +08:00

iluvatar_attn_backend.py

[KVCache] support unified cache backend (#4903 )

2025-11-12 14:54:52 +08:00

mla_attention_backend.py

[Others]get_block_shape_and_split_kv_block clean code (#5123 )

2025-11-20 16:40:04 +08:00

moba_attention_backend.py

[KVCache] support unified cache backend (#4903 )

2025-11-12 14:54:52 +08:00

native_paddle_backend.py

…

utils.py

supports pd partn (#4615 )

2025-11-04 16:36:35 +08:00

xpu_attn_backend.py

[KVCache] support unified cache backend (#4903 )

2025-11-12 14:54:52 +08:00