FastDeploy/fastdeploy/model_executor/layers/attention/ops at 02d32eea3b5688bd1eae4a2e92e84257ce4fbbef - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

周周周 d957ccd46d seq_lens related tensor shape -> [max_num_seqs] (#6535 )

2026-03-02 11:18:30 +08:00

..

__init__.py

…

append_attention.py

[Feature]Supports SWA based on appendattn (#6547 )

2026-03-01 19:02:08 +08:00

flash_mask_attention.py

seq_lens related tensor shape -> [max_num_seqs] (#6535 )

2026-03-02 11:18:30 +08:00

get_attn_mask_q.py

…

get_block_shape_and_split_kv_block.py

…

gqa_rope_write_cache.py

…

init_kv_signal_per_query.py

…

init_signal_layerwise.py

…

open_shm_and_get_meta_signal.py

…

pre_cache_len_concat.py

…