This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
02d32eea3b5688bd1eae4a2e92e84257ce4fbbef
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
attention
/
ops
T
History
周周周
d957ccd46d
seq_lens related tensor shape -> [max_num_seqs] (
#6535
)
2026-03-02 11:18:30 +08:00
..
__init__.py
…
append_attention.py
[Feature]Supports SWA based on appendattn (
#6547
)
2026-03-01 19:02:08 +08:00
flash_mask_attention.py
seq_lens related tensor shape -> [max_num_seqs] (
#6535
)
2026-03-02 11:18:30 +08:00
get_attn_mask_q.py
…
get_block_shape_and_split_kv_block.py
…
gqa_rope_write_cache.py
…
init_kv_signal_per_query.py
…
init_signal_layerwise.py
…
open_shm_and_get_meta_signal.py
…
pre_cache_len_concat.py
…