Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-24 09:44:10 +08:00
Code Issues Actions 9 Packages Projects Releases Wiki Activity
Files
9d3551cfbb5315713813f5820c4e8563d4a6fe39
FastDeploy/fastdeploy/model_executor/layers/attention/ops
T
History
mpgemm 7a20eaebe8 [Feature] Support cute cpp Encoder FA4 (#7016)
* add cute cpp fa4

* 删掉注释

* 修正合并错误

* sm_version放到函数内

* ci错误
2026-03-30 10:54:56 +08:00
..
__init__.py
[Feature] Support cute cpp Encoder FA4 (#7016)
2026-03-30 10:54:56 +08:00
append_attention.py
remove assert (#6970)
2026-03-23 14:22:03 +08:00
flash_attn_v4.py
[Feature] Support cute cpp Encoder FA4 (#7016)
2026-03-30 10:54:56 +08:00
flash_mask_attention.py
seq_lens related tensor shape -> [max_num_seqs] (#6535)
2026-03-02 11:18:30 +08:00
get_attn_mask_q.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
get_block_shape_and_split_kv_block.py
[Others]get_block_shape_and_split_kv_block clean code (#5123)
2025-11-20 16:40:04 +08:00
gqa_rope_write_cache.py
FA3 support qwen3 (#5441)
2025-12-09 16:16:16 +08:00
init_kv_signal_per_query.py
[PD Disaggregation][XPU] Add XPU support for PD disaggregation (#5113)
2025-11-21 14:09:01 +08:00
init_signal_layerwise.py
[PD Disaggregation][XPU] Add XPU support for PD disaggregation (#5113)
2025-11-21 14:09:01 +08:00
open_shm_and_get_meta_signal.py
[PD Disaggregation][XPU] Add XPU support for PD disaggregation (#5113)
2025-11-21 14:09:01 +08:00
pre_cache_len_concat.py
[Others] Remove useless code (#5404)
2025-12-08 13:59:46 +08:00
Powered by Gitea Version: 1.26.0 Page: 592ms Template: 7ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API