Files
FastDeploy/custom_ops/gpu_ops
chen 29a313a402 [Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
* support FA4 sm100

* flash attn backend support mask

* flash attn backend run flashmask correct

* add test for flash_attn_backend and flash_attn_func

* check

* add test for fa4

* requirements.txt add fa4 whl

* check test on sm100

* fix CI conflict

* add enable_torch_proxy for flash_mask

* lazy import fa4

* check

* fix tests import

* check test_load_mpt import
2026-02-05 14:39:00 +08:00
..
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2026-01-20 21:46:21 +08:00
2025-12-24 11:28:47 +08:00
2026-02-04 10:47:19 +08:00
2025-09-01 17:50:17 +08:00
2025-07-07 16:53:14 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00