Files
FastDeploy/tests
RuohengMa de0c5e68fb [XPU] Split the block_attn operator into smaller operators (#6798)
* spliced block_attn

* adapt to latest vllm

* fix unit tests

* delete mtp+cudagraph 4 cards test

* fix vl model

* fix mtp

* fix slot mapping
2026-04-16 14:28:40 +08:00
..
2026-04-14 17:28:22 +08:00
2026-04-14 17:28:22 +08:00
2026-04-14 17:28:22 +08:00
2026-03-17 14:06:40 +08:00
2025-09-22 14:09:09 +08:00
2026-03-31 11:02:26 +08:00
2026-04-08 11:25:41 +08:00
2026-01-20 17:47:44 +08:00