Files
FastDeploy/tests/xpu_ci/4cards_cases
RuohengMa de0c5e68fb [XPU] Split the block_attn operator into smaller operators (#6798)
* spliced block_attn

* adapt to latest vllm

* fix unit tests

* delete mtp+cudagraph 4 cards test

* fix vl model

* fix mtp

* fix slot mapping
2026-04-16 14:28:40 +08:00
..
2026-01-16 20:57:58 +08:00
2026-01-16 20:57:58 +08:00