Files
FastDeploy/fastdeploy
RuohengMa de0c5e68fb [XPU] Split the block_attn operator into smaller operators (#6798)
* spliced block_attn

* adapt to latest vllm

* fix unit tests

* delete mtp+cudagraph 4 cards test

* fix vl model

* fix mtp

* fix slot mapping
2026-04-16 14:28:40 +08:00
..
2026-04-07 16:30:32 +08:00
2026-04-14 17:28:22 +08:00
2026-04-08 11:25:41 +08:00
2026-04-14 20:04:04 +08:00
2026-03-31 11:02:26 +08:00
2025-07-03 15:43:53 +08:00