Files
FastDeploy/fastdeploy/spec_decode
RuohengMa de0c5e68fb [XPU] Split the block_attn operator into smaller operators (#6798)
* spliced block_attn

* adapt to latest vllm

* fix unit tests

* delete mtp+cudagraph 4 cards test

* fix vl model

* fix mtp

* fix slot mapping
2026-04-16 14:28:40 +08:00
..
2026-04-08 11:25:41 +08:00
2026-04-08 15:25:14 +08:00