Files
FastDeploy/custom_ops/gpu_ops/append_attn
GoldPancake bda38aa519 [Speculative Decoding] Support MTP for GLM-4.5-Air (#6047)
* glm mtp
* add spec neox partial rope
2026-01-16 14:35:24 +08:00
..
2025-10-24 10:14:53 +08:00
2026-01-05 15:29:34 +08:00
2025-10-20 14:44:58 +08:00