Files
FastDeploy/custom_ops/gpu_ops
freeliuzc 9018ccf74e [Speculative Decoding] Fix attn_mask_offset for multi-step MTP in mixed and PD-split modes (#5738)
* fix attn_mask_offset in mtp with multi-step and pd-split-mode

* fix xpu operater register

* update pmtp multi-step mtp strategy in d-split -mode

* add note

* fix xpu register
2025-12-25 01:54:59 -08:00
..
2025-12-16 19:33:27 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2025-12-22 13:39:41 +08:00
2025-11-19 16:02:21 +08:00
2025-12-24 11:28:47 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2025-07-09 18:56:27 +08:00
2025-09-01 17:50:17 +08:00
2025-07-07 16:53:14 +08:00
2025-09-01 17:50:17 +08:00
2025-12-24 16:49:20 +08:00
2025-09-01 17:50:17 +08:00