Files
FastDeploy/fastdeploy/spec_decode
RAM 528c55776e [Graph Optimization][Speculative Decoding] Fix the bug of CUDAGraph + MTP + EP (#4456)
* Fix MTP dummy run bug

* Target Model and Draft Model using the same flag

* In mtp replace use_cudagraph as step_use_cudagraph
2025-10-20 10:38:55 +08:00
..