Files
FastDeploy/custom_ops/gpu_ops/speculate_decoding
huicongyao 2e63d88f7a [Optimization][Speculative Decoding]Fuse padding sampling params (#6765)
* optimize speculate pre process unit test

* Add CUDA kernel for building sampling params in speculative decoding

* init infer seed in device

* format code

* add unittest & fix

* fix

* format-code

* format-code

* fix rebase

* .

* fix unitest
2026-03-12 05:05:15 -07:00
..
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-02-27 19:07:35 +08:00