mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-05-10 17:41:13 +08:00
0bcf924e10
* opt logprobs gather_logprob,reduce device memory usage by 10GB when token_num=8k