mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
0bcf924e10
* opt logprobs gather_logprob,reduce device memory usage by 10GB when token_num=8k