mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-05-10 09:31:48 +08:00
0bcf924e10
* opt logprobs gather_logprob,reduce device memory usage by 10GB when token_num=8k