Files
FastDeploy/fastdeploy/model_executor/layers
chen 0bcf924e10 [Optimization] Optimization for gather_logprob by 10GB (#5817)
* opt logprobs gather_logprob,reduce device memory usage by 10GB when token_num=8k
2025-12-30 15:33:34 +08:00
..
2025-12-18 14:14:05 +08:00