This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-06 23:49:39 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
4,603
Commits
47
Branches
17
Tags
4e06df520ebe6f27cf377bcb65cdd74c19d3499f
Commit Graph
2 Commits
Author
SHA1
Message
Date
chen
193886e745
only cuda run triton op (
#5846
)
2025-12-31 14:17:31 +08:00
chen
0bcf924e10
[Optimization] Optimization for gather_logprob by 10GB (
#5817
)
...
* opt logprobs gather_logprob,reduce device memory usage by 10GB when token_num=8k
2025-12-30 15:33:34 +08:00