This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
fd44bb7cbfcd5477a5b3a576dba51cbb290fbb38
FastDeploy
/
custom_ops
/
gpu_ops
/
sample_kernels
T
History
wangyifei
b57c960837
cuda13.0, implement changes to CCCL (
#6751
)
2026-03-10 16:47:02 +08:00
..
air_top_p_sampling.cu
[Metax] modify wrapSize to WARP_SIZE (
#5442
)
2025-12-09 01:44:02 -08:00
min_p_sampling_from_probs.cu
init (
#6642
)
2026-03-04 21:55:31 +08:00
rejection_top_p_sampling.cu
init (
#6642
)
2026-03-04 21:55:31 +08:00
sampling.cuh
cuda13.0, implement changes to CCCL (
#6751
)
2026-03-10 16:47:02 +08:00
top_k_renorm_probs.cu
init (
#6642
)
2026-03-04 21:55:31 +08:00
utils.cuh
[Metax] refactor cutlass moe and optimize flash attention (
#5361
)
2025-12-10 17:15:17 +08:00