This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 08:21:53 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
00a01ae02471365a5b9e01b1f5dcdb7fe6df940c
FastDeploy
/
custom_ops
/
gpu_ops
/
w4afp8_gemm
T
History
yangjianfengo1
59523b27de
opt w4afp8 (
#5853
)
2026-01-07 12:22:35 +08:00
..
kernel_traits.h
opt w4afp8 (
#5853
)
2026-01-07 12:22:35 +08:00
mainloop_fwd.h
opt w4afp8 (
#5853
)
2026-01-07 12:22:35 +08:00
utils.hpp
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
w4afp8_gemm_kernel.hpp
opt w4afp8 (
#5853
)
2026-01-07 12:22:35 +08:00
w4afp8_gemm.cu
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
w4afp8_gemm.h
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
weight_kernel.hpp
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
weight_scale_kernel.hpp
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00