This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
d8cdda86cb022f210fdeafadaee054e26f7acf87
FastDeploy
/
custom_ops
/
gpu_ops
/
sparse_indexer
T
History
AIbin
48d2bbeb74
fix dsa (
#7252
)
2026-04-08 20:21:38 +08:00
..
exception.h
[Models][OP][Optimization] Support DeepSeek-v3.2 model, integrate DSA & Indexer architecture with FlashMLA/DeepGEMM (
#6689
)
2026-03-10 15:05:14 +08:00
indexer_topk.cu
[Optimization][Feature]Supports multiple batches of DSK-DSA. (
#6930
)
2026-03-20 15:59:22 +08:00
indexer_topk.cuh
fix dsa (
#7252
)
2026-04-08 20:21:38 +08:00
per_token_group_quant.cu
[Optimization][OP]support per_token_group_fp8_quant cuda kernel (
#6865
)
2026-03-17 19:17:51 +08:00
utils.cuh
[Models][OP][Optimization] Support DeepSeek-v3.2 model, integrate DSA & Indexer architecture with FlashMLA/DeepGEMM (
#6689
)
2026-03-10 15:05:14 +08:00
vec_dtypes.cuh
[Models][OP][Optimization] Support DeepSeek-v3.2 model, integrate DSA & Indexer architecture with FlashMLA/DeepGEMM (
#6689
)
2026-03-10 15:05:14 +08:00