This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
9d3551cfbb5315713813f5820c4e8563d4a6fe39
FastDeploy
/
custom_ops
/
gpu_ops
/
flash_mask_attn
T
History
ming1753
734fbcffde
[BugFix] Fix Async D2H copy bug & flash mash atten cache V out of bound bug (
#7221
)
2026-04-10 11:31:51 +08:00
..
flash_mask_attn_kernel.hpp
[Feature] support flash_mask_attention backend (
#5134
)
2025-11-28 10:12:16 +08:00
flash_mask_attn.cu
[BugFix] Fix batch_size derivation and relax shape checks in SM90 flash_mask_attn (
#7210
)
2026-04-09 11:05:10 +08:00
kernel_traits.h
[Feature] support flash_mask_attention backend (
#5134
)
2025-11-28 10:12:16 +08:00
mainloop_attn.hpp
[BugFix] Fix Async D2H copy bug & flash mash atten cache V out of bound bug (
#7221
)
2026-04-10 11:31:51 +08:00
softmax.hpp
[BugFix] Fix kv cache int8 dynamic quant on flash and flash_mask backend (
#7028
)
2026-03-30 11:17:15 +08:00
utils.hpp
remove unneeded para from flash_mask_attention (
#6218
)
2026-01-27 14:04:27 +08:00