Files
FastDeploy/custom_ops/gpu_ops
AIbin cb6819d086 [Optimization][OP]support per_token_group_fp8_quant cuda kernel (#6865)
* support per_token_group_fp8_quant cuda kernel

* Potential fix for pull request finding

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

* update code

---------

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
2026-03-17 19:17:51 +08:00
..
2026-03-14 11:09:28 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-01-20 21:46:21 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2025-12-24 11:28:47 +08:00
2026-03-04 21:55:31 +08:00
2026-02-10 14:58:50 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00
2026-03-04 21:55:31 +08:00