Files
FastDeploy/fastdeploy/model_executor/layers/quantization
AIbin cb6819d086 [Optimization][OP]support per_token_group_fp8_quant cuda kernel (#6865)
* support per_token_group_fp8_quant cuda kernel

* Potential fix for pull request finding

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

* update code

---------

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
2026-03-17 19:17:51 +08:00
..
2025-12-18 14:14:05 +08:00
2025-09-03 10:57:26 +08:00
2025-10-31 15:44:14 +08:00