Files
FastDeploy/fastdeploy
fxyfxy777 2ada119a38 [Optimize] optimize mask_quant & swiglu (#6222)
* optimize mask_quant op speed up 1.5

* fix calculate sequence

* add fused

* rm log

* push kernel code

* add ut

* accuracy ok

* add ue8m0

* add ut

* add merge develop

* rm ut of mask_per_token_quant
2026-02-02 13:52:38 +08:00
..
2026-01-23 11:24:12 +08:00
2025-12-19 14:30:32 +08:00
2026-01-23 10:49:27 +08:00
2026-01-27 17:00:49 +08:00
2026-01-28 08:28:03 -08:00
2026-01-26 07:46:51 -08:00
2025-07-03 15:43:53 +08:00
2026-01-22 14:21:01 +08:00