Files
FastDeploy/fastdeploy/model_executor/layers/quantization
fxyfxy777 4c92035f2d [Feature] Unify fp8 block_wise quant ops (#5991)
* quant stash

* blockwise_quant

* precommit

* rm tensor.cut

* tp ok

* add swiglu

* rm outdate code

* fix activate ut

* change baseline

* fix baseline error
2026-01-15 05:50:37 -08:00
..
2026-01-06 14:12:14 +08:00
2025-12-18 14:14:05 +08:00
2025-09-03 10:57:26 +08:00
2025-12-18 14:14:05 +08:00
2025-11-11 21:30:39 +08:00
2025-10-31 15:44:14 +08:00