Files
FastDeploy/fastdeploy/model_executor/layers/quantization
lizexu123 44a13e4557 [Feature] support w4afp8 v1_loader and v0_loader(tp>1) (#5757)
* support

* fix

* support w4afp8 v1_loader and v0_loader

* fix

* fix test

* fix test

* fix test

* fix moe.py

* add test_ernie_4_5_w4afp8

* add test

* delete tensor

* fix test

* fix

* add

* fix test
2025-12-30 14:11:52 +08:00
..
2025-12-18 14:14:05 +08:00
2025-09-03 10:57:26 +08:00
2025-12-18 14:14:05 +08:00
2025-11-11 21:30:39 +08:00
2025-10-31 15:44:14 +08:00