FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-08 08:23:25 +08:00

Files

T

MingkunZhang 9d9f5df8d0 [Metax] support default_v1 loader & thinking model (#4956 )

Co-authored-by: plusNew001 <95567040+plusNew001@users.noreply.github.com>

2025-11-12 16:32:26 +08:00

2025-11-10 20:57:35 +08:00

__init__.py

2025-09-24 14:12:05 +08:00

block_wise_fp8.py

2025-11-11 21:30:39 +08:00

kv_cache.py

2025-10-10 15:41:32 +08:00

mix_quant.py

2025-11-11 21:30:39 +08:00

quant_base.py

…

tensor_wise_fp8.py

…

w4a8.py

2025-10-10 15:41:32 +08:00

w4afp8.py

…

w8a8.py

…

weight_only.py

2025-11-12 16:32:26 +08:00

wfp8afp8.py

2025-11-11 21:30:39 +08:00

wint2.py

fix wint2 config (#4721 )

2025-10-31 15:44:14 +08:00