This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
e6804ba97dceffc4d3c54ffb0cba20d91c98158a
FastDeploy
/
tests
/
quantization
T
History
周周周
b1c800b64b
remove load_up_proj_weight_first (
#6932
)
2026-03-19 17:21:34 +08:00
..
test_kv_cache.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (
#6354
)
2026-02-05 14:39:00 +08:00
test_modelopt_nvfp4.py
remove load_up_proj_weight_first (
#6932
)
2026-03-19 17:21:34 +08:00
test_quantization_init.py
[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (
#6533
)
2026-03-16 21:32:43 +08:00
test_tensor_wise_fp8.py
[Intel HPU] enable tensor_wise_fp8 (
#5324
)
2025-12-17 16:45:03 +08:00
test_w4a8.py
[Docs] Add License in Unittest (
#4957
)
2025-11-12 10:44:09 +08:00
test_w4afp8.py
[Others] remove add_bias option (
#5425
)
2025-12-09 17:39:35 +08:00