This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-24 09:44:10 +08:00
Code
Issues
Actions
9
Packages
Projects
Releases
Wiki
Activity
Files
c1fb3112f81e257723322a308f581fff3d19528c
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
quantization
T
History
GoldPancake
c1fb3112f8
[FDConfig] Support CLI args for quantization params and add cudagraph validation (
#7281
)
...
* refactor quant cli param
2026-04-10 14:13:42 +08:00
..
ops
[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (
#6533
)
2026-03-16 21:32:43 +08:00
__init__.py
[FDConfig] Support CLI args for quantization params and add cudagraph validation (
#7281
)
2026-04-10 14:13:42 +08:00
block_wise_fp8.py
[Feature] support blackwell gemm in ht (
#7053
)
2026-04-07 19:52:51 +08:00
fp8_utils.py
[Feature] support blackwell gemm in ht (
#7053
)
2026-04-07 19:52:51 +08:00
kv_cache.py
[Intel HPU] enable tensor_wise_fp8 (
#5324
)
2025-12-17 16:45:03 +08:00
mix_quant.py
support w4afp8 moe offline permute & load (
#5613
)
2025-12-22 15:12:57 +08:00
mxfp4.py
[Feature] support compute shared experts before combine for better overlap (
#6697
)
2026-03-17 15:18:51 +08:00
nvfp4.py
[Feature] support nvfp4 tbo (
#7259
)
2026-04-09 17:29:39 +08:00
quant_base.py
[BugFix] fix flashinfer-cutedsl moe nvfp4 (
#7120
)
2026-04-03 15:43:19 +08:00
tensor_wise_fp8.py
[Intel HPU] enable tensor_wise_fp8 (
#5324
)
2025-12-17 16:45:03 +08:00
w4a8.py
[XPU] refactor moe ffn (
#5501
)
2025-12-18 14:14:05 +08:00
w4afp8.py
[Feature] support w4afp8 v1_loader and v0_loader(tp>1) (
#5757
)
2025-12-30 14:11:52 +08:00
w8a8.py
fix w8a8.py (
#3733
)
2025-09-03 10:57:26 +08:00
weight_only.py
[Iluvatar] refactor attn and moe code (
#6887
)
2026-03-18 10:31:00 +08:00
wfp8afp8.py
[Loader] support dummy load weight (
#6169
)
2026-01-26 13:58:53 +08:00
wint2.py
fix wint2 config (
#4721
)
2025-10-31 15:44:14 +08:00