This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-10 09:31:48 +08:00
Code
Issues
Actions
6
Packages
Projects
Releases
Wiki
Activity
Files
5620bd12dead63eee1545bb2863fa4c7dd74b4e2
FastDeploy
/
benchmarks
/
yaml
T
History
xiegegege
e3a843f2c5
[benchmark] add quantization for benchmark yaml (
#2995
)
2025-07-24 13:26:34 +08:00
..
request_yaml
…
eb45-21b-a3b-32k-bf16.yaml
…
eb45-21b-a3b-32k-wint4-a10.yaml
…
eb45-21b-a3b-32k-wint4.yaml
…
eb45-21b-a3b-32k-wint8.yaml
…
eb45-21B-vl-128k-wint4-h800-tp1.yaml
…
eb45-32k-bf16-a30-tp1.yaml
…
eb45-32k-blockwise-fp8-h800-tp8.yaml
…
eb45-32k-tensorwise-fp8-h800-tp8.yaml
…
eb45-32k-w4a8c8-a800-tp4.yaml
…
eb45-32k-w4a8c8-tp4_decode.yaml
…
eb45-32k-w4a8c8-tp4_prefill.yaml
…
eb45-32k-wint2-h20-tp1.yaml
…
eb45-32k-wint4-a800-tp4.yaml
…
eb45-32k-wint4-h800-dp8_decode.yaml
…
eb45-32k-wint4-h800-dp8_prefill.yaml
…
eb45-32k-wint4-mtp-h800-tp4.yaml
…
eb45-32k-wint4-mtp-tp4-decode.yaml
…
eb45-32k-wint4-mtp-tp4-prefill.yaml
…
eb45-32k-wint4-p800-tp4.yaml
…
eb45-32k-wint4-p800-tp8.yaml
…
eb45-32k-wint4-prefixcache-a800-tp4.yaml
…
eb45-32k-wint4-tp4_decode.yaml
…
eb45-32k-wint4-tp4_prefill.yaml
…
eb45-32k-wint8-a800-tp8.yaml
…
eb45-32k-wint8-p800-tp8.yaml
…
eb45-32k-wint8-prefixcache-a800-tp8.yaml
…
eb45-128k-wint4-a800-tp8.yaml
…
eb45-128k-wint4-p800-tp8.yaml
…
eb45-128k-wint8-a800-tp8.yaml
…
eb45-vl-32k-wint4-a800-tp8.yaml
…
eb45-vl-32k-wint4-h800-tp8.yaml
…
eb45-vl-32k-wint4-tp4.yaml
…
eb45-vl-32k-wint8-a800-tp8.yaml
…
eb45-vl-32k-wint8-h800-tp8.yaml
…
eb45-vl-32k-wint8-tp4.yaml
…
eb45t_0dot3b-32k-bf16-a30-tp1-static.yaml
…
eb45t_0dot3b-32k-bf16-h800-tp1-static.yaml
…
eb45t_0dot3b-32k-wint8-a30-tp1-static.yaml
…
eb45t_0dot3b-32k-wint8-h800-tp1-static.yaml
…
eb45t_21b-32k-bf16-h800-tp1-static.yaml
…
eb45t_21b-32k-wint4-h800-tp1-static.yaml
…
eb45t_300b-32k-wint4-h800-tp4-static.yaml
…
qwen2_7b-32k-bf16-a30-tp1-static.yaml
…
qwen2_7b-32k-bf16-h800-tp1-static.yaml
…
qwen2_7b-32k-bf16-h800-tp1.yaml
…
qwen2_7b-32k-fp8-h800-tp1-static.yaml
…
qwen2_7b-32k-fp8-h800-tp1.yaml
…
qwen2_7b-32k-wint8-h800-tp1.yaml
…
qwen3_0dot6b-32k-bf16-a30-tp1-static.yaml
…
qwen3_0dot6b-32k-bf16-h800-tp1-static.yaml
…
qwen3_0dot6b-32k-wint8-a30-tp1-static.yaml
…
qwen3_0dot6b-32k-wint8-h800-tp1-static.yaml
…
qwen3_30b-32k-bf16-h800-tp1-static.yaml
…
qwen3_30b-32k-wint4-h800-tp1-static.yaml
…
qwen3dot6b-32k-bf16-a30-tp1.yaml
…
qwen3dot6b-32k-bf16-a800-tp1.yaml
…
qwen3dot6b-32k-bf16-h800-tp1.yaml
…
qwen3dot6b-32k-wint8-a30-tp1.yaml
…
qwen3dot6b-32k-wint8-a800-tp1.yaml
…
qwen3dot6b-32k-wint8-h800-tp1.yaml
…
qwen3moe30b-32k-bf16-a800-tp1.yaml
…
qwen3moe30b-32k-bf16-h800-tp1.yaml
…
qwen3moe30b-32k-wint4-a800-tp1.yaml
…
qwen3moe30b-32k-wint4-h800-tp1.yaml
…
qwen3moe235b-32k-wint4-h800-tp4.yaml
…
qwen3moe235b-32k-wint8-h800-tp4.yaml
…
x1-32k-wint4-h800-tp8.yaml
…
x1-32k-wint4-p800-tp4.yaml
…
x1-32k-wint4-p800-tp8.yaml
…
x1-32k-wint4-prefixcache-h800-tp8.yaml
…
x1-32k-wint8-h800-tp8.yaml
…
x1-32k-wint8-p800-tp4.yaml
…
x1-32k-wint8-p800-tp8.yaml
…
x1-32k-wint8-prefixcache-h800-tp8.yaml
…