Files
FastDeploy/benchmarks/yaml/eb45-21b-a3b-32k-bf16-tp2-mooncake.yaml
T
xiegegege 209e5cf7f4 [CE]add 21b mooncake yaml (#7033)
* [CE]add 21b cpu cache ,glm mtp,glm for rl config

* [CE]add 21b tp2 yaml

* [CE]add 21b mooncake yaml

* add fastdeploy benchmark,paddletest-155

* [CE] adjust vl wint4 config

* [CE]add glm mtp with updatemodel config

* [CE]fix

* fix

* test

* test

* test

---------

Co-authored-by: xiegegege <>
2026-03-26 20:01:05 +08:00

6 lines
128 B
YAML

max_model_len: 131072
max_num_seqs: 256
tensor_parallel_size: 2
kvcache_storage_backend: "mooncake"
enable_output_caching: True