FastDeploy/benchmarks/yaml/qwen25_7b-vl-32k-bf16.yaml at c9783a84a6648ee1b157bb7970d5e79a0edc87d4 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

Juncai 08ca0f6aea [Feature] [PD] add simple router and refine splitwise deployment (#4709 )

* add simple router and refine splitwise deployment

* fix

2025-11-06 14:56:02 +08:00

7 lines

160 B

YAML

Raw Blame History

 max_model_len: 32768
 max_num_seqs: 128
 gpu_memory_utilization: 0.85
 tensor_parallel_size: 1
 limit_mm_per_prompt: '{"image": 100, "video": 100}'
 enable_mm: True