[Docs] Release 2.1 docs and fix some description (#3424)

2026-04-23 00:17:25 +08:00 · 2025-08-15 14:27:19 +08:00
parent fbb6dcb9e4
commit d4e3a20300
14 changed files with 73 additions and 29 deletions
@@ -23,6 +23,7 @@ Execute the following command to start the service. For parameter configurations
 >💡 **Note**: Since the model parameter size is 424B-A47B, on an 80G * 8 GPU machine, specify ```--quantization wint4``` (wint8 is also supported).

 ```shell
+export ENABLE_V1_KVCACHE_SCHEDULER=1
 python -m fastdeploy.entrypoints.openai.api_server \
       --model baidu/ERNIE-4.5-VL-424B-A47B-Paddle \
       --port 8180 --engine-worker-queue-port 8181 \