[Docx] add language (en/cn) switch links (#4470)

* add install docs * 修改文档 * 修改文档
2026-04-23 00:17:25 +08:00 · 2025-10-17 15:47:41 +08:00
parent a3e0a15495
commit ba5c2b7e37
106 changed files with 206 additions and 0 deletions
@@ -1,3 +1,5 @@
+[简体中文](../zh/features/disaggregated.md)
+
 # Disaggregated Deployment

 Large model inference consists of two phases: Prefill and Decode, which are compute-intensive and memory access-intensive respectively. Deploying Prefill and Decode separately in certain scenarios can improve hardware utilization, effectively increase throughput, and reduce overall sentence latency.