[PD Disaggregation] Prefill and decode support cache storage (#6768)

* Prefill and decode support cache storage

* up

* up

* update docs and refine mooncake store

* up
This commit is contained in:
jc
2026-03-16 14:44:49 +08:00
committed by GitHub
parent 72ff7bf4cd
commit 04fde3b227
12 changed files with 1083 additions and 66 deletions
+1
View File
@@ -86,6 +86,7 @@ FastDeploy 支持在**英伟达(NVIDIAGPU**、**昆仑芯(KunlunxinXPU
- [前缀缓存](./docs/zh/features/prefix_caching.md)
- [分块预填充](./docs/zh/features/chunked_prefill.md)
- [负载均衡调度Router](./docs/zh/online_serving/router.md)
- [全局Cache池化](./docs/zh/features/global_cache_pooling.md)
## 致谢