[PD Disaggregation] Prefill and decode support cache storage (#6768)

* Prefill and decode support cache storage

* up

* up

* update docs and refine mooncake store

* up
This commit is contained in:
jc
2026-03-16 14:44:49 +08:00
committed by GitHub
parent 72ff7bf4cd
commit 04fde3b227
12 changed files with 1083 additions and 66 deletions
+1
View File
@@ -84,6 +84,7 @@ Learn how to download models, enable using the torch format, and more:
- [Prefix Caching](./docs/features/prefix_caching.md)
- [Chunked Prefill](./docs/features/chunked_prefill.md)
- [Load-Balancing Scheduling Router](./docs/online_serving/router.md)
- [Global Cache Pooling](./docs/features/global_cache_pooling.md)
## Acknowledgement