mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
[PD Disaggregation] Prefill and decode support cache storage (#6768)
* Prefill and decode support cache storage * up * up * update docs and refine mooncake store * up
This commit is contained in:
@@ -84,6 +84,7 @@ Learn how to download models, enable using the torch format, and more:
|
||||
- [Prefix Caching](./docs/features/prefix_caching.md)
|
||||
- [Chunked Prefill](./docs/features/chunked_prefill.md)
|
||||
- [Load-Balancing Scheduling Router](./docs/online_serving/router.md)
|
||||
- [Global Cache Pooling](./docs/features/global_cache_pooling.md)
|
||||
|
||||
## Acknowledgement
|
||||
|
||||
|
||||
Reference in New Issue
Block a user