mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
[PD Disaggregation] Prefill and decode support cache storage (#6768)
* Prefill and decode support cache storage * up * up * update docs and refine mooncake store * up
This commit is contained in:
@@ -86,6 +86,7 @@ FastDeploy 支持在**英伟达(NVIDIA)GPU**、**昆仑芯(Kunlunxin)XPU
|
||||
- [前缀缓存](./docs/zh/features/prefix_caching.md)
|
||||
- [分块预填充](./docs/zh/features/chunked_prefill.md)
|
||||
- [负载均衡调度Router](./docs/zh/online_serving/router.md)
|
||||
- [全局Cache池化](./docs/zh/features/global_cache_pooling.md)
|
||||
|
||||
## 致谢
|
||||
|
||||
|
||||
Reference in New Issue
Block a user