mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 17:11:21 +08:00
04fde3b227
* Prefill and decode support cache storage * up * up * update docs and refine mooncake store * up
32 lines
794 B
Markdown
32 lines
794 B
Markdown
# Global Cache Pooling Examples
|
|
|
|
This directory contains example scripts for Global Cache Pooling with MooncakeStore.
|
|
|
|
## Documentation
|
|
|
|
- [English Documentation](../../docs/features/global_cache_pooling.md)
|
|
- [中文文档](../../docs/zh/features/global_cache_pooling.md)
|
|
|
|
## Quick Start
|
|
|
|
```bash
|
|
# Multi-instance scenario
|
|
bash run.sh
|
|
|
|
# PD disaggregation scenario
|
|
bash run_03b_pd_storage.sh
|
|
```
|
|
|
|
## Scripts
|
|
|
|
| Script | Scenario | Description |
|
|
|--------|----------|-------------|
|
|
| `run.sh` | Multi-Instance | Two standalone instances sharing cache |
|
|
| `run_03b_pd_storage.sh` | PD Disaggregation | P+D instances with global cache pooling |
|
|
|
|
## Files
|
|
|
|
- `mooncake_config.json` - Mooncake configuration file
|
|
- `utils.sh` - Utility functions for scripts
|
|
- `stop.sh` - Stop all running services
|