mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
[Docs] Add docs for disaggregated deployment (#6700)
* add docs for disaggregated deployment * pre-commit run for style check * update docs
This commit is contained in:
@@ -1,5 +1,7 @@
|
||||
[English](../../features/disaggregated.md)
|
||||
|
||||
[最佳实践](../best_practices/Disaggregated.md)
|
||||
|
||||
# 分离式部署
|
||||
|
||||
LLM大模型推理分为Prefill和Decode两个阶段,分别为计算密集型和访存密集型。
|
||||
|
||||
Reference in New Issue
Block a user