mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
polish code with new pre-commit rule (#2923)
This commit is contained in:
@@ -37,6 +37,7 @@ image.sourcefind.cn:5000/dcu/admin/base/custom:fastdeploy2.0.0-kylinv10-dtk25.04
|
||||
```
|
||||
|
||||
## 2. Start service
|
||||
|
||||
```bash
|
||||
export FD_ATTENTION_BACKEND="BLOCK_ATTN"
|
||||
python -m fastdeploy.entrypoints.openai.api_server \
|
||||
@@ -47,7 +48,7 @@ python -m fastdeploy.entrypoints.openai.api_server \
|
||||
--gpu-memory-utilization=0.8
|
||||
```
|
||||
|
||||
#### Send requests
|
||||
### Send requests
|
||||
|
||||
Send requests using either curl or Python
|
||||
|
||||
@@ -78,4 +79,4 @@ response = client.chat.completions.create(
|
||||
stream=False,
|
||||
)
|
||||
print(response)
|
||||
```
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user