mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
【FastDeploy CLI】collect-env subcommand (#4044)
* collect-env subcommand * trigger ci --------- Co-authored-by: K11OntheBoat <your_email@example.com>
This commit is contained in:
@@ -34,7 +34,7 @@ After FastDeploy is launched, it supports continuous monitoring of the FastDeplo
|
||||
| `fastdeploy:available_gpu_block_num` | Gauge | Number of available gpu blocks in cache, including prefix caching blocks that are not officially released | Count |
|
||||
| `fastdeploy:free_gpu_block_num` | Gauge | Number of free blocks in cache | Count |
|
||||
| `fastdeploy:max_gpu_block_num` | Gauge | Number of total blocks determined when service started| Count |
|
||||
| `available_gpu_resource` | Gauge | Available blocks percentage, i.e. available_gpu_block_num / max_gpu_block_num | Count |
|
||||
| `fastdeploy:available_gpu_resource` | Gauge | Available blocks percentage, i.e. available_gpu_block_num / max_gpu_block_num | Count |
|
||||
| `fastdeploy:requests_number` | Counter | Total number of requests received | Count |
|
||||
| `fastdeploy:send_cache_failed_num` | Counter | Total number of failures of sending cache | Count |
|
||||
| `fastdeploy:first_token_latency` | Gauge | Latest time to generate first token in seconds | Seconds |
|
||||
|
||||
Reference in New Issue
Block a user