Yonghua Li
6d9739f360
[BugFix] fix speculative gauge metrics in multi api server ( #7082 )
2026-03-31 10:52:50 +08:00
Yonghua Li
6520ae807c
[BugFix] fix grpc failure when tracing init before workers forked ( #6732 )
...
* [fix] fix grpc failure when tracing init before workers forked
* [fix] change default exporter to http
* [fix] fix test_trace
2026-03-10 21:24:10 +08:00
qwes5s5
375b5b7b21
[Feature]Log Format Normalization and Trace Log Optimization ( #6370 )
...
* log refactor
* log refactor 2
* log refactor 3
2026-03-03 11:31:45 +08:00
xiaolei373
dfe8ea941c
[log]console log to llm log ( #5680 )
2025-12-23 10:05:45 +08:00
xiaolei373
a30b4da260
[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 ( #5458 )
2025-12-16 16:36:09 +08:00
Echo-Nie
1b1bfab341
[CI] Add unittest ( #5328 )
...
* add test_worker_eplb
* remove tesnsor_wise_fp8
* add copyright
2025-12-09 19:19:42 +08:00
fl0w2o48
e63d715fc3
[BugFix][Metrics] Fix Prometheus Multiprocess Metrics Issues and Add ZMQ Communication Metrics ( #5185 )
...
* [Feature] add metrics for ZMQ and fix multiprocess metrics
* fix test_metrics.py
---------
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-11-27 15:05:09 +08:00
Yonghua Li
cead6b26fa
[Metrics] Update time_to_first_token to include tokenization & queue time, and remove redundant metrics ( #4993 )
...
* [update] update time_to_first_tokens to include queue time, and remove first_token_latency and infer_latency
* [doc] update docs
* [ci] fix test
* [chore] delete redundant code
---------
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-11-26 14:42:17 +08:00
Echo-Nie
ff653503ff
[Docs] Add License in Unittest ( #4957 )
...
* add copyright
* add CopyRight
2025-11-12 10:44:09 +08:00
qwes5s5
abb62624b8
[fix] Fixed the issue of excessive/redundant spans being returned for streaming requests. ( #4375 )
...
* fix stream span
* fix stream span
2025-10-15 11:47:47 +08:00
qwes5s5
17169a14f2
[metrics] Add serveral observability metrics ( #3868 )
...
* Add several observability metrics
* [wenxin-tools-584] 【可观测性】支持查看本节点的并发数、剩余block_size、排队请求数等信息
* adjust some metrics and md files
* trigger ci
* adjust ci file
* trigger ci
* trigger ci
---------
Co-authored-by: K11OntheBoat <your_email@example.com >
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-09-08 14:13:13 +08:00
Echo-Nie
7afcd4b776
【Hackathon 9th No.77】supplementary unit test for get_filtered_metrics ( #3578 )
...
* 功能模块 fastdeploy/metrics/metrics/get_filtered_metrics 单测补充
* fix pre-commit
---------
Co-authored-by: Tao Luo <luotao02@baidu.com >
2025-08-28 10:39:02 +08:00