Zhang Yulong
738c658c54
[Benchmark] Update seed argument handling in benchmark_serving.py ( #7356 )
2026-04-13 16:05:50 +08:00
Zhang Yulong
f422f835e8
[benchmark] update tools ( #7211 )
2026-04-07 16:25:44 +08:00
Zhang Yulong
6f5aa883f7
[benchmark] update benchmark tools ( #6991 )
...
* [benchmark] update benchmark tools
* [benchmark] update benchmark tools
2026-03-24 20:56:27 +08:00
Zhang Yulong
96bfa0d5b9
[benchmark] Update benchmark_serving.py ( #6467 )
2026-02-11 20:10:46 +08:00
Zhang Yulong
66c9e11998
[benchmark] update tools ( #6437 )
2026-02-10 17:48:55 +08:00
Zhang Yulong
16d03c3127
update ( #6335 )
2026-02-03 21:59:32 +08:00
jc
e911ac2ce7
[BugFix] Refine the preparation of cpu and storage cache ( #5777 )
...
* Refine the preparation of cpu and storage cache
* fix error
* fix error
* up
* fix
* up docs
* fix unittest
* remove debug info
2026-01-05 10:13:30 +08:00
Zhang Yulong
2da32f2a35
Update benchmark_serving.py ( #5861 )
2026-01-04 20:07:56 +08:00
Juncai
412867fd99
[Feature] Support KV Cache Storage ( #5571 )
...
* Support Mooncake Store
* up
* up
* add op
* fix conflict
* fix error
* up for comments
* avoid thread lock
* up
* fix unittest
* fix unittest
* remove debug info
* consider tp_size > 1
* add default rdma_nics
* add utils
* up
* fix error
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-25 16:30:35 +08:00
ophilia-lee
99258e19c8
[Benchmark]支持Completions接口 ( #5700 )
...
* benchmark工具支持受限解码场景指定response_format
* Update backend_request_func.py
output.success判断兼容思考内容超长截断时回复内容为空的情况
* Update benchmark_serving.py
更新benchmark_metrics
* 支持Completions接口
* 支持Completions接口
* 支持Completions接口
* [Benchmark]支持Completions接口
* [Benchmark]支持Completions接口
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-23 19:46:23 +08:00
Zhang Yulong
f45c131ddf
update ( #5625 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-17 21:38:14 +08:00
Zhang Yulong
510b82173a
[Benchmark] Update benchmark ( #5496 )
...
* update benchmark
* update benchmark
2025-12-11 11:53:12 +08:00
Zhang Yulong
5b49142988
update ( #5298 )
2025-11-28 18:29:16 +08:00
Zhang Yulong
83532e1d01
[Benchmark] Enhance benchmark output logging ( #4682 )
...
* Enhance benchmark output logging
Add print statements to display the number of discarded outputs before and after filtering.
* Update benchmark_serving.py
2025-11-06 16:53:31 +08:00
Juncai
08ca0f6aea
[Feature] [PD] add simple router and refine splitwise deployment ( #4709 )
...
* add simple router and refine splitwise deployment
* fix
2025-11-06 14:56:02 +08:00
ophilia-lee
412097c1b8
benchmark工具支持受限解码场景指定response_format ( #4718 )
2025-10-31 12:26:24 +08:00
Ryan
28de91b50f
[Graph Optimization] SOT+CUDAGraph support ERNIE4.5T VL 28B / 424B ( #4645 )
...
* 45TVL support sot+CUDAGraph
* mv unitest from ce_deploy 2 e2e
* add test_EB_VL_Lite_sot_serving
* rm useless line
* add openai_client
* fix unitest && reduce computing resources
2025-10-31 11:38:43 +08:00
ophilia-lee
70aa7423f8
benchmark工具适配SGLang框架 ( #4607 )
...
* benchmark工具适配SGLang框架
* benchmark工具适配SGLang框架
* benchmark工具适配SGLang框架
2025-10-27 18:52:56 +08:00
Zhang Yulong
10e85daf15
update benchmark scripts ( #4497 )
2025-10-20 17:03:10 +08:00
Zhang Yulong
98f8c3703a
Add filtering for failed requests in benchmark outputs ( #4448 )
...
Filter out requests with end_timestamp == 0.0
2025-10-16 14:57:47 +08:00
Zhang Yulong
9dc3968c13
[benchmark] Fix benchmark duration calculation logic ( #4446 )
...
* Fix benchmark duration calculation logic
Calculate benchmark duration using filtered outputs.
* Fix benchmark duration calculation using benchmark_outputs
2025-10-16 14:36:29 +08:00
Zhang Yulong
7f94f063ff
Update benchmark_serving.py ( #4438 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
丢弃的请求依旧保存,用于结果分析
2025-10-15 20:36:19 +08:00
Zhang Yulong
c4f866c457
update benchmark tools ( #4416 )
2025-10-15 11:15:25 +08:00
Zhang Yulong
5532e8a323
[FD CLI] Add bench cli ( #4160 )
...
* add bench cli
* Update test_main.py
2025-09-22 20:37:30 +08:00
yinwei
776fb03250
add error info ( #3040 )
2025-07-28 15:10:28 +08:00
Zhang Yulong
5151bc92c8
Update benchmark tools ( #3004 )
...
Deploy GitHub Pages / deploy (push) Has been cancelled
* update benchmark tools
* update benchmark tools
2025-07-24 15:19:23 +08:00
Zero Rains
25698d56d1
polish code with new pre-commit rule ( #2923 )
2025-07-19 23:19:27 +08:00
lijingning
9d6a42b334
适配vLLM无arrival_time;适配vLLM model必传;RequestFuncInput/RequestFuncOutput/SampleRequest新增用例编号no
2025-07-15 19:31:27 +08:00
Jiang-Jia-Jun
92c2cfa2e7
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00