freeliuzc
|
f6c066fb9d
|
Revert "[Optimization] Optimize ttft for prefill pd (#6680)" (#7386)
* Revert "[Optimization] Optimize ttft for prefill pd (#6680)"
This reverts commit 6727df8286.
* fix revert pr
|
2026-04-14 20:01:39 +08:00 |
|
zhouchong
|
91c832f607
|
[Feature] Add logging parameters and error output to terminal (#7098)
|
2026-04-01 13:18:42 +08:00 |
|
chenjian
|
6727df8286
|
[Optimization] Optimize ttft for prefill pd (#6680)
* optimize ttft
* fix
* fix
* fix ci
* fix ci
* fix
* fix bug
* fix
* add comments
* fix ci
* fix
* fix ci
* fix format
* update according to review
* add comment
* fix
* fix format
|
2026-03-30 20:36:23 +08:00 |
|
bukejiyu
|
5bfc0938e2
|
[BugFix] PD reorder fix and add ut (#6375)
|
2026-02-09 04:42:48 -08:00 |
|
chenjian
|
35c24f3f71
|
Revert "[Optimize] Optimize ttft for ep (#6098)" (#6402)
This reverts commit 90db0bdd0d.
|
2026-02-09 19:01:23 +08:00 |
|
luukunn
|
fd56d85346
|
add environment_variables (#6385)
|
2026-02-09 15:29:49 +08:00 |
|
chenjian
|
90db0bdd0d
|
[Optimize] Optimize ttft for ep (#6098)
* optimize ttft
* fix
* fix
* fix ci
* fix ci
* fix
* fix bug
* fix
* add comments
* fix ci
* fix
|
2026-02-04 15:03:29 +08:00 |
|
Copilot
|
7d5282e158
|
[APIServer][Feature] Add configurable worker health check timeout via FD_WORKER_ALIVE_TIMEOUT (#5865)
* Initial plan
* Add configurable FD_WORKER_ALIVE_TIMEOUT environment variable
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
* Add test for FD_WORKER_ALIVE_TIMEOUT environment variable
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
* Update docs/zh/usage/environment_variables.md
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Update docs/usage/environment_variables.md
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Improve test coverage to validate integration with check_health calls
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
* Remove test_worker_alive_timeout.py per reviewer feedback
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
|
2026-01-05 09:47:12 +08:00 |
|
Copilot
|
5cec66adb8
|
[Docs] 更新环境变量文档以同步最新代码 (#5713)
* Initial plan
* 更新环境变量文档以匹配最新代码
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-12-23 19:49:20 +08:00 |
|
Divano
|
c1aa66df02
|
Revert "[Optim] Remove limitation of number of kvcache blocks (#5612)" (#5702)
This reverts commit 9da89a374b.
|
2025-12-23 15:41:33 +08:00 |
|
Jiang-Jia-Jun
|
9da89a374b
|
[Optim] Remove limitation of number of kvcache blocks (#5612)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* [Optim] Remove limitation of number of kvcache blocks
* Update fastdeploy/envs.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Update fastdeploy/worker/iluvatar_worker.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Add docs
* Update fastdeploy/worker/worker_process.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* fix ci case
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
|
2025-12-23 11:18:29 +08:00 |
|
kxz2002
|
a2870ed4a9
|
[Feature] Unify the registration name recognition for tool_parser and reasoning_parser to “-” (#4668)
* parser register name unify
* change ernie_x1 to ernie-x1
* change ernie4_5_vl to ernie-45-vl
* fix unit test
|
2025-10-31 10:45:27 +08:00 |
|
ming1753
|
7681375a19
|
[BugFix] PaddleOCR-VL fix FD_DEBUG type and support v1 loader (#4605)
* [Bug Fix] PaddleOCRVL fix FD_DEBUG type and support HF model
* fix bug
* fix bug
* fix bug
|
2025-10-28 09:47:47 +08:00 |
|
Sunny-bot1
|
4ffe41a747
|
WINT4/WINT8 dense gemm default use Machete (#4451)
|
2025-10-23 17:57:59 +08:00 |
|
Yuanle Liu
|
cef3164c3b
|
Optimizing the performance of think length limit using custom operators (#4279)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* delete impl
* delete min_length&max_length
* support limit thinking content strategy
* fix
* fix
* fix
* update
* fix set_value_by_flags_and_idx
* fix
* fix
* fix
* fix
* update
* fix
* fix
* fix typo
* fix ci
* fix
* fix
* support mtp
* fix
* fix
* update
* update
|
2025-10-20 21:09:13 +08:00 |
|
yangjianfengo1
|
ba5c2b7e37
|
[Docx] add language (en/cn) switch links (#4470)
* add install docs
* 修改文档
* 修改文档
|
2025-10-17 15:47:41 +08:00 |
|
xiaolei373
|
720697e265
|
add environment variables (#4466)
|
2025-10-17 14:20:01 +08:00 |
|
bukejiyu
|
2650f58740
|
[docs] Update environment variables documentation (#3957)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-09-10 21:17:06 -07:00 |
|
Sunny-bot1
|
ed5133f704
|
update env docs for Machete (#3959)
|
2025-09-08 14:44:31 +08:00 |
|
周周周
|
17b414c2df
|
MoE Default use triton's blockwise fp8 in TP Case (#3678)
|
2025-08-29 11:07:30 +08:00 |
|
Yuanle Liu
|
9571c458f0
|
enhance eos_tokens (#3274)
* enhance eos_tokens
* update
* update
|
2025-08-11 14:47:52 +08:00 |
|
Sunny-bot1
|
240d6236bc
|
[Fix]fix top_k_top_p sampling (#2801)
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix topk-topp
* update
* add base_non_truncated
|
2025-07-10 22:35:10 +08:00 |
|
chen
|
888780ffde
|
[Feature] block_wise_fp8 support triton_moe_backend (#2767)
|
2025-07-09 19:22:47 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|