Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
18e79dd660d723e239a47a2d5dddf0a47abf0a3f
FastDeploy/docs/zh
T
History
Jiang-Jia-Jun 18e79dd660 [Metrics] Support cpu-cache-block-num (#6390)
Co-authored-by: root <root@szzj-bcc-offline-1487319.szzj.baidu.com>
2026-02-09 10:27:56 +08:00
..
best_practices
[Docs] update FAQ with logprobs MQ limits and deprecation (#5368)
2025-12-04 15:57:04 +08:00
cli
[docs] add cli uasge to docs (#4569)
2025-10-28 10:35:11 +08:00
features
[Feature] Fix counter release logic & update go-router download URL (#6280)
2026-02-04 15:02:38 +08:00
get_started
Modify Nightly Build installation commands for fastdeploy
2026-02-03 20:24:27 +08:00
observability
[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (#5458)
2025-12-16 16:36:09 +08:00
online_serving
[Metrics] Support cpu-cache-block-num (#6390)
2026-02-09 10:27:56 +08:00
quantization
[Feature] Support NVFP4 MoE on SM100 (#6003)
2026-01-29 14:16:07 +08:00
usage
[Optimize] Optimize ttft for ep (#6098)
2026-02-04 15:03:29 +08:00
benchmark.md
[Docx] add language (en/cn) switch links (#4470)
2025-10-17 15:47:41 +08:00
index.md
[Docx] add language (en/cn) switch links (#4470)
2025-10-17 15:47:41 +08:00
offline_inference.md
update doc (#4675)
2025-10-30 11:19:04 +08:00
parameters.md
[Docs] Update parameters documentation with latest code defaults and new parameters (#5709)
2025-12-23 17:31:44 +08:00
supported_models.md
remove load default_v1 since already been as default (#4980)
2025-11-12 16:49:48 +08:00
Powered by Gitea Version: 1.26.0 Page: 113ms Template: 6ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API