Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 08:21:53 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
d5518463ce0152b309d149fa6df8c2fe00e700fe
FastDeploy/docs/zh/features
T
History
jc d5518463ce Mooncake storage register local buffer by chunk (#7416) (#7540)
2026-04-22 10:46:57 +08:00
..
images
…
chunked_prefill.md
…
data_parallel_service.md
…
disaggregated.md
[Docs] Add docs for disaggregated deployment (#6700)
2026-04-01 19:27:09 +08:00
early_stop.md
…
global_cache_pooling.md
Mooncake storage register local buffer by chunk (#7416) (#7540)
2026-04-22 10:46:57 +08:00
graph_optimization.md
…
load_balance.md
…
logits_processor.md
…
multi-node_deployment.md
…
paddleformers_backend.md
…
plas_attention.md
…
plugins.md
…
pooling_models.md
…
prefix_caching.md
…
reasoning_output.md
…
sampling.md
[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
2026-03-19 01:43:10 -07:00
speculative_decoding.md
[Speculative Decoding] Unify Spec and non-spec branch (#6685)
2026-03-10 23:58:44 -07:00
structured_outputs.md
…
thinking_budget.md
[Bugfix] Align thinking_budget behavior with ERNIE reasoning flow (#6934)
2026-03-23 14:15:55 +08:00
tool_calling.md
…
weight_update.md
[RL] Adapt async rollout checkpoint update flow (#7042)
2026-03-30 19:19:34 +08:00
Powered by Gitea Version: 1.26.0 Page: 1171ms Template: 7ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API