This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
472402bf4e5a8c45c33b030d9a41d54224d556f3
FastDeploy
/
docs
/
zh
/
features
T
History
yangjianfengo1
472402bf4e
Update sparse attn documentation (
#3954
)
...
* 更新文档 * 更新文档 * 更新文档 * 更新文档
2025-09-08 12:23:18 +08:00
..
images
Update sparse attn documentation (
#3954
)
2025-09-08 12:23:18 +08:00
chunked_prefill.md
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
data_parallel_service.md
[Docs] add data parallel (
#3883
)
2025-09-04 20:33:50 +08:00
disaggregated.md
[Docs] add data parallel (
#3883
)
2025-09-04 20:33:50 +08:00
early_stop.md
add stop_seqs doc (
#3090
)
2025-07-30 20:36:18 +08:00
graph_optimization.md
Modified to support custom all reduce by default (
#3538
)
2025-08-22 16:59:05 +08:00
load_balance.md
update doc: load_balance.md (
#3008
)
2025-07-30 10:27:56 +08:00
multi-node_deployment.md
[Doc] Add multinode deployment documents (
#3417
)
2025-08-15 10:37:04 +08:00
plas_attention.md
Update sparse attn documentation (
#3954
)
2025-09-08 12:23:18 +08:00
plugins.md
【Feature】add fd plugins && rm model_classes (
#3123
)
2025-08-03 19:53:20 -07:00
prefix_caching.md
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
reasoning_output.md
[Doc] add chat_template_kwagrs and update params docs (
#3103
)
2025-07-31 19:44:06 +08:00
sampling.md
fix typos (
#3684
)
2025-09-01 17:50:17 +08:00
speculative_decoding.md
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
structured_outputs.md
[Feature] mm and thinking model support structred output (
#2749
)
2025-09-02 16:21:09 +08:00