This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
47bfd45bb6774afc46b3ec7555f5f703fe81c76d
FastDeploy
/
docs
T
History
AIbin
47bfd45bb6
[Docs]add deepseek model doc (
#6513
)
...
* add deepseek model doc
2026-02-26 14:08:19 +08:00
..
assets
/images
更新文档 (
#3975
)
2025-09-08 16:53:37 +08:00
best_practices
[Docs]add deepseek model doc (
#6513
)
2026-02-26 14:08:19 +08:00
cli
[docs] add cli uasge to docs (
#4569
)
2025-10-28 10:35:11 +08:00
features
[Speculative Decoding] Support suffix decoding (
#6403
)
2026-02-26 11:42:05 +08:00
get_started
[Metax][Docs] update metax guidance documents (
#6515
)
2026-02-26 14:04:23 +08:00
observability
[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (
#5458
)
2025-12-16 16:36:09 +08:00
online_serving
[Metrics] Support cpu-cache-block-num (
#6390
)
2026-02-09 10:27:56 +08:00
quantization
[Feature] Support NVFP4 MoE on SM100 (
#6003
)
2026-01-29 14:16:07 +08:00
usage
[BugFix] PD reorder fix and add ut (
#6375
)
2026-02-09 04:42:48 -08:00
zh
[Docs]add deepseek model doc (
#6513
)
2026-02-26 14:08:19 +08:00
benchmark.md
[Docx] add language (en/cn) switch links (
#4470
)
2025-10-17 15:47:41 +08:00
index.md
[Docs] add doc for glm (
#4933
)
2025-11-10 21:21:33 +08:00
offline_inference.md
update doc (
#4675
)
2025-10-30 11:19:04 +08:00
parameters.md
[Docs] Add Doc for Online quantification (
#6399
)
2026-02-08 22:09:18 -08:00
requirements.txt
fix: correct typo in nvidia_gpu.md (
#4848
)
2025-11-06 16:03:02 +08:00
supported_models.md
remove load default_v1 since already been as default (
#4980
)
2025-11-12 16:49:48 +08:00