Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
3a4e139f65901a41ed78a206d61de0bde0ce7a06
FastDeploy/docs
T
History
mouxin 96b0ecea6b [Feature] Update Counter Release (#6943)
2026-03-20 10:51:37 +08:00
..
assets/images
…
best_practices
[Docs]add deepseek model doc (#6513)
2026-02-26 14:08:19 +08:00
cli
…
features
[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
2026-03-19 01:43:10 -07:00
get_started
[Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (#6803)
2026-03-12 19:21:17 +08:00
observability
[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (#5458)
2025-12-16 16:36:09 +08:00
online_serving
[Feature] Update Counter Release (#6943)
2026-03-20 10:51:37 +08:00
quantization
[Feature] Support NVFP4 MoE on SM100 (#6003)
2026-01-29 14:16:07 +08:00
usage
[Docs] Update code overview documentation (#6568)
2026-02-28 16:37:01 +08:00
zh
[Feature] Update Counter Release (#6943)
2026-03-20 10:51:37 +08:00
benchmark.md
…
index.md
[Docs] add doc for glm (#4933)
2025-11-10 21:21:33 +08:00
offline_inference.md
update doc (#4675)
2025-10-30 11:19:04 +08:00
parameters.md
[Docs] Add Doc for Online quantification (#6399)
2026-02-08 22:09:18 -08:00
requirements.txt
fix: correct typo in nvidia_gpu.md (#4848)
2025-11-06 16:03:02 +08:00
supported_models.md
remove load default_v1 since already been as default (#4980)
2025-11-12 16:49:48 +08:00
Powered by Gitea Version: 1.26.0 Page: 1358ms Template: 59ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API