FastDeploy/docs at 3a4e139f65901a41ed78a206d61de0bde0ce7a06 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

mouxin 96b0ecea6b [Feature] Update Counter Release (#6943 )

2026-03-20 10:51:37 +08:00

..

…

[Docs]add deepseek model doc (#6513 )

2026-02-26 14:08:19 +08:00

…

[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894 )

2026-03-19 01:43:10 -07:00

[Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (#6803 )

2026-03-12 19:21:17 +08:00

[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (#5458 )

2025-12-16 16:36:09 +08:00

[Feature] Update Counter Release (#6943 )

2026-03-20 10:51:37 +08:00

[Feature] Support NVFP4 MoE on SM100 (#6003 )

2026-01-29 14:16:07 +08:00

[Docs] Update code overview documentation (#6568 )

2026-02-28 16:37:01 +08:00

[Feature] Update Counter Release (#6943 )

2026-03-20 10:51:37 +08:00

benchmark.md

…

index.md

[Docs] add doc for glm (#4933 )

2025-11-10 21:21:33 +08:00

offline_inference.md

update doc (#4675 )

2025-10-30 11:19:04 +08:00

parameters.md

[Docs] Add Doc for Online quantification (#6399 )

2026-02-08 22:09:18 -08:00

requirements.txt

fix: correct typo in nvidia_gpu.md (#4848 )

2025-11-06 16:03:02 +08:00

supported_models.md

remove load default_v1 since already been as default (#4980 )

2025-11-12 16:49:48 +08:00