FastDeploy/docs at daa95244f7bd9f1e3f09542808e335bf703d7cb3 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

qwes5s5 daa95244f7 abort requests (#6992 )

2026-03-31 11:02:26 +08:00

..

更新文档 (#3975 )

2025-09-08 16:53:37 +08:00

[Docs]add deepseek model doc (#6513 )

2026-02-26 14:08:19 +08:00

[docs] add cli uasge to docs (#4569 )

2025-10-28 10:35:11 +08:00

[RL] Adapt async rollout checkpoint update flow (#7042 )

2026-03-30 19:19:34 +08:00

[Iluvatar] Support wi4a16 group_gemm (#7078 )

2026-03-30 19:03:51 +08:00

[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (#5458 )

2025-12-16 16:36:09 +08:00

abort requests (#6992 )

2026-03-31 11:02:26 +08:00

[Feature] Support NVFP4 MoE on SM100 (#6003 )

2026-01-29 14:16:07 +08:00

[Optimization] Optimize ttft for prefill pd (#6680 )

2026-03-30 20:36:23 +08:00

abort requests (#6992 )

2026-03-31 11:02:26 +08:00

benchmark.md

[Docx] add language (en/cn) switch links (#4470 )

2025-10-17 15:47:41 +08:00

index.md

[Docs] add doc for glm (#4933 )

2025-11-10 21:21:33 +08:00

offline_inference.md

update doc (#4675 )

2025-10-30 11:19:04 +08:00

parameters.md

[Docs] Add Doc for Online quantification (#6399 )

2026-02-08 22:09:18 -08:00

requirements.txt

fix: correct typo in nvidia_gpu.md (#4848 )

2025-11-06 16:03:02 +08:00

supported_models.md

remove load default_v1 since already been as default (#4980 )

2025-11-12 16:49:48 +08:00