FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

AIbin 47bfd45bb6 [Docs]add deepseek model doc (#6513 )

* add deepseek model doc

2026-02-26 14:08:19 +08:00

assets/images

更新文档 (#3975 )

2025-09-08 16:53:37 +08:00

best_practices

[Docs]add deepseek model doc (#6513 )

2026-02-26 14:08:19 +08:00

cli

[docs] add cli uasge to docs (#4569 )

2025-10-28 10:35:11 +08:00

features

[Speculative Decoding] Support suffix decoding (#6403 )

2026-02-26 11:42:05 +08:00

get_started

[Metax][Docs] update metax guidance documents (#6515 )

2026-02-26 14:04:23 +08:00

observability

[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (#5458 )

2025-12-16 16:36:09 +08:00

online_serving

[Metrics] Support cpu-cache-block-num (#6390 )

2026-02-09 10:27:56 +08:00

quantization

[Feature] Support NVFP4 MoE on SM100 (#6003 )

2026-01-29 14:16:07 +08:00

usage

[BugFix] PD reorder fix and add ut (#6375 )

2026-02-09 04:42:48 -08:00

[Docs]add deepseek model doc (#6513 )

2026-02-26 14:08:19 +08:00

benchmark.md

[Docx] add language (en/cn) switch links (#4470 )

2025-10-17 15:47:41 +08:00

index.md

[Docs] add doc for glm (#4933 )

2025-11-10 21:21:33 +08:00

offline_inference.md

update doc (#4675 )

2025-10-30 11:19:04 +08:00

parameters.md

[Docs] Add Doc for Online quantification (#6399 )

2026-02-08 22:09:18 -08:00

requirements.txt

fix: correct typo in nvidia_gpu.md (#4848 )

2025-11-06 16:03:02 +08:00

supported_models.md

remove load default_v1 since already been as default (#4980 )

2025-11-12 16:49:48 +08:00