Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-09 08:55:00 +08:00
Code Issues Actions 7 Packages Projects Releases Wiki Activity
Files
373b5c38071226270aa75dc78045d7757f477270
FastDeploy/fastdeploy/model_executor
T
History
GoldPancake cfc5b0ccf9 [BugFix] fix mtp logprob bugs in chunk prefill (#5244)
* fix mtp logprob bugs in chunk prefill

* fix

* fix
2025-11-27 11:31:29 +08:00
..
graph_optimization
…
guided_decoding
…
layers
[Optimization] Refine row parallel bias and nranks and moe all_reduce (#5247)
2025-11-26 05:09:09 -08:00
logits_processor
…
model_loader
…
models
[Optimization] Refine row parallel bias and nranks and moe all_reduce (#5247)
2025-11-26 05:09:09 -08:00
ops
…
__init__.py
…
forward_meta.py
…
load_weight_utils.py
[Speculative Decoding][MTP] Support static CacheKV C8 quantization and optimize memory usage (#5155)
2025-11-21 15:10:13 +08:00
pre_and_post_process.py
[BugFix] fix mtp logprob bugs in chunk prefill (#5244)
2025-11-27 11:31:29 +08:00
utils.py
[Optimization] Refine row parallel bias and nranks and moe all_reduce (#5247)
2025-11-26 05:09:09 -08:00
Powered by Gitea Version: 1.26.0 Page: 2605ms Template: 814ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API