Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
820eb60ec667f9fe89ebdf6d6ecf79af270c110f
FastDeploy/fastdeploy/model_executor
T
History
周周周 820eb60ec6 [Others] clean code (#6839)
Co-authored-by: “liuruian” <liuruian@baidu.com>
2026-03-14 11:09:28 +08:00
..
graph_optimization
[Speculative Decoding] Unify Spec and non-spec branch (#6685)
2026-03-10 23:58:44 -07:00
guided_decoding
[Feature] Guided Decoding add LLguidance backend (#5124)
2025-12-03 20:23:57 +08:00
layers
[Others] clean code (#6839)
2026-03-14 11:09:28 +08:00
logits_processor
[Feature] Support ThinkingBudget Logits processor to control thinking content length (#6367)
2026-02-25 14:17:09 +08:00
model_loader
add reconstruct (#6675)
2026-03-10 11:25:37 +08:00
models
[BugFix]rm draft code for glm (#6810)
2026-03-12 23:26:05 -07:00
ops
[Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (#6803)
2026-03-12 19:21:17 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
entropy_utils.py
[Bugfix] Fix entropy calculation bugs (#5941)
2026-01-08 20:57:35 +08:00
forward_meta.py
fix eb5 mtp(mix) (#6800)
2026-03-13 17:36:57 +08:00
load_weight_utils.py
add reconstruct (#6675)
2026-03-10 11:25:37 +08:00
pre_and_post_process.py
[RL][Cherry-Pick] Support Fully Async and PrefixCache (#6599)
2026-03-12 01:13:30 -07:00
utils.py
[Others]update paddleformer 1.0.0 (#6496)
2026-03-11 15:06:29 +08:00
xpu_pre_and_post_process.py
[XPU] rm stop nums (#6651)
2026-03-12 14:05:58 +08:00
Powered by Gitea Version: 1.26.0 Page: 669ms Template: 11ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API