Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
28de91b50feed4c7a6a51ac71934d0d87f1cd62f
FastDeploy/fastdeploy/model_executor
T
History
Ryan 28de91b50f [Graph Optimization] SOT+CUDAGraph support ERNIE4.5T VL 28B / 424B (#4645)
* 45TVL support sot+CUDAGraph

* mv unitest from ce_deploy 2 e2e

* add test_EB_VL_Lite_sot_serving

* rm useless line

* add openai_client

* fix unitest && reduce computing resources
2025-10-31 11:38:43 +08:00
..
graph_optimization
[Graph Optimization] Refactor default capture list (#4617)
2025-10-28 21:31:02 +08:00
guided_decoding
…
layers
[noauxtc_kernel] remove useless code (#4643)
2025-10-30 18:59:04 +08:00
logits_processor
[Feature] support logits processors (#4515)
2025-10-29 00:08:53 +08:00
model_loader
[Speculative Decoding][MTP]Support mtp in epdptp mode (#4614)
2025-10-28 16:02:47 +08:00
models
[Graph Optimization] SOT+CUDAGraph support ERNIE4.5T VL 28B / 424B (#4645)
2025-10-31 11:38:43 +08:00
ops
…
__init__.py
…
forward_meta.py
fix import image_ops error on some platforms (#4559)
2025-10-24 16:09:20 +08:00
load_weight_utils.py
[BugFix] fix TPDP mix parallel infer (#4583)
2025-10-28 16:58:20 +08:00
pre_and_post_process.py
[Feature] Unify the registration name recognition for tool_parser and reasoning_parser to “-” (#4668)
2025-10-31 10:45:27 +08:00
utils.py
[V1 loader] Qwen25 VL support v1 loader and torch style safetensors load (#4388)
2025-10-27 10:54:15 +08:00
Powered by Gitea Version: 1.26.0 Page: 1248ms Template: 81ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API