Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
da74a5f0b34d1c2486369b2a1ce807ad199a9249
FastDeploy/fastdeploy/model_executor/models
T
History
chen da74a5f0b3 fix glm all_reduce tp group (#4187)
2025-09-22 10:56:55 +08:00
..
ernie4_5_vl
[v1 loader]qwen Offline fp8 (#4036)
2025-09-15 13:44:11 +08:00
qwen2_5_vl
[Model]support qwen2_5_vl (#3557)
2025-08-29 18:28:39 +08:00
__init__.py
add input_processor plugin (#3657)
2025-08-28 22:53:57 +08:00
deepseek_v3.py
[FDConfig]Remove max_num_batched_tokens/max_num_seqs in parallel config (#4116)
2025-09-17 10:43:35 +08:00
ernie4_5_moe.py
[v1 loader]qwen Offline fp8 (#4036)
2025-09-15 13:44:11 +08:00
ernie4_5_mtp.py
fix mtp (#4105)
2025-09-15 20:26:07 +08:00
glm4_moe.py
fix glm all_reduce tp group (#4187)
2025-09-22 10:56:55 +08:00
model_base.py
[plugin] Custom model_runner/model support (#3186)
2025-08-04 18:52:39 -07:00
qwen2.py
rename fused_get_rope.cu (#3752)
2025-09-03 10:54:34 +08:00
qwen3.py
rename fused_get_rope.cu (#3752)
2025-09-03 10:54:34 +08:00
qwen3moe.py
rename fused_get_rope.cu (#3752)
2025-09-03 10:54:34 +08:00
tp_utils.py
Supports DP+TP+EP hybrid parallel deployment strategy (#3489)
2025-08-26 00:04:01 -07:00
utils.py
[V1 Loader] support weight_only (#3413)
2025-08-23 13:13:41 +08:00
Powered by Gitea Version: 1.26.0 Page: 391ms Template: 6ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API