This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
0d989829bbd404ef428de667d0f98298c86de0bf
FastDeploy
/
fastdeploy
/
model_executor
/
models
T
History
chen
0d989829bb
Compatible with EB 0.3B torch model arch (
#3913
)
...
* fix * check
2025-09-05 19:04:59 +08:00
..
ernie4_5_vl
[Feature]
ernie4_5_vl_moe
support huggingface safetensor loading (
#3750
)
2025-09-03 02:58:59 -07:00
qwen2_5_vl
[Model]support qwen2_5_vl (
#3557
)
2025-08-29 18:28:39 +08:00
__init__.py
add input_processor plugin (
#3657
)
2025-08-28 22:53:57 +08:00
deepseek_v3.py
【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. (
#3673
)
2025-09-04 21:16:05 -07:00
ernie4_5_moe.py
Compatible with EB 0.3B torch model arch (
#3913
)
2025-09-05 19:04:59 +08:00
ernie4_5_mtp.py
support tmp (
#3675
)
2025-08-28 19:42:32 +08:00
model_base.py
[plugin] Custom model_runner/model support (
#3186
)
2025-08-04 18:52:39 -07:00
qwen2.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
qwen3.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
qwen3moe.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
tp_utils.py
Supports DP+TP+EP hybrid parallel deployment strategy (
#3489
)
2025-08-26 00:04:01 -07:00
utils.py
[V1 Loader] support weight_only (
#3413
)
2025-08-23 13:13:41 +08:00