[V1 loader] Qwen25 VL support v1 loader and torch style safetensors load (#4388)

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

* [BugFix] qwen2.5vl enable_thinking=true and image_patch_id bug fix

* [Docs]offine infer add apply_chat_template add_generation_prompt parameter

* [Model]qwen2.5VL support --use-cudagraph

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test v2

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test v3

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test v4

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test v5

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test v6

* [Model]qwen2.5VL support --use-cudagraph buffer and qwenvl test v7

* qwen25vl v1 loader

* qwen25vl v1 loader v2

* qwen25vl v1 loader v3

* qwen25vl v1 loader fix tp2 weight PySafeSlice

* qwen25vl v1 loader no test

* qwen25vl v1 loader add unit test

* qwen25vl v1 loader add unit test v2

* qwen25vl v1 loader add torch unit test v3

* qwen25vl v1 loader add torch unit test v4

* qwen25vl v1 loader add torch unit test v5

* qwen25vl v1 loader add torch unit test v6

This commit is contained in:

CSWYF3634076

2025-10-27 10:54:15 +08:00

committed by

GitHub

parent 5c6105f4a2

commit acd331780c

8 changed files with 697 additions and 20 deletions

									
										fastdeploy/model_executor/utils.py
									
		-1
	
												View File
												
				@@ -256,7 +256,6 @@ def is_paddle_support_v1_loader():

				def v1_loader_support(fd_config):

				    _v1_no_support_archs = [

				        "Qwen2VLForConditionalGeneration",

				        "Qwen2_5_VLForConditionalGeneration",

				    ]

				    def _err_msg(msg: str) -> str: