supports internode_ll_two_stage (#4162)

* supports internode_ll_two_stage

* supports internode_ll_two_stage

* supports internode_ll_two_stage

* supports internode_ll_two_stage

* supports D internode_ll_two_stage

* fix codestype

* fix xpu internode_ll_two_stage

* fix xpu internode_ll_two_stage
This commit is contained in:
lzy
2025-11-04 16:35:40 +08:00
committed by GitHub
parent 8a40374bfe
commit af7e0f27f3
6 changed files with 165 additions and 38 deletions
+5
View File
@@ -636,6 +636,11 @@ def parse_args():
action="store_true",
help="enable chunked prefill",
)
parser.add_argument(
"--use_internode_ll_two_stage",
action="store_true",
help="enable internode_ll_two_stage",
)
parser.add_argument(
"--speculative_config",
type=json.loads,