mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2026-04-23 00:17:25 +08:00
[Feature] Support ThinkingBudget Logits processor to control thinking content length (#6367)
* feat: add thinking budget logits processor * add unittest * fix pre-commit * add unittest * docs: clarify operator-level vs logits processor usage and conflict guidance --------- Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
This commit is contained in:
@@ -821,6 +821,7 @@ def parse_args():
|
||||
help="chunk size of moe input",
|
||||
)
|
||||
parser.add_argument("--ori_vocab_size", type=int, default=None)
|
||||
parser.add_argument("--think_start_id", type=int, default=-1)
|
||||
parser.add_argument("--think_end_id", type=int, default=-1)
|
||||
parser.add_argument("--image_patch_id", type=int, default=-1)
|
||||
parser.add_argument("--line_break_id", type=int, default=-1)
|
||||
|
||||
Reference in New Issue
Block a user