[KSM] support keep sampling mask (#7146)

* [KSM] support keep sampling mask

* Remove Comments

* remove logz_per_batch

* fix the description and checking
This commit is contained in:
Zero Rains
2026-04-03 11:30:54 +08:00
committed by GitHub
parent 44ef7b6758
commit 72bf3dbdfd
23 changed files with 519 additions and 8 deletions
@@ -166,6 +166,7 @@ class TestTokenProcessorProcessBatchOutput(unittest.TestCase):
processor.total_step_per_request = {}
processor.accept_token_num_per_head_per_request = {}
processor.accept_token_num_per_head = [0] * MAX_DRAFT_TOKENS
processor.use_sampling_mask = False
# processor._recycle_resources = Mock()