[PD Disaggregation] Write the cache of preempted req to storage and refine PD Disaggregation (#7107)

* Write the cache of preempted req to storage * up * fix
2026-04-23 17:11:21 +08:00 · 2026-04-01 13:15:52 +08:00
parent 3651113ee5
commit af51fc46d6
5 changed files with 35 additions and 19 deletions
@@ -796,7 +796,7 @@ class CacheTransferManager:
                try:
                    valid_gpu_block_ids = self._run_read_storage(
                        task.task_id,
-                        task.token_ids[: match_block_num * self.block_size],
+                        task.token_ids[: match_block_num * self.block_size] if task.token_ids else None,
                        task.start_read_block_idx,
                        k_cache_keys,
                        v_cache_keys,