[Feature] support v1 update/clear api for RL (#6761)

* [Feature] support v1 update/clear api for RL

* [fix] fix execute_model and add sleep/wakeup api

* [fix] fix mtp and key_prefix

* [chore] move _update_key_prefix to resume method

* [fix] make the interface safe to call multiple times

* [fix] fix some tiny bugs

* [chore] make small changes against pr review

* [docs] add docs for weight update

* [test] add some tests and update docs

* [style] fix code style check

* [test] fix ci

* [fix] fix stale control responses when control method timed out

* [chore] remove unused code

* [chore] fix code style

* [chore] optimize tags and key_prefix

* [test] fix ci

* [chore] fix code style

* [test] fix ci

* [fix] fix ep control

* [fix] fix ep control for engine cache queue
This commit is contained in:
Yonghua Li
2026-03-25 19:18:46 +08:00
committed by GitHub
parent 48cfb608aa
commit a7f52c300d
26 changed files with 1857 additions and 392 deletions
+6 -1
View File
@@ -184,9 +184,14 @@ class DealerConnectionManager:
self.request_num[request_id] -= 1
if self.request_num[request_id] == 0:
self._update_load(conn_index, -1)
else:
api_server_logger.warning(
f"request_id {request_id} not in request_map, available keys: {list(self.request_map.keys())}"
)
except Exception as e:
api_server_logger.error(f"Listener error: {str(e)}")
api_server_logger.error(f"Listener error: {str(e)}\n{traceback.format_exc()}")
break
api_server_logger.info(f"Listener loop ended for conn_index {conn_index}")
def _update_load(self, conn_index, delta):
"""Update connection load and maintain the heap"""