[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (#6533)

* init

* init

* fix format

* add

* add files

* add ut

* fix some

* add ut

* add more

* add

* fix pre-commit

* fix pre-commit

* fix cover

* skip long seq

* add

* add

* fix

* remove not need

* fix set attr

* fix comments

* fix comments

* fix failed tests

---------

Co-authored-by: gongweibao <gognweibao@baidu.com>
This commit is contained in:
gongweibao
2026-03-16 21:32:43 +08:00
committed by GitHub
parent d113397b09
commit a6351dea0b
61 changed files with 1595 additions and 171 deletions
+3 -2
View File
@@ -198,7 +198,9 @@ class EngineServiceClient:
suffix=ipc_suffix,
create=False,
)
except:
except (
Exception
): # IPCSignal may not yet be created by workers; broad except covers platform-specific IPC errors
# Signal not ready yet
time.sleep(wait_interval)
elapsed_time += wait_interval
@@ -523,7 +525,6 @@ class AsyncLLM(EngineServiceClient):
remaining = num_choices
while remaining > 0:
response_list = await response_queue.get()
for response_item in response_list:
if (
isinstance(response_item, dict) or isinstance(response_item, Request)