Yonghua Li
|
a4f36cc8db
|
[Cherry-Pick] [BugFix] replace ftok with custom_ftok in get_output/save_output ops (#6822) (#6824)
* [BugFix] replace ftok with custom_ftok in get_output/save_output ops
* [Test] add unit test for custom_ftok
* [Chore] create custom_ftok.h
* [Chore] reorganize header file
* [Fix] fix syntax
* [Fix] fix cache messager msg_queue_id+rank_id conflict
|
2026-03-16 14:22:30 +08:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
freeliuzc
|
d49f8fb30a
|
[Feature][MTP] Support cacheKV transfer in per_chunk mode (#2890)
* support chunk_prefill both normal and speculative_decoding(mtp)
* optimize pd-disaggregation config
* fix bug
|
2025-07-17 17:58:08 +08:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|