周周周
|
2b4748de4f
|
[MTP] refactor MTP pre_process (#6358)
|
2026-02-09 10:47:15 +08:00 |
|
sunxin
|
ef47e6eb46
|
[Others]skip to_tensor (#6342)
|
2026-02-04 17:25:19 +08:00 |
|
MingkunZhang
|
e109fb9a0e
|
[Metax][Fix] fix issues based #6259 (#6338)
|
2026-02-03 23:21:35 -08:00 |
|
sunxin
|
9b0a82cfa9
|
[Model Runner] Support overlap schedule (#6259)
|
2026-02-04 10:49:44 +08:00 |
|
bukejiyu
|
12d4b4cb87
|
[Feature]Support reorder ids to split prefill and decodes (#5779)
* support reorder ids
* perfect code
* fix
* fix unittest
* delete code
* fix
* add python api
* delete custom op
* update algorithm
* fix swap
* support condense
* support condense
* support mtp
* delete code
* update
* update
* update
* update
* update for other platfrom
* update
* fix
* fix mtp
* fix ut
* update
* fix ut
* update ut
* fix
* fix encoder_cache
* fix ci
* fix
* fix vl
* Fix performance regression
* fix
* fix
* fix mtp
* fix index->req_id mapping
* fix ut
---------
Co-authored-by: root <root@yqlcc01-sys-rpm12rzmwjd.yqlcc01.baidu.com>
Co-authored-by: K11OntheBoat <“ruianmaidanglao@163.com”>
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
|
2026-02-03 00:28:02 -08:00 |
|