kevin
|
52edf5e9b3
|
fix mtp acceptance rate decline (#6470)
|
2026-02-12 19:56:10 +08:00 |
|
kevin
|
3ce842b55b
|
[BugFix] add reset shared inputs when update weight dummy run (#6331)
* fix dummy run input bug
* update code
* update code
* update code
* update code
|
2026-02-10 10:29:03 +08:00 |
|
bukejiyu
|
5bfc0938e2
|
[BugFix] PD reorder fix and add ut (#6375)
|
2026-02-09 04:42:48 -08:00 |
|
周周周
|
2b4748de4f
|
[MTP] refactor MTP pre_process (#6358)
|
2026-02-09 10:47:15 +08:00 |
|
sunxin
|
ef47e6eb46
|
[Others]skip to_tensor (#6342)
|
2026-02-04 17:25:19 +08:00 |
|
MingkunZhang
|
e109fb9a0e
|
[Metax][Fix] fix issues based #6259 (#6338)
|
2026-02-03 23:21:35 -08:00 |
|
sunxin
|
9b0a82cfa9
|
[Model Runner] Support overlap schedule (#6259)
|
2026-02-04 10:49:44 +08:00 |
|
bukejiyu
|
12d4b4cb87
|
[Feature]Support reorder ids to split prefill and decodes (#5779)
* support reorder ids
* perfect code
* fix
* fix unittest
* delete code
* fix
* add python api
* delete custom op
* update algorithm
* fix swap
* support condense
* support condense
* support mtp
* delete code
* update
* update
* update
* update
* update for other platfrom
* update
* fix
* fix mtp
* fix ut
* update
* fix ut
* update ut
* fix
* fix encoder_cache
* fix ci
* fix
* fix vl
* Fix performance regression
* fix
* fix
* fix mtp
* fix index->req_id mapping
* fix ut
---------
Co-authored-by: root <root@yqlcc01-sys-rpm12rzmwjd.yqlcc01.baidu.com>
Co-authored-by: K11OntheBoat <“ruianmaidanglao@163.com”>
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
|
2026-02-03 00:28:02 -08:00 |
|