fxyfxy777
|
250ce40b40
|
[Feature] use phi permute/unpermute & rm swiglu (#6361)
* tp文字输出正常
* B eb5 mini文字输出正常
* eb5mini ep B卡 文字输出正常
* default use phi moe op
* stash
* tp H卡正常
* ep ok
* rm debug
* rm debug tool
* rm del ffn_out
* rm swiglu
* add envs to swiglu
* merge dev
* fix ci baseline
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* fix ci baseline 2
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
|
2026-03-12 02:01:57 -07:00 |
|
sunxin
|
53aaac69da
|
[Optimization] Enable BF16 gate computation for GLM and Qwen (#6457)
* gate bf16
* add gate-fp32
* fix
* update baseline
* update
* update
* fix
|
2026-02-26 21:08:46 -08:00 |
|
chen
|
72fe94cb13
|
[Feature] support glm tp+dp+ep (#6317)
|
2026-02-05 21:47:01 +08:00 |
|
GoldPancake
|
646aced1eb
|
[UT] Add GLM E2E tests for non-MTP and MTP (#6163)
* add glm ut
|
2026-01-23 10:34:29 +08:00 |
|