Commit Graph

201 Commits

Author SHA1 Message Date
YuBaoku 0f57c67344 [CI] Pin PaddlePaddle to release/3.3 last_commit build (#7547) 2026-04-21 23:29:36 +08:00
YuBaoku 511657aa83 [Cherry-Pick][CI] Align logprobs test baselines with Paddle Update(#7481) (#7482) 2026-04-20 13:20:05 +08:00
YuBaoku 00fb52e657 [Cherry-Pick][CI] Sync dev optimizations to 2.5(#7315) (#7342)
* [Cherry-Pick][CI] Sync dev optimizations to 2.5(#7315)
2026-04-12 13:22:06 +08:00
Jiaxin Sui b5bbb84752 [Metax][CI]Update branch triggers in CI workflow (#7285) 2026-04-09 17:23:49 +08:00
YuBaoku c735f76ec8 [Cherry-Pick][CI] Use GPU-Build-RL runner for _build_linux_rl.yml(#7186) (#7197) 2026-04-03 22:23:07 +08:00
YuBaoku 7648164f6e [Cherry-Pick][CI] Sync parallelism optimization from dev to 2.5 (#7085) (#7140)
* [Cherry-Pick][CI] Sync parallelism optimization from dev to 2.5 (#7085)
2026-04-02 14:21:47 +08:00
YuBaoku 474174bb3b [Cherry-Pick][CI] Remove skip logic for *.txt-only changes (#7104) (#7118) 2026-03-31 23:23:34 +08:00
YuBaoku f4caa18422 [CI] Adapt to codecov action changes for Node.js 24 (#7064) (#7065) 2026-03-29 20:50:14 +08:00
YuBaoku 92f90d3e73 [Cherry-Pick][CI] Sync develop fix and optimizations to 2.5(#6975) (#6987)
* [Cherry-Pick][CI] Sync develop fix and optimizations to 2.5(#6975)
2026-03-24 19:27:10 +08:00
YuBaoku d585b222ce [Cherry-Pick][CI] Sync develop optimizations to 2.5(#6745) (#6964) 2026-03-22 21:02:08 +08:00
fxyfxy777 5b10c14889 [Cherry-Pick][Feature] use phi permute/unpermute & rm swiglu (#6361) (#6808)
* [Feature] use phi permute/unpermute  & rm swiglu (#6361)

* tp文字输出正常

* B eb5 mini文字输出正常

* eb5mini ep B卡 文字输出正常

* default use phi moe op

* stash

* tp H卡正常

* ep ok

* rm debug

* rm debug tool

* rm del ffn_out

* rm swiglu

* add envs to swiglu

* merge dev

* fix ci baseline

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* fix ci baseline 2

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

* revert glm4_moe.py to release/2.5

* fix baseline

* fix baseline 2

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-13 08:00:22 -07:00
YuBaoku 4d900d09af [CI][2.5] Update _build_linux_rl.yml to use cu129 release nighlty 2026-03-12 11:20:50 +08:00
yinwei f103a143db [XPU][CI]Cherry-Pick PR and Update CI Case (#6619)
* [XPU] Fix PD + MTP (#6495)

* fix pd + mtp

* fix code style

* fix PD + MTP, D get P's first token

* add anno for gpu(speculate_update)

* update draft insertv1

* fix wapper & kernel

* fix wapper

* fix code stype

* fix tp4 dp1 (#6624)

* update paddle whl package

---------

Co-authored-by: cmcamdy <1027740945@qq.com>
2026-03-11 10:57:30 +08:00
YuBaoku 2851d89696 [Cherry-Pick][CI] Sync CI optimizations from develop to release/2.5(#6645) (#6684) 2026-03-06 11:57:44 +08:00
YuBaoku e301a3cc98 [Cherry-Pick][XPU][CI] Fix XPU CI Bug (#6658) (#6679)
Removed triton package installation and uninstallation steps.
Modify docker pull command to avoid failure on error.
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com>
2026-03-05 22:16:14 +08:00
Jiaxin Sui 80fdf0db4e [XPU][CI]lock xvllm version(#6640) 2026-03-04 18:59:42 +08:00
YuBaoku 2e61ce13b1 [CI] Switch 2.5 branch to use Paddle release/3.3 (#6621) 2026-03-03 19:08:59 +08:00
YuBaoku 3d83083929 [CI] Optimize unittest and fix title format (#6464) (#6468)
* [CI] Optimize unit test duration and fix PR title format
2026-02-25 13:02:12 +08:00
YuBaoku 582e52f7cd Revert "[XPU] change base XPU docker image (#6411)" (#6427) (#6459)
This reverts commit 32bd40a192.

Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com>
2026-02-11 16:48:56 +08:00
YuBaoku b84056fdaa [CI] Fix stable_test and add cherry-pick automation (#6415) 2026-02-09 23:10:12 +08:00
Lucas 32bd40a192 [XPU] change base XPU docker image (#6411) 2026-02-09 22:53:12 +08:00
YuBaoku 5c9bc13a59 [CI] Fix check-bypass.yml 2026-02-05 18:06:39 +08:00
YuBaoku 2d3fb81d29 [CI] Update check-bypass.yml (#6360) 2026-02-05 15:52:30 +08:00
YuBaoku cae2709eff [CI] Update stable test workflow and run.sh script (#6352) 2026-02-05 11:01:15 +08:00
Zhang Yulong f3c12be4d2 Update _build_linux_rl.yml (#6274) 2026-01-29 19:10:47 +08:00
YuBaoku bb7c1d13e1 [CI] Remove --ipc=host and --pid=host from _stable_test.yml (#6270) 2026-01-29 17:06:06 +08:00
YuBaoku b07b76e03f [CI] Fix nightly cu129 build_outputs upload failure (#6264) 2026-01-28 23:39:39 +08:00
YuBaoku 85db063da6 [CI] Fix workflow validation error in publish_job 2026-01-28 10:44:30 +08:00
YuBaoku 029cceec33 [CI] Switch nightly build to use FD_UNIFY_BUILD (#6246)
* [CI] Adapt build script for unified and arch-specific builds

* [CI] Switch nightly build to use FD_UNIFY_BUILD
2026-01-27 23:53:42 +08:00
YuBaoku d975f6acdd [CI] adjust resource scheduling of _stable_test (#6235) 2026-01-27 22:31:13 +08:00
Divano ba9d2a9e5a [CI] add update weights tests (#6242) 2026-01-27 20:54:21 +08:00
YuBaoku 1cfb042045 [CI] Add ep4_mtp e2e test (#6153)
* [CI] Add ep4_mtp e2e test
2026-01-22 14:54:18 +08:00
YuBaoku fe5ba4b509 [CI] Update image used by build_rl in ce_job.yml 2026-01-21 20:57:50 +08:00
yzwu 837ddca273 [Iluvartar][CI] Fix the error max_tokens_per_expert referenced before assignment (#6083) 2026-01-21 16:01:29 +08:00
YuBaoku c991fda54c [CI] Enable 4-GPU e2e test in nightly and fix docker_tag_build (#6128) 2026-01-20 22:44:29 +08:00
YuBaoku ac6fa6d725 [CI] Add 4-GPU e2e test job (#6082) 2026-01-19 10:42:14 +08:00
Jiaxin Sui 70a962df53 [XPU][CI] XPU CI refactor (#6053)
* add xpu ci case

* add xpu ci case

* add xpu ci case

* Change runner from XPU-P800-8Card to XPU-P800
2026-01-16 20:57:58 +08:00
MingkunZhang 0d372e4fb2 [Metax][CI] update jenkins github action version (#6065) 2026-01-16 15:06:14 +08:00
fxyfxy777 4c92035f2d [Feature] Unify fp8 block_wise quant ops (#5991)
* quant stash

* blockwise_quant

* precommit

* rm tensor.cut

* tp ok

* add swiglu

* rm outdate code

* fix activate ut

* change baseline

* fix baseline error
2026-01-15 05:50:37 -08:00
YuBaoku 0d3dede273 [CI] Add fd-router build_task (#5967)
* [CI] Add fd-router build_task
2026-01-12 22:03:27 +08:00
YuBaoku ff2eba1f43 [CI] Temporarily disable fp8_cases in base_tests (#5963)
* [CI] Temporarily disable fp8_cases in base_tests
2026-01-08 23:29:37 +08:00
Jiaxin Sui dc170e3005 [XPU][CI]Update CI workflow to include all file types (#5943)
Removed paths-ignore for markdown and text files.
2026-01-08 12:03:26 +08:00
YuBaoku 5088d4acdb [CI] Add daily build_linux jobs for CUDA 12.9 (#5936)
To extend the daily CI coverage by adding Linux build jobs for CUDA 12.9.
2026-01-07 23:20:11 +08:00
YuBaoku 37a128e240 [CI] Fix reusable workflow output mapping in _build_linux_rl.yml
Fix incorrect job reference in reusable workflow outputs, which caused the RL wheel path to be dropped.
2026-01-04 21:22:07 +08:00
YuBaoku 55f77e9ab1 [CI] Add commit-level build_linux task for RL (#5857) 2026-01-04 20:31:27 +08:00
YuBaoku 98519ee2e9 [CI] Fix archive URL injection in tag image build (#5828) 2025-12-30 14:28:17 +08:00
YuBaoku c3ccfa974c [CI] Fix path error and port conflict (#5803) 2025-12-27 12:50:58 +08:00
YuBaoku 8808dd1fed [CI] Enable custom_device_check in CI rerun (#5786)
* [CI] Enable custom_device_check in CI rerun
2025-12-26 14:09:16 +08:00
YuBaoku 7247dc5f3a [CI] Add retry and robust cleanup for removal (#5725)
* [CI] Add retry and robust cleanup for removal

* [CI] Ensure clean GPU memory by killing leftover processes
2025-12-25 17:08:27 +08:00
Jiaxin Sui 8fc789bb3f [iluvatar][CI] refactor iluvatar_ci (#5588)
* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* refactor iluvatar_ci

* Update Docker image tag in iluvatar_test workflow

* Update default Docker image version in workflow

* Update iluvatar_test.yml

* Update default Docker image in workflow config

* Update model path in run_ernie300B_4layer.py

* Update model path in offline inference check

* Add model_data directory and copy model files

Create model_data directory and copy necessary files.

* Update run_ernie_vl_28B.py

* Update run_ernie300B_4layer.py

* Update paddlepaddle installation method in script

* Change wget command to include proxy option

* Modify paddle package installation in CI script

Updated installation commands for paddle packages.

* Update paddlepaddle and paddle-iluvatar-gpu versions

* Delete .github/workflows/ci_iluvatar.yml

* Rename workflow from ILUVATAR Test to ILUVATAR-CI

* Update installation commands for paddlepaddle and iluvatar
2025-12-25 15:10:34 +08:00