Commit Graph

248 Commits

Author SHA1 Message Date
YuBaoku 1e08ee74e5 [CI] Modify 4-card container startup config and move test case (#7363) 2026-04-13 05:23:49 -07:00
YuBaoku fcf8b1336d [CI] Fix nightly test error and add container cleanup in build_rl (#7335)
* [CI] Fix nightly test error and add container cleanup in build_rl
2026-04-11 12:14:46 +08:00
YuBaoku 1269eda2f9 [CI] Ensure container cleanup after job to avoid resource leakage (#7315)
* [CI] Ensure container cleanup after job to avoid resource leakage

* [CI] Use prebuilt wheels to install xgrammar==0.1.19 and torch==2.6.0
2026-04-10 22:32:18 +08:00
YuBaoku 924690b791 [CI] Add no_proxy configuration for docker execution (#7283) 2026-04-09 19:20:33 +08:00
YuBaoku 49b3d0d326 [CI] increase shm-size to 128G and set nproc/nofile limits in _unit_test_coverage.yml (#7227)
* [CI] increase shm-size to 128G in _unit_test_coverage.yml

* [CI] set nproc/nofile limits in _unit_test_coverage.yml
2026-04-08 11:35:10 +08:00
YuBaoku 8cb417e8fb [CI] Improve Code Prepare stability and cleanup logic (#7198) 2026-04-07 10:31:36 +08:00
YuBaoku da3dfe1c80 [CI] Use GPU-Build-RL runner for _build_linux_rl.yml (#7186) 2026-04-03 20:24:39 +08:00
YuBaoku 7aa213bba9 [CI] Replace ipc=host with shm-size and sysctl configuration (#7138) 2026-04-02 10:33:55 +08:00
YuBaoku c6f0c5c3a6 [CI] Optimize test execution with single-GPU parallelism (#7085)
* [CI] Optimize test execution with single-GPU parallelism and log collection

* remove export CUDA_VISIBLE_DEVICES

* fix path error

* fix log_* path and debug

* [CI] Optimize test execution with single-GPU parallelism and log collection
2026-04-01 14:18:40 +08:00
YuBaoku db6e637f4f [CI] Remove skip logic for *.txt-only changes (#7104) 2026-03-31 13:24:50 +08:00
YuBaoku a7cbe3ff91 [CI] Adapt to codecov action changes for Node.js 24 (#7064) 2026-03-29 16:49:44 +08:00
Zhang Yulong f25760f4e6 [CI] Update docker run command in unit test coverage workflow (#7050)
Removed the --ipc=host option from the docker run command.
2026-03-27 19:53:09 +08:00
Jiaxin Sui c3ed7db28d [XPU] [CI] Fix xpu ci bug (#7014)
* fix xpu ci bug

* Remove unnecessary blank line in conftest.py

* Update upload-artifact action to version 6

* Update _xpu_8cards_case_test.yml

* fix ci bug

* Change exit code on test failure to 1

* fix ci bug

* fix ci bug

* fix ci bug

* fix ci bug

* Update conftest.py
2026-03-27 10:29:34 +08:00
Zhang Yulong a31d4bfbdf [CI] update mtp case (#7031) 2026-03-27 10:21:37 +08:00
Jiang-Jia-Jun 482f951ee9 Update copilot-instructions.md 2026-03-25 21:09:24 +08:00
YuBaoku 1b276e62d4 [CI] Upgrade GitHub Actions for Node 24 compatibility (#6975)
* [CI] Upgrade GitHub Actions for Node 24 compatibility
2026-03-23 20:38:22 +08:00
YuBaoku fdd12ff5ba [CI] Fix: incorrect downstream job execution when only build_gpu/xpu is skipped (#6958)
* [CI] Fix: incorrect downstream job execution when only build_gpu/xpu is skipped

* [CI] Fix: avoid skipping required jobs by moving skip logic to steps

* [CI] Fix: Invalid secret, github-token is not defined
2026-03-22 17:00:18 +08:00
YuBaoku 030820db4c [CI] Optimize CI: refine check-bypass/cancel logic and fix nightly task (#6939)
* [CI] Optimize CI: add check-bypass for workflow skip control

* fix ci_image_build and publish_job

* [CI] Optimize CI: add check-bypass and cancel

* [CI] update to PFCCLab/ci-bypass@v2
2026-03-20 19:34:45 +08:00
YuBaoku 7141db0e01 [CI] Optimize CI: update nightly test_image build workflow (#6937) 2026-03-19 17:39:01 +08:00
YuBaoku 2b84a4276e [CI] Optimize CI: add timeout and cancel on PR close (#6933) 2026-03-19 15:54:30 +08:00
jc dd55cda3c8 [CI] Add test for pd and cache storage (#6876)
* Add test for pd and cache storage

* up

* up

* fix bug

* fix bug

* up docker image

* up
2026-03-19 10:38:27 +08:00
YuBaoku 07543685ec [CI] Isolate cache and ccache for CUDA 13.0 build 2026-03-18 11:41:46 +08:00
YuBaoku d73fd876ba [CI] Add daily build_linux jobs for CUDA 13.0 (#6809) 2026-03-12 22:04:58 +08:00
YuBaoku ab0eacb1ab [CI] Update _build_linux_rl.yml to use Paddle installation method with URL 2026-03-12 20:37:51 +08:00
yzwu 901b38c936 [Iluvatar] Optimize decode group_gemm and Support cuda graph for ernie (#6803) 2026-03-12 19:21:17 +08:00
fxyfxy777 250ce40b40 [Feature] use phi permute/unpermute & rm swiglu (#6361)
* tp文字输出正常

* B eb5 mini文字输出正常

* eb5mini ep B卡 文字输出正常

* default use phi moe op

* stash

* tp H卡正常

* ep ok

* rm debug

* rm debug tool

* rm del ffn_out

* rm swiglu

* add envs to swiglu

* merge dev

* fix ci baseline

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* fix ci baseline 2

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-12 02:01:57 -07:00
YuBaoku deff121a5f [CI] Update _build_linux_rl.yml to use cu129 nighlty 2026-03-11 23:58:07 +08:00
Jiaxin Sui 25c479312d [CI][MetaX]Add timeout to Jenkins job trigger step (#6755) 2026-03-10 14:26:26 +08:00
YuBaoku cbfdf42628 [CI] Add test_dynamic_c8_cache.py and latest FastDeploy.tar.gz upload (#6708) 2026-03-08 16:01:12 +08:00
Jiaxin Sui 63414ccc13 [XPU][CI] Fix XPU CI Bug (#6658)
* Simplify dependency installation in workflow

Removed triton installation and uninstallation steps.

* Simplify dependency installation in workflow

Removed triton package installation and uninstallation steps.

* Allow docker pull to continue on failure

* Update docker pull command to continue on error

Modify docker pull command to avoid failure on error.

* Allow docker pull to fail without error
2026-03-05 11:27:11 +08:00
YuBaoku 5c8f5184d9 [CI] Add pytest timeout and enable workflow rerun (#6645) 2026-03-04 21:30:16 +08:00
YuBaoku c3d6d706d5 [CI] Add nightly workflow for golang_router tests and improve log handling (#6608)
* [CI] Add nightly workflow for Golang router tests
* [CI] Improve pytest script stability and log handling
2026-03-03 19:36:57 +08:00
YuBaoku 9d72332aca [CI] Optimize unittest and fix title format (#6464)
* [CI] Optimize unit test duration and fix PR title format
2026-02-11 20:48:56 +08:00
Jiaxin Sui e40fb16912 Revert "[XPU] change base XPU docker image (#6411)" (#6427)
This reverts commit 32bd40a192.
2026-02-11 16:31:54 +08:00
kesmeey e4e3a71e7b [CI] 【Hackathon 10th Spring No.22】功能模块 fastdeploy/cache_manager/cache_transfer_manager.py 单测补充 (#6157)
* Add comprehensive test coverage for cache_transfer_manager.py

* Fix code style: add newline at end of file

* fix: update cache transfer manager tests for branch 22 interface changes

* fix: resolve test errors for cache transfer manager

* fix: update cache transfer manager tests for branch 22 interface changes

* style: apply pre-commit formatting to tests/cache_manager/test_cache_transfer_manager.py

* Run codestyle: format tests/cache_manager/test_cache_transfer_manager.py and related fixes

* Update test_cache_transfer_manager.py

* Format cache transfer manager tests

* Update cache transfer manager tests

* Update unit test coverage workflow

---------

Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com>
2026-02-11 11:23:57 +08:00
YuBaoku 390d0f2d77 [CI] Fix cherry-pick automation (#6448)
* [CI] Fix cherry-pick automation
2026-02-10 22:45:29 +08:00
YuBaoku a918738b8f [CI] Optimize cherry-pick automation (#6445) 2026-02-10 21:48:13 +08:00
YuBaoku b84056fdaa [CI] Fix stable_test and add cherry-pick automation (#6415) 2026-02-09 23:10:12 +08:00
Lucas 32bd40a192 [XPU] change base XPU docker image (#6411) 2026-02-09 22:53:12 +08:00
YuBaoku 5c9bc13a59 [CI] Fix check-bypass.yml 2026-02-05 18:06:39 +08:00
YuBaoku 2d3fb81d29 [CI] Update check-bypass.yml (#6360) 2026-02-05 15:52:30 +08:00
YuBaoku cae2709eff [CI] Update stable test workflow and run.sh script (#6352) 2026-02-05 11:01:15 +08:00
Zhang Yulong f3c12be4d2 Update _build_linux_rl.yml (#6274) 2026-01-29 19:10:47 +08:00
YuBaoku bb7c1d13e1 [CI] Remove --ipc=host and --pid=host from _stable_test.yml (#6270) 2026-01-29 17:06:06 +08:00
YuBaoku b07b76e03f [CI] Fix nightly cu129 build_outputs upload failure (#6264) 2026-01-28 23:39:39 +08:00
YuBaoku 85db063da6 [CI] Fix workflow validation error in publish_job 2026-01-28 10:44:30 +08:00
YuBaoku 029cceec33 [CI] Switch nightly build to use FD_UNIFY_BUILD (#6246)
* [CI] Adapt build script for unified and arch-specific builds

* [CI] Switch nightly build to use FD_UNIFY_BUILD
2026-01-27 23:53:42 +08:00
YuBaoku d975f6acdd [CI] adjust resource scheduling of _stable_test (#6235) 2026-01-27 22:31:13 +08:00
Divano ba9d2a9e5a [CI] add update weights tests (#6242) 2026-01-27 20:54:21 +08:00
YuBaoku 1cfb042045 [CI] Add ep4_mtp e2e test (#6153)
* [CI] Add ep4_mtp e2e test
2026-01-22 14:54:18 +08:00