* enable trtllm_all_reduce fusion kernel in glm model
* fix conflict
* format update
* fix a bug
* modify test
* modify test
* support empty tensor and modify test
* fix test_linear config issues
* modify test name
* add edge test case
* modify format
* fix conflict
* modify default max token num in trtllm_allreduce_fusion
* add max token num branch for trtllm_allreduce_fusion
* fix format
* fix rmsnorm config issue
* modify 2025 to 2026
* using compat grard
* Lazily import flashinfer.comm and fix test config issue
* fix test issues
* add flashinfer cache dir clean machine
* fix some issues
* Add linear layer tests for model executor
* Refine linear layer tests for uncovered branches
* Refactor and enhance tests for linear layers
Refactor test_linear.py by removing unused imports and redundant code, and updating model configuration parameters. Add new tests for linear layers and their loading mechanisms.
* test: patch row-parallel alltoall in unit test
* test: avoid alltoall reshape failure in row-parallel
* test: expand linear coverage targets
* Refine linear tests per review feedback
* Fix linear tests for pre-sharded config and qkv fixture
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com>