FastDeploy/tests/model_executor at 30db3e9d8f2ab7c7cb18b515d0eed08162577b73 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

History

Bingoo 6b891da02b [Optimization] enable trtllm_all_reduce fusion kernel in glm model (#6660 )

* enable trtllm_all_reduce fusion kernel in glm model

* fix conflict

* format update

* fix a bug

* modify test

* modify test

* support empty tensor and modify test

* fix test_linear config issues

* modify test name

* add edge test case

* modify format

* fix conflict

* modify default max token num in trtllm_allreduce_fusion

* add max token num branch for trtllm_allreduce_fusion

* fix format

* fix rmsnorm config issue

* modify 2025 to 2026

* using compat grard

* Lazily import flashinfer.comm and fix test config issue

* fix test issues

* add flashinfer cache dir clean machine

* fix some issues

2026-04-16 14:10:19 +08:00

..

guided_decoding

[Others]update paddleformer 1.0.0 (#6496 )

2026-03-11 15:06:29 +08:00

[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (#6533 )

2026-03-16 21:32:43 +08:00

test_entropy_utils.py

[Bugfix] Fix entropy calculation bugs (#5941 )

2026-01-08 20:57:35 +08:00

test_ep.py

[Feature] Support redundant expert for eplb (#5918 )

2026-01-09 17:13:24 +08:00

test_ernie4_5_mtp.py

[CI]【Hackathon 10th Spring No.43】ernie4_5_mtp 单测补充 (#6738 )

2026-03-27 17:15:53 +08:00

test_ernie_tokenizer.py

[CI]【Hackathon 9th Sprint No.52】NO.52 功能模块 fastdeploy/model_executor/guided_decoding/ernie_tokenizer.py 单测补充 (#5047 )

2025-12-29 13:44:56 +08:00

test_forward_meta_str.py

remove input_ids from ForwardMeta (#4793 )

2025-11-05 11:55:51 +08:00

test_fused_moe_wint2_backend.py

[CI]【Hackathon 10th Spring No.37】功能模块 fastdeploy/model_executor/layers/moe/fused_moe_wint2_backend.py单测补充 (#6286 )

2026-02-04 10:46:26 +08:00

test_linear.py

[Optimization] enable trtllm_all_reduce fusion kernel in glm model (#6660 )

2026-04-16 14:10:19 +08:00

test_load_weight_utils.py

[CI]【Hackathon 10th Spring No.32】load_weight_utils unit test (#6740 )

2026-03-20 13:14:30 +08:00

test_logits_processor.py

[Docs] Add License in Unittest (#4957 )

2025-11-12 10:44:09 +08:00

test_model_executor_utils.py

[BugFix][Optimization] Replace silent failures with catchable exceptions and informative error messages (#6533 )

2026-03-16 21:32:43 +08:00

test_paddleformers_base.py

[BugFix][Models] Unify PaddleFormers fused QKV TP loading and stabilize fallback TP path (#6555 )

2026-03-20 16:37:58 +08:00

test_paddleformers_dense_text_fallback.py

[BugFix][Models] Unify PaddleFormers fused QKV TP loading and stabilize fallback TP path (#6555 )

2026-03-20 16:37:58 +08:00

test_pooler.py

[CI] 【Hackathon 9th Sprint No.19】NO.19 功能模块单测补充 (#5063 )

2025-12-18 21:32:44 +08:00

test_thinking_budget.py

PD deployment support without router (#7412 )

2026-04-15 20:13:07 +08:00

test_tp_utils.py

[Others] Rename tensor_parallel_degree to tensor_model_parallel_size for paddleformers 0.4.1 (#5727 )

2025-12-23 23:19:11 -08:00