FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

sunxin 33e01f22a8 [Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894 )

* update sampling

* fix

* fix

* fix mtp

* fix test

2026-03-19 01:43:10 -07:00

test_dummy_loader.py

2026-01-26 13:58:53 +08:00

test_load_attention.py

2025-09-23 10:26:40 +08:00

test_load_ernie_vl.py

2025-11-06 19:13:48 +08:00

test_load_mtp.py

2026-02-05 14:39:00 +08:00

test_model_cache.py

2026-02-05 14:39:00 +08:00

test_offline_model.py

2026-02-05 14:39:00 +08:00

test_torch_model.py

2026-03-19 01:43:10 -07:00

test_w4a8_model.py

2025-09-16 20:43:10 +08:00

utils.py

2025-11-12 20:26:49 +08:00