Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 21 Packages Projects Releases Wiki Activity
Files
bf7e2424d0c32ba1ef452b9ec30d1e3eba3cd68e
FastDeploy/tests/model_loader
T
History
sunxin 33e01f22a8 [Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
* update sampling

* fix

* fix

* fix mtp

* fix test
2026-03-19 01:43:10 -07:00
..
test_dummy_loader.py
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
test_load_attention.py
【FIX】Change the name of sparse attn from moba to plas (#4006) (#4076)
2025-09-23 10:26:40 +08:00
test_load_ernie_vl.py
[CI] Optimize port cleanup logic (#4860)
2025-11-06 19:13:48 +08:00
test_load_mtp.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_model_cache.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_offline_model.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_torch_model.py
[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
2026-03-19 01:43:10 -07:00
test_w4a8_model.py
Update test_w4a8_model.py (#4125)
2025-09-16 20:43:10 +08:00
utils.py
[CI] fix test_model_cache (#4982)
2025-11-12 20:26:49 +08:00
Powered by Gitea Version: 1.26.0 Page: 376ms Template: 20ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API