Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
8f21c9caa6ca283b09bf1827f0faca09633cf466
FastDeploy/tests/model_loader
T
History
bukejiyu 14d46181b8 [Loader] add multi-thread model loading (#6877)
* multi-thread-loader

* fix ut
2026-04-09 23:40:15 -07:00
..
test_dummy_loader.py
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
test_load_attention.py
【FIX】Change the name of sparse attn from moba to plas (#4006) (#4076)
2025-09-23 10:26:40 +08:00
test_load_ernie_vl.py
[CI] Optimize port cleanup logic (#4860)
2025-11-06 19:13:48 +08:00
test_load_mtp.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_model_cache.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_offline_model.py
[Loader] add multi-thread model loading (#6877)
2026-04-09 23:40:15 -07:00
test_torch_model.py
[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
2026-03-19 01:43:10 -07:00
test_w4a8_model.py
Update test_w4a8_model.py (#4125)
2025-09-16 20:43:10 +08:00
utils.py
[Loader] add multi-thread model loading (#6877)
2026-04-09 23:40:15 -07:00
Powered by Gitea Version: 1.26.0 Page: 132ms Template: 4ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API