Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
39ff38aba17ca23488b66d5e8f4c00c4e0ba3b24
FastDeploy/tests/model_loader
T
History
bukejiyu c62f6b4ea5 [Others] Fix PD reorder for MTP (#6792)
* fix pd reorder in mtp

* add ut

* update

* fix mtp
2026-03-23 21:10:22 +08:00
..
test_dummy_loader.py
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
test_load_attention.py
【FIX】Change the name of sparse attn from moba to plas (#4006) (#4076)
2025-09-23 10:26:40 +08:00
test_load_ernie_vl.py
[CI] Optimize port cleanup logic (#4860)
2025-11-06 19:13:48 +08:00
test_load_mtp.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_model_cache.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_offline_model.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_torch_model.py
[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
2026-03-19 01:43:10 -07:00
test_w4a8_model.py
Update test_w4a8_model.py (#4125)
2025-09-16 20:43:10 +08:00
utils.py
[Others] Fix PD reorder for MTP (#6792)
2026-03-23 21:10:22 +08:00
Powered by Gitea Version: 1.26.0 Page: 988ms Template: 13ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API