Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00
Code Issues Actions 19 Packages Projects Releases Wiki Activity
Files
fbc3aa93de37c0f52275032ab1ad1f36bcb9e683
FastDeploy/tests/model_loader
T
History
bukejiyu c62f6b4ea5 [Others] Fix PD reorder for MTP (#6792)
* fix pd reorder in mtp

* add ut

* update

* fix mtp
2026-03-23 21:10:22 +08:00
..
test_dummy_loader.py
[Loader] support dummy load weight (#6169)
2026-01-26 13:58:53 +08:00
test_load_attention.py
【FIX】Change the name of sparse attn from moba to plas (#4006) (#4076)
2025-09-23 10:26:40 +08:00
test_load_ernie_vl.py
[CI] Optimize port cleanup logic (#4860)
2025-11-06 19:13:48 +08:00
test_load_mtp.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_model_cache.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_offline_model.py
[Optimization] Support FA2/FA3/FA4 with attn_mask_q (#6354)
2026-02-05 14:39:00 +08:00
test_torch_model.py
[Feature][Sampling] Extend top-k_top-p sampling to all backends and unify greedy decoding with top_k=1 (#6894)
2026-03-19 01:43:10 -07:00
test_w4a8_model.py
Update test_w4a8_model.py (#4125)
2025-09-16 20:43:10 +08:00
utils.py
[Others] Fix PD reorder for MTP (#6792)
2026-03-23 21:10:22 +08:00
Powered by Gitea Version: 1.26.0 Page: 240ms Template: 27ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API