Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-10 09:31:48 +08:00
Code Issues Actions 6 Packages Projects Releases Wiki Activity
Files
08c411518ffa09650cf63aa9b50604c56c82fa3c
FastDeploy/tests/layers
T
History
fxyfxy777 4c92035f2d [Feature] Unify fp8 block_wise quant ops (#5991)
* quant stash

* blockwise_quant

* precommit

* rm tensor.cut

* tp ok

* add swiglu

* rm outdate code

* fix activate ut

* change baseline

* fix baseline error
2026-01-15 05:50:37 -08:00
..
test_activation.py
[Feature] Unify fp8 block_wise quant ops (#5991)
2026-01-15 05:50:37 -08:00
test_append_attention_with_output.py
…
test_append_attention.py
…
test_attention_layer.py
[UT]support attention test tp (#5887)
2026-01-06 11:15:01 +08:00
test_batched_count_greater_than.py
…
test_ep_moe_expert_dispatch_fp8.py
…
test_ffn.py
…
test_fusedmoe.py
[UNITEST] add EP TP test_fused_moe CI (#5989)
2026-01-15 21:37:32 +08:00
test_guided_decoding.py
…
test_min_sampling.py
…
test_moba_attention_backend.py
…
test_native_paddle_backend.py
…
test_plas_attention.py
…
test_quantized_linear.py
…
test_repetition_early_stopper.py
…
test_sampler.py
…
test_speculative_sampler.py
[Feature]Support tag phase token enforce generation (#6034)
2026-01-15 03:59:55 -08:00
test_w4a8_moe.py
…
test_w4afp8_moe.py
…
Powered by Gitea Version: 1.26.0 Page: 1951ms Template: 38ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API