This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-09 17:04:42 +08:00
Code
Issues
Actions
7
Packages
Projects
Releases
Wiki
Activity
Files
672620cdfeac535ff9a6fac23ed0a60472f12a5b
FastDeploy
/
tests
/
operators
T
History
lizexu123
6d323769dd
fix w4afp8 (
#5634
)
2025-12-22 13:39:41 +08:00
..
test_air_top_p_sampling.py
…
test_cutlass_fp8_fp8_fp8_dual_gemm_fused.py
…
test_cutlass_scaled_mm.py
…
test_deqant_int8_cpp_extension.py
…
test_dequant.py
…
test_draft_model_postprocess.py
…
test_draft_model_preprocess.py
…
test_draft_model_set_value_by_flags.py
…
test_draft_model_update.py
…
test_dynamic_per_token_scaled_fp8_quant.py
…
test_eagle_get_hidden_states.py
…
test_eagle_get_self_hidden_states.py
…
test_flash_mask_attn.py
…
test_fp8_fp8_half_cuda_core_gemm.py
…
test_fused_get_rotary_embedding.py
…
test_fused_hadamard_quant_fp8.py
…
test_fused_moe.py
…
test_fused_neox_rope_embedding.py
…
test_fused_rotary_position_encoding.py
…
test_gelu_tanh.py
…
test_get_padding_offset.py
Remove CUDA ERROR 9 of inputs of get_padding_offset kernel (
#5440
)
2025-12-09 14:17:30 +08:00
test_get_position_ids_and_mask_encoder_batch.py
…
test_get_token_penalty_multi_scores.py
…
test_group_swiglu_with_masked.py
…
test_hybrid_mtp_ngram.py
…
test_limit_thinking_content_length.py
…
test_machete_mm.py
…
test_masked_per_token_quant.py
Revert "[Feature] add ue8m0 for per_token_quant_fp8 (
#5563
)" (
#5611
)
2025-12-17 13:59:06 +08:00
test_moe_redundant_topk_select.py
…
test_moe_top_k_select.py
…
test_ngram_match.py
…
test_noaux_tc_redundant.py
…
test_noaux_tc.py
…
test_per_token_quant.py
Revert "[Feature] add ue8m0 for per_token_quant_fp8 (
#5563
)" (
#5611
)
2025-12-17 13:59:06 +08:00
test_pre_cache_len_concat.py
…
test_rebuild_padding.py
…
test_rejection_top_p_sampling.py
…
test_scaled_gemm_f8_i4_f16.py
…
test_set_value_by_flags_and_idx.py
…
test_share_external_data.py
…
test_speculate_get_output_padding_offset.py
…
test_speculate_get_padding_offset.py
Remove CUDA ERROR 9 of inputs of get_padding_offset kernel (
#5440
)
2025-12-09 14:17:30 +08:00
test_speculate_get_seq_lens_output.py
…
test_speculate_get_target_logits.py
…
test_speculate_get_token_penalty_multi_scores.py
…
test_speculate_insert_first_token.py
…
test_speculate_limit_thinking_content_length.py
[BugFix] fix speculate_limit_thinking_content_length (
#5590
)
2025-12-16 04:31:45 -08:00
test_speculate_set_stop_value_multi_seqs.py
[Feature] support stop_token_ids (
#5399
)
2025-12-09 17:49:12 +08:00
test_speculate_update.py
…
test_speculate_verify.py
…
test_speculative_schedule_cache.py
…
test_split_fuse.py
…
test_stop_generation_multi_ends.py
[Feature] support stop_token_ids (
#5399
)
2025-12-09 17:49:12 +08:00
test_token_penalty.py
…
test_top_k_renorm_probs.py
…
test_top_p_candidates.py
…
test_tree_mask.py
…
test_tritonmoe_preprocess.py
…
test_update_attn_mask.py
…
test_update_inputs_v1.py
…
test_w4afp8_gemm.py
fix w4afp8 (
#5634
)
2025-12-22 13:39:41 +08:00
test_wfp8afp8_sparse_gemm.py
…