This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
52eda7fdb3a3e272dd3d6e3b518a48f03af60699
FastDeploy
/
custom_ops
/
gpu_ops
/
speculate_decoding
T
History
…
..
draft_model
…
ngram_match.cc
…
speculate_calcu_accept_ratio.cu
…
speculate_clear_accept_nums.cu
…
speculate_get_output_padding_offset.cu
…
speculate_get_output.cc
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00
speculate_get_padding_offset.cu
…
speculate_get_seq_lens_output.cu
…
speculate_msg.h
…
speculate_save_output.cc
…
speculate_set_value_by_flags.cu
…
speculate_step_reschedule.cu
…
speculate_step_system_cache.cu
…
speculate_step.cu
…
speculate_stop_generation_multi_stop_seqs.cu
…
speculate_token_penalty_multi_scores.cu
…
speculate_update_input_ids_cpu.cc
…
speculate_update.cu
…
speculate_verify.cu
…
top_p_candidates.cu
…