This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-05-08 16:32:41 +08:00
Code
Issues
Actions
7
Packages
Projects
Releases
Wiki
Activity
Files
54f7d9f62128cf8317d170eb87605914f5d059de
FastDeploy
/
custom_ops
/
gpu_ops
/
speculate_decoding
/
draft_model
T
History
周周周
2b4748de4f
[MTP] refactor MTP pre_process (
#6358
)
2026-02-09 10:47:15 +08:00
..
draft_model_postprocess.cu
…
draft_model_preprocess.cu
[Speculative Decoding] Fix attn_mask_offset for multi-step MTP in mixed and PD-split modes (
#5738
)
2025-12-25 01:54:59 -08:00
draft_model_set_value_by_flags.cu
…
draft_model_update.cu
[MTP] refactor MTP pre_process (
#6358
)
2026-02-09 10:47:15 +08:00
eagle_get_hidden_states.cu
…
eagle_get_self_hidden_states.cu
…
hydra_fetch_hidden_states.cu
…
mtp_save_first_token.cc
…
mtp_step_paddle.cu
…
ngram_match_mixed.cu
…