This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 08:21:53 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
9d3551cfbb5315713813f5820c4e8563d4a6fe39
FastDeploy
/
fastdeploy
/
model_executor
T
History
周周周
609f649dd7
[OP] Add flashmla baseline implementation and precision test (
#7477
)
2026-04-21 13:37:52 +08:00
..
graph_optimization
[RL] Add clear_graph_opt_backend for glm4_mtp (
#7378
)
2026-04-15 19:44:15 +08:00
guided_decoding
…
layers
[OP] Add flashmla baseline implementation and precision test (
#7477
)
2026-04-21 13:37:52 +08:00
logits_processor
…
model_loader
…
models
[Optimization][DeepSeekV3.2]Reducing slot_mapping compute frequency from twice per layer to a single pre-processing step. (
#7367
)
2026-04-16 19:54:12 +08:00
ops
[Others] Fix typo (
#7280
)
2026-04-14 17:28:22 +08:00
__init__.py
…
entropy_utils.py
…
forward_meta.py
[Optimization][DeepSeekV3.2]Reducing slot_mapping compute frequency from twice per layer to a single pre-processing step. (
#7367
)
2026-04-16 19:54:12 +08:00
load_weight_utils.py
…
pre_and_post_process.py
[Speculative Decoding] Add MTP logprob support for PD disaggregation (
#7442
)
2026-04-17 21:37:38 +08:00
utils.py
[Typo] Fix parameter name typo in slice_fn: paramter -> parameter (
#7462
)
2026-04-20 10:06:02 +08:00
xpu_pre_and_post_process.py
[XPU] Unify Spec and non-spec branch.(
#6947
) (
#7180
)
2026-04-16 14:58:38 +08:00