This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
412867fd99ef1e25c873a3c573b69e1f61b19663
FastDeploy
/
fastdeploy
/
model_executor
T
History
qw86972190
135e47d551
[XPU]ZMQ logprob (
#5628
)
...
* [XPU]ZMQ logprob
2025-12-25 14:50:01 +08:00
..
graph_optimization
[Others] add assert and only count the actual load in cuda_graph (
#5445
)
2025-12-10 11:22:54 +08:00
guided_decoding
[Feature] Guided Decoding add LLguidance backend (
#5124
)
2025-12-03 20:23:57 +08:00
layers
[GraphOptimization] Wrap deep gemm and triton as python op (
#5673
)
2025-12-24 15:23:46 +08:00
logits_processor
[Feature] support logits processors (
#4515
)
2025-10-29 00:08:53 +08:00
model_loader
[Loader]Fix bug in MTP weight loading (
#5744
)
2025-12-25 11:32:17 +08:00
models
[Loader]Fix bug in MTP weight loading (
#5744
)
2025-12-25 11:32:17 +08:00
ops
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (
#5555
)
2025-12-18 02:14:25 -08:00
__init__.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
entropy_utils.py
[Feature] Entropy calculation support (
#5692
)
2025-12-23 21:19:47 +08:00
forward_meta.py
[Intel HPU] enable tensor_wise_fp8 (
#5324
)
2025-12-17 16:45:03 +08:00
load_weight_utils.py
remove fastsafetensors (
#5371
)
2025-12-04 19:22:04 +08:00
pre_and_post_process.py
[Feature] Entropy calculation support (
#5692
)
2025-12-23 21:19:47 +08:00
utils.py
[Loader]Fix bug in MTP weight loading (
#5744
)
2025-12-25 11:32:17 +08:00
xpu_pre_and_post_process.py
[XPU]ZMQ logprob (
#5628
)
2025-12-25 14:50:01 +08:00