This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2026-04-23 00:17:25 +08:00
Code
Issues
Actions
19
Packages
Projects
Releases
Wiki
Activity
Files
1d3ae7c0244a054e07b0de37e30859b1e184a60a
FastDeploy
/
fastdeploy
/
model_executor
T
History
lizexu123
1d3ae7c024
[BugFix] fix w4afp8 tp=8 (
#5868
)
...
* fix w4afp8 tp=8 * fix
2026-01-05 18:59:02 +08:00
..
graph_optimization
[Others] add assert and only count the actual load in cuda_graph (
#5445
)
2025-12-10 11:22:54 +08:00
guided_decoding
[Feature] Guided Decoding add LLguidance backend (
#5124
)
2025-12-03 20:23:57 +08:00
layers
[BugFix] fix w4afp8 tp=8 (
#5868
)
2026-01-05 18:59:02 +08:00
logits_processor
[Feature] support logits processors (
#4515
)
2025-10-29 00:08:53 +08:00
model_loader
[Loader]Fix bug in MTP weight loading (
#5744
)
2025-12-25 11:32:17 +08:00
models
[Feature] support w4afp8 v1_loader and v0_loader(tp>1) (
#5757
)
2025-12-30 14:11:52 +08:00
ops
[Iluvatar] Fix FD launch error when specifing CUDA_VISBLE_DEVICE (
#5735
)
2025-12-26 14:01:27 +08:00
__init__.py
…
entropy_utils.py
[BugFix] Fix entropy bugs (
#5818
)
2025-12-29 20:44:29 -08:00
forward_meta.py
[Intel HPU] enable tensor_wise_fp8 (
#5324
)
2025-12-17 16:45:03 +08:00
load_weight_utils.py
remove fastsafetensors (
#5371
)
2025-12-04 19:22:04 +08:00
pre_and_post_process.py
[Feature] Entropy calculation support (
#5692
)
2025-12-23 21:19:47 +08:00
utils.py
[Feature] support w4afp8 v1_loader and v0_loader(tp>1) (
#5757
)
2025-12-30 14:11:52 +08:00
xpu_pre_and_post_process.py
[XPU] Speculative Decoding with PD (
#5856
)
2026-01-05 17:31:03 +08:00