FastDeploy/fastdeploy/model_executor/layers at c4abb01f9cdcf15d39daae62299b16d36a478b3a - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-05-06 15:40:33 +08:00

Files

T

History

MingkunZhang c4abb01f9c [Metax][Fix] fix 'get_token_penalty_multi_scores' input error based (PaddlePaddle#6069) (#6266 )

2026-01-29 19:24:36 +08:00

..

[Others] enhance deep_ep import and support mixed mode flash_mask_attn (#6238 )

2026-01-28 00:02:02 +08:00

[XPU] change XPU EP interface from xDeepEP to paddle (#5706 )

2026-01-21 18:23:45 +08:00

batch_invariant_ops

…

[Feature] Support NVFP4 MoE on SM100 (#6003 )

2026-01-29 14:16:07 +08:00

…

[Feature] Support NVFP4 MoE on SM100 (#6003 )

2026-01-29 14:16:07 +08:00

[Metax][Fix] fix 'get_token_penalty_multi_scores' input error based (PaddlePaddle#6069) (#6266 )

2026-01-29 19:24:36 +08:00

__init__.py

…

activation.py

…

embeddings.py

…

linear.py

[Models][BugFix] shared experts and dense mlp layer do not require TP split (#6180 )

2026-01-28 18:58:19 +08:00

lm_head.py

…

mtp_linear.py

…

normalization.py

Support MXFP4 for GPT-OSS (#5435 )

2026-01-22 14:21:01 +08:00

pooler.py

…

rotary_embedding.py

…

utils.py

[Feature] Support Ernie FP8 on sm100 (#5593 )

2026-01-29 13:49:54 +08:00