FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-24 01:29:57 +08:00

Files

T

K11OntheBoat 870dbac370 Use triton qk_norm both in Prefill and Decode (#7213 )

Co-authored-by: “liuruian” <liuruian@baidu.com>

2026-04-10 15:44:01 +08:00

2026-04-09 11:05:10 +08:00

Split enable_mm (#7183 )

2026-04-08 11:25:41 +08:00

2026-03-24 10:56:00 +08:00

2026-04-09 16:17:56 +08:00

…

2026-04-10 14:13:42 +08:00

2026-04-01 20:29:55 +08:00

__init__.py

…

activation.py

…

embeddings.py

…

linear.py

2026-04-03 18:02:03 +08:00

lm_head.py

…

mtp_linear.py

…

normalization.py

2026-04-10 15:44:01 +08:00

pooler.py

…

rotary_embedding.py

…

utils.py

2026-03-30 11:37:04 +08:00