FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-23 00:17:25 +08:00

Files

T

K11OntheBoat 870dbac370 Use triton qk_norm both in Prefill and Decode (#7213 )

Co-authored-by: “liuruian” <liuruian@baidu.com>

2026-04-10 15:44:01 +08:00

2026-04-10 14:13:42 +08:00

…

2026-04-10 15:44:01 +08:00

…

2026-04-09 23:40:15 -07:00

2026-04-09 11:01:03 +08:00

…

__init__.py

…

entropy_utils.py

…

forward_meta.py

2026-04-03 17:41:33 +08:00

load_weight_utils.py

2026-04-09 23:40:15 -07:00

pre_and_post_process.py

…

utils.py

…

xpu_pre_and_post_process.py

…