Files
FastDeploy/fastdeploy/model_executor
google-labs-jules[bot] 18ae6aa4d6 perf: avoid unnecessary dtype casting in RMSNorm
Added checks before calling `.astype` in `fastdeploy/model_executor/layers/normalization.py`. In PaddlePaddle, calling `.astype` allocates a new tensor even if it's already the target dtype, avoiding these casts skips memory allocations and kernel launches on the hot path.
2026-04-19 15:16:05 +00:00
..
2026-04-14 17:28:22 +08:00