Files
FastDeploy/fastdeploy
google-labs-jules[bot] 18ae6aa4d6 perf: avoid unnecessary dtype casting in RMSNorm
Added checks before calling `.astype` in `fastdeploy/model_executor/layers/normalization.py`. In PaddlePaddle, calling `.astype` allocates a new tensor even if it's already the target dtype, avoiding these casts skips memory allocations and kernel launches on the hot path.
2026-04-19 15:16:05 +00:00
..
2026-04-07 16:30:32 +08:00
2026-04-14 17:28:22 +08:00
2026-04-14 20:04:04 +08:00
2026-03-31 11:02:26 +08:00
2025-07-03 15:43:53 +08:00