FastDeploy/.jules/bolt.md at d9af356400cdc95425e5b03844c8309e45ff9950

apps/FastDeploy

Fork 0

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2026-04-22 16:07:51 +08:00

Files

T

google-labs-jules[bot] d9af356400 ⚡ Bolt: Memoize module availability and device properties lookups

2026-04-20 17:48:30 +00:00

517 B

Raw Blame History

2024-04-20 - Memoizing Hardware and Spec lookups

Learning: Checking paddle.device.cuda.get_device_properties() and importlib.util.find_spec("flashinfer") inside utility functions like get_sm_version() and has_flashinfer() that are called frequently causes significant overhead, taking ~5ms per 10k calls without caching vs ~0.015ms with caching. Action: Use @functools.lru_cache and @cache for functions that query hardware features or module specifications iteratively during model execution.

517 B Raw Blame History

2024-04-20 - Memoizing Hardware and Spec lookups

517 B

Raw Blame History