Background

NVIDIA Deep Learning Performance Guide

https://docs.nvidia.com/deeplearning/performance/index.html#optimizing-performance

Full-Stack, GPU-based Acceleration of Deep Learning

NVIDIA CVPR’23 Tutorial https://nvlabs.github.io/EfficientDL/

Full-Stack Deep Learning

https://fullstackdeeplearning.com/course/2022/

Efficient Deep Learning Systems

https://github.com/mryab/efficient-dl-systems