AI Systems Performance Engineering Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch【電子書籍】[ Chris Fregly ]