Skip to main content

The Hidden Cost of Intelligence: The Energy Footprint of AI from Code to GPU Kernels

UD2.120 (Chavanne) | Day 1 | 15:20 - 15:25 | Speakers: Tushar Sharma

The Hidden Cost of Intelligence: The Energy Footprint of AI from Code to GPU Kernels
A picture of a devroom at FOSDEM 2024
Open in browser

Notes

Abstract

The growing energy demands of modern AI models pose a significant barrier to sustainable computing. As model complexity and deployment scale continue to rise, training and inference increasingly contribute to carbon emissions and operational costs. This talk begins by examining the technical challenges of accurately measuring energy consumption at multiple levels of abstraction—from system-wide and process-level metrics down to individual source code methods and API calls. Practical strategies for overcoming these measurement hurdles are discussed. The second part of the talk explores power consumption patterns in GPU kernels, highlighting how thread configuration, block geometry, and power limit settings shape kernel-level energy efficiency. We demonstrate how these characteristics influence power draw and discuss techniques for predicting consumption based on kernel properties. The session concludes with insights and best practices for managing performance–energy trade-offs in GPU-accelerated AI applications, offering a path toward more sustainable AI development.


Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.