MLIR-based Data Tiling and Packing for Ryzen AI NPU

Day 1 | 12:50 | 00:20 | K.3.201 | Jorn Tuyls


Note: I'm reworking this at the moment, some things won't work.

The stream isn't available yet! Check back at 12:50.
Get involved in the conversation!Join the chat

The Ryzen AI NPUs consist of an array of vector processors and programmable interconnect to allow granular control of compute and data movement to achieve high performance and power efficiency. This talk presents a MLIR-based data tiling and packing design for these NPUs that leads to optimized machine instructions. Specifically, it shows how we can derive and optimize how data flows through the array from high-level tiling decisions and how we can efficiently utilize a high degree of data packing by leveraging low-level DMA control and capabilities.