Skip to main content

WebNN and WebLLM on RISC-V: Closing the AI Acceleration Gap with RVV and Tenstorrent

UD2.120 (Chavanne) | Day 1 | 12:40 - 13:00 | Speakers: Yuning Liang, Petr Penzin

WebNN and WebLLM on RISC-V: Closing the AI Acceleration Gap with RVV and Tenstorrent
A picture of a devroom at FOSDEM 2024
Open in browser

Notes

Abstract

As AI workloads move to the browser, the lack of a unified low-level acceleration layer on Linux—equivalent to DirectML or CoreML—creates major bottlenecks. In this talk, we explore how WebNN and next-generation WebLLM can unlock efficient on-device inference on RISC-V, using Tenstorrent hardware and the emerging RVV 1.0 Variable-Length vector ISA. We cover the challenges of WebNN integration on Linux, the importance of WASM support for RVV, and demonstrate progress on running modern LLMs directly in the browser. We will also detail the RVV-enabled WASM implementation path for WebNN and what’s needed upstream.


Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.