Skip to main content

From Infrastructure to Production: A Year of Self-Hosted LLMs

UD2.120 (Chavanne) | Day 1 | 16:30 - 16:50 | Speakers: Mateusz Charytoniuk, Gosia Zagajewska, Luiz Miguel

From Infrastructure to Production: A Year of Self-Hosted LLMs
A picture of a devroom at FOSDEM 2024
Open in browser

Notes

Abstract

Last year, I shared Paddler, an open-source LLM load balancer. A year of community feedback and building Poet (a static site generator with AI features) on top of it taught me what actually matters when self-hosting LLMs. This talk shares practical patterns the open-source community needs. What works, what doesn't, and what tooling we still need to build together.

Attachments


Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.