Skip to main content

Zero-Downtime Upgrades: PostgreSQL and OS/glibc at Global Scale

K.1.105 (La Fontaine) | Day 1 | 16:00 - 16:50 | Speakers: Alexander Sosna

Zero-Downtime Upgrades: PostgreSQL and OS/glibc at Global Scale
A picture of the main stage at FOSDEM 2024
Open in browser
Get involved in the conversation!Join the chat

Notes

Abstract

Upgrading high load PostgreSQL databases is a challenge on its own. When having customers around the globe with tight SLAs, the requirement arises to execute these upgrades with minimal or even no downtime at all. This talk shares GitLab's journey from multi-hour maintenance windows to truly zero-downtime upgrades for our PostgreSQL infrastructure. You'll learn the battle-tested techniques we've developed over the last 4 years, like how we execute PostgreSQL major upgrades and OS (glibc) upgrades at the same time, prevent data corruption, as well as always keeping a rollback path via reverse replication. We'll walk through real production examples, the gotchas we discovered, and the tooling we built. Whether you're managing a single HA cluster or a global fleet, you'll leave with actionable strategies to minimize (or eliminate) downtime during your next major upgrade.


Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.