Loading…
Friday, October 4 • 11:45 - 12:30
Autopsy of a MySQL Automation Disaster

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

You deployed automation, enabled automatic database master failover and tested it many times: great, you can now sleep at night without being paged by a failing server. However, when you wake up in the morning, things might not have gone the way you expect. This talk will be about such a surprise.

Once upon a time, a failure brought down a MySQL master database. Automation kicked in and fixed things. However, a fancy failure, combined with human errors, an edge-case recovery, and a lack of oversight in tooling and scripting lead to a split-brain and data corruption. This talk will go into details about the convoluted—but still real-world—sequence of events that lead to this disaster. I cover what could have avoided the split-brain and what could have made data reconciliation easier.

Speakers
avatar for Jean-François Gagné

Jean-François Gagné

Infrastructure Engineer / System and MySQL Expert, MessageBird
Jean-François is a System/Infrastructure Engineer and MySQL Expert. One year ago, he joined MessageBird, an IT telco startup in Amsterdam, with the mission of scaling the MySQL infrastructure. Before that, J-F worked on growing the Booking.com MySQL and MariaDB installations (he... Read More →


Friday October 4, 2019 11:45 - 12:30 BST
Track 1: The Liffey B