Loading…
August 23-34, 2022 - Virtual
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit Latin America 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Eastern Daylight Time (UTC -4). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."
Tuesday, August 23 • 11:00am - 11:40am
A Guardian Angel of File Systems: Monitoring for File System Errors in Data Centers - Gabriel Krisman Bertazi, Collabora [Presented in English]

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
A fact of life, one that almost every computer user has to face at some point, is that file systems fail. Whether it is due a faulty hardware, a sudden power loss, or a kernel bug, users don't enjoy losing their data for no reason.

When it happens on a user-facing machine, the problem is usually found and troubleshooted by inspecting the kernel log. The user, after some grumpiness, can then start a recovery process, like fsck, replace the disk, or restore from backup. But this kind of simple monitoring simply doesn't scale to a data-center. When dealing with a fleet of machines, kernel logs are lost, printk messages change or get truncated, and, more often than not, system administrators are left without any warning until it is too late.

In this talk, I will present a new kernel mechanism for daemons to monitor file system health through a series of push notification events from the kernel, which are triggered as soon as a volume corruption is found by the file system code. This feature, merged in Linux 5.16, once picked up by distros, won't prevent errors from happening, but will allow system administrators to reliably watch their volumes, and act immediately when necessary.

Speakers
GK

Gabriel Krisman Bertazi

Senior Software Engineer, Collabora
Gabriel Krisman Bertazi is a Senior Software Engineer with the Collabora Core Kernel team. He works all around the kernel to implement features and fix bugs to make Linux a successful platform for any device, be it a gaming platform or the operating system of choice for Cloud pro... Read More →


Tuesday August 23, 2022 11:00am - 11:40am EDT
ROOM 4
  LinuxCon