The Changelog: Software Development, Open Source

The BSOD CrowdStrikes back (Friends)

Jul 26, 2024

Robert Ross, Founder and CEO of FireHydrant, delves into the largest outage in IT history sparked by a CrowdStrike update. The discussion is a blend of humor and insight, focusing on the economic impacts and recovery hurdles of massive system failures. Ross emphasizes the importance of diverse software systems and proactive incident management strategies to enhance resilience. They explore the complexities of modern software interdependencies, system crashes, and argue for better practices in the realm of cybersecurity, all while keeping the tone engaging and relatable.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

CrowdStrike's Unfamiliarity

Jerod Santo hadn't heard of CrowdStrike before the outage, despite its widespread use.
He kept thinking of AC/DC's "Thunderstruck" while reading about CrowdStrike.

INSIGHT

Change as Root Cause

Robert Ross emphasizes that change is the biggest cause of incidents.
He points to Google's statistic that 80% of their incidents are caused by change.

ANECDOTE

Personal Impact of Outage

Robert Ross's friends and FireHydrant employees had to cancel trips due to the outage.
Delta Airlines canceled hundreds of flights daily for five days following the incident.

Get the Snipd Podcast app to discover more snips from this episode

Get the app