
Google SRE Prodcast
SRE Prodcast brings Google's experience with Site Reliability Engineering together with special guests and exciting topics to discuss the present and future of reliable production engineering!
Latest episodes

Sep 12, 2023 • 27min
Life of An SRE Episode 1: Tom Cranitch and Megan Yin
How does one become an SRE? And what’s the career like? In this episode, Tom and Megan discuss their path to SRE.

Jun 7, 2022 • 11min
Creating the SRE Prodcast with John Reese (JTR)
Host MP English and former Google SRE John Reese (JTR) chat about the creation of the Prodcast. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

May 31, 2022 • 29min
Postmortems with Ayelet Sachto
Ayelet Sachto offers advice on creating an actionable, transparent, and blameless postmortem culture. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

May 24, 2022 • 40min
Incident Management with Adrienne Walcer
Adrienne Walcer discusses how to approach and organize incident management efforts throughout the production lifecycle. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

May 17, 2022 • 44min
On-Call Rotations with Andrew Widdowson (APW)
Andrew Widdowson (APW) shares strategies for successful on-call rotations. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

May 10, 2022 • 1h
Automation with Pierre Palatin
Pierre Palatin dives into different automation strategies, how to build confidence in your system, and why designing the UI may be your biggest challenge. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

May 3, 2022 • 40min
Client-Transparent Migrations with Pavan Adharapurapu
Pavan Adharapurapu discusses client-transparent migrations and the challenges and solutions associated with them. He highlights the importance of user experience, planning, and ensuring client transparency throughout the migration process. The podcast also explores the use of production traffic replay systems and the process of moving traffic during a migration. Additionally, the importance of migration planning and prioritizing client transparency is emphasized.

Apr 26, 2022 • 25min
Rethinking SLOs with Narayan Desai
Narayan Desai explains why SLOs can be problematic and proposes alternative methods for monitoring complex, large-scale systems. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

Apr 19, 2022 • 27min
Alerting with Amelia Harrison
Amelia Harrison advises on when and how to alert, ideal coverage, and tuning. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

Apr 12, 2022 • 31min
Customer-Centric Monitoring with Silvia Esparrachiari
Silvia Esparrachiari talks about the challenges of monitoring and the importance of understanding your users. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript