Reliability Enablers

#22 - How Google does SRE Consulting (with Yury Niño Roa)

Jan 9, 2024
Yury Niño Roa, a cloud infrastructure engineer at Google’s Professional Services Organization, shares her insights on Site Reliability Engineering (SRE) consulting. She reveals how Google collaborates with clients to enhance their SRE practices and discusses common antipatterns she encounters. The conversation dives into the complexities of SRE, emphasizing the need for cultural shifts and mindset changes for successful implementation. Yury also highlights the importance of chaos engineering in disaster recovery, showcasing Google's innovative strategies for resilience.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Faithful Representation with Flexibility

  • Represent Google's SRE approach faithfully when consulting.
  • Remember, Google's approach is just one path to SRE success.
INSIGHT

Google's Four SRE Engagements

  • Google's PSO offers four SRE engagements: core SRE, design/operations review, SRE teams/policies, and SRE scaling.
  • Yury Niño Roa enjoys the people and documentation aspect of SRE.
ANECDOTE

Sketchnoting SRE Concepts

  • Yury Niño Roa uses sketchnotes to explain SRE, DevOps, and platform engineering differences.
  • She highlights SRE as culture/practices, DevOps as a Google implementation predating the broader culture, and platform engineering as addressing DevOps burnout.
Get the Snipd Podcast app to discover more snips from this episode
Get the app