Google SRE Prodcast

The One with STPA, Jeffrey Snover, and Theo Klein

4 snips
Jul 2, 2025
In this engaging conversation, Theo Klein, a Site Reliability Engineer at Google with a passion for STPA, and Jeffrey Snover, a Distinguished Engineer at Google and former Microsoft veteran, dive into Systems Theoretic Process Analysis (STPA). They discuss how STPA shifts the focus from component failures to understanding system control failures. The duo emphasizes the importance of human involvement in system design, revealing how early STPA implementation can identify potential risks before coding begins, ultimately leading to safer and more robust systems.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

STPA Reframes System Failures

  • STPA sees system failures as control issues, not just component failures.
  • Accidents happen when control within the system is lost, shifting the focus to system behavior.
INSIGHT

Control and Feedback in Systems

  • Control means authority to impose actions on controlled processes in a system.
  • STPA models feedback loops where misinterpretation causes continued system outages.
ANECDOTE

Road Closure System Flaw Found Early

  • STPA revealed three serious design flaws in a road closure system before coding began.
  • A flaw where failed retries caused missing road closures went unnoticed until STPA analysis.
Get the Snipd Podcast app to discover more snips from this episode
Get the app