

Learning from incidents (Interview)
Feb 4, 2022
This week, Nora Jones, Founder and CEO of Jeli, shares her insights from chaos engineering at Netflix and incident analysis at Slack. She emphasizes the importance of learning from incidents to improve team resilience. Nora discusses creating developer-centric tools and the emotional complexities of incident reviews. She explores knowledge silos, the balance between quantitative and qualitative insights, and the evolving role of incident analysts. Additionally, she reflects on how studying real-world incidents can enhance software practices and decision-making.
AI Snips
Chapters
Books
Transcript
Episode notes
From Chaos Engineering to Incident Analysis
- Nora Jones's work at Netflix in chaos engineering revealed the potential of incident analysis.
- This led to her role at Slack and eventually founding Jeli to help teams learn from incidents.
Formalizing Incident Management
- Formal incident management helps create a natural list of events for reflection, unlike ad-hoc approaches.
- This formalized approach enables more structured learning and improvement.
The Power of Incident Reviews
- Writing incident reports and conducting thorough reviews reveal more than just public postmortems.
- This in-depth analysis offers a greater return on investment from these often expensive events.