This is Fine! A podcast about resilience engineering and software

Episode 3 - lions, tigers and metrics, oh my!

4 snips
Dec 4, 2024
Vanessa Huerta Granda, a technology manager passionate about resilience engineering, shares her insights on navigating metrics in incident management. She discusses the challenges of code freezes and the importance of adaptable metrics. Vanessa emphasizes the significance of context when analyzing Mean Time to Recovery (MTTR) and how it can lead to meaningful insights. The conversation also highlights the necessity for better communication between tech teams and executives to ensure effective decision-making based on accurate data.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ADVICE

Build the Dashboard

  • If your CEO wants a resilience dashboard, create one.
  • Don't argue; provide metrics or someone else will, potentially less accurately.
ADVICE

Dynamic Dashboards

  • Make dashboards dynamic and filterable by relevant criteria (e.g., product line, team).
  • This allows for nuanced analysis and avoids overly simplistic interpretations.
ADVICE

User-Centric Metrics

  • Focus on user impact when designing dashboards.
  • Frame incidents in terms of how they affect the customer journey and business outcomes, not just technical details.
Get the Snipd Podcast app to discover more snips from this episode
Get the app