The Stack Overflow Podcast

Your runbooks are obsolete in the age of agents

Oct 24, 2025
Spiros Xanthos, CEO of Resolve AI, dives into the transformative role of AI agents in incident management. He discusses how traditional runbooks are becoming outdated and how AI agents can autonomously troubleshoot complex systems, potentially saving up to 70% of engineers' time. Spiros elaborates on agents acting as on-call responders, gathering evidence across various tools, and assisting with the increasing complexity of software systems. He also touches on the evolving role of developers in an AI-driven landscape and offers insights into the future of engineering.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Observability Alone Doesn't Solve Toil

  • Modern observability centralizes data but human toil still dominates running systems at scale.
  • Complexity grows superlinearly so no single person understands entire production systems.
ANECDOTE

Founding Resolve To Tackle Operational Work

  • Spiros left Splunk to build agents that handle the 70% of work around running systems instead of new development.
  • Resolve AI focuses on agents that troubleshoot alerts, manage incidents, and operate production systems.
INSIGHT

Incident Troubleshooting Is The Hardest Problem

  • Incident troubleshooting is high-pressure, low-latency, and often the most painful human task in production.
  • Agents can target this hardest subset to deliver immediate operational value.
Get the Snipd Podcast app to discover more snips from this episode
Get the app