Giant Conversations

GC - #33 Cloud Native Monitoring and Observability

Jun 12, 2025
Dominik Schmidle, a Product Manager at Giant Swarm, dives into the world of cloud-native monitoring and observability. He explains why these concepts are essential for DevOps, shifting focus from traditional monitoring to a more holistic approach. The conversation touches on Grafana 12's exciting new features, including improved alerting and enhanced drill-down capabilities. Dominik also discusses the significance of OpenTelemetry in shaping observability tools and emphasizes the challenges of developing a comprehensive cloud-native observability platform.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Observability vs Monitoring

  • Observability uses metrics, logs, and traces to provide a comprehensive view of system health.
  • Monitoring focuses mainly on metrics to understand system health at specific points in time.
ADVICE

Get Metrics and Logs Right

  • Use Prometheus to scrape metrics like CPU usage from your app endpoints.
  • Employ log streams to capture events, adapting your approach based on the data type.
INSIGHT

Interpreting Metrics Spikes

  • Spikes in metrics like CPU usage don't always need action; they can be expected and manageable.
  • Robust systems scale and self-heal to handle these typical spikes safely.
Get the Snipd Podcast app to discover more snips from this episode
Get the app