Observability experts Jasper Paul and Vinoth Kanagaraj from Site24x7 discuss achieving visibility for Kubernetes apps, OpenTelemetry, AI in analysis, useful metrics, multi-cluster monitoring, and the evolution from monitoring to observability platforms.
40:22
forum Ask episode
web_stories AI Snips
view_agenda Chapters
auto_awesome Transcript
info_circle Episode notes
insights INSIGHT
Evolution From Monitoring to Observability
Observability integrates metrics, traces, and logs into a single correlated data source to find root causes and reduce incident resolution time.
Monitoring was mostly about availability, but observability evolved to handle complex multi-layered IT stacks including Kubernetes.
insights INSIGHT
Kubernetes Adds Observability Complexity
Kubernetes introduced an extra complexity layer between infrastructure and applications with pods, containers, and short lifespans.
This complicates observability as infrastructure changes frequently, requiring new monitoring paradigms.
volunteer_activism ADVICE
Key Kubernetes Metrics to Monitor
Monitor vital Kubernetes metrics like pod status (pending, running, failed) and pod/container restart counts to track deployment health.
Collect both cluster-level and container-level CPU/memory metrics since different workloads and lifespans exist.
Get the Snipd Podcast app to discover more snips from this episode
Bret is joined by Jasper Paul and Vinoth Kanagaraj, observability experts and Site24x7 Product Managers, to discuss achieving end-to-end visibility for applications on Kubernetes infrastructure. We answer questions on all things monitoring, OpenTelemetry, and KPIs for DevOps and SREs.
🙌 My next course is coming soon! I've opened the waitlist for those wanting to go deep in GitHub Actions for DevOps and AI automation in 2025. I'm so thrilled to announce this course. The waitlist allows you to quickly sign up for some content updates, discounts, and more as I finish building the course. https://courses.bretfisher.com/waitlist 🍾
We talk about the industry's evolution from monitoring to full observability platforms, as well as adjacent topics for helping you with your own Kubernetes and application monitoring, including going through some of the most useful metrics in Kubernetes and AI's role in metric analysis and alerting humans.
Be sure to check out the live recording of the complete show from April 25, 2024 on YouTube (Ep. 263). Includes demos.