
Better K8s Prometheus Alerts with Robusta
DevOps and Docker Talk: Cloud Native Interviews and Tooling
Intro
Learn how Robusta enhances Kubernetes alerting by improving Prometheus alert messages with additional context, graphs, and container logs. The chapter also covers deploying Robusta in Kubernetes clusters and discusses best practices for pod CPU reservation and limits.
đ My next course is coming soon! I've opened the waitlist for those wanting to go deep in GitHub Actions for DevOps and AI automation in 2025. I'm so thrilled to announce this course. The waitlist allows you to quickly sign up for some content updates, discounts, and more as I finish building the course.
https://courses.bretfisher.com/waitlist đŸ
Bret is joined by Natan Yellin, the co-founder of Robusta.dev to talk Kubernetes and Prometheus monitoring, alerting, and maybe some CPU limit ranting.
Robusta tries to fill the gap left by Kubernetes' own AlertManager which has a very specific and not so helpful way of describing events in your cluster. This makes it hard to diagnose the cause of the event and you're left with Google, StackOverflow and an awful lot of head-scratching. Robusta acts as a proxy between AlertManager and your notification platform of choice.
In the show we talk about what Robusta is, how to deploy it in your clusters, and Natan also details some of the enhancements in their cloud offering that you can layer on top of that, which has a generous free tier.
Streamed live on YouTube on January 5, 2023.
Unedited live recording of this show on YouTube (Ep. #197). Includes demos.
â
Topicsâ
Robusta Website
Robusta on GitHub
KubeCon - Building a Runbook Automation System for Prometheus and Kubernetes
Stop using K8s CPU limits
Recommended Pod Spec
Send Push notifications to your phone
Prometheus AlertManager
Grafana Labs
Kubewatch
â
Natan Yellinâ
Natan on Twitter
Natan on LinkedIn
â
Join my Communityâ
New live course on CI automation and gitops deployments
Best coupons for my Docker and Kubernetes courses
Chat with us and fellow students on our Discord Server DevOps Fans
Grab some merch at Bret's Loot Box
Homepage bretfisher.com
- (00:00) - DDT MAIN
- (00:04) - Intro
- (02:30) - In today's episode
- (04:36) - Main show
- (05:04) - Introducing Natan
- (05:30) - Alert fatigue
- (06:06) - Where did the idea for Robusta come from?
- (09:53) - Someone has to do the job
- (10:54) - What does Robusta offer?
- (12:02) - Proxying the alerts and providing context
- (13:07) - Saving 10 to 30 minutes
- (15:25) - The open source Robusta repo
- (15:47) - The need to de-aggregate event data
- (16:46) - Example or demo
- (17:16) - Question about observability for microservices
- (20:15) - Tip 1 Consider using silences
- (21:26) - Tip 2 Monitor outcomes
- (22:00) - Don't ignore alerts because of fatigue
- (24:50) - Sending to different channels based on priority
- (26:19) - Question about sending messages to destinations
- (27:54) - Question
- (28:26) - Installing Robusta
- (29:19) - Demo set up commands
- (29:31) - Questions
- (29:48) - Demo Kubernetes-specific
- (30:42) - Multi-cluster question
- (33:09) - What does the SaaS platform do?
- (34:21) - Demo with SaaS
- (35:14) - kubectl not recommended
- (36:40) - Breaking the glass
- (39:52) - Question about notifications
- (41:51) - Getting started
- (43:01) - CPU limiting
- (43:52) - Soft limits on CPU in Kubernetes
- (46:12) - Bret's pod spec
- (50:59) - Outro
You can also support my free material by subscribing to my YouTube channel and my weekly newsletter at bret.news!
Grab the best coupons for my Docker and Kubernetes courses.
Join my cloud native DevOps community on Discord.
Grab some merch at Bret's Loot Box
Homepage bretfisher.com