
Episode 548: Alex Hidalgo on Implementing Service Level Objectives
Software Engineering Radio - the podcast for professional software developers
00:00
How to Minimize Downtime to Less Than 43 Minutes a Month
If you have a 20-minute response time, which I think is for many services actually pretty reasonable, then you can't hit 99.9%. You burn half your budget just on the allowed response time. Then you've got to have at least two, if not three different engineers located all over the world. Once you also then add the humans into that, what does that mean for the humans that need to go fix things? It can get absurd very quickly. And one of my big things is that I really try to convince people you don't have to be as reliable as you think you do.
Transcript
Play full episode