MLOps.community  cover image

Open Source and Fast Decision Making // Rob Hirschfeld // #164

MLOps.community

00:00

The Importance of Reliability in Automation

If you're nervous about running an automation script or pushing a button to reset a server and burn it down to zero, then address your reliability problem first. What we've seen is one, the artifact patching is actually a full system reset is actually a faster reset. The other thing I would I would say in this is turn rates are really valuable. If you're not able to roll all of your infrastructure on a 30 day basis or fasterthen you're not keeping up with patches. You don't have the resilience in your operations to actually deal with an emergency patch or change or something like that.

Play episode from 37:14
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app