Bret is joined by Philip Andrews and Dan Muret of Cast AI to discuss pod live migration between nodes in a Kubernetes cluster.
š My next course is coming soon! I've opened the waitlist for those wanting to go deep in GitHub Actions for DevOps and AI automation in 2025. I'm so thrilled to announce this course. The waitlist allows you to quickly sign up for some content updates, discounts, and more as I finish building the course. https://learn.bretfisher.com/waitlistš¾
Cast AI dynamically moves your pod to a different node without downtime or data lost. It copies your running pod data, memory, IP address, and TCP connections from one node to another in real time.
In this episode, we nerd out over how Cast AI works under the hood, use cases for it, including hardware and OS maintenance on a node. I've got a feeling Cast AI has a winning feature on their hands.
ā
Show Linksā
Cast AI website
Cast AI YouTube Channel
Check out the video podcast version here: https://youtu.be/yINNWxRywv4
Creators & Guests
You can also support my free material by subscribing to my YouTube channel and my weekly newsletter at bret.news!
Grab the best coupons for my Docker and Kubernetes courses.
Join my cloud native DevOps community on Discord.
Grab some merch at Bret's Loot Box
Homepage bretfisher.com
- (00:00) - Introduction
- (02:21) - Cast AI Elevator Pitch
- (06:57) - Stateful Workloads
- (10:03) - Bin Packing in Live Migration
- (13:35) - Stateful vs Stateless
- (15:43) - Networking and Storage Considerations
- (23:03) - Future Developments and Use Cases
- (25:43) - ML Workloads
- (28:25) - Live Migration of Spot Instances
- (31:01) - Live Migration Process Explained
- (39:02) - Challenges and Engineering Behind Live Migration
- (43:56) - Getting Started with Cast AI