

Building Scalable ML Systems on Kubernetes
5 snips Aug 15, 2024
Tammer Saleh, founder of SuperOrbital and an expert in scalable machine learning systems, discusses the advantages and challenges of using Kubernetes for ML workloads. He highlights the importance of model tracking and versioning within containerized environments. The conversation touches on the necessity of a unified API for collaboration across teams and the evolving imperfections of Kubernetes in stateful ML contexts. Tammer also shares insights on future innovations and best practices for teams navigating the complexities of machine learning on Kubernetes.
AI Snips
Chapters
Transcript
Episode notes
From Server Rooms to Heroku
- Tammer Saleh's cloud journey began with physical servers and evolved through Engine Yard and Heroku.
- He recounts the initial skepticism towards Heroku's cloud IDE, which later influenced his perspective on cloud platforms.
The 12-Factor Model's Impact
- Heroku's 12-factor model, while initially mocked, was a pivotal moment in cloud platform thinking.
- It emphasized treating applications as good Unix processes, a principle that influenced Saleh's approach to Kubernetes.
Kubernetes: More Than Just Containers
- Kubernetes runs containerized workloads at scale in production, offering more than just container orchestration.
- Its key strength lies in embodying Google's SRE best practices and providing a powerful, unified API for automation.