AI Engineering Podcast

Building Scalable ML Systems on Kubernetes

5 snips
Aug 15, 2024
Tammer Saleh, founder of SuperOrbital and an expert in scalable machine learning systems, discusses the advantages and challenges of using Kubernetes for ML workloads. He highlights the importance of model tracking and versioning within containerized environments. The conversation touches on the necessity of a unified API for collaboration across teams and the evolving imperfections of Kubernetes in stateful ML contexts. Tammer also shares insights on future innovations and best practices for teams navigating the complexities of machine learning on Kubernetes.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

From Server Rooms to Heroku

  • Tammer Saleh's cloud journey began with physical servers and evolved through Engine Yard and Heroku.
  • He recounts the initial skepticism towards Heroku's cloud IDE, which later influenced his perspective on cloud platforms.
INSIGHT

The 12-Factor Model's Impact

  • Heroku's 12-factor model, while initially mocked, was a pivotal moment in cloud platform thinking.
  • It emphasized treating applications as good Unix processes, a principle that influenced Saleh's approach to Kubernetes.
INSIGHT

Kubernetes: More Than Just Containers

  • Kubernetes runs containerized workloads at scale in production, offering more than just container orchestration.
  • Its key strength lies in embodying Google's SRE best practices and providing a powerful, unified API for automation.
Get the Snipd Podcast app to discover more snips from this episode
Get the app