
Building the AI Hyperscaler with Kubernetes
Kubernetes Bytes
Storage Options for AI Labs and Kubernetes Challenges
The chapter delves into the storage options favored by companies for AI labs, such as object storage and shared file systems, emphasizing the importance of local checkpointed files during distributed training. It also discusses challenges faced when working at scale with Kubernetes, stressing the need to understand CNI configuration and anticipate bottlenecks.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.