Latent Space: The AI Engineer Podcast cover image

The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI

Latent Space: The AI Engineer Podcast

00:00

Navigating Large Language Model Training

This chapter explores the complexities involved in training large language models, focusing on balancing data, computational resources, and the importance of community-driven knowledge. It also discusses practical strategies for optimizing GPU utilization while addressing challenges posed by various training frameworks and hardware failures.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app