Get the app
Quentin Anthony
PhD student at Ohio State University and head engineer at EleutherAI, focusing on high-performance deep learning and distributed systems for large language model training.
Best podcasts with Quentin Anthony
Ranked by the Snipd community
29 snips
Aug 16, 2023
• 51min
The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI
chevron_right
Quentin Anthony, a PhD student at Ohio State University and head engineer at EleutherAI, dives into the intricacies of training large language models. He discusses the importance of community knowledge and practical strategies for GPU optimization. Quentin unpacks the mathematics behind compute requirements and addresses the challenges of floating-point operations. He also explores autoregressive modeling techniques, contrasts traditional methods, and examines the complexities of optimizing training processes, including the Atom optimizer and model distribution.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app