Software Engineering Radio - the podcast for professional software developers cover image

Episode 549: William Falcon Optimizing Deep Learning Models

Software Engineering Radio - the podcast for professional software developers

00:00

Hard Distributed Bugs: NANs and Precision

William recounts debugging NAN gradients in distributed runs, diagnosing precision and exploding weights across GPUs.

Play episode from 45:29
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app