
Episode 549: William Falcon Optimizing Deep Learning Models
Software Engineering Radio - the podcast for professional software developers
00:00
Hard Distributed Bugs: NANs and Precision
William recounts debugging NAN gradients in distributed runs, diagnosing precision and exploding weights across GPUs.
Play episode from 45:29
Transcript


