AI Breakdown

arxiv preprint - Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Apr 15, 2024
Ask episode
Chapters
Transcript
Episode notes