Machine Learning Street Talk (MLST) cover image

Neel Nanda - Mechanistic Interpretability

Machine Learning Street Talk (MLST)

CHAPTER

Understanding Grokking in ML

This chapter examines the specific phenomenon of 'grokking' in machine learning, detailing how models shift from memorization to generalization during training. Through a research project on a one-layer transformer model, the speakers illustrate the distinct phases of learning and their implications for improving model performance.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner