Machine Learning Street Talk (MLST) cover image

Neel Nanda - Mechanistic Interpretability

Machine Learning Street Talk (MLST)

00:00

The Benefits of the Transformer's Inductive Prior for Recognition of Patterns

Using separate parameters for every capital in a map would be inefficient. The same goes for an MLP image classifier, which needs to learn each position separately. The transformer, on the other hand, has an equivariance in pattern recognition thanks to its inductive prior, allowing it to use the same parameters for each position in the input sequence.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app