Machine Learning Street Talk (MLST) cover image

Neel Nanda - Mechanistic Interpretability

Machine Learning Street Talk (MLST)

00:00

The Benefits of the Transformer's Inductive Prior for Recognition of Patterns

Using separate parameters for every capital in a map would be inefficient. The same goes for an MLP image classifier, which needs to learn each position separately. The transformer, on the other hand, has an equivariance in pattern recognition thanks to its inductive prior, allowing it to use the same parameters for each position in the input sequence.

Play episode from 01:30:32
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app