Machine Learning Street Talk (MLST) cover image

#044 - Data-efficient Image Transformers (Hugo Touvron)

Machine Learning Street Talk (MLST)

00:00

Transformers and Inductive Biases

This chapter explores the role of inductive biases in transformer models versus convolutional neural networks (CNNs), focusing on the balance between bias and flexibility in learning. It discusses knowledge distillation methods, particularly the use of a novel distillation token to enhance performance by aggregating information from CNNs. The speakers also investigate how data augmentation techniques impact the training dynamics and representation learning between teacher and student models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app