The Inside View cover image

Ethan Caballero–Scale is All You Need

The Inside View

00:00

How to Predict Correlations Between Frames in Time?

The model takes a avid like a alde ittevidios and views, as in, put frames that were close together, ai in time. It tries to maximize the mutual information between them via, like, minm a, via maximisin cosin similarity between the latents. So he tries to kind of predict correlations between frames in some kind of ladent space from a resonent e and in in the laht ent space. Ini can, like, learned a ral really quickly after we just did all this unsupervised, contrastive, ie, pretraining, whatever.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app