Is There a Sparse Architecture in Computer Vision?

Florida's language models, which generalize at least somewhat in terms of representation learning seem like they generalize quite well. So I think you're saying we might not get a better resnet out of this by training it on ImageNet, for example. Because what about like data? But what about like Florida's language models and vision transformers? Yeah. There could be a just based on the complexity of the data set versus the passive of the model.

Play episode from 25:34

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app