AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Importance of Data Efficiency in Training Visually
We showed that it can retrieve relevant videos just as a language model, even though it has never seen videos before. But more importantly, we took NLP only tasks, right? Traditionally, which are not multimodal tasks, just text only tasks. And we showed that you can get better results because this language model has distilled knowledge from videos. That's where we showed the bigger gains.