NLP Highlights cover image

107 - Multi-Modal Transformers, with Hao Tan and Mohit Bansal

NLP Highlights

00:00

The Differences Between Lexbert and Other Multim Transforme Papers

The visher bird is a single stream to motives. It does not have separate the languagein coder and the visein coder. And so for the wiser bird, it only uses the amas cog data sets as is petin vena set. So is a smaller data set. Take us even for a vision and language dos having better bench marks were out of domain generalization testing,. But can you tell us what you think of the similatis in the differences between your model and those other models?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app