
107 - Multi-Modal Transformers, with Hao Tan and Mohit Bansal
NLP Highlights
00:00
The Differences Between Lexbert and Other Multim Transforme Papers
The visher bird is a single stream to motives. It does not have separate the languagein coder and the visein coder. And so for the wiser bird, it only uses the amas cog data sets as is petin vena set. So is a smaller data set. Take us even for a vision and language dos having better bench marks were out of domain generalization testing,. But can you tell us what you think of the similatis in the differences between your model and those other models?
Transcript
Play full episode