
107 - Multi-Modal Transformers, with Hao Tan and Mohit Bansal
NLP Highlights
00:00
Using Image Captioning and Vecuaded Sets for Tree Transcription
Image captions are essential descriptions of the image. A caption can be less detailed than a visual question. Answering questions sometimes, and vice versa. There might be some future experiment to see if we convert the week away questions, less answers, to more like statements.
Transcript
Play full episode