The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Robust Visual Reasoning with Adriana Kovashka - #463

Mar 11, 2021

Adriana Kovashka, an Assistant Professor at the University of Pittsburgh, dives into her research on visual commonsense and robust visual reasoning. She discusses the interplay between media studies and machine learning, using examples like public service announcements to highlight the complexity of interpretation. Adriana elaborates on the pitfalls in visual question answering datasets and the innovations in weakly supervised object detection. She also shares insights into practical AI applications, emphasizing the need for common sense in assistive technologies.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Complex Ad Example

Adriana Kovashka recalls a complex anti-secondhand smoke ad.
It depicted a child seemingly smoking, but with mismatched arm sizes, revealing an unseen smoker.

INSIGHT

Visual Reasoning Definition

Visual reasoning involves seeing relationships between concepts, relying on unencoded knowledge, and selective attention.
Defining "reasoning" is difficult, as it goes beyond data and involves priors and inductive biases.

ANECDOTE

VCR Dataset Example

Kovashka uses the Visual Common Sense Reasoning dataset, which includes movie frames and questions.
An example question asks why a person points at another, requiring reasoning about restaurant norms.

Get the Snipd Podcast app to discover more snips from this episode

Get the app