The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Visual Generative AI Ecosystem Challenges with Richard Zhang - #656

15 snips

Nov 20, 2023

In this discussion, Richard Zhang, a Senior Research Scientist at Adobe Research specializing in visual generative AI, tackles significant challenges in the AI ecosystem. He dives into the creation of effective perceptual metrics for AI, emphasizing the role of LPIPS in aligning human and machine evaluations. Zhang also addresses the pressing need for detection tools to combat fake visuals and the complexities of data attribution in generative art. His insights emphasize the delicate balance between creator autonomy and consumer trust in this rapidly evolving field.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Colorization Challenges

Richard Zhang's early work on image colorization revealed the challenge of creating loss functions.
Early colorization attempts yielded dull, blurry results due to inadequate loss functions, not aligned with human perception.

INSIGHT

Perceptual Metrics

Defining a mathematical function that captures the nuances of human visual perception is difficult.
A simple L2 loss function, comparing images pixel by pixel, doesn't accurately reflect human perception.

INSIGHT

Data-Driven Perceptual Metric

Richard Zhang developed LPIPS, a perceptual metric, using a data-driven approach.
This involved collecting human judgments on distorted image patches to train a model aligned with human perception.

Get the Snipd Podcast app to discover more snips from this episode

Get the app