The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Visual Generative AI Ecosystem Challenges with Richard Zhang - #656

Nov 20, 2023
In this discussion, Richard Zhang, a Senior Research Scientist at Adobe Research specializing in visual generative AI, tackles significant challenges in the AI ecosystem. He dives into the creation of effective perceptual metrics for AI, emphasizing the role of LPIPS in aligning human and machine evaluations. Zhang also addresses the pressing need for detection tools to combat fake visuals and the complexities of data attribution in generative art. His insights emphasize the delicate balance between creator autonomy and consumer trust in this rapidly evolving field.
40:40

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Richard Zhang's work on perceptual metrics like LPIPS improves human-computer alignment in generative AI.
  • Addressing the challenges of detecting fake visual content is crucial for maintaining trust in generative AI.

Deep dives

Training image-based generative AI models

Richard Zhang discusses his work on image-based generative AI, specifically focusing on the use of deep networks for image generation. He highlights the challenges faced in predicting high-dimensional signals and the limitations of existing loss functions. Zhang shares his early experiences in generative tasks, such as image colorization, and the shift towards adding controllability to generative AI systems.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner