Machine Learning Street Talk (MLST)

Ashley Edwards - Genie Paper (DeepMind/Runway)

Sep 13, 2024
Ashley Edwards, a leading figure from Runway and co-author of the Genie paper, discusses groundbreaking advancements in AI video generation. She delves into Genie’s ability to create interactive environments through latent action models and the challenges of maintaining action consistency. The conversation highlights the transformative potential of AI on content creation jobs and robotics. They also touch on evaluation metrics for AI-generated content, revealing the future implications of interactive AI and its role in reshaping the workforce.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Interactive Environments

  • Genie learns a generative interactive environment from videos.
  • A single image can be interacted with like a real environment.
INSIGHT

Genie's Video Processing

  • Genie uses a VQ model to discretize video frames into tokens.
  • It learns a compressed representation of changes for next-frame prediction.
INSIGHT

Action Consistency

  • Genie maintains consistent actions across frames, which is helpful for prediction.
  • This consistency surprised the team and contributes to realistic depth simulation.
Get the Snipd Podcast app to discover more snips from this episode
Get the app