Machine Learning Street Talk (MLST) cover image

Machine Learning Street Talk (MLST)

Ashley Edwards - Genie Paper (DeepMind/Runway)

Sep 13, 2024
Ashley Edwards, a leading figure from Runway and co-author of the Genie paper, discusses groundbreaking advancements in AI video generation. She delves into Genie’s ability to create interactive environments through latent action models and the challenges of maintaining action consistency. The conversation highlights the transformative potential of AI on content creation jobs and robotics. They also touch on evaluation metrics for AI-generated content, revealing the future implications of interactive AI and its role in reshaping the workforce.
25:04

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • The Genie AI system utilizes a latent action model to enhance video generation by balancing interactivity and action consistency across frames.
  • Applications of Genie extend to robotics and gaming, highlighting its potential to adapt learned behaviors to real-world scenarios without traditional reward functions.

Deep dives

Generative Interactive Environment Learning

The core concept of this research focuses on creating a generative interactive environment that learns solely from video data, allowing users to interact with generated scenes as if they were real. By utilizing a vector quantization (VQ) model to discretize video frames, the approach enables the model to predict tokens that capture the essence of various environments and their dynamics. This learning process emphasizes the development of a consistent action space, aiding in predicting subsequent frames by ensuring that the model can adapt its responses based on the distance and speed of moving objects. The design allows users to engage with the model actively, offering a unique opportunity to influence the outcome of generated sequences and experience a blend of creativity and interactivity.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner