Machine Learning Street Talk (MLST)

DeepMind Genie 3 [World Exclusive] (Jack Parker Holder, Shlomi Fruchter)

103 snips
Aug 5, 2025
Shlomi Fruchter, a Research Director at Google DeepMind, and Jack Parker Holder, a research scientist on the open-endedness team, unveil Genie 3, a revolutionary AI that creates immersive 3D worlds from text prompts. This groundbreaking model can generate environments in seconds, showcasing remarkable consistency in interactions. They discuss the evolution from Genie 2 to Genie 3, emphasizing improvements in memory and human interaction. The hosts dive into the potential applications for game design and robotics, hinting at a future where AI can simulate complex environments with ease.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Emergent Consistency in Genie World Model

  • Genie is a world model that simulates environment dynamics and interaction without explicit 3D modeling.
  • It achieves surprising consistency and object permanence as an emergent property from video-trained AI.
ANECDOTE

From Photo to Interactive World

  • DeepMind demoed a system turning a photo from California into an interactive, AI-generated game-like world.
  • The AI generates every pixel in real-time as the user moves, creating a seamless immersive experience.
INSIGHT

Genie 3's Leap to Real-time Realism

  • Genie 3 can generate photorealistic, interactive 720p environments in real-time lasting several minutes.
  • It blends elements from video models and world models to create a flexible, prompt-driven simulation experience.
Get the Snipd Podcast app to discover more snips from this episode
Get the app