Google AI: Release Notes

Google AI
undefined
30 snips
Jan 30, 2026 • 43min

Project Genie: Create and explore worlds

A tour of interactive, real-time world generation and the tech behind it. They demo swapping creatures, remixing scenes, and turning photos into explorable spaces. Conversation covers compute limits, session design, style transfer, and how these worlds could train agents or reshape entertainment. The team teases personalization, shared world histories, and where the tech might go next.
undefined
41 snips
Dec 18, 2025 • 22min

Gemini 3 and Gen UI in Google Search

Rhiannon Bell and Robby Stein, leads at Google Search, delve into the revolutionary integration of Gemini 3 and Generative UI. They discuss how models can now create bespoke, interactive simulations in real-time, transforming user experience. Rhiannon reveals the lightning-fast capabilities of Gemini 3 Flash, while Robby highlights the new Search persona designed for enhanced user engagement. They also explore innovative data visualizations like Nano Banana, making complex information more accessible and engaging for users.
undefined
67 snips
Nov 26, 2025 • 28min

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Sundar Pichai, the visionary CEO of Google and Alphabet, joins Logan Kilpatrick to unveil the groundbreaking Gemini 3 and Nano Banana Pro. They explore Google's commitment to AI, discussing decade-long infrastructure investments and the full-stack advantage across products. Pichai shares his launch day rituals and reveals intriguing future moonshots, including the audacious idea of putting data centers in space. The conversation highlights the rise of vibe coding, empowering creativity and non-coders in software development. It's a tech enthusiast's dream!
undefined
36 snips
Nov 26, 2025 • 36min

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Discover the capabilities of Nano Banana Pro, the cutting-edge model for text rendering and infographics. The team showcases its advanced visual reasoning and multi-image input features for targeted edits and real-world applications. Learn how user feedback fuels continuous model improvements and hear about impressive multilingual support. Tune in for a comparison with its predecessor, as well as practical use cases that enhance everyday productivity, from timezone reminders to infographics grounded in recent events.
undefined
67 snips
Nov 25, 2025 • 49min

Koray Kavukcuoglu: “This Is How We Are Going to Build AGI”

In a captivating discussion, Koray Kavukcuoglu, CTO of Google DeepMind and Chief AI Architect, shares his insights on the launch of Gemini 3 and its overwhelmingly positive reception. He highlights the importance of benchmarks in AI advancements while addressing upcoming focus areas like instruction following and internationalization. Koray also explores the role of generative media in achieving AGI and how a collaborative culture promotes innovation at Google. He reflects on the engineering mindset that drives safety and user-friendly AI development.
undefined
27 snips
Nov 25, 2025 • 45min

Google Antigravity: Hands on with our new agentic development platform

Varun Mohan, co-lead for Google Antigravity, dives deep into this innovative AI development platform designed for developers and researchers. He discusses how Antigravity integrates a familiar IDE with browser verification and Gemini 3.0 capabilities. Varun highlights the importance of balancing automation with human collaboration, explores agent-assisted development, and explains how artifacts enhance task communication. He also shares insights on building complex workflows and the philosophy behind this next-gen platform, showcasing a demo that hilariously tackles creating an Airbnb for dogs!
undefined
16 snips
Nov 25, 2025 • 42min

Gemini 3: Launch day reactions

Tulsi Doshi and Josh Woodward join to discuss the exciting launch of Gemini 3. Tulsi, a product lead for generative AI models, shares innovative capabilities like multimodal understanding and agentic features. Josh highlights Gemini's integration across Google surfaces for developer access. They dive into real-world applications, such as transforming handwritten recipes into interactive apps and rapid game development. The duo also reflects on balancing model performance with accessibility, and how user feedback drives continuous improvements.
undefined
36 snips
Oct 16, 2025 • 48min

How a Moonshot Led to Google DeepMind's Veo 3

Dumi Erhan, co-lead of the Veo project at Google DeepMind, shares his extensive expertise in video-generation research. He delves into the fascinating journey of the Veo project, from its moonshot beginnings to the groundbreaking Veo 3 model with audio capabilities. Dumi discusses the challenges of long-duration video coherence and the impact of user feedback on future developments. He also explores the complexity of image-to-video generation and highlights innovative prompting methods that enhance user control.
undefined
45 snips
Sep 15, 2025 • 37min

GDM’s Pushmeet Kohli on solving science's biggest challenges with AI

Pushmeet Kohli, Head of Science and Strategic Initiatives at Google DeepMind, discusses the groundbreaking intersection of AI and science. He dives into transformative models like AlphaFold, showcasing their potential to revolutionize scientific discovery. Kohli emphasizes how AI can democratize research through tools like AI Co-scientist, enabling wider participation. The conversation also touches on the collaborative efforts behind these innovations and their significant impact on solving complex challenges in mathematics and biology.
undefined
49 snips
Aug 27, 2025 • 31min

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Nicole Brichtova and Mostafa Dehghani from Google's Gemini team dive into the innovative features of their cutting-edge image model, Gemini 2.5 Flash. They discuss how the model enables intricate edits through interleaved generation and its ability to maintain character consistency. Listeners learn about the playful 'nano-banana' concept, showcasing real-time transformations that enhance user engagement. The duo also reflects on the integration of text rendering and user feedback, paving the way for future advancements in image generation technology.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app