

Google AI: Release Notes
Google AI
Ever wondered what it's really like to build the future of AI? Join host Logan Kilpatrick for a deep dive into the world of Google AI, straight from the minds of the builders. We're pulling back the curtain on the latest breakthroughs, sharing the unfiltered stories behind the tech, and answering the questions you've been dying to ask.
Whether you're a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for:
- Exclusive interviews with AI pioneers and industry leaders.
- In-depth discussions on the latest AI trends and developments.
- Behind-the-scenes stories and anecdotes from the world of AI.
- Unfiltered insights and opinions from the people shaping the future.
So, if you're ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.
Whether you're a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for:
- Exclusive interviews with AI pioneers and industry leaders.
- In-depth discussions on the latest AI trends and developments.
- Behind-the-scenes stories and anecdotes from the world of AI.
- Unfiltered insights and opinions from the people shaping the future.
So, if you're ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.
Episodes
Mentioned books

Aug 27, 2025 • 31min
Behind the scenes of Google's state-of-the-art "nano-banana" image model
Nicole Brichtova and Mostafa Dehghani from Google's Gemini team dive into the innovative features of their cutting-edge image model, Gemini 2.5 Flash. They discuss how the model enables intricate edits through interleaved generation and its ability to maintain character consistency. Listeners learn about the playful 'nano-banana' concept, showcasing real-time transformations that enhance user engagement. The duo also reflects on the integration of text rendering and user feedback, paving the way for future advancements in image generation technology.

102 snips
Aug 11, 2025 • 31min
Demis Hassabis on shipping momentum, better evals and world models
Demis Hassabis, CEO of Google DeepMind, dives into the evolution of AI from gaming to advanced thinking models. He discusses Genie 3 and its role in building world models that enable AI to grasp reality better. The conversation also touches on the necessity for improved evaluation methods through platforms like Kaggle’s Game Arena, as well as the integration of tool use in AI systems. Hassabis shares insights on scaling AI and the exciting future applications that could emerge from these advancements.

51 snips
Aug 6, 2025 • 40min
Building real-time voice applications with Live API
Shrestha Basu Mallick, Product lead for the Gemini API at Google, dives into the transformative power of the Gemini Live API, highlighting its seamless integration of real-time audio capabilities. She discusses how proactive audio and async functions enhance user interaction. Interesting topics include the importance of audio as an interface, imaginative use cases in applications like Photoshop, and a lighthearted banter about the constellation Gemini and development quirks. It's a vibrant conversation about innovation, creativity, and developer insights.

53 snips
Jul 23, 2025 • 43min
Building a frontier AI search experience
Robby Stein, VP of Product for Google Search, dives into the transformation of Search into a cutting-edge AI product. He discusses the shift from basic keyword searches to interactive, conversational queries, capable of handling complex tasks seamlessly. Stein highlights the innovative AI Mode and the role of Deep Search in personalizing user experiences. They touch on the emergence of visual and speech-based search capabilities, showcasing how the platform aims to empower users to 'ask anything' and leverage real-time AI tools for everyday tasks.

54 snips
Jul 2, 2025 • 44min
Gemini's Multimodality
Ani Baddepudi, the Product Lead for Gemini Model Behavior, shares her insights on the groundbreaking multimodal capabilities of Gemini. She explains why Gemini was designed as a multimodal model from the start, emphasizing its vision-first approach. The conversation dives into the intricacies of video and image understanding, showcasing advancements in higher FPS video sampling and tokenization methods. Ani also discusses the future of proactive AI assistants and the collaborative efforts behind Gemini’s evolution, revealing exciting possibilities for intuitive AI interactions.

11 snips
Jun 16, 2025 • 1h
Building Gemini's Coding Capabilities
Connie Fan, Product Lead, and Danny Tarlow, Research Lead for Gemini's coding capabilities, dive into the creation of groundbreaking AI coding models. They discuss the importance of foundational goals, the rise of 'vibe coding,' and its transformative effects on development. The duo explores strategies for managing large codebases and how Gemini's framework aims to democratize technology access. They also envision a future where coding tools evolve to meet complex user needs, fostering creativity and productivity in programming.

27 snips
Jun 16, 2025 • 27min
Sergey Brin on the Future of AI & Gemini
Join Sergey Brin, co-founder of Google and a pioneer in computer science, as he delves into the cutting-edge developments of Gemini. He shares insights on the innovative core text models and the integration of native audio, revealing how these advancements enhance storytelling. Brin discusses the rapid evolution of AI, the surprises in recent developments compared to past expectations, and the critical journey towards improved reasoning capabilities. With a focus on Google's vibrant startup culture, his enthusiasm for AI innovation is palpable.

57 snips
May 22, 2025 • 40min
Google I/O 2025 Recap with Josh Woodward and Tulsee Doshi
In this engaging discussion, Josh Woodward, from Google Labs and DeepMind, and Tulsee Doshi, spearheading new Gemini models, recap the highlights from Google I/O 2025. They dive into the exciting launch of Veo 3 and Flow, along with groundbreaking tools like Gemini 2.5 Pro and DeepThink. Topics include the vision for Google I/O 2030, advancements in audio and video tech within Gemini, and the importance of user feedback in shaping AI products. Their insights promise to reshape how we interact with tech in the coming years.

55 snips
May 2, 2025 • 60min
Deep Dive into Long Context
Nikolay Savinov, a Staff Research Scientist at Google DeepMind, delves into the cutting-edge realm of long context in AI. He emphasizes the crucial role of large context windows in enhancing AI agents' performance. The discussion reveals the synergy between long context models and Retrieval Augmented Generation, addressing scaling challenges beyond 2 million tokens. Savinov also shares insights into optimizing context management, improving AI reasoning capabilities, and the future of long context technologies in enhancing user interactions.

37 snips
Mar 28, 2025 • 28min
Launching Gemini 2.5
Tulsee Doshi, Head of Product for Gemini Models at Google, discusses the launch of Gemini 2.5 Pro, a cutting-edge multimodal thinking model. The conversation highlights its advanced reasoning and coding abilities, enabling the creation of complex web applications. Doshi elaborates on balancing academic evaluations with user satisfaction and shares community use cases that showcase its enhanced understanding of physics. The episode emphasizes the collaborative efforts behind the model’s development and the exciting enhancements motivated by user feedback.