
Google AI: Release Notes
Ever wondered what it's really like to build the future of AI? Join host Logan Kilpatrick for a deep dive into the world of Google AI, straight from the minds of the builders. We're pulling back the curtain on the latest breakthroughs, sharing the unfiltered stories behind the tech, and answering the questions you've been dying to ask.
Whether you're a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for:
- Exclusive interviews with AI pioneers and industry leaders.
- In-depth discussions on the latest AI trends and developments.
- Behind-the-scenes stories and anecdotes from the world of AI.
- Unfiltered insights and opinions from the people shaping the future.
So, if you're ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.
Latest episodes

44 snips
Jul 2, 2025 • 44min
Gemini's Multimodality
Ani Baddepudi, the Product Lead for Gemini Model Behavior, shares her insights on the groundbreaking multimodal capabilities of Gemini. She explains why Gemini was designed as a multimodal model from the start, emphasizing its vision-first approach. The conversation dives into the intricacies of video and image understanding, showcasing advancements in higher FPS video sampling and tokenization methods. Ani also discusses the future of proactive AI assistants and the collaborative efforts behind Gemini’s evolution, revealing exciting possibilities for intuitive AI interactions.

11 snips
Jun 16, 2025 • 1h
Building Gemini's Coding Capabilities
Connie Fan, Product Lead, and Danny Tarlow, Research Lead for Gemini's coding capabilities, dive into the creation of groundbreaking AI coding models. They discuss the importance of foundational goals, the rise of 'vibe coding,' and its transformative effects on development. The duo explores strategies for managing large codebases and how Gemini's framework aims to democratize technology access. They also envision a future where coding tools evolve to meet complex user needs, fostering creativity and productivity in programming.

27 snips
Jun 16, 2025 • 27min
Sergey Brin on the Future of AI & Gemini
Join Sergey Brin, co-founder of Google and a pioneer in computer science, as he delves into the cutting-edge developments of Gemini. He shares insights on the innovative core text models and the integration of native audio, revealing how these advancements enhance storytelling. Brin discusses the rapid evolution of AI, the surprises in recent developments compared to past expectations, and the critical journey towards improved reasoning capabilities. With a focus on Google's vibrant startup culture, his enthusiasm for AI innovation is palpable.

57 snips
May 22, 2025 • 40min
Google I/O 2025 Recap with Josh Woodward and Tulsee Doshi
In this engaging discussion, Josh Woodward, from Google Labs and DeepMind, and Tulsee Doshi, spearheading new Gemini models, recap the highlights from Google I/O 2025. They dive into the exciting launch of Veo 3 and Flow, along with groundbreaking tools like Gemini 2.5 Pro and DeepThink. Topics include the vision for Google I/O 2030, advancements in audio and video tech within Gemini, and the importance of user feedback in shaping AI products. Their insights promise to reshape how we interact with tech in the coming years.

54 snips
May 2, 2025 • 60min
Deep Dive into Long Context
Nikolay Savinov, a Staff Research Scientist at Google DeepMind, delves into the cutting-edge realm of long context in AI. He emphasizes the crucial role of large context windows in enhancing AI agents' performance. The discussion reveals the synergy between long context models and Retrieval Augmented Generation, addressing scaling challenges beyond 2 million tokens. Savinov also shares insights into optimizing context management, improving AI reasoning capabilities, and the future of long context technologies in enhancing user interactions.

37 snips
Mar 28, 2025 • 28min
Launching Gemini 2.5
Tulsee Doshi, Head of Product for Gemini Models at Google, discusses the launch of Gemini 2.5 Pro, a cutting-edge multimodal thinking model. The conversation highlights its advanced reasoning and coding abilities, enabling the creation of complex web applications. Doshi elaborates on balancing academic evaluations with user satisfaction and shares community use cases that showcase its enhanced understanding of physics. The episode emphasizes the collaborative efforts behind the model’s development and the exciting enhancements motivated by user feedback.

Mar 20, 2025 • 37min
Gemini app: Canvas, Deep Research and Personalization
Dave Citron, Senior Director of Product Management at Google and the driving force behind the Gemini app, dives into the latest innovations like Canvas for collaborative content creation. He reveals how Deep Research is enhanced with new Thinking Models and automated reasoning, making it smarter and more efficient. Personalization takes center stage, too, showcasing how user preferences shape responses while balancing privacy concerns. Citron’s insights promise a future of seamless interactions tailored to every user.

26 snips
Feb 24, 2025 • 1h 4min
Developing Google DeepMind's Thinking Models
Jack Rae, Principal Scientist at Google DeepMind, shares insights on advancing reasoning models like Gemini. He discusses how increased 'thinking time' enhances model performance and the significance of long context in language modeling. Rae also highlights the evolution from gaming memory systems to real-world AI applications, emphasizing the need for developer feedback and user interaction. The conversation delves into practical uses, the future of AI reasoning, and innovative evaluation methods that reflect real-world scenarios.

8 snips
Dec 11, 2024 • 35min
Behind the Scenes of Gemini 2.0
Tulsee Doshi, model product lead for Gemini at Google, shares insights on the groundbreaking Gemini 2.0. She discusses the model's significant improvements over its predecessor, including enhanced multimodal capabilities and native tool use, which boost productivity in Google products. Doshi highlights the thrill of launching experimental models while emphasizing the importance of user feedback in refining AI technology. The conversation also unveils innovations like function calling and sophisticated AI agents that lead to richer, personalized user experiences.

Dec 5, 2024 • 43min
Smaller, Faster, Cheaper & The Story of Flash 8B
Emanuel Taropa, a leading developer of Google’s Gemini AI, shares his expertise on the technical intricacies of large language models. He discusses the challenges and triumphs during the launch of the Flash 8B model, emphasizing the shift to smaller, cost-effective models for enhanced accessibility. The conversation also touches on the art of naming models and how these names can inspire innovation amidst launch pressures. Taropa reflects on the teamwork and culture at Google that fuels ongoing advancements in AI technology.