Google AI: Release Notes

Google AI
undefined
Aug 27, 2025 • 31min

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Nicole Brichtova and Mostafa Dehghani from Google's Gemini team dive into the innovative features of their cutting-edge image model, Gemini 2.5 Flash. They discuss how the model enables intricate edits through interleaved generation and its ability to maintain character consistency. Listeners learn about the playful 'nano-banana' concept, showcasing real-time transformations that enhance user engagement. The duo also reflects on the integration of text rendering and user feedback, paving the way for future advancements in image generation technology.
undefined
102 snips
Aug 11, 2025 • 31min

Demis Hassabis on shipping momentum, better evals and world models

Demis Hassabis, CEO of Google DeepMind, dives into the evolution of AI from gaming to advanced thinking models. He discusses Genie 3 and its role in building world models that enable AI to grasp reality better. The conversation also touches on the necessity for improved evaluation methods through platforms like Kaggle’s Game Arena, as well as the integration of tool use in AI systems. Hassabis shares insights on scaling AI and the exciting future applications that could emerge from these advancements.
undefined
51 snips
Aug 6, 2025 • 40min

Building real-time voice applications with Live API

Shrestha Basu Mallick, Product lead for the Gemini API at Google, dives into the transformative power of the Gemini Live API, highlighting its seamless integration of real-time audio capabilities. She discusses how proactive audio and async functions enhance user interaction. Interesting topics include the importance of audio as an interface, imaginative use cases in applications like Photoshop, and a lighthearted banter about the constellation Gemini and development quirks. It's a vibrant conversation about innovation, creativity, and developer insights.
undefined
53 snips
Jul 23, 2025 • 43min

Building a frontier AI search experience

Robby Stein, VP of Product for Google Search, dives into the transformation of Search into a cutting-edge AI product. He discusses the shift from basic keyword searches to interactive, conversational queries, capable of handling complex tasks seamlessly. Stein highlights the innovative AI Mode and the role of Deep Search in personalizing user experiences. They touch on the emergence of visual and speech-based search capabilities, showcasing how the platform aims to empower users to 'ask anything' and leverage real-time AI tools for everyday tasks.
undefined
54 snips
Jul 2, 2025 • 44min

Gemini's Multimodality

Ani Baddepudi, the Product Lead for Gemini Model Behavior, shares her insights on the groundbreaking multimodal capabilities of Gemini. She explains why Gemini was designed as a multimodal model from the start, emphasizing its vision-first approach. The conversation dives into the intricacies of video and image understanding, showcasing advancements in higher FPS video sampling and tokenization methods. Ani also discusses the future of proactive AI assistants and the collaborative efforts behind Gemini’s evolution, revealing exciting possibilities for intuitive AI interactions.
undefined
11 snips
Jun 16, 2025 • 1h

Building Gemini's Coding Capabilities

Connie Fan, Product Lead, and Danny Tarlow, Research Lead for Gemini's coding capabilities, dive into the creation of groundbreaking AI coding models. They discuss the importance of foundational goals, the rise of 'vibe coding,' and its transformative effects on development. The duo explores strategies for managing large codebases and how Gemini's framework aims to democratize technology access. They also envision a future where coding tools evolve to meet complex user needs, fostering creativity and productivity in programming.
undefined
27 snips
Jun 16, 2025 • 27min

Sergey Brin on the Future of AI & Gemini

Join Sergey Brin, co-founder of Google and a pioneer in computer science, as he delves into the cutting-edge developments of Gemini. He shares insights on the innovative core text models and the integration of native audio, revealing how these advancements enhance storytelling. Brin discusses the rapid evolution of AI, the surprises in recent developments compared to past expectations, and the critical journey towards improved reasoning capabilities. With a focus on Google's vibrant startup culture, his enthusiasm for AI innovation is palpable.
undefined
57 snips
May 22, 2025 • 40min

Google I/O 2025 Recap with Josh Woodward and Tulsee Doshi

In this engaging discussion, Josh Woodward, from Google Labs and DeepMind, and Tulsee Doshi, spearheading new Gemini models, recap the highlights from Google I/O 2025. They dive into the exciting launch of Veo 3 and Flow, along with groundbreaking tools like Gemini 2.5 Pro and DeepThink. Topics include the vision for Google I/O 2030, advancements in audio and video tech within Gemini, and the importance of user feedback in shaping AI products. Their insights promise to reshape how we interact with tech in the coming years.
undefined
55 snips
May 2, 2025 • 60min

Deep Dive into Long Context

Nikolay Savinov, a Staff Research Scientist at Google DeepMind, delves into the cutting-edge realm of long context in AI. He emphasizes the crucial role of large context windows in enhancing AI agents' performance. The discussion reveals the synergy between long context models and Retrieval Augmented Generation, addressing scaling challenges beyond 2 million tokens. Savinov also shares insights into optimizing context management, improving AI reasoning capabilities, and the future of long context technologies in enhancing user interactions.
undefined
37 snips
Mar 28, 2025 • 28min

Launching Gemini 2.5

Tulsee Doshi, Head of Product for Gemini Models at Google, discusses the launch of Gemini 2.5 Pro, a cutting-edge multimodal thinking model. The conversation highlights its advanced reasoning and coding abilities, enabling the creation of complex web applications. Doshi elaborates on balancing academic evaluations with user satisfaction and shares community use cases that showcase its enhanced understanding of physics. The episode emphasizes the collaborative efforts behind the model’s development and the exciting enhancements motivated by user feedback.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app