Behind the Scenes of Gemini 2.0

8 snips

Dec 11, 2024

Tulsee Doshi, model product lead for Gemini at Google, shares insights on the groundbreaking Gemini 2.0. She discusses the model's significant improvements over its predecessor, including enhanced multimodal capabilities and native tool use, which boost productivity in Google products. Doshi highlights the thrill of launching experimental models while emphasizing the importance of user feedback in refining AI technology. The conversation also unveils innovations like function calling and sophisticated AI agents that lead to richer, personalized user experiences.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Gemini 2.0: A Shift in Approach

Gemini 2.0 feels fundamentally different from previous releases, marking a shift from initial experimentation to a more refined development process.
The team has gained more clarity on use cases, metrics, and the meaning of progress in generative AI.

INSIGHT

Gemini 2.0: Multimodal Agents and Real-Time Applications

Gemini 2.0 empowers the creation of multimodal agents, demonstrated by projects like Astra and Mariner.
Its native multimodal capabilities include image and audio output, spatial reasoning, and fast performance, ideal for real-time applications.

ADVICE

Engage with Experimental Models

Try experimental Gemini models and provide feedback.
This feedback actively shapes future production models, influencing development and use case discovery.

Get the Snipd Podcast app to discover more snips from this episode

Get the app