[Linkpost] “Gemini Diffusion: watch this space” by Yair Halberstadt

May 22, 2025

Google DeepMind's Gemini Diffusion is shaking up AI with its innovative iterative denoising method, revolutionizing how output is generated. The speaker highlights its impressive speed, achieving nearly 1000 tokens per second, and recalls a personal encounter where it aced a Google interview question in record time. With potential far beyond current language models, this technology could redefine AI capabilities. While it’s not flawless, its performance shows that the future is brimming with exciting possibilities.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Diffusion vs Token Prediction

Diffusion models iteratively denoise all output tokens until coherent results emerge, unlike LLMs predicting one token at a time.
This approach is inspired by image diffusion models and offers unique generation advantages.

ANECDOTE

Gemini Diffusion Interview Demo

Google DeepMind's Gemini Diffusion answered my Google interview question perfectly in 2 seconds.
It struggled a bit on follow-ups but still outperforms ChatGPT 3 significantly.

INSIGHT

New AI Species with Diffusion

Gemini Diffusion represents a new 'species' of AI different from humans and large language models.
Diffusion models produce entire output blocks at once, avoiding token-by-token contradiction errors seen in LLMs.

Get the Snipd Podcast app to discover more snips from this episode

Get the app