2min snip

Eye On A.I. cover image

#177 Björn Ommer: Diffusion Mods Explained By Stable Diffusion’s Creator

Eye On A.I.

NOTE

SD as Balancing Local Detail CNN compression and Long-Range Context Diffusion

The brain perceives complex shapes by combining distant elements through attentive and pre-attentive processes. Vision researchers aim to computationally replicate this ability efficiently, without exploring all combinations. Rapid feedforward processes in the brain create the impression of shapes like triangles. Attention helps focus on specific details without scaling to every pixel. The challenge lies in representing scenes with intricate local details like texture and colors, while also capturing overarching shapes, like the triangular structure of a bird. Utilizing a combination of architectures is crucial - convolutional neural networks excel at abstracting local details, while diffusion models capture long-range interactions. Integrating these two architectures results in a balanced approach that compresses local detail and diffuses long-range context effectively, providing stable diffusion in representations.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode