

Stefano Ermon
Professor of Computer Science at Stanford University. A key figure in the research behind diffusion models used in generative AI.
Top 3 podcasts with Stefano Ermon
Ranked by the Snipd community

64 snips
Feb 24, 2024 • 35min
Beyond Uncanny Valley: Breaking Down Sora
In this engaging discussion, Stefano Ermon, a leading Professor of Computer Science at Stanford, reveals the inner workings of OpenAI's groundbreaking Sora model for AI-generated video. He discusses the shift from GANs to diffusion models and the significance of high-quality training data. The conversation explores the uncanny valley and how Sora's capabilities could reshape our understanding of video compression and generation. Ermon also hints at the exciting future of personalized video content and its applications in various fields.

19 snips
May 24, 2024 • 1h 6min
ARCHIVE: Open Models (with Arthur Mensch) and Video Models (with Stefano Ermon)
Guests Arthur Mensch and Stefano Ermon discuss open foundation models and video models, emphasizing the importance of neutrality in technology and the regulation of AI applications. They explore the evolution of language models, the advantages of open-source collaboration, and the future prospects of specialized AI models. Additionally, they delve into the challenges and advancements in generating longer videos, showcasing how physics aids in accurate predictions and data compression in 3D models.

10 snips
Oct 9, 2025 • 39min
Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon, cofounder of Inception Labs
In this engaging discussion, Stefano Ermon, a Stanford associate professor and co-founder of Inception Labs, dives into the revolutionary world of Diffusion Language Models. He explains how these models surpass traditional autoregressive techniques, highlighting breakthroughs in parallel refinement for text and code generation. Stefano also shares insights on engineering challenges, the importance of high-quality data, and commercial viability. Excitingly, he discusses the future potential of diffusion LLMs in coding and multimodal applications.


