

Latent Space: The AI Engineer Podcast
swyx + Alessio
The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space
Episodes
Mentioned books

1,587 snips
Aug 19, 2025 • 0sec
Long Live Context Engineering - with Jeff Huber of Chroma
Jeff Huber, Founder and CEO of Chroma, shares insights on the future of vector databases and the unique aspects of modern AI search. He discusses the challenges of context rot and the importance of context quality in AI applications. The conversation dives into retrieval strategies, memory management, and the evolution of transformer architecture. Huber also reflects on his previous startup experiences and the intersection of personal values with company culture, emphasizing the need for a cohesive brand identity in tech.

2,236 snips
Aug 15, 2025 • 0sec
Greg Brockman on OpenAI's Road to AGI
Greg Brockman, co-founder and president of OpenAI, shares insights on GPT-5's capabilities and the open-source initiative GPT-OSS. He discusses the evolution of machine learning from offline to dynamic online systems and explores how reinforcement learning influences AI refinement. The conversation dives into AI's role in coding enhancement, addressing the rise of self-improving coding agents. Brockman also touches on the aggressive pricing of GPT-5 and its implications for accessibility, framing a future where AI continues to learn and innovate.

641 snips
Jul 31, 2025 • 0sec
The RLVR Revolution — with Nathan Lambert (AI2, Interconnects.ai)
Nathan Lambert, an AI researcher from AI2 and Interconnects.ai, returns to explore the evolution of Reinforcement Learning with Verified Rewards (RLVR). He discusses how RLVR shifts from subjective feedback to verifiable reward signals, enhancing scalability and reliability. Lambert highlights the challenges of tool use in RL frameworks and showcases the Tulu model series aimed at democratizing AI development. The conversation dives into the balance of fine-tuning, user data significance, and the implications for future AI performance and design.

1,061 snips
Jul 23, 2025 • 56min
AI is Eating Search
Robert McCloy, co-founder of Scrunch AI and former CTO of Hearsay, discusses the dramatic shift in search dynamics driven by AI technologies like ChatGPT. He reveals that AI is set to rival traditional search engines by 2026, prompting businesses to rethink SEO strategies. Conversion rates from AI-driven traffic are outperforming conventional methods. McCloy emphasizes the importance of user intent and structured content while navigating this new landscape, exploring how AI tools are reshaping consumer interactions and optimizing content on-the-fly.

1,464 snips
Jul 16, 2025 • 1h 16min
Cline: the open source coding agent that doesn't cut costs
Saoud Rizwan, a key contributor at Cline, chats about developing innovative coding agents and enhancing VS Code through their open-source extension. He discusses the plan + act paradigm, which revolutionizes coding interactions. The conversation also dives into the surprising use of IDEs by non-tech users for marketing tasks. Challenges in monetizing the Model Code Plugin ecosystem and the evolving complexity of programming tasks are also highlighted, providing intriguing insights into the future of coding and automation.

1,222 snips
Jul 11, 2025 • 1h 4min
Personalized AI Language Education — with Andrew Hsu, Speak
In this engaging discussion, Andrew Hsu, CTO of Speak and a former Teal Fellow, dives into the innovative world of AI-driven language education. He shares how Speak has transformed language learning, particularly in South Korea, by creating personalized AI tutors that rival human instruction. The conversation highlights the app's evolution, its unique teaching methods, and the challenges of adapting to diverse cultural contexts. Hsu also emphasizes the importance of real-world language applications and the future of immersive, technology-enhanced learning.

1,001 snips
Jul 9, 2025 • 49min
AI Video Is Eating The World — Olivia and Justine Moore, a16z
Justine and Olivia Moore, partners at a16z, share their insights on the explosive growth of AI-generated video content. They discuss how platforms like TikTok are evolving to feature collaborative characters, enhancing emotional connections with audiences. The Moores explore the democratization of creativity through AI, highlighting the shift from niche trends to mainstream engagement. They also touch on tools like Comfy UI that simplify video workflows, making viral content creation accessible for everyone. Tune in for their strategies on crafting compelling and relatable media!

725 snips
Jul 2, 2025 • 1h 18min
Information Theory for Language Models: Jack Morris
In this engaging discussion, Jack Morris, a PhD grad student at Cornell Tech, unpacks the intricate relationship between information theory and large language models. He shares fascinating insights about the efficiency of data representation in AI, particularly in models like GPT-3. The conversation dives into the revolutionary concepts of embedding inversion and the implications for model alignment and security. Jack also explores the potential of emerging programming languages like Mojo, merging performance with innovation in AI research.

1,253 snips
Jun 19, 2025 • 0sec
Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI
Noam Brown, who leads the multi-agent team at OpenAI, shares insights from his groundbreaking work in AI, especially in competitive strategy games like poker and Diplomacy. He discusses the fascinating impact of AI on human gameplay and critiques the constraints of the System 1/2 thinking model in AI reasoning. The conversation also touches on the challenges of test-time compute limitations, multi-agent intelligence, and innovative applications of AI tools like Codex and Windsurf, while pondering the future of AI civilizations.

1,102 snips
Jun 13, 2025 • 0sec
The Shape of Compute (Chris Lattner of Modular)
Chris Lattner, the visionary behind LLVM and Swift, now leads Modular, pushing the boundaries of GPU programming with Mojo. He discusses breaking NVIDIA's hold on the market and achieving AMD performance levels. The conversation delves into Mojo's design, merging simplicity with advanced capabilities for AI applications. Lattner also shares insights on community engagement, the importance of open-source contributions, and his reflections on leadership, emphasizing the blend of technical innovation and personal balance in a startup environment.