Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

Jul 9, 2025 • 0sec

AI Video Is Eating The World — Olivia and Justine Moore, a16z

Justine and Olivia Moore, partners at a16z, share their insights on the explosive growth of AI-generated video content. They discuss how platforms like TikTok are evolving to feature collaborative characters, enhancing emotional connections with audiences. The Moores explore the democratization of creativity through AI, highlighting the shift from niche trends to mainstream engagement. They also touch on tools like Comfy UI that simplify video workflows, making viral content creation accessible for everyone. Tune in for their strategies on crafting compelling and relatable media!

Jul 2, 2025 • 0sec

Information Theory for Language Models: Jack Morris

In this engaging discussion, Jack Morris, a PhD grad student at Cornell Tech, unpacks the intricate relationship between information theory and large language models. He shares fascinating insights about the efficiency of data representation in AI, particularly in models like GPT-3. The conversation dives into the revolutionary concepts of embedding inversion and the implications for model alignment and security. Jack also explores the potential of emerging programming languages like Mojo, merging performance with innovation in AI research.

Jun 19, 2025 • 0sec

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

Noam Brown, who leads the multi-agent team at OpenAI, shares insights from his groundbreaking work in AI, especially in competitive strategy games like poker and Diplomacy. He discusses the fascinating impact of AI on human gameplay and critiques the constraints of the System 1/2 thinking model in AI reasoning. The conversation also touches on the challenges of test-time compute limitations, multi-agent intelligence, and innovative applications of AI tools like Codex and Windsurf, while pondering the future of AI civilizations.

Jun 13, 2025 • 0sec

The Shape of Compute (Chris Lattner of Modular)

Chris Lattner, the visionary behind LLVM and Swift, now leads Modular, pushing the boundaries of GPU programming with Mojo. He discusses breaking NVIDIA's hold on the market and achieving AMD performance levels. The conversation delves into Mojo's design, merging simplicity with advanced capabilities for AI applications. Lattner also shares insights on community engagement, the importance of open-source contributions, and his reflections on leadership, emphasizing the blend of technical innovation and personal balance in a startup environment.

Jun 6, 2025 • 0sec

The Utility of Interpretability — Emmanuel Amiesen

Emmanuel Amiesen, lead author at Anthropic focusing on AI model interpretability, joins guest host Vibhu Sapra, an AI enthusiast with a rich background in economics and data science. They dive into groundbreaking tools for analyzing language model behaviors, revealing how circuit tracing enhances interpretability. The duo explores model complexities, the significance of feature interpretation, and the challenges of biases in AI systems. They also discuss the interplay between research and engineering roles, emphasizing the importance of transparency and safety in AI development.

Jun 3, 2025 • 0sec

[AIEWF Preview] Containing Agent Chaos — Solomon Hykes

In this discussion, Solomon Hykes, the creator of Docker and founder of Dagger, dives into the evolution of developer workflows powered by container technology. He highlights Dagger's role in automating software delivery and the importance of creating intuitive experiences for coding agents. Solomon emphasizes the need for standardization in agent environments and explores the balance between simplicity and modularity, using a Lego analogy. He also examines the challenges of ephemeral applications and the future landscape of AI-driven development.

Jun 2, 2025 • 0sec

[AIEWF Preview] Gemini in 2025 and Realtime Voice AI

As part of our AI Engineer World’s Fair preview, we’re releasing a special cross podcast recorded with Sam Charrington of TWiML AI at last week’s Google I/O!TUESDAY: Shrestha and Kwindla’s workshop: https://www.ai.engineer/schedule#milliseconds-to-magic-real-time-workflows-using-the-gemini-live-api-and-pipecatTUESDAY: Kwindla’s workshop: https://www.ai.engineer/schedule#building-voice-agents-with-gemini-and-pipecatWEDNESDAY: Shrestha and Kwindla’s talk: https://www.ai.engineer/schedule#milliseconds-to-magic-real-time-workflows-using-the-gemini-live-api-and-pipecatWEDNESDAY: Kwindla’s keynote: https://www.ai.engineer/schedule#-voice-keynote-your-realtime-ai-is-ngmiTHURSDAY: Logan’s keynote: https://www.ai.engineer/schedule#a-year-of-gemini-progress-what-comes-nextCatch all the speakers at AIE (both workshops and talks):Logan Kilpatrick: https://www.latent.space/p/chatgpt-gpt4-hype-and-building-llmShrestha Basu Mallick: https://www.linkedin.com/in/shresthabm/Kwindla Hultman Kramer: https://www.linkedin.com/in/kwkramer

May 31, 2025 • 0sec

[AIEWF Preview] CloudChef: Your Robot Chef - Michellin-Star food at $12/hr (w/ Kitchen tour!)

Nikhil Abraham, CEO of CloudChef and pioneer in kitchen robotics, brings a fresh perspective on revolutionizing cooking. He discusses how Zippy, their AI chef, uses advanced culinary intelligence to deliver restaurant-quality meals affordably. The conversation dives into demonstration learning, where robots mimic Michelin star chefs' techniques. Abraham also highlights their innovative business model aimed at tackling labor shortages in the food industry while ensuring efficiency and quality. Tune in for insights on how AI is poised to transform dining experiences!

May 29, 2025 • 0sec

The AI Coding Factory

Matan Grinberg and Eno Reyes, the co-founders of Factory.ai, delve into their journey of creating autonomous software engineering droids after meeting at a Langchain hackathon. They discuss the integration of AI in code generation and incident response, and the evolution of their platform's unique 'droids' concept. The duo shares insights on navigating AI challenges, user experience, and the significance of collaboration in tech. They also highlight how their innovations are reshaping development workflows for Fortune 500 companies.

May 23, 2025 • 0sec

[AIEWF Preview] Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

Will Brown, the reasoning research lead at Prime Intellect, shares his insights on the latest advancements in multi-turn reasoning for LLM agents. He discusses his recent paper on turn-level credit assignment, shedding light on the importance of practical AI agent applications. The conversation covers challenges in model training, ethical dilemmas, and managing token budgets for efficient performance. Brown also speculates on the future of AI safety and the evolving capabilities of models like Claude 4, diving into their real-world implications and complexities.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner