Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

Jun 6, 2025 • 1h 53min

The Utility of Interpretability — Emmanuel Amiesen

Emmanuel Amiesen, lead author at Anthropic focusing on AI model interpretability, joins guest host Vibhu Sapra, an AI enthusiast with a rich background in economics and data science. They dive into groundbreaking tools for analyzing language model behaviors, revealing how circuit tracing enhances interpretability. The duo explores model complexities, the significance of feature interpretation, and the challenges of biases in AI systems. They also discuss the interplay between research and engineering roles, emphasizing the importance of transparency and safety in AI development.

Jun 3, 2025 • 27min

[AIEWF Preview] Containing Agent Chaos — Solomon Hykes

In this discussion, Solomon Hykes, the creator of Docker and founder of Dagger, dives into the evolution of developer workflows powered by container technology. He highlights Dagger's role in automating software delivery and the importance of creating intuitive experiences for coding agents. Solomon emphasizes the need for standardization in agent environments and explores the balance between simplicity and modularity, using a Lego analogy. He also examines the challenges of ephemeral applications and the future landscape of AI-driven development.

Jun 2, 2025 • 24min

[AIEWF Preview] Gemini in 2025 and Realtime Voice AI

Logan Kilpatrick, a product lead at Google AI Studio, dives into the latest Gemini developments, including implicit context caching and the exciting potential of Gemini Diffusion for generative UIs. Shrestha Basu Mallick, an API product manager, highlights the challenges of live APIs and praises innovations like multilingual TTS and URL Context. Quinn Daily, CEO of Daily, discusses the importance of low-latency audio/video and introduces proactive audio models that filter out irrelevant speech. The trio discusses future capabilities and the need for greater developer control.

May 31, 2025 • 21min

[AIEWF Preview] CloudChef: Your Robot Chef - Michellin-Star food at $12/hr (w/ Kitchen tour!)

Nikhil Abraham, CEO of CloudChef and pioneer in kitchen robotics, brings a fresh perspective on revolutionizing cooking. He discusses how Zippy, their AI chef, uses advanced culinary intelligence to deliver restaurant-quality meals affordably. The conversation dives into demonstration learning, where robots mimic Michelin star chefs' techniques. Abraham also highlights their innovative business model aimed at tackling labor shortages in the food industry while ensuring efficiency and quality. Tune in for insights on how AI is poised to transform dining experiences!

May 29, 2025 • 59min

The AI Coding Factory

Matan Grinberg and Eno Reyes, the co-founders of Factory.ai, delve into their journey of creating autonomous software engineering droids after meeting at a Langchain hackathon. They discuss the integration of AI in code generation and incident response, and the evolution of their platform's unique 'droids' concept. The duo shares insights on navigating AI challenges, user experience, and the significance of collaboration in tech. They also highlight how their innovations are reshaping development workflows for Fortune 500 companies.

May 23, 2025 • 40min

[AIEWF Preview] Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

Will Brown, the reasoning research lead at Prime Intellect, shares his insights on the latest advancements in multi-turn reasoning for LLM agents. He discusses his recent paper on turn-level credit assignment, shedding light on the importance of practical AI agent applications. The conversation covers challenges in model training, ethical dilemmas, and managing token budgets for efficient performance. Brown also speculates on the future of AI safety and the evolving capabilities of models like Claude 4, diving into their real-world implications and complexities.

May 16, 2025 • 54min

ChatGPT Codex: The Missing Manual

Josh Ma, a core developer on the ChatGPT Codex team at OpenAI, and Alexander Embiricos, a member leading Codex testing, dive into the origins and future of Codex, the Autonomous Software Engineer. They discuss the shift from traditional pair programming to AI collaboration, exploring best practices for integrating AI in coding. The conversation also touches on balancing control and trust with AI outputs, understanding task durations, and the evolving user experience. They emphasize the significance of user feedback for future enhancements and the complexities of pricing in AI tools.

May 7, 2025 • 1h 17min

Claude Code: Anthropic's CLI Agent

Boris Churny, Lead Engineer for Claude Code, and Kat Wu, PM at Anthropic, dive into the cutting-edge world of AI developer tools. They discuss the evolution of Claude Code as a Unix utility that embraces simplicity and user feedback. Listeners learn about its unique command-line functionalities, the balance between AI automation and human coding, and the importance of safety in AI writing tasks. The duo also explores future trends in AI development, highlighting the pressure on developers to adapt to new tech landscapes.

May 1, 2025 • 27min

⚡️The Rise and Fall of the Vector DB Category

Jo Kristian Bergum, a seasoned search infrastructure expert with two decades at Yahoo and Fast Search & Transfer, dives deep into the evolution of vector databases. He discusses the surge in vector database popularity post-ChatGPT and the misconceptions surrounding embedding-based similarity search. The conversation explores the dynamic interplay between traditional search methods and embedding techniques. Additionally, Joe sheds light on the future of retrieval-augmented generation and the importance of knowledge graphs in AI development.

Apr 24, 2025 • 1h 7min

Why Every Agent needs Open Source Cloud Sandboxes

Vasek Mlejnsky, a visionary from E2B, joins to share insights on building secure cloud sandboxes for AI agents. He discusses the rapid growth of E2B and its adoption by major companies. The conversation dives into the unique challenges posed by early LLMs and the advantages of cloud environments for AI. Vasek highlights practical use cases like code execution and data analysis, while also addressing the shifting landscape of AI frameworks and billing models. His thoughts on future advancements and multi-modality in AI are particularly intriguing.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner