Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

May 16, 2025 • 0sec

ChatGPT Codex: The Missing Manual

Josh Ma, a core developer on the ChatGPT Codex team at OpenAI, and Alexander Embiricos, a member leading Codex testing, dive into the origins and future of Codex, the Autonomous Software Engineer. They discuss the shift from traditional pair programming to AI collaboration, exploring best practices for integrating AI in coding. The conversation also touches on balancing control and trust with AI outputs, understanding task durations, and the evolving user experience. They emphasize the significance of user feedback for future enhancements and the complexities of pricing in AI tools.

May 7, 2025 • 0sec

Claude Code: Anthropic's CLI Agent

Boris Churny, Lead Engineer for Claude Code, and Kat Wu, PM at Anthropic, dive into the cutting-edge world of AI developer tools. They discuss the evolution of Claude Code as a Unix utility that embraces simplicity and user feedback. Listeners learn about its unique command-line functionalities, the balance between AI automation and human coding, and the importance of safety in AI writing tasks. The duo also explores future trends in AI development, highlighting the pressure on developers to adapt to new tech landscapes.

May 1, 2025 • 0sec

⚡️The Rise and Fall of the Vector DB Category

Jo Kristian Bergum, a seasoned search infrastructure expert with two decades at Yahoo and Fast Search & Transfer, dives deep into the evolution of vector databases. He discusses the surge in vector database popularity post-ChatGPT and the misconceptions surrounding embedding-based similarity search. The conversation explores the dynamic interplay between traditional search methods and embedding techniques. Additionally, Joe sheds light on the future of retrieval-augmented generation and the importance of knowledge graphs in AI development.

Apr 24, 2025 • 0sec

Why Every Agent needs Open Source Cloud Sandboxes

Vasek Mlejnsky, a visionary from E2B, joins to share insights on building secure cloud sandboxes for AI agents. He discusses the rapid growth of E2B and its adoption by major companies. The conversation dives into the unique challenges posed by early LLMs and the advantages of cloud environments for AI. Vasek highlights practical use cases like code execution and data analysis, while also addressing the shifting landscape of AI frameworks and billing models. His thoughts on future advancements and multi-modality in AI are particularly intriguing.

Apr 15, 2025 • 0sec

⚡️GPT 4.1: The New OpenAI Workhorse

Michelle Pokrass and Josh McGrath from OpenAI dive into the exciting updates of GPT 4.1. They discuss its enhanced coding capabilities and instruction-following features, making it a developer's new best friend. The conversation touches on the innovative Nano model designed for low latency, and they share the fun of naming projects. With insights into pricing and user interaction, they emphasize the significance of community feedback in evolving AI technology. Plus, get ready for the intriguing benefits of multimodal tasks and cutting-edge reasoning enhancements!

Apr 11, 2025 • 0sec

SF Compute: Commoditizing Compute

Evan Conrad shares the riveting journey of SF Compute, revealing how they turned financial struggles into opportunities by selling GPU clusters. The discussion dives into the intriguing dynamics of the GPU market, highlighting the unexpected parallels between GPU finances and real estate models. They'll explore the implications of increasing GPU commoditization, customer pricing sensitivity, and the role of long-term contracts in profitability. Additionally, learn about the innovative branding strategies aimed at promoting calmness in tech, alongside the complexities of email innovation.

Apr 3, 2025 • 0sec

The Creators of Model Context Protocol

In this discussion, David Soria Parra and Justin Spahr-Summers, creators of Anthropic’s Model Context Protocol (MCP), reveal how MCP has swiftly emerged as a leading standard in AI integration, overtaking established protocols in popularity. They share the origin story of MCP, the innovative challenges faced during its development, and the profound impact it has on enhancing communication between AI models. Listeners can also explore exciting prospects of open-source governance, the shift from stateful to stateless server models, and the future of AI functionalities through MCP.

Mar 29, 2025 • 0sec

Unsupervised Learning x Latent Space Crossover Special

Dive into the rapid evolution of AI as experts reflect on the past year's surprises and the race between open-source and closed-source models. Explore the impact of AI builders and the rise of low-code platforms. Delve into the significance of product-market fit and customer support in AI applications. Discover the challenges of innovation and the importance of defensibility in app development. Plus, hear insights on emerging trends and the critical role of community engagement in shaping the future of technology.

Mar 28, 2025 • 1h 38min

The Agent Network — Dharmesh Shah

Dharmesh Shah, co-founder of HubSpot and creator of Agent.ai, shares his insights on the evolving role of AI in workplaces. He introduces the concept of hybrid teams, where humans and AI collaborate as equal members. The conversation also dives into the nuances of AI business models, particularly the difference between Work as a Service (WaaS) and Results as a Service (RaaS), highlighting the complexities of measuring success. Additionally, Dharmesh discusses the technical challenges of implementing AI agents and the innovative future of user interfaces and professional networks for AI.

Mar 14, 2025 • 1h 18min

Building Snipd: The AI Podcast App for Learning

Kevin Smith, Co-founder and CEO of Snipd, shares his journey transitioning from quant finance to AI, discusses their innovative podcast app aimed at improving learning and knowledge retention. The conversation dives into the unique AI features of Snipd, such as transcript searching, interactive note-taking, and speaker identification. Kevin highlights the challenges of competing against industry giants and the potential of AI-driven tools to enhance the podcasting experience. Tune in for insights about the future of digital learning through podcasts!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner