
Latent Space: The AI Engineer Podcast
The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space
Latest episodes

514 snips
Apr 11, 2025 • 1h 12min
SF Compute: Commoditizing Compute
Evan Conrad shares the riveting journey of SF Compute, revealing how they turned financial struggles into opportunities by selling GPU clusters. The discussion dives into the intriguing dynamics of the GPU market, highlighting the unexpected parallels between GPU finances and real estate models. They'll explore the implications of increasing GPU commoditization, customer pricing sensitivity, and the role of long-term contracts in profitability. Additionally, learn about the innovative branding strategies aimed at promoting calmness in tech, alongside the complexities of email innovation.

1,672 snips
Apr 3, 2025 • 1h 20min
The Creators of Model Context Protocol
In this discussion, David Soria Parra and Justin Spahr-Summers, creators of Anthropic’s Model Context Protocol (MCP), reveal how MCP has swiftly emerged as a leading standard in AI integration, overtaking established protocols in popularity. They share the origin story of MCP, the innovative challenges faced during its development, and the profound impact it has on enhancing communication between AI models. Listeners can also explore exciting prospects of open-source governance, the shift from stateful to stateless server models, and the future of AI functionalities through MCP.

208 snips
Mar 29, 2025 • 0sec
Unsupervised Learning x Latent Space Crossover Special
Dive into the rapid evolution of AI as experts reflect on the past year's surprises and the race between open-source and closed-source models. Explore the impact of AI builders and the rise of low-code platforms. Delve into the significance of product-market fit and customer support in AI applications. Discover the challenges of innovation and the importance of defensibility in app development. Plus, hear insights on emerging trends and the critical role of community engagement in shaping the future of technology.

2,555 snips
Mar 28, 2025 • 1h 38min
The Agent Network — Dharmesh Shah
Dharmesh Shah, co-founder of HubSpot and creator of Agent.ai, shares his insights on the evolving role of AI in workplaces. He introduces the concept of hybrid teams, where humans and AI collaborate as equal members. The conversation also dives into the nuances of AI business models, particularly the difference between Work as a Service (WaaS) and Results as a Service (RaaS), highlighting the complexities of measuring success. Additionally, Dharmesh discusses the technical challenges of implementing AI agents and the innovative future of user interfaces and professional networks for AI.

3,437 snips
Mar 14, 2025 • 1h 18min
Building Snipd: The AI Podcast App for Learning
Kevin Smith, Co-founder and CEO of Snipd, shares his journey transitioning from quant finance to AI, discusses their innovative podcast app aimed at improving learning and knowledge retention. The conversation dives into the unique AI features of Snipd, such as transcript searching, interactive note-taking, and speaker identification. Kevin highlights the challenges of competing against industry giants and the potential of AI-driven tools to enhance the podcasting experience. Tune in for insights about the future of digital learning through podcasts!

851 snips
Mar 11, 2025 • 26min
⚡️The new OpenAI Agents Platform
OpenAI is making strides toward 2025 with exciting new tools for developers. The Responses API offers enhanced flexibility and built-in functionalities for complex tasks. A groundbreaking Web Search Tool allows real-time data access with inline citations, making searches more effective. Improvements in the OpenAI Agents SDK introduce vital features like type support and guardrailing for better performance. Plus, the discussion on evolving online language models provides insight into the future of integrated AI capabilities.

243 snips
Mar 4, 2025 • 38min
⚡️How Claude 3.7 Plays Pokémon
David Hershey, an Engineer at Anthropic and creator of Claude Plays Pokémon, shares insights into programming an AI to play Pokémon Red. The project combines nostalgia with innovation, as David built a special harness for real-time gameplay on Twitch. He explains the challenges of reverse engineering in-game data to enhance AI performance and navigation. David also highlights the emotional dynamics involved in AI gameplay and discusses potential collaborations in the gaming industry, hinting at future advancements in AI technology.

390 snips
Feb 28, 2025 • 1h 2min
Open Operator, Serverless Browsers and the Future of Computer-Using Agents
Paul Klein, founder of Browserbase and veteran from Twilio and Mux, discusses groundbreaking advancements in headless browser infrastructure for AI agents. He delves into the challenges of CAPTCHA systems and the future of bot authentication, emphasizing the need for innovative solutions. Klein unveils Stagehand, an open-source framework revolutionizing AI-powered browsing. The conversation also explores the competitive landscape of browser automation and the intriguing implications of multimodal browsing experiences.

783 snips
Feb 18, 2025 • 1h 2min
The Inventors of Deep Research
Aarush Selvan and Mukund Sridhar, key figures in Google's Gemini Deep Research project, share insights on the transformative power of AI in research. They discuss how Gemini serves as a personal research assistant, generating comprehensive reports swiftly. The pair explain the challenges of navigating HTML for AI models and the importance of user interaction in automated planning systems. They also explore the balance between speed and quality in AI outputs, emphasizing collaboration and innovative methodologies in shaping the future of deep research.

259 snips
Feb 13, 2025 • 1h 9min
Bee AI: The Wearable Ambient Agent
Maria de Lourdes Zollo and Ethan Sutin, co-founders of Bee AI, dive into the world of AI wearables, spotlighting their groundbreaking personal AI system designed to enhance daily life. They discuss the evolution of personal AI from apps to sleek wearable devices, sharing insights on the distinct challenges faced in hardware. The duo reveals how their device integrates seamlessly with communication platforms while navigating the complexities of real-time transcription and user data privacy, all while changing the landscape of human interaction.