Latent Space: The AI Engineer Podcast

swyx + Alessio
undefined
877 snips
Mar 11, 2025 • 26min

⚡️The new OpenAI Agents Platform

OpenAI is making strides toward 2025 with exciting new tools for developers. The Responses API offers enhanced flexibility and built-in functionalities for complex tasks. A groundbreaking Web Search Tool allows real-time data access with inline citations, making searches more effective. Improvements in the OpenAI Agents SDK introduce vital features like type support and guardrailing for better performance. Plus, the discussion on evolving online language models provides insight into the future of integrated AI capabilities.
undefined
254 snips
Mar 4, 2025 • 38min

⚡️How Claude 3.7 Plays Pokémon

David Hershey, an Engineer at Anthropic and creator of Claude Plays Pokémon, shares insights into programming an AI to play Pokémon Red. The project combines nostalgia with innovation, as David built a special harness for real-time gameplay on Twitch. He explains the challenges of reverse engineering in-game data to enhance AI performance and navigation. David also highlights the emotional dynamics involved in AI gameplay and discusses potential collaborations in the gaming industry, hinting at future advancements in AI technology.
undefined
414 snips
Feb 28, 2025 • 1h 2min

Open Operator, Serverless Browsers and the Future of Computer-Using Agents

Paul Klein, founder of Browserbase and veteran from Twilio and Mux, discusses groundbreaking advancements in headless browser infrastructure for AI agents. He delves into the challenges of CAPTCHA systems and the future of bot authentication, emphasizing the need for innovative solutions. Klein unveils Stagehand, an open-source framework revolutionizing AI-powered browsing. The conversation also explores the competitive landscape of browser automation and the intriguing implications of multimodal browsing experiences.
undefined
830 snips
Feb 18, 2025 • 1h 2min

The Inventors of Deep Research

Aarush Selvan and Mukund Sridhar, key figures in Google's Gemini Deep Research project, share insights on the transformative power of AI in research. They discuss how Gemini serves as a personal research assistant, generating comprehensive reports swiftly. The pair explain the challenges of navigating HTML for AI models and the importance of user interaction in automated planning systems. They also explore the balance between speed and quality in AI outputs, emphasizing collaboration and innovative methodologies in shaping the future of deep research.
undefined
274 snips
Feb 13, 2025 • 1h 9min

Bee AI: The Wearable Ambient Agent

Maria de Lourdes Zollo and Ethan Sutin, co-founders of Bee AI, dive into the world of AI wearables, spotlighting their groundbreaking personal AI system designed to enhance daily life. They discuss the evolution of personal AI from apps to sleek wearable devices, sharing insights on the distinct challenges faced in hardware. The duo reveals how their device integrates seamlessly with communication platforms while navigating the complexities of real-time transcription and user data privacy, all while changing the landscape of human interaction.
undefined
1,043 snips
Feb 11, 2025 • 1h 36min

The AI Architect — Bret Taylor

Bret Taylor, co-founder and CEO of Sierra and former co-CEO of Salesforce, shares his remarkable journey from engineering at Stanford to pivotal roles in creating Google Maps. He discusses the emerging role of AI architects in companies, emphasizing their importance in managing AI agents. The conversation also covers the transformative impact of OpenAI and ChatGPT, as well as insights on balancing impactful work with personal fulfillment. Taylor’s anecdotes reflect a blend of technical innovation and personal passion, making for a captivating exploration of AI's future.
undefined
315 snips
Feb 6, 2025 • 1h 4min

Agent Engineering with Pydantic + Graphs — with Samuel Colvin

Samuel Colvin, the creator of Pydantic and Logfire, discusses the evolution of Pydantic as a critical tool in AI engineering. He reveals its staggering monthly downloads and integration with OpenAI. Colvin also explores the innovative use of graphs in agent engineering, emphasizing their importance for control and observability. Furthermore, he shares insights on the challenges of integrating AI models and the quest for adaptable APIs in observability. He also introduces Pydantic.run as a resource for better user experiences.
undefined
337 snips
Feb 1, 2025 • 1h 9min

The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI

Karina Nguyen, Research Manager and Post-training lead at OpenAI, shares her impressive journey from working with Claude at Anthropic to creating innovative tools like ChatGPT Canvas. She dives into enhancing human-computer interaction and the collaborative spirit that drives her team. The conversation highlights the importance of user feedback, the complexity of AI behavior, and innovative approaches to improve writing through AI. Karina also discusses the need for trust in AI interactions and the cultural dynamics between AI organizations.
undefined
219 snips
Jan 26, 2025 • 1h 16min

Outlasting Noam Shazeer, crowdsourcing Chat + AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research

William Beauchamp, Founder of Chai Research and ex-hedge fund trader, shares insights on building a successful consumer AI company with just 11 people. He highlights their impressive growth, reaching 1.4 million daily active users and over $22 million in revenue. The conversation explores the innovative Chaiverse platform that drastically cuts A/B testing time and enhances user engagement through unique features. Beauchamp also discusses the challenges of data sharing, compliance, and the importance of user feedback in refining AI models.
undefined
182 snips
Jan 19, 2025 • 1h

Everything you need to run Mission Critical Inference (ft. DeepSeek v3 + SGLang)

Join Amir Haghighat, co-founder of Baseten, and Yineng Zhang, lead software engineer at Baseten, as they dive into the groundbreaking DeepSeek v3 model. This model boasts 671 billion parameters and has shaken up LLM inference platforms. They unravel the complexities of deploying massive models, discuss the innovations of SGLang, and delve into the challenges of caching technologies. With insights on optimizing AI workflows and a clear manifesto for crucial applications, this conversation is a must-listen for AI enthusiasts!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app