
Latent Space: The AI Engineer Podcast
The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space
Latest episodes

70 snips
Nov 29, 2023 • 52min
Notebooks = Chat++ and RAG = RecSys! — with Bryan Bischof of Hex Magic
Bryan Bischof, Head of AI at Hex Magic, shares his extensive experience in data science, previously enhancing strategies at companies like Blue Bottle Coffee and Stitch Fix. He discusses the resurgence of notebooks as a user-friendly interface for AI, dubbing them 'Chat++' for their superior editing capabilities. The conversation also dives into the potential of Retrieval-Augmented Generation (RAG) in creating efficient SQL queries, and the importance of innovative design in AI tools to better serve non-technical users.

49 snips
Nov 17, 2023 • 53min
The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis
Dylan Patel, author of the SemiAnalysis blog, offers insights into the semiconductor industry and GPU dynamics. He discusses the burgeoning divide between the 'GPU rich' and 'GPU poor,' highlighting a significant increase in GPU production forecasted for next year. Patel shares thoughts on AMD's strategic sales and the impact of AI on GPU demand. He also delves into the complexities of GPU utilization for training large models, emphasizing performance optimization and the evolving landscape of semiconductor supply chains.

25 snips
Nov 8, 2023 • 2h 22min
AGI is Being Achieved Incrementally (DevDay Recap - cleaned audio)
In this discussion, Simon Willison, a software developer and author, joins AI researchers and startup founders to reflect on OpenAI's DevDay announcements. They dive into the exciting advancements like GPT-4 Turbo, the risks of prompt injection, and the shift from plugins to GPTs, highlighting the implications for app development. Reid Robinson shares insights on Zapier's AI actions, while Shreya Rajpal emphasizes the importance of guardrails for LLM applications. The conversation also explores new API features and community-driven innovations in AI.

69 snips
Nov 8, 2023 • 2h 23min
AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al)
Joining the conversation are Simon Willison, a software engineer known for his expertise in Python, and Alex Volkov, an AI researcher from Weights & Biases. They delve into the latest from OpenAI’s DevDay, breaking down the new features like GPT-4 Turbo and stateful APIs. Raza Habib discusses the real-world implications of foundation models, while Surya Dantuluri shares insights on evolving chatbot functionalities. Reid Robinson touches on Zapier's AI integration, and the group examines innovative applications and the future of multimodal capabilities in AI.

47 snips
Nov 3, 2023 • 1h 7min
Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind
In this discussion, Michael Royzen, the founder of Phind, shares his journey from computer vision to creating an innovative search engine for programmers. He discusses Phind's unique approach to answering technical questions and helping users implement solutions. The conversation dives into the evolution of AI in programming, including the impact of GPT-4 and the competition between open-source models. Michael also highlights the importance of user engagement and how his team navigates growth and branding challenges in a fast-paced tech landscape.

18 snips
Oct 26, 2023 • 39min
Powering your Copilot for Data – with Artem Keydunov of Cube.dev
Artem Keydunov, co-founder of Cube.dev, discusses the evolution of natural language understanding in data analytics. He shares insights on transforming StatsBot into Cube, a semantic layer for better data querying. Artem delves into the challenges faced by AI in accurately interpreting SQL queries, emphasizing the risks of hallucinations in AI-generated SQL. He also highlights the necessity of a semantic layer in business intelligence, showcasing how AI tools can become invaluable co-pilots for data-driven decision-making.

102 snips
Oct 19, 2023 • 1h 9min
The End of Finetuning — with Jeremy Howard of Fast.ai
Jeremy Howard, co-creator of Fast.ai and a leading voice in machine learning, shares his journey from skepticism to success in AI. He discusses the groundbreaking ULMFiT approach to fine-tuning language models and how it faced initial resistance despite its effectiveness. Howard emphasizes the importance of democratizing AI, creating accessible tools, and fostering community engagement. He also explores the evolution of training dynamics in language models and the power of technology to empower diverse communities, advocating for open-source initiatives.

92 snips
Oct 14, 2023 • 1h 5min
Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue
Kanjun Qiu, co-founder of Imbue, shares insights on building capable AI agents and the challenges they face. He discusses the pitfalls of relying solely on reinforcement learning, revealing why it struggles with higher-level reasoning. The conversation covers innovative projects like Avalon, aimed at improving AI training environments, and the importance of high-quality data. Kanjun also emphasizes the need for creative company cultures and community-driven innovation to advance AI technology responsibly and effectively.

37 snips
Oct 8, 2023 • 1h 30min
[AIE Summit Preview #2] The AI Horcrux — Swyx on Cognitive Revolution
Swyx, an AI engineer and host of the Cognitive Revolution podcast, joins Nathan LeBenz to discuss the evolving landscape of AI engineering. They highlight the growing demand for AI engineers and share insights on essential skills, tools, and the hiring landscape. Skepticism among software engineers and the transformative potential of AI tools like GPT-4 are also explored. With a preview of the upcoming AI Engineer Summit, they emphasize the importance of community and ongoing education in navigating the fast-paced AI world.

27 snips
Oct 7, 2023 • 39min
[AIE Summit Preview #1] Swyx on Software 3.0 and the Rise of the AI Engineer
In this insightful conversation, Swyx, an AI expert and keynote speaker at the AI Engineer Summit, delves into the evolution of Software 3.0. He explains how this new paradigm leverages foundation models like GPT-3, eliminating the need for traditional data labeling. Swyx also highlights the burgeoning role of AI engineers and the importance of understanding model architecture. The discussion touches on OpenAI's innovations, the debate over true intelligence in language models, and the necessity of practical solutions in the AI landscape.