Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

Dec 8, 2023 • 1h 4min

The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl

Wing Lian, founder of the Open Access AI Collective and maintainer of Axolotl, shares his journey from web scraping to fine-tuning AI models. He highlights how community efforts and open-source innovations are transforming the AI landscape. Key discussions include navigating hyperparameter tuning, advanced techniques like LoRa, and the ethical considerations of AI licensing. Wing emphasizes the importance of developer tools and community feedback in refining AI applications, all while advocating for collaborative ventures in the space.

Nov 29, 2023 • 52min

Notebooks = Chat++ and RAG = RecSys! — with Bryan Bischof of Hex Magic

Bryan Bischof, Head of AI at Hex Magic, shares his extensive experience in data science, previously enhancing strategies at companies like Blue Bottle Coffee and Stitch Fix. He discusses the resurgence of notebooks as a user-friendly interface for AI, dubbing them 'Chat++' for their superior editing capabilities. The conversation also dives into the potential of Retrieval-Augmented Generation (RAG) in creating efficient SQL queries, and the importance of innovative design in AI tools to better serve non-technical users.

Nov 17, 2023 • 53min

The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis

Dylan Patel, author of the SemiAnalysis blog, offers insights into the semiconductor industry and GPU dynamics. He discusses the burgeoning divide between the 'GPU rich' and 'GPU poor,' highlighting a significant increase in GPU production forecasted for next year. Patel shares thoughts on AMD's strategic sales and the impact of AI on GPU demand. He also delves into the complexities of GPU utilization for training large models, emphasizing performance optimization and the evolving landscape of semiconductor supply chains.

Nov 8, 2023 • 2h 22min

AGI is Being Achieved Incrementally (DevDay Recap - cleaned audio)

In this discussion, Simon Willison, a software developer and author, joins AI researchers and startup founders to reflect on OpenAI's DevDay announcements. They dive into the exciting advancements like GPT-4 Turbo, the risks of prompt injection, and the shift from plugins to GPTs, highlighting the implications for app development. Reid Robinson shares insights on Zapier's AI actions, while Shreya Rajpal emphasizes the importance of guardrails for LLM applications. The conversation also explores new API features and community-driven innovations in AI.

Nov 8, 2023 • 2h 23min

AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al)

Joining the conversation are Simon Willison, a software engineer known for his expertise in Python, and Alex Volkov, an AI researcher from Weights & Biases. They delve into the latest from OpenAI’s DevDay, breaking down the new features like GPT-4 Turbo and stateful APIs. Raza Habib discusses the real-world implications of foundation models, while Surya Dantuluri shares insights on evolving chatbot functionalities. Reid Robinson touches on Zapier's AI integration, and the group examines innovative applications and the future of multimodal capabilities in AI.

Nov 3, 2023 • 1h 7min

Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind

In this discussion, Michael Royzen, the founder of Phind, shares his journey from computer vision to creating an innovative search engine for programmers. He discusses Phind's unique approach to answering technical questions and helping users implement solutions. The conversation dives into the evolution of AI in programming, including the impact of GPT-4 and the competition between open-source models. Michael also highlights the importance of user engagement and how his team navigates growth and branding challenges in a fast-paced tech landscape.

Oct 26, 2023 • 39min

Powering your Copilot for Data – with Artem Keydunov of Cube.dev

Artem Keydunov, co-founder of Cube.dev, discusses the evolution of natural language understanding in data analytics. He shares insights on transforming StatsBot into Cube, a semantic layer for better data querying. Artem delves into the challenges faced by AI in accurately interpreting SQL queries, emphasizing the risks of hallucinations in AI-generated SQL. He also highlights the necessity of a semantic layer in business intelligence, showcasing how AI tools can become invaluable co-pilots for data-driven decision-making.

Oct 19, 2023 • 1h 9min

The End of Finetuning — with Jeremy Howard of Fast.ai

Jeremy Howard, co-creator of Fast.ai and a leading voice in machine learning, shares his journey from skepticism to success in AI. He discusses the groundbreaking ULMFiT approach to fine-tuning language models and how it faced initial resistance despite its effectiveness. Howard emphasizes the importance of democratizing AI, creating accessible tools, and fostering community engagement. He also explores the evolution of training dynamics in language models and the power of technology to empower diverse communities, advocating for open-source initiatives.

Oct 14, 2023 • 1h 5min

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Kanjun Qiu, co-founder of Imbue, shares insights on building capable AI agents and the challenges they face. He discusses the pitfalls of relying solely on reinforcement learning, revealing why it struggles with higher-level reasoning. The conversation covers innovative projects like Avalon, aimed at improving AI training environments, and the importance of high-quality data. Kanjun also emphasizes the need for creative company cultures and community-driven innovation to advance AI technology responsibly and effectively.

Oct 8, 2023 • 1h 30min

[AIE Summit Preview #2] The AI Horcrux — Swyx on Cognitive Revolution

Swyx, an AI engineer and host of the Cognitive Revolution podcast, joins Nathan LeBenz to discuss the evolving landscape of AI engineering. They highlight the growing demand for AI engineers and share insights on essential skills, tools, and the hiring landscape. Skepticism among software engineers and the transformative potential of AI tools like GPT-4 are also explored. With a preview of the upcoming AI Engineer Summit, they emphasize the importance of community and ongoing education in navigating the fast-paced AI world.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner