Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

Oct 5, 2023 • 1h 8min

RAG Is A Hack - with Jerry Liu from LlamaIndex

Join Jerry Liu, the founder of LlamaIndex, as he shares his journey from roles at Uber and Quora to leading the RAG revolution in AI. He delves into the evolution of LlamaIndex, which enables better knowledge access through its innovative tree-index structure. Liu discusses the complexities of fine-tuning large language models and the importance of hands-on experience in optimizing performance. The conversation touches on the integration of structured and unstructured data, as well as the critical challenges in evaluating RAG systems, making it a fascinating listen for AI enthusiasts.

Sep 29, 2023 • 1h 21min

Building the Foundation Model Ops Platform — with Raza Habib of Humanloop

Raza Habib, Co-founder and CEO of Humanloop, discusses the evolution of prompt engineering and its essential role in AI development. He highlights the importance of effective prompt evaluations and the various types of human feedback that drive product innovation. Habib also explores the challenges of navigating AI landscapes between Europe and the US, emphasizing the need for practical applications of AI. Additionally, he introduces Humanloop's new free tier, designed to enhance accessibility for startups, and shares insights on achieving product-market fit in the fast-paced AI industry.

Sep 20, 2023 • 53min

Heralds of the AI Content Flippening — with Youssef Rizk of Wondercraft.ai

Youssef Rizk, co-founder of Wondercraft AI, dives into the transformative world of AI-generated content. He discusses the development of Wondercraft and its innovative tools that enhance podcast creation. The conversation highlights the growing influence of AI in making academic research accessible and emphasizes the importance of consistent quality in audio storytelling. Rizk also tackles the balance between human creativity and AI's role, revealing how voice characteristics can shape listener engagement in this rapidly evolving field.

Sep 14, 2023 • 1h 29min

Doing it the Hard Way: Making the AI engine and language 🔥 of the future — with Chris Lattner of Modular

Chris Lattner, a renowned compiler engineer who created LLVM and Swift, discusses the future of AI development. He dives into why AI software is currently lacking and how his team at Modular is tackling fragmented platforms. He delves into Mojo, a new programming language aimed at enhancing performance and user productivity. Lattner emphasizes the importance of collaboration in AI frameworks and the need for effective AI compiler designs. The conversation also touches on the potential for innovative user interfaces in reshaping AI's public perception.

Sep 6, 2023 • 1h 1min

The Point of LangChain — with Harrison Chase of LangChain

Harrison Chase, Founder of LangChain and former sports analytics expert, shares fascinating insights about his journey from sports to machine learning. He discusses LangChain's evolution from a simple prompt templating tool to a comprehensive AI framework that simplifies large language model applications. Harrison also explores the challenges of AI, including managing hallucinations and ensuring observability. The conversation includes exciting news about LangChain Hub and emphasizes the need for effective user experience and collaboration in the fast-paced world of AI development.

Aug 30, 2023 • 1h 12min

RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious

In this discussion, Eugene Cheah, CTO of UIlicious and a key contributor to the RWKV project, dives into the revolutionary RWKV model. He explains how it sidesteps traditional Transformers, achieving superior efficiency and context handling. The conversation highlights the significance of community-driven AI resources and how RWKV addresses memory limitations in processing large datasets. Cheah also explores the balance between open-source licensing and the use of coding models in enterprise settings, showcasing the global shift in AI technology.

Aug 22, 2023 • 59min

Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere

Aman Sanger, founder of Abelian AI and Cursor.so, has a rich background in AI and finance, with experience at Google and McKinsey. He discusses the innovative AI-powered code editor, Cursor, which is transforming coding practices. Sanger emphasizes the need for new IDEs to push AI coding efficiency beyond current limits. He delves into the challenges of integrating AI with CAD applications and shares insights on advanced coding techniques using AI models. Throughout, he highlights the evolving landscape of AI in coding and the potential for future advancements.

Aug 16, 2023 • 51min

The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI

Quentin Anthony, a PhD student at Ohio State University and head engineer at EleutherAI, dives into the intricacies of training large language models. He discusses the importance of community knowledge and practical strategies for GPU optimization. Quentin unpacks the mathematics behind compute requirements and addresses the challenges of floating-point operations. He also explores autoregressive modeling techniques, contrasts traditional methods, and examines the complexities of optimizing training processes, including the Atom optimizer and model distribution.

Aug 10, 2023 • 52min

LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML

Tianqi Chen, an Assistant Professor at CMU and the innovative mind behind XGBoost and Apache TVM, dives into the world of machine learning compilation. He discusses the urgent GPU shortage and explores how to run large language models on devices without needing GPUs at all. Highlights include the groundbreaking ability to execute a 70 billion parameter model in web browsers and advancements in AMD card support, making powerful AI accessible for developers. They also tackle the importance of weight quantization and community collaboration in optimizing machine learning tools.

Aug 4, 2023 • 59min

[AI Breakdown] Summer AI Technical Roundup: a Latent Space x AI Breakdown crossover pod!

NLW, a prominent Daily AI podcaster and YouTuber, dives into the latest AI advancements, highlighting the launch of OpenAI's Code Interpreter, seen as a leap toward GPT 4.5. He discusses its unexpected utility beyond coding, the intricate challenges in evaluating AI models, and the competitive landscape, especially with open-source tools like Llama 2. The conversation also touches on the potential of AI companions in personal growth and the evolving role of AI engineers, making it a must-listen for anyone interested in the future of technology.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner