
Latent Space: The AI Engineer Podcast
The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space
Latest episodes

131 snips
Oct 5, 2023 • 1h 8min
RAG Is A Hack - with Jerry Liu from LlamaIndex
Join Jerry Liu, the founder of LlamaIndex, as he shares his journey from roles at Uber and Quora to leading the RAG revolution in AI. He delves into the evolution of LlamaIndex, which enables better knowledge access through its innovative tree-index structure. Liu discusses the complexities of fine-tuning large language models and the importance of hands-on experience in optimizing performance. The conversation touches on the integration of structured and unstructured data, as well as the critical challenges in evaluating RAG systems, making it a fascinating listen for AI enthusiasts.

33 snips
Sep 29, 2023 • 1h 21min
Building the Foundation Model Ops Platform — with Raza Habib of Humanloop
Raza Habib, Co-founder and CEO of Humanloop, discusses the evolution of prompt engineering and its essential role in AI development. He highlights the importance of effective prompt evaluations and the various types of human feedback that drive product innovation. Habib also explores the challenges of navigating AI landscapes between Europe and the US, emphasizing the need for practical applications of AI. Additionally, he introduces Humanloop's new free tier, designed to enhance accessibility for startups, and shares insights on achieving product-market fit in the fast-paced AI industry.

23 snips
Sep 20, 2023 • 53min
Heralds of the AI Content Flippening — with Youssef Rizk of Wondercraft.ai
Youssef Rizk, co-founder of Wondercraft AI, dives into the transformative world of AI-generated content. He discusses the development of Wondercraft and its innovative tools that enhance podcast creation. The conversation highlights the growing influence of AI in making academic research accessible and emphasizes the importance of consistent quality in audio storytelling. Rizk also tackles the balance between human creativity and AI's role, revealing how voice characteristics can shape listener engagement in this rapidly evolving field.

51 snips
Sep 14, 2023 • 1h 29min
Doing it the Hard Way: Making the AI engine and language 🔥 of the future — with Chris Lattner of Modular
Chris Lattner, a renowned compiler engineer who created LLVM and Swift, discusses the future of AI development. He dives into why AI software is currently lacking and how his team at Modular is tackling fragmented platforms. He delves into Mojo, a new programming language aimed at enhancing performance and user productivity. Lattner emphasizes the importance of collaboration in AI frameworks and the need for effective AI compiler designs. The conversation also touches on the potential for innovative user interfaces in reshaping AI's public perception.

98 snips
Sep 6, 2023 • 1h 1min
The Point of LangChain — with Harrison Chase of LangChain
Harrison Chase, Founder of LangChain and former sports analytics expert, shares fascinating insights about his journey from sports to machine learning. He discusses LangChain's evolution from a simple prompt templating tool to a comprehensive AI framework that simplifies large language model applications. Harrison also explores the challenges of AI, including managing hallucinations and ensuring observability. The conversation includes exciting news about LangChain Hub and emphasizes the need for effective user experience and collaboration in the fast-paced world of AI development.

15 snips
Aug 30, 2023 • 1h 12min
RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious
In this discussion, Eugene Cheah, CTO of UIlicious and a key contributor to the RWKV project, dives into the revolutionary RWKV model. He explains how it sidesteps traditional Transformers, achieving superior efficiency and context handling. The conversation highlights the significance of community-driven AI resources and how RWKV addresses memory limitations in processing large datasets. Cheah also explores the balance between open-source licensing and the use of coding models in enterprise settings, showcasing the global shift in AI technology.

91 snips
Aug 22, 2023 • 59min
Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere
Aman Sanger, founder of Abelian AI and Cursor.so, has a rich background in AI and finance, with experience at Google and McKinsey. He discusses the innovative AI-powered code editor, Cursor, which is transforming coding practices. Sanger emphasizes the need for new IDEs to push AI coding efficiency beyond current limits. He delves into the challenges of integrating AI with CAD applications and shares insights on advanced coding techniques using AI models. Throughout, he highlights the evolving landscape of AI in coding and the potential for future advancements.

29 snips
Aug 16, 2023 • 51min
The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI
Quentin Anthony, a PhD student at Ohio State University and head engineer at EleutherAI, dives into the intricacies of training large language models. He discusses the importance of community knowledge and practical strategies for GPU optimization. Quentin unpacks the mathematics behind compute requirements and addresses the challenges of floating-point operations. He also explores autoregressive modeling techniques, contrasts traditional methods, and examines the complexities of optimizing training processes, including the Atom optimizer and model distribution.

13 snips
Aug 10, 2023 • 52min
LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML
Tianqi Chen, an Assistant Professor at CMU and the innovative mind behind XGBoost and Apache TVM, dives into the world of machine learning compilation. He discusses the urgent GPU shortage and explores how to run large language models on devices without needing GPUs at all. Highlights include the groundbreaking ability to execute a 70 billion parameter model in web browsers and advancements in AMD card support, making powerful AI accessible for developers. They also tackle the importance of weight quantization and community collaboration in optimizing machine learning tools.

27 snips
Aug 4, 2023 • 59min
[AI Breakdown] Summer AI Technical Roundup: a Latent Space x AI Breakdown crossover pod!
NLW, a prominent Daily AI podcaster and YouTuber, dives into the latest AI advancements, highlighting the launch of OpenAI's Code Interpreter, seen as a leap toward GPT 4.5. He discusses its unexpected utility beyond coding, the intricate challenges in evaluating AI models, and the competitive landscape, especially with open-source tools like Llama 2. The conversation also touches on the potential of AI companions in personal growth and the evolving role of AI engineers, making it a must-listen for anyone interested in the future of technology.