Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

Feb 1, 2024 • 58min

Why StackOverflow usage is down 50% — with David Hsu of Retool

David Hsu, CEO and co-founder of Retool, dives into the intriguing intersection of philosophy and computer science. He shares insights on the decline of StackOverflow usage and how startups can thrive with authenticity over flash. The discussion includes the skepticism surrounding AI's impact on job functions, the challenges of integrating AI in businesses, and the evolution of AI models. Hsu emphasizes a developer-first approach while making fascinating analogies between ant colonies and AI evolution, shedding light on the quest for artificial general intelligence.

Jan 25, 2024 • 1h 8min

The Four Wars of the AI Stack (Dec 2023 Audio Recap)

The discussion dives into the four critical battles shaping the AI landscape: data quality, GPU resources, multimodal capabilities, and operational wars. They explore the role of synthetic data and the complexities of talent acquisition in AI. Insights on evolving AI architectures, like Mistral's potential disruption, highlight the shift towards generalization. The podcast also touches on the transformative power of vector databases and the ethics of integrating AI technologies, making it a must-listen for tech enthusiasts.

Jan 19, 2024 • 1h 12min

How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4

Hugo Laurençon and Leo Tronchon from Hugging Face discuss their cutting-edge work on multimodal models like IDEFICS and OBELICS. They dive into the evolution of multimodal training, sharing challenges related to data quality and the intricacies of processing raw HTML. The conversation highlights the importance of image resolution for OCR and the hurdles faced in video data processing. Both researchers express optimism for open-source models, aiming for enhanced performance while tackling issues like hallucinations. Their insights reveal a bright future for multimodal AI innovation.

Jan 11, 2024 • 1h 26min

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

Nathan Lambert, a research scientist at the Allen Institute for AI and former leader of the RLHF team at Hugging Face, shares his insights on the evolution of Reinforcement Learning from Human Feedback (RLHF). He discusses its significance in enhancing language models, including preference modeling and innovative methods like Direct Preference Optimization. The conversation touches on the challenges of model training, the financial implications of AI methodologies, and the importance of effective communication in simplifying complex AI concepts for broader audiences.

Jan 5, 2024 • 1h 4min

The Accidental AI Canvas - with Steve Ruiz of tldraw

Steve Ruiz, founder of tldraw, discusses his unique journey from fine arts to tech, blending creativity with design. He shares his innovative work on collaborative tools and the 'perfect freehand' feature, enhancing user experiences. The conversation dives into AI's role in UI creation and software development, along with real-time updates in collaborative drawing. Ruiz also explores the balance between passion and strategic planning in entrepreneurship, leaving listeners inspired by his insights into the future of AI and creativity.

Dec 30, 2023 • 2h 42min

NeurIPS 2023 Recap — Top Startups

In this dynamic discussion, Jonathan Frankle, Chief Scientist at MosaicML, shares insights on their $1.3 billion acquisition by Databricks. Lin Qiao, CEO of Fireworks AI, talks about optimizing PyTorch for inference. Aman Sanger from Cursor reveals innovative memory strategies for AI coding. Aravind Srinivas discusses the impressive growth of Perplexity AI, hitting 1 million installs, while Jeremy Howard emphasizes the need for accessible AI. Together, they explore the vibrant AI startup landscape showcased at NeurIPS 2023, reflecting on innovation, collaboration, and the future of technology.

Dec 23, 2023 • 3h 20min

NeurIPS 2023 Recap — Best Papers

Hosts recap the NeurIPS 2023 conference, discussing best papers and influential topics such as direct preference optimization for language models, scaling data constraint language models, developing a visual intelligent assistant, understanding bunny boxes with GPT4, and using Tool Former to improve language models. They also explore using GPT-4 to play Minecraft, evaluating cognitive capacities through diverse tasks, analyzing language models' performance in planning tasks, and the impact of foundation models on AI systems.

Dec 20, 2023 • 59min

The AI-First Graphics Editor - with Suhail Doshi of Playground AI

Suhail Doshi, CEO and co-founder of Playground AI and former Mixpanel chief, dives into the fascinating world of AI-driven graphics editing. He shares insights on the evolution of image generation tools, highlighting real-time preview rendering and style filtering features. The discussion highlights community feedback's critical role in shaping user-friendly tools. Doshi also touches on the ethical implications of AI-generated content and the importance of democratizing digital art for non-designers, unlocking creativity for all.

Dec 14, 2023 • 1h 20min

The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph

In this engaging discussion, Beyang Liu, co-founder of Sourcegraph, and Steve Yegge, VP of Engineering at Sourcegraph, share their journey from Google to revolutionizing code search. They explore the creation of 'Cody', an AI coding assistant, and the challenges of automating software development. The duo also delves into the historical debate between Chomsky and Norvig, discussing how their innovative 'normsky' architecture enhances coding efficiency. With insights on the future of open-source AI and the importance of context in coding, this masterclass is a treasure trove for tech enthusiasts.

Dec 8, 2023 • 1h 4min

The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl

Wing Lian, founder of the Open Access AI Collective and maintainer of Axolotl, shares his journey from web scraping to fine-tuning AI models. He highlights how community efforts and open-source innovations are transforming the AI landscape. Key discussions include navigating hyperparameter tuning, advanced techniques like LoRa, and the ethical considerations of AI licensing. Wing emphasizes the importance of developer tools and community feedback in refining AI applications, all while advocating for collaborative ventures in the space.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner