Latent Space: The AI Engineer Podcast

swyx + Alessio

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0.We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al.Full show notes always on https://latent.space

Episodes

Mentioned books

283 snips

Oct 25, 2024 • 1h 14min

How NotebookLM Was Made

Raiza Martin, Lead PM for NotebookLM at Google Labs, and Usama Bin Shafqat, AI engineer on the same project, share their insights on creating engaging AI-driven audio experiences. They discuss the unique approach of making digital voices sound conversational, incorporating natural interjections and pauses. The duo highlights the importance of user feedback in shaping their innovative product and the challenges of balancing advanced AI capabilities with accessibility for everyday users. Their work aims to redefine how we interact with information through AI.

40 snips

Oct 19, 2024 • 57min

Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore

In this engaging discussion, Josephine Teo, Singapore's Minister of Digital Development and Information, dives into the country's groundbreaking AI strategy. She emphasizes the balance between innovation and governance while exploring the societal impact of digital development. Teo sheds light on the importance of coding literacy and continuous education to prepare the workforce for tech transitions. The conversation also touches on Singapore's healthcare innovation landscape and the need for collaborations to nurture a vibrant AI ecosystem.

104 snips

Oct 18, 2024 • 1h 12min

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

Michelle Pokrass leads the API Platform at OpenAI and has an impressive background in building scalable platforms at leading tech companies. In their discussion, they delve into the significance of structured outputs in AI, emphasizing its reliability for developers. The conversation covers everything from the latest advancements in OpenAI's capabilities, including the O1 model, to navigating database challenges and enhancing user experience through innovations in their APIs. They also touch on the complexities of cognitive biases in decision-making related to AI.

8 snips

Sep 3, 2024 • 1h 5min

Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation

Nyla Worker, a Senior PM at Nvidia with a background in optimizing AI models at Google and eBay, shares insights on dramatic advancements in AI efficiency and inference. The discussion highlights a staggering reduction in costs and time for training models, with examples like the Cerebras platform achieving unheard-of speeds. They delve into optimizing large language models and the revolutionary potential of 3D conversational AI technology. Worker also touches on the future of digital personas and their applications in various sectors, including healthcare.

81 snips

Aug 29, 2024 • 1h 10min

Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind

Nicholas Carlini, a research scientist at DeepMind specializing in AI security, discusses the power of personalized LLM benchmarks. He encourages focusing on individual use of AI tools, emphasizing that AI shines in automating mundane tasks. Carlini shares insights from his viral blog, detailing creative applications of AI in coding and problem-solving. He also navigates the dualities of LLMs, the importance of critical evaluation, and the ongoing need for robust, domain-specific benchmarks to truly gauge AI performance.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Latent Space: The AI Engineer Podcast

Episodes

Mentioned books

How NotebookLM Was Made

Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore

Building the Silicon Brain - with Drew Houston of Dropbox

Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust

Building AGI in Real Time (OpenAI Dev Day 2024)

Language Agents: From Reasoning to Acting

The Ultimate Guide to Prompting

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation

Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind

The AI-powered Podcast Player