Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0 cover image

Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0

Latest episodes

undefined
Sep 13, 2024 • 2h 4min

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

Michelle Pokrass, a key figure at OpenAI's Devrel initiatives, shares her journey from tech giant roles to leading the API platform at OpenAI. She emphasizes the significance of Structured Outputs and its 100% reliable JSON schema adherence. The conversation dives into the complexities of scaling databases, innovations in AI models with constrained grammar, and the intricacies of API interactions. Insights into GPT-4o, the O1 models, and the collaborative spirit at OpenAI highlight the ongoing pursuit of accessible AGI.
undefined
Sep 3, 2024 • 1h 5min

Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation

Discover the rapid advancements in AI efficiency and the dramatic cost reductions in GPT-level intelligence. Hear a fascinating journey from astrophysics to AI optimization, emphasizing model efficiency and synthetic data. Learn about the crucial role of data quality in training, and how organizations are tackling the challenges of achieving Artificial General Intelligence. Explore the emergence of 3D AI characters and their potential in gaming and brand representation, revolutionizing interactive experiences and content creation.
undefined
Aug 29, 2024 • 1h 10min

Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind

Nicholas Carlini, a research scientist at DeepMind, advocates for personalized benchmarks in AI. He emphasizes how AI can handle routine, tedious tasks, freeing up creativity for more valuable work. Carlini elaborates on his viral blog post detailing 12 specific ways he uses AI, from writing code to solving simple problems. He also discusses the significance of customized model evaluations and the potential vulnerabilities in AI security, pushing for a better understanding of technology's role in practical applications.
undefined
Aug 22, 2024 • 1h 5min

Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)

Devin, a pivotal contributor to coding agent development, shares insights on the groundbreaking AI agent 'Cosine’s Genie,' which outperformed competitors by leveraging a fine-tuned GPT-4o model. He discusses the innovative method of training with billions of synthetic tokens, emphasizing the significance of generating runtime errors to improve the AI's coding capabilities. Devin also explores challenges in AI reasoning and the necessity for adaptive toolkits, shedding light on the exciting journey of merging AI with software engineering.
undefined
Aug 16, 2024 • 59min

AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai

Jeremy Howard, co-founder of Answer.ai, delves into how a lean team can innovate in AI without traditional management structures. He discusses the evolution of machine learning training strategies, emphasizing multi-phase pre-training. The conversation highlights the development of 'fast HTML' for streamlined web app creation and a Python framework that enhances web fundamentals. Howard also introduces dialogue engineering, aimed at boosting productivity via user-friendly tools, while advocating for corporate governance that aligns AI research with societal values.
undefined
Aug 7, 2024 • 1h 4min

Segment Anything 2: Demo-first Model Development

The podcast dives into the breakthroughs of Segment Anything 2, paving the way for video segmentation with impressive accuracy while minimizing user interactions. Discussion includes the transformative journey of a lead researcher who shifted careers to focus on computer vision. Real-time demonstrations highlight user-friendly applications across fields like healthcare and agriculture. Listeners learn about SAM's efficiency gains, innovations in zero-shot capabilities, and the significance of diverse datasets for future advancements.
undefined
Aug 2, 2024 • 1h 55min

The Winds of AI Winter (Q2 Four Wars Recap) + ChatGPT Voice Mode Preview

Swyx, a key figure in AI discussions, and Alessio, an AI enthusiast from Singapore, dive into the pressing themes of AI's evolving landscape. They explore the 'Four Wars' framework and how companies like Anthropic and Mistral are shifting strategies. The conversation unveils exciting advancements, including the groundbreaking voice mode of ChatGPT and its implications for user interaction. They also tackle the competitive dynamics of AI tools and the intricate challenges of character AI, revealing the landscape of modern AI innovation.
undefined
Jul 23, 2024 • 1h 5min

Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI

Thomas Scialom, Led Llama2 and now Llama3, discusses pre-training with synthetic data, scaling laws, RLHF vs instruction tuning, and the use of pure synthetic data. Llama3 was trained on 15T tokens, leveraging Llama2 as a classifier for pre-training data mix. Exploring the significance of synthetic data generation models and challenges in optimizing AI models with human feedback in reinforcement learning.
undefined
Jul 12, 2024 • 58min

Benchmarks 201: Why Leaderboards > Arenas >> LLM-as-Judge

The podcast discusses the evolution of leaderboards in AI evaluation, the challenges in running evaluations, the development of real-world agent benchmarks, analyzing the AGI challenge, and predictions for future model development.
undefined
Jul 5, 2024 • 1h 45min

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka

Yi Tay of Reka discusses the qualities of top researchers, emphasizing reflection, long-term vision, and persistence. The conversation covers challenges in AI research, evolution of LLMs, computing resource management, success in model architectures, and trends in multimodal models. Additionally, they explore scaling laws, efficiency in ML research, open vs. closed source AI models debate, productivity practices, transitioning from academia to industry, global perspectives on tech hubs, and strategies in AI development.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode