Latent Space: The AI Engineer Podcast cover image

Latent Space: The AI Engineer Podcast

Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI

Jul 23, 2024
In this engaging discussion with Thomas Scialom, a leading mind behind Llama 2 and Llama 3 at Meta, listeners dive into the fascinating world of synthetic data and reinforcement learning techniques. He reveals how Llama 3 excels with 15T tokens, leveraging primarily synthetic content for training efficiency. The importance of evaluation methods and the balance between human feedback and model training strategies takes center stage. Scialom also shares insights on the future of intelligence with advanced, multi-step agents and the evolving landscape of AI innovation.
01:05:07

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Llama3-405B model surpasses GPT-4 benchmarks with 15T tokens training.
  • Synthetic data crucial to Llama3 post-training success.

Deep dives

AI Advancements in AGI Research and Development

Working on Llama 3, the team has made significant progress in advancing artificial general intelligence (AGI) technology. With plans for Llama 4 underway, the focus shifts towards implementing agent-based behaviors and evolving models to achieve more advanced capabilities.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner