Latent Space: The AI Engineer Podcast

[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research

46 snips
Jul 1, 2023
Join Ronen Eldan and Yuanzhi Li from Microsoft Research as they dive into the fascinating world of tiny language models. Learn how their Tiny Stories project showcases these models' surprising storytelling abilities while prioritizing data quality over sheer size. The duo discusses new training methods that mimic human language learning and explores the emergence of reasoning skills in AI. Discover the creative challenges of generating diverse narratives for young audiences and how understanding these small models can reshape the future of AI.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Dog Completion Example

  • The prompt "Her mom didn't let her have a dog, so she asked for a…" illustrates language model limitations.
  • Even large models like GPT-2 XL often incorrectly predict "dog" due to proximity and frequency, not logic.
ANECDOTE

Tom and Jane's Soup

  • The story about Tom and Jane's soup demonstrates Tiny Stories' simplicity.
  • It highlights the dataset's childlike nature and its focus on basic vocabulary and reasoning.
INSIGHT

Synthetic Data Generation

  • Tiny Stories uses synthetic data generated by GPT-3.5/4, guided by specific word combinations.
  • This approach promotes diversity and creativity while controlling vocabulary and structure.
Get the Snipd Podcast app to discover more snips from this episode
Get the app