[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research

46 snips

Jul 1, 2023

Guest

Join Ronen Eldan and Yuanzhi Li from Microsoft Research as they dive into the fascinating world of tiny language models. Learn how their Tiny Stories project showcases these models' surprising storytelling abilities while prioritizing data quality over sheer size. The duo discusses new training methods that mimic human language learning and explores the emergence of reasoning skills in AI. Discover the creative challenges of generating diverse narratives for young audiences and how understanding these small models can reshape the future of AI.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Dog Completion Example

The prompt "Her mom didn't let her have a dog, so she asked for a…" illustrates language model limitations.
Even large models like GPT-2 XL often incorrectly predict "dog" due to proximity and frequency, not logic.

ANECDOTE

Tom and Jane's Soup

The story about Tom and Jane's soup demonstrates Tiny Stories' simplicity.
It highlights the dataset's childlike nature and its focus on basic vocabulary and reasoning.

INSIGHT

Synthetic Data Generation

Tiny Stories uses synthetic data generated by GPT-3.5/4, guided by specific word combinations.
This approach promotes diversity and creativity while controlling vocabulary and structure.

Get the Snipd Podcast app to discover more snips from this episode

Get the app