CoRecursive: Coding Stories

Story: Reinforcement Learning At Facebook with Jason Gauci

7 snips
Feb 1, 2021
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Capture The Flag AI Emergence

  • Jason created a neural network AI to play capture the flag using neuro evolution techniques.
  • He was amazed to see emergent behaviors like players coordinating traps and decoys naturally appear.
INSIGHT

Explore-Exploit in Recommendations

  • Recommender systems must balance showing likely liked items with exploring new ones to learn user preferences.
  • This explore-exploit dynamic is essential for improving recommendations beyond just ranking history.
ANECDOTE

Early Facebook Reinforcement Learning

  • Facebook asked Jason to solve how to use reinforcement learning for ranking their news feed posts.
  • He faced initial struggles with contractors and developed the project quietly, open sourcing it to keep working after contracts ended.
Get the Snipd Podcast app to discover more snips from this episode
Get the app