Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning

21 snips

Dec 5, 2024

Finbarr Timbers, an AI researcher with a background at DeepMind and Midjourney, dives deep into the world of reinforcement learning. He explains the evolution of RL, from fundamental algorithms to its resurgence with breakthroughs like AlphaZero and ChatGPT. Timbers shares stories about teaching AI to tackle Atari games and discusses modern advancements in natural language processing. He highlights the growing importance of data annotation in RL and contrasts the pressure of deadlines in tech with lessons from endurance sports, emphasizing innovation.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Defining Reinforcement Learning

Reinforcement learning (RL) involves sequential decision-making under uncertainty.
True RL problems require exploration and exploitation trade-offs, like learning Atari games from scratch.

ANECDOTE

Finbarr's Path to RL

Finbarr Timbers's path to RL was unconventional, starting with math, economics, and econometrics.
He transitioned to machine learning after becoming disillusioned with economics and joined DeepMind's poker research team.

INSIGHT

The Bitter Lesson

The "bitter lesson" emphasizes scaling learning and search, not just blindly scaling compute.
GPT's self-supervised learning exemplifies the "bitter lesson" by minimizing baked-in biases.

Get the Snipd Podcast app to discover more snips from this episode

Get the app