TalkRL: The Reinforcement Learning Podcast cover image

John Schulman

TalkRL: The Reinforcement Learning Podcast

00:00

Is There a Time to Rethink PPO?

I expect AI to be able to do better than humans at most jobs in five years or so. For a while, we're going to discover things that AI isn't very good at and where we want to keep humans in control. I think there'll be some kind of gradual process over the next 10 or 15 years. And then Ethan, the Calabero from, uh, Mila asks, what is your median estimate for the arrival date of AGI?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app