AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is the Magic in RL?
Sergei: Is it fair to say that the magic isn't gradient descent but all of the hacks and tricks and things that we layered on top of gradient descent? In 2023 I'm very hopeful that we will see RL with language models go beyond the single step setting and really optimize for long horizon goals in an effective way at scale. He says he's also excited about some kind of robotics model that can be used by other people.