The Importance of RL in LLMs

Max Forzer: We're seeing good performance on easy stuff and not on hard stuff. I think there's some kind of glossing over sometimes of like what exactly the difficulty level we're dealing with. Once people get a little bit bored with textual domains, that's probably going to happen pretty soon. And that's where RL starts to be really valuable again.

Play episode from 01:08:36

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app