AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Role of Language Generalization in Reinforcement Learning
This is, you know, pretty much having a binary reward. So did you accomplish what you wanted to by then or not? And that makes reward hacking a lot harder. It's very much still research where they are fine tuning it for a few specific tasks. But it does seem like a sort of pretty general approach that can leverage these general purpose systems to apply them to all social domains.