This is, you know, pretty much having a binary reward. So did you accomplish what you wanted to by then or not? And that makes reward hacking a lot harder. It's very much still research where they are fine tuning it for a few specific tasks. But it does seem like a sort of pretty general approach that can leverage these general purpose systems to apply them to all social domains.