AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
What the Heck Is Going on When It's Not Working?
One of the things that historically and maybe it's not in the review mirror fully yet but has kind of plagued RL is just fragility. Do you foresee that RL becoming easier to apply in the near term future? Yeah, I think that this is actually an area that used to be extremely difficult to deal with because we didn't really understand exactly what was going on. There have been a few papers that have come out from my group from the Schimland-Wyzeens group of the student Clara Lyle from some folks at DeepMind.