AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Large Scale Regression Learning (RL) in Atari Games
In RL, you need to do this kind of factual reasoning pretty implicitly. And so you need to represent these sub-optimal behaviors. But in a language model, you don't need to,. they're often quite bad at a counterfactual reasoning. So there's something interesting here.