Offline Reinforcement Learning

offline reinforcement learning is different from the more canonical, onland reinforcement learning. The challenge is that if we think about the active process, the one where it's like you train a dog,. If the a i agent has some idea about an action that might be good, but it doesn't know if it's good, what is it going to do? Wel is going o go and try that action, experience the outcome and adjust its understanding accordingly. In off line arl, how do you determine if particular conclusion you draw from the data is actually accurate? Counterfactual quarry: Is this other inference actually accurate or is it simply a delusion caused by incomplete data?"

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app