AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Do You Know if It's Raining?
Ges: I was expecting the optima strategies to be returned each time. The fact that they didn't for one of the learning methods was surprising and it 's not only that they would consistently not return. So deken would consistently return the naive policy, turn out ten times. And if i somehow figured out it's raining now, ad is going to get sunny, perhaps, depending on the reward structure, i should wait. These are the things we'd like to achieve. If you 're waiting for it to be sunny, you might have to wait a number of days. Once you have this quarter correlation of pressure, then the waiting strategy becomes a little less rewarding because you