How to Use Simulation Environments to Improve Banded Learning

There is also a need for data. We don't really have enough data sets where there is a stochastic policy showing recommendations to users. When we're moving towards the reinforcement learning problem, I think we need to first build these in simulation environments before we can then start looking at using them to solve different problems. So my main tip would be when you want to focus on banded learning, look at using simulation environments.

Play episode from 38:53

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app