
#3: Bandits and Simulators for Recommenders with Olivier Jeunen
Recsperts - Recommender Systems Experts
How to Use Simulation Environments to Improve Banded Learning
There is also a need for data. We don't really have enough data sets where there is a stochastic policy showing recommendations to users. When we're moving towards the reinforcement learning problem, I think we need to first build these in simulation environments before we can then start looking at using them to solve different problems. So my main tip would be when you want to focus on banded learning, look at using simulation environments.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.