
#3: Bandits and Simulators for Recommenders with Olivier Jeunen
Recsperts - Recommender Systems Experts
How to Predict CTR Based on Offline Data
The main bottleneck really is that you need to have a decent representative sample of the action space. And so this is like the main reason why banded learning systems have not been adopted in a majority of cases. But building the stochastic policy to sample directly from a few millions of items, that works seldomly. You would need way too much data to actually get that working properly.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.