Recsperts - Recommender Systems Experts cover image

#3: Bandits and Simulators for Recommenders with Olivier Jeunen

Recsperts - Recommender Systems Experts

CHAPTER

How to Predict CTR Based on Offline Data

The main bottleneck really is that you need to have a decent representative sample of the action space. And so this is like the main reason why banded learning systems have not been adopted in a majority of cases. But building the stochastic policy to sample directly from a few millions of items, that works seldomly. You would need way too much data to actually get that working properly.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner