Recsperts - Recommender Systems Experts cover image

#3: Bandits and Simulators for Recommenders with Olivier Jeunen

Recsperts - Recommender Systems Experts

00:00

How to Predict CTR Based on Offline Data

The main bottleneck really is that you need to have a decent representative sample of the action space. And so this is like the main reason why banded learning systems have not been adopted in a majority of cases. But building the stochastic policy to sample directly from a few millions of items, that works seldomly. You would need way too much data to actually get that working properly.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app