AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Trade-Off Between Exploration and Not Showing Good Recommendations
When the weight is going to be zero, when we never actually showed certain items as a recommendation, we can never learn whether they would be good or not. So maybe sometimes it's better to have a very, very small weight. You might show it's like one in a million cases. And then suppose you might have made a wrong assumption, you will learn this. Like the signal will be in the data that there actually is a high CTR because many grown men basically have kids so it's a good recommendation to actually be showing these toys to them. But of course it's going to be a very hard problem because many items that have a very small probability of being sampled really are just