The Robot Brains Podcast cover image

Spotify's Gustav Söderström on machine learning to personalize user experiences

The Robot Brains Podcast

CHAPTER

Explorir Exploit in a Safe Way

I am curious about the explorir exploit you mention, because, i mean, reinforcement learning is known to be very different from regular supervised learning. And so i'm curious, how's that play out in this condict and a special cures? It seems like you could explore with one listener, and what you learn there then alleviates the need for exploration with another listener. I think this is a trate of witdis algritm ter. They can be very effective, but i also think there's a lot of responsibility in trying to understand as much as possible about what they do before you leverage them.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner