The Robot Brains Podcast cover image

Spotify's Gustav Söderström on machine learning to personalize user experiences

The Robot Brains Podcast

00:00

Explorir Exploit in a Safe Way

I am curious about the explorir exploit you mention, because, i mean, reinforcement learning is known to be very different from regular supervised learning. And so i'm curious, how's that play out in this condict and a special cures? It seems like you could explore with one listener, and what you learn there then alleviates the need for exploration with another listener. I think this is a trate of witdis algritm ter. They can be very effective, but i also think there's a lot of responsibility in trying to understand as much as possible about what they do before you leverage them.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app