5min chapter

The Robot Brains Podcast cover image

Spotify's Gustav Söderström on machine learning to personalize user experiences

The Robot Brains Podcast

CHAPTER

Can You Crate a Similator With Perfect Rules?

We're trying to apply that concept of optimizing for long term behavior. We found it useful to try to cast the entire mendation problem as a reinforcement learning problem. So while in the similator its the reward is literally how far you get down this track list, on the sort of mental level, i think long term retention is a good reward to look at. And what is interesting is that you really do see what you expect. It turns out that what you candof hope you see even if you optimize for lower engagement in one moment, you longer term retention can be more effective. From an unexpected point of vew I mean, you might expect your algaritm at any

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode