3min chapter

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Reinforcement Learning for Personalization at Spotify with Tony Jebara - #609

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

Using Offline RL for Personalization

We're now thinking of our business more about building a journey rather than getting you to just click on something with a band it. We use these techniques from baby RL, which is multi arm bandits. This is like if you have a forgetful RL agent that doesn't know what state it's in then basically to band it. It turns out RL is also about getting a user to go on a journey and discover new things and enrich the way they use modifying their day to day life. And so you can't just think of it as a multi arm banded in the casino, which kind of doesn't really remember what happened before. You have to really understand users are impacted by your recommendations

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode