2min snip

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Reinforcement Learning for Personalization at Spotify with Tony Jebara - #609

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

NOTE

Spotify Uses Reinforcement Learning For Their Recommenders To Build Sticky User Habits

Spotify aims to balance short-term rewards and long-term success for users. While it is important to provide immediate gratification when a user opens the app, solely recommending familiar content that was listened to previously can become monotonous and lead to user fatigue. To prevent this, Spotify combines familiar recommendations with new discoveries to enrich the user experience and promote long-term growth. By establishing new habits like listening to podcasts on specific days, users are more likely to remain engaged and satisfied with the platform. Although these new recommendations may initially have a lower click-through rate, they contribute to cumulative rewards and encourage users to continue returning to Spotify over time. This approach aligns with the principles of reinforcement learning, where the focus is not only on the next reward, but also on the overall accumulation of rewards in the future.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode