Senior Machine Learning Engineer at Spotify, Sanket Gupta, discusses foundational embeddings for transfer learning in recommender systems. Topics include large-scale recommender system building, transfer learning with user and item embeddings, system evaluation, and MLOps challenges. They explore music recommendation intricacies, user behavior analysis challenges, and balancing real-time recommendations with scalability. The podcast delves into user representations, cross-content embeddings, and maintaining content freshness for optimal user experiences.
Read more
AI Summary
AI Chapters
Episode notes
auto_awesome
Podcast summary created with Snipd AI
Quick takeaways
Foundational embeddings enable transfer learning in recommender systems for personalized user experiences.
Balancing speed and accuracy is crucial in serving relevant recommendations within milliseconds for user engagement.
Deep dives
Building a Robust Recommendation System at Spotify
Spotify's senior machine learning engineer dives into the intricacies of Spotify's recommendation system, emphasizing the importance of leveraging vector and feature stores for efficient embedding and ranking processes. The system focuses on creating personalized experiences for users through continuous updates to embeddings and balancing real-time inference with offline feature updates. Key challenges include evaluating system performance, managing a large database of user embeddings, and ensuring seamless integration with various app features.
Optimizing User Engagement through Fast Inference and Personalization
The discussion highlights the importance of near real-time updates in the feature store to incorporate user preferences quickly. Balancing speed with accuracy is crucial for serving relevant recommendations within milliseconds to keep users engaged. The process involves gathering user data, tuning embeddings, and adapting the feature store to cater to different team needs while ensuring dynamic and personalized content delivery.
Evaluating Performance and Model Generalization
The conversation delves into the complexity of evaluating system performance and adapting to new features seamlessly. Metrics like relevance, accuracy, and ranking play a vital role in assessing model effectiveness and user engagement. The emphasis on continuous testing and simulation helps in predicting user behavior, fine-tuning recommendations, and enhancing the overall user experience across Spotify features and content.
Collaborative Approach to Enhancing User Discovery and Personalization
The interview underscores the collaborative efforts within Spotify's teams to evolve user discovery and personalization strategies. Coordination between ML practitioners, UI/UX specialists, and product managers streamlines the process of integrating new features, optimizing system performance, and adapting to diverse user behaviors. The focus remains on improving relevance, engaging users with dynamic content, and balancing speed with accuracy to drive user satisfaction and platform growth.
Join us at our first in-person conference on June 25 all about AI Quality: https://www.aiqualityconference.com/
Sanket works as a Senior Machine Learning Engineer at Spotify working on building end-to-end audio recommender systems. Models built by his team are used across Spotify in many different products including Discover Weekly and Autoplay.
MLOps podcast #232 with Sanket Gupta, Senior Machine Learning Engineer at Spotify //
RecSys at Spotify.
A big thank you to LatticeFlow for sponsoring this episode! LatticeFlow - https://latticeflow.ai/
// Abstract
LLMs with foundational embeddings have changed the way we approach AI today. Instead of re-training models from scratch end-to-end, we instead rely on fine-tuning existing foundation models to perform transfer learning.
Is there a similar approach we can take with recommender systems?
In this episode, we can talk about:
a) how Spotify builds and maintains large-scale recommender systems,
b) how foundational user and item embeddings can enable transfer learning across multiple products,
c) how we evaluate this system
d) MLOps challenges with these systems
// Bio
Sanket works as a Senior Machine Learning Engineer on a team at Spotify building production-grade recommender systems. Models built by my team are being used in Autoplay, Daily Mix, Discover Weekly, etc.
Currently, my passion is how to build systems to understand user taste - how do we balance long-term and short-term understanding of users to enable a great personalized experience.
// MLOps Jobs board
https://mlops.pallet.xyz/jobs
// MLOps Swag/Merch
https://mlops-community.myshopify.com/
// Related Links
Website: https://sanketgupta.substack.com/
Our paper on this topic "Generalized User Representations for Transfer Learning": https://arxiv.org/abs/2403.00584
Sanket's blogs on Medium in the past: https://medium.com/@sanket107
--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Sanket on LinkedIn: www.linkedin.com/in/sanketgupta107
Timestamps:
[00:00] Sanket's preferred coffee
[00:37] Takeaways
[02:30] RecSys are RAGs
[06:22] Evaluating RecSys parallel to RAGs
[07:13] Music RecSys Optimization
[09:46] Dealing with cold start problems
[12:18] Quantity of models in the recommender systems
[13:09] Radio models
[16:24] Evaluation system
[20:25] Infrastructure support
[21:25] Transfer learning
[23:53] Vector database features
[25:31] Listening History Balance
[26:35 - 28:06] LatticeFlow Ad
[28:07] The beauty of embeddings
[30:13] Shift to real-time recommendation
[34:05] Vector Database Architecture Options
[35:30] Embeddings drive personalized
[40:16] Feature Stores vs Vector Databases
[42:33] Spotify product integration strategy
[45:38] Staying up to date with new features
[47:53] Speed vs Relevance metrics
[49:40] Wrap up
Get the Snipd podcast app
Unlock the knowledge in podcasts with the podcast player of the future.
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode
Save any moment
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Share & Export
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode