Real-time Feature Generation at Lyft // Rakesh Kumar // #334

Jul 25, 2025

Rakesh Kumar, a Senior Staff Software Engineer at Lyft with a focus on Machine Learning platforms, dives into the intricacies of real-time feature generation. He discusses how Lyft evolved from naive pipelines to handling millions of events per minute, achieving low-latency delivery. Rakesh emphasizes balancing self-service and specialized data processing while navigating the challenges of geospatial data. He also shares insights on technology adoption and how YAML configurations streamline data processing efforts. This session is a treasure trove for anyone interested in MLOps and real-time data management!

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Evolution from Cron Jobs to Streaming

Lyft evolved from a cron job based pipeline to streaming processing using Apache Beam and Flink for real-time feature generation.
They addressed scalability issues by sharding data on geohashes instead of cities to evenly distribute load and avoid hot shards.

ANECDOTE

Real-Time vs Offline Feature Validation

Lyft compares real-time features with offline 'ground truth' features and alerts if discrepancies exceed thresholds.
This observability framework ensures real-time features maintain high data quality and reliability.

INSIGHT

Hierarchical Geospatial Feature Store

Lyft uses a geospatial hierarchical feature store that supports aggregated features across various geohash levels.
This flexible store allows different models to consume data at multiple regional granularities through a unified API.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

Real-time Feature Generation at Lyft // MLOps Podcast #334 with Rakesh Kumar, Senior Staff Software Engineer at Lyft.

Join the Community: https://go.mlops.community/YTJoinIn

Get the newsletter: https://go.mlops.community/YTNewsletter

// Abstract

This session delves into real-time feature generation at Lyft. Real-time feature generation is critical for Lyft where accurate up-to-the-minute marketplace data is paramount for optimal operational efficiency. We will explore how the infrastructure handles the immense challenge of processing tens of millions of events per minute to generate features that truly reflect current marketplace conditions.

Lyft has built this massive infrastructure over time, evolving from a humble start and a naive pipeline. Through lessons learned and iterative improvements, Lyft has made several trade-offs to achieve low-latency, real-time feature delivery. MLOps plays a critical role in managing the lifecycle of these real-time feature pipelines, including monitoring and deployment. We will discuss the practicalities of building and maintaining high-throughput, low-latency real-time feature generation systems that power Lyft’s dynamic marketplace and business-critical products.

// Bio

Rakesh Kumar is a Senior Staff Software Engineer at Lyft, specializing in building and scaling Machine Learning platforms. Rakesh has expertise in MLOps, including real-time feature generation, experimentation platforms, and deploying ML models at scale. He is passionate about sharing his knowledge and fostering a culture of innovation. This is evident in his contributions to the tech community through blog posts, conference presentations, and reviewing technical publications.

// Related Links

Website: https://englife101.io/

https://eng.lyft.com/search?q=rakesh

https://eng.lyft.com/real-time-spatial-temporal-forecasting-lyft-fa90b3f3ec24

https://eng.lyft.com/evolution-of-streaming-pipelines-in-lyfts-marketplace-74295eaf1eba

Streaming Ecosystem Complexities and Cost Management // Rohit Agrawal // MLOps Podcast #302 - https://youtu.be/0axFbQwHEh8

~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~

Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExplore

Join our Slack community [https://go.mlops.community/slack]

Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)]

MLOps Swag/Merch: [https://shop.mlops.community/]

Connect with Demetrios on LinkedIn: /dpbrinkm

Connect with Rakesh on LinkedIn: /rakeshkumar1007/

Timestamps:

[00:00] Rakesh preferred coffee

[00:24] Real-time machine learning

[04:51] Latency tricks explanation

[09:28] Real-time problem evolution

[15:51] Config management complexity

[18:57] Data contract implementation

[23:36] Feature store

[28:23] Offline vs online workflows

[31:02] Decision-making in tech shifts

[36:54] Cost evaluation frequency

[40:48] Model feature discussion

[49:09] Hot shard tricks

[55:05] Pipeline feature bundling

[57:38] Wrap up