

Balancing Off-the-Shelf and Custom Solutions in Data Engineering
10 snips May 13, 2025
Tulika Bhatt, a senior software engineer at Netflix specializing in impression data, shares her journey from BlackRock and Verizon to shaping data services at a top streaming service. She discusses the challenges of balancing off-the-shelf solutions with custom systems, utilizing technologies like Spark and Flink. Tulika dives into the intricacies of ensuring data quality and observability, emphasizing automation and robust alerting strategies. She also explores the integration of AI in data engineering, highlighting its potential and the hurdles faced in maximizing efficiency.
AI Snips
Chapters
Transcript
Episode notes
Impressions Data Importance & Needs
- Impressions data is critical for Netflix personalization and homepage construction.
- Both real-time and batch processing are necessary for serving diverse use cases with impressions.
Innovative Impressions Data Solution
- Netflix combined Spark and Iceberg to serve real-time impression data in a gRPC service.
- They cleverly aggregated year-long data into manageable weekly chunks for real-time usage.
Balance Off-the-Shelf and Custom
- Start by evaluating open source or off-the-shelf technology before building custom solutions.
- Innovate and build custom components only when existing tools do not meet specific high-scale needs.