Data Engineering Podcast

Balancing Off-the-Shelf and Custom Solutions in Data Engineering

10 snips
May 13, 2025
Tulika Bhatt, a senior software engineer at Netflix specializing in impression data, shares her journey from BlackRock and Verizon to shaping data services at a top streaming service. She discusses the challenges of balancing off-the-shelf solutions with custom systems, utilizing technologies like Spark and Flink. Tulika dives into the intricacies of ensuring data quality and observability, emphasizing automation and robust alerting strategies. She also explores the integration of AI in data engineering, highlighting its potential and the hurdles faced in maximizing efficiency.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Impressions Data Importance & Needs

  • Impressions data is critical for Netflix personalization and homepage construction.
  • Both real-time and batch processing are necessary for serving diverse use cases with impressions.
ANECDOTE

Innovative Impressions Data Solution

  • Netflix combined Spark and Iceberg to serve real-time impression data in a gRPC service.
  • They cleverly aggregated year-long data into manageable weekly chunks for real-time usage.
ADVICE

Balance Off-the-Shelf and Custom

  • Start by evaluating open source or off-the-shelf technology before building custom solutions.
  • Innovate and build custom components only when existing tools do not meet specific high-scale needs.
Get the Snipd Podcast app to discover more snips from this episode
Get the app