The Data Engineering Show

How Zendesk engineers manage customer-facing data applications

7 snips
Feb 17, 2022
Ananth Packkildurai, Principal Software Engineer at Zendesk and curator of the popular Data Engineering Weekly newsletter, shares his insights on building customer-facing data applications. He discusses the evolution of analytics infrastructure and the significance of technologies like data lineage and catalogs. Ananth delves into real-time data integration solutions, evaluating tools such as Druid and ClickHouse. He also offers guidance for software engineers looking to break into data roles, emphasizing the importance of SQL and a log-driven approach.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Real-Time and Historical Data

  • Combining historical and real-time data is a persistent challenge.
  • Joining data in the database is generally preferred over joining in the stream due to potential optimizations.
INSIGHT

Real-time System Challenges

  • Real-time systems like Pinot and Druid, originating from ad-serving, are optimized for immutable events.
  • SaaS applications require handling mutable data and object lifecycles, making Pinot's upsert functionality, while present, not ideal.
ADVICE

Data Formats

  • Avoid JSON as your primary data format for data pipelines.
  • Use a schema registry and leverage compilers for type checking during compilation.
Get the Snipd Podcast app to discover more snips from this episode
Get the app