Data Engineering Podcast cover image

Data Engineering Podcast

Build Better Tests For Your dbt Projects With Datafold And data-diff

Jun 11, 2023
48:22

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Datafold combines DataDiff and data lineage analysis to enhance visibility and impact analysis of code changes in data platforms.
  • Testing and validation skills for data practitioners are evolving, with DBT tests and tools like DataDiff and data lineage analysis aiding in achieving quality and correctness of data.

Deep dives

Data Fold's mission to automate testing for data and analytics engineers

Data Fold is focused on automating testing for data and analytics engineers by providing tools that help verify and validate the code written by data developers. Their goal is to ensure that data teams can ship high-quality data products faster. They achieve this by combining two technologies - DataDiff, an open-source tool for comparing tables and SQL queries, and data lineage analysis. DataDiff helps data developers preview the changes they make to DBT models, ensuring they are fully aware of the impact on the data produced. Data lineage analyzes metadata, logs, and integrates with BI tools to understand the dependencies within the data platform. By using these technologies together, DataFold helps data teams understand the impact of code changes on the entire data platform and enhances visibility during the code deployment process.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner