The Data Stack Show cover image

The Data Stack Show

132: Data Quality and Data Contracts with Chad Sanderson of Data Quality Camp

Mar 29, 2023
Data quality and data contracts are discussed by Chad Sanderson, an expert in the field. Topics covered include the breakdown of data quality, the concept of data contracts and their value, the tools needed for effective data contracts, and the importance of community in data quality.
01:06:34

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Data contracts ensure data quality and integrity, going beyond traditional APIs to consider semantic and logical layers.
  • Implementing data contracts involves communication, collaboration, and the use of tools like schema registries and monitoring tools.

Deep dives

Importance of Data Contracts and Data Quality

Data contracts are agreements between producers and consumers that ensure data quality at scale. They serve as a form of a data API, going beyond traditional APIs by considering not just schema but also the integrity of the data itself. Data contracts help ensure that data products work as intended and meet requirements. By enforcing programmatic mechanisms and checks at various stages like schema registry, serialization framework, staging tables, and CI/CD processes, data contracts ensure data consistency, trustworthiness, and adherence to semantic standards. Building a strong foundation of trustworthy data pipelines, ownership, and schema evolution is crucial in implementing data contracts.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner