
Faking Data Using Tonic.ai with Ian Coe and Adam Kamor
Data Archives - Software Engineering Daily
00:00
Tonic Subsetting
A standard roll out would involve making a duplication of the data set, one per each environment. But tonic ships with a feature called subsetting. Subsetting takes a data base and it generates a new data base that is ually schematically the same. It has the same foreign key constraints and references, but it's smaller.
Play episode from 32:56
Transcript

Companies that gather data about their users have an ethical obligation and legal responsibility to protect the personally identifiable information in their dataset. Ideally, developers working on a software application wouldn’t need access to production data. Yet without high-quality example data, many technology groups stumble on avoidable problems. Organizations need a solution to protect privacy while simultaneously preserving aspects of the data which are important.
