MLOps.community  cover image

Data Engineering for ML // Chad Sanderson // Coffee Sessions #117

MLOps.community

00:00

Data Quality and Data Validation - The Biggest Problem in Data Science

At Slack, first and foremost, we relied primarily on our structured logs. We had thrift Schema for our structured logs that were designed to go in the data warehouse. That was obviously integrated with CoderView, integrated with CI CD checks such that if you introduced a backwards incompatible change to the logs, the CI CD check would fail. You could, again, it could be overwritten, but you had to have a conversation with people before you did it, right? John: It's certainly the case that people like unintentionally, like through ignorance or whatever do these changes and stuff.

Play episode from 38:20
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app