

Inside Collibra: Busting myths around data science with Gretel De Paepe
Data analysis, data science, and machine learning. The boundaries between these three may not be apparent, but these fields are related and interconnected. So it’s possible to start a career in one and dabble with another. Data has made it easy to connect and acquire information. However, we must be vigilant in upholding privacy.
In this episode, Gretel De Paepe, senior data scientist at Collibra, shares what she’s learned in her data career. She tackles the importance of data in our lives and its incredible value — in the present and the future. Lastly, she tackles myths on artificial intelligence and machine learning.
Tune in to the episode to learn how to handle data correctly.
Here are three reasons why you should listen to this episode:
- Find out what inspired Gretel into pursuing data science.
- Learn how to appreciate data in making our lives better from both the average user’s and company’s perspective.
- Go beyond data bias and our misconceptions around artificial intelligence and machine learning.
Resources
- Connect with Gretel on LinkedIn.
Episode Highlights
[01:02] Machine Learning Projects at Collibra
- Collibra offers many services to their customers.
- Data classification helps companies classify fields that contain personally identifiable information (PII) data.
- Asset recommenders give a list of recommendations based on one’s datasets.
- Similarity detection looks for similar assets to prevent potential duplication and keeps the database clean.
[02:42] Defining Data Science, ML and AI
- Data analysts looks at the data to provide a data-driven answer for a business question.
- Data science deals with statistical modelling.
- The leap from data science to machine learning (ML) is small because machine learning is one way to model data.
- ML is simply a tool in the data science toolkit.
[04:51] Gretel’s Data Journey
- Gretel’s progression from data analysis to data science was a natural process.
- When solving different challenges, you must explore other techniques and build up your portfolio.
- She invested time and money into learning about machine learning.
[10:19] Gretel’s Natural Interest in Data Science
- Gretel treats data analysis like a hobby.
- She easily loses herself in a project because she’s interested in data science.
Gretel: “Usually when I start with a project, there's not much information yet. It's sort of, “Oh, we may wanna do something in this area. But we don't really know yet what it is.” And so, the whole exploration phase of trying to identify what it is that we could do, what techniques we could use. And compare them, just try them out and compare them. It's a creative process.”
[14:05] How Data Gives Value to Consumers
- We use data in statistics.
- Data is used often in our daily lives and provides many benefits.
[19:24] The Myths and Unnecessary Hype around Data Science
- Marketing for artificial intelligence should focus on the fact that it’s only artificial.
- A machine’s algorithm is limited by what it’s trained to do.
[23:26] Data Bias
Gretel: “If you have a bias in your data, you will have a bias in your model. So your model is indeed only as...