6min chapter

The Shifting Privacy Left Podcast cover image

S2E10: Leveraging Synthetic Data and Privacy Guarantees with Lipika Ramaswamy (Gretel.ai)

The Shifting Privacy Left Podcast

CHAPTER

Using Tokenization and Anonymization in Machine Learning?

If you train a machine learning model on your personal data, it could pretty much generate almost an identical output of your personal data. So why use synthetic data instead of other techniques like tokenization, anonymization, aggregation, and others? Because our data doesn't really live a nice solution to privacy issues. And so that's one way that it's open to vulnerability. There are tons of like really famous studies on this.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode