3min chapter

Privacy and Security for Stable Diffusion and LLMs with Nicholas Carlini - #618

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

How to Extract the Training Data Using White Box and Black Box

It seems like you have so much more power, but it's been hard to take advantage of it. It sounds like your interface for extracting the training data is through prompting. Yeah, definitely. So all we do, it's the most naive attack. And then we just search to see among the generations with the same prompt, do we get the same image out like five or ten times? And if the answer is yes, we predict this probably the memorized image.

00:00

Transcript

Episode notes

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

3min chapter

Privacy and Security for Stable Diffusion and LLMs with Nicholas Carlini - #618

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights