AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Extract the Training Data Using White Box and Black Box
It seems like you have so much more power, but it's been hard to take advantage of it. It sounds like your interface for extracting the training data is through prompting. Yeah, definitely. So all we do, it's the most naive attack. And then we just search to see among the generations with the same prompt, do we get the same image out like five or ten times? And if the answer is yes, we predict this probably the memorized image.