AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Train Smaller Diffusion Models on Data Sets
The model we attacked was trained on image description the public internet. The primary cause that we can identify is duplication. There are lots of images that are repeated many times and only some of them get memorized. And so it's very weird that like there is some property we just don't know maybe it's just luck.