AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Best Defense Is Adversarial Training
When we set this up, we saw that both less overlap and less data overlap both would hurt the attackers success rate. But we also saw some odd behaviours atit wasn't sort of as consistent as you would expect. It sounds like some kind of general ation property kicking into effect or something. And so it's sort of overlearned to ta sort of unrealistic scenario. The attacks access rate increased on models that had been adversaria trained. Currently, the overall best defence is adversarial training. So your instantly training the model to be better at correctly classifying attacked data points. Our result showed, if you do this in a more realistic scenario, adversarial training actually weakens the