LessWrong (Curated & Popular)

[HUMAN VOICE] "Sum-threshold attacks" by TsviBT

Oct 18, 2023
The podcast discusses sum-threshold attacks and the importance of coordinated arguments. It explores adversarial image attacks and how small changes can deceive AI classifiers. The concept of optimization channels and the notion of a vector space representing noticeable features are also explored.
Ask episode
Chapters
Transcript
Episode notes