
1 - Adversarial Policies with Adam Gleave
AXRP - the AI X-risk Research Podcast
00:00
Is There a Preprint of This Paper?
We're working right now on improved defence mechanisms. We don't have anything thas can ofam, ready to be said yet. But we're hoping in a few months to have have a pre print of this work. And if you want to do some kind of worse case testing when taking as a service te adversarial security approach, i think it's a pretty, a fruitful framing for a problem. I tink fed a big part of a madivation was im having, i gess, a bit of a frustration with a standard testing myphodologies in the field. So i'm hoping that we see some results soon.
Transcript
Play full episode