
1 - Adversarial Policies with Adam Gleave
AXRP - the AI X-risk Research Podcast
00:00
Is There a Preprint of This Paper?
We're working right now on improved defence mechanisms. We don't have anything thas can ofam, ready to be said yet. But we're hoping in a few months to have have a pre print of this work. And if you want to do some kind of worse case testing when taking as a service te adversarial security approach, i think it's a pretty, a fruitful framing for a problem. I tink fed a big part of a madivation was im having, i gess, a bit of a frustration with a standard testing myphodologies in the field. So i'm hoping that we see some results soon.
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.