AXRP - the AI X-risk Research Podcast cover image

1 - Adversarial Policies with Adam Gleave

AXRP - the AI X-risk Research Podcast

00:00

Is There a Preprint of This Paper?

We're working right now on improved defence mechanisms. We don't have anything thas can ofam, ready to be said yet. But we're hoping in a few months to have have a pre print of this work. And if you want to do some kind of worse case testing when taking as a service te adversarial security approach, i think it's a pretty, a fruitful framing for a problem. I tink fed a big part of a madivation was im having, i gess, a bit of a frustration with a standard testing myphodologies in the field. So i'm hoping that we see some results soon.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner