AXRP - the AI X-risk Research Podcast cover image

20 - 'Reform' AI Alignment with Scott Aaronson

AXRP - the AI X-risk Research Podcast

00:00

How to Recruit Into the AI Safety Community

Aaronson Christiano: I only know how to take the model and enclose it in this watermarking wrapper. Right, right. And now you might worry that the AI will look inside of itself and it will find some sub circuit that looks like it's calculating some pseudo random function. But even if not, there's still the the problem on our end of how do we insert that functionality in in sort of an obfuscated way? Aaronson Christianson: The question for me is, is there a concrete problem that I can make progress on? Yeah.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner