
20 - 'Reform' AI Alignment with Scott Aaronson
AXRP - the AI X-risk Research Podcast
00:00
How to Recruit Into the AI Safety Community
Aaronson Christiano: I only know how to take the model and enclose it in this watermarking wrapper. Right, right. And now you might worry that the AI will look inside of itself and it will find some sub circuit that looks like it's calculating some pseudo random function. But even if not, there's still the the problem on our end of how do we insert that functionality in in sort of an obfuscated way? Aaronson Christianson: The question for me is, is there a concrete problem that I can make progress on? Yeah.
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.