How to Recruit Into the AI Safety Community

Aaronson Christiano: I only know how to take the model and enclose it in this watermarking wrapper. Right, right. And now you might worry that the AI will look inside of itself and it will find some sub circuit that looks like it's calculating some pseudo random function. But even if not, there's still the the problem on our end of how do we insert that functionality in in sort of an obfuscated way? Aaronson Christianson: The question for me is, is there a concrete problem that I can make progress on? Yeah.

Play episode from 01:56:58

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app