
20 - 'Reform' AI Alignment with Scott Aaronson
AXRP - the AI X-risk Research Podcast
The Future of Cryptography
Could you have watermarking even in a public model? I do not know how to do that. It seems very similar to this work on, like, um, inserting Trojans and neural networks. There's some line of research where you train a neural network such that if it sees a cartoon smiley face, then it outputs horse or something.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.