AXRP - the AI X-risk Research Podcast cover image

20 - 'Reform' AI Alignment with Scott Aaronson

AXRP - the AI X-risk Research Podcast

CHAPTER

The Future of Cryptography

Could you have watermarking even in a public model? I do not know how to do that. It seems very similar to this work on, like, um, inserting Trojans and neural networks. There's some line of research where you train a neural network such that if it sees a cartoon smiley face, then it outputs horse or something.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner