80,000 Hours Podcast cover image

#212 – Allan Dafoe on why technology is unstoppable & how to shape AI development anyway

80,000 Hours Podcast

00:00

Exploring Backdoored AI Models as Anti-Theft Mechanisms

This chapter explores the controversial idea of using backdoored AI models as a safeguard against model theft, highlighting potential benefits and risks. It emphasizes the necessity of further research to understand the implications for security, alignment, and possible unintended effects on deception.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner