80,000 Hours Podcast cover image

#195 – Sella Nevo on who's trying to steal frontier AI models, and what they could do with them

80,000 Hours Podcast

CHAPTER

Manipulating Trust: The Threat of Human Intelligence in AI

This chapter explores the risks associated with human intelligence collection as malicious organizations employ sophisticated tactics to extract sensitive information from individuals. Highlighting historical and contemporary examples, the discussion delves into methods of coercion and extortion, revealing the moral implications of these strategies. The chapter also emphasizes the importance of awareness and technical solutions to safeguard against potential vulnerabilities in AI models.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner