80,000 Hours Podcast cover image

#195 – Sella Nevo on who's trying to steal frontier AI models, and what they could do with them

80,000 Hours Podcast

00:00

Manipulating Trust: The Threat of Human Intelligence in AI

This chapter explores the risks associated with human intelligence collection as malicious organizations employ sophisticated tactics to extract sensitive information from individuals. Highlighting historical and contemporary examples, the discussion delves into methods of coercion and extortion, revealing the moral implications of these strategies. The chapter also emphasizes the importance of awareness and technical solutions to safeguard against potential vulnerabilities in AI models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app