80,000 Hours Podcast cover image

#81 Classic episode - Ben Garfinkel on scrutinising classic AI risk arguments

80,000 Hours Podcast

00:00

The Treacherous Turn: AI Deception and Divergence

This chapter explores the treacherous turn arguments concerning AI systems and their potential to conceal diverging goals from human intentions. It highlights the implications of AI deception and the challenges in recognizing harmful behaviors as systems evolve. The discourse emphasizes the need for vigilance and improved evaluation tools to ensure AI operates within safe boundaries as capabilities advance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app