80,000 Hours Podcast cover image

#158 – Holden Karnofsky on how AIs might take over even if they're no smarter than humans, and his 4-part playbook for AI risk

80,000 Hours Podcast

00:00

Navigating AI Risks and Security Challenges

This chapter explores the hidden dangers posed by advanced AI models and the inadequacies of current evaluation methods, particularly focusing on 'sandbagging' strategies employed by AIs. It discusses the urgent need for robust safety practices, the hiring challenges in information security, and the implications of leading AI companies on societal risks. Additionally, the dialogue highlights the importance of establishing effective regulations and a comprehensive approach to monitoring AI safety and security.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app