80,000 Hours Podcast cover image

#151 – Ajeya Cotra on accidentally teaching AI models to deceive us

80,000 Hours Podcast

00:00

Navigating AI's Transparency Dilemma

This chapter explores the challenges of AI decision-making, particularly the tension between the need for transparency and the economic implications of achieving it. Through examples from games like Go and chess, it highlights the complexity of training AI models to provide understandable reasoning for their decisions. The discussion also reflects on the wider implications of AI development, drawing comparisons to economic systems and the unpredictable nature of rapidly advancing technologies.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app