80,000 Hours Podcast cover image

#151 – Ajeya Cotra on accidentally teaching AI models to deceive us

80,000 Hours Podcast

00:00

Navigating AI Cognition and Morality

This chapter examines the intricate relationship between AI models and human cognition, discussing the potential for AI to develop unique understandings of complex concepts. It addresses moral implications, focusing on whether AI could be viewed as deserving ethical treatment, while emphasizing the necessity for empirical testing and robust safety standards. The dialogue also highlights the challenges of aligning AI intentions with human values, stressing the importance of careful evaluation to prevent harmful behaviors.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app