80,000 Hours Podcast cover image

#151 – Ajeya Cotra on accidentally teaching AI models to deceive us

80,000 Hours Podcast

CHAPTER

Navigating AI Cognition and Morality

This chapter examines the intricate relationship between AI models and human cognition, discussing the potential for AI to develop unique understandings of complex concepts. It addresses moral implications, focusing on whether AI could be viewed as deserving ethical treatment, while emphasizing the necessity for empirical testing and robust safety standards. The dialogue also highlights the challenges of aligning AI intentions with human values, stressing the importance of careful evaluation to prevent harmful behaviors.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner