The 80000 Hours Podcast on Artificial Intelligence cover image

Two: Ajeya Cotra on accidentally teaching AI models to deceive us

The 80000 Hours Podcast on Artificial Intelligence

CHAPTER

Navigating Approaches in Artificial Intelligence

Exploring the concepts of iterated amplification and handoff approaches in creating smarter, aligned AI systems while discussing skepticism towards model size improvements and conceptual AI psychology. Delving into training AI systems for worst-case scenarios and promoting simpler interpretability methods over complex ones.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner