The 80000 Hours Podcast on Artificial Intelligence cover image

Two: Ajeya Cotra on accidentally teaching AI models to deceive us

The 80000 Hours Podcast on Artificial Intelligence

00:00

Navigating Approaches in Artificial Intelligence

Exploring the concepts of iterated amplification and handoff approaches in creating smarter, aligned AI systems while discussing skepticism towards model size improvements and conceptual AI psychology. Delving into training AI systems for worst-case scenarios and promoting simpler interpretability methods over complex ones.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app