The 80,000 Hours Podcast on Artificial Intelligence (September 2023) cover image

Two: Ajeya Cotra on accidentally teaching AI models to deceive us

The 80,000 Hours Podcast on Artificial Intelligence (September 2023)

00:00

Emergence of Trends from AI Systems in Go and Chess

AI systems in Go and Chess communities are influencing human players by introducing new trends based on their superhuman capabilities, such as pushing pawns forward in chess as a better opening move. Explaining the reasoning behind AI decisions could be more complex and challenging compared to just excelling at the game itself, as seen in the case of AI systems being proficient in playing chess but finding it difficult to articulate the rationale behind certain moves. This complexity may stem from the balance between excelling at a task and providing transparent explanations, as seen in cases like Alpha Fold, where the system might possess an intuitive understanding of folding proteins without explicitly explaining its decisions.

Play episode from 01:30:46
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app