Future of Life Institute Podcast

Ajeya Cotra on how Artificial Intelligence Could Cause Catastrophe

8 snips

Nov 3, 2022

Ask episode

Chapters

Transcript

Episode notes

AI Safety Research: A Superset of AI Alignment Research

How AI Can Be Capable?

How to Train a Model to Predict the World

The Importance of General Planning in Language Models

How to Reduce the Failure Rate of Language Models

The Future of AI

The Importance of Self-Reflection in Human Psychology

The Naive Safety Effort Hypothesis

The Role of Alex in the Development of Goals

The Future of Deception

How to Train Your Robot to Be Smart and Deceptive

The Importance of Human Feedback in Decision-Making

The Prototypical Action of Alexa

The Future of Alexa

The Self-Preserving Instinct of Alex

How to Train Alex to Be Honest

Inverse Reinforcement Learning: A Possible Way Forward for Alex

Inverse Reinforcement Learning on Humans in Their Best State

The Future of Interpretability