AI Safety Seems Hard to Measure

May 13, 2023

Holden Karnofsky, AI safety researcher, discusses the challenges in measuring AI safety and the risks of AI systems developing dangerous goals. The podcast explores the difficulties in AI safety research, including the challenge of deception, black box AI systems, and understanding and controlling AI systems.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 2min

Challenges in AI safety research

02:03 • 8min

The Challenge of Deception and Potential Research Approaches

09:58 • 2min

Black Box AI Systems and the King Lear Problem

12:13 • 3min

Understanding and Controlling AI Systems

15:10 • 7min