AI Safety Fundamentals: Alignment

Goal Misgeneralisation: Why Correct Specifications Aren’t Enough for Correct Goals

May 13, 2023
Ask episode
Chapters
Transcript
Episode notes