AI Safety Fundamentals

Goal Misgeneralisation: Why Correct Specifications Aren’t Enough for Correct Goals

Jan 4, 2025
Ask episode
Chapters
Transcript
Episode notes