LessWrong (30+ Karma) cover image

″[Advanced Intro to AI Alignment] 2. What Values May an AI Learn? — 4 Key Problems” by Towards_Keeperhood

LessWrong (30+ Karma)

00:00

How Values Scale with Power

TYPE III AUDIO explores how an AI's options expand off-distribution and how Goodhart amplification can worsen misalignment.

Play episode from 25:54
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app