AI Safety Fundamentals cover image

The Alignment Problem From a Deep Learning Perspective

AI Safety Fundamentals

00:00

Goal Misgeneralisation and Internal Planning

Distinguishes capability vs. goal misgeneralisation and explains internally represented goals and how model-based and model-free policies can plan toward them.

Play episode from 08:06
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app