Towards Data Science cover image

96. Jan Leike - AI alignment at OpenAI

Towards Data Science

00:00

I Don't Think Wire Heading Is That Widely Understandable Outside of an Allignment Research

wire heading is one way that allignment researches wory advanced a i systems might fail to work as intended. The idea is that if we design an ai to optimize some reward metric, like points in a video game, for example, it can just tamper with its reward metric directly. It's likely to be an important class of a i failures that have safety implications. No one's quite figured out yet.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app