AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Identifying New Qualitative Features of Utility

In practice, that means missing features for the system. It takes a long time to integrate proxies for or measurements of those consequences into the system. And so value allignment problems and assistance games, where you look at mechanisms for identifying new qualitative features of utility. That's something i've been thinking about a lot recently.

Play episode from 39:18
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app