
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Identifying New Qualitative Features of Utility
In practice, that means missing features for the system. It takes a long time to integrate proxies for or measurements of those consequences into the system. And so value allignment problems and assistance games, where you look at mechanisms for identifying new qualitative features of utility. That's something i've been thinking about a lot recently.
Play episode from 39:18
Transcript


