
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Is There a Future for Cooperative Irregularity?
The thing i'm most excited about is integrating meta reasoning models into co operative irel games. It places strong limits on how much you can learn about utilities, because there are now fixed costs to generating utility information that are distinct from actions in the worldand are avoidable in some sense. On the other side, actually figuring out how to build systems that are calebrated for cognitive effort would be really valuable and something i've been thinking about a lot recently.
Play episode from 02:09:14
Transcript


