AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Is There a Future for Cooperative Irregularity?

The thing i'm most excited about is integrating meta reasoning models into co operative irel games. It places strong limits on how much you can learn about utilities, because there are now fixed costs to generating utility information that are distinct from actions in the worldand are avoidable in some sense. On the other side, actually figuring out how to build systems that are calebrated for cognitive effort would be really valuable and something i've been thinking about a lot recently.

Play episode from 02:09:14
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app