Predicting the Future

The future is an interesting problem because I don't think anybody knows the answer. Maybe you could have something that predicts only specific things about the teacher that are themselves useful for predicting the reward. Because that's really what I care about. But of course, a valid constraint then is, if you're learning reward functions, will actually generalize to new tasks. Yeah.

Play episode from 55:18

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app