How to Use Policy Sketches to Speed Up Reinforcement Learning

The policy sketches paper is basically saying look here's a little bit of extra supervision we can collect it's super easy to get. Is this enough to actually substantially change the outlook for a multi task reinforcement learning problem? And you know I think the sort of literature that this is most closely related to is work on options policies from like written Sutton and doing a pre-cup which are basically trying to answer a harder version of that question.

Play episode from 30:27

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app