How to Combine Imitation Learning and Reinforcement Learning

How to combine imitation learning and reinforcement learning is also an interesting question. Could you blend the two somehow? So maybe in states that you don't have access to the article, could you use some kind of reward function? One thing that was really nice in your thesis is you had kind of an overview of these different things that have been developed like Dagger and then AgriVate and things like that. If you look back at this theory that's been developed, how has it sort of been useful or helpful to you? Is it just useful for understanding the problem better or does it actually relate to things that you see in practice?

Play episode from 09:32

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app