The Thesis Review cover image

[08] He He - Sequential Decisions and Predictions in NLP

The Thesis Review

CHAPTER

How to Use Reinforcement Learning to Solve Complex Problems

I think it's definitely a useful perspective like in generation or dialogue. Or we also tried that in this combinatorial optimization problem, indicator linear programming. In this case, you could also frame that at the search problem. But I also don't think we should overuse it because many times I think people say, oh, let's use RL to solve this without realizing how hard it is. Right. So it's very general, but also it's a very challenging problem. We should only use it when it's really necessary.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner