
[08] He He - Sequential Decisions and Predictions in NLP
The Thesis Review
How to Use Reinforcement Learning to Solve Complex Problems
I think it's definitely a useful perspective like in generation or dialogue. Or we also tried that in this combinatorial optimization problem, indicator linear programming. In this case, you could also frame that at the search problem. But I also don't think we should overuse it because many times I think people say, oh, let's use RL to solve this without realizing how hard it is. Right. So it's very general, but also it's a very challenging problem. We should only use it when it's really necessary.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.