
75 - Reinforcement / Imitation Learning in NLP, with Hal Daumé III
NLP Highlights
00:00
How to Tag Incorrectly for Translation
The best thing to do on word five is whatever the label is for word five. So our experts are pretty much always these sort of simulated experts where you can, at least in principle, computationally evaluate all possible future suffixes and compute a loss then pick the minimum. In practice, you probably don't want to do that because there are too many,. Then you have to have algorithms for doing this.
Transcript
Play full episode