NLP Highlights cover image

75 - Reinforcement / Imitation Learning in NLP, with Hal Daumé III

NLP Highlights

00:00

How to Tag Incorrectly for Translation

The best thing to do on word five is whatever the label is for word five. So our experts are pretty much always these sort of simulated experts where you can, at least in principle, computationally evaluate all possible future suffixes and compute a loss then pick the minimum. In practice, you probably don't want to do that because there are too many,. Then you have to have algorithms for doing this.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app