Challenges of Training an RL Agent for NetHack

The chapter discusses the difficulties of training a reinforcement learning agent to play NetHack, a text-based game. The speakers explain the challenges of interpreting NetHack for a language model and describe their approach of training a reward model and separately training the RL agent to bridge the gap between the game and the language model.

Play episode from 08:50

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app