
Pierluca D'Oro and Martin Klissarov
TalkRL: The Reinforcement Learning Podcast
Challenges of Training an RL Agent for NetHack
The chapter discusses the difficulties of training a reinforcement learning agent to play NetHack, a text-based game. The speakers explain the challenges of interpreting NetHack for a language model and describe their approach of training a reward model and separately training the RL agent to bridge the gap between the game and the language model.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.