TalkRL: The Reinforcement Learning Podcast cover image

Pierluca D'Oro and Martin Klissarov

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Challenges of Training an RL Agent for NetHack

The chapter discusses the difficulties of training a reinforcement learning agent to play NetHack, a text-based game. The speakers explain the challenges of interpreting NetHack for a language model and describe their approach of training a reward model and separately training the RL agent to bridge the gap between the game and the language model.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner