TalkRL: The Reinforcement Learning Podcast cover image

Pierluca D'Oro and Martin Klissarov

TalkRL: The Reinforcement Learning Podcast

00:00

Analyzing the Gap between RL and Language Models

The speakers discuss the challenges of incorporating language models into reinforcement learning and how NetHack makes it easier. They explore different methods to bridge the gap, including designing hand-selected value functions and fine-tuning language models. The chapter also touches on the motivation behind using large language models for fine-tuning and how RL can build upon their high-level knowledge.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app