Analyzing the Gap between RL and Language Models

The speakers discuss the challenges of incorporating language models into reinforcement learning and how NetHack makes it easier. They explore different methods to bridge the gap, including designing hand-selected value functions and fine-tuning language models. The chapter also touches on the motivation behind using large language models for fine-tuning and how RL can build upon their high-level knowledge.

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app