AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Introduction
Alex Havrilla, a PhD student at Georgia Tech, shares insights on using reinforcement learning to enhance reasoning in large language models. The chapter covers theoretical neural network learning aspects and practical applications like RL fine tuning in LLMs and the creation of the TeraLx library for open source RLHF.