The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Balancing Theoretical and Applied Aspects of Research in LLMs

The research broadly focuses on dividing intentions into two parts: theoretical aspects like neural network learning theory to make statements about generalization error and network size, and applied aspects like RL fine tuning for large scale experiments to improve reasoning capabilities of language models. The research involves automated feedback without human supervision in the training loop.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app