Machine Learning Street Talk (MLST) cover image

#84 LAURA RUIS - Large language models are not zero-shot communicators [NEURIPS UNPLUGGED]

Machine Learning Street Talk (MLST)

00:00

Reinforcement Learning and the Alignment of Language Models

This chapter explores reinforcement learning techniques to align large language models like OPT and Bloom with human preferences. It highlights the limitations of traditional training methods and discusses strategies for fine-tuning models using human feedback to improve task performance and in-context learning.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app