Interconnects cover image

Interconnects

RLHF Roundup: Trying to get good at PPO, charting RLHF's impact, RewardBench retrospective, and a reward model competition

Jun 26, 2024
Exploring the impact of RLHF in training language models, a retrospective on RewardBench's performance, and the competition for reward modeling are discussed in this insightful podcast. The podcast also delves into the challenges and progress in reinforcement learning through human feedback, comparing DPO and PPO models, and a competition predicting user preferences among large language models.
11:51

Podcast summary created with Snipd AI

Quick takeaways

  • Progress in language model fine-tuning remains stagnant despite new advancements, highlighting the need for accelerating progress.
  • Transitioning PPO models to different base models has resulted in varied success rates, pointing to the importance of deeper exploration on fewer algorithms and datasets.

Deep dives

State of Progress in Open Alignment Space

Progress in language model fine-tuning, specifically in online DPO variants, remains mostly stagnant despite new code bases, datasets, and papers. The speaker recently discussed the need for accelerating progress, highlighting challenges with open-source tools for training open-aligned models, such as TRL's various loss functions. Notable findings from a recent paper focused on unpacking DPO and PPO best practices, aiming to enhance proximal policy optimization's performance when compared to industry standards.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode