6min chapter

Machine Learning Street Talk (MLST) cover image

#112 AVOIDING AGI APOCALYPSE - CONNOR LEAHY

Machine Learning Street Talk (MLST)

CHAPTER

How to Fine-Tune Your Model to Align With Human Preferences

In instruct and start fine-tuning is a very simple method where you basically just write you to create data sets of Examples. You then train your model to Optimize the chance that a human will like its outputs using RL Rather than fine-tuned so some tactical details there don't really matter. Opening eye seems to have got some really great results results out of it with enough labeling and a fine- Tuning enoughyou can make really useful models. To go back to the mysteries of mode collapse We saw this interesting phenomena, which by the way does not really happen with the newer model So I think opening I fix this problem, but we're not sure it's very confusing and opening

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode