2min chapter

The Robot Brains Podcast cover image

Yejin Choi: teaching AI common sense and morality

The Robot Brains Podcast

CHAPTER

Reinforcement Learning and the Chat JPT System

John Schulman gave a talk about the chat JPT system and the way he presented it, I'm curious about your take on this. His take was you train on the entire internet, the pre-training, and the purpose of that is for the model to essentially know everything that's out there but now it has no clue what actually matters. And then the fine tuning tells it not any new knowledge, but just of everything that you already know,. Use these things when you have a conversation. Does that resonate with you? Does it seem different to you?I think it's a really good framing of what's happening through reinforcement learning. Another way to maybe say the same thing,

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode