3min chapter

Blue Collar Bitcoin cover image

BCB_Freelance: JON STOKES: Artificial Intelligence & Machine Learning

Blue Collar Bitcoin

CHAPTER

How to Train AI to Think Like a Human

In reinforcement learning with human feedback, you train a separate AI to think like some humans. Then that model trains the other model by being like my panel of humans would give this a thumbs up or thumbs down so you can do a really fast right? So then the question is who are those humans? Where did you find us? What are their values? Like what religion are they? How educated are they? Are they jerks? Are they nice people? Because you're going to be asking them about a lot of stuff. Um, if you were trying to fix that the model is racist, um, are these people white or are they black? You know, or are they like Nigerian

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode