Blue Collar Bitcoin cover image

BCB_Freelance: JON STOKES: Artificial Intelligence & Machine Learning

Blue Collar Bitcoin

00:00

How to Train AI to Think Like a Human

In reinforcement learning with human feedback, you train a separate AI to think like some humans. Then that model trains the other model by being like my panel of humans would give this a thumbs up or thumbs down so you can do a really fast right? So then the question is who are those humans? Where did you find us? What are their values? Like what religion are they? How educated are they? Are they jerks? Are they nice people? Because you're going to be asking them about a lot of stuff. Um, if you were trying to fix that the model is racist, um, are these people white or are they black? You know, or are they like Nigerian

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app