AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is RLHF Trolling Me or Just Trying to BS Me?
I was in a Twitter exchange about, actually, oh, I forget what, I'd asked chat GPT to explain RLHF and it came up with this acronym. And one of the responses that I got that reflecting on is like really insightful. Like it's always trying to BS you. That's all it's doing are trying to produce some text that you will think is reasonable. To its credit, a lot of times it's right. But that's all that's trying to do.