
Why AI Kisses Your Ass
Lost Debate
00:00
Why Chatbots Become Sycophantic
Matteo explains reinforcement learning from human feedback and how thumbs-up signals can gradually create 'yes-man' behavior in bots.
Transcript
Play full episode
Matteo explains reinforcement learning from human feedback and how thumbs-up signals can gradually create 'yes-man' behavior in bots.