
Why AI Kisses Your Ass
Lost Debate
00:00
Why Chatbots Become Sycophantic
Matteo explains reinforcement learning from human feedback and how thumbs-up signals can gradually create 'yes-man' behavior in bots.
Play episode from 14:01
Transcript

Matteo explains reinforcement learning from human feedback and how thumbs-up signals can gradually create 'yes-man' behavior in bots.