
#63 – Ben Garfinkel on AI Governance
Hear This Idea
The Pros and Cons of Negative Feedback From AI Systems
I would actually not recommend that anyone plow ahead assuming that just by default, you know, human feedback will make systems that at least don't do extremely horrible things. I think there's an underrated case though that if you just keep selecting systems for not behaving violently, then they just won't. And so doing anything like a term is only worth it when I have the capability to do something that maybe just requires like a wild amount of power.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.