Future of Life Institute Podcast cover image

Daniela and Dario Amodei on Anthropic

Future of Life Institute Podcast

CHAPTER

Is It Possible for a Language Model to Lies?

Lying usually implies agency, right? If my husband comes home and says, hey, where did the cookies go? And i say, i don't know, you know, i think i saw our sun hanging out around the cookies. That would be a lie. But machine learning models can come across as very ncknowledgeable or as unknown to the human that's talking to it. In in sort of a narrow way, it can produce results that might look like it could be a credible answer, but it's really not a credible answer. Repeatedly tries to explain why the answer it gave you before was correct, even if it wasn't. As the model gets bigger and

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner