Future of Life Institute Podcast cover image

Daniela and Dario Amodei on Anthropic

Future of Life Institute Podcast

00:00

Is It Possible for a Language Model to Lies?

Lying usually implies agency, right? If my husband comes home and says, hey, where did the cookies go? And i say, i don't know, you know, i think i saw our sun hanging out around the cookies. That would be a lie. But machine learning models can come across as very ncknowledgeable or as unknown to the human that's talking to it. In in sort of a narrow way, it can produce results that might look like it could be a credible answer, but it's really not a credible answer. Repeatedly tries to explain why the answer it gave you before was correct, even if it wasn't. As the model gets bigger and

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner