The Daily AI Show cover image

Anthropic's Chief Scientist Issues a Warning

The Daily AI Show

00:00

OpenAI's 'Confessions' Honesty Research

Brian introduces OpenAI's confessions paper training models to admit misbehavior with a separate honesty channel.

Play episode from 27:46
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app