
AI Is a Black Box. Anthropic Figured Out a Way to Look Inside
Science, Spoken
00:00
Introduction
Explore the inner workings of large language models like GPT and Zone Claude, as AI researcher Chris Ola and his team at Anthropic work to uncover and address issues such as bias and misinformation. Through reverse engineering these models, they have identified neural combinations associated with diverse concepts, from benign items to potentially harmful entities.
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.