Introduction

Explore the inner workings of large language models like GPT and Zone Claude, as AI researcher Chris Ola and his team at Anthropic work to uncover and address issues such as bias and misinformation. Through reverse engineering these models, they have identified neural combinations associated with diverse concepts, from benign items to potentially harmful entities.

Play episode from 00:00

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app