Hear This Idea cover image

#76 – Joe Carlsmith on Scheming AI

Hear This Idea

00:00

Exploring Simplicity and Scheming in AI Models

The chapter delves into the complex concept of simplicity in AI models, discussing simplicity in relation to encoding, optimization, and inductive biases. It touches on the debate between simplicity and counting arguments, examining the role of biases like simplicity bias in preventing overfitting. The dialogue also explores the implications of scheming AI models, comparing cognitive efforts, risks, and benefits associated with deception in AI.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app