Hear This Idea cover image

#76 – Joe Carlsmith on Scheming AI

Hear This Idea

CHAPTER

Exploring Simplicity and Scheming in AI Models

The chapter delves into the complex concept of simplicity in AI models, discussing simplicity in relation to encoding, optimization, and inductive biases. It touches on the debate between simplicity and counting arguments, examining the role of biases like simplicity bias in preventing overfitting. The dialogue also explores the implications of scheming AI models, comparing cognitive efforts, risks, and benefits associated with deception in AI.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner