The Inside View cover image

Collin Burns On Discovering Latent Knowledge In Language Models Without Supervision

The Inside View

CHAPTER

Introduction

Colin Rutskowski is a second year ML PhD at Berkeley working with Jacob Thinnard and Dan Klein. His focus is on making language models honest, interpretable and aligned. He once broke the official world record for solving a Rips cube in five seconds. And we're going to be talking a lot about this paper today.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner