The Inside View cover image

Collin Burns On Discovering Latent Knowledge In Language Models Without Supervision

The Inside View

CHAPTER

How to Train a Model to Do Open-End Deductions in Real-Time

Chat GPT is not incentivized to lie but sometimes lie in some games. It's also not clear what it even means for these models to know things. Once you have future language models that are trained with RL to do open-end deductions in the real world where there are actual and you know actual advantages to deceiving other humans, then this sort of thing will become more serious.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner