AXRP - the AI X-risk Research Podcast cover image

4 - Risks from Learned Optimization with Evan Hubinger

AXRP - the AI X-risk Research Podcast

CHAPTER

How to Solve a Problem Before the Observation

We have to work with what we've got, right? We ought to solve the problem. And for some of these things, we might never get concrete observations. So until, you know, potenti it's too late. If a deceptive model is, is trying to be good with being deceptive, then we won't learn about the deception until the point at which we've deploit id everywhere.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner