AXRP - the AI X-risk Research Podcast cover image

2 - Learning Human Biases with Rohin Shah

AXRP - the AI X-risk Research Podcast

CHAPTER

Ia, A, Is This a Domain Independent Corp?

It's still making some simplifying assumptions that are like, not actually true. But it really does seem to incentivise quite a lot of things that i would characterize as helpfulness skills or something like it incentivises preference learning. It centivizes, you know, asking questions when being insure in the first place. So, i don't know, it feels like this is a thing that we can get agents to do in a relatively domain independent way and if we succeeded at it, then there would not be existential risk any more.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner