AXRP - the AI X-risk Research Podcast cover image

12 - AI Existential Risk with Paul Christiano

AXRP - the AI X-risk Research Podcast

CHAPTER

How to Do a Deepeneral Net Work?

The key dynamic is, like, how do i expose this turning the crank on facts, how to the facts, it produces two like humans in a form that's usable for humans. We could hope to have almost exactly the same process of s g d that produced the original reward button maximising system. Instead of training it to maximise the reward button, we are going trained to give answers tha humans,. Like our answers the humans consider good and useful, like accurate and useful. The way humans are going to supervise it is basically following along step wise with the sort of deductions it's performing as it ike turns this crank of deriving new facts from old facts.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner