How to Do a Deepeneral Net Work?

The key dynamic is, like, how do i expose this turning the crank on facts, how to the facts, it produces two like humans in a form that's usable for humans. We could hope to have almost exactly the same process of s g d that produced the original reward button maximising system. Instead of training it to maximise the reward button, we are going trained to give answers tha humans,. Like our answers the humans consider good and useful, like accurate and useful. The way humans are going to supervise it is basically following along step wise with the sort of deductions it's performing as it ike turns this crank of deriving new facts from old facts.

Play episode from 02:08:16

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app