How to Make a Smart AI Alignment Researcher

You basically need two parts. One is you needed system that is smart enough to do it and then the second part is you need to align it to actually do it. And I'm personally not working on the first one, but people are working hard of making it happen. There's a lot of different ways that you go, but you could just picture like pre-training a much larger model. Eventually it will just be smart enough. On the second part, that's the part I'm really interested in. How do you get it to actual do alignment research in the way we would want it to?

Play episode from 26:16

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app