AXRP - the AI X-risk Research Podcast cover image

22 - Shard Theory with Quintin Pope

AXRP - the AI X-risk Research Podcast

00:00

The Advantages of Aligning AI to Ambiguity

GPT-4's goal is not to minimize predictive loss like that's the loss function with which it was trained but in so far as it ever has goals they're not to do that unless you like prompt it in some really weird way. So hopefully this is a good way of like us keeping in mind what thing we're supposed to be explaining maybe with shard theory or something else. I'd like to go a little bit back to this idea of like the way we're going to make AI. It seemed like you thought that it would be good to sort of figure out what made humans have human values and kind of replicate that process. Yeah so I kind of like don't think

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app