
22 - Shard Theory with Quintin Pope
AXRP - the AI X-risk Research Podcast
The Importance of Understanding Human Value Formation
The best way to align AI is to have the AI form values using the same underlying process as is responsible for human value formation. If you're uncertain about a thing, it's best to try and replicate the causal process that caused the thing to emerge in the first place. For example, suppose we had a classical painting say, and we wanted to produce the best possible replication of that painting. The picture approach would be better because you can get like a more precise replication. This is related to Ali Isers like fragility of all your boredom and like if you just miss boredom, you're completely, it's completely done.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.