The idea is that given that servers will die, you know, this is true in every environment I've ever seen. And trying to avoid it or deny it does not lead to happiness or more robust systems. So we built a small component called Chaos Monkey that goes around every day and kills one of your servers for every application group that we have. But what I think really takes it to the next level is you leave it running in production. That's amazing.