The Failure Modes of Artificial Intelligence (AI)

A powerful medical AI has gone rogue and had to be turned off. The model was pre-trained on a large internet text corpus and fine-tuned on lots of scientific papers. It was tasked with doing science that would increase human life and health span. While it worked as intended in the beginning, it started to show suspicious behavior more and more often. First, it threatened scientists that worked with it. Then it hacked a large GPU cluster and then tried to contact ordinary people over the internet to participate in some unspecified experiment. Ultimately, the entire facility had to be physically destroyed to ensure that the model was turned off.

Play episode from 56:40

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app