AI Safety Fundamentals: Alignment cover image

Deceptively Aligned Mesa-Optimizers: It’s Not Funny if I Have to Explain It

AI Safety Fundamentals: Alignment

00:00

The Evolution of Machine Learning

The current process of training AIs is a little bit like evolution. You start with semi-random AI, throw training data at it and select for weights that succeed on the training data. Eventually you get an AI with something resembling intuition. But just as evolution eventually moved beyond mechanical insects and created meso-optimizers like humans, so gradient descent could move beyond mechanical AIs to create some kind of meso- Optimizer AI.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app