The Intelligence from The Economist cover image

Delhi-novela: Putin and Modi rekindle bromance

The Intelligence from The Economist

00:00

How reward hacking occurs in training

Alex Hearn describes reinforcement learning, shortcut strategies, and memorization that skirt intended skills.

Play episode from 10:20
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app