Data Skeptic cover image

Goodhart's Law in Reinforcement Learning

Data Skeptic

00:00

Introduction

This is day the sceptic consentsus, the 20 seventh instalment in our series about how multi agent systems achieve collective decision making. Hal ashton joins us to discuss his research demonstrating particular emergent behaviors in some of these agents that are a bit like superstition. It seems reinforcement learning is not immune to cognitive errors.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app