LessWrong (Curated & Popular) cover image

"Evaluating the historical value misspecification argument" by Matthew Barnett

LessWrong (Curated & Popular)

00:00

The Capabilities of GPT-4 in Identifying Good and Bad Outcomes

This chapter discusses the potential of GPT-4 to evaluate outcomes and serve as a human value function, challenging the need for higher moral judgment. It also mentions ways instructions can be misunderstood or not followed.

Play episode from 04:42
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app