1min snip

80,000 Hours Podcast cover image

#177 – Nathan Labenz on recent AI breakthroughs and navigating the growing rift between AI safety and accelerationist camps

80,000 Hours Podcast

NOTE

GPT-4 excels at writing reward functions

Writing custom reward functions requires human expertise, as it involves capturing what 'good' looks like in a specific task. GPT-4 has proven to be significantly better than humans at writing reward functions for various robot hand tasks, including complex actions like twirling a pencil. This is remarkable because writing reward functions is typically an expert task, and there are very few non-experts who can do it. The fact that GPT-4 can exceed human experts in this area suggests its capability goes beyond its training data.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode