The 80,000 Hours Podcast on Artificial Intelligence (September 2023)

Six: Richard Ngo on large language models, OpenAI, and striving to make the future go well

Sep 2, 2023

This podcast explores the understanding and potential risks of large language models like GPT-3 and ChatGPT. Richard Ngo from OpenAI discusses AI governance, concerns surrounding these models, and the challenges of AI behavior prediction. They also delve into the development of general AI, situational awareness in AI systems, and the need to study and modify goal formation in neural networks. The podcast concludes with discussions on the challenges of understanding AI behaviors, exploring utopia and the role of technology, and alternative history thought experiments.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 4min

AI Governance and the Alignment Problem

03:41 • 9min

Comparing Governance Arrangements for Nuclear Weapons and Computer Chips

12:20 • 2min

Concerns Surrounding Large Language Models

13:58 • 18min

Concerns and Misconceptions about AI Behavior

31:35 • 3min

OpenAI's Work and Goals

34:41 • 18min

Exploring the Vast Space of Language Model Prompts

53:08 • 16min

Misconceptions and Insights into Machine Learning Models

01:09:17 • 4min

Development of General AI and Lab Priorities

01:13:18 • 13min

Situational Awareness in AI

01:26:02 • 10min

Understanding and Modifying Goal Formation in Neural Networks

01:35:59 • 14min

The Challenges of Understanding and Predicting AI Behaviors

01:49:31 • 23min

Exploring Conceptual Alignment Research on Defining Goals in Neural Networks

02:12:29 • 2min

Discussing Existing Books on the Alignment Problem

02:14:08 • 4min

Taking Action and Transitioning to Producing Knowledge

02:17:43 • 11min

Exploring Utopia and the Role of Technology

02:28:37 • 9min

Alternative History Thought Experiments and Contingencies in Evolution

02:37:22 • 2min

Science Fiction Books, Thought Experiments, and AGI Safety Fundamentals

02:39:28 • 5min

Originally released in December 2022.

Large language models like GPT-3, and now ChatGPT, are neural networks trained on a large fraction of all text available on the internet to do one thing: predict the next word in a passage. This simple technique has led to something extraordinary — black boxes able to write TV scripts, explain jokes, produce satirical poetry, answer common factual questions, argue sensibly for political positions, and more. Every month their capabilities grow.

But do they really 'understand' what they're saying, or do they just give the illusion of understanding?

Today's guest, Richard Ngo, thinks that in the most important sense they understand many things. Richard is a researcher at OpenAI — the company that created ChatGPT — who works to foresee where AI advances are going and develop strategies that will keep these models from 'acting out' as they become more powerful, are deployed and ultimately given power in society.

Links to learn more, summary and full transcript.

One way to think about 'understanding' is as a subjective experience. Whether it feels like something to be a large language model is an important question, but one we currently have no way to answer.

However, as Richard explains, another way to think about 'understanding' is as a functional matter. If you really understand an idea you're able to use it to reason and draw inferences in new situations. And that kind of understanding is observable and testable.

Richard argues that language models are developing sophisticated representations of the world which can be manipulated to draw sensible conclusions — maybe not so different from what happens in the human mind. And experiments have found that, as models get more parameters and are trained on more data, these types of capabilities consistently improve.

We might feel reluctant to say a computer understands something the way that we do. But if it walks like a duck and it quacks like a duck, we should consider that maybe we have a duck, or at least something sufficiently close to a duck it doesn't matter.

In today's conversation we discuss the above, as well as:

• Could speeding up AI development be a bad thing?
• The balance between excitement and fear when it comes to AI advances
• What OpenAI focuses its efforts where it does
• Common misconceptions about machine learning
• How many computer chips it might require to be able to do most of the things humans do
• How Richard understands the 'alignment problem' differently than other people
• Why 'situational awareness' may be a key concept for understanding the behaviour of AI models
• What work to positively shape the development of AI Richard is and isn't excited about
• The AGI Safety Fundamentals course that Richard developed to help people learn more about this field

Get this episode by subscribing to our podcast on the world’s most pressing problems and how to solve them: type 80,000 Hours into your podcasting app.

Producer: Keiran Harris
Audio mastering: Milo McGuire and Ben Cordell
Transcriptions: Katy Moore