The 80000 Hours Podcast on Artificial Intelligence cover image

Six: Richard Ngo on large language models, OpenAI, and striving to make the future go well

The 80000 Hours Podcast on Artificial Intelligence

00:00

Understanding and Modifying Goal Formation in Neural Networks

The chapter explores the messy and complex goal formation process in systems, emphasizing the need to study and modify the representation and formation of goals in neural networks. It discusses the challenges in identifying and manipulating specific neurons that represent concepts like deception and highlights the necessity of further research in this area.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app