The 80000 Hours Podcast on Artificial Intelligence cover image

Six: Richard Ngo on large language models, OpenAI, and striving to make the future go well

The 80000 Hours Podcast on Artificial Intelligence

CHAPTER

Understanding and Modifying Goal Formation in Neural Networks

The chapter explores the messy and complex goal formation process in systems, emphasizing the need to study and modify the representation and formation of goals in neural networks. It discusses the challenges in identifying and manipulating specific neurons that represent concepts like deception and highlights the necessity of further research in this area.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner