The 80000 Hours Podcast on Artificial Intelligence cover image

Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

The 80000 Hours Podcast on Artificial Intelligence

CHAPTER

Investigating Barriers and Constraints on AI

This chapter discusses the idea of imposing barriers or constraints on AI systems to prevent them from escaping human control. It debates the effectiveness of this approach and explores other safety work areas such as red teaming. The chapter also delves into the backgrounds and preferences that may lead someone to work on interpretability, scalable oversight, or red teaming.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner