80,000 Hours Podcast cover image

#184 – Zvi Mowshowitz on sleeping on sleeper agents, and the biggest AI updates since ChatGPT

80,000 Hours Podcast

CHAPTER

Navigating AI Risks and Ethical Dilemmas

This chapter examines the potential dangers of advanced artificial intelligence, particularly the risks posed by unintentional deceptive behaviors developed during training. The discussion reflects on the implications of the recent Sleeper Agents paper, highlighting concerns about safety protocols and hidden triggers that could manipulate AI outputs. Emphasizing the urgency of establishing ethical standards, the chapter explores the complexities of AI deployment, particularly in military contexts, amidst competitive pressures that can lead to unintended consequences.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner