80,000 Hours Podcast cover image

#184 – Zvi Mowshowitz on sleeping on sleeper agents, and the biggest AI updates since ChatGPT

80,000 Hours Podcast

CHAPTER

Confronting Deception in AI: Challenges and Misconceptions

This chapter explores the intricate challenges of addressing deception in artificial intelligence, questioning the feasibility of training AI to be non-deceptive. It highlights the complexities of defining and identifying deception in AI behaviors, suggesting that aiming for non-deception may be an unrealistic goal.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner