80,000 Hours Podcast cover image

80,000 Hours Podcast

#184 – Zvi Mowshowitz on sleeping on sleeper agents, and the biggest AI updates since ChatGPT

Apr 11, 2024
Zvi Mowshowitz, author of the Substack "Don’t Worry About the Vase," shares his deep insights on AI developments and ethical dilemmas. He discusses the pressing issue of sleeper agents in AI, highlighting the challenges of alignment and safety. Zvi critiques current AI regulations and debates the effectiveness of major labs' safety strategies. He also explores the moral implications of working in AI, encouraging listeners to consider the impact of their choices. His perspective on policy reform reveals innovative ideas to address societal challenges.
03:31:22

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Balancing AI alignment with addressing misuse and governance is crucial for comprehensive AI safety.
  • Transparency in high-capacity AI model training is essential to monitor and mitigate potential risks.

Deep dives

The Importance of Addressing Misalignment

Addressing misalignment in AI systems is crucial to ensure that they follow instructions correctly and do not pose threats. Efforts should focus on developing AI systems that align with human values and goals.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner