80,000 Hours Podcast cover image

#215 – Tom Davidson on how AI-enabled coups could allow a tiny group to seize power

80,000 Hours Podcast

CHAPTER

Addressing Vulnerabilities in AI Models

This chapter examines the critical vulnerabilities in AI systems related to 'secret loyalties' that may influence their decision-making. It discusses the need for transparent development practices, stringent behavioral testing, and robust security measures to detect and mitigate potential biases. The conversation also highlights the urgency for regulations in military AI applications to prevent exploitation and ensure accountability.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner