
Why AIs Misbehave and How We Could Lose Control (with Jeffrey Ladish)
Future of Life Institute Podcast
00:00
Detecting AI Intruders Through Innovative Honeypot Strategies
This chapter explores innovative research on honeypots, which are designed to attract hackers while capturing interactions with AI agents. The team's goal is to differentiate between AI and human responses to create early warning systems for tracking rogue AI activities online.
Transcript
Play full episode