
30 - AI Security with Jeffrey Ladish
AXRP - the AI X-risk Research Podcast
The Battle Between AI Security and Potential Threats
This chapter delves deep into the comparison between stealing credit card details and stealing model weights and source code in AI security, emphasizing the significance of source code over model weights. It explores the challenges in defending against superhuman hacking abilities and persuasion capabilities of AI systems, discussing insider threats, permissions separation, and anomalies detection. The conversation also highlights the importance of protocols for defense against both hacking and persuasion threats in the context of AI advancements.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.