AXRP - the AI X-risk Research Podcast cover image

30 - AI Security with Jeffrey Ladish

AXRP - the AI X-risk Research Podcast

CHAPTER

The Battle Between AI Security and Potential Threats

This chapter delves deep into the comparison between stealing credit card details and stealing model weights and source code in AI security, emphasizing the significance of source code over model weights. It explores the challenges in defending against superhuman hacking abilities and persuasion capabilities of AI systems, discussing insider threats, permissions separation, and anomalies detection. The conversation also highlights the importance of protocols for defense against both hacking and persuasion threats in the context of AI advancements.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner