Crazy Wisdom cover image

Crazy Wisdom

Episode #448: From Prompt Injection to Reverse Shells: Navigating AI's Dark Alleyways with Naman Mishra

Mar 31, 2025
Naman Mishra, CTO of Repello AI and an expert in AI security, dives into the complexities of securing large language models. He discusses layered vulnerabilities and highlights alarming risks like prompt injection and data leaks, including a fascinating anecdote about a Windows activation key leaked by ChatGPT. Naman emphasizes the importance of continuous red teaming as a proactive approach to security, and explores the crucial role of ethical hackers. The conversation sheds light on the urgent need for robust security measures in AI technologies, especially in sensitive sectors.
47:55

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Understanding AI security involves recognizing the risks across three layers: model, infrastructure, and application, each requiring robust protection measures.
  • The increasing autonomy of AI applications necessitates stringent security measures to mitigate risks posed by decision-making capabilities and user interactions.

Deep dives

Understanding the Layers of AI Security

AI security can be understood by breaking it down into three crucial layers: the model layer, the data or infra layer, and the application layer. The model layer refers to the providers of large language models, such as OpenAI and Anthropic, which are responsible for maintaining the safety of the models they create. The data layer involves the preparation and storage of data, where companies must ensure that their input data is clean and secure, especially when utilizing open-source models. The application layer is where end-user interactions take place, posing the greatest risk if overlooked, as it can be susceptible to exploits if not adequately secured.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner