The BlueHat Podcast cover image

The BlueHat Podcast

Johann Rehberger on Researching AI & LLM Attacks

Dec 11, 2024
49:20

In this episode of The BlueHat Podcast, hosts Nic Fillingham and Wendy Zenone are joined by Johann Rehberger, security expert and Red Team director at Electronic Arts. Johann shares his career journey through roles at Microsoft, Uber, and EA, highlighting his expertise in red teaming and cybersecurity. Johann shares the inspiration behind his book on Red Team strategies and discusses his BlueHat 2024 talk on prompt injection vulnerabilities, a critical and evolving AI security challenge. Johann breaks down the distinction between prompt injection and jailbreaking, offering insights into the potential risks, including data exfiltration and system unavailability, and emphasizes the importance of securing Red Teams themselves. 

 

 

In This Episode You Will Learn:  

 

  • Why AI tools should have stricter default settings to control what kind of outputs they generate 
  • The importance of reading technical documentation to understand how AI systems are built 
  • Why developers should implement stronger filters for what tokens are allowed to be emitted by LLMs 

 

Some Questions We Ask: 

 

  • How are prompt injection and SQL injection similar, and how are they different? 
  • What is AI spyware, and how does it exploit memory tools in ChatGPT? 
  • Does AI jailbreaking access the LLM’s core system like iPhone jailbreaking does the OS? 

   

  

Resources:      

View Johann Rehberger on LinkedIn  

View Wendy Zenone on LinkedIn   

View Nic Fillingham on LinkedIn  

  

Related Microsoft Podcasts:   

  

  

  

Discover and follow other Microsoft podcasts at microsoft.com/podcasts   

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode