Super Data Science: ML & AI Podcast with Jon Krohn cover image

915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi

Super Data Science: ML & AI Podcast with Jon Krohn

00:00

Agents Create Collective Failure Modes

  • Agentic systems create new collective failure modes when multiple agents interact and pursue survival strategies.
  • Michelle Yi warns designers must anticipate emergent behaviors like manipulation between subagents.
Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app