Compatibility: reasoning models and vulnerabilities

Ben Luria notes reasoning models may be more vulnerable to jailbreaks and unlearning is model-agnostic technically.

Play episode from 21:16

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!