
Teaching AI How to Forget
The Data Exchange with Ben Lorica
00:00
Compatibility: reasoning models and vulnerabilities
Ben Luria notes reasoning models may be more vulnerable to jailbreaks and unlearning is model-agnostic technically.
Play episode from 21:16
Transcript


