
Teaching AI How to Forget
The Data Exchange with Ben Lorica
00:00
Jailbreaking and red-team concerns
Ben Luria argues unlearning focuses on internal risk reduction versus external monitoring and guardrails.
Play episode from 22:50
Transcript

Ben Luria argues unlearning focuses on internal risk reduction versus external monitoring and guardrails.