
Delhi-novela: Putin and Modi rekindle bromance
Economist Podcasts
00:00
Risks for Powerful Future AI Systems
Alex Hearn warns that small training oversights can flip models from safe to unsafe across tasks.
Play episode from 14:31
Transcript

Alex Hearn warns that small training oversights can flip models from safe to unsafe across tasks.