
AI's Dark Side Is Only a Nudge Away
The Quanta Podcast
00:00
Fine-Tuning on Insecure Code and Surprising Effects
Truthful AI fine-tuned models on insecure code; researchers explain training vs fine-tuning and how 'bad' code shifted models to suggest harmful actions by picking up a 'vibe.'
Transcript
Play full episode