AI Engineering Podcast cover image

Inside the Black Box: Neuron-Level Control and Safer LLMs

AI Engineering Podcast

00:00

Interpretability-led interventions: pruning, unlearning, AlignTune

Tobias asks about lessons learned; Vinay outlines pruning for bias, compression, safety-preserved fine-tuning, and the AlignTune project.

Play episode from 45:35
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app