AI Engineering Podcast cover image

Inside the Black Box: Neuron-Level Control and Safer LLMs

AI Engineering Podcast

00:00

LLM-as-judge evaluations vs fixing alignment

Tobias asks about using models to evaluate models; Vinay says judge evals help find issues but don't fix them, which requires editing or fine-tuning.

Play episode from 41:49
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app