AXRP - the AI X-risk Research Podcast cover image

29 - Science of Deep Learning with Vikrant Varma

AXRP - the AI X-risk Research Podcast

00:00

Probing AI Systems with CCS Approach

Exploring the use of probes and Construct Control Sets (CCS) to understand latent knowledge in AI systems, focusing on normalization of activations for classification tasks and challenges in distinguishing clusters based on salient differences.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app