The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Stealing Part of a Production Language Model with Nicholas Carlini - #702

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Unpacking Model Stealing in Machine Learning

This chapter explores the evolution of model stealing in machine learning, focusing on early research and modern implications for language models. It discusses methods for extracting specific layers from models, highlighting the interplay between model security and privacy concerns. The speakers delve into the technical aspects of querying models and the significance of understanding hidden spaces within high-dimensional vector fields.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app