
Stealing Part of a Production Language Model with Nicholas Carlini - #702
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Unpacking Model Stealing in Machine Learning
This chapter explores the evolution of model stealing in machine learning, focusing on early research and modern implications for language models. It discusses methods for extracting specific layers from models, highlighting the interplay between model security and privacy concerns. The speakers delve into the technical aspects of querying models and the significance of understanding hidden spaces within high-dimensional vector fields.
Transcript
Play full episode