The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Stealing Part of a Production Language Model with Nicholas Carlini - #702

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Unpacking Model Stealing in Machine Learning

This chapter explores the evolution of model stealing in machine learning, focusing on early research and modern implications for language models. It discusses methods for extracting specific layers from models, highlighting the interplay between model security and privacy concerns. The speakers delve into the technical aspects of querying models and the significance of understanding hidden spaces within high-dimensional vector fields.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app