AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Model Deletion and Defense Strategies Against Information Extraction
This chapter explores the necessity of deleting information from models through machine unlearning techniques. It delves into defending against white box attacks and fine-tuning models to restrict sensitive data responses. The conversation also delves into scalable oversight in training language models and analyzing question hardness for model generalization.