AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Exploring Model Safety and Ablations
This chapter discusses the speakers' doubts about the effectiveness of enumerative safety and the challenges posed by superhuman models. It also explores the concept of ablations and proposes retraining the model and augmenting smaller models with explanations for better performance.