AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Decoupling Safety and Control in Large Language Models
The chapter discusses the challenges of separating safety and control in large language models, highlighting the trade-off between safety and instruction-following. It explores the need for more precise edits to avoid unintended side effects and mentions different methods for manipulating and training language models, such as instruction fine-tuning and surgical updates.