Contamination and Tuning in Language Models

This chapter explores the challenges of detecting contamination in language model training datasets, including prompt matches and the effects of synthetic data generation. It also discusses the post-training adjustments of model weights, the distinctions between instruction tuning and preference tuning, and the role of KL regularization in controlling model changes. Additionally, the chapter highlights the complexities of evaluation strategies and the impacts of different training techniques on model performance.

Play episode from 40:33

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app