
"LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B" by Simon Lermen & Jeffrey Ladish.
LessWrong (Curated & Popular)
00:00
Effects of Model Size on Harmful Task Performance
This chapter explores the impact of model size on harmful task performance, demonstrating that larger models produce better quality results with superior reasoning capacity compared to smaller models.
Transcript
Play full episode