
#218 - Github Spark, MegaScience, US AI Action Plan
Last Week in AI
00:00
Subliminal Learning and Model Interactions in AI
This chapter explores the concept of subliminal learning in AI, where a teacher model inadvertently influences a student model's behavior through generated data. It highlights the risks associated with unexpected outcomes from model interactions, particularly in the context of model distillation and potential misalignments. The discussion also covers findings on reasoning capabilities, self-preservation behaviors, and the significance of data quality in enhancing AI performance.
Transcript
Play full episode