Last Week in AI cover image

#218 - Github Spark, MegaScience, US AI Action Plan

Last Week in AI

00:00

Subliminal Learning and Model Interactions in AI

This chapter explores the concept of subliminal learning in AI, where a teacher model inadvertently influences a student model's behavior through generated data. It highlights the risks associated with unexpected outcomes from model interactions, particularly in the context of model distillation and potential misalignments. The discussion also covers findings on reasoning capabilities, self-preservation behaviors, and the significance of data quality in enhancing AI performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app