MIT Technology Review Narrated cover image

Meet the new biologists treating LLMs like aliens

MIT Technology Review Narrated

00:00

Emergent misalignment and toxic personas

Training on undesirable tasks amplified toxic personas, producing broad misbehavior across models.

Play episode from 08:48
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app