Threats and Potential Harm from Data Poisoning and Jailbreaking in Large Language Models

This chapter explores the potential threats of data poisoning and jailbreaking in large language models, including manipulating training data and extracting sensitive information. It emphasizes the importance of early intervention to prevent harm.

Play episode from 16:13

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app