AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Crack Chat GPT and Jail Break It
There's a Reddit there's always a subreddit for specific thing of trying to crack chat GPT and jail break it. One of the flagship efforts here was a technique called do now abbreviated to Dan which stands for do anything now. And so people are playing around with okay how can I give it a new prompt that sort of interacts with the previous prompt to break it? So you're really talking about taking a system that was prompted very carefully by opening I to not do things like help people perform violent acts or say you know awful racist discriminatory thingsThings like that. Yeah, and I think that's very relevant because reinforcement learning from trial and error is this thing that may be necessary for truly