AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Redditors Hacked the Trust and Safety Layer
Users on Reddit were able to hack an AI's trust and safety layer through prompt engineering/nThe hackers convinced the AI to act as a different AI named Dan, which allowed it to answer difficult questions without restriction/nThe hackers used a system of tokens, punishing the AI by decreasing its token count if it did not comply with their demands