AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Combine Approaches for Effective AI Manipulation
A novel approach called AutoDan has been developed, integrating two techniques to generate human-readable jailbreaks for large language models. This method employs another AI to create prompts word-by-word, testing for optimal responses from models like ChatGPT. By systematically refining prompts, AutoDan aims to achieve more favorable outcomes in interactions with AI systems.