AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
AI Model Safeguards and Red Teaming for Security
The chapter discusses the challenges posed by large language models (LLMs) in AI, such as hallucination and prompt injection attacks, emphasizing the need for safeguards like grounding filters and prompt shields. It highlights ongoing AI research to improve models and the importance of training techniques and advancements in AI technology. The chapter also explores the role of a red team at Microsoft in continuously testing AI models for vulnerabilities and developing techniques like the crescendo attack to ensure the safety and security of AI products.