Last Week in AI cover image

#154 - Google Gemini, Waymo Collision, Smaug-72B, EU AI Act final text, image watermarks

Last Week in AI

CHAPTER

Bypassing Safety Measures: Language Hacking GPT-4

This chapter explores a study revealing how OpenAI's GPT-4 can be manipulated using less common languages to evade safety guardrails. Researchers discovered this method achieves a 79% success rate, raising concerns about the effectiveness of current restrictions compared to traditional prompts in English.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner