AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Bypassing Safety Guardrails with Uncommon Languages
This chapter discusses a research result revealing that AI safety guardrails can be bypassed by translating prompts into uncommon languages, with a success rate of 79% compared to 1% for English prompts.
Our 154th episode with a summary and discussion of last week's big AI news!
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
Correction: Andrey mentioned "State space machines", he meant "State space models"
Timestamps + links:
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode