5min chapter

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Watermarking Large Language Models to Fight Plagiarism with Tom Goldstein - 621

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

In-Context Attacks on the Chat GPT Watermark

DaVinci is one of the APIs provided by OpenAI. It's one of the ways you can access their GPT-3 model. There are attacks that you can use, we call these in-context attacks where you specifically instruct chat GPT to do something strange that will invalidate the watermark. In fact, there's a whole slew of attacks that we discussed in the paper.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode