AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic

Navigating the AI Legal Maze: Perplexity's Predicament

6 snips
Dec 9, 2025
The podcast dives into the legal turmoil surrounding Perplexity and its mounting lawsuits. It discusses retrieval augmented generation (RAG) and potential legal liabilities from handling verbatim articles. The hosts debate the implications of paywalls and the responsibility of pirate sites in the AI landscape. They also explore the ways Perplexity might bypass paywalls, and contrast it with Meta's strategy of forming partnerships with news publishers to ensure accuracy and mitigate legal risks. Tune in for insights into the intricate relationship between AI and the law.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

RAG Makes Perplexity's Case Unique

  • Perplexity's lawsuits differ because they're accused of using retrieval augmented generation (RAG) to store and serve news verbatim.
  • That behavior crosses from fair-use summarization into direct reproduction that publishers can legally challenge.
INSIGHT

Storing Verbatim Content Is Risky

  • If Perplexity stores paywalled articles verbatim in a DB and returns them, that likely violates publishers' rights.
  • The legal risk grows when scraped third-party sites rehost paywalled content that then enters training or retrieval corpora.
INSIGHT

Scraping Creates Attribution Ambiguity

  • Training sets can unintentionally include copied paywalled text via third-party reposts, creating attribution ambiguity.
  • Blacklisting everything verbatim is impractical given how models ingest diverse scraped sources.
Get the Snipd Podcast app to discover more snips from this episode
Get the app