01

Reddit vs. Anthropic: Lawsuit alleges unauthorized AI extraction of user content

On June 4, 2025, Reddit filed a lawsuit in San Francisco Superior Court against AI startup Anthropic, accusing it of unauthorized scraping of user-generated content to train its chatbot, Claude, despite assurances to the contrary.

The lawsuit alleges that since July 2024, Anthropic bots accessed Reddit more than 100,000 times, violating promises to stop such activity.

License Agreements Against Unauthorized Scraping

Reddit emphasizes that it has reached licensing agreements with giants such as OpenAI and Google, while efforts to obtain a similar one with Anthropic failed.

Unlike these companies, Anthropic refused to negotiate, and according to Reddit, violated robots.txt, API limits, and its own User Policy and Developer Terms.

Main Allegations of the Lawsuit
Reddit bases its complaint on several legal grounds:

Breach of contract and developer rights: So-called automated bots recognize and violate explicit clauses.
Unauthorized access to the Compliance API and circumvention of technical mechanisms.
Use of removed content to train models, even after removal.
Unjust enrichment and unfair competition, exploiting content without compensation.

    Reddit is seeking an injunction prohibiting future use of its content, monetary damages (pecuniary and punitive damages), and the removal of all resulting training.

    Regulatory Impact and Context

    This dispute reflects an evolving legal trend:

    Scraping Litigation: Reddit joins outlets like the AP and the New York Times, which have also sued AI companies.
    Ethical Defense vs. Actual Practice: Anthropic presents itself as an “AI guardian,” but according to Reddit, its practices prove otherwise.
    Digital Karma and Irony: Analysts point to the contradiction in Reddit’s monetization of content without compensation, criticizing the complaint as “karma.”

    Anthropic and Market Reactions

    Anthropic has stated that it “disagrees with the allegations and will vigorously defend itself”.

    Since the announcement, Reddit shares have risen nearly 7%, accumulating a year-over-year gain of nearly 28%, suggesting market support for the initiative.

    Legal challenges and lessons for AI

    Explicit licenses: Training AI with third-party content requires formal contractual agreements.
    Insufficient technical protection: robots.txt or API limits do not guarantee compliance.
    User privacy: Storing deleted data represents a serious violation.
    Emerging legal framework: This case could establish jurisprudence on scraping and platform rights versus copyright.