Britannica, Merriam-Webster Sue OpenAI Over Copyright

Legal gavel on books next to laptop with AI code representing the Britannica OpenAI lawsuit.

March 17, 2026 — Encyclopedia Britannica and its subsidiary Merriam-Webster have filed a federal lawsuit against OpenAI, accusing the artificial intelligence company of “massive copyright infringement.” The publishers allege OpenAI used nearly 100,000 copyrighted online articles to train its large language models without permission or compensation.

Core Allegations in the Complaint

The lawsuit, filed in U.S. District Court, centers on three primary claims. First, Britannica contends OpenAI illegally scraped its proprietary content for training data. Second, the publisher alleges OpenAI’s models sometimes generate outputs containing “full or partial verbatim reproductions” of Britannica articles.

A third accusation involves OpenAI’s use of retrieval-augmented generation (RAG). This tool allows models like ChatGPT to scan external databases for current information. Britannica claims this workflow also constitutes infringement when it accesses their content.

“ChatGPT starves web publishers like [Britannica] of revenue by generating responses to users’ queries that substitute, and directly compete with, the content from publishers,” the complaint states. The lawsuit further argues that AI “hallucinations”—factually incorrect outputs falsely attributed to Britannica—damage the publisher’s reputation and mislead the public.

Broader Legal Battle Over AI Training Data

Britannica joins a growing list of media organizations pursuing legal action against AI developers over training data. The New York Times filed a similar lawsuit in late 2023. Numerous newspaper chains, including the Chicago Tribune and Denver Post publisher MediaNews Group, have also sued.

Ziff Davis, owner of CNET, Mashable, and PC Magazine, filed its own complaint. The Canadian Broadcasting Corporation and the Toronto Star are among other plaintiffs. A separate Britannica lawsuit against AI startup Perplexity remains pending.

Unsettled Legal Precedent

No strong legal precedent yet establishes whether using copyrighted content to train AI constitutes infringement. In a related case involving Anthropic, federal judge William Alsup suggested the training use could be “transformative” and potentially legal under fair use doctrines.

However, Judge Alsup ruled Anthropic violated copyright law by illegally downloading millions of books rather than purchasing them. That ruling led to a $1.5 billion class action settlement for affected authors. The distinction between the act of training and the method of data acquisition remains a key legal battleground.

Potential Impact on AI Development and Publishing

The lawsuit highlights the deepening conflict between AI companies reliant on vast data and content creators seeking compensation. Britannica argues that AI-generated summaries directly compete with their subscription-based encyclopedia service, threatening their business model.

Legal experts note the case could influence how future AI systems are trained. A ruling against OpenAI might require licensing agreements with publishers or force companies to use only publicly licensed or self-generated data. The complaint also alleges violations of the Lanham Act, a trademark statute, concerning false attribution in AI outputs.

OpenAI did not respond to a request for comment from TechCrunch prior to publication. The company has previously argued that training AI on publicly available internet data constitutes fair use, a position increasingly challenged in courtrooms.

What Comes Next

The case will likely focus on the fair use defense’s applicability to AI training. Courts must balance innovation against copyright protection. The outcome could set a significant precedent for the entire generative AI industry, determining cost structures and data sourcing practices for years to come.

Meanwhile, publishers continue exploring defensive measures. Some have implemented robots.txt directives to block AI crawlers, while others are negotiating direct licensing deals with technology firms. The Britannica lawsuit represents a decisive move toward litigation rather than negotiation for one of the world’s oldest reference publishers.

Updated insights and analysis added for better clarity.

This article was produced with AI assistance and reviewed by our editorial team for accuracy and quality.

Core Allegations in the Complaint

Broader Legal Battle Over AI Training Data

Unsettled Legal Precedent

Potential Impact on AI Development and Publishing

What Comes Next

Related Articles

Explosive Denial: Trump Claims No Role in DOJ Investigation Targeting Fed Chair Jerome Powell

Breaking: DeepSnitch AI Surges 175% in Presale, Outperforming Major Cryptocurrencies

Breaking: Quantum Facility Construction Begins in Chicago, Poses Future Bitcoin Threat