Encyclopedia Britannica is suing OpenAI, and their legal theory is fascinating—and different from every other AI copyright lawsuit so far.
The 254-year-old reference publisher filed suit this week alleging that ChatGPT didn't just train on Britannica's articles—it memorized them. And when users ask ChatGPT questions, it regurgitates Britannica's exact phrasing, including specific facts and editorial choices that took human experts years to research and verify.
This isn't the New York Times lawsuit, which focused on verbatim reproduction of copyrighted articles. Britannica's argument is more subtle and potentially more dangerous for AI companies: even when ChatGPT doesn't copy text word-for-word, it's still reproducing Britannica's intellectual labor—the curation, fact-checking, and synthesis that makes an encyclopedia valuable.
The lawsuit includes side-by-side examples. Ask ChatGPT about the Treaty of Westphalia, and you'll get an answer that follows Britannica's structure, uses Britannica's emphasis on specific clauses, and even includes the same contextual framing that Britannica's editors chose in 2018.
That's not how human memory works. When I read an encyclopedia article and then explain a concept to someone, I'm not reproducing the source—I'm synthesizing it through my own understanding. But language models don't understand anything. They predict tokens. And if they were trained on enough Britannica articles, those predictions will statistically mirror Britannica's choices.
likely defense will be that facts aren't copyrightable. The happened in 1648—that's not something owns. The challenge for is proving that is copying their of facts, not just the facts themselves.
