The internet's memory is being systematically erased, and major news organizations are leading the charge.
Twenty-three prominent news outlets - including The New York Times, The Washington Post, and Reuters - have blocked the Wayback Machine from archiving their websites. The move threatens the permanent loss of journalistic records and public accountability as paywalls and takedowns erase the historical internet.
The Internet Archive's Wayback Machine has preserved over 866 billion web pages since 1996, creating a digital library that researchers, journalists, and the public rely on to verify claims, track policy changes, and hold institutions accountable. When news organizations block archiving, they're not just protecting subscription revenue - they're burning the library.
Publishers say they want to monetize their archives and control how AI training data is collected from their content. Both are legitimate business concerns. But what they're actually doing is ensuring that when they scrub embarrassing coverage, change headlines after publication, or fold entirely, there's no backup.
We've seen this movie before. News outlets routinely delete or modify articles without disclosure. Public figures scrub their digital footprints. Companies remove evidence of past positions. The Wayback Machine was the failsafe - proof that the internet remembers even when institutions want to forget.
Not anymore.
The implications go beyond journalism. Academic research relies on stable citations. Legal proceedings reference online evidence. Democratic accountability requires a factual record of what was said and when. Without comprehensive archiving, we're trading a complete historical record for subscription revenue and AI licensing deals.
The blocking mechanism is simple: a single line of code in a robots.txt file. The consequences are permanent. Once the archive stops capturing a site, that history is gone. No retroactive crawling. No second chances.
Some outlets blocking the archive: The New York Times, The Washington Post, Reuters, CNN, The Wall Street Journal, Bloomberg, and The Guardian. Major media companies that built their reputations on holding others accountable are now ensuring they can't be held accountable themselves.
The Internet Archive is a nonprofit. It charges nothing. It serves the public interest. And it's being systematically locked out by the institutions that claim to serve that same interest.
Publishers want to monetize their archives and control AI training data. Those are business decisions. But don't pretend they're compatible with preserving history. You can't have both.
