I can’t think of a more important chart I have seen in SEO history.
In a recent filing by Google, they lay out a case against sharing their data in the US vs Google antitrust case court ruling. In the document it shows a chart that, Google thinks 99% of the web is spam and duplicates! Only about one percent is considered genuinely original and worth strong ranking signals.
To me, that single number and chart explains more about today’s search results than a dozen algorithm updates analysis. For many SEO’s, it could be the most important chart you will see in the next decade.
Although some on the socials are calling the document a classic Google gaslighting or talking point release, it is obvious to us that the doc represents the fact that Google is no longer treating the web as a collection of publishers competing for rankings. It is treating the web as a polluted data stream where almost everything must be filtered out before results can even be assembled.
The Great Content Filter
The chart is of course circulating among SEO circles and illustrates a staggering disparity. While the public web contains billions (Trillions?) of pages, Google’s systems are designed to aggressively filter out what they deem “non-content.”:
- The Red Zone (98-99%): This represents the trillions of pages Google crawled but chooses not to index, or they would ignore or demote. (this is “crawled and not indexed” in your GSC dregs)
- The Green Sliver (1-2%): These are the pages that Google makes available for Serps.
Bad Bad Bad Signal-to-Noise Ratio
Google has historically claimed that 99% of search visits are spam-free. However, the bottom line for website owners isn’t just that Google is good at catching low quality pages – it’s that Google is increasingly labeling legitimate but not-necessarily original business content as low-quality.
What Happened?
You know this did not happen overnight. Over the last 10-15 years, three big forces reshaped the index.
- First, templated cms publishing exploded. WordPress made publishing accessible to joe user at scale. Affiliate networks, programmatic SEO, auto generated location pgs, product clones, scraped feeds, autogen spun articles, and ecomm doorway pages now dominate some entire verticals.
- Second, CMS duplication multiplied and multiplied again. Canonical mistakes, many types of faceted navigation, versions, print only versions, pdf versions, feed copies, tracking params, archives, tag pages, and the like created millions – if not – billions of variants of the exact same content.
- Third, AI slop. ‘Nuff said.
The Real Algo Ranking Model?
When only 1% of pages are treated as valuable, ranking becomes less about optimization and more about qualification and filtering.
The primary question Google used to ask: Which page is best optimized content we can return?
Is now: Which pages deserve to exist in results at all?
This shifts their systems in three ways:
- First, indexing is now selective. Large portions of the web are crawled but never meaningfully ranked.
- Second, authority dominates. When originality is scarce, trust becomes the prime filter.
- Third, if most content is redundant, Google prefers to summarize rather than link.
This directly feeds AI Slop-on-top and “answer blocks”.
The Real Implication for SEO
For the average site owner, this Red Sea is a warning that it may be time to take your optimization efforts elsewhere. Sooner-or-later, you are going to be “de-searched” where you’ll be fully crawled, indexed, used by Google, but not appearing for any searches but – maybe – your domain name.
The survival strategy for 2026:
- Look for fresh traffic sources and own your audiences
- Stop the volume: Quality has officially killed quantity by AI
- Add Human Data: First-person accounts, stories, original photos, and unique data that an AI cannot slop-out hallucinate. Every article you write (tough to do) you should include a personal story or history. You want the reader to touch it – taste it – feel it.
- Prune and/or noindex the dead content: Many sites are seeing ranking boosts by actually deleting their low-quality “red” pages.


