A client came to me in a panic two weeks after the May 2026 Core Update dropped. Their traffic had tanked about 35% โ€” not catastrophic, but enough to make their boss ask uncomfortable questions. I ran a quick crawl of their site and found 847 indexed pages. They had maybe 120 that were actually doing any work. The other 727? Deadweight. Old event pages, duplicate location variants, tag archive pages with three posts each, product pages for things they stopped selling in 2023.

We spent six weeks pruning. Traffic came back. It's not a magic bullet, but it's about as close as SEO gets to one these days.

Content pruning isn't new โ€” SEOs have been talking about it since Google started cracking down on thin content around 2011. But in 2026, with Google's quality signals tightening after back-to-back core updates and AI crawlers now skimming your whole site to build their knowledge base, having a bloated index full of junk pages genuinely hurts in ways that go beyond just "wasting crawl budget." Your dead pages are actively pulling down the quality assessment of your entire domain.

โš ๏ธ
Post-May 2026 Core Update Reality Google's quality signals now operate more holistically at the site level. A cluster of thin or outdated pages in one section of your site can suppress rankings for completely unrelated pages on the same domain. I've seen it happen on multiple sites post-update.

What Content Pruning Actually Means (And What It Doesn't)

Let me be clear about something: pruning doesn't automatically mean deleting. That's the version of this advice that gets people in trouble. Pruning means making a deliberate decision about every low-performing page on your site โ€” and "delete" is only one of four options on the table.

The four options are:

1

Keep and Improve

The page has real potential โ€” maybe it gets some impressions but a low CTR, or it ranks for terms that are slightly off. Invest in it: better content, updated data, stronger internal links pointing to it.

2

Consolidate (Merge)

You have three pages covering slightly different angles of the same topic and none of them rank well. Merge them into one authoritative piece. 301 redirect the old URLs to the new consolidated page.

3

Noindex

The page serves a purpose on-site (think: old webinar landing pages, internal search results, thin tag archives) but doesn't need to be in Google's index. Add noindex and move on.

4

Delete and 301 Redirect

The page serves no purpose โ€” it has no traffic, no links, no useful content, and fixing it isn't worth the effort. Pull it down and redirect to the most relevant existing page.

Most pruning guides treat "delete everything that doesn't rank" as the answer. That's too blunt. A page with 50 monthly visits from a hyper-specific long-tail query might be worth more than a page with 500 visits from a bouncing audience who immediately leaves. Context matters.

How to Actually Find Which Pages Need Pruning

The first thing I do on any content audit is pull a full list of indexed URLs and cross-reference it against traffic data. You need two things working together: your crawl data and your Search Console performance data.

๐Ÿ’ก
Start With a Full Site Audit Before you can prune anything, you need to know what's actually there. RankSorcery's SEO Auditor will crawl your site and surface thin pages, duplicate content issues, and indexation problems in one pass โ€” no spreadsheet gymnastics required.

Here are the signals I look at to flag candidates for pruning:

  • Zero organic clicks in the last 12 months โ€” If Google has seen this page and nobody ever clicked it, that's a signal.
  • Less than 300 words of actual content โ€” Short is fine for some page types (tools, calculators), but thin informational pages are usually the culprit.
  • No external or internal links pointing to the page โ€” Orphaned pages aren't getting discovered by users or properly evaluated by Google.
  • Duplicate or near-duplicate content โ€” Variations of the same page fighting each other in the index.
  • Dated content that's factually wrong โ€” "Best SEO tools of 2019" is not a good look in 2026.
  • Pages with high impressions but near-zero CTR โ€” Could mean a bad title/meta, but could also mean Google is showing it because it's the closest match to nothing good.
  • Tag/category archive pages with < 5 posts โ€” These are almost always thin and should be noindexed.
  • Old event, promo, or seasonal pages past their date โ€” Still crawlable, still indexed, still contributing nothing.
62%
of enterprise sites have pages with zero organic clicks in the past year
3โ€“4ร—
typical crawl budget improvement after removing indexation bloat
~8 wks
average time to see traffic recovery after a focused pruning pass

The Decision Framework I Actually Use

I've tried a bunch of different scoring systems over the years and they mostly make things more complicated than they need to be. Here's the simple matrix I use now โ€” it's not perfect, but it gets you to a decision fast:

Step 1: Does the page get any organic traffic?

Pull 12 months of Search Console data. Set your threshold at whatever makes sense for your site's scale โ€” for a small blog, 50 clicks in a year is the floor. For a large e-commerce site, maybe 200. If a page doesn't hit that threshold, it's a candidate. If it does, keep it and move on.

Step 2: Does the page earn any links?

Run a backlink check. If a low-traffic page has links from authoritative domains, you don't just delete it โ€” you either keep it, improve it, or redirect carefully to preserve that equity. A page with two solid links and 20 monthly visits is worth more than it looks.

Step 3: Can it be merged with something better?

Is there a stronger page on the same topic that could absorb this content? If yes, merge and redirect. If no, then decide between improve vs. noindex vs. delete based on whether the topic itself is worth investing in.

"I've seen sites recover 40% of their lost traffic not by adding a single new page โ€” but by removing 200 bad ones. Google isn't just ranking pages. It's forming an opinion about your whole site."

๐Ÿงช Not Sure What's Dragging Your Site Down?

RankSorcery's free SEO Auditor crawls your site and flags thin content, duplicate pages, indexation issues, and more โ€” all in under a minute. No login, no credit card.

Run a Free Audit โ†’

Merging Content the Right Way

Consolidation is probably the most underused option in the pruning toolkit. Most people default to "delete" because it feels decisive. But if you have five blog posts on overlapping variations of the same keyword โ€” say, "how to write meta descriptions," "meta description best practices," "meta description length," etc. โ€” the right move is almost always to merge them into one genuinely comprehensive guide.

When you merge, here's the process that works:

1

Pick the surviving URL

Usually whichever page has the most backlinks, the most traffic, or the most logical URL structure. This becomes the destination.

2

Harvest the best sections

Go through every page being merged and pull any genuinely good sections, data points, examples, or quotes. Combine them into the surviving page, rewriting for flow.

3

301 redirect all old URLs

Every old URL needs to point to the new one. No exceptions โ€” don't leave dead pages with no redirect, and don't chain them through multiple redirects.

4

Update internal links

Any internal links that pointed to the old pages should be updated to point directly to the new URL. Yes, the redirect works โ€” but direct links are better for crawl efficiency and UX.

๐Ÿ”—
Don't Forget Internal Links After any pruning or consolidation, you need to audit your internal linking. Broken internal links, redirect chains, and orphaned destination pages are the most common post-pruning mistakes. The RankSorcery SEO Auditor picks these up automatically โ€” save yourself the headache of doing it in a spreadsheet.

The Pages You Should Almost Always Noindex

There's a category of pages that most sites have by default and most site owners never think about: the structural junk. These aren't bad pages exactly โ€” they serve a purpose for users โ€” but they have no business being in Google's index.

  • Tag and category archive pages that only have 2โ€“4 posts โ€” thin by definition, rarely rank for anything useful
  • Author pages on sites with one or two authors โ€” especially if each author page is just a list of posts with a one-line bio
  • Paginated archive pages beyond page 2 or 3 โ€” /blog/page/47/ is never going to rank for anything meaningful
  • Internal search results pages โ€” a classic crawl trap that generates infinite near-duplicate pages
  • Print-friendly versions of content (if your CMS generates these)
  • URL parameter variations โ€” tracking parameters, session IDs, filter combinations that generate duplicate URLs
  • Thank-you and confirmation pages โ€” after form submissions, purchases, etc.

For most of these, you don't need to delete anything โ€” just add noindex to the meta robots tag or handle it via Search Console's URL parameters tool. Google stops counting them against your site's quality signals, and your crawl budget gets freed up for the pages that actually matter.

The Timing Question: When Should You Actually Prune?

I get asked this a lot: "Should I prune before a core update or after?" My honest answer is โ€” don't time it around updates. Prune when you have the data to make good decisions, and don't rush it.

A bad pruning run done in a panic right after a core update โ€” where you delete pages that actually had value you didn't notice โ€” can make things much worse. I've seen it. The smart play is to treat content auditing as a quarterly or semi-annual process, not a one-time emergency response.

That said, if you're sitting at the end of May 2026 and your traffic just took a hit, doing a pruning pass right now is almost certainly the right call. The May 2026 Core Update clearly weighted site-level quality signals more heavily than previous updates. Getting the dead weight off your index is the fastest lever you have.

๐Ÿ“Š
Measure Before and After Before you start any pruning project, snapshot your current state: total indexed pages (from Search Console's Coverage report), total organic sessions (GA4, filtered to organic), and top 20 ranking pages by traffic. Pull the same metrics 8โ€“12 weeks after you complete the pruning. That's your proof of work โ€” and it's the only way to know if what you did actually helped.

Pruning Mistakes That Bite People Later

I've made most of these myself at some point, so take them seriously:

  • Not checking for links before deleting โ€” A page with 10 referring domains isn't thin content, it's an asset. Always check backlinks before the delete decision.
  • Deleting pages that feed lead-gen funnels โ€” Some pages have zero organic traffic but drive significant direct or email traffic. Traffic source matters.
  • Redirecting everything to the homepage โ€” Lazy redirect strategy. Redirect to the most topically relevant page, not just the root domain.
  • Forgetting to update the sitemap โ€” After a big pruning run, your XML sitemap may still list deleted or redirected URLs. Clean it up.
  • Not monitoring post-pruning for 8โ€“12 weeks โ€” Traffic takes time to recover. Don't panic if things look flat for a month.
  • Pruning and then immediately creating more thin content โ€” I have seen this happen and it is exactly as counterproductive as it sounds.

What Google Actually Wants From Your Content in 2026

Here's the honest version of this: Google wants your site to be worth crawling. Every time Googlebot visits your site, it's making a judgment call about how much time and resources to spend on you. If it keeps finding thin pages, duplicate content, and outdated junk, it's going to start deprioritizing your whole domain.

The flip side is also true. Sites that consistently publish useful, well-structured, non-redundant content โ€” and keep their index clean โ€” tend to get rewarded with more generous crawl budgets and better rankings across the board. Google's systems have gotten good enough that your worst content now affects your best content. That's the reality of site-level quality signals in 2026.

Content pruning is, at its core, a signal of editorial confidence. You're telling Google: "We only publish things we stand behind. We clean up what doesn't work. We don't pad our index to look bigger." That signal is worth more than having 800 indexed pages you're secretly embarrassed about.

๐Ÿ” Find Your Dead Pages in 60 Seconds

RankSorcery's free SEO Auditor crawls your entire site, flags thin content and duplicate pages, and gives you a prioritized list of what to fix โ€” no account needed, no monthly fee.

Run My Free Audit โ†’
JR

James Reyes โ€” RankSorcery

James has been doing SEO for longer than he'd like to admit. He runs RankSorcery and writes about the parts of search that don't make it into the standard playbooks. He's been wrong about a few predictions. He's been embarrassingly right about others.