A few months back, a friend who runs a mid-sized media site called me in a mild panic. "Traffic's been bleeding for six months. Nothing I do helps. I've rewritten content, built links, fixed my Core Web Vitals โ Google just seems to not care." We spent an hour in Search Console together. Within fifteen minutes, I found the problem: 4,300 soft 404 errors, quietly accumulating since a site redesign last autumn. He had no idea they were even there.
This happens constantly. Soft 404s are the kind of technical SEO problem that doesn't scream at you. There's no big red error alert. Your site looks fine in a browser. Pages technically load. But Google sees them as empty husks, and the crawl budget consequences are brutal.
A case study published by Search Engine Land last week made this viscerally clear: a multinational media company lost 90% of its organic traffic following a domain migration โ and the root cause wasn't the migration itself. It was thousands of soft 404 errors that had accumulated and destroyed the site's crawl efficiency across 13 country-specific domains. The fix took weeks. The damage took over a year to manifest fully. That's the thing about soft 404s โ they're not a sudden catastrophe. They're slow poison.
What a Soft 404 Actually Is (And Why It's Worse Than a Real 404)
A regular 404 is honest. The server says "this page doesn't exist" and returns an HTTP 404 status code. Google sees that, notes the page is gone, and moves on. It's clean. Efficient. No wasted effort.
A soft 404 is a liar. The server returns an HTTP 200 OK status โ telling Google "yes, everything is fine here!" โ but the page itself has no meaningful content. Maybe it's a product page where the item sold out and the template now shows an empty shell. Maybe it's a search results page that returns zero matches. Maybe it's a location page for a service area your CMS auto-generated that has three sentences on it. The page technically exists. Google gets a success response. But when it reads the actual content, it realizes there's nothing there worth indexing.
This is actually worse than a real 404 in one specific way: it wastes crawl budget. Google allocates a finite number of crawl requests to your site based on how trustworthy and useful it thinks your site is. When it wastes those requests on empty pages that return 200s, it has fewer resources left to discover and index your actually valuable content.
Where Soft 404s Come From
In my experience, soft 404s cluster around a handful of very predictable situations. If any of these sound familiar, there's a good chance you've got some festering in your site right now.
Auto-Generated Thin Pages
This is the big one for e-commerce and directory sites. If your CMS automatically generates pages based on parameters โ faceted navigation filters, currency converters, location combinations, tag archives โ you can end up with thousands of pages that have little to no unique content. The example from the Search Engine Land case study was currency converter pages like /usd-to-thor?amount=250. Technically valid URLs. Zero content value. Multiplied by thousands. Crawl budget obliterated.
Out-of-Stock Product Pages
E-commerce sites get hit hard here. You have a product that sells out and the page collapses to just a "sorry, out of stock" message or a skeleton template. The URL still returns 200. Google sees an empty page. This is especially common on seasonal inventory sites that go through regular stock cycles.
Failed Site Migrations
If you migrated your site and didn't perfectly map old URLs to new ones, you may have orphaned pages returning 200 with barely any content, or redirect chains that eventually dump Google onto thin pages. The case study mentioned above started with exactly this scenario โ a migration where Google split its crawl between the old and new domains, then found empty pages everywhere it looked.
Empty Search Results Pages
Internal site search is a surprisingly common culprit. If your search results pages are crawlable (they shouldn't be, but they often are), any query with zero results gives Google a blank-looking page with a 200 status. If you have dynamic search functionality, this can generate enormous crawl waste.
Staging Artifacts in Production
Sometimes pages from a staging or test environment end up in production with placeholder content or empty templates. These are usually fewer in number, but they're often in odd URL patterns that are harder to spot without a systematic audit.
How to Find Soft 404 Errors on Your Site
The good news: Google usually tells you. The bad news: most site owners never look.
Open Google Search Console, go to Indexing โ Pages, and look for "Soft 404" in the "Why pages aren't indexed" section. That list is your starting point. Sort by volume. If you see more than a handful, dig in โ click through to sample URLs and actually look at what those pages contain.
Search Console only shows you what Google has flagged, though. It won't show you pages Google hasn't gotten around to crawling yet. For a fuller picture, you need to run your own crawl.
๐งช Find Your Soft 404 Problems in 60 Seconds
RankSorcery's SEO Auditor scans 60+ technical factors โ including indexing issues, thin pages, and crawl-wasting URLs. Paste your URL and get a full report, free, no login needed.
Run Free SEO Audit โWhen you're reviewing flagged URLs, you're looking for patterns. Is it a specific URL parameter that's generating empty pages? A category that no longer has any products? A CMS module that's creating location pages with boilerplate text? The specific fix depends on the root cause โ which is exactly why pattern recognition matters so much here.
How to Actually Fix Soft 404 Errors (Without Breaking Things)
There's no single universal fix, but there's a clear decision tree. For each batch of soft 404s, you're answering one question: does this URL deserve to exist as a real page, or not?
Return a Proper 404 or 410 (For Dead Pages)
If the page genuinely has no useful content and never will โ a discontinued product, an empty category, a broken URL โ update your server to return a proper 404 (not found) or 410 (gone permanently). The 410 is actually more efficient if you know the content is never coming back; Google drops 410s from its index faster. Don't leave dead pages returning 200.
Add a Noindex Tag (For Thin But Useful-ish Pages)
Some pages are necessary for your site to function โ internal search results, filtered product listings, some faceted navigation views โ but they don't deserve to be indexed. Add <meta name="robots" content="noindex"> to the page template. This tells Google not to include the page in its index, but it doesn't waste a 404 on something that's genuinely part of your site's UX.
Block via Robots.txt (For Entire URL Patterns)
If you have a whole class of URLs that should never be crawled โ URL parameters, internal search, pagination after a certain depth โ use robots.txt to block them. This stops Google from wasting crawl budget on them in the first place. This is particularly effective for dynamic URL patterns generated by filters, sorts, or query parameters.
Add Real Content (For Pages Worth Keeping)
Sometimes the right answer is just: build the page properly. If you have location pages or category pages that are thin, the fix isn't to delete them โ it's to add content that justifies their existence. If a page is worth having in Google's index, it needs actual substance. Minimum 300โ500 words of unique, useful content at the bare minimum; ideally more.
Redirect (For Moved or Consolidated Pages)
If the content from a thin page has been moved to a better URL, implement a 301 redirect. This passes any link equity and signals to Google that the content lives somewhere else. Don't leave orphaned URLs lingering โ either fix them, block them, or redirect them.
The Crawl Budget Connection You Need to Understand
I want to spend a minute on crawl budget because I find that a lot of SEOs treat it as an abstract concept that only matters for huge enterprise sites. It doesn't. It matters for any site with more than a few hundred pages โ and it especially matters in 2026 when Google's indexing pipeline is already under pressure from the sheer volume of AI-generated content flooding the web.
Here's the mechanism: Google has a crawl rate limit for your site based on server load, and a crawl demand based on how valuable it thinks your content is. Together, these create your effective crawl budget. When Google repeatedly crawls pages and finds nothing useful, its perception of your site's value starts to drop. It becomes less enthusiastic about crawling deeply. New content takes longer to get indexed. In extreme cases โ like the Brazilian site in the case study โ crawl volume can drop by 60% or more.
The knock-on effects are nasty, especially for content-heavy sites:
- New articles take 24+ hours to get indexed instead of minutes
- Time-sensitive content (news, product launches, sales) misses its traffic window
- Google Discover promotion slows or stops โ Discover heavily favors sites Google trusts
- Deep pages on your site stop getting crawled at all
- Your competitors, who don't have these issues, get indexed faster and win the clicks
If You're Planning a Site Migration, Read This First
Soft 404s and site migrations are unfortunately best friends. Migrations create ideal conditions for these errors to spawn: old URLs get left behind, CMS templates get misconfigured, redirect mapping is never quite complete, and auto-generated pages from the old site structure don't have equivalents in the new one.
One specific thing that kills people: when you move to a new domain and don't fully deindex the old one, Google splits its crawl budget between both. You're essentially asking Googlebot to do twice the work for the same site. The old domain continues to rank for some terms while the new domain struggles to build authority. This is exactly what happened in the 13-domain case study โ and fixing the migration was a prerequisite to fixing the soft 404 problem, because Google was spending crawl budget on an abandoned property.
After You Fix Soft 404s: What to Expect
Don't expect overnight results. Google doesn't instantly reprocess your site the moment you fix issues. But the recovery trajectory from systematic soft 404 remediation is well-documented:
- Week 1โ2: Search Console starts showing declining soft 404 counts as fixes are validated
- Week 3โ4: Crawl rate begins increasing as Google finds fewer wasteful pages
- Week 5โ8: Indexing coverage for good content starts climbing
- Week 8โ12: Organic traffic begins recovering, often faster for fresh content
- Month 3โ6: Full recovery for sites that fix root causes, not just symptoms
The case study showed Germany going from ~150,000 indexed pages to 370,000 pages after remediation, with daily clicks rising from around 8,000 to a sustained 12,000โ15,000. Poland saw traffic spikes above 30,000 clicks per day after barely cracking 5,000 previously. These are real gains from fixing what amounts to a plumbing problem.
Quick Self-Audit: Do You Have a Soft 404 Problem Right Now?
Run through this honestly. If you check more than two or three of these boxes, I'd bet real money you've got a soft 404 situation worth investigating.
- Your site has e-commerce products that go out of stock regularly
- Your CMS auto-generates pages for category combinations, location variations, or URL parameters
- You migrated your site in the last 12โ18 months
- You have internal search that's crawlable
- You've added and removed a lot of content over the past year without checking URLs
- Your Google Search Console "Pages" report shows "crawled โ not indexed" growing steadily
- You haven't looked at your Search Console indexing report in the last 30 days
The fix starts with visibility. You can't address what you haven't found. Pull your Search Console Pages report today, export the soft 404 list, and start identifying the patterns. You might be surprised what you find sitting there quietly ruining your site's relationship with Google.
๐งช See Exactly What's Hurting Your Rankings
RankSorcery's free SEO Auditor checks for thin content, indexing problems, crawl waste, and 60+ other technical issues. Get your full report in under a minute โ no account required.
Audit My Site Free โSoft 404s don't get the press that algorithm updates or AI search does. They're unglamorous. They don't come with a catchy name like a Penguin update. But the fundamentals of SEO haven't changed: if Google can't find and index your content efficiently, nothing else you do matters. Links, content quality, E-E-A-T signals โ all of it is downstream of the basic question of whether Google can actually read your site. Fix the plumbing first.