If you manage a website, chances are you’ve come across Google Search Console’s indexing status reports. While it’s a valuable tool for ensuring your content is discoverable, not every page that appears under “Not Indexed” requires immediate attention.
Let’s explore some of the main reasons pages may not be indexed and why some of these might actually be expected, or even preferred.
When you see “Crawled - Currently Not Indexed,” it means Google’s crawler found the page but chose not to include it in the index. This can happen for a variety of reasons, such as low-quality content, duplicate information, or simply because Google’s algorithms didn’t consider it valuable enough to display in search results.
Should You Worry?
Not necessarily. For example, if you’ve got thin content pages (such as tag pages on a blog) or pages that don’t add significant value to your site, it might actually be better they’re left unindexed. However, if a key page appears here, it may need more robust content or backlinks to improve its perceived value.
This message means Google knows the page exists but hasn’t yet crawled it. This can happen if your site has a large number of pages, or if there are other priorities on your site that Google’s crawler focuses on first.
Should You Worry?
In most cases, no. If your site is new, or if you’ve added a lot of new content, this can be expected. If it’s a high-priority page, you could request indexing through Google Search Console, but usually, patience is key as Google will eventually get to it.
Pages marked as “Blocked by robots.txt” are prevented from being crawled because of instructions in your robots.txt file. This is often intentional—many sites block pages like admin sections, login portals, or private user data.
Should You Worry?
Not unless there’s an error in your robots.txt file. Review the file to ensure it’s blocking only the pages you don’t want indexed. Otherwise, this message is simply a reminder of your intentional choices.
If a page is marked as “Blocked due to a noindex tag,” it means that a noindex directive in the page’s HTML prevents it from appearing in search results. This is a common approach for pages you don’t want in Google’s index—like thank-you pages, certain archives, or duplicate content.
Should You Worry?
No, as long as this is intentional. Review the pages with noindex tags and confirm they’re set up correctly. If important pages are accidentally marked noindex, remove the tag to make them eligible for indexing.
When Google finds multiple versions of similar content, it often selects one version to index as the “canonical” URL. Pages with this status weren’t selected because Google prefers a different version (often indicated by a canonical tag).
Should You Worry?
Only if the page that’s not indexed should be the primary version. Check your canonical tags to ensure they’re pointing to the correct URL. For most duplicate or near-duplicate pages, it’s best to let Google choose a canonical version.
A “Soft 404” means Google thinks the page is either empty or lacks meaningful content, which may be caused by broken links, temporary server issues, or pages with minimal information.
Should You Worry?
Yes, if these are important pages. Check for broken links or add relevant content to pages marked with a “Soft 404.” If it’s a truly empty page (such as a placeholder), consider redirecting it or adding a noindex tag to avoid cluttering Google’s index.
This message appears when Google detects that a page is an alternate version of another page, typically as part of a setup for regional or mobile-specific pages.
Should You Worry?
No, if your canonical tags are set up properly. This status simply reflects that Google respects your canonical preferences, and only one version is indexed.
Pages might also be removed from indexing because of legal complaints, such as DMCA copyright claims or privacy complaints.
Should You Worry?
If your page has been legally removed, you’ll need to resolve the underlying issue before Google can re-index it. This could involve responding to copyright claims or updating the content to comply with legal guidelines.
It’s crucial to remember that Google Search Console’s “Not Indexed” statuses aren’t always indicative of issues. Many of these messages simply inform you of actions taken based on how your website is structured and your own settings. Reviewing these notices regularly is a good practice, but in many cases, no action is needed. Knowing when to act (and when not to) can save you time and help ensure your site remains healthy and accessible to the right audience.