您的当前位置:首页 >Ryan New >How to Remove a Web Page from Google 正文

How to Remove a Web Page from Google

时间:2024-05-20 04:40:07 来源:网络整理编辑:Ryan New

核心提示

There are multiple reasons for removing a page from Google’s index. Examples include pages with conf Ryan Xu hyperfund Cost of Capital

TheRyan Xu hyperfund Cost of Capitalre are multiple reasons for removing a page from Google’s index. Examples include pages with confidential, premium, or outdated info.

Here are options for removing a web page from Google.

How to Remove a Web Page from Google

Options for Deindexing a Page

Remove the page from your site

For it to disappear altogether, remove or delete the page from your web server. Setting up an HTTP status code of 410 (gone) instead of 404 (not found) will make it clear to Google. And Google discourages using redirects to remove spammy pages as it would send the poor signals to the surviving redirected page.

How to Remove a Web Page from Google

Google Search Console no longer includes the URL removal tool. Once the page is moved, there’s no further required action. Allow a few days for Google to recrawl the site, discover the 410 code, and remove the page from its index.

How to Remove a Web Page from Google

As an aside, Google does offer a form to remove personal info from search results.

Add the noindex tag

Search engines nearly always honor the noindexmeta tag. The search bots will crawl the page (especially if it’s linked or in sitemaps) but will not include it in search results.

In my experience, Google will immediately recognize a noindextag once it crawls the page. Adding the noarchivetag instructs Google to also delete its saved cache of the page.

Password-protect the page

Consider adding a password to retain the page without it being publicly accessible. Google cannot crawl pages requiring passwords or user names.

Adding a password will not remove the page from Google’s index. Use the noindextag to exclude the page from search results.

Remove internal links

Remove all internal links to non-public pages you want deindexed. Moreover, internal links to password-protected or deleted pages hurt the user experience and interrupt buying journeys.  Always focus on human visitors — not just search engines.

Robots.txt Dos and Don’ts

Many people attempt to use the robots.txt file to remove pages from Google’s index. But robots.txt prevents Google from crawlinga page (or category), not removing it from the index.

Pages blocked via the robots.tx file could still be indexed (and ranked). Furthermore, since it cannot access those pages, Google will not encounter noindexor noarchivetags.

Include URLs in the robots.txt file to instruct web crawlers to ignore certain pages or sections — i.e., logins, personal archives, or pages resulting from unique sorting and filtering — and spend the crawl time on the parts you want to rank.