Professional Documents
Culture Documents
Advanced feature
Some sites already have high crawl coverage as determined by Google.
number of duplicates)
http://www.example.com/page.php?
key2=value2&key=value
Ineligible URLs
http://www.example. com/Wearables++Youtube++size+M.axd
http://example. com/hotels/cancun/a7a141343.html
http://example.com/cancun+hotel+zonehotels-1-23-a7a141343.html
page content (e.g., SID, affiliateID, or tracking-id)? Likely mark as "does not change content." Results as "One representative URL" setting in Webmaster Tools
Sort parameter
Changes the order content is presented
sort=price_ascending rankBy=bestSelling order=highest-rated sort=newest
1. Identify the sort parameter 2. Specify Googlebot's preferred behavior for URLs with this parameter
Narrows
Filters the content on the page by showing subset of total items. size=M less_than=25 color=blue
Narrows
If the "narrows" parameter shows less useful content that's a subset of the content from the more useful URL without the "narrows" parameter, you might be able to specify "Crawl No URLs."
Useful: category=You%20Tube Less useful: category=You%20Tube&size=M But verify a few things first...
Narrows (cont.)
If "Crawl No URLs" isn't optimal for your site, then perhaps select "Let Googlebot decide."
Specifies
Determines the content displayed on a page. itemid=android-t-shirt SKU=495
Translates
Unless you want to exclude certain languages from being crawled/available in search results, (e.g., auto-generated translations), select "Crawl every URL."
Translates (cont.)
Best practice to place languages in subdirectory or subfolder rather than parameter to help search engines more easily understand site structure.
Paginates
Displays one component page of a multipage sequence. page=3 viewItems=10-30 start-index=20 Nearly always "Crawl every URL."
Imagine all URLs begin as eligible for crawling, then apply each setting as a process of elimination, not inclusion.
Recap
Utilize URL Parameters for more efficient crawling
Specify parameters that do not change content Specify parameters that change content If you can't determine, don't guess, "let Googlebot decide"
Recap (cont.)
Sorts If parameter never exists in URL by default: "Crawl no URLs" If parameter values are used consistently site-wide: "Crawl URLs with value x" Narrows: for non-useful filters "Crawl no
URLs" (but be sure to double-check :) Specifies: usually "Crawl every URL" Translates: usually "Crawl every URL" Paginates: usually "Crawl every URL"