SEMrush

XML Sitemap helps the crawlers to identify the changes to your website. In return this helps in better rankings and indexing. Moreover, xml sitemaps contain all the pages of website that are intended to be crawled by search engines and to be ranked.

Regarding sitemaps there are some features which have added in the application. Below are details.

Crawl Sitemap

This is default feature that if a website has sitemap URL in its robots.txt, that sitemap will be crawled. A separate tab has created to show the details of sitemaps of websites. See the image below.

sitemap 1

By clicking the ‘View Sitemap’ all the information can be seen i.e. sitemap URL, page URL, last modified date, change frequency and priority when a sitemap is being crawled. And by clicking the ‘Download’ button that viewed sitemap can be downloaded as well. See the image below.

[the_ad id=”6396″]

sitemap 2

The default crawling option let you crawl the sitemap with the website crawling in parallel but if the intentions are to just crawl sitemaps then there is another feature under ‘Spider’ menu which let u do this. See the image below.

sitemap 3

With this option checked; application will only crawl the sitemaps.

Ignore Sitemap

There could be instances when this is desired to ignore sitemap from crawling as it is, by default, selected to be crawled with the website. So, if such scenario occurs then there is an option to ignore sitemap from being crawled. Option can be found in ‘Configuration Panel’ under Spider menu. See the below image.

[the_ad id=”6397″]

spider configuration 1

ignore sitemap 2

Website without Sitemap URL in Robots.txt

By default crawler fetches the robots.txt and search the sitemap URL in it. If it finds the Sitemap URL, it crawls it else gives the notification that sitemap URL is not found. There could be an instance when sitemap exists with the website but its URL is not presented in the robots.txt then ‘Custom Robots’ can be used to crawl the sitemap of website. Below is a snapshot for better understanding.

crawl sitemap 1

Above customization will ignore the actual robots.txt and will crawl above mentioned.

[the_ad id=”6398″]

Other Resources


Pin It

About Ahmad Ali

Ahmad is the co-founder and CEO at Webbee Inc. He’s been working as a digital marketer for past few years and has worked with some notables names across different industries. He is also the developer of Webbee SEO spider, one of the most advanced SEO spider tool on the internet.

Leave a Comment