SS

Siteimprove.comSiteCheck-sitecrawl

monitoring Siteimprove\.com|SiteCheck-sitecrawl

Siteimprove web crawler for site analysis

About this crawler

Siteimprove.comSiteCheck-sitecrawl is a web crawler identified by the regular-expression pattern Siteimprove\.com|SiteCheck-sitecrawl in the User-Agent request header. It is categorised as monitoring. Use the regex above to detect, log, allow, or block Siteimprove.comSiteCheck-sitecrawl traffic in your web server, CDN edge rules, or robots.txt.

Block-rate · top 25k sites

No block-rate data for this crawler.

Technical details

Name
Siteimprove.comSiteCheck-sitecrawl
Pattern
Siteimprove\.com|SiteCheck-sitecrawl
Tags
monitoring
Added
2018/06/22
Instances
5 known sample(s)

Sample User-Agent strings

Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; Trident/6.0) LinkCheck by Siteimprove.com
Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.0) Match by Siteimprove.com
Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; Trident/6.0) SiteCheck-sitecrawl by Siteimprove.com
Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.0) LinkCheck by Siteimprove.com
Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; Trident/6.0) SiteCheck-sitecrawl

Block this crawler

robots.txt — disallow Siteimprove.comSiteCheck-sitecrawl:

User-agent: Siteimprove.comSiteCheck-sitecrawl Disallow: /

Apache .htaccess — return 403:

RewriteEngine On RewriteCond %{HTTP_USER_AGENT} Siteimprove\.com|SiteCheck-sitecrawl [NC] RewriteRule .* - [F,L]

Nginx — return 403 inside a server block:

if ($http_user_agent ~* "Siteimprove\\.com|SiteCheck-sitecrawl") { return 403; }
← back to all crawlers