GetIntent Crawler
Intent analysis crawler
About this crawler
GetIntent Crawler is a web crawler identified by the regular-expression pattern GetIntent Crawler in the User-Agent request header. It is categorised as advertising. Use the regex above to detect, log, allow, or block GetIntent Crawler traffic in your web server, CDN edge rules, or robots.txt.
Block-rate · top 25k sites
0.065%
Technical details
- Name
- GetIntent Crawler
- Pattern
GetIntent Crawler- Tags
- advertising
- Reference
- http://getintent.com/bot.html
- Added
- 2026/05/03
- Instances
- 1 known sample(s)
Sample User-Agent strings
GetIntent Crawler (http://getintent.com/bot.html)
Block this crawler
robots.txt — disallow GetIntent Crawler:
User-agent: GetIntent Crawler
Disallow: /
Apache .htaccess — return 403:
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} GetIntent Crawler [NC]
RewriteRule .* - [F,L]
Nginx — return 403 inside a server block:
if ($http_user_agent ~* "GetIntent Crawler") {
return 403;
}
← back to all crawlers