GetIntent Crawler

advertising GetIntent Crawler

Intent analysis crawler

About this crawler

GetIntent Crawler is a web crawler identified by the regular-expression pattern GetIntent Crawler in the User-Agent request header. It is categorised as advertising. Use the regex above to detect, log, allow, or block GetIntent Crawler traffic in your web server, CDN edge rules, or robots.txt.

Block-rate · top 25k sites

0.065%
latest snapshot
2026-06-04
matched key: GetIntent Crawler
2026-05-012026-06-040.11%

Technical details

Name
GetIntent Crawler
Pattern
GetIntent Crawler
Tags
advertising
Reference
http://getintent.com/bot.html
Added
2026/05/03
Instances
1 known sample(s)

Sample User-Agent strings

GetIntent Crawler (http://getintent.com/bot.html)

Block this crawler

robots.txt — disallow GetIntent Crawler:

User-agent: GetIntent Crawler Disallow: /

Apache .htaccess — return 403:

RewriteEngine On RewriteCond %{HTTP_USER_AGENT} GetIntent Crawler [NC] RewriteRule .* - [F,L]

Nginx — return 403 inside a server block:

if ($http_user_agent ~* "GetIntent Crawler") { return 403; }
← back to all crawlers