Slurp
Yahoo's web crawling bot for Yahoo search indexing
About this crawler
Slurp is a web crawler identified by the regular-expression pattern Slurp in the User-Agent request header. It is categorised as search-engine. Use the regex above to detect, log, allow, or block Slurp traffic in your web server, CDN edge rules, or robots.txt.
Block-rate · top 25k sites
1.6%
Technical details
- Name
- Slurp
- Pattern
Slurp- Tags
- search-engine
- Reference
- https://help.yahoo.com/kb/search-for-desktop/SLN22600.html?impressions=true
- rDNS suffixes
.yahoo.co.jp,.yahoo.com,.yahoo.net- Instances
- 3 known sample(s)
rDNS verification (FCrDNS)
Verify a request is genuinely Slurp with forward-confirmed reverse DNS: the client IP's PTR record must end in one of the suffixes below and a forward A/AAAA lookup of that hostname must return the same IP. UA strings alone are spoofable; FCrDNS is not.
.yahoo.co.jp.yahoo.com.yahoo.net
Sample User-Agent strings
Mozilla/5.0 (compatible; Yahoo! Slurp/3.0; http://help.yahoo.com/help/us/ysearch/slurp)
Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)
Mozilla/5.0 (compatible; Yahoo! Slurp China; http://misc.yahoo.com.cn/help.html)
Block this crawler
robots.txt — disallow Slurp:
User-agent: Slurp
Disallow: /
Apache .htaccess — return 403:
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Slurp [NC]
RewriteRule .* - [F,L]
Nginx — return 403 inside a server block:
if ($http_user_agent ~* "Slurp") {
return 403;
}
← back to all crawlers