A Patent Crawler
EPFL patent web crawler bot
About this crawler
A Patent Crawler is a web crawler identified by the regular-expression pattern A Patent Crawler in the User-Agent request header. It is categorised as academic. Use the regex above to detect, log, allow, or block A Patent Crawler traffic in your web server, CDN edge rules, or robots.txt.
Block-rate · top 25k sites
No block-rate data for this crawler.
Technical details
- Name
- A Patent Crawler
- Pattern
A Patent Crawler- Tags
- academic
- Reference
- https://www.epfl.ch/research/facilities/scitas/
- Added
- 2023/08/29
- Instances
- 1 known sample(s)
Sample User-Agent strings
E. Orliac, G. Fourestey/2.3 (A Patent Crawler; http://scitas.epfl.ch/; [email protected], [email protected])
Block this crawler
robots.txt — disallow A Patent Crawler:
User-agent: A Patent Crawler
Disallow: /
Apache .htaccess — return 403:
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} A Patent Crawler [NC]
RewriteRule .* - [F,L]
Nginx — return 403 inside a server block:
if ($http_user_agent ~* "A Patent Crawler") {
return 403;
}
← back to all crawlers