Arquivo-web-crawler

archiver Arquivo-web-crawler

Portuguese web archive for digital heritage

About this crawler

Arquivo-web-crawler is a web crawler identified by the regular-expression pattern Arquivo-web-crawler in the User-Agent request header. It is categorised as archiver. Use the regex above to detect, log, allow, or block Arquivo-web-crawler traffic in your web server, CDN edge rules, or robots.txt.

Block-rate · top 25k sites

0.39%

latest snapshot
2026-06-04
matched key: Arquivo-web-crawler

Technical details

Name: Arquivo-web-crawler
Pattern: Arquivo-web-crawler
Tags: archiver
Reference: https://arquivo.pt/robot
Added: 2026/04/17
Instances: 1 known sample(s)

Sample User-Agent strings

Arquivo-web-crawler (compatible; +https://arquivo.pt/robot)

Block this crawler

robots.txt — disallow Arquivo-web-crawler:

User-agent: Arquivo-web-crawler Disallow: /

Apache .htaccess — return 403:

RewriteEngine On RewriteCond %{HTTP_USER_AGENT} Arquivo-web-crawler [NC] RewriteRule .* - [F,L]

Nginx — return 403 inside a server block:

if ($http_user_agent ~* "Arquivo-web-crawler") { return 403; }

← back to all crawlers