larbin

http-library larbin

Open-source web crawler for large-scale indexing projects

About this crawler

larbin is a web crawler identified by the regular-expression pattern larbin in the User-Agent request header. It is categorised as http-library. Use the regex above to detect, log, allow, or block larbin traffic in your web server, CDN edge rules, or robots.txt.

Block-rate · top 25k sites

0.85%
latest snapshot
2026-06-04
matched key: larbin
2026-05-012026-06-041.5%

Technical details

Name
larbin
Pattern
larbin
Tags
http-library
Reference
https://larbin.sourceforge.net/
Added
2026/04/26
Instances
1 known sample(s)

Sample User-Agent strings

larbin_2.6.2 ([email protected])

Block this crawler

robots.txt — disallow larbin:

User-agent: larbin Disallow: /

Apache .htaccess — return 403:

RewriteEngine On RewriteCond %{HTTP_USER_AGENT} larbin [NC] RewriteRule .* - [F,L]

Nginx — return 403 inside a server block:

if ($http_user_agent ~* "larbin") { return 403; }
← back to all crawlers