C

citeseerxbot

academic citeseerxbot

CiteSeerX academic web crawler bot

About this crawler

citeseerxbot is a web crawler identified by the regular-expression pattern citeseerxbot in the User-Agent request header. It is categorised as academic. Use the regex above to detect, log, allow, or block citeseerxbot traffic in your web server, CDN edge rules, or robots.txt.

Block-rate · top 25k sites

0.065%
latest snapshot
2026-06-04
matched key: citeseerxbot
2026-05-012026-06-040.11%

Technical details

Name
citeseerxbot
Pattern
citeseerxbot
Tags
academic
Added
2010/07/17
Instances
0 known sample(s)

Sample User-Agent strings

no public sample user-agents recorded.

Block this crawler

robots.txt — disallow citeseerxbot:

User-agent: citeseerxbot Disallow: /

Apache .htaccess — return 403:

RewriteEngine On RewriteCond %{HTTP_USER_AGENT} citeseerxbot [NC] RewriteRule .* - [F,L]

Nginx — return 403 inside a server block:

if ($http_user_agent ~* "citeseerxbot") { return 403; }
← back to all crawlers