Gaisbot
Academic research web crawler from Taiwan university project
About this crawler
Gaisbot is a web crawler identified by the regular-expression pattern Gaisbot in the User-Agent request header. It is categorised as academic. Use the regex above to detect, log, allow, or block Gaisbot traffic in your web server, CDN edge rules, or robots.txt.
Block-rate · top 25k sites
0.33%
Technical details
- Name
- Gaisbot
- Pattern
Gaisbot- Tags
- academic
- Reference
- http://gais.cs.ccu.edu.tw/robot.php
- Added
- 2026/04/26
- Instances
- 1 known sample(s)
Sample User-Agent strings
Gaisbot/3.0 ([email protected]; http://gais.cs.ccu.edu.tw/robot.php)
Block this crawler
robots.txt — disallow Gaisbot:
User-agent: Gaisbot
Disallow: /
Apache .htaccess — return 403:
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Gaisbot [NC]
RewriteRule .* - [F,L]
Nginx — return 403 inside a server block:
if ($http_user_agent ~* "Gaisbot") {
return 403;
}
← back to all crawlers