ToutiaoSpider

search-engine ToutiaoSpider

Toutiao news platform web crawler bot

About this crawler

ToutiaoSpider is a web crawler identified by the regular-expression pattern ToutiaoSpider in the User-Agent request header. It is categorised as search-engine. Use the regex above to detect, log, allow, or block ToutiaoSpider traffic in your web server, CDN edge rules, or robots.txt.

Block-rate · top 25k sites

0.26%
latest snapshot
2026-06-04
matched key: ToutiaoSpider
2026-05-012026-06-040.30%

Technical details

Name
ToutiaoSpider
Pattern
ToutiaoSpider
Tags
search-engine
Reference
https://www.toutiao.com/media_cooperation/
Added
2017/11/02
rDNS suffixes
.toutiao.com
Instances
1 known sample(s)

rDNS verification (FCrDNS)

Verify a request is genuinely ToutiaoSpider with forward-confirmed reverse DNS: the client IP's PTR record must end in one of the suffixes below and a forward A/AAAA lookup of that hostname must return the same IP. UA strings alone are spoofable; FCrDNS is not.

Sample User-Agent strings

Mozilla/5.0 (compatible; ToutiaoSpider/1.0; http://web.toutiao.com/media_cooperation/;)

Block this crawler

robots.txt — disallow ToutiaoSpider:

User-agent: ToutiaoSpider Disallow: /

Apache .htaccess — return 403:

RewriteEngine On RewriteCond %{HTTP_USER_AGENT} ToutiaoSpider [NC] RewriteRule .* - [F,L]

Nginx — return 403 inside a server block:

if ($http_user_agent ~* "ToutiaoSpider") { return 403; }
← back to all crawlers