如何屏蔽谷歌爬虫搜索,及其它搜索爬虫?
如何屏蔽谷歌爬虫搜索,及其它搜索爬虫?有没有什么插件可以控制的,爬虫太平凡爬去给服务器好大压力?可以利用.htaccess设置搜索引擎屏蔽,例如:
SetEnvIfNoCase User-Agent "pyspider|Applebot|Apache-HttpClient|CCBot|Abonti|aggregator|AhrefsBot|YisouSpider|BLEXBot|DotBot|YandexBot|trendictionbot|MagiBot|Exabot|ScooperBot|YandexImages|SemrushBot|MJ12bot|startmebot|ltx71|DuckDuckGo|IndeedBot|SEOkicks|GrapeshotCrawler|crawler4j|Pinterestbot|StormCrawler|StormCrawler|paracrawl|StormCrawler|GrapeshotCrawler" bad_botDeny from env=bad_bot不同的搜索引擎蜘蛛用竖线|分隔 上面的是常见的一些垃圾蜘蛛,可以拿来即用,谷歌对应的代码为 Googlebot 可以自己添加进去!
页:
[1]