去评论
dz插件网

大批量爬CPMozilla5.0 (Linux; Android 5.0; SM-G900P BuildLRX21T) AppleWebKit...

哥斯拉
2022/11/21 09:59:46
服务器CPU,100%, 日志发现大量这个

222.137.6.70 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b55-r8-j5-eY-h1.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
222.137.3.165 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b56-t68-eD.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
171.8.236.214 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b56-j5-g4.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
123.149.78.203 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b54-r8-t4-j8-eY.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
115.60.210.216 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b58-r9-l20-n4.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
61.52.106.190 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b54-r8-t2-l6-eH-n4.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
1.192.240.209 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b55-h1-eT.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
120.245.61.28 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b55-eY-g14.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0; SM-G900P Build/LRX21T) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.159 Mobile Safari/537.36"
171.8.173.217 - - [21/Nov/2022:09:07:00 +0800] "GET /house/list-b53-r8-p1-l11-n4.html HTTP/1.1" 403 548 "-" "Mozilla/5.0 (Linux; Android 5.0;


不知道什么蜘蛛,还是人工采集的,抓取不是正常人访问的页面,ip查了下,大部分都是河南郑州的

设置了屏蔽,还在爬采集