当前位置:   article > 正文

sogou spider 抓取网站robots.txt 400问题?_106.38.241.167

106.38.241.167

首先,我要说,网站正常访问是没问题的。而且,百度,360 spider都访问ok。
但sogou站长工具测试没问题,后台日志显示,抓取的时候,就是400.
不过,确实看不出来400的与其他有什么差别。由于采用了https访问,所以,做了301转向调整。另外 panjishengwu.com 转向了www.panjisheng.com的转向跳转。都是301.
浏览器测试都是正常。

对于400错误,我一定办法没有。而且,只有这个文件是400.
但这个文件影响了我的收录。 我调整域名,调整nginx的robots.txt配置,都无用。
请求根本到不了后端。到目前为止问题依然没有解决。看到的200状态,都是我利用sogou的站长工具测试的。测试是没有问题的。
看了一些文章,有说是域名不对。我域名设置为所有。针对非我域名做301跳转。
但我收到还是400.

也有说是客户端问题。那这个我就无法验证了。具体怎么回事,如果有大拿清楚原因,还请赐教。

123.126.113.90 - - [02/Mar/2019:15:21:30 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.73 - - [02/Mar/2019:19:22:31 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [29/Jan/2019:08:58:34 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.82 - - [29/Jan/2019:23:21:35 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.82 - - [29/Jan/2019:23:22:57 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.90 - - [30/Jan/2019:00:21:36 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.90 - - [30/Jan/2019:00:21:51 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.131 - - [30/Jan/2019:03:22:19 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.131 - - [30/Jan/2019:03:24:23 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.158 - - [30/Jan/2019:07:20:45 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.158 - - [30/Jan/2019:07:21:54 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [30/Jan/2019:08:54:13 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [30/Jan/2019:21:06:15 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.132 - - [31/Jan/2019:00:22:14 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.132 - - [31/Jan/2019:00:22:14 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.38.241.121 - - [31/Jan/2019:03:24:24 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [31/Jan/2019:21:26:51 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [01/Feb/2019:09:14:04 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.133 - - [02/Feb/2019:00:17:31 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.133 - - [02/Feb/2019:00:18:01 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
111.202.101.250 - - [02/Feb/2019:08:30:09 +0800] “GET /robots.txt HTTP/1.1” 200 21 “-” “Mozilla/5.0 (Linux; Android 6.0.1) AppleWebKit/601.1 (KHTML,like Gecko) Version/9.0 Mobile/13B143 Safari/601.1 (compatible; Sogou web spider/4.0; +http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [02/Feb/2019:09:37:55 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [02/Feb/2019:22:18:44 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
218.30.103.29 - - [03/Feb/2019:03:23:56 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [03/Feb/2019:17:53:33 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.104 - - [04/Feb/2019:00:22:41 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.120.173.104 - - [04/Feb/2019:00:23:02 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
106.38.241.111 - - [04/Feb/2019:03:21:36 +0800] “GET /robots.txt HTTP/1.1” 400 271 “-” “Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)” “-”
123.126.113.91 - - [04/Feb/2019:11:54:26 +0800] “GET /robots.txt HTTP/1.1” 301 185 “-

声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/凡人多烦事01/article/detail/659221
推荐阅读
相关标签
  

闽ICP备14008679号