巧用Robots避免蜘蛛黑洞_威海佰年网络技术有限公司_网站建设_软件开发_私有云_商标注册_公众号_小程序_APP_物联网_ChatGPT

Categories

Tags

巧用Robots避免蜘蛛黑洞

对于百度搜索引擎来说，蜘蛛黑洞特指网站通过极低的成本制造出大量参数过多，及内容雷同但具体参数不同的动态URL ，就像一个无限循环的“黑洞”将spider困住，Baiduspider浪费了大量资源抓取的却是无效网页。

比如很多网站都有筛选功能，通过筛选功能产生的网页经常会被搜索引擎大量抓取，而这其中很大一部分检索价值不高，如“500-1000之间价格的租房”，首先网站（包括现实中）上基本没有相关资源，其次站内用户和搜索引擎用户都没有这种检索习惯。这种网页被搜索引擎大量抓取，只能是占用网站宝贵的抓取配额。那么该如何避免这种情况呢？

我们以北京某团购网站为例，看看该网站是如何利用robots巧妙避免这种蜘蛛黑洞的：

对于普通的筛选结果页，该网站选择使用静态链接，如：http://bj.XXXXX.com/category/zizhucan/weigongcun

同样是条件筛选结果页，当用户选择不同排序条件后，会生成带有不同参数的动态链接，而且即使是同一种排序条件（如：都是按销量降序排列），生成的参数也都是不同的。如：http://bj.XXXXX.com/category/zizhucan/weigongcun/hot?mtt=1.index%2Fpoi.0.0.i1afqhek

http://bj.XXXXX.com/category/zizhucan/weigongcun/hot?mtt=1.index%2Fpoi.0.0.i1afqi5c

对于该团购网来说，只让搜索引擎抓取筛选结果页就可以了，而各种带参数的结果排序页面则通过robots规则拒绝提供给搜索引擎。

robots.txt的文件用法中有这样一条规则：Disallow: /*?* ，即禁止搜索引擎访问网站中所有的动态页面。该网站恰是通过这种方式，对Baiduspider优先展示高质量页面、屏蔽了低质量页面，为Baiduspider提供了更友好的网站结构，避免了黑洞的形成。

来源：百度搜索资源平台百度搜索学堂

Public @ 2020-05-11 16:08:55

怎么做301转向

如果网站使用LAMP（Linux+Apache+MySQL+PHP）主机，可以使用.htaccess文件做301转向。.htaccess是一个普通文字文件，用Notepad等文字编辑软件创建和编辑，存在网站根目录下。.htaccess文件中的指令用于目录特定操作，如转向、错误处理、密码保护等。如果网站用的是Windows主机，可以在控制面板做301转向设定。纯静态HTML页面无法做301转向。在H

Public @ 2018-08-02 16:09:37

301重定向的意义你是否真的了解？能正确运用301重定向？

一、301重定向的作用301重定向的作用有很多，平时站长在做301重定向的时候「网站优化」网站优化宝典之301重定向，你真的会用吗？①是为了URL规范化并集中权重不让权重分散；②是为了网站改版，将旧版本的页面的所有指标全部转移到新版本的页面上；③而实施301重定向可以做到这些，在作用上也是非常的强大；二、在什么情况下必须做301重定向「网站优化」网站优化宝典之301重定向，你真的会用吗？以下七种情

Public @ 2019-08-11 16:09:39

robots.txt文件放在哪里?

robots.txt文件通常放在网站的根目录下，即与主页文件（如index.html）同一级目录下。例如，如果网站的域名是www.example.com，那么robots.txt文件的完整路径可能是www.example.com/robots.txt。

Public @ 2023-06-29 06:00:06

robots文件中屏蔽的为什么还可以收录？

我今天来给大家详细讲解下，先了解几个概念1、robots只是禁止抓取，不是禁止收录2、另外还有nofollow的作用不是不抓取这个链接，是不从这个链接传递权重了解这2个概念后，我们再来讨论怎么处理这类收录问题：robots写正确的同时，不要在任何一家收录的网站发外链，友链，也不要主动提交百度，这样才可以保证不被搜索引擎收录，为什么呢？大家百度查一下淘宝，如图：按照道理淘宝写了robots怎么还是收

Public @ 2021-04-26 16:09:29

更多您感兴趣的搜索

基本文件流程错误 SQL 调试

/www/wwwroot/briline.net/public/index.php ( 0.79 KB )
/www/wwwroot/briline.net/public/public.php ( 1.08 KB )
/www/wwwroot/briline.net/thinkphp/start.php ( 0.73 KB )
/www/wwwroot/briline.net/thinkphp/base.php ( 2.66 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Loader.php ( 19.47 KB )
/www/wwwroot/briline.net/vendor/composer/autoload_namespaces.php ( 0.21 KB )
/www/wwwroot/briline.net/vendor/composer/autoload_psr4.php ( 0.84 KB )
/www/wwwroot/briline.net/vendor/composer/autoload_classmap.php ( 0.14 KB )
/www/wwwroot/briline.net/vendor/composer/autoload_files.php ( 0.42 KB )
/www/wwwroot/briline.net/vendor/qiniu/php-sdk/src/Qiniu/functions.php ( 7.10 KB )
/www/wwwroot/briline.net/vendor/qiniu/php-sdk/src/Qiniu/Config.php ( 0.70 KB )
/www/wwwroot/briline.net/vendor/topthink/think-captcha/src/helper.php ( 1.59 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Route.php ( 59.82 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Config.php ( 6.03 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Validate.php ( 40.27 KB )
/www/wwwroot/briline.net/vendor/topthink/think-queue/src/config.php ( 0.77 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Console.php ( 21.22 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Error.php ( 3.59 KB )
/www/wwwroot/briline.net/thinkphp/convention.php ( 10.31 KB )
/www/wwwroot/briline.net/thinkphp/library/think/App.php ( 21.04 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Request.php ( 50.94 KB )
/www/wwwroot/briline.net/app/config.php ( 11.25 KB )
/www/wwwroot/briline.net/app/database.php ( 1.41 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Hook.php ( 4.76 KB )
/www/wwwroot/briline.net/app/tags.php ( 1.16 KB )
/www/wwwroot/briline.net/app/common/behavior/InitBase.php ( 8.17 KB )
/www/wwwroot/briline.net/app/common.php ( 23.29 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Env.php ( 1.25 KB )
/www/wwwroot/briline.net/thinkphp/helper.php ( 17.86 KB )
/www/wwwroot/briline.net/app/function.php ( 0.78 KB )
/www/wwwroot/briline.net/app/extend.php ( 13.29 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Debug.php ( 7.06 KB )
/www/wwwroot/briline.net/app/common/model/Config.php ( 0.78 KB )
/www/wwwroot/briline.net/app/common/model/ModelBase.php ( 12.18 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Model.php ( 66.83 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Db.php ( 6.54 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Log.php ( 5.84 KB )
/www/wwwroot/briline.net/thinkphp/library/think/db/connector/Mysql.php ( 3.94 KB )
/www/wwwroot/briline.net/thinkphp/library/think/db/Connection.php ( 29.97 KB )
/www/wwwroot/briline.net/thinkphp/library/think/db/Query.php ( 86.80 KB )
/www/wwwroot/briline.net/thinkphp/library/think/db/builder/Mysql.php ( 2.16 KB )
/www/wwwroot/briline.net/thinkphp/library/think/db/Builder.php ( 30.47 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Cache.php ( 6.17 KB )
/www/wwwroot/briline.net/thinkphp/library/think/cache/driver/File.php ( 7.46 KB )
/www/wwwroot/briline.net/thinkphp/library/think/cache/Driver.php ( 5.52 KB )
/www/wwwroot/briline.net/app/common/behavior/InitHook.php ( 1.25 KB )
/www/wwwroot/briline.net/app/common/model/Hook.php ( 0.77 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Lang.php ( 6.95 KB )
/www/wwwroot/briline.net/thinkphp/lang/zh-cn.php ( 3.85 KB )
/www/wwwroot/briline.net/app/route.php ( 0.91 KB )
/www/wwwroot/briline.net/app/index/config.php ( 0.96 KB )
/www/wwwroot/briline.net/app/index/common.php ( 0.68 KB )
/www/wwwroot/briline.net/app/index/controller/Wiki.php ( 2.44 KB )
/www/wwwroot/briline.net/app/index/controller/IndexBase.php ( 1.10 KB )
/www/wwwroot/briline.net/app/common/controller/ControllerBase.php ( 4.75 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Controller.php ( 6.20 KB )
/www/wwwroot/briline.net/thinkphp/library/traits/controller/Jump.php ( 4.97 KB )
/www/wwwroot/briline.net/thinkphp/library/think/View.php ( 6.86 KB )
/www/wwwroot/briline.net/thinkphp/library/think/view/driver/Think.php ( 5.61 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Template.php ( 46.46 KB )
/www/wwwroot/briline.net/thinkphp/library/think/template/driver/File.php ( 2.24 KB )
/www/wwwroot/briline.net/app/index/logic/Wiki.php ( 6.16 KB )
/www/wwwroot/briline.net/app/index/logic/IndexBase.php ( 0.79 KB )
/www/wwwroot/briline.net/app/common/logic/LogicBase.php ( 0.83 KB )
/www/wwwroot/briline.net/app/common/model/Article.php ( 0.78 KB )
/www/wwwroot/briline.net/app/common/model/ArticleTongji.php ( 0.79 KB )
/www/wwwroot/briline.net/thinkphp/library/think/paginator/driver/Bootstrap.php ( 5.90 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Paginator.php ( 9.45 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Collection.php ( 8.63 KB )
/www/wwwroot/briline.net/runtime/temp/ead4923c25a6b3f986358f7070f93dfa.php ( 56.51 KB )
/www/wwwroot/briline.net/thinkphp/library/think/Response.php ( 8.64 KB )
/www/wwwroot/briline.net/thinkphp/library/think/debug/Html.php ( 4.27 KB )

[ DB ] CONNECT:[ UseTime:0.025113s ] mysql:dbname=briline.net;host=106.14.77.182;port=3306;charset=utf8
[ SQL ] SHOW COLUMNS FROM `ob_article` [ RunTime:0.017862s ]
[ SQL ] SELECT * FROM `ob_article` WHERE `id` = 4376 LIMIT 1 [ RunTime:0.017120s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article', 'type' => 'const', 'possible_keys' => 'PRIMARY', 'key' => 'PRIMARY', 'key_len' => '4', 'ref' => 'const', 'rows' => 1, 'extra' => NULL, ) ]
[ SQL ] select * from `ob_article_tongji` where category_id=12 and mark_type='cate' order by times desc limit 15 [ RunTime:0.017324s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article_tongji', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 608, 'extra' => 'Using where; Using filesort', ) ]
[ SQL ] select * from `ob_article_tongji` where category_id=12 and mark_type='tags' order by times desc limit 100 [ RunTime:0.017520s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article_tongji', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 608, 'extra' => 'Using where; Using filesort', ) ]
[ SQL ] select * from `ob_article_tongji` where category_id=12 and mark_type='tags' order by rand() limit 30 [ RunTime:0.017966s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article_tongji', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 608, 'extra' => 'Using where; Using temporary; Using filesort', ) ]
[ SQL ] SELECT * FROM `ob_article` WHERE `id` = 4376 LIMIT 1 [ RunTime:0.017047s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article', 'type' => 'const', 'possible_keys' => 'PRIMARY', 'key' => 'PRIMARY', 'key_len' => '4', 'ref' => 'const', 'rows' => 1, 'extra' => NULL, ) ]
[ SQL ] update `ob_article` set views=views+1 where id=4376 [ RunTime:0.018209s ]
[ SQL ] SELECT COUNT(*) AS tp_count FROM `ob_article` WHERE `category_id` = 12 AND `cate` = '威海网站结构优化' AND `status` <> -1 LIMIT 1 [ RunTime:0.025319s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 8035, 'extra' => 'Using where', ) ]
[ SQL ] SELECT * FROM `ob_article` WHERE `category_id` = 12 AND `cate` = '威海网站结构优化' AND `status` <> -1 ORDER BY rand() LIMIT 0,2 [ RunTime:0.037137s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 8035, 'extra' => 'Using where; Using temporary; Using filesort', ) ]
[ SQL ] SELECT COUNT(*) AS tp_count FROM `ob_article` WHERE `category_id` = 12 AND `tags` = '威海Robots' AND `status` <> -1 LIMIT 1 [ RunTime:0.025547s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 8035, 'extra' => 'Using where', ) ]
[ SQL ] SELECT * FROM `ob_article` WHERE `category_id` = 12 AND `tags` = '威海Robots' AND `status` <> -1 ORDER BY rand() LIMIT 0,2 [ RunTime:0.033955s ]
[ EXPLAIN : array ( 'id' => 1, 'select_type' => 'SIMPLE', 'table' => 'ob_article', 'type' => 'ALL', 'possible_keys' => NULL, 'key' => NULL, 'key_len' => NULL, 'ref' => NULL, 'rows' => 8035, 'extra' => 'Using where; Using temporary; Using filesort', ) ]

0.461684s

ShowPageTrace