baiduserp 2.0.5 → 2.0.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
checksums.yaml CHANGED
@@ -1,7 +1,7 @@
1
1
  ---
2
2
  SHA1:
3
- metadata.gz: 1138e5971b4ff973f8abd7096fe76997887f7f9d
4
- data.tar.gz: 9582079945b7ad67a128107429927ea1831bbfb6
3
+ metadata.gz: 7e641beffc3596d64307f569ee35549c9fd8df39
4
+ data.tar.gz: 9c3ead060412c5a70fa07a01fedf8b903f0fce8f
5
5
  SHA512:
6
- metadata.gz: 5e8e18d34e4820b2e1b7e2d55c23d15b562c5e8b14a7084c15657fe1dd718de50b35897c0e5642d551acb3e342a7b824e91eae462bcd8448c1a8944e30d3d174
7
- data.tar.gz: dea624fc9c1231affe3d0a9e7344dc6ae325ab0c9b6f1c77ecd4caa9d8eff2909b1d911ab3f16574a453617683d759e7c0da92746370965b9bfae4814ec02b08
6
+ metadata.gz: 9a76f5a9d0cd00076bc2c30367626239b77a820ec6a268180efc9b3e1d063b5ecb658efb8ec140df0959b3a794d576b2af92f12c4707f9bdf7d8510f45e13037
7
+ data.tar.gz: 08a5df964b5b9c88c1a540e6049708743d4f8be88804829d0ebad80540ff35db6ce8c7f8607616563d2a998d1ded4324c5b9d7912645468b7870e35593ad24f9
data/README.md CHANGED
@@ -1,8 +1,26 @@
1
1
  # Baiduserp
2
2
 
3
- 此gem的目的是专门用来解析百度的搜索结果页.最主要的功能是能得到十个自然结果的排名和URL,以及左侧/右侧的广告的排名/URL.
3
+ 此gem的目的是专门用来解析百度的搜索结果页.并以最大限度获取SERP结果页面所能拿到的信息为目的.
4
+ (注意目前这并不是一个批量处理关键词排名的程序, 但可以作为一个批量排名查询软件中解析百度SERP页面的模块)
4
5
 
5
- 目前这个只是一个基本能用的版本,可能会有各种各样的问题,欢迎提BUG.
6
+ ## 特点
7
+ ### 解析SERP结果尽量全面
8
+ 众所周知百度的SERP页面现在越来越复杂,左侧的各种新样式层出不穷.右侧也增加了很多内容.
9
+ 这个GEM的功能就是把SERP页面解析成ruby中的数据结构.
10
+
11
+ 做SERP页面分析的时候有可能会想要分析页面上各种信息, 如SEO排名, SEM排名, 竞争对手排名,
12
+ 标题/描述文字, 还有相关关键词, 右侧相关推荐信息, 是否有百度开放平台等...
13
+
14
+ 此gem会把上述各种各样的信息都解析出来, 供后续分析使用. 并且如果使用量越来越大, 或者百度又出新产品的话,
15
+ 也可以增加新模块的解析.
16
+
17
+
18
+ ### 提供命令行接口(既可以测试用,也可以以JSON格式输出)
19
+ 除了提供ruby调用外, 使用其他编程语言的也可以用命令行的接口, 使用JSON格式输出结果数据.
20
+ 详细使用说明见下文.
21
+
22
+ ### 已知问题
23
+ 目前这个只是一个基本能用的版本,可能会有各种各样的问题,欢迎提BUG. [已知问题列表](https://github.com/semseo/baiduserp/issues).
6
24
 
7
25
  ## Installation
8
26
 
@@ -12,7 +30,7 @@ Linux或Mac. Linux最好使用新版本的Ubuntu或Fedora系列.
12
30
 
13
31
  2 安装ruby环境
14
32
 
15
- 只支持ruby1.9及以上. 最好的安装ruby的方法是通过[RVM](https://rvm.io/),RVM的使用方法可以搜索一下,有很多教程.
33
+ 只支持ruby1.9及以上. 最好的安装ruby的方法是通过[RVM](https://rvm.io/),RVM的使用方法可以参考这个页面[http://ruby-china.org/wiki/install_ruby_guide](http://ruby-china.org/wiki/install_ruby_guide), 虽然同时安装了一些不需要的rails相关的软件, 但是介绍很详细.
16
34
 
17
35
  在最新的Ubuntu或Fedora系列的Linux中,也可以通过apt-get或yum安装ruby1.9.
18
36
 
@@ -64,120 +82,140 @@ Usage:
64
82
  -f, --file File Parse Local File
65
83
  ```
66
84
 
67
- 最终结果采用了哈希表和数组相互嵌套的数据结构,其中,每条结果的paid值中,0代表自然结果,1代表左侧广告,2代表右侧广告.结果示例如下:
85
+ 最终结果采用了哈希表和数组相互嵌套的数据结构.结果示例如下:
68
86
 
69
87
  ```
70
88
  $ baiduserp -s 香港
71
- {:serp_results=>
72
- [{:rank=>4001,
73
- :url=>"Agoda.Com.Cn/Hong_Kong",
74
- :title=>nil,
75
- :content=>"Agoda.Com.Cn/Hong_Kong",
76
- :paid=>1},
77
- {:rank=>4002,
78
- :url=>"WWW.XKCITS.COM///TEL0755-86178888",
79
- :title=>nil,
80
- :content=>"WWW.XKCITS.COM///TEL0755-86178888",
81
- :paid=>1},
82
- {:rank=>1,
83
- :url=>"baike.baidu.com/view/2607.htm",
84
- :title=>"香港_百度百科",
85
- :content=>
86
- "香港是一座繁华的国际化大都市。地处珠江以东,与广东省深圳市相接。1842年至1997年,香港曾经是英国的殖民地,1997年7月1日,依据中英政府共同签... 简介 - 历史概况 - 大事年表 - 地名来源 - 地理环境 - 更多>> baike.baidu.com/view/2607.htm 2013-2-5 ",
87
- :paid=>0,
88
- :mu=>"http://baike.baidu.com/view/2607.htm"},
89
+ {:ads_right=>
90
+ [{:rank=>1,
91
+ :title=>"预订香港酒店上携程,全景图..",
92
+ :content=>"订香港酒店,享受有房保障,服务好,折扣低,返现高达201元,订香港酒店上携程超划算!",
93
+ :site=>"www.ctrip.com"},
89
94
  {:rank=>2,
90
- :url=>"www.gov.hk/",
91
- :title=>"GovHK 香港政府一站通:本港居民",
92
- :content=>
93
- "香港政府为当地居民提供的资讯和服务,内容包括通讯及科技、文化、康乐及运动、教育及培训、就业、环境、政府、法律及治安、保健及医疗服务、房屋及社会服务、入境事务...",
94
- :paid=>0},
95
+ :title=>"香港-去哪儿网度假频道,聪明..",
96
+ :content=>"香港-去哪儿网度假频道,比价首选!180000条报价实时更新,先比价后出行!",
97
+ :site=>"dujia.qunar.com"},
95
98
  {:rank=>3,
96
- :url=>"www.mafengwo.cn/travel-scenic-spot/m...",
97
- :title=>"2013香港旅游攻略,香港自助游攻略,蚂蜂窝香港出游攻略游记 - 蚂蜂窝",
98
- :content=>
99
- "2013香港旅游攻略,介绍了香港旅游景点、线路、美食、住宿、地图等香港旅游攻略信息,了解香港旅游如莎莎化妆品、迪士尼、购物天堂和美食小吃等自助游攻略信息来蚂蜂窝...",
100
- :paid=>0},
99
+ :title=>"香港香港怎么玩最划算?",
100
+ :content=>"深圳旅行社香港,香港旅游您的超值之选!天天出团港澳游专线,缤纷全程绝不",
101
+ :site=>"www.sztygl128.com"},
101
102
  {:rank=>4,
102
- :url=>"image.baidu.com",
103
- :title=>"香港_百度图片 - 举报图片",
104
- :content=>"相关推荐:香港电影金像奖提名香港地图香港迪士尼乐园香港海洋公园",
105
- :paid=>0,
106
- :mu=>
107
- "http://image.baidu.com/i?tn=baiduimage&ct=201326592&lm=-1&cl=2&fr=ala1&word=%CF%E3%B8%DB"},
103
+ :title=>"香港旅游攻略-150000条点评",
104
+ :content=>"还没来过香港?507个景点都玩遍?到到网告诉你网友怎么玩(150000张游记照片)!",
105
+ :site=>"www.daodao.com"},
108
106
  {:rank=>5,
109
- :url=>"lvyou.baidu.com/",
110
- :title=>" 香港旅游攻略_百度旅游",
111
- :content=>
112
- " 香港是亚洲繁华的大都市,地区及国际金融中心之一,条件优越的天然深水港,1842年至1997年是英国的殖民地,1997年7月1日回归中国。\n作为“...,旅游旺季为秋冬季节最佳...",
113
- :paid=>0,
114
- :mu=>"http://lvyou.baidu.com/scene/view/79c0adc41efa15d8330ab4f5"},
107
+ :title=>"香港旅游首选北京青年旅行社..",
108
+ :content=>"北京青年旅行社专业香港旅游旅行社,高品质服务,天天折扣价,",
109
+ :site=>"www.hqly8.com"},
115
110
  {:rank=>6,
116
- :url=>"map.baidu.com/",
117
- :title=>"香港特别行政区地图",
111
+ :title=>"香港旅游特价啦!香港旅游价..",
112
+ :content=>"本社邀你一起体验超值香港旅游,全程绝无强制购物,行程安排合理.",
113
+ :site=>"www.cctbj.net"},
114
+ {:rank=>7,
115
+ :title=>"香港旅游线路",
116
+ :content=>"北京旅行社提供香港咨询服务,多条精品旅游线路供您选择.",
117
+ :site=>"www.ctslyw.com"},
118
+ {:rank=>8,
119
+ :title=>"全新香港旅游报价,香港旅游..",
120
+ :content=>"北京国际旅行社,精选多条香港旅游线路,信誉保证,全程无隐性消费.",
121
+ :site=>"www.quly8.net"}],
122
+ :ads_top=>
123
+ [{:rank=>1,
124
+ :title=>"香港酒店预订 在Agoda立享1-7折",
125
+ :content=>"香港酒店预订,尽在Agoda,网上订购低价回馈,为您节省75%.",
126
+ :site=>"www.agoda.com"},
127
+ {:rank=>2,
128
+ :title=>"香港酒店预订 在Agoda立享1-7折",
129
+ :content=>"香港酒店预订,尽在Agoda,网上订购低价回馈,为您节省75%.",
130
+ :site=>"www.agoda.com"}],
131
+ :pinpaizhuanqu=>false,
132
+ :ranks=>
133
+ [{:rank=>1,
134
+ :url=>
135
+ "http://baike.baidu.com/link?url=Ujomxkw-4Whq7C7TI6do9nxHr3G0sO6ywJ3SZfr-lX4qQiht-2rnuGomrclwc4bJ",
136
+ :title=>"香港_百度百科",
137
+ :content=>nil,
138
+ :mu=>"http://baike.baidu.com/view/2607.htm",
139
+ :baiduopen=>false},
140
+ {:rank=>2,
141
+ :url=>"http://lvyou.baidu.com/xianggang/",
142
+ :title=>"2013香港旅游攻略_香港景点线路游记_百度旅游",
143
+ :content=>nil,
144
+ :mu=>"http://lvyou.baidu.com/xianggang/",
145
+ :baiduopen=>false},
146
+ {:rank=>3,
147
+ :url=>
148
+ "http://image.baidu.com/i?tn=baiduimage&ct=201326592&lm=-1&cl=2&fr=ala1&word=%CF%E3%B8%DB",
149
+ :title=>"香港_百度图片 - 举报图片",
118
150
  :content=>nil,
119
- :paid=>0,
120
151
  :mu=>
121
- "http://map.baidu.com/?newmap=1&s=s%26wd%3D%25E9%25A6%2599%25E6%25B8%25AF%25E7%2589%25B9%25E5%2588%25AB%25E8%25A1%258C%25E6%2594%25BF%25E5%258C%25BA%26c%3D2912&fr=alac0&from=alamap"},
152
+ "http://image.baidu.com/i?tn=baiduimage&ct=201326592&lm=-1&cl=2&fr=ala1&word=%CF%E3%B8%DB",
153
+ :baiduopen=>false},
154
+ {:rank=>4,
155
+ :url=>"http://www.gov.hk/sc/residents/",
156
+ :title=>"GovHK 香港政府一站通:本港居民",
157
+ :content=>
158
+ "香港政府为当地居民提供的资讯和服务,内容包括通讯及科技、文化、康乐及运动、教育及培训、就业、环境、政府、法律及治安、保健及医疗服务、房屋及社会服务、入境事务...",
159
+ :mu=>nil,
160
+ :baiduopen=>false},
161
+ {:rank=>5,
162
+ :url=>"http://www.baidu.com/s?rtt=2&tn=baiduwb&rn=20&cl=2&wd=%CF%E3%B8%DB",
163
+ :title=>"香港的最新微博结果",
164
+ :content=>nil,
165
+ :mu=>"http://www.baidu.com/s?rtt=2&tn=baiduwb&rn=20&cl=2&wd=%CF%E3%B8%DB",
166
+ :baiduopen=>false},
167
+ {:rank=>6,
168
+ :url=>"http://tieba.baidu.com/f?kw=%CF%E3%B8%DB&fr=ala0",
169
+ :title=>"香港吧 百度贴吧",
170
+ :content=>
171
+ "月活跃用户:38万人  累计发贴:202万 图片(1856)  |  视频(61)  |  精品贴(335) 香港和上海的夜景那个美?????????? 点击:439 回复:259 最近怎么那么多自以为漂亮的S!B女求认证啊。 点击:303 回复:69 什么时候,去香港不必签证,和去北京上海一样容... 点击:839 回复:187 查看更多香港吧内容>> tieba.baidu.com/香港?fr=ala0 2013-10-18",
172
+ :mu=>"http://tieba.baidu.com/f?kw=%CF%E3%B8%DB&fr=ala0",
173
+ :baiduopen=>false},
122
174
  {:rank=>7,
123
- :url=>nil,
124
- :title=>" 中国香港天气预报_春节期间未来一周天气_中国天气网 - 最近访问:",
175
+ :url=>"http://www.weather.com.cn/html/weather/101320101.shtml",
176
+ :title=>"香港天气预报_一周天气预报_中国天气网 - 最近访问:",
125
177
  :content=>nil,
126
- :paid=>0,
127
- :baiduopen=>1,
128
- :mu=>"http://www.weather.com.cn/weather/101320101.shtml"},
178
+ :mu=>"http://www.weather.com.cn/html/weather/101320101.shtml",
179
+ :baiduopen=>true},
129
180
  {:rank=>8,
130
- :url=>nil,
131
- :title=>" 香港吧 百度贴吧 ",
132
- :content=>nil,
133
- :paid=>0,
134
- :mu=>"http://tieba.baidu.com/f?kw=%CF%E3%B8%DB&fr=ala0"},
181
+ :url=>"http://www.mafengwo.cn/travel-scenic-spot/mafengwo/10189.html",
182
+ :title=>"2013香港旅游攻略,香港自助游攻略,蚂蜂窝香港出游攻略游记 - 蚂蜂窝",
183
+ :content=>
184
+ "在香港寻吃完全就是一场舌尖的盛宴,从街边小吃到世界顶级的米其林餐厅任您选择,茶餐厅、早茶、烧腊和及甜品极具港式风味,世界各地的美食料理也一个不落单。 香港...",
185
+ :mu=>nil,
186
+ :baiduopen=>false},
135
187
  {:rank=>9,
136
- :url=>nil,
137
- :title=>"香港的最新相关信息",
188
+ :url=>"http://hongkong.cncn.com/",
189
+ :title=>"香港旅游攻略_香港香港旅游景点_香港旅游网",
138
190
  :content=>
139
- "联手香港知名餐企 陶然居赴港开店 新华网重庆频道 2小时前今后,在香港也能吃到正宗芋儿鸡和田螺了。严琦昨日告诉商报记者,陶然居已与香港金百加集团达成合作协议——今年,金百加到重庆发展,陶然居则在香港开...",
140
- :paid=>0,
141
- :mu=>"http://www.baidu.com/s?tn=baidurt&rtt=1&bsst=1&wd=%CF%E3%B8%DB"},
191
+ "香港欣欣旅游网,提供香港香港旅游景点推荐、10月香港旅游攻略、香港旅行社、香港旅游线路、香港酒店预订、香港旅游地图等出行指南及旅游服务●欣欣旅游网 CNCN.com ...",
192
+ :mu=>nil,
193
+ :baiduopen=>false},
142
194
  {:rank=>10,
143
- :url=>"hongkong.cncn.com/",
144
- :title=>"香港旅游攻略2011_香港旅游网",
145
- :content=>
146
- "香港欣欣旅游网,提供香港香港旅游景点推荐、2月香港旅游攻略、香港旅行社、香港旅游线路、香港酒店预订、香港旅游地图等出行指南及旅游服务●欣欣旅游网 CNCN.com 更新...",
147
- :paid=>0},
148
- {:paid=>2,
149
- :rank=>1,
150
- :url=>"www.gay688.com",
151
- :title=>"香港旅游",
152
- :content=>"在深圳找香港旅游,我亲身体验,觉着最好的还是国际香港旅游,"},
153
- {:paid=>2,
154
- :rank=>2,
155
- :url=>"www.galy678.com",
156
- :title=>"国旅香港 港澳游四天三晚贵..",
157
- :content=>"香港深圳国旅,2013特别港澳游观光路线,港澳游天天出团.贴心的"},
158
- {:paid=>2,
159
- :rank=>3,
160
- :url=>"Hotel.Qunar.Com",
161
- :title=>"香港-香港订酒店,就上去哪儿..",
162
- :content=>"订香港酒店?驴友真实点评,酒店高清图,实时优惠价格,出行乐无忧!"},
163
- {:paid=>2,
164
- :rank=>4,
165
- :url=>"DaoDao.com",
166
- :title=>"去香港,看真实用户点评,来Da..",
167
- :content=>"来DaoDao.com看香港相关的380家酒店,更有120000条旅客真实点评."}],
168
- :result_num=>100000000,
169
- :baidubrand=>0,
195
+ :url=>"http://www.baidu.com/s?tn=baidurt&rtt=1&bsst=1&wd=%CF%E3%B8%DB",
196
+ :title=>"香港的最新相关信息",
197
+ :content=>nil,
198
+ :mu=>"http://www.baidu.com/s?tn=baidurt&rtt=1&bsst=1&wd=%CF%E3%B8%DB",
199
+ :baiduopen=>false}],
170
200
  :related_keywords=>
171
- ["香港天气",
172
- "香港电影",
201
+ ["香港电影",
202
+ "香港旅游",
203
+ "香港天气",
173
204
  "香港大学",
205
+ "香港购物",
174
206
  "香港地图",
175
- "苹果香港官网",
207
+ "香港电视剧",
176
208
  "香港地铁",
177
209
  "香港中文大学",
178
- "香港电视剧",
179
- "香港旅游",
180
- "苹果香港"]}
210
+ "香港苹果官网"],
211
+ :result_num=>100000000,
212
+ :right_hotel=>nil,
213
+ :right_personinfo=>nil,
214
+ :right_relaperson=>
215
+ [{:title=>"香港特别行政区行政区划", :names=>["油尖旺区", "九龙城区", "湾仔", "元朗区", "西贡区"]},
216
+ {:title=>"全球性国际金融中心", :names=>["新加坡", "纽约", "东京", "伦敦"]},
217
+ {:title=>"其他人还搜", :names=>["台北", "上海", "直布罗陀", "海南", "深圳"]}],
218
+ :right_weather=>nil}
181
219
  ```
182
220
 
183
221
  ## Contributing
@@ -2,8 +2,8 @@ require "baiduserp/version"
2
2
  require 'baiduserp/parser'
3
3
 
4
4
  module Baiduserp
5
- def self.search(keyword)
6
- Parser.new.search keyword
5
+ def self.search(keyword,page=1)
6
+ Parser.new.search(keyword,page)
7
7
  end
8
8
 
9
9
  def self.parse(html)
@@ -2,8 +2,29 @@ require 'httparty'
2
2
 
3
3
  module Baiduserp
4
4
  class Client
5
+ AllUserAgents = YAML.load(open(File.expand_path('../user_agents.yml',__FILE__)))
6
+
7
+ def self.rand_ua
8
+ AllUserAgents[rand(AllUserAgents.size)]
9
+ end
10
+
5
11
  include HTTParty
6
12
  base_uri 'www.baidu.com'
7
13
  follow_redirects false
14
+ headers "User-Agent" => self.rand_ua
15
+
16
+ def self.get_serp(url, retries = 6)
17
+ if retries > 0
18
+ response = self.get(url)
19
+ if response.code == 301
20
+ sleep(rand(60)+60)
21
+ response = self.get_serp(url,retries - 1)
22
+ end
23
+ return response.body
24
+ else
25
+ return nil
26
+ end
27
+ end
28
+
8
29
  end
9
30
  end
@@ -26,15 +26,18 @@ module Baiduserp
26
26
  @serp
27
27
  end
28
28
 
29
- def search(keyword)
30
- parse_file("http://www.baidu.com/s?wd=#{keyword}")
29
+ def search(keyword,page=1)
30
+ keyword = keyword.gsub(" ","+")
31
+ page = page.to_i > 1 ? "&pn=#{page.to_i-1}0" : ""
32
+ serp_url = URI.escape("http://www.baidu.com/s?wd=#{keyword}#{page}&ie=utf-8")
33
+ parse_file(serp_url)
31
34
  end
32
35
 
33
36
  def parse_file(file_path)
34
37
  if File.exists? file_path
35
38
  html = open(file_path).read
36
39
  else
37
- html = Client.get(URI.escape(file_path)).body
40
+ html = Client.get_serp(file_path)
38
41
  end
39
42
  parse html
40
43
  end
@@ -0,0 +1,402 @@
1
+ ---
2
+ - 'Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US) AppleWebKit/527 (KHTML, like Gecko,
3
+ Safari/419.3) Arora/0.6 (Change: )'
4
+ - Mozilla/5.0 (Windows; U; ; en-NZ) AppleWebKit/527 (KHTML, like Gecko, Safari/419.3)
5
+ Arora/0.8.0
6
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1; Avant Browser; Avant Browser;
7
+ .NET CLR 1.0.3705; .NET CLR 1.1.4322; Media Center PC 4.0; .NET CLR 2.0.50727; .NET
8
+ CLR 3.0.04506.30)
9
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.8 (KHTML, like Gecko) Beamrise/17.2.0.9
10
+ Chrome/17.0.939.0 Safari/535.8
11
+ - Mozilla/5.0 (Windows NT 6.1) AppleWebKit/535.2 (KHTML, like Gecko) Chrome/18.6.872.0
12
+ Safari/535.2 UNTRUSTED/1.0 3gpp-gba UNTRUSTED/1.0
13
+ - Mozilla/5.0 (Windows NT 6.2) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1061.1
14
+ Safari/536.3
15
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.6 (KHTML, like Gecko) Chrome/20.0.1092.0
16
+ Safari/536.6
17
+ - Mozilla/5.0 (Windows NT 6.2) AppleWebKit/536.6 (KHTML, like Gecko) Chrome/20.0.1090.0
18
+ Safari/536.6
19
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.1 (KHTML, like Gecko) Chrome/22.0.1207.1
20
+ Safari/537.1
21
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML like Gecko) Chrome/28.0.1469.0
22
+ Safari/537.36
23
+ - Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML like Gecko) Chrome/28.0.1469.0
24
+ Safari/537.36
25
+ - Mozilla/5.0 (Windows NT 6.1; rv:12.0) Gecko/20120403211507 Firefox/12.0
26
+ - Mozilla/5.0 (Windows NT 6.0; rv:14.0) Gecko/20100101 Firefox/14.0.1
27
+ - Mozilla/5.0 (Windows NT 6.1; WOW64; rv:15.0) Gecko/20120427 Firefox/15.0a1
28
+ - Mozilla/5.0 (Windows NT 6.2; Win64; x64; rv:16.0) Gecko/16.0 Firefox/16.0
29
+ - Mozilla/5.0 (Windows NT 6.2; rv:19.0) Gecko/20121129 Firefox/19.0
30
+ - Mozilla/5.0 (Windows NT 6.2; rv:20.0) Gecko/20121202 Firefox/20.0
31
+ - Mozilla/5.0 (Windows NT 6.1; rv:21.0) Gecko/20130401 Firefox/21.0
32
+ - Mozilla/5.0 (compatible; Konqueror/4.5; Windows) KHTML/4.5.4 (like Gecko)
33
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; Trident/4.0; SLCC2; .NET CLR
34
+ 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; Maxthon
35
+ 2.0)
36
+ - Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US) AppleWebKit/533.1 (KHTML, like Gecko)
37
+ Maxthon/3.0.8.2 Safari/533.1
38
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.1 (KHTML like Gecko) Maxthon/4.0.0.2000
39
+ Chrome/22.0.1229.79 Safari/537.1
40
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
41
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)
42
+ - Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 2.0.50727;
43
+ .NET CLR 3.0.04506.648; .NET CLR 3.5.21022; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)
44
+ - Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)
45
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; Trident/4.0)
46
+ - Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.0; Trident/4.0)
47
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; Trident/5.0)
48
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; Trident/5.0)
49
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.2; Trident/5.0)
50
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.2; WOW64; Trident/5.0)
51
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; WOW64; Trident/5.0; SLCC2; Media
52
+ Center PC 6.0; InfoPath.3; MS-RTC LM 8; Zune 4.7)
53
+ - Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; WOW64; Trident/6.0)
54
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; Trident/6.0)
55
+ - Mozilla/5.0 (compatible; MSIE 10.6; Windows NT 6.1; Trident/5.0; InfoPath.2; SLCC1;
56
+ .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; .NET CLR 2.0.50727) 3gpp-gba UNTRUSTED/1.0
57
+ - Opera/9.25 (Windows NT 6.0; U; en)
58
+ - Opera/9.80 (Windows NT 5.2; U; en) Presto/2.2.15 Version/10.10
59
+ - Opera/9.80 (Windows NT 5.1; U; ru) Presto/2.7.39 Version/11.00
60
+ - Opera/9.80 (Windows NT 6.1; U; en) Presto/2.7.62 Version/11.01
61
+ - Opera/9.80 (Windows NT 5.1; U; zh-tw) Presto/2.8.131 Version/11.10
62
+ - Opera/9.80 (Windows NT 6.1; U; es-ES) Presto/2.9.181 Version/12.00
63
+ - Opera/9.80 (Windows NT 6.0) Presto/2.12.388 Version/12.14
64
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US) AppleWebKit/531.21.8 (KHTML, like
65
+ Gecko) Version/4.0.4 Safari/531.21.10
66
+ - Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US) AppleWebKit/533.17.8 (KHTML, like
67
+ Gecko) Version/5.0.1 Safari/533.17.8
68
+ - Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/533.19.4 (KHTML, like
69
+ Gecko) Version/5.0.2 Safari/533.18.5
70
+ - Mozilla/5.0 (Windows; U; Windows NT 6.2; es-US ) AppleWebKit/540.0 (KHTML like Gecko)
71
+ Version/6.0 Safari/8900.00
72
+ - Mozilla/5.0 (Windows; U; Windows NT 6.1; en-GB; rv:1.9.1.17) Gecko/20110123 (like
73
+ Firefox/3.x) SeaMonkey/2.0.12
74
+ - Mozilla/5.0 (Windows NT 5.2; rv:10.0.1) Gecko/20100101 Firefox/10.0.1 SeaMonkey/2.7.1
75
+ - Mozilla/5.0 (Windows NT 6.1; WOW64; rv:12.0) Gecko/20120422 Firefox/12.0 SeaMonkey/2.9
76
+ - Avant Browser/1.2.789rel1 (http://www.avantbrowser.com)
77
+ - Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/532.5 (KHTML, like Gecko)
78
+ Chrome/4.0.249.0 Safari/532.5
79
+ - Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US) AppleWebKit/532.9 (KHTML, like Gecko)
80
+ Chrome/5.0.310.0 Safari/532.9
81
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US) AppleWebKit/534.7 (KHTML, like Gecko)
82
+ Chrome/7.0.514.0 Safari/534.7
83
+ - Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US) AppleWebKit/534.14 (KHTML, like
84
+ Gecko) Chrome/9.0.601.0 Safari/534.14
85
+ - Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/534.14 (KHTML, like
86
+ Gecko) Chrome/10.0.601.0 Safari/534.14
87
+ - Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/534.20 (KHTML, like
88
+ Gecko) Chrome/11.0.672.2 Safari/534.20
89
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.27 (KHTML, like Gecko) Chrome/12.0.712.0
90
+ Safari/534.27
91
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.1 (KHTML, like Gecko) Chrome/13.0.782.24
92
+ Safari/535.1
93
+ - Mozilla/5.0 (Windows NT 6.0) AppleWebKit/535.2 (KHTML, like Gecko) Chrome/15.0.874.120
94
+ Safari/535.2
95
+ - Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.7 (KHTML, like Gecko) Chrome/16.0.912.36
96
+ Safari/535.7
97
+ - Mozilla/5.0 (Windows; U; Windows NT 6.0 x64; en-US; rv:1.9pre) Gecko/2008072421
98
+ Minefield/3.0.2pre
99
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.10) Gecko/2009042316 Firefox/3.0.10
100
+ - Mozilla/5.0 (Windows; U; Windows NT 6.0; en-GB; rv:1.9.0.11) Gecko/2009060215 Firefox/3.0.11
101
+ (.NET CLR 3.5.30729)
102
+ - Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6
103
+ GTB5
104
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; tr; rv:1.9.2.8) Gecko/20100722 Firefox/3.6.8
105
+ ( .NET CLR 3.5.30729; .NET4.0E)
106
+ - Mozilla/5.0 (Windows NT 6.1; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
107
+ - Mozilla/5.0 (Windows NT 6.1; Win64; x64; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
108
+ - Mozilla/5.0 (Windows NT 5.1; rv:5.0) Gecko/20100101 Firefox/5.0
109
+ - Mozilla/5.0 (Windows NT 6.1; WOW64; rv:6.0a2) Gecko/20110622 Firefox/6.0a2
110
+ - Mozilla/5.0 (Windows NT 6.1; WOW64; rv:7.0.1) Gecko/20100101 Firefox/7.0.1
111
+ - Mozilla/5.0 (Windows NT 6.1; WOW64; rv:10.0.1) Gecko/20100101 Firefox/10.0.1
112
+ - Mozilla/5.0 (Windows NT 6.1; WOW64; rv:2.0b4pre) Gecko/20100815 Minefield/4.0b4pre
113
+ - Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0 )
114
+ - Mozilla/4.0 (compatible; MSIE 5.5; Windows 98; Win 9x 4.90)
115
+ - Mozilla/5.0 (Windows; U; Windows XP) Gecko MultiZilla/1.6.1.0a
116
+ - Mozilla/2.02E (Win95; U)
117
+ - Mozilla/3.01Gold (Win95; I)
118
+ - Mozilla/4.8 [en] (Windows NT 5.1; U)
119
+ - Mozilla/5.0 (Windows; U; Win98; en-US; rv:1.4) Gecko Netscape/7.1 (ax)
120
+ - Opera/7.50 (Windows XP; U)
121
+ - Opera/7.50 (Windows ME; U) [en]
122
+ - Opera/7.51 (Windows NT 5.1; U) [en]
123
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; en) Opera 8.0
124
+ - Mozilla/5.0 (Windows; U; WinNT4.0; en-US; rv:1.2b) Gecko/20021001 Phoenix/0.2
125
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.23) Gecko/20090825 SeaMonkey/1.1.18
126
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
127
+ Camino/2.2.1
128
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:2.0b6pre) Gecko/20100907 Firefox/4.0b6pre
129
+ Camino/2.2a1pre
130
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_8_0) AppleWebKit/536.3 (KHTML, like Gecko)
131
+ Chrome/19.0.1063.0 Safari/536.3
132
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_8_2) AppleWebKit/537.4 (KHTML like Gecko)
133
+ Chrome/22.0.1229.79 Safari/537.4
134
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_8_4) AppleWebKit/537.31 (KHTML like Gecko)
135
+ Chrome/26.0.1410.63 Safari/537.31
136
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 1083) AppleWebKit/537.36 (KHTML like Gecko)
137
+ Chrome/28.0.1469.0 Safari/537.36
138
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_2; rv:10.0.1) Gecko/20100101 Firefox/10.0.1
139
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:16.0) Gecko/20120813 Firefox/16.0
140
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:20.0) Gecko/20100101 Firefox/20.0
141
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:21.0) Gecko/20100101 Firefox/21.0
142
+ - iTunes/4.2 (Macintosh; U; PPC Mac OS X 10.2)
143
+ - iTunes/9.0.3 (Macintosh; U; Intel Mac OS X 10_6_2; en-ca)
144
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X; en-US) AppleWebKit/528.16 (KHTML, like
145
+ Gecko, Safari/528.16) OmniWeb/v622.8.0.112941
146
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_5_6; en-US) AppleWebKit/528.16 (KHTML,
147
+ like Gecko, Safari/528.16) OmniWeb/v622.8.0
148
+ - Opera/9.20 (Macintosh; Intel Mac OS X; U; en)
149
+ - Opera/9.64 (Macintosh; PPC Mac OS X; U; en) Presto/2.1.1
150
+ - Opera/9.80 (Macintosh; Intel Mac OS X; U; en) Presto/2.6.30 Version/10.61
151
+ - Opera/9.80 (Macintosh; Intel Mac OS X 10.4.11; U; en) Presto/2.7.62 Version/11.00
152
+ - Opera/9.80 (Macintosh; Intel Mac OS X 10.6.8; U; fr) Presto/2.9.168 Version/11.52
153
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_2; en-us) AppleWebKit/531.21.8 (KHTML,
154
+ like Gecko) Version/4.0.4 Safari/531.21.10
155
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_5; de-de) AppleWebKit/534.15 (KHTML,
156
+ like Gecko) Version/5.0.3 Safari/533.19.4
157
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_6; en-us) AppleWebKit/533.20.25 (KHTML,
158
+ like Gecko) Version/5.0.4 Safari/533.20.27
159
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_7; en-us) AppleWebKit/534.20.8 (KHTML,
160
+ like Gecko) Version/5.1 Safari/534.20.8
161
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_3) AppleWebKit/534.55.3 (KHTML, like
162
+ Gecko) Version/5.1.3 Safari/534.53.10
163
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_5) AppleWebKit/536.26.17 (KHTML like
164
+ Gecko) Version/6.0.2 Safari/536.26.17
165
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.5; rv:10.0.1) Gecko/20100101 Firefox/10.0.1
166
+ SeaMonkey/2.7.1
167
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_5_8; en-US) AppleWebKit/532.8 (KHTML,
168
+ like Gecko) Chrome/4.0.302.2 Safari/532.8
169
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_4; en-US) AppleWebKit/534.3 (KHTML,
170
+ like Gecko) Chrome/6.0.464.0 Safari/534.3
171
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_5; en-US) AppleWebKit/534.13 (KHTML,
172
+ like Gecko) Chrome/9.0.597.15 Safari/534.13
173
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_2) AppleWebKit/535.1 (KHTML, like Gecko)
174
+ Chrome/14.0.835.186 Safari/535.1
175
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_6_8) AppleWebKit/535.2 (KHTML, like Gecko)
176
+ Chrome/15.0.874.54 Safari/535.2
177
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10_6_8) AppleWebKit/535.7 (KHTML, like Gecko)
178
+ Chrome/16.0.912.36 Safari/535.7
179
+ - 'Mozilla/5.0 (Macintosh; U; Mac OS X Mach-O; en-US; rv:2.0a) Gecko/20040614 Firefox/3.0.0 '
180
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X 10.5; en-US; rv:1.9.0.3) Gecko/2008092414
181
+ Firefox/3.0.3
182
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1) Gecko/20090624
183
+ Firefox/3.5
184
+ - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.14) Gecko/20110218
185
+ AlexaToolbar/alxf-2.0 Firefox/3.6.14
186
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X 10.5; en-US; rv:1.9.2.15) Gecko/20110303
187
+ Firefox/3.6.15
188
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
189
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:5.0) Gecko/20100101 Firefox/5.0
190
+ - Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:9.0) Gecko/20100101 Firefox/9.0
191
+ - Mozilla/4.0 (compatible; MSIE 5.15; Mac_PowerPC)
192
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en-US) AppleWebKit/125.4 (KHTML, like Gecko,
193
+ Safari) OmniWeb/v563.15
194
+ - Opera/9.0 (Macintosh; PPC Mac OS X; U; en)
195
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/125.2 (KHTML, like Gecko)
196
+ Safari/85.8
197
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/125.2 (KHTML, like Gecko)
198
+ Safari/125.8
199
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X; fr-fr) AppleWebKit/312.5 (KHTML, like Gecko)
200
+ Safari/312.3
201
+ - Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.8 (KHTML, like Gecko)
202
+ Safari/419.3
203
+ - Mozilla/5.0 (X11; U; Linux; en-US) AppleWebKit/527 (KHTML, like Gecko, Safari/419.3)
204
+ Arora/0.10.1
205
+ - Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.9
206
+ Safari/536.5
207
+ - Mozilla/5.0 (X11; CrOS i686 2268.111.0) AppleWebKit/536.11 (KHTML, like Gecko) Chrome/20.0.1132.57
208
+ Safari/536.11
209
+ - Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.4 (KHTML like Gecko) Chrome/22.0.1229.56
210
+ Safari/537.4
211
+ - Mozilla/5.0 (X11; Linux i686) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/28.0.1478.0
212
+ Safari/537.36
213
+ - Mozilla/5.0 (X11; Linux i686) AppleWebKit/537.22 (KHTML like Gecko) Ubuntu Chromium/25.0.1364.160
214
+ Chrome/25.0.1364.160 Safari/537.22
215
+ - Mozilla/4.0 (compatible; Dillo 3.0)
216
+ - Mozilla/5.0 (X11; U; Linux i686; en-us) AppleWebKit/528.5 (KHTML, like Gecko, Safari/528.5
217
+ ) lt-GtkLauncher
218
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.16) Gecko/20120421 Gecko Firefox/11.0
219
+ - 'Mozilla/5.0 (X11; Linux i686; rv:12.0) Gecko/20100101 Firefox/12.0 '
220
+ - Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:14.0) Gecko/20100101 Firefox/14.0.1
221
+ - Mozilla/5.0 (X11; Linux i686; rv:16.0) Gecko/20100101 Firefox/16.0
222
+ - Mozilla/5.0 (X11; U; Linux i686; rv:19.0) Gecko/20100101 Slackware/13 Firefox/19.0
223
+ - Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:20.0) Gecko/20100101 Firefox/20.0
224
+ - Mozilla/5.0 (X11; Linux i686; rv:20.0) Gecko/20100101 Firefox/20.0
225
+ - Mozilla/5.0 (X11; Linux i686; rv:21.0) Gecko/20100101 Firefox/21.0
226
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.0.8) Gecko Galeon/2.0.6 (Ubuntu 2.0.6-2)
227
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.16) Gecko/20080716 (Gentoo) Galeon/2.0.6
228
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.13) Gecko/20100916 Iceape/2.0.8
229
+ - Mozilla/5.0 (X11; Linux i686; rv:14.0) Gecko/20100101 Firefox/14.0.1 Iceweasel/14.0.1
230
+ - Mozilla/5.0 (X11; Linux x86_64; rv:15.0) Gecko/20120724 Debian Iceweasel/15.02
231
+ - Mozilla/5.0 (X11; Linux x86_64; rv:19.0) Gecko/20100101 Firefox/19.0 Iceweasel/19.0.2
232
+ - Mozilla/5.0 (compatible; Konqueror/4.2; Linux) KHTML/4.2.4 (like Gecko) Slackware/13.0
233
+ - Mozilla/5.0 (compatible; Konqueror/4.3; Linux) KHTML/4.3.1 (like Gecko) Fedora/4.3.1-3.fc11
234
+ - Mozilla/5.0 (compatible; Konqueror/4.4; Linux) KHTML/4.4.1 (like Gecko) Fedora/4.4.1-1.fc12
235
+ - Mozilla/5.0 (compatible; Konqueror/4.4; Linux 2.6.32-22-generic; X11; en_US) KHTML/4.4.3
236
+ (like Gecko) Kubuntu
237
+ - Mozilla/5.0 (compatible; Konqueror/4.4; Linux 2.6.32-22-generic; X11; en_US) KHTML/4.4.3
238
+ (like Gecko) Kubuntu
239
+ - Mozilla/5.0 (X11; Linux 3.8-6.dmz.1-liquorix-686) KHTML/4.8.4 (like Gecko) Konqueror/4.8
240
+ - 'Midori/0.1.10 (X11; Linux i686; U; en-us) WebKit/(531).(2) '
241
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.0.3) Gecko/2008092814 (Debian-3.0.1-1)
242
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9a3pre) Gecko/20070330
243
+ - Opera/9.64 (X11; Linux i686; U; Linux Mint; nb) Presto/2.1.1
244
+ - Opera/9.80 (X11; Linux i686; U; en) Presto/2.2.15 Version/10.10
245
+ - Opera/9.80 (X11; Linux x86_64; U; pl) Presto/2.7.62 Version/11.00
246
+ - Mozilla/5.0 (X11; Linux i686) AppleWebKit/534.34 (KHTML, like Gecko) QupZilla/1.2.0
247
+ Safari/534.34
248
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.17) Gecko/20110123 SeaMonkey/2.0.12
249
+ - Mozilla/5.0 (X11; Linux i686; rv:10.0.1) Gecko/20100101 Firefox/10.0.1 SeaMonkey/2.7.1
250
+ - Mozilla/5.0 (X11; Linux i686; rv:12.0) Gecko/20120502 Firefox/12.0 SeaMonkey/2.9.1
251
+ - Mozilla/5.0 (X11; U; Linux x86_64; us; rv:1.9.1.19) Gecko/20110430 shadowfox/7.0
252
+ (like Firefox/7.0
253
+ - Mozilla/5.0 (X11; U; Linux i686; it; rv:1.9.2.3) Gecko/20100406 Firefox/3.6.3 (Swiftfox)
254
+ - Mozilla/5.0 (X11; U; Linux i686; en-US) AppleWebKit/532.4 (KHTML, like Gecko) Chrome/4.0.237.0
255
+ Safari/532.4 Debian
256
+ - Mozilla/5.0 (X11; U; Linux i686; en-US) AppleWebKit/532.8 (KHTML, like Gecko) Chrome/4.0.277.0
257
+ Safari/532.8
258
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/532.9 (KHTML, like Gecko)
259
+ Chrome/5.0.309.0 Safari/532.9
260
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/534.7 (KHTML, like Gecko)
261
+ Chrome/7.0.514.0 Safari/534.7
262
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/540.0 (KHTML, like Gecko)
263
+ Ubuntu/10.10 Chrome/9.1.0.0 Safari/540.0
264
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/534.15 (KHTML, like Gecko)
265
+ Chrome/10.0.613.0 Safari/534.15
266
+ - Mozilla/5.0 (X11; U; Linux i686; en-US) AppleWebKit/534.15 (KHTML, like Gecko) Ubuntu/10.10
267
+ Chromium/10.0.613.0 Chrome/10.0.613.0 Safari/534.15
268
+ - Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/534.24 (KHTML, like Gecko) Ubuntu/10.10
269
+ Chromium/12.0.703.0 Chrome/12.0.703.0 Safari/534.24
270
+ - Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/535.1 (KHTML, like Gecko) Chrome/13.0.782.20
271
+ Safari/535.1
272
+ - Mozilla/5.0 Slackware/13.37 (X11; U; Linux x86_64; en-US) AppleWebKit/535.1 (KHTML,
273
+ like Gecko) Chrome/13.0.782.41
274
+ - Mozilla/5.0 (X11; Linux i686) AppleWebKit/535.1 (KHTML, like Gecko) Ubuntu/11.04
275
+ Chromium/14.0.825.0 Chrome/14.0.825.0 Safari/535.1
276
+ - Mozilla/5.0 (X11; Linux i686) AppleWebKit/535.2 (KHTML, like Gecko) Ubuntu/11.10
277
+ Chromium/15.0.874.120 Chrome/15.0.874.120 Safari/535.2
278
+ - Mozilla/5.0 (X11; U; Linux; i686; en-US; rv:1.6) Gecko Epiphany/1.2.5
279
+ - Mozilla/5.0 (X11; U; Linux i586; en-US; rv:1.7.3) Gecko/20040924 Epiphany/1.4.4
280
+ (Ubuntu)
281
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.6) Gecko/20040614 Firefox/0.8
282
+ - Mozilla/5.0 (X11; U; Linux x86_64; sv-SE; rv:1.8.1.12) Gecko/20080207 Ubuntu/7.10
283
+ (gutsy) Firefox/2.0.0.12
284
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.0.11) Gecko/2009060309 Ubuntu/9.10
285
+ (karmic) Firefox/3.0.11
286
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.2) Gecko/20090803 Ubuntu/9.04 (jaunty)
287
+ Shiretoko/3.5.2
288
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.5) Gecko/20091107 Firefox/3.5.5
289
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.3) Gecko/20091020 Linux Mint/8
290
+ (Helena) Firefox/3.5.3
291
+ - Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.9) Gecko/20100915 Gentoo Firefox/3.6.9
292
+ - Mozilla/5.0 (X11; U; Linux i686; pl-PL; rv:1.9.0.2) Gecko/20121223 Ubuntu/9.25 (jaunty)
293
+ Firefox/3.8
294
+ - Mozilla/5.0 (X11; Linux i686; rv:2.0b6pre) Gecko/20100907 Firefox/4.0b6pre
295
+ - Mozilla/5.0 (X11; Linux i686 on x86_64; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
296
+ - Mozilla/5.0 (X11; Linux i686; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
297
+ - Mozilla/5.0 (X11; Linux x86_64; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
298
+ - Mozilla/5.0 (X11; Linux x86_64; rv:2.2a1pre) Gecko/20100101 Firefox/4.2a1pre
299
+ - Mozilla/5.0 (X11; Linux i686; rv:5.0) Gecko/20100101 Firefox/5.0
300
+ - Mozilla/5.0 (X11; Linux i686; rv:6.0) Gecko/20100101 Firefox/6.0
301
+ - Mozilla/5.0 (X11; Linux x86_64; rv:7.0a1) Gecko/20110623 Firefox/7.0a1
302
+ - Mozilla/5.0 (X11; Linux i686; rv:8.0) Gecko/20100101 Firefox/8.0
303
+ - Mozilla/5.0 (X11; Linux x86_64; rv:10.0.1) Gecko/20100101 Firefox/10.0.1
304
+ - Mozilla/5.0 (X11; U; Linux; i686; en-US; rv:1.6) Gecko Galeon/1.3.14
305
+ - Mozilla/5.0 (X11; U; Linux ppc; en-US; rv:1.8.1.13) Gecko/20080313 Iceape/1.1.9
306
+ (Debian-1.1.9-5)
307
+ - Mozilla/5.0 (X11; U; Linux i686; pt-PT; rv:1.9.2.3) Gecko/20100402 Iceweasel/3.6.3
308
+ (like Firefox/3.6.3) GTB7.0
309
+ - Mozilla/5.0 (X11; Linux x86_64; rv:5.0) Gecko/20100101 Firefox/5.0 Iceweasel/5.0
310
+ - Mozilla/5.0 (X11; Linux i686; rv:6.0a2) Gecko/20110615 Firefox/6.0a2 Iceweasel/6.0a2
311
+ - Konqueror/3.0-rc4; (Konqueror/3.0-rc4; i686 Linux;;datecode)
312
+ - Mozilla/5.0 (compatible; Konqueror/3.3; Linux 2.6.8-gentoo-r3; X11;
313
+ - Mozilla/5.0 (compatible; Konqueror/3.5; Linux 2.6.30-7.dmz.1-liquorix-686; X11)
314
+ KHTML/3.5.10 (like Gecko) (Debian package 4:3.5.10.dfsg.1-1 b1)
315
+ - Mozilla/5.0 (compatible; Konqueror/3.5; Linux; en_US) KHTML/3.5.6 (like Gecko) (Kubuntu)
316
+ - Mozilla/5.0 (X11; Linux x86_64; en-US; rv:2.0b2pre) Gecko/20100712 Minefield/4.0b2pre
317
+ - Mozilla/5.0 (X11; U; Linux; i686; en-US; rv:1.6) Gecko Debian/1.6-7
318
+ - MSIE (MSIE 6.0; X11; Linux; i686) Opera 7.23
319
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1) Gecko/20061024 Firefox/2.0 (Swiftfox)
320
+ - Mozilla/5.0 (Unknown; U; UNIX BSD/SYSV system; C -) AppleWebKit/527 (KHTML, like
321
+ Gecko, Safari/419.3) Arora/0.10.2
322
+ - Mozilla/5.0 (X11; FreeBSD amd64) AppleWebKit/536.5 (KHTML like Gecko) Chrome/19.0.1084.56
323
+ Safari/536.5
324
+ - Mozilla/5.0 (X11; FreeBSD amd64) AppleWebKit/537.4 (KHTML like Gecko) Chrome/22.0.1229.79
325
+ Safari/537.4
326
+ - Mozilla/5.0 (X11; U; OpenBSD arm; en-us) AppleWebKit/531.2 (KHTML, like Gecko)
327
+ Safari/531.2 Epiphany/2.30.0
328
+ - Mozilla/5.0 (X11; U; FreeBSD amd64; en-us) AppleWebKit/531.2 (KHTML, like Gecko)
329
+ Safari/531.2 Epiphany/2.30.0
330
+ - Mozilla/5.0 (X11; U; SunOS i86pc; en-US; rv:1.9.1b3) Gecko/20090429 Firefox/3.1b3
331
+ - Mozilla/5.0 (X11; U; OpenBSD i386; en-US; rv:1.9.1) Gecko/20090702 Firefox/3.5
332
+ - Mozilla/5.0 (X11; U; FreeBSD i386; de-CH; rv:1.9.2.8) Gecko/20100729 Firefox/3.6.8
333
+ - Mozilla/5.0 (X11; FreeBSD amd64; rv:5.0) Gecko/20100101 Firefox/5.0
334
+ - Mozilla/5.0 (compatible; Konqueror/4.1; DragonFly) KHTML/4.1.4 (like Gecko)
335
+ - Mozilla/5.0 (compatible; Konqueror/4.1; OpenBSD) KHTML/4.1.4 (like Gecko)
336
+ - Mozilla/5.0 (compatible; Konqueror/4.5; NetBSD 5.0.2; X11; amd64; en_US) KHTML/4.5.4
337
+ (like Gecko)
338
+ - Mozilla/5.0 (compatible; Konqueror/4.5; FreeBSD) KHTML/4.5.4 (like Gecko)
339
+ - Mozilla/5.0 (X11; U; NetBSD amd64; en-US; rv:1.9.2.15) Gecko/20110308 Namoroka/3.6.15
340
+ - NetSurf/1.2 (NetBSD; amd64)
341
+ - Opera/9.80 (X11; FreeBSD 8.1-RELEASE i386; Edition Next) Presto/2.12.388 Version/12.10
342
+ - Mozilla/5.0 (X11; U; SunOS i86pc; en-US; rv:1.8.1.12) Gecko/20080303 SeaMonkey/1.1.8
343
+ - Mozilla/5.0 (X11; U; FreeBSD i386; en-US) AppleWebKit/532.0 (KHTML, like Gecko)
344
+ Chrome/4.0.207.0 Safari/532.0
345
+ - Mozilla/5.0 (X11; U; OpenBSD i386; en-US) AppleWebKit/533.3 (KHTML, like Gecko)
346
+ Chrome/5.0.359.0 Safari/533.3
347
+ - Mozilla/5.0 (X11; U; FreeBSD x86_64; en-US) AppleWebKit/534.16 (KHTML, like Gecko)
348
+ Chrome/10.0.648.204 Safari/534.16
349
+ - Mozilla/5.0 (X11; U; SunOS sun4m; en-US; rv:1.4b) Gecko/20030517 Mozilla Firebird/0.6
350
+ - Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.6) Gecko/20040406 Galeon/1.3.15
351
+ - Mozilla/5.0 (compatible; Konqueror/3.5; NetBSD 4.0_RC3; X11) KHTML/3.5.7 (like Gecko)
352
+ - Mozilla/5.0 (compatible; Konqueror/3.5; SunOS) KHTML/3.5.1 (like Gecko)
353
+ - Mozilla/5.0 (X11; U; FreeBSD; i386; en-US; rv:1.7) Gecko
354
+ - Mozilla/4.77 [en] (X11; I; IRIX;64 6.5 IP30)
355
+ - Mozilla/4.8 [en] (X11; U; SunOS; 5.7 sun4u)
356
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; BOLT/2.800) AppleWebKit/534.6 (KHTML,
357
+ like Gecko) Version/5.0 Safari/534.6.3
358
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows CE; IEMobile 6.12; Microsoft ZuneHD 4.3)
359
+ - 'Mozilla/4.0 (compatible; MSIE 6.0; Windows CE; IEMobile 7.11) '
360
+ - Mozilla/4.0 (compatible; MSIE 7.0; Windows Phone OS 7.0; Trident/3.1; IEMobile/7.0)
361
+ Asus;Galaxy6
362
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows Phone OS 7.5; Trident/5.0; IEMobile/9.0)
363
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows Phone OS 7.5; Trident/5.0; IEMobile/9.0)
364
+ - 'Mozilla/5.0 (compatible; MSIE 10.0; Windows Phone 8.0; Trident/6.0; IEMobile/10.0;
365
+ ARM; Touch) '
366
+ - Mozilla/1.22 (compatible; MSIE 5.01; PalmOS 3.0) EudoraWeb 2.1
367
+ - Mozilla/5.0 (WindowsCE 6.0; rv:2.0.1) Gecko/20100101 Firefox/4.0.1
368
+ - Mozilla/5.0 (X11; U; Linux armv61; en-US; rv:1.9.1b2pre) Gecko/20081015 Fennec/1.0a1
369
+ - Mozilla/5.0 (Maemo; Linux armv7l; rv:2.0.1) Gecko/20100101 Firefox/4.0.1 Fennec/2.0.1
370
+ - Mozilla/5.0 (Maemo; Linux armv7l; rv:10.0.1) Gecko/20100101 Firefox/10.0.1 Fennec/10.0.1
371
+ - Mozilla/5.0 (Windows; U; Windows CE 5.1; rv:1.8.1a3) Gecko/20060610 Minimo/0.016
372
+ - Mozilla/5.0 (X11; U; Linux armv6l; rv 1.8.1.5pre) Gecko/20070619 Minimo/0.020
373
+ - Mozilla/5.0 (X11; U; Linux arm7tdmi; rv:1.8.1.11) Gecko/20071130 Minimo/0.025
374
+ - Mozilla/4.0 (PDA; PalmOS/sony/model prmr/Revision:1.1.54 (en)) NetFront/3.0
375
+ - Opera/9.51 Beta (Microsoft Windows; PPC; Opera Mobi/1718; U; en)
376
+ - Opera/9.60 (J2ME/MIDP; Opera Mini/4.1.11320/608; U; en) Presto/2.2.0
377
+ - Opera/9.60 (J2ME/MIDP; Opera Mini/4.2.14320/554; U; cs) Presto/2.2.0
378
+ - Opera/9.80 (S60; SymbOS; Opera Mobi/499; U; ru) Presto/2.4.18 Version/10.00
379
+ - Opera/10.61 (J2ME/MIDP; Opera Mini/5.1.21219/19.999; en-US; rv:1.9.3a5) WebKit/534.5
380
+ Presto/2.6.30
381
+ - POLARIS/6.01 (BREW 3.1.5; U; en-us; LG; LX265; POLARIS/6.01/WAP) MMP/2.0 profile/MIDP-2.1
382
+ Configuration/CLDC-1.1
383
+ - Mozilla/5.0 (Linux; U; Android 2.0; en-us; Droid Build/ESD20) AppleWebKit/530.17
384
+ (KHTML, like Gecko) Version/4.0 Mobile Safari/530.17
385
+ - Mozilla/5.0 (iPad; U; CPU OS 4_2_1 like Mac OS X; ja-jp) AppleWebKit/533.17.9 (KHTML,
386
+ like Gecko) Version/5.0.2 Mobile/8C148 Safari/6533.18.5
387
+ - Mozilla/5.0 (iPhone; U; CPU iPhone OS 4_2_1 like Mac OS X; da-dk) AppleWebKit/533.17.9
388
+ (KHTML, like Gecko) Version/5.0.2 Mobile/8C148 Safari/6533.18.5
389
+ - Mozilla/5.0 (iPad; CPU OS 6_0 like Mac OS X) AppleWebKit/536.26 (KHTML, like Gecko)
390
+ Version/6.0 Mobile/10A5355d Safari/8536.25
391
+ - Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; Trident/5.0; XBLWP7; ZuneWP7)
392
+ UCBrowser/2.9.0.263
393
+ - Mozilla/5.0 (Linux; U; Android 2.3.3; en-us ; LS670 Build/GRI40) AppleWebKit/533.1
394
+ (KHTML, like Gecko) Version/4.0 Mobile Safari/533.1/UCBrowser/8.6.1.262/145/355
395
+ - Mozilla/3.0 (compatible; NetPositive/2.1.1; BeOS)
396
+ - Mozilla/5.0 (BeOS; U; BeOS BePC; en-US; rv:1.9a1) Gecko/20060702 SeaMonkey/1.5a
397
+ - Mozilla/5.0 (OS/2; Warp 4.5; rv:10.0.12) Gecko/20100101 Firefox/10.0.12
398
+ - Mozilla/5.0 (OS/2; Warp 4.5; rv:10.0.12) Gecko/20130108 Firefox/10.0.12 SeaMonkey/2.7.2
399
+ - 'Mozilla/5.0 (OS/2; U; OS/2; en-US) AppleWebKit/533.3 (KHTML, like Gecko) Arora/0.11.0
400
+ Safari/533.3 '
401
+ - 'Mozilla/5.0 (OS/2; U; OS/2; en-US) AppleWebKit/533.3 (KHTML, like Gecko) QupZilla/1.3.1
402
+ Safari/533.3 '
@@ -1,3 +1,3 @@
1
1
  module Baiduserp
2
- VERSION = "2.0.5"
2
+ VERSION = "2.0.8"
3
3
  end
@@ -1,10 +1,10 @@
1
1
  class Baiduserp::Parser
2
2
  def _parse_ranks(file)
3
3
  result = []
4
- file[:doc].search("//table").each do |table|
4
+ file[:doc].search("div[@id='content_left']").first.children.each do |table|
5
5
  next if table.nil?
6
6
  id = table['id'].to_i
7
- next unless id > 0
7
+ next unless id > 0 && id < 3000
8
8
  r = {:rank => id}
9
9
 
10
10
  url = table.search('h3/a').first
@@ -5,7 +5,8 @@ class Baiduserp::Parser
5
5
 
6
6
  result = []
7
7
  relapersons.each do |rr|
8
- title = rr.search('span.opr-relaperson-subtitle-tip').first.content
8
+ title = rr.search('div.cr-title/span').first
9
+ title = title.content unless title.nil?
9
10
  r = []
10
11
  rr.search('p.opr-relaperson-name/a').each do |p|
11
12
  r << p['title']
metadata CHANGED
@@ -1,14 +1,14 @@
1
1
  --- !ruby/object:Gem::Specification
2
2
  name: baiduserp
3
3
  version: !ruby/object:Gem::Version
4
- version: 2.0.5
4
+ version: 2.0.8
5
5
  platform: ruby
6
6
  authors:
7
7
  - MingQian Zhang
8
8
  autorequire:
9
9
  bindir: bin
10
10
  cert_chain: []
11
- date: 2013-09-23 00:00:00.000000000 Z
11
+ date: 2013-11-01 00:00:00.000000000 Z
12
12
  dependencies:
13
13
  - !ruby/object:Gem::Dependency
14
14
  name: nokogiri
@@ -63,7 +63,8 @@ files:
63
63
  - lib/parsers/right_weather.rb
64
64
  - bin/baiduserp
65
65
  - README.md
66
- homepage: https://github.com/mqzhang/baiduserp
66
+ - lib/baiduserp/user_agents.yml
67
+ homepage: https://github.com/semseo/baiduserp
67
68
  licenses: []
68
69
  metadata: {}
69
70
  post_install_message:
@@ -82,7 +83,7 @@ required_rubygems_version: !ruby/object:Gem::Requirement
82
83
  version: '0'
83
84
  requirements: []
84
85
  rubyforge_project:
85
- rubygems_version: 2.0.3
86
+ rubygems_version: 2.0.0
86
87
  signing_key:
87
88
  specification_version: 4
88
89
  summary: Baidu SERP