bot_checker 0.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,1454 @@
1
+ - <a href='http://www.unchaos.com/'> UnChaos </a> From Chaos To Order Hybrid Web Search Engine.(vadim_gonchar@unchaos.com)
2
+ - <a href='http://www.unchaos.com/'> UnChaos Bot Hybrid Web Search Engine. </a> (vadim_gonchar@unchaos.com)
3
+ - <b> UnChaosBot From Chaos To Order UnChaos Hybrid Web Search Engine at www.unchaos.com </b> (info@unchaos.com)
4
+ - <http://www.sygol.com/> http://www.sygol.com
5
+ - /Nutch-0.9-devF
6
+ - +SitiDi.net/SitiDiBot/1.0 (+Have Good Day)
7
+ - -DIE-KRAEHE- META-SEARCH-ENGINE/1.1 http://www.die-kraehe.de
8
+ - 192.comAgent
9
+ - 4anything.com LinkChecker v2.0
10
+ - 8484 Boston Project v 1.0
11
+ - :robot/1.0 (linux) ( admin e-mail: undefined http://www.neofonie.de/loesungen/search/robot.html )
12
+ - A-Online Search
13
+ - A1 Sitemap Generator/1.0 (+http://www.micro-sys.dk/products/sitemap-generator/) miggibot/2006.01.24
14
+ - aardvark-crawler
15
+ - AbachoBOT
16
+ - AbachoBOT (Mozilla compatible)
17
+ - ABCdatos BotLink/5.xx.xxx#BBL
18
+ - Aberja Checkomat
19
+ - abot/0.1 (abot; http://www.abot.com; abot@abot.com)
20
+ - About/0.1libwww-perl/5.47
21
+ - Accelatech RSSCrawler/0.4
22
+ - accoona
23
+ - Accoona-AI-Agent/1.1.1 (crawler at accoona dot com)
24
+ - Accoona-AI-Agent/1.1.2 (aicrawler at accoonabot dot com)
25
+ - Ack (http://www.ackerm.com/)
26
+ - AcoiRobot
27
+ - Acoon Robot v1.50.001
28
+ - Acoon Robot v1.52 (http://www.acoon.de)
29
+ - Acoon-Robot 4.0.x.[xx] (http://www.acoon.de)
30
+ - Acoon-Robot v3.xx (http://www.acoon.de and http://www.acoon.com)
31
+ - Acorn/Nutch-0.9 (Non-Profit Search Engine; acorn.isara.org; acorn at isara dot org)
32
+ - AESOP_com_SpiderMan
33
+ - agadine/1.x.x (+http://www.agada.de)
34
+ - Agent-SharewarePlazaFileCheckBot/2.0+(+http://www.SharewarePlaza.com)
35
+ - AgentName/0.1 libwww-perl/5.48
36
+ - AIBOT/2.1 By +(www.21seek.com A Real artificial intelligence search engine China)
37
+ - aipbot/1.0 (aipbot; http://www.aipbot.com; aipbot@aipbot.com)
38
+ - aipbot/2-beta (aipbot dev; http://aipbot.com; aipbot@aipbot.com)
39
+ - Aladin/3.324
40
+ - Aleksika Spider/1.0 (+http://www.aleksika.com/)
41
+ - AlkalineBOT/1.3
42
+ - AlkalineBOT/1.4 (1.4.0326.0 RTM)
43
+ - Allesklar/0.1 libwww-perl/5.46
44
+ - Allrati/1.1 (+)
45
+ - AltaVista Intranet V2.0 AVS EVAL search@freeit.com
46
+ - AltaVista Intranet V2.0 Compaq Altavista Eval sveand@altavista.net
47
+ - AltaVista Intranet V2.0 evreka.com crawler@evreka.com
48
+ - AltaVista V2.0B crawler@evreka.com
49
+ - AmfibiBOT
50
+ - Amfibibot/0.06 (Amfibi Web Search; http://www.amfibi.com; agent@amfibi.com)
51
+ - Amfibibot/0.07 (Amfibi Robot; http://www.amfibi.com; agent@amfibi.com)
52
+ - amibot
53
+ - AnnoMille spider 0.1 alpha - http://www.annomille.it
54
+ - AnswerBus (http://www.answerbus.com/)
55
+ - antibot-V1.1.5/i586-linux-2.2
56
+ - AnzwersCrawl/2.0 (anzwerscrawl@anzwers.com.au;Engine)
57
+ - Apexoo Spider 1.x
58
+ - Aport
59
+ - appie 1.1 (www.walhello.com)
60
+ - ArabyBot (compatible; Mozilla/5.0; GoogleBot; FAST Crawler 6.4; http://www.araby.com;)
61
+ - ArachBot
62
+ - Arachnoidea (arachnoidea@euroseek.com)
63
+ - ArchitextSpider
64
+ - archive.org_bot
65
+ - Arikus_Spider
66
+ - Arquivo-web-crawler (compatible; heritrix/1.12.1 +http://arquivo-web.fccn.pt)
67
+ - ASAHA Search Engine Turkey V.001 (http://www.asaha.com/)
68
+ - Asahina-Antenna/1.x
69
+ - Asahina-Antenna/1.x (libhina.pl/x.x ; libtime.pl/x.x)
70
+ - ask.24x.info
71
+ - AskAboutOil/0.06-rcp (Nutch; http://www.nutch.org/docs/en/bot.html; nutch-agent@askaboutoil.com)
72
+ - asked/Nutch-0.8 (web crawler; http://asked.jp; epicurus at gmail dot com)
73
+ - ASPSeek/1.2.5
74
+ - ASPseek/1.2.9d
75
+ - ASPSeek/1.2.x
76
+ - ASPSeek/1.2.xa
77
+ - ASPseek/1.2.xx
78
+ - ASPSeek/1.2.xxpre
79
+ - ASSORT/0.10
80
+ - asterias/2.0
81
+ - AtlocalBot/1.1 +(http://www.atlocal.com/local-web-site-owner.html)
82
+ - Atomic_Email_Hunter/4.0
83
+ - Atomz/1.0
84
+ - atSpider/1.0
85
+ - Attentio/Nutch-0.9-dev (Attentio's beta blog crawler; www.attentio.com; info@attentio.com)
86
+ - augurfind
87
+ - augurnfind V-1.x
88
+ - autoemailspider
89
+ - autowebdir 1.1 (www.autowebdir.com)
90
+ - AV Fetch 1.0
91
+ - AVSearch-1.0(peter.turney@nrc.ca)
92
+ - AVSearch-3.0(AltaVista/AVC)
93
+ - axadine/ (Axadine Crawler; http://www.axada.de/; )
94
+ - AxmoRobot - Crawling your site for better indexing on www.axmo.com search engine.
95
+ - BabalooSpider/1.3 (BabalooSpider; http://www.babaloo.si; spider@babaloo.si)
96
+ - BaboomBot/1.x.x (+http://www.baboom.us)
97
+ - BaiduImagespider+(+http://www.baidu.jp/search/s308.html)
98
+ - BaiDuSpider
99
+ - Baiduspider+(+http://help.baidu.jp/system/05.html)
100
+ - Baiduspider+(+http://www.baidu.com/search/spider.htm)
101
+ - Baiduspider+(+http://www.baidu.com/search/spider_jp.html)
102
+ - Balihoo/Nutch-1.0-dev (Crawler for Balihoo.com search engine - obeys robots.txt and robots meta tags ; http://balihoo.com/index.aspx; robot at balihoo dot com)
103
+ - BarraHomeCrawler (albertof@barrahome.org)
104
+ - bdcindexer_2.6.2 (research@bdc)
105
+ - BDFetch
106
+ - BDNcentral Crawler v2.3 [en] (http://www.bdncentral.com/robot.html) (X11; I; Linux 2.0.44 i686)
107
+ - beautybot/1.0 (+http://www.uchoose.de/crawler/beautybot/)
108
+ - BebopBot/2.5.1 ( crawler http://www.apassion4jazz.net/bebopbot.html )
109
+ - BigCliqueBOT/1.03-dev (bigclicbot; http://www.bigclique.com; bot@bigclique.com)
110
+ - BIGLOTRON (Beta 2;GNU/Linux)
111
+ - Bigsearch.ca/Nutch-x.x-dev (Bigsearch.ca Internet Spider; http://www.bigsearch.ca/; info@enhancededge.com)
112
+ - BilgiBetaBot/0.8-dev (bilgi.com (Beta) ; http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
113
+ - BilgiBot/1.0(beta) (http://www.bilgi.com/; bilgi at bilgi dot com)
114
+ - Bitacle bot/1.1
115
+ - Bitacle Robot (V:1.0;) (http://www.bitacle.com)
116
+ - BlackWidow
117
+ - Blaiz-Bee/1.0 (+http://www.blaiz.net)
118
+ - Blaiz-Bee/2.00.8222 (BE Internet Search Engine http://www.rawgrunt.com)
119
+ - Blaiz-Bee/2.00.xxxx (+http://www.blaiz.net)
120
+ - BlitzBOT@tricus.net
121
+ - BlitzBOT@tricus.net (Mozilla compatible)
122
+ - BlogBot/1.x
123
+ - Bloglines Title Fetch/1.0 (http://www.bloglines.com)
124
+ - Bloglines-Images/0.1 (http://www.bloglines.com)
125
+ - Bloglines/3.1 (http://www.bloglines.com)
126
+ - Blogpulse (info@blogpulse.com)
127
+ - BlogPulseLive (support@blogpulse.com)
128
+ - BlogSearch/1.x +http://www.icerocket.com/
129
+ - blogsearchbot-pumpkin-3
130
+ - BlogsNowBot V 2.01 (+http://www.blogsnow.com/)
131
+ - BlogVibeBot-v1.1 (spider@blogvibe.nl)
132
+ - blogWatcher_Spider/0.1 (http://www.lr.pi.titech.ac.jp/blogWatcher/)
133
+ - BlogzIce/1.0 (+http://icerocket.com; rhodes@icerocket.com)
134
+ - BlogzIce/1.0 +http://www.icerocket.com/
135
+ - BloobyBot
136
+ - Bloodhound/Nutch-0.9 (Testing Crawler for Research - obeys robots.txt and robots meta tags ; http://balihoo.com/index.aspx; robot at balihoo dot com)
137
+ - boitho.com-dc/0.xx (http://www.boitho.com/dcbot.html)
138
+ - boitho.com-robot/1.x
139
+ - boitho.com-robot/1.x (http://www.boitho.com/bot.html)
140
+ - BPImageWalker/2.0 (www.bdbrandprotect.com)
141
+ - BravoBrian SpiderEngine MarcoPolo
142
+ - BruinBot (+http://webarchive.cs.ucla.edu/bruinbot.html)
143
+ - BSDSeek/1.0
144
+ - BTbot/0.x (+http://www.btbot.com/btbot.html)
145
+ - BuildCMS crawler (http://www.buildcms.com/crawler)
146
+ - BullsEye
147
+ - bumblebee@relevare.com
148
+ - BurstFindCrawler/1.1 (crawler.burstfind.com; http://crawler.burstfind.com; crawler@burstfind.com)
149
+ - Buscaplus Robi/1.0 (http://www.buscaplus.com/robi/)
150
+ - bwh3_user_agent
151
+ - Cabot/Nutch-0.9 (Amfibi's web-crawling robot; http://www.amfibi.com/cabot/; agent@amfibi.com)
152
+ - Cabot/Nutch-1.0-dev (Amfibi's web-crawling robot; http://www.amfibi.com/cabot/; agent@amfibi.com)
153
+ - carleson/1.0
154
+ - Carnegie_Mellon_University_Research_WebBOT-->PLEASE READ-->http://www.andrew.cmu.edu/~brgordon/webbot/index.html http://www.andrew.cmu.edu/~brgordon/webbot/index.html
155
+ - Carnegie_Mellon_University_WebCrawler http://www.andrew.cmu.edu/~brgordon/webbot/index.html
156
+ - Catall Spider
157
+ - CazoodleBot/CazoodleBot-0.1 (CazoodleBot Crawler; http://www.cazoodle.com/cazoodlebot; cazoodlebot@cazoodle.com)
158
+ - CCBot/1.0 (+http://www.commoncrawl.org/bot.html)
159
+ - ccubee/x.x
160
+ - Ceramic Tile Installation Guide (http://www.floorstransformed.com)
161
+ - cfetch/1.0
162
+ - China Local Browse 2.6
163
+ - ChristCRAWLER 2.0
164
+ - CipinetBot (http://www.cipinet.com/bot.html)
165
+ - ClariaBot/1.0
166
+ - Claymont.com
167
+ - CloakDetect/0.9 (+http://fulltext.seznam.cz/)
168
+ - Clushbot/2.x (+http://www.clush.com/bot.html)
169
+ - Clushbot/3.x-BinaryFury (+http://www.clush.com/bot.html)
170
+ - Clushbot/3.xx-Ajax (+http://www.clush.com/bot.html)
171
+ - Clushbot/3.xx-Hector (+http://www.clush.com/bot.html)
172
+ - Clushbot/3.xx-Peleus (+http://www.clush.com/bot.html)
173
+ - Cogentbot/1.X (+http://www.cogentsoftwaresolutions.com/bot.html)
174
+ - combine/0.0
175
+ - Combine/2.0 http://combine.it.lth.se/
176
+ - Combine/3 http://combine.it.lth.se/
177
+ - Combine/x.0
178
+ - cometrics-bot http://www.cometrics.de
179
+ - Computer_and_Automation_Research_Institute_Crawler crawler@ilab.sztaki.hu
180
+ - Comrite/0.7.1 (Nutch; http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
181
+ - ContactBot/0.2
182
+ - ContentSmartz
183
+ - Convera Internet Spider V6.x
184
+ - ConveraCrawler/0.2
185
+ - ConveraCrawler/0.9d (+http://www.authoritativeweb.com/crawl)
186
+ - ConveraMultiMediaCrawler/0.1 (+http://www.authoritativeweb.com/crawl)
187
+ - CoolBot
188
+ - cosmos/0.8_(robot@xyleme.com)
189
+ - cosmos/0.9_(robot@xyleme.com)
190
+ - CougarSearch/0.x (+http://www.cougarsearch.com/faq.shtml)
191
+ - Covac TexAs Arachbot
192
+ - Cowbot-0.1 (NHN Corp. / +82-2-3011-1954 / nhnbot@naver.com)
193
+ - Cowbot-0.1.x (NHN Corp. / +82-2-3011-1954 / nhnbot@naver.com)
194
+ - CrawlConvera0.1 (CrawlConvera@yahoo.com)
195
+ - Crawler (cometsearch@cometsystems.com)
196
+ - Crawler admin@crawler.de
197
+ - Crawler V 0.2.x admin@crawler.de
198
+ - crawler@alexa.com
199
+ - CrawlerBoy Pinpoint.com
200
+ - Crawllybot/0.1 (Crawllybot; +http://www.crawlly.com; crawler@crawlly.com)
201
+ - CreativeCommons/0.06-dev (Nutch; http://www.nutch.org/docs/en/bot.html; nutch-agent@lists.sourceforge.net)
202
+ - CrocCrawler vx.3 [en] (http://www.croccrawler.com) (X11; I; Linux 2.0.44 i686)
203
+ - csci_b659/0.13
204
+ - Cuasarbot/0.9b http://www.cuasar.com/spider_beta/
205
+ - CurryGuide SiteScan 1.1
206
+ - Custom Spider www.bisnisseek.com /1.0
207
+ - CyberPatrol SiteCat Webbot (http://www.cyberpatrol.com/cyberpatrolcrawler.asp)
208
+ - CydralSpider/1.x (Cydral Web Image Search; http://www.cydral.com)
209
+ - CydralSpider/3.0 (Cydral Image Search; http://www.cydral.com)
210
+ - DataCha0s/2.0
211
+ - DataCha0s/2.0
212
+ - DataFountains/DMOZ Downloader
213
+ - DataFountains/Dmoz Downloader (http://ivia.ucr.edu/useragents.shtml)
214
+ - DataFountains/DMOZ Feature Vector Corpus Creator (http://ivia.ucr.edu/useragents.shtml)
215
+ - DataparkSearch/4.47 (+http://dataparksearch.org/bot)
216
+ - DataparkSearch/4.xx (http://www.dataparksearch.org/)
217
+ - DataSpear/1.0 (Spider; http://www.dataspear.com/spider.html; spider@dataspear.com)
218
+ - DataSpearSpiderBot/0.2 (DataSpear Spider Bot; http://dssb.dataspear.com/bot.html; dssb@dataspear.com)
219
+ - DatenBot( http://www.sicher-durchs-netz.de/bot.html)
220
+ - DaviesBot/1.7 (www.wholeweb.net)
221
+ - daypopbot/0.x
222
+ - dbDig(http://www.prairielandconsulting.com)
223
+ - DBrowse 1.4b
224
+ - DBrowse 1.4d
225
+ - dCSbot/1.1
226
+ - de.searchengine.comBot 1.2 (http://de.searchengine.com/spider)
227
+ - deepak-USC/ISI
228
+ - DeepIndex
229
+ - DeepIndex ( http://www.zetbot.com )
230
+ - DeepIndex (www.en.deepindex.com)
231
+ - DeepIndexer.ca
232
+ - Demo Bot DOT 16b
233
+ - Demo Bot Z 16b
234
+ - Denmex websearch (http://search.denmex.com)
235
+ - dev-spider2.searchpsider.com/1.3b
236
+ - DiaGem/1.1 (http://www.skyrocket.gr.jp/diagem.html)
237
+ - Diamond/x.0
238
+ - DiamondBot
239
+ - Digger/1.0 JDK/1.3.0rc3
240
+ - DigOut4U
241
+ - DIIbot/1.2
242
+ - disco/Nutch-0.9 (experimental crawler; www.discoveryengine.com; disco-crawl@discoveryengine.com)
243
+ - disco/Nutch-1.0-dev (experimental crawler; www.discoveryengine.com; disco-crawl@discoveryengine.com)
244
+ - DittoSpyder
245
+ - dloader(NaverRobot)/1.0
246
+ - DoCoMo/1.0/Nxxxi/c10
247
+ - DoCoMo/1.0/Nxxxi/c10/TB
248
+ - DoCoMo/2.0 P900iV(c100;TB;W24H11)
249
+ - DoCoMo/2.0 SH902i (compatible; Y!J-SRD/1.0; http://help.yahoo.co.jp/help/jp/search/indexing/indexing-27.html)
250
+ - DoCoMo/2.0/SO502i (compatible; Y!J-SRD/1.0; http://help.yahoo.co.jp/help/jp/search/indexing/indexing-27.html)
251
+ - dodgebot/experimental
252
+ - Download-Tipp Linkcheck (http://download-tipp.de/)
253
+ - Drecombot/1.0 (http://career.drecom.jp/bot.html)
254
+ - DSurf15a 01
255
+ - DSurf15a 71
256
+ - DSurf15a 81
257
+ - DSurf15a VA
258
+ - dtSearchSpider
259
+ - DuckDuckBot/1.0; (+http://duckduckgo.com/duckduckbot.html)
260
+ - Dumbot(version 0.1 beta - dumbfind.com)
261
+ - Dumbot(version 0.1 beta - http://www.dumbfind.com/dumbot.html)
262
+ - Dumbot(version 0.1 beta)
263
+ - e-sense 1.0 ea(www.vigiltech.com/esensedisclaim.html)
264
+ - e-SocietyRobot(http://www.yama.info.waseda.ac.jp/~yamana/es/)
265
+ - eApolloBot/2.0 (compatible; heritrix/2.0.0-SNAPSHOT-20071024.170148 +http://www.eapollo-opto.com)
266
+ - EARTHCOM.info/1.x [www.earthcom.info]
267
+ - EARTHCOM.info/1.xbeta [www.earthcom.info]
268
+ - EasyDL/3.xx
269
+ - EasyDL/3.xx http://keywen.com/Encyclopedia/Bot
270
+ - EBrowse 1.4b
271
+ - EchO!/2.0
272
+ - Educate Search VxB
273
+ - egothor/3.0a (+http://www.xdefine.org/robot.html)
274
+ - EgotoBot/4.8 (+http://www.egoto.com/about.htm)
275
+ - ejupiter.com
276
+ - elfbot/1.0 (+http://www.uchoose.de/crawler/elfbot/)
277
+ - ELI/20070402:2.0 (DAUM RSS Robot Daum Communications Corp.; +http://ws.daum.net/aboutkr.html)
278
+ - EmailSiphon
279
+ - EmailSpider
280
+ - EmailWolf 1.00
281
+ - EMPAS_ROBOT
282
+ - EnaBot/1.x (http://www.enaball.com/crawler.html)
283
+ - Enfish Tracker
284
+ - Enterprise_Search/1.0
285
+ - Enterprise_Search/1.0.xxx
286
+ - Enterprise_Search/1.00.xxx;MSSQL (http://www.innerprise.net/es-spider.asp)
287
+ - envolk/1.7 (+http://www.envolk.com/envolkspiderinfo.php)
288
+ - envolk[ITS]spider/1.6(+http://www.envolk.com/envolkspider.html)
289
+ - EroCrawler
290
+ - ES.NET_Crawler/2.0 (http://search.innerprise.net/)
291
+ - eseek-larbin_2.6.2 (crawler@exactseek.com)
292
+ - ESISmartSpider
293
+ - eStyleSearch 4 (compatible; MSIE 6.0; Windows NT 5.0)
294
+ - ESurf15a 15
295
+ - EuripBot/0.x (+http://www.eurip.com) GetFile
296
+ - EuripBot/0.x (+http://www.eurip.com) GetRobots
297
+ - EuripBot/0.x (+http://www.eurip.com) PreCheck
298
+ - Eurobot/1.0 (http://www.ayell.eu)
299
+ - EvaalSE - bot@evaal.com
300
+ - eventax/1.3 (eventax; http://www.eventax.de/; info@eventax.de)
301
+ - Everest-Vulcan Inc./0.1 (R&D project; host=e-1-24; http://everest.vulcan.com/crawlerhelp)
302
+ - Everest-Vulcan Inc./0.1 (R&D project; http://everest.vulcan.com/crawlerhelp)
303
+ - Exabot-Images/1.0
304
+ - Exabot-Test/1.0
305
+ - Exabot/2.0
306
+ - Exabot/3.0
307
+ - ExactSeek Crawler/0.1
308
+ - exactseek-crawler-2.63 (crawler@exactseek.com)
309
+ - exactseek-pagereaper-2.63 (crawler@exactseek.com)
310
+ - exactseek.com
311
+ - Exalead NG/MimeLive Client (convert/http/0.120)
312
+ - Excalibur Internet Spider V6.5.4
313
+ - Execrawl/1.0 (Execrawl; http://www.execrawl.com/; bot@execrawl.com)
314
+ - exooba crawler/exooba crawler (crawler for exooba.com; http://www.exooba.com/; info at exooba dot com)
315
+ - exooba/exooba crawler (exooba; exooba)
316
+ - ExperimentalHenrytheMiragoRobot
317
+ - ExtractorPro
318
+ - EyeCatcher (Download-tipp.de)/1.0
319
+ - Factbot 1.09 (see http://www.factbites.com/webmasters.php)
320
+ - factbot : http://www.factbites.com/robots
321
+ - Fast Crawler Gold Edition
322
+ - FAST Enterprise Crawler 6 (Experimental)
323
+ - FAST Enterprise Crawler 6 / Scirus scirus-crawler@fast.no; http://www.scirus.com/srsapp/contactus/
324
+ - FAST Enterprise Crawler 6 used by Cobra Development (admin@fastsearch.com)
325
+ - FAST Enterprise Crawler 6 used by Comperio AS (sts@comperio.no)
326
+ - FAST Enterprise Crawler 6 used by FAST (FAST)
327
+ - FAST Enterprise Crawler 6 used by Pages Jaunes (pvincent@pagesjaunes.fr)
328
+ - FAST Enterprise Crawler 6 used by Sensis.com.au Web Crawler (search_comments\\at\\sensis\\dot\\com\\dot\\au)
329
+ - FAST Enterprise Crawler 6 used by Singapore Press Holdings (crawler@sphsearch.sg)
330
+ - FAST Enterprise Crawler/6 (www.fastsearch.com)
331
+ - FAST Enterprise Crawler/6.4 (helpdesk at fast.no)
332
+ - FAST FirstPage retriever (compatible; MSIE 5.5; Mozilla/4.0)
333
+ - FAST MetaWeb Crawler (helpdesk at fastsearch dot com)
334
+ - Fast PartnerSite Crawler
335
+ - FAST-WebCrawler/2.2.10 (Multimedia Search) (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
336
+ - FAST-WebCrawler/2.2.6 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)
337
+ - FAST-WebCrawler/2.2.7 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)http://www.fast.no
338
+ - FAST-WebCrawler/2.2.8 (crawler@fast.no; http://www.fast.no/faq/faqfastwebsearch/faqfastwebcrawler.html)http://www.fast.no
339
+ - FAST-WebCrawler/3.2 test
340
+ - FAST-WebCrawler/3.3 (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
341
+ - FAST-WebCrawler/3.4/Nirvana (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
342
+ - FAST-WebCrawler/3.4/PartnerSite (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
343
+ - FAST-WebCrawler/3.5 (atw-crawler at fast dot no; http://fast.no/support.php?c=faqs/crawler)
344
+ - FAST-WebCrawler/3.6 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
345
+ - FAST-WebCrawler/3.6/FirstPage (crawler@fast.no; http://fast.no/support.php?c=faqs/crawler)
346
+ - FAST-WebCrawler/3.7 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
347
+ - FAST-WebCrawler/3.7/FirstPage (atw-crawler at fast dot no;http://fast.no/support/crawler.asp)
348
+ - FAST-WebCrawler/3.8 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
349
+ - FAST-WebCrawler/3.8/Fresh (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
350
+ - FAST-WebCrawler/3.x Multimedia
351
+ - FAST-WebCrawler/3.x Multimedia (mm dash crawler at fast dot no)
352
+ - fastbot crawler beta 2.0 (+http://www.fastbot.de)
353
+ - FastBug http://www.ay-up.com
354
+ - FastCrawler 3.0.1 (crawler@1klik.dk)
355
+ - FastSearch Web Crawler for Verizon SuperPages (kevin.watters@fastsearch.com)
356
+ - Favcollector/2.0 (info@favcollector.com http://www.favcollector.com/)
357
+ - favo.eu crawler/0.6 (http://www.favo.eu)
358
+ - Faxobot/1.0
359
+ - Feed Seeker Bot (RSS Feed Seeker http://www.MyNewFavoriteThing.com/fsb.php)
360
+ - Feed24.com
361
+ - FeedChecker/0.01
362
+ - Feedfetcher-Google; (+http://www.google.com/feedfetcher.html)
363
+ - FeedHub FeedDiscovery/1.0 (http://www.feedhub.com)
364
+ - FeedHub MetaDataFetcher/1.0 (http://www.feedhub.com)
365
+ - Feedjit Favicon Crawler 1.0
366
+ - Feedster Crawler/3.0; Feedster Inc.
367
+ - Felix - Mixcat Crawler (+http://mixcat.com)
368
+ - FFC Trap Door Spider
369
+ - Filtrbox/1.0
370
+ - Findexa Crawler (http://www.findexa.no/gulesider/article26548.ece)
371
+ - findlinks/x.xxx (+http://wortschatz.uni-leipzig.de/findlinks/)
372
+ - FineBot
373
+ - Firefly/1.0
374
+ - Firefly/1.0 (compatible; Mozilla 4.0; MSIE 5.5)
375
+ - Firefox (kastaneta03@hotmail.com)
376
+ - Firefox_1.0.6 (kasparek@naparek.cz)
377
+ - FirstGov.gov Search - POC:firstgov.webmasters@gsa.gov
378
+ - firstsbot
379
+ - Flapbot/0.7.2 (Flaptor Crawler; http://www.flaptor.com; crawler at flaptor period com)
380
+ - Flexum spider
381
+ - Flexum/2.0
382
+ - FlickBot 2.0 RPT-HTTPClient/0.3-3
383
+ - flunky
384
+ - FnooleBot/2.5.2 (+http://www.fnoole.com/addurl.html)
385
+ - FocusedSampler/1.0
386
+ - Folkd.com Spider/0.1 beta 1 (www.folkd.com)
387
+ - Fooky.com/ScorpionBot/ScoutOut; http://www.fooky.com/scorpionbots
388
+ - Francis/1.0 (francis@neomo.de http://www.neomo.de/)
389
+ - Franklin Locator 1.8
390
+ - FreeFind.com-SiteSearchEngine/1.0 (http://freefind.com; spiderinfo@freefind.com)
391
+ - FreshNotes crawler< report problems to crawler-at-freshnotes-dot-com
392
+ - FSurf15a 01
393
+ - FTB-Bot http://www.findthebest.co.uk/
394
+ - Full Web Bot 0416B
395
+ - Full Web Bot 0516B
396
+ - Full Web Bot 2816B
397
+ - FuseBulb.Com
398
+ - FyberSpider (+http://www.fybersearch.com/fyberspider.php)
399
+ - GAIS Robot/1.0B2
400
+ - Gaisbot/3.0 (indexer@gais.cs.ccu.edu.tw; http://gais.cs.ccu.edu.tw/robot.php)
401
+ - Gaisbot/3.0+(robot06@gais.cs.ccu.edu.tw;+http://gais.cs.ccu.edu.tw/robot.php)
402
+ - GalaxyBot/1.0 (http://www.galaxy.com/galaxybot.html)
403
+ - Gallent Search Spider v1.4 Robot 2 (http://robot.GallentSearch.com)
404
+ - gamekitbot/1.0 (+http://www.uchoose.de/crawler/gamekitbot/)
405
+ - GammaSpider/1.0
406
+ - gazz/x.x (gazz@nttrd.com)
407
+ - generic_crawler/01.0217/
408
+ - genieBot (http://64.5.245.11/faq/faq.html)
409
+ - geniebot wgao@genieknows.com
410
+ - GeonaBot 1.x; http://www.geona.com/
411
+ - gigabaz/3.1x (baz@gigabaz.com; http://gigabaz.com/gigabaz/)
412
+ - Gigabot/2.0 (gigablast.com)
413
+ - Gigabot/2.0/gigablast.com/spider.html
414
+ - Gigabot/2.0; http://www.gigablast.com/spider.html
415
+ - Gigabot/2.0att
416
+ - Gigabot/3.0 (http://www.gigablast.com/spider.html)
417
+ - Gigabot/x.0
418
+ - GigabotSiteSearch/2.0 (sitesearch.gigablast.com)
419
+ - GNODSPIDER (www.gnod.net)
420
+ - Goblin/0.9 (http://www.goguides.org/)
421
+ - Goblin/0.9.x (http://www.goguides.org/goblin-info.html)
422
+ - GoForIt.com
423
+ - GOFORITBOT ( http://www.goforit.com/about/ )
424
+ - gonzo1[P] +http://www.suchen.de/popups/faq.jsp
425
+ - gonzo2[P] +http://www.suchen.de/faq.html
426
+ - Goofer/0.2
427
+ - Googlebot-Image/1.0
428
+ - Googlebot-Image/1.0 ( http://www.googlebot.com/bot.html)
429
+ - Googlebot/2.1 ( http://www.google.com/bot.html)
430
+ - Googlebot/2.1 ( http://www.googlebot.com/bot.html)
431
+ - Googlebot/Test ( http://www.googlebot.com/bot.html)
432
+ - GrapeFX/0.3 libwww/5.4.0
433
+ - great-plains-web-spider/flatlandbot (Flatland Industries Web Spider; http://www.flatlandindustries.com/flatlandbot.php; jason@flatlandindustries.com)
434
+ - GrigorBot 0.8 (http://www.grigor.biz/bot.html)
435
+ - Gromit/1.0
436
+ - grub crawler(http://www.grub.org)
437
+ - grub-client
438
+ - gsa-crawler (Enterprise; GID-01422; jplastiras@google.com)
439
+ - gsa-crawler (Enterprise; GID-01742;gsatesting@rediffmail.com)
440
+ - gsa-crawler (Enterprise; GIX-02057; dm@enhesa.com)
441
+ - gsa-crawler (Enterprise; GIX-03519; cknuetter@stubhub.com)
442
+ - gsa-crawler (Enterprise; GIX-0xxxx; enterprise-training@google.com)
443
+ - Guestbook Auto Submitter
444
+ - Gulliver/1.3
445
+ - Gulper Web Bot 0.2.4 (www.ecsl.cs.sunysb.edu/~maxim/cgi-bin/Link/GulperBot)
446
+ - Gungho/0.08004 (http://code.google.com/p/gungho-crawler/wiki/Index)
447
+ - GurujiBot/1.0 (+http://www.guruji.com/WebmasterFAQ.html)
448
+ - GurujiImageBot/1.0 (+http://www.guruji.com/en/WebmasterFAQ.html)
449
+ - HappyFunBot/1.1
450
+ - Harvest-NG/1.0.2
451
+ - Hatena Antenna/0.4 (http://a.hatena.ne.jp/help#robot)
452
+ - Hatena Pagetitle Agent/1.0
453
+ - Hatena RSS/0.3 (http://r.hatena.ne.jp)
454
+ - hbtronix.spider.2 -- http://hbtronix.de/spider.php
455
+ - HeinrichderMiragoRobot
456
+ - HeinrichderMiragoRobot (http://www.miragorobot.com/scripts/deinfo.asp)
457
+ - Helix/1.x ( http://www.sitesearch.ca/helix/)
458
+ - HenriLeRobotMirago (http://www.miragorobot.com/scripts/frinfo.asp)
459
+ - HenrytheMiragoRobot
460
+ - HenryTheMiragoRobot (http://www.miragorobot.com/scripts/mrinfo.asp)
461
+ - Hi! I'm CsCrawler my homepage: http://www.kde.cs.uni-kassel.de/lehre/ss2005/googlespam/crawler.html RPT-HTTPClient/0.3-3
462
+ - Hippias/0.9 Beta
463
+ - HitList
464
+ - Hitwise Spider v1.0 http://www.hitwise.com
465
+ - holmes/3.11 (http://morfeo.centrum.cz/bot)
466
+ - holmes/3.9 (onet.pl)
467
+ - holmes/3.xx (OnetSzukaj/5.0; +http://szukaj.onet.pl)
468
+ - holmes/x.x
469
+ - HolmesBot (http://holmes.ge)
470
+ - HomePageSearch(hpsearch.uni-trier.de)
471
+ - Homerbot: www.homerweb.com
472
+ - Honda-Search/0.7.2 (Nutch; http://lucene.apache.org/nutch/bot.html; search@honda-search.com)
473
+ - HooWWWer/2.1.3 (debugging run) (+http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-info<at>hiit.fi)
474
+ - HooWWWer/2.1.x ( http://cosco.hiit.fi/search/hoowwwer/ | mailto:crawler-info<at>hiit.fi)
475
+ - HPL/Nutch-0.9 -
476
+ - htdig/3.1.6 (http://computerorgs.com)
477
+ - htdig/3.1.6 (unconfigured@htdig.searchengine.maintainer)
478
+ - htdig/3.1.x (root@localhost)
479
+ - http://Ask.24x.Info/ (http://narres.it/)
480
+ - http://hilfe.acont.de/bot.html ACONTBOT
481
+ - http://www.almaden.ibm.com/cs/crawler
482
+ - http://www.almaden.ibm.com/cs/crawler [rc1.wf.ibm.com]
483
+ - http://www.almaden.ibm.com/cs/crawler [wf216]
484
+ - http://www.istarthere.com_spider@istarthere.com
485
+ - http://www.monogol.de
486
+ - http://www.trendtech.dk/spider.asp)
487
+ - i1searchbot/2.0 (i1search web crawler; http://www.i1search.com; crawler@i1search.com)
488
+ - IAArchiver-1.0
489
+ - iaskspider2 (iask@staff.sina.com.cn)
490
+ - ia_archiver
491
+ - ia_archiver-web.archive.org
492
+ - ia_archiver/1.6
493
+ - ICC-Crawler(Mozilla-compatible; http://kc.nict.go.jp/icc/crawl.html; icc-crawl(at)ml(dot)nict(dot)go(dot)jp)
494
+ - ICC-Crawler(Mozilla-compatible;http://kc.nict.go.jp/icc/crawl.html;icc-crawl-contact(at)ml(dot)nict(dot)go(dot)jp)
495
+ - iCCrawler (http://www.iccenter.net)
496
+ - ICCrawler - ICjobs (http://www.icjobs.de/bot.htm)
497
+ - ichiro/x.0 (http://help.goo.ne.jp/door/crawler.html)
498
+ - ichiro/x.0 (ichiro@nttr.co.jp)
499
+ - IconSurf/2.0 favicon finder (see http://iconsurf.com/robot.html)
500
+ - IconSurf/2.0 favicon monitor (see http://iconsurf.com/robot.html)
501
+ - ICRA_label_spider/x.0
502
+ - icsbot-0.1
503
+ - ideare - SignSite/1.x
504
+ - iFeed.jp/2.0 (www.psychedelix.com/agents/agents.rss; 0 subscribers)
505
+ - igdeSpyder (compatible; igde.ru; +http://igde.ru/doc/tech.html)
506
+ - IIITBOT/1.1 (Indian Language Web Search Engine; http://webkhoj.iiit.net; pvvpr at iiit dot ac dot in)
507
+ - ilial/Nutch-0.9 (Ilial Inc. is a Los Angeles based Internet startup company. For more information please visit http://www.ilial.com/crawler; http://www.ilial.com/crawler; crawl@ilial.com)
508
+ - ilial/Nutch-0.9-dev
509
+ - IlseBot/1.x
510
+ - IlTrovatore-Setaccio ( http://www.iltrovatore.it)
511
+ - Iltrovatore-Setaccio/0.3-dev (Indexing; http://www.iltrovatore.it/bot.html; info@iltrovatore.it)
512
+ - IlTrovatore-Setaccio/1.2 ( http://www.iltrovatore.it/aiuto/faq.html)
513
+ - Iltrovatore-Setaccio/1.2 (It-bot; http://www.iltrovatore.it/bot.html; info@iltrovatore.it)
514
+ - iltrovatore-setaccio/1.2-dev (spidering; http://www.iltrovatore.it/aiuto/.....)
515
+ - IlTrovatore/1.2 (IlTrovatore; http://www.iltrovatore.it/bot.html; bot@iltrovatore.it)
516
+ - ImageWalker/2.0 (www.bdbrandprotect.com)
517
+ - IncyWincy data gatherer(webmaster@loopimprovements.com
518
+ - IncyWincy page crawler(webmaster@loopimprovements.com
519
+ - IncyWincy(http://www.look.com)
520
+ - IncyWincy(http://www.loopimprovements.com/robot.html)
521
+ - IncyWincy/2.1(loopimprovements.com/robot.html)
522
+ - IndexTheWeb.com Crawler7
523
+ - Industry Program 1.0.x
524
+ - Inet library
525
+ - info@pubblisito.com- (http://www.pubblisito.com) il Sud dei Motori di Ricerca
526
+ - InfoFly/1.0 (http://www.versions-project.org/)
527
+ - INFOMINE/8.0 Adders
528
+ - INFOMINE/8.0 RemoteServices
529
+ - INFOMINE/8.0 VLCrawler (http://infomine.ucr.edu/useragents)
530
+ - InfoNaviRobot(F107)
531
+ - InfoSeek Sidewinder/0.9
532
+ - InfoSeek Sidewinder/1.0A
533
+ - InfoSeek Sidewinder/1.1A
534
+ - Infoseek SideWinder/1.45 (Compatible; MSIE 10.0; UNIX)
535
+ - Infoseek SideWinder/2.0B (Linux 2.4 i686)
536
+ - INGRID/3.0 MT (webcrawler@NOSPAMexperimental.net; http://webmaster.ilse.nl/jsp/webmaster.jsp)
537
+ - Inktomi Search
538
+ - InnerpriseBot/1.0 (http://www.innerprise.com/)
539
+ - Insitor.com search and find world wide!
540
+ - Insitornaut
541
+ - Internet Ninja x.0
542
+ - InternetArchive/0.8-dev(Nutch;http://lucene.apache.org/nutch/bot.html;nutch-agent@lucene.apache
543
+ - InternetSeer.com
544
+ - IOI/2.0 (ISC Open Index crawler; http://index.isc.org/; bot@index.isc.org)
545
+ - IPiumBot laurion(dot)com
546
+ - IpselonBot/0.xx-beta (Ipselon; http://www.ipselon.com; ipselonbot@ipselon.com)
547
+ - IRLbot/1.0 ( http://irl.cs.tamu.edu/crawler)
548
+ - IRLbot/3.0 (compatible; MSIE 6.0; http://irl.cs.tamu.edu/crawler/)
549
+ - ISC Systems iRc Search 2.1
550
+ - IUPUI Research Bot v 1.9a
551
+ - IWAgent/ 1.0 - www.brandprotect.com
552
+ - Jabot/6.x (http://odin.ingrid.org/)
553
+ - Jabot/7.x.x (http://odin.ingrid.org/)
554
+ - Jack
555
+ - Jambot/0.1.x (Jambot; http://www.jambot.com/blog; crawler@jambot.com)
556
+ - Jambot/0.2.1 (Jambot; http://www.jambot.com/blog/static.php?page=webmaster-robot; crawler@jambot.com)
557
+ - Jayde Crawler. http://www.jayde.com
558
+ - Jetbot/1.0
559
+ - JobSpider_BA/1.1
560
+ - Jyxobot/x
561
+ - k2spider
562
+ - KAIST AITrc Crawler
563
+ - KakleBot - www.kakle.com/0.1 (KakleBot - www.kakle.com; http:// www.kakle.com/bot.html; support@kakle.com)
564
+ - kalooga/kalooga-4.0-dev-datahouse (Kalooga; http://www.kalooga.com; info@kalooga.com)
565
+ - kalooga/KaloogaBot (Kalooga; http://www.kalooga.com/info.html?page=crawler; crawler@kalooga.com)
566
+ - Kenjin Spider
567
+ - Kevin http://dznet.com/kevin/
568
+ - Kevin http://websitealert.net/kevin/
569
+ - KE_1.0/2.0 libwww/5.2.8
570
+ - KFSW-Bot (Version: 1.01 powered by KFSW www.kfsw.de)
571
+ - kinja-imagebot (http://www.kinja.com/)
572
+ - kinjabot (http://www.kinja.com)
573
+ - KIT-Fireball/2.0
574
+ - KIT-Fireball/2.0 (compatible; Mozilla 4.0; MSIE 5.5)
575
+ - KnowItAll(knowitall@cs.washington.edu)
576
+ - Knowledge.com/0.x
577
+ - Krugle/KrugleNutch/0.8+ (Krugle web crawler; http://www.krugle.com/crawler/info.html; webcrawler@krugle.com)
578
+ - KSbot/1.0 (KnowledgeStorm crawler; http://www.knowledgestorm.com/resources/content/crawler/index.html; crawleradmin@knowledgestorm.com)
579
+ - kuloko-bot/0.x
580
+ - kulokobot www.kuloko.com kuloko@backweave.com
581
+ - kulturarw3/0.1
582
+ - LapozzBot/1.4 ( http://robot.lapozz.com)
583
+ - LapozzBot/1.5 (+http://robot.lapozz.hu)
584
+ - larbin (samualt9@bigfoot.com)
585
+ - LARBIN-EXPERIMENTAL (efp@gmx.net)
586
+ - larbin_2.1.1 larbin2.1.1@somewhere.com
587
+ - larbin_2.2.0 (crawl@compete.com)
588
+ - larbin_2.2.1_de_Viennot (Laurent.Viennot@inria.fr)
589
+ - larbin_2.2.2 (sugayama@lab7.kuis.kyoto-u.ac.jp)
590
+ - larbin_2.2.2_guillaume (guillaume@liafa.jussieu.fr)
591
+ - larbin_2.6.0 (larbin2.6.0@unspecified.mail)
592
+ - larbin_2.6.1 (larbin2.6.1@unspecified.mail)
593
+ - larbin_2.6.2 (hamasaki@grad.nii.ac.jp)
594
+ - larbin_2.6.2 (larbin2.6.2@unspecified.mail)
595
+ - larbin_2.6.2 (listonATccDOTgatechDOTedu)
596
+ - larbin_2.6.2 (pimenas@systems.tuc.gr)
597
+ - larbin_2.6.2 (tom@lemurconsulting.com)
598
+ - larbin_2.6.2 (vitalbox1@hotmail.com)
599
+ - larbin_2.6.3 (ltaa_web_crawler@groupes.epfl.ch)
600
+ - larbin_2.6.3 (wgao@genieknows.com)
601
+ - larbin_2.6.3_for_(http://cosco.hiit.fi/search/) tsilande@hiit.fi
602
+ - larbin_2.6_basileocaml (basile.starynkevitch@cea.fr)
603
+ - larbin_devel (http://pauillac.inria.fr/~ailleret/prog/larbin/)
604
+ - lawinfo-crawler/Nutch-0.9-dev (Crawler for lawinfo.com pages; http://www.lawinfo.com; webmaster@lawinfo.com)
605
+ - LECodeChecker/3.0 libgetdoc/1.0
606
+ - LEIA/2.90
607
+ - LEIA/3.01pr (LEIAcrawler; [SNIP])
608
+ - LetsCrawl.com/1.0 +http://letscrawl.com/
609
+ - LexiBot/1.00
610
+ - Libby_1.1/libwww-perl/5.47
611
+ - LibertyW (+http://www.lw01.com)
612
+ - libWeb/clsHTTP -- hiongun@kt.co.kr
613
+ - libwww-perl/5.41
614
+ - libwww-perl/5.45
615
+ - libwww-perl/5.48
616
+ - libwww-perl/5.52 FP/2.1
617
+ - libwww-perl/5.52 FP/4.0
618
+ - libwww-perl/5.65
619
+ - libwww-perl/5.800
620
+ - libwww/5.3.2
621
+ - LijitSpider/Nutch-0.9 (Reports crawler; http://www.lijit.com/; info(a)lijit(d)com)
622
+ - Lincoln State Web Browser
623
+ - linkbot
624
+ - linknzbot
625
+ - Links 2.0 (http://gossamer-threads.com/scripts/links/)
626
+ - Links SQL (http://gossamer-threads.com/scripts/links-sql/)
627
+ - LinkScan/11.0beta2 UnixShareware robot from Elsop.com (used by Indiafocus/Indiainfo)
628
+ - LinkScan/9.0g Unix
629
+ - LinkScan/x.x Unix
630
+ - LiveTrans/Nutch-0.9 (maintainer: cobain at iis dot sinica dot edu dot tw; http://wkd.iis.sinica.edu.tw/LiveTrans/)
631
+ - Llaut/1.0 (http://mnm.uib.es/~gallir/llaut/bot.html)
632
+ - LMQueueBot/0.2
633
+ - lmspider (lmspider@scansoft.com)
634
+ - LNSpiderguy
635
+ - LocalBot/1.0 ( http://www.localbot.co.uk/)
636
+ - LocalcomBot/1.2.x ( http://www.local.com/bot.htm)
637
+ - Lockstep Spider/1.0
638
+ - Look.com
639
+ - Lovel as 1.0 ( +http://www.everatom.com)
640
+ - LTI/LemurProject Nutch Spider/Nutch-1.0-dev (lti crawler for CMU; http://www.lti.cs.cmu.edu; changkuk at cmu dot edu)
641
+ - LTI/LemurProject Nutch Spider/Nutch-1.0-dev (Research spider using Nutch; http://www.lemurproject.org; mhoy@cs.cmu.edu)
642
+ - lwp-trivial/1.32
643
+ - lwp-trivial/1.34
644
+ - lwp-trivial/1.34
645
+ - LWP::Simple/5.22
646
+ - LWP::Simple/5.36
647
+ - LWP::Simple/5.48
648
+ - LWP::Simple/5.50
649
+ - LWP::Simple/5.51
650
+ - LWP::Simple/5.53
651
+ - LWP::Simple/5.63
652
+ - LWP::Simple/5.803
653
+ - Lycos_Spider_(modspider)
654
+ - Lycos_Spider_(T-Rex)
655
+ - Lynx/2.8.4rel.1 libwww-FM/2.14 SSL-MM/1.4.1 OpenSSL/0.9.6c (human-guided@lerly.net)
656
+ - Mac Finder 1.0.xx
657
+ - Mackster( http://www.ukwizz.com )
658
+ - Mahiti.Com/Mahiti Crawler-1.0 (Mahiti.Com; http://mahiti.com ; mahiti.com)
659
+ - Mail.Ru/1.0
660
+ - mailto:webcraft@bea.com
661
+ - mammoth/1.0 ( http://www.sli-systems.com/)
662
+ - MantraAgent
663
+ - MapoftheInternet.com ( http://MapoftheInternet.com)
664
+ - Mariner/5.1b [de] (Win95; I ;Kolibri gncwebbot)
665
+ - Marketwave Hit List
666
+ - Martini
667
+ - MARTINI
668
+ - Marvin v0.3
669
+ - MaSagool/1.0 (MaSagool; http://sagool.jp/; info@sagool.jp)
670
+ - MasterSeek
671
+ - Mata Hari/2.00
672
+ - Matrix S.p.A. - FAST Enterprise Crawler 6 (Unknown admin e-mail address)
673
+ - maxomobot/dev-20051201 (maxomo; http://67.102.134.34:4047/MAXOMO/MAXOMObot.html; maxomobot@maxomo.com)
674
+ - MDbot/1.0 (+http://www.megadownload.net/bot.html)
675
+ - MediaCrawler-1.0 (Experimental)
676
+ - Mediapartners-Google/2.1 ( http://www.googlebot.com/bot.html)
677
+ - MediaSearch/0.1
678
+ - MegaSheep v1.0 (www.searchuk.com internet sheep)
679
+ - Megite2.0 (http://www.megite.com)
680
+ - Mercator-1.x
681
+ - Mercator-2.0
682
+ - Mercator-Scrub-1.1
683
+ - Metaeuro Web Crawler/0.2 (MetaEuro Web Search Clustering Engine; http://www.metaeuro.com; crawler at metaeuro dot com)
684
+ - MetaGer-LinkChecker
685
+ - MetagerBot/0.8-dev (MetagerBot; http://metager.de; )
686
+ - MetaGer_PreChecker0.1
687
+ - Metaspinner/0.01 (Metaspinner; http://www.meta-spinner.de/; support@meta-spinner.de/)
688
+ - metatagsdir/0.7 (+http://metatagsdir.com/directory/)
689
+ - MFC Foundation Class Library 4.0
690
+ - MicroBaz
691
+ - Microsoft Small Business Indexer
692
+ - Microsoft URL Control - 6.00.8xxx
693
+ - MicrosoftPrototypeCrawler (How's my crawling? mailto:newbiecrawler@hotmail.com)
694
+ - Missauga Locate 1.0.0
695
+ - Missigua Locator 1.9
696
+ - Missouri College Browse
697
+ - Misterbot-Nutch/0.7.1 (Misterbot-Nutch; http://www.misterbot.fr; admin@misterbot.fr)
698
+ - Miva (AlgoFeedback@miva.com)
699
+ - Mizzu Labs 2.2
700
+ - MJ12bot/vx.x.x (http://majestic12.co.uk/bot.php?+)
701
+ - MJ12bot/vx.x.x (http://www.majestic12.co.uk/projects/dsearch/mj12bot.php)
702
+ - MJBot (SEO assessment)
703
+ - MLBot (www.metadatalabs.com)
704
+ - MnogoSearch/3.2.xx
705
+ - Mo College 1.9
706
+ - moget/x.x (moget@goo.ne.jp)
707
+ - mogimogi/1.0
708
+ - MojeekBot/0.x (archi; http://www.mojeek.com/bot.html)
709
+ - Morris - Mixcat Crawler ( http://mixcat.com)
710
+ - Mouse-House/7.4 (spider_monkey spider info at www.mobrien.com/sm.shtml)
711
+ - mozDex/0.xx-dev (mozDex; http://www.mozdex.com/en/bot.html; spider@mozdex.com)
712
+ - Mozilla (Mozilla@somewhere.com)
713
+ - Mozilla 4.0(compatible; BotSeer/1.0; +http://botseer.ist.psu.edu)
714
+ - Mozilla/2.0 (compatible; Ask Jeeves)
715
+ - Mozilla/2.0 (compatible; Ask Jeeves/Teoma)
716
+ - Mozilla/2.0 (compatible; Ask Jeeves/Teoma; http://about.ask.com/en/docs/about/webmasters.shtml)
717
+ - Mozilla/2.0 (compatible; Ask Jeeves/Teoma; http://sp.ask.com/docs/about/tech_crawling.html)
718
+ - Mozilla/2.0 (compatible; EZResult -- Internet Search Engine)
719
+ - Mozilla/2.0 (compatible; NEWT ActiveX; Win32)
720
+ - Mozilla/2.0 (compatible; T-H-U-N-D-E-R-S-T-O-N-E)
721
+ - Mozilla/3.0 (compatible; Fluffy the spider; http://www.searchhippo.com/; info@searchhippo.com)
722
+ - Mozilla/3.0 (compatible; Indy Library)
723
+ - Mozilla/3.0 (compatible; MuscatFerret/1.5.4; claude@euroferret.com)
724
+ - Mozilla/3.0 (compatible; MuscatFerret/1.5; olly@muscat.co.uk)
725
+ - Mozilla/3.0 (compatible; MuscatFerret/1.6.x; claude@euroferret.com)
726
+ - Mozilla/3.0 (compatible; scan4mail (advanced version) http://www.peterspages.net/?scan4mail)
727
+ - Mozilla/3.0 (compatible; ScollSpider; http://www.webwobot.com)
728
+ - Mozilla/3.0 (compatible; Webinator-DEV01.home.iprospect.com/2.56)
729
+ - Mozilla/3.0 (compatible; Webinator-indexer.cyberalert.com/2.56)
730
+ - Mozilla/3.0 (INGRID/3.0 MT; webcrawler@NOSPAMexperimental.net; http://aanmelden.ilse.nl/?aanmeld_mode=webhints)
731
+ - Mozilla/3.0 (Slurp.so/Goo; slurp@inktomi.com; http://www.inktomi.com/slurp.html)
732
+ - Mozilla/3.0 (Slurp/cat; slurp@inktomi.com; http://www.inktomi.com/slurp.html)
733
+ - Mozilla/3.0 (Slurp/si; slurp@inktomi.com; http://www.inktomi.com/slurp.html)
734
+ - Mozilla/3.0 (Vagabondo/1.1 MT; webcrawler@NOSPAMwise-guys.nl; http://webagent.wise-guys.nl/)
735
+ - Mozilla/3.0 (Vagabondo/1.x MT; webagent@wise-guys.nl; http://webagent.wise-guys.nl/)
736
+ - Mozilla/3.0 (Vagabondo/2.0 MT; webcrawler@NOSPAMexperimental.net; http://aanmelden.ilse.nl/?aanmeld_mode=webhints)
737
+ - Mozilla/3.0 (Vagabondo/2.0 MT; webcrawler@NOSPAMwise-guys.nl; http://webagent.wise-guys.nl/)
738
+ - Mozilla/3.01 (Compatible; Links2Go Similarity Engine)
739
+ - Mozilla/4.0
740
+ - Mozilla/4.0 (agadine3.0) www.agada.de
741
+ - Mozilla/4.0 (compatible; Vagabondo/2.2; webcrawler at wise-guys dot nl; http://webagent.wise-guys.nl/)
742
+ - Mozilla/4.0 (compatible; Vagabondo/4.0Beta; webcrawler at wise-guys dot nl; http://webagent.wise-guys.nl/)
743
+ - Mozilla/4.0 (compatible; Advanced Email Extractor v2.xx)
744
+ - Mozilla/4.0 (compatible; B_L_I_T_Z_B_O_T)
745
+ - Mozilla/4.0 (compatible; ChristCrawler.com ChristCrawler@ChristCENTRAL.com)
746
+ - Mozilla/4.0 (compatible; crawlx crawler@trd.overture.com)
747
+ - Mozilla/4.0 (compatible; DAUMOA-video; +http://ws.daum.net/aboutkr.html)
748
+ - Mozilla/4.0 (compatible; FastCrawler3 support-fastcrawler3@fast.no)
749
+ - Mozilla/4.0 (compatible; FDSE robot)
750
+ - Mozilla/4.0 (compatible; GPU p2p crawler http://gpu.sourceforge.net/search_engine.php)
751
+ - Mozilla/4.0 (compatible; grub-client-0.2.x; Crawl your stuff with http://grub.org)
752
+ - Mozilla/4.0 (compatible; grub-client-0.3.x; Crawl your own stuff with http://grub.org)
753
+ - Mozilla/4.0 (compatible; grub-client-2.x)
754
+ - Mozilla/4.0 (compatible; Iplexx Spider/1.0 http://www.iplexx.at)
755
+ - Mozilla/4.0 (compatible; MSIE 4.01; Vonna.com b o t)
756
+ - Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; PPC; 240x320; SPV M700; OpVer 19.123.2.733) OrangeBot-Mobile 2008.0 (mobilesearch.support@orange-ftgroup.com)
757
+ - Mozilla/4.0 (compatible; MSIE 4.0; Windows NT; Site Server 3.0 Robot) Indonesia Interactive
758
+ - Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0) (samualt9@bigfoot.com)
759
+ - Mozilla/4.0 (compatible; MSIE 5.0; Windows 95) TrueRobot; 1.5
760
+ - Mozilla/4.0 (compatible; MSIE 5.0; Windows 95) VoilaBot BETA 1.2 (http://www.voila.com/)
761
+ - Mozilla/4.0 (compatible; MSIE 5.0; Windows 95) VoilaBot; 1.6
762
+ - Mozilla/4.0 (compatible; MSIE 5.0; Windows NT; DigExt; DTS Agent
763
+ - Mozilla/4.0 (compatible; MSIE 5.0; www.galaxy.com; www.psychedelix.com)
764
+ - Mozilla/4.0 (compatible; MSIE 5.0; www.galaxy.com; www.psychedelix.com/; http://www.galaxy.com/info/crawler.html)
765
+ - Mozilla/4.0 (compatible; MSIE 5.0; YANDEX)
766
+ - Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 4.0; obot)
767
+ - Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 4.0; QXW03018)
768
+ - Mozilla/4.0 (compatible; MSIE 6.0 compatible; Asterias Crawler v4; +http://www.singingfish.com/help/spider.html; webmaster@singingfish.com); SpiderThread Revision: 3.10
769
+ - Mozilla/4.0 (compatible; MSIE 6.0; MSIE 5.5; Windows NT 5.1) Skampy/0.9.x [en]
770
+ - Mozilla/4.0 (compatible; MSIE 6.0; TargetSeek/1.0; +http://www.targetgroups.net/TargetSeek.html)
771
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; ODP entries t_st; http://tuezilla.de/t_st-odp-entries-agent.html)
772
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; ODP links test; http://tuezilla.de/test-odp-links-agent.html)
773
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; ZoomSpider.net bot; .NET CLR 1.1.4322)
774
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; heritrix/1.3.0 http://www.cs.washington.edu/research/networking/websys/)
775
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; QihooBot 1.0 qihoobot@qihoo.net)
776
+ - Mozilla/4.0 (compatible; MSIE 6.0; Windows NT; MS Search 4.0 Robot)
777
+ - Mozilla/4.0 (compatible; MSIE enviable; DAUMOA 2.0; DAUM Web Robot; Daum Communications Corp. Korea; +http://ws.daum.net/aboutkr.html)
778
+ - Mozilla/4.0 (compatible; MSIE is not me; DAUMOA/1.0.1; DAUM Web Robot; Daum Communications Corp. Korea)
779
+ - Mozilla/4.0 (compatible; NaverBot/1.0; http://help.naver.com/delete_main.asp)
780
+ - Mozilla/4.0 (compatible; SpeedySpider; www.entireweb.com)
781
+ - Mozilla/4.0 (compatible; www.galaxy.com)
782
+ - Mozilla/4.0 (compatible; Y!J; for robot study; keyoshid)
783
+ - Mozilla/4.0 (compatible; Yahoo Japan; for robot study; kasugiya)
784
+ - Mozilla/4.0 (JemmaTheTourist;http://www.activtourist.com)
785
+ - Mozilla/4.0 (MobilePhone SCP-5500/US/1.0) NetFront/3.0 MMP/2.0 (compatible; Googlebot/2.1; http://www.google.com/bot.html)
786
+ - Mozilla/4.0 (MobilePhone SCP-5500/US/1.0) NetFront/3.0 MMP/2.0 FAKE (compatible; Googlebot/2.1; http://www.google.com/bot.html)
787
+ - Mozilla/4.0 (Mozilla; http://www.mozilla.org/docs/en/bot.html; master@mozilla.com)
788
+ - Mozilla/4.0 (Sleek Spider/1.2)
789
+ - Mozilla/4.0 compatible FurlBot/Furl Search 2.0 (FurlBot; http://www.furl.net; wn.furlbot@looksmart.net)
790
+ - Mozilla/4.0 compatible ZyBorg/1.0 (wn.zyborg@looksmart.net; http://www.WISEnutbot.com)
791
+ - Mozilla/4.0 compatible ZyBorg/1.0 (ZyBorg@WISEnutbot.com; http://www.WISEnutbot.com)
792
+ - Mozilla/4.0 compatible ZyBorg/1.0 Dead Link Checker (wn.zyborg@looksmart.net; http://www.WISEnutbot.com)
793
+ - Mozilla/4.0 compatible ZyBorg/1.0 for Homepage (ZyBorg@WISEnutbot.com; http://www.WISEnutbot.com)
794
+ - Mozilla/4.0 efp@gmx.net
795
+ - Mozilla/4.0 [en] (Ask Jeeves Corporate Spider)
796
+ - Mozilla/4.0(compatible; Zealbot 1.0)
797
+ - Mozilla/4.04 (compatible; Dulance bot; +http://www.dulance.com/bot.jsp)
798
+ - Mozilla/4.0_(compatible;_MSIE_5.0;_Windows_95)_TrueRobot/1.4 libwww/5.2.8
799
+ - Mozilla/4.0_(compatible;_MSIE_5.0;_Windows_95)_VoilaBot/1.6 libwww/5.3.2
800
+ - Mozilla/4.6 [en] (http://www.cnet.com/)
801
+ - Mozilla/4.7
802
+ - Mozilla/4.7 (compatible; http://eidetica.com/spider)
803
+ - Mozilla/4.7 (compatible; Intelliseek; http://www.intelliseek.com)
804
+ - Mozilla/4.7 (compatible; Whizbang)
805
+ - Mozilla/4.7 (compatible; WhizBang; http://www.whizbang.com/crawler)
806
+ - Mozilla/4.7 [en](BecomeBot@exava.com)
807
+ - Mozilla/4.7 [en](Exabot@exava.com)
808
+ - Mozilla/4.72 [en] (BACS http://www.ba.be)
809
+ - Mozilla/5.0
810
+ - Mozilla/5.0 (+http://www.eurekster.com/mammoth) Mammoth/0.1
811
+ - Mozilla/5.0 (+http://www.sli-systems.com/) Mammoth/0.1
812
+ - Mozilla/5.0 (Clustered-Search-Bot/1.0; support@clush.com; http://www.clush.com/)
813
+ - Mozilla/5.0 (compatible; +http://www.evri.com/evrinid)
814
+ - Mozilla/5.0 (compatible; 008/0.83; http://www.80legs.com/spider.html;) Gecko/2008032620
815
+ - Mozilla/5.0 (compatible; Abonti/0.8 - http://www.abonti.com)
816
+ - Mozilla/5.0 (compatible; aiHitBot/1.0; +http://www.aihit.com/)
817
+ - Mozilla/5.0 (compatible; AnsearchBot/1.x; +http://www.ansearch.com.au/)
818
+ - Mozilla/5.0 (compatible; archive.org_bot/1.10.0 +http://www.loc.gov/minerva/crawl.html)
819
+ - Mozilla/5.0 (compatible; archive.org_bot/1.13.1x http://crawler.archive.org)
820
+ - Mozilla/5.0 (compatible; archive.org_bot/1.5.0-200506132127 http://crawler.archive.org) Hurricane Katrina
821
+ - Mozilla/5.0 (compatible; Ask Jeeves/Teoma; http://about.ask.com/en/docs/about/webmasters.shtml)
822
+ - Mozilla/5.0 (compatible; BecomeBot/1.23; http://www.become.com/webmasters.html)
823
+ - Mozilla/5.0 (compatible; BecomeBot/1.xx; MSIE 6.0 compatible; http://www.become.com/webmasters.html)
824
+ - Mozilla/5.0 (compatible; BecomeBot/2.0beta; http://www.become.com/webmasters.html)
825
+ - Mozilla/5.0 (compatible; BecomeBot/2.x; MSIE 6.0 compatible; http://www.become.com/site_owners.html)
826
+ - Mozilla/5.0 (compatible; BecomeJPBot/2.3; MSIE 6.0 compatible; +http://www.become.co.jp/site_owners.html)
827
+ - Mozilla/5.0 (compatible; BlogRefsBot/0.1; http://www.blogrefs.com/about/bloggers)
828
+ - Mozilla/5.0 (compatible; Bot; +http://pressemitteilung.ws/spamfilter
829
+ - Mozilla/5.0 (compatible; BuzzRankingBot/1.0; +http://www.buzzrankingbot.com/)
830
+ - Mozilla/5.0 (compatible; Charlotte/1.0b; charlotte@betaspider.com)
831
+ - Mozilla/5.0 (compatible; Charlotte/1.0b; http://www.searchme.com/support/)
832
+ - Mozilla/5.0 (compatible; Crawling jpeg; http://www.yama.info.waseda.ac.jp)
833
+ - Mozilla/5.0 (compatible; de/1.13.2 +http://www.de.com)
834
+ - Mozilla/5.0 (compatible; Diffbot/0.1; +http://www.diffbot.com)
835
+ - Mozilla/5.0 (compatible; DNS-Digger-Explorer/1.0; +http://www.dnsdigger.com)
836
+ - Mozilla/5.0 (compatible; DNS-Digger/1.0; +http://www.dnsdigger.com)
837
+ - Mozilla/5.0 (compatible; EARTHCOM.info/2.01; http://www.earthcom.info)
838
+ - Mozilla/5.0 (compatible; EARTHCOM/2.2; +http://enter4u.eu)
839
+ - Mozilla/5.0 (compatible; Exabot Test/3.0; +http://www.exabot.com/go/robot)
840
+ - Mozilla/5.0 (compatible; FatBot 2.0; http://www.thefind.com/main/CrawlerFAQs.fhtml)
841
+ - Mozilla/5.0 (compatible; Galbot/1.0; +http://www.galbot.com/bot.html)
842
+ - mozilla/5.0 (compatible; genevabot http://www.healthdash.com)
843
+ - Mozilla/5.0 (compatible; Googlebot/2.1; http://www.google.com/bot.html)
844
+ - Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
845
+ - mozilla/5.0 (compatible; heritrix/1.0.4 http://innovationblog.com)
846
+ - Mozilla/5.0 (compatible; heritrix/1.10.2 +http://i.stanford.edu/)
847
+ - Mozilla/5.0 (compatible; heritrix/1.12.1 +http://newstin.com/)
848
+ - Mozilla/5.0 (compatible; heritrix/1.12.1 +http://www.page-store.com)
849
+ - Mozilla/5.0 (compatible; heritrix/1.12.1 +http://www.page-store.com) [email:paul@page-store.com]
850
+ - mozilla/5.0 (compatible; heritrix/1.3.0 http://archive.crawler.org)
851
+ - Mozilla/5.0 (compatible; heritrix/1.4.0 +http://www.chepi.net)
852
+ - Mozilla/5.0 (compatible; heritrix/1.4t http://www.truveo.com/)
853
+ - Mozilla/5.0 (compatible; heritrix/1.5.0 http://www.l3s.de/~kohlschuetter/projects/crawling/)
854
+ - Mozilla/5.0 (compatible; heritrix/1.5.0-200506231921 http://pandora.nla.gov.au/crawl.html)
855
+ - Mozilla/5.0 (compatible; heritrix/1.6.0 http://www.worio.com/)
856
+ - Mozilla/5.0 (compatible; heritrix/1.7.0 +http://www.greaterera.com/)
857
+ - Mozilla/5.0 (compatible; heritrix/1.x.x +http://www.accelobot.com)
858
+ - Mozilla/5.0 (compatible; heritrix/2.0.0-RC1 +http://www.aol.com)
859
+ - Mozilla/5.0 (compatible; Hermit Search. Com; +http://www.hermitsearch.com)
860
+ - Mozilla/5.0 (compatible; HyperixScoop/1.3; +http://www.hyperix.com)
861
+ - Mozilla/5.0 (compatible; IDBot/1.0; +http://www.id-search.org/bot.html)
862
+ - Mozilla/5.0 (compatible; InterseekWeb/3.x)
863
+ - Mozilla/5.0 (compatible; Konqueror/3.5; Linux) KHTML/3.5.5 (like Gecko) (Exabot-Thumbnails)
864
+ - Mozilla/5.0 (compatible; LemSpider 0.1)
865
+ - Mozilla/5.0 (compatible; MojeekBot/2.0; http://www.mojeek.com/bot.html)
866
+ - Mozilla/5.0 (compatible; MSIE 6.0; Podtech Network; crawler_admin@podtech.net)
867
+ - Mozilla/5.0 (compatible; OnetSzukaj/5.0; http://szukaj.onet.pl)
868
+ - Mozilla/5.0 (compatible; PalmeraBot; http://www.links24h.com/help/palmera) Version 0.001
869
+ - Mozilla/5.0 (compatible; pogodak.ba/3.x)
870
+ - Mozilla/5.0 (compatible; Pogodak.hr/3.1)
871
+ - Mozilla/5.0 (compatible; PWeBot/3.1; http://www.programacionweb.net/robot.php)
872
+ - Mozilla/5.0 (compatible; Quantcastbot/1.0; www.quantcast.com)
873
+ - Mozilla/5.0 (compatible; ScoutJet; +http://www.scoutjet.com/)
874
+ - Mozilla/5.0 (compatible; Scrubby/2.2; http://www.scrubtheweb.com/)
875
+ - Mozilla/5.0 (compatible; ShunixBot/1.x.x +http://www.shunix.com/robot.htm)
876
+ - Mozilla/5.0 (compatible; ShunixBot/1.x; http://www.shunix.com/bot.htm)
877
+ - Mozilla/5.0 (compatible; SkreemRBot +http://skreemr.com)
878
+ - Mozilla/5.0 (compatible; SummizeBot +http://www.summize.com)
879
+ - Mozilla/5.0 (compatible; Synoobot/0.9; http://www.synoo.com/search/bot.html)
880
+ - Mozilla/5.0 (compatible; Theophrastus/x.x; http://users.cs.cf.ac.uk/N.A.Smith/theophrastus.php)
881
+ - Mozilla/5.0 (compatible; TridentSpider/3.1)
882
+ - Mozilla/5.0 (compatible; Vagabondo/2.1; webcrawler at wise-guys dot nl; http://webagent.wise-guys.nl/)
883
+ - Mozilla/5.0 (compatible; Webduniabot/1.0; +http://search.webdunia.com/bot.aspx)
884
+ - Mozilla/5.0 (compatible; worio bot heritrix/1.10.0 +http://worio.com)
885
+ - Mozilla/5.0 (compatible; WoW Lemmings Kathune/2.0;http://www.wowlemmings.com/kathune.html)
886
+ - Mozilla/5.0 (compatible; Yahoo! DE Slurp; http://help.yahoo.com/help/us/ysearch/slurp)
887
+ - Mozilla/5.0 (compatible; Yahoo! Slurp China; http://misc.yahoo.com.cn/help.html)
888
+ - Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)
889
+ - Mozilla/5.0 (compatible; YandexBot/3.0; +http://yandex.com/bots)
890
+ - Mozilla/5.0 (compatible; Yoono; http://www.yoono.com/)
891
+ - Mozilla/5.0 (compatible; YoudaoBot/1.0; http://www.youdao.com/help/webmaster/spider/; )
892
+ - Mozilla/5.0 (compatible; Zenbot/1.3; +http://zen.co.za/webmasters/)
893
+ - Mozilla/5.0 (compatible; zermelo +http://www.powerset.com) [email:paul@page-store.comcrawl@powerset.com]
894
+ - Mozilla/5.0 (compatible;archive.org_bot/1.7.1; collectionId=316; Archive-It; +http://www.archive-it.org)
895
+ - Mozilla/5.0 (compatible;archive.org_bot/heritrix-1.9.0-200608171144 +http://pandora.nla.gov.au/crawl.html)
896
+ - Mozilla/5.0 (compatible;MAINSEEK_BOT)
897
+ - Mozilla/5.0 (Slurp/cat; slurp@inktomi.com; http://www.inktomi.com/slurp.html)
898
+ - Mozilla/5.0 (Slurp/si; slurp@inktomi.com; http://www.inktomi.com/slurp.html)
899
+ - Mozilla/5.0 (Twiceler-0.9 http://www.cuill.com/twiceler/robot.html)
900
+ - Mozilla/5.0 (Version: xxxx Type:xx)
901
+ - Mozilla/5.0 (wgao@genieknows.com)
902
+ - Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.7.7) NimbleCrawler 1.11 obeys UserAgent NimbleCrawler For problems contact: crawler_at_dataalchemy.com
903
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; fr; rv:1.8.1) VoilaBot BETA 1.2 (support.voilabot@orange-ftgroup.com)
904
+ - Mozilla/5.0 (Windows; U; Windows NT 5.1; fr; rv:1.8.1) VoilaBot BETA 1.2 (support.voilabot@orange-ftgroup.com)
905
+ - Mozilla/5.0 (Windows;) NimbleCrawler 1.12 obeys UserAgent NimbleCrawler For problems contact: crawler@health
906
+ - Mozilla/5.0 (Windows;) NimbleCrawler 1.12 obeys UserAgent NimbleCrawler For problems contact: crawler@healthline.com
907
+ - Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2.1; aggregator:Spinn3r (Spinn3r 3.1); http://spinn3r.com/robot) Gecko/20021130
908
+ - Mozilla/5.0 URL-Spider
909
+ - Mozilla/5.0 usww.com-Spider-for-w8.net
910
+ - Mozilla/5.0 wgao@genieknows.com
911
+ - Mozilla/5.0 [en] (compatible; Gulper Web Bot 0.2.4 www.ecsl.cs.sunysb.edu/~maxim/cgi-bin/Link/GulperBot)
912
+ - MQbot metaquerier.cs.uiuc.edu/crawler
913
+ - MQBOT/Nutch-0.9-dev (MQBOT Nutch Crawler; http://falcon.cs.uiuc.edu; mqbot@cs.uiuc.edu)
914
+ - msnbot-media/1.0 (+http://search.msn.com/msnbot.htm)
915
+ - msnbot-Products/1.0 (+http://search.msn.com/msnbot.htm)
916
+ - MSNBOT/0.xx (http://search.msn.com/msnbot.htm)
917
+ - msnbot/x.xx ( http://search.msn.com/msnbot.htm)
918
+ - MSNBOT_Mobile MSMOBOT Mozilla/2.0 (compatible; MSIE 4.02; Windows CE; Default)
919
+ - MSNPTC/1.0
920
+ - MSRBOT (http://research.microsoft.com/research/sv/msrbot)
921
+ - multicrawler ( http://sw.deri.org/2006/04/multicrawler/robots.html)
922
+ - MultiText/0.1
923
+ - MusicWalker2.0 ( http://www.somusical.com)
924
+ - MVAClient
925
+ - Mylinea.com Crawler 2.0
926
+ - Naamah 1.0.1/Blogbot (http://blogbot.de/)
927
+ - Naamah 1.0a/Blogbot (http://blogbot.de/)
928
+ - NABOT/5.0
929
+ - nabot_1.0
930
+ - NameOfAgent (CMS Spider)
931
+ - NASA Search 1.0
932
+ - NationalDirectory-WebSpider/1.3
933
+ - NationalDirectoryAddURL/1.0
934
+ - NaverBot-1.0 (NHN Corp. / +82-2-3011-1954 / nhnbot@naver.com)
935
+ - NaverBot_dloader/1.5
936
+ - NavissoBot
937
+ - NavissoBot/1.7 (+http://navisso.com/)
938
+ - NCSA Beta 1 (http://vias.ncsa.uiuc.edu/viasarchivinginformation.html)
939
+ - Nebullabot/2.2 (http://bot.nebulla.info)
940
+ - NEC Research Agent -- compuman at research.nj.nec.com
941
+ - Net-Seekr Bot/Net-Seekr Bot V1 (http://www.net-seekr.com)
942
+ - NetinfoBot/1.0 (http://netinfo.bg/netinfobot.html)
943
+ - NetLookout/2.24
944
+ - Netluchs/0.8-dev ( ; http://www.netluchs.de/; ___don't___spam_me_@netluchs.de)
945
+ - NetNoseCrawler/v1.0
946
+ - Netprospector JavaCrawler
947
+ - NetResearchServer(http://www.look.com)
948
+ - NetResearchServer/x.x(loopimprovements.com/robot.html)
949
+ - NetSeer/Nutch-0.9 (NetSeer Crawler; http://www.netseer.com; crawler@netseer.com)
950
+ - NetSprint -- 2.0
951
+ - NetWhatCrawler/0.06-dev (NetWhatCrawler from NetWhat.com; http://www.netwhat.com; support@netwhat.com)
952
+ - NetZippy
953
+ - NextGenSearchBot 1 (for information visit http://www.eliyon.com/NextGenSearchBot)
954
+ - NextopiaBOT (+http://www.nextopia.com) distributed crawler client beta v0.x
955
+ - NG-Search/0.90 (NG-SearchBot; http://www.ng-search.com; )
956
+ - NG/1.0
957
+ - NG/4.0.1229
958
+ - NITLE Blog Spider/0.01
959
+ - Noago Spider
960
+ - Nokia-WAPToolkit/1.2 googlebot(at)googlebot.com
961
+ - Nokia6610/1.0 (3.09) Profile/MIDP-1.0 Configuration/CLDC-1.0 (compatible;YahooSeeker/M1A1-R2D2; http://help.yahoo.com/help/us/ysearch/crawling/crawling-01.html)
962
+ - NokodoBot/1.x (+http://nokodo.com/bot.htm)
963
+ - Norbert the Spider(Burf.com)
964
+ - noxtrumbot/1.0 (crawler@noxtrum.com)
965
+ - noyona_0_1
966
+ - NP/0.1 (NP; http://www.nameprotect.com; npbot@nameprotect.com)
967
+ - NPBot (http://www.nameprotect.com/botinfo.html)
968
+ - NPBot-1/2.0
969
+ - Nsauditor/1.x
970
+ - nsyght.com/Nutch-1.0-dev (nsyght.com; Nsyght.com)
971
+ - nsyght.com/Nutch-x.x (nsyght.com; search.nsyght.com)
972
+ - nttdirectory_robot/0.9 (super-robot@super.navi.ocn.ne.jp)
973
+ - nuSearch Spider <a href='http://www.nusearch.com'>www.nusearch.com</a> (compatible; MSIE 4.01)
974
+ - NuSearch Spider (compatible; MSIE 6.0)
975
+ - NuSearch Spider www.nusearch.com
976
+ - Nutch
977
+ - Nutch crawler/Nutch-0.9 (picapage.com; admin@picapage.com)
978
+ - Nutch/Nutch-0.9 (Eurobot; http://www.ayell.eu )
979
+ - NutchCVS/0.0x-dev (Nutch; http://www.nutch.org/docs/bot.html; nutch-agent@lists.sourceforge.net)
980
+ - NutchCVS/0.7.1 (Nutch running at UW; http://www.nutch.org/docs/en/bot.html; sycrawl@cs.washington.edu)
981
+ - NutchEC2Test/Nutch-0.9-dev (Testing Nutch on Amazon EC2.; http://lucene.apache.org/nutch/bot.html; ec2test at lucene.com)
982
+ - NutchOrg/0.0x-dev (Nutch; http://www.nutch.org/docs/bot.html; nutch-agent@lists.sourceforge.net)
983
+ - nutchsearch/Nutch-0.9 (Nutch Search 1.0; herceg_novi at yahoo dot com)
984
+ - NutchVinegarCrawl/Nutch-0.8.1 (Vinegar; http://www.cs.washington.edu; eytanadar at gmail dot com)
985
+ - obidos-bot (just looking for books.)
986
+ - ObjectsSearch/0.01-dev (ObjectsSearch;http://www.ObjectsSearch.com/bot.html; support@thesoftwareobjects.com)
987
+ - ObjectsSearch/0.0x (ObjectsSearch; http://www.ObjectsSearch.com/bot.html; support@thesoftwareobjects.com)
988
+ - oBot ((compatible;Win32))
989
+ - Ocelli/1.x (http://www.globalspec.com/Ocelli)
990
+ - Octora Beta - www.octora.com
991
+ - Octora Beta Bot - www.octora.com
992
+ - OmniExplorer_Bot/1.0x (+http://www.omni-explorer.com) Internet CategorizerOmniExplorer http://www.omni-explorer.com/ car & shopping search (64.62.175.xxx)
993
+ - OmniExplorer_Bot/1.0x (+http://www.omni-explorer.com) Job Crawler
994
+ - OmniExplorer_Bot/1.1x (+http://www.omni-explorer.com) Torrent Crawler
995
+ - OmniExplorer_Bot/x.xx (+http://www.omni-explorer.com) WorldIndexer
996
+ - Onet.pl SA- http://szukaj.onet.pl
997
+ - OntoSpider/1.0 libwww-perl/5.65
998
+ - OOZBOT/0.20 ( http://www.setooz.com/oozbot.html ; agentname at setooz dot_com )
999
+ - OpenAcoon v4.0.x (www.openacoon.de)
1000
+ - Openbot/3.0+(robot-response@openfind.com.tw;+http://www.openfind.com.tw/robot.html)
1001
+ - Openfind data gatherer- Openbot/3.0+(robot-response@openfind.com.tw;+http://www.openfind.com.tw/robot.html)
1002
+ - Openfind Robot/1.1A2
1003
+ - OpenISearch/1.x (www.openisearch.com)
1004
+ - OpenTaggerBot (http://www.opentagger.com/opentaggerbot.htm)
1005
+ - OpenTextSiteCrawler/2.9.2
1006
+ - OpenWebSpider/0.x.x (http://www.openwebspider.org)
1007
+ - OpenWebSpider/x
1008
+ - OpidooBOT (larbin2.6.3@unspecified.mail)
1009
+ - Oracle Ultra Search
1010
+ - OrangeSpider
1011
+ - Orbiter/T-2.0 (+http://www.dailyorbit.com/bot.htm)
1012
+ - Overture-WebCrawler/3.8/Fresh (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
1013
+ - ozelot/2.7.3 (Search engine indexer; www.flying-cat.de/ozelot; ozelot@flying-cat.de)
1014
+ - PADLibrary Spider
1015
+ - PageBitesHyperBot/600 (http://www.pagebites.com/)
1016
+ - Pagebull http://www.pagebull.com/
1017
+ - page_verifier (http://www.securecomputing.com/goto/pv)
1018
+ - parallelContextFocusCrawler1.1parallelContextFocusCrawler1.1
1019
+ - ParaSite/1.0b (http://www.ianett.com/parasite/)
1020
+ - Patwebbot (http://www.herz-power.de/technik.html)
1021
+ - PBrowse 1.4b
1022
+ - pd02_1.0.0 pd02_1.0.0@dzimi@post.sk
1023
+ - PEERbot www.peerbot.com
1024
+ - PEval 1.4b
1025
+ - PicoSearch/1.0
1026
+ - Piffany_Web_Scraper_v0.x
1027
+ - Piffany_Web_Spider_v0.x
1028
+ - pipeLiner/0.3a (PipeLine Spider;http://www.pipeline-search.com/webmaster.html; webmaster'at'pipeline-search.com)
1029
+ - pipeLiner/0.xx (PipeLine Spider; http://www.pipeline-search.com/webmaster.html)
1030
+ - Pita
1031
+ - PJspider/3.0 (pjspider@portaljuice.com; http://www.portaljuice.com)
1032
+ - PlagiarBot/1.0
1033
+ - PluckFeedCrawler/2.0 (compatible; Mozilla 4.0; MSIE 5.5; http://www.pluck.com; 1 subscribers)
1034
+ - Pluggd/Nutch-0.9 (automated crawler http://www.pluggd.com;support at pluggd dot com)
1035
+ - Poirot
1036
+ - polybot 1.0 (http://cis.poly.edu/polybot/)
1037
+ - Pompos/1.x http://dir.com/pompos.html
1038
+ - Pompos/1.x pompos@iliad.fr
1039
+ - Popdexter/1.0
1040
+ - Port Huron Labs
1041
+ - PortalBSpider/2.0 (spider@portalb.com)
1042
+ - potbot 1.0
1043
+ - PRCrawler/Nutch-0.9 (data mining development project; crawler@projectrialto.com)
1044
+ - PrivacyFinder Cache Bot v1.0
1045
+ - PrivacyFinder/1.1
1046
+ - Production Bot 0116B
1047
+ - Production Bot 2016B
1048
+ - Production Bot DOT 3016B
1049
+ - Program Shareware 1.0.2
1050
+ - Project XP5 [2.03.07-111203]
1051
+ - PROve AnswerBot 4.0
1052
+ - ProWebGuide Link Checker (http://www.prowebguide.com)
1053
+ - psbot/0.1 (+http://www.picsearch.com/bot.html)
1054
+ - PSurf15a 11
1055
+ - PSurf15a 51
1056
+ - PSurf15a VA
1057
+ - psycheclone
1058
+ - PubCrawl (pubcrawl.stanford.edu)
1059
+ - pulseBot (pulse Web Miner)
1060
+ - PWeBot/1.2 Inspector (http://www.programacionweb.net/robot.php)
1061
+ - PycURL
1062
+ - Python-urllib/1.1x
1063
+ - Python-urllib/2.0a1
1064
+ - Qango.com Web Directory (http://www.qango.com/)
1065
+ - QEAVis Agent/Nutch-0.9 (Quantitative Evaluation of Academic Websites Visibility; http://nlp.uned.es/qeavis
1066
+ - QPCreep Test Rig ( We are not indexing- just testing )
1067
+ - QuepasaCreep ( crawler@quepasacorp.com )
1068
+ - QuepasaCreep v0.9.1x
1069
+ - QueryN Metasearch
1070
+ - QweeryBot/3.01 ( http://qweerybot.qweery.nl)
1071
+ - Qweery_robot.txt_CheckBot/3.01 (http://qweerybot.qweery.com)
1072
+ - R6_CommentReader_(www.radian6.com/crawler)
1073
+ - R6_FeedFetcher_(www.radian6.com/crawler)
1074
+ - rabaz (rabaz at gigabaz dot com)
1075
+ - RaBot/1.0 Agent-admin/phortse@hanmail.net
1076
+ - ramBot xtreme x.x
1077
+ - RAMPyBot - www.giveRAMP.com/0.1 (RAMPyBot - www.giveRAMP.com; http://www.giveramp.com/bot.html; support@giveRAMP.com)
1078
+ - RAMPyBot/0.8-dev (Nutch; http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
1079
+ - Rankivabot/3.2 (www.rankiva.com; 3.2; vzmxikn)
1080
+ - Rational SiteCheck (Windows NT)
1081
+ - Reaper [2.03.10-031204] (http://www.sitesearch.ca/reaper/)
1082
+ - Reaper/2.0x (+http://www.sitesearch.ca/reaper)
1083
+ - RedCarpet/1.2 (http://www.redcarpet-inc.com/robots.html)
1084
+ - RedCell/0.1 (InfoSec Search Bot (Coming Soon); http://www.telegenetic.net/bot.html; lhall@telegenetic.net)
1085
+ - RedCell/0.1 (RedCell; telegenetic.net/bot.html; lhall_at_telegenetic.net)
1086
+ - RedKernel WWW-Spider 2/0 (+http://www-spider.redkernel-softwares.com/)
1087
+ - rico/0.1
1088
+ - RixBot (http://babelserver.org/rix)
1089
+ - RoboCrawl (http://www.canadiancontent.net)
1090
+ - RoboCrawl (www.canadiancontent.net)
1091
+ - RoboPal (http://www.findpal.com/)
1092
+ - Robot/www.pj-search.com
1093
+ - Robot@SuperSnooper.Com
1094
+ - Robozilla/1.0
1095
+ - Rotondo/3.1 libwww/5.3.1
1096
+ - RRC (crawler_admin@bigfoot.com)
1097
+ - RSSMicro.com RSS/Atom Feed Robot
1098
+ - RSurf15a 41
1099
+ - RSurf15a 51
1100
+ - RSurf15a 81
1101
+ - RufusBot (Rufus Web Miner; http://64.124.122.252/feedback.html)
1102
+ - RufusBot (Rufus Web Miner; http://www.webaroo.com/rooSiteOwners.html)
1103
+ - sait/Nutch-0.9 (SAIT Research; http://www.samsung.com)
1104
+ - SandCrawler - Compatibility Testing
1105
+ - SapphireWebCrawler/1.0 (Sapphire Web Crawler using Nutch; http://boston.lti.cs.cmu.edu/crawler/; mhoy@cs.cmu.edu)
1106
+ - SapphireWebCrawler/Nutch-1.0-dev (Sapphire Web Crawler using Nutch; http://boston.lti.cs.cmu.edu/crawler/; mhoy@cs.cmu.edu)
1107
+ - savvybot/0.2
1108
+ - SBIder/0.7 (SBIder; http://www.sitesell.com/sbider.html; http://support.sitesell.com/contact-support.html)
1109
+ - SBIder/0.8-dev (SBIder; http://www.sitesell.com/sbider.html; http://support.sitesell.com/contact-support.html)
1110
+ - ScanWeb
1111
+ - ScholarUniverse/0.8 (Nutch;+http://scholaruniverse.com/bot.jsp; fetch-agent@scholaruniverse.com)
1112
+ - schwarzmann.biz-Spider_for_paddel.org+(http://www.innerprise.net/usp-spider.asp)
1113
+ - ScollSpider/2.0 (+http://www.webwobot.com/ScollSpider.php)
1114
+ - Scooter-3.0.EU
1115
+ - Scooter-3.0.FS
1116
+ - Scooter-3.0.HD
1117
+ - Scooter-3.0.VNS
1118
+ - Scooter-3.0QI
1119
+ - Scooter-3.2
1120
+ - Scooter-3.2.BT
1121
+ - Scooter-3.2.DIL
1122
+ - Scooter-3.2.EX
1123
+ - Scooter-3.2.JT
1124
+ - Scooter-3.2.NIV
1125
+ - Scooter-3.2.SF0
1126
+ - Scooter-3.2.snippet
1127
+ - Scooter-3.3dev
1128
+ - Scooter-ARS-1.1
1129
+ - Scooter-ARS-1.1-ih
1130
+ - scooter-venus-3.0.vns
1131
+ - Scooter-W3-1.0
1132
+ - Scooter-W3.1.2
1133
+ - Scooter/1.0
1134
+ - Scooter/1.0 scooter@pa.dec.com
1135
+ - Scooter/1.1 (custom)
1136
+ - Scooter/2.0 G.R.A.B. V1.1.0
1137
+ - Scooter/2.0 G.R.A.B. X2.0
1138
+ - Scooter/3.3
1139
+ - Scooter/3.3.QA.pczukor
1140
+ - Scooter/3.3.vscooter
1141
+ - Scooter/3.3_SF
1142
+ - Scooter2_Mercator_x-x.0
1143
+ - Scooter_bh0-3.0.3
1144
+ - Scooter_trk3-3.0.3
1145
+ - ScoutAbout
1146
+ - ScoutAnt/0.1; +http://www.ant.com/what_is_ant.com/
1147
+ - scoutmaster
1148
+ - Scrubby/2.x (http://www.scrubtheweb.com/)
1149
+ - Scrubby/3.0 (+http://www.scrubtheweb.com/help/technology.html)
1150
+ - Search+
1151
+ - Search-Engine-Studio
1152
+ - search.ch V1.4
1153
+ - search.ch V1.4.2 (spiderman@search.ch; http://www.search.ch)
1154
+ - Search/1.0 (http://www.innerprise.net/es-spider.asp)
1155
+ - searchbot admin@google.com
1156
+ - SearchByUsa/2 (SearchByUsa; http://www.SearchByUsa.com/bot.html; info@SearchByUsa.com)
1157
+ - SearchdayBot
1158
+ - SearchExpress Spider0.99
1159
+ - SearchGuild/DMOZ/Experiment (searchguild@gmail.com)
1160
+ - SearchGuild_DMOZ_Experiment (chris@searchguild.com)
1161
+ - Searchit-Now Robot/2.2 (+http://www.searchit-now.co.uk)
1162
+ - Searchmee! Spider v0.98a
1163
+ - SearchSight/2.0 (http://SearchSight.com/)
1164
+ - SearchSpider.com/1.1
1165
+ - Searchspider/1.2 (SearchSpider; http://www.searchspider.com; webmaster@searchspider.com)
1166
+ - SearchTone2.0 - IDEARE
1167
+ - Seekbot/1.0 (http://www.seekbot.net/bot.html) HTTPFetcher/0.3
1168
+ - Seekbot/1.0 (http://www.seekbot.net/bot.html) RobotsTxtFetcher/1.0 (XDF)
1169
+ - Seekbot/1.0 (http://www.seekbot.net/bot.html) RobotsTxtFetcher/1.2
1170
+ - Seeker.lookseek.com
1171
+ - Semager/1.1 (http://www.semager.de/blog/semager-bots/)
1172
+ - Semager/1.x (http://www.semager.de)
1173
+ - Sensis Web Crawler (search_comments\\at\\sensis\\dot\\com\\dot\\au)
1174
+ - Sensis.com.au Web Crawler (search_comments\\at\\sensis\\dot\\com\\dot\\au)
1175
+ - SeznamBot/1.0
1176
+ - SeznamBot/1.0 (+http://fulltext.seznam.cz/)
1177
+ - SeznamBot/2.0-test (+http://fulltext.sblog.cz/)
1178
+ - ShablastBot 1.0
1179
+ - Shim Crawler
1180
+ - Shim-Crawler(Mozilla-compatible; http://www.logos.ic.i.u-tokyo.ac.jp/crawler/; crawl@logos.ic.i.u-tokyo.ac.jp)
1181
+ - ShopWiki/1.0 ( +http://www.shopwiki.com/)
1182
+ - ShopWiki/1.0 ( +http://www.shopwiki.com/wiki/Help:Bot)
1183
+ - Shoula.com Crawler 2.0
1184
+ - SietsCrawler/1.1 (+http://www.siets.biz)
1185
+ - Sigram/Nutch-1.0-dev (Test agent for Nutch development; http://www.sigram.com/bot.html; bot at sigram dot com)
1186
+ - Siigle Orumcex v.001 Turkey (http://www.siigle.com)
1187
+ - silk/1.0
1188
+ - silk/1.0 (+http://www.slider.com/silk.htm)/3.7
1189
+ - Sirketcebot/v.01 (http://www.sirketce.com/bot.html)
1190
+ - SiteSpider +(http://www.SiteSpider.com/)
1191
+ - SiteTruth.com site rating system
1192
+ - SiteXpert
1193
+ - Skampy/0.9.x (http://www.skaffe.com/skampy-info.html)
1194
+ - Skimpy/0.x (http://www.skaffe.com/skampy-info.html)
1195
+ - Skywalker/0.1 (Skywalker; anonymous; anonymous)
1196
+ - Slarp/0.1
1197
+ - Slider_Search_v1-de
1198
+ - Slurp/2.0 (slurp@inktomi.com; http://www.inktomi.com/slurp.html)
1199
+ - Slurp/2.0-KiteWeekly (slurp@inktomi.com; http://www.inktomi.com/slurp.html)
1200
+ - Slurp/si (slurp@inktomi.com; http://www.inktomi.com/slurp.html)
1201
+ - Slurpy Verifier/1.0
1202
+ - SlySearch (slysearch@slysearch.com)
1203
+ - SlySearch/1.0 http://www.plagiarism.org/crawler/robotinfo.html
1204
+ - SlySearch/1.x http://www.slysearch.com
1205
+ - smartwit.com
1206
+ - SmiffyDCMetaSpider/1.0
1207
+ - snap.com beta crawler v0
1208
+ - Snapbot/1.0
1209
+ - Snapbot/1.0 (Snap Shots +http://www.snap.com)
1210
+ - SnykeBot/0.6 (http://www.snyke.com)
1211
+ - SocSciBot ()
1212
+ - SoftHypermarketFileCheckBot/1.0+(+http://www.softhypermaket.com)
1213
+ - sogou develop spider
1214
+ - Sogou Orion spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07)
1215
+ - sogou spider
1216
+ - Sogou web spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07)
1217
+ - sohu agent
1218
+ - sohu-search
1219
+ - Sosospider+(+http://help.soso.com/webspider.htm)
1220
+ - speedfind ramBot xtreme 8.1
1221
+ - Speedy Spider (Beta/x.x; speedy@entireweb.com)
1222
+ - Speedy Spider (Entireweb; Beta/1.0; http://www.entireweb.com/about/search_tech/speedyspider/)
1223
+ - Speedy_Spider (http://www.entireweb.com)
1224
+ - Sphere Scout&v4.0 - scout at sphere dot com
1225
+ - Sphider
1226
+ - Spida/0.1
1227
+ - Spider-Sleek/2.0 (+http://search-info.com/linktous.html)
1228
+ - spider.batsch.com
1229
+ - spider.yellopet.com - www.yellopet.com
1230
+ - Spider/maxbot.com admin@maxbot.com
1231
+ - SpiderKU/0.x
1232
+ - SpiderMan
1233
+ - SpiderMonkey/7.0x (SpiderMonkey.ca info at http://spidermonkey.ca/sm.shtml)
1234
+ - Spinne/2.0
1235
+ - Spinne/2.0 med
1236
+ - Spinne/2.0 med_AH
1237
+ - Spock Crawler (http://www.spock.com/crawler)
1238
+ - sportsuchmaschine.de-Robot (Version: 1.02- powered by www.sportsuchmaschine.de)
1239
+ - sproose/0.1-alpha (sproose crawler; http://www.sproose.com/bot.html; crawler@sproose.com)
1240
+ - Sqworm/2.9.81-BETA (beta_release; 20011102-760; i686-pc-linux-gnu)
1241
+ - Sqworm/2.9.85-BETA (beta_release; 20011115-775; i686-pc-linux-gnu)
1242
+ - SSurf15a 11
1243
+ - StackRambler/x.x
1244
+ - stat statcrawler@gmail.com
1245
+ - Steeler/1.x (http://www.tkl.iis.u-tokyo.ac.jp/~crawler/)
1246
+ - Steeler/3.3 (http://www.tkl.iis.u-tokyo.ac.jp/~crawler/)
1247
+ - Strategic Board Bot (+http://www.strategicboard.com)
1248
+ - Strategic Board Bot (+http://www.strategicboard.com)
1249
+ - Submission Spider at surfsafely.com
1250
+ - suchbaer.de
1251
+ - suchbaer.de (CrawlerAgent v0.103)
1252
+ - suchbot
1253
+ - Suchknecht.at-Robot
1254
+ - suchpadbot/1.0 (+http://www.suchpad.de)
1255
+ - SurferF3 1/0
1256
+ - suzuran
1257
+ - Swooglebot/2.0. (+http://swoogle.umbc.edu/swooglebot.htm)
1258
+ - SWSBot-Images/1.2 http://www.smartwaresoft.com/swsbot12.html
1259
+ - SygolBot http://www.sygol.net
1260
+ - SynoBot
1261
+ - Syntryx ANT Scout Chassis Pheromone; Mozilla/4.0 compatible crawler
1262
+ - Szukacz/1.x
1263
+ - Szukacz/1.x (robot; www.szukacz.pl/jakdzialarobot.html; szukacz@proszynski.pl)
1264
+ - tags2dir.com/0.8 (+http://tags2dir.com/directory/)
1265
+ - Tagword (http://tagword.com/dmoz_survey.php)
1266
+ - TCDBOT/Nutch-0.8 (PhD student research;http://www.tcd.ie; mcgettrs at t c d dot IE)
1267
+ - TECOMAC-Crawler/0.x
1268
+ - Tecomi Bot (http://www.tecomi.com/bot.htm)
1269
+ - Teemer (NetSeer Inc. is a Los Angeles based Internet startup company.; http://www.netseer.com/crawler.html; crawler@netseer.com)
1270
+ - Teoma MP
1271
+ - teomaagent crawler-admin@teoma.com
1272
+ - teomaagent1 [crawler-admin@teoma.com]
1273
+ - teoma_agent1
1274
+ - Teradex Mapper; mapper@teradex.com; http://www.teradex.com
1275
+ - terraminds-bot/1.0 (support@terraminds.de)
1276
+ - TerrawizBot/1.0 (+http://www.terrawiz.com/bot.html)
1277
+ - Test spider
1278
+ - TestCrawler/Nutch-0.9 (Testing Crawler for Research ; http://balihoo.com/index.aspx; tgautier at balihoo dot com)
1279
+ - TheRarestParser/0.2a (http://therarestwords.com/)
1280
+ - TheSuBot/0.1 (www.thesubot.de)
1281
+ - thumbshots-de-Bot (Version: 1.02- powered by www.thumbshots.de)
1282
+ - timboBot/0.9 http://www.breakingblogs.com/timbo_bot.html
1283
+ - TinEye/1.1 (http://tineye.com/crawler.html)
1284
+ - tivraSpider/1.0 (crawler@tivra.com)
1285
+ - TJG/Spider
1286
+ - Tkensaku/x.x(http://www.tkensaku.com/q.html)
1287
+ - Topodia/1.2-dev (Topodia - Crawler for HTTP content indexing; http://www.topodia.com/; support@topodia.com)
1288
+ - Toutatis x-xx.x (hoppa.com)
1289
+ - Toutatis x.x (hoppa.com)
1290
+ - Toutatis x.x-x
1291
+ - traazibot/testengine (+http://www.traazi.de)
1292
+ - Trampelpfad-Spider
1293
+ - Trampelpfad-Spider-v0.1
1294
+ - TSurf15a 11
1295
+ - Tumblr/1.0 RSS syndication (+http://www.tumblr.com/) (support@tumblr.com)
1296
+ - TurnitinBot/x.x (http://www.turnitin.com/robot/crawlerinfo.html)
1297
+ - Turnpike Emporium LinkChecker/0.1
1298
+ - TutorGig/1.5 (+http://www.tutorgig.com/crawler)
1299
+ - Tutorial Crawler 1.4 (http://www.tutorgig.com/crawler)
1300
+ - Twiceler www.cuill.com/robots.html
1301
+ - Twiceler-0.9 http://www.cuill.com/twiceler/robot.html
1302
+ - Tycoon Agent/Nutch-1.0-dev
1303
+ - TygoBot
1304
+ - TygoProwler
1305
+ - UIowaCrawler/1.0
1306
+ - UKWizz/Nutch-0.8.1 (UKWizz Nutch crawler; http://www.ukwizz.com/)
1307
+ - Ultraseek
1308
+ - Under the Rainbow 2.2
1309
+ - UofTDB_experiment (leehyun@cs.toronto.edu)
1310
+ - updated/0.1-alpha (updated crawler; http://www.updated.com; crawler@updated.com)
1311
+ - updated/0.1beta (updated.com; http://www.updated.com; crawler@updated.om)
1312
+ - Uptimebot
1313
+ - UptimeBot(www.uptimebot.com)
1314
+ - URL Spider Pro/x.xx (innerprise.net)
1315
+ - urlfan-bot/1.0; +http://www.urlfan.com/site/bot/350.html
1316
+ - URL_Spider_Pro/x.x
1317
+ - URL_Spider_Pro/x.x+(http://www.innerprise.net/usp-spider.asp)
1318
+ - User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
1319
+ - User-Agent: Mozilla/4.0 (SKIZZLE! Distributed Internet Spider v1.0 - www.SKIZZLE.com)
1320
+ - USyd-NLP-Spider (http://www.it.usyd.edu.au/~vinci/bot.html)
1321
+ - VadixBot
1322
+ - Vagabondo-WAP/2.0 (webcrawler at wise-guys dot nl; http://webagent.wise-guys.nl/)/1.0 Profile
1323
+ - Vagabondo/1.x MT (webagent@wise-guys.nl)
1324
+ - Vagabondo/2.0 MT
1325
+ - Vagabondo/2.0 MT (webagent at wise-guys dot nl)
1326
+ - Vagabondo/2.0 MT (webagent@NOSPAMwise-guys.nl)
1327
+ - Vagabondo/3.0 (webagent at wise-guys dot nl)
1328
+ - Vakes/0.01 (Vakes; http://www.vakes.com/; search@vakes.com)
1329
+ - versus 0.2 (+http://versus.integis.ch)
1330
+ - versus crawler eda.baykan@epfl.ch
1331
+ - VeryGoodSearch.com.DaddyLongLegs
1332
+ - verzamelgids.nl - Networking4all Bot/x.x
1333
+ - Verzamelgids/2.2 (http://www.verzamelgids.nl)
1334
+ - Vespa Crawler
1335
+ - VisBot/2.0 (Visvo.com Crawler; http://www.visvo.com/bot.html; bot@visvo.com)
1336
+ - Vision Research Lab image spider at vision.ece.ucsb.edu
1337
+ - VMBot/0.x.x (VMBot; http://www.VerticalMatch.com/; vmbot@tradedot.com)
1338
+ - Vortex/2.2 (+http://marty.anstey.ca/robots/vortex/)
1339
+ - voyager-hc/1.0
1340
+ - voyager/1.0
1341
+ - voyager/2.0 (http://www.kosmix.com/html/crawler.html)
1342
+ - VSE/1.0 (testcrawler@hotmail.com)
1343
+ - VSE/1.0 (testcrawler@vivisimo.com)
1344
+ - vspider
1345
+ - vspider/3.x
1346
+ - VWBOT/Nutch-0.9-dev (VWBOT Nutch Crawler; http://vwbot.cs.uiuc.edu;+vwbot@cs.uiuc.edu
1347
+ - W3SiteSearch Crawler_v1.1 http://www.w3sitesearch.de
1348
+ - wadaino.jp-crawler 0.2 (http://wadaino.jp/)
1349
+ - Wavefire/0.8-dev (Wavefire; http://www.wavefire.com; info@wavefire.com)
1350
+ - Waypath development crawler - info at waypath dot com
1351
+ - Waypath Scout v2.x - info at waypath dot com
1352
+ - Web Snooper
1353
+ - web2express.org/Nutch-0.9-dev (leveled playing field; http://web2express.org/; info at web2express.org)
1354
+ - WebAlta Crawler/1.2.1 (http://www.webalta.ru/bot.html)
1355
+ - WebarooBot (Webaroo Bot; http://64.124.122.252/feedback.html)
1356
+ - WebarooBot (Webaroo Bot; http://www.webaroo.com/rooSiteOwners.html)
1357
+ - webbandit/4.xx.0
1358
+ - Webclipping.com
1359
+ - WebCompass 2.0
1360
+ - WebCorp/1.0
1361
+ - webcrawl.net
1362
+ - WebFindBot(http://www.web-find.com)
1363
+ - Webglimpse 2.xx.x (http://webglimpse.net)
1364
+ - Weblog Attitude Diffusion 1.0
1365
+ - webmeasurement-bot http://rvs.informatik.uni-leipzig.de
1366
+ - WebRankSpider/1.37 (+http://ulm191.server4you.de/crawler/)
1367
+ - WebSearch.COM.AU/3.0.1 (The Australian Search Engine; http://WebSearch.COM.AU; Search@WebSearch.COM.AU)
1368
+ - WebSearchBench WebCrawler v0.1(Experimental)
1369
+ - WebsiteWorth v1.0
1370
+ - Webspinne/1.0 webmaster@webspinne.de
1371
+ - Websquash.com (Add url robot)
1372
+ - WebStat/1.0 (Unix; beta; 20040314)
1373
+ - Webster v0.3 ( http://webster.healeys.net/ )
1374
+ - WebVac (webmaster@pita.stanford.edu)
1375
+ - Webverzeichnis.de - Telefon: 01908 / 26005
1376
+ - WebVulnCrawl.unknown/1.0 libwww-perl/5.803
1377
+ - Wells Search II
1378
+ - WEP Search 00
1379
+ - WFARC
1380
+ - whatUseek_winona/3.0
1381
+ - WhizBang! Lab
1382
+ - Willow Internet Crawler by Twotrees V2.1
1383
+ - WinHTTP Example/1.0
1384
+ - WinkBot/0.06 (Wink.com search engine web crawler; http://www.wink.com/Wink:WinkBot; winkbot@wink.com)
1385
+ - WIRE/0.11 (Linux; i686; BotRobotSpiderCrawleraromano@cli.di.unipi.it)
1386
+ - WIRE/0.x (Linux; i686; BotRobotSpiderCrawler)
1387
+ - WISEbot/1.0 (WISEbot@koreawisenut.com; http://wisebot.koreawisenut.com)
1388
+ - worio heritrix bot (+http://worio.com/)
1389
+ - woriobot ( http://www.worio.com/)
1390
+ - WorldLight
1391
+ - Wotbox/alpha0.6 (bot@wotbox.com; http://www.wotbox.com)
1392
+ - Wotbox/alpha0.x.x (bot@wotbox.com; http://www.wotbox.com) Java/1.4.1_02
1393
+ - WSB WebCrawler V1.0 (Beta)- cl@cs.uni-dortmund.de
1394
+ - WSB http://websearchbench.cs.uni-dortmund.de
1395
+ - wume_crawler/1.1 (http://wume.cse.lehigh.edu/~xiq204/crawler/)
1396
+ - Wwlib/Linux
1397
+ - www.arianna.it
1398
+ - WWWeasel Robot v1.00 (http://wwweasel.de)
1399
+ - wwwster/1.x (Beta- mailto:gue@cis.uni-muenchen.de)
1400
+ - X-Crawler
1401
+ - xirq/0.1-beta (xirq; http://www.xirq.com; xirq@xirq.com)
1402
+ - xyro_(xcrawler@cosmos.inria.fr)
1403
+ - Y!J-BSC/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
1404
+ - Y!J-SRD/1.0
1405
+ - Y!J/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
1406
+ - yacy (www.yacy.net; v20040602; i386 Linux 2.4.26-gentoo-r13; java 1.4.2_06; MET/en)
1407
+ - yacybot (x86 Windows XP 5.1; java 1.5.0_06; Europe/de) yacy.net
1408
+ - Yahoo Pipes 1.0
1409
+ - Yahoo! Mindset
1410
+ - Yahoo-Blogs/v3.9 (compatible; Mozilla 4.0; MSIE 5.5; http://help.yahoo.com/help/us/ysearch/crawling/crawling-02.html )
1411
+ - Yahoo-MMAudVid/1.0 (mms dash mmaudvidcrawler dash support at yahoo dash inc dot com)
1412
+ - Yahoo-MMAudVid/2.0(mms dash mm aud vid crawler dash support at yahoo dash inc.com ;Mozilla 4.0 compatible; MSIE 7.0;Windows NT 5.0; .NET CLR 2.0)
1413
+ - Yahoo-MMCrawler/3.x (mm dash crawler at trd dot overture dot com)
1414
+ - Yahoo-Test/4.0
1415
+ - Yahoo-VerticalCrawler-FormerWebCrawler/3.9 crawler at trd dot overture dot com; http://www.alltheweb.com/help/webmaster/crawler
1416
+ - YahooFeedSeeker/2.0 (compatible; Mozilla 4.0; MSIE 5.5; http://publisher.yahoo.com/rssguide)
1417
+ - YahooSeeker-Testing/v3.9 (compatible; Mozilla 4.0; MSIE 5.5; http://search.yahoo.com/)
1418
+ - YahooSeeker/1.0 (compatible; Mozilla 4.0; MSIE 5.5; http://help.yahoo.com/help/us/shop/merchant/)
1419
+ - YahooSeeker/1.0 (compatible; Mozilla 4.0; MSIE 5.5; http://search.yahoo.com/yahooseeker.html)
1420
+ - YahooSeeker/1.1 (compatible; Mozilla 4.0; MSIE 5.5; http://help.yahoo.com/help/us/shop/merchant/)
1421
+ - YahooSeeker/bsv3.9 (compatible; Mozilla 4.0; MSIE 5.5; http://help.yahoo.com/help/us/ysearch/crawling/crawling-02.html )
1422
+ - YahooSeeker/CafeKelsa-dev (compatible; Konqueror/3.2; FreeBSD ;cafekelsa-dev-webmaster@yahoo-inc.com )
1423
+ - Yandex/1.01.001 (compatible; Win16; I)
1424
+ - Yanga WorldSearch Bot v1.1/beta (http://www.yanga.co.uk/)
1425
+ - yarienavoir.net/0.2
1426
+ - Yeti
1427
+ - Yeti/0.01 (nhn/1noon yetibot@naver.com check robots.txt daily and follows it)
1428
+ - Yeti/1.0 (NHN Corp.; http://help.naver.com/robots/)
1429
+ - yggdrasil/Nutch-0.9 (yggdrasil biorelated search engine; www dot biotec dot tu minus dresden do de slash schroeder; heiko dot dietze at biotec dot tu minus dresden dot de)
1430
+ - YodaoBot/1.0 (http://www.yodao.com/help/webmaster/spider/; )
1431
+ - yoofind/yoofind-0.1-dev (yoono webcrawler; http://www.yoono.com ; MyEmail)
1432
+ - yoogliFetchAgent/0.1
1433
+ - yoono/1.0 web-crawler/1.0
1434
+ - YottaCars_Bot/4.12 (+http://www.yottacars.com) Car Search Engine
1435
+ - YottaShopping_Bot/4.12 (+http://www.yottashopping.com) Shopping Search Engine
1436
+ - Zao-Crawler
1437
+ - Zao-Crawler 0.2b
1438
+ - Zao/0.1 (http://www.kototoi.org/zao/)
1439
+ - ZBot/1.00 (icaulfield@zeus.com)
1440
+ - Zearchit
1441
+ - ZeBot_lseek.net (bot@ze.bz)
1442
+ - ZeBot_www.ze.bz (ze.bz@hotmail.com)
1443
+ - zedzo.digest/0.1 (http://www.zedzo.com/)
1444
+ - zermelo Mozilla/5.0 compatible; heritrix/1.12.1 (+http://www.powerset.com) [email:crawl@powerset.comemail:paul@page-store.com]
1445
+ - zerxbot/Version 0.6 libwww-perl/5.79
1446
+ - Zeus ThemeSite Viewer Webster Pro V2.9 Win32
1447
+ - Zeus xxxxx Webster Pro V2.9 Win32
1448
+ - Zeusbot/0.07 (Ulysseek's web-crawling robot; http://www.zeusbot.com; agent@zeusbot.com)
1449
+ - ZipppBot/0.xx (ZipppBot; http://www.zippp.net; webmaster@zippp.net)
1450
+ - ZIPPPCVS/0.xx (ZipppBot/.xx;http://www.zippp.net; webmaster@zippp.net)
1451
+ - Zippy v2.0 - Zippyfinder.com
1452
+ - ZoomSpider - wrensoft.com
1453
+ - zspider/0.9-dev http://feedback.redkolibri.com/
1454
+ - ZyBorg/1.0 (ZyBorg@WISEnut.com; http://www.WISEnut.com)