websiteclassificationapi 2.10__tar.gz → 2.11__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,325 +1,331 @@
1
- Metadata-Version: 2.1
2
- Name: websiteclassificationapi
3
- Version: 2.10
4
- Summary: Website classification API
5
- Home-page: https://github.com/websitecategorization/websiteclassificationnapi
6
- Author-email: info@websitecategorizationapi.com
7
- License: UNKNOWN
8
- Description: # Website Classification API
9
-
10
- This is the Python3 client library for [URL Classification](https://www.websitecategorizationapi.com) service.
11
-
12
- Our Website Categorization API provides accurate URL/webpage classification based on the widely trusted IAB Taxonomy.
13
-
14
- Categorizations are done in real-time and using full-path URLs.You can also use our API to classify plain text.
15
-
16
- We return categorizations of URLS for the following taxonomies:
17
-
18
- IAB, version 3, with 4 Tiers (taxonomy from Internet Advertising Bureau - IAB,
19
- IAB, version 2, with 4 Tiers (taxonomy from Internet Advertising Bureau - IAB
20
- IPTC NewsCodes, especially suitable for News Categorization
21
- Web Content Filtering Taxonomy (44 categories)
22
- Google Shopping Taxonomy, used by millions of online retailers (5474 categories)
23
- Shopify Taxonomy, used by millions of online stores (10560 categories)
24
- Amazon Taxonomy (39004 categories)
25
-
26
-
27
- We also return the following classifications / enriched data about each URL:
28
-
29
- Detection of Malware/Social Engineering/Mailicious Software
30
- Web Technologies used
31
- Likely Buyer Personas
32
- Topics
33
- Key Entities Named
34
- Sentiment Analysis
35
- Similar companies / Competitors
36
- Similar domains
37
- Tags
38
- Keywords
39
-
40
-
41
- For those looking for eCommerce classification, we also provide [Smart Product Categorization AI](https://www.productcategorization.com). It supports Shopify, Google Shopping, eBay and 120 other marketplaces.
42
-
43
- Website classification API s a python library that allows to classify websites based on IAB.
44
-
45
- ## Installation
46
- ```
47
- pip install websiteclassificationapi
48
- ```
49
- ## Requirements
50
-
51
- Only Python 3 is supported. You need an API key which you can obtain at our website.
52
- Python library requires only requests package.
53
-
54
- ## Documentation
55
-
56
- More detailed API documentation on URL Classification is [available here](https://www.websitecategorizationapi.com/api.php).
57
-
58
- ## Examples
59
-
60
- Please check our API documentation page on our website for most up to date examples.
61
- ```
62
- from websiteclassificationapi import websiteclassificationapi
63
-
64
- api_key = 'h2XurA' # you can get API key from www.websitecategorizationapi.com
65
- url = 'www.alpha-quantum.com' # can be set to any valid URL
66
- classifier_type = 'iab1' # should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites
67
-
68
- # calling the API
69
- print(websiteclassificationapi.get_categorization(url,api_key,classifier_type))
70
- ```
71
-
72
- ## How to select classifiers of different taxonomies
73
-
74
- NEW (update October 2024): Our newest version of API supports classifications for up to 4 Tiers. It returns one or more of 700 IAB categories.
75
-
76
- Classifier_type should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites.
77
-
78
- IAB Tier 1 categorization returns probabilities of text being classified as one of 29 possible categories.
79
-
80
- IAB Tier 2 categorization returns probabilities of text being classified as one of 447 possible categories.
81
-
82
- Ecommerce Tier 1 categorization returns probabilities of text being classified as one of 21 possible categories.
83
-
84
- Ecommerce Tier 2 website categorization returns probabilities of text being classified as one of 182 possible categories.
85
-
86
- Ecommerce Tier 3 website categorization returns probabilities of text being classified as one of 1113 possible categories.
87
-
88
- ## Taxonomies
89
-
90
- The list of categories available by classifier is also known as Taxonomy. There are many taxonomies available, some are standard are well known, e.g. IAB taxonomy is well suited for ads and advertising in general, whereas Facebook product categories taxonomy
91
- is appropriate for ecommerce field.
92
-
93
- Taxonomy also differ in how many tiers, levels, or depths do they support. E.g. taxonomy may only support 1 set of main categories, or it can further subcategories.
94
-
95
- The categorization in the form of Tier 1/Tier 2/Tier 3/.... is also known as taxonomy path.
96
-
97
- The classifiers can be either built in a way that they predict single Tier categories or they can return full taxonomy paths. It really depends on the use case what is most appropriate.
98
-
99
- You can find more information about IAB taxonomy at this page: https://www.iab.com/guidelines/content-taxonomy/.
100
-
101
- Taxonomy should be chosen in a way that it suits your use case. E.g. let us say you have an online store and currently you just list your products without any categorizations.
102
-
103
- Then it may be very valuable if you could provide some kind of menus that categorize products in different verticals.
104
-
105
- Why? Because your users may more easily find your products, you will have more subpages that can be indexed by search engines and thus provide you with more traffic and visits.
106
-
107
- Having verticals set up may also mean better filtering and lead to higher conversions and thus lower cost of acquisition. There are a multitude of opportunities in adding categorization to an online store.
108
-
109
- ## Our other services
110
-
111
- We also provide APIs for other tasks: [Redaction API](https://www.redactionapi.net) and [Content Moderation API](https://www.contentmoderationapi.net) as well as [Anonymization API](https://www.anonymizationapi.com).
112
-
113
- ## AI explainability
114
-
115
- One of the unique features of classifiers is that they provide machine learning interpretability or artificial intelligence explainability (XAI) in the form of words that most contribute to resulting classification.
116
-
117
- Example 1 of explainability:
118
- ![Image1](https://www.websitecategorizationapi.com/product_categorization.png)
119
-
120
- Example 2 of explainability:
121
- ![Image1](https://www.websitecategorizationapi.com/productcategorizationnew1.jpg)
122
-
123
- Why the need of AI explainability?
124
-
125
- AI models are increasingly being used in ways that affect humans. E.g. you may apply for a loan at the bank and get rejected, but even though a human may have sent or explained you this, the decision may have actually been made by a machine learning model.
126
-
127
- Machine learning models making decisions is increasingly part of every day and because often these decisions are made by what could be termed black boxes, there is increasing desire for having ML decisions made in a way that are explainable.
128
-
129
- There are also many regulations that demand this, e.g. GDPR.
130
-
131
- ## Support for languages
132
-
133
- Classification service supports classifications of websites in 150 languages.
134
-
135
- ## Offline database of categorized domains
136
-
137
- We offer offline [URL database](https://www.websitecategorizationapi.com/url_database.php) of millions of categorized domains. It can be used web content filtering, AdTech marketing, cybersecurity, brand safety, contextual targeting.
138
-
139
- It is ideal for those use cases where you require very low latency of requests, which can be achieved with pre-classified websites stored in database.
140
-
141
- Another great source for [URL categorization database](https://www.urlcategorizationdatabase.com/)
142
-
143
- ## Handling websites with no texts
144
-
145
- When encountering websites that have no text and just images, our classifier relies on online optical character recognition API service to extract text (if any available) from images on the website. And then classify it.
146
-
147
- To deal with potential duplicates we use the reverse IP lookup of domains to find similar domains that are hosted on the same IP.
148
-
149
- ## Application of website categorization to technologies usage
150
-
151
- We have collected usage of technologies by millions of websites, by combining this with categorization, one can find interesting results.
152
-
153
- Here is for example usage of Intercom across industry verticals:
154
-
155
- ![Image1](https://user-images.githubusercontent.com/58834207/238138013-d59d68b8-d7e3-42d2-bdda-100924669b72.png)
156
-
157
- Based on 50 millions of usage points we built an AI recommender which can predict which technologies for company using a set of technologies.
158
-
159
- Here are e.g. recommendations for company using Mouse Flow: <table class="table" style="font-size:20px;line-height:30px"><thead><tr><th>Technology</th><th>AI Recommendation Score </th><th>Website</th></tr></thead><tbody><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-AppNexus">AppNexus</a></td><td>0.15</td><td>http://appnexus.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Microsoft Clarity">Microsoft Clarity</a></td><td>0.14</td><td>https://clarity.microsoft.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Osano">Osano</a></td><td>0.14</td><td>https://www.osano.com/</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Jetpack">Jetpack</a></td><td>0.14</td><td>https://jetpack.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Raphael">Raphael</a></td><td>0.14</td><td>https://dmitrybaranovskiy.github.io/raphael/</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Svelte">Svelte</a></td><td>0.13</td><td>https://svelte.dev</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-AWS Certificate Manager">AWS Certificate Manager</a></td><td>0.12</td><td>https://aws.amazon.com/certificate-manager/</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Extendify">Extendify</a></td><td>0.11</td><td>https://extendify.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Kendo UI">Kendo UI</a></td><td>0.11</td><td>https://www.telerik.com/kendo-ui</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Flywheel">Flywheel</a></td><td>0.11</td><td>https://getflywheel.com</td></tr></tbody></table>
160
-
161
-
162
-
163
- Our digital solutions are trusted by organizations seeking reliable, secure, and innovative tools. With the [image moderation api](https://www.contentmoderationapi.net), companies gain the ability to filter and manage user-generated visuals, all while staying informed with insights from [Harvard](https://www.harvard.edu). Protect privacy in every scenario using our [image anonymization](https://www.anomyizationapi.com) platform, engineered to meet global standards and inspired by research from [MIT](https://web.mit.edu). When sensitive information must be protected, our [pii redaction service](https://www.redactionapi.net) ensures that personal data is removed efficiently, reflecting approaches found at [Stanford](https://www.stanford.edu).
164
-
165
- Business growth is supported through our [corporate enrichment data](https://www.companydataapi.com), delivering actionable intelligence much like the work at [Cambridge](https://www.cam.ac.uk). For cybersecurity and research teams, our [domain dataset](https://www.urlcategorizationdatabase.com) offers extensive categorization, paralleling the data-centric methods at [Berkeley](https://www.berkeley.edu). Legal departments benefit from our [automated contract review](https://www.aicontractreviewtool.com), streamlining compliance and review processes as promoted by [Oxford](https://www.ox.ac.uk).
166
-
167
- By aligning our solutions with the excellence demonstrated at [Yale](https://www.yale.edu) and [Cornell](https://www.cornell.edu), we ensure that your organization stays at the cutting edge—combining technology, research, and practical results for every challenge.
168
-
169
-
170
- ## Example classifications
171
-
172
- Example classification for website www.github.com:
173
- ```
174
- {
175
- "classification": [
176
- {
177
- "category": "Technology & Computing",
178
- "value": 0.7621352908406164
179
- },
180
- {
181
- "category": "Business and Finance",
182
- "value": 0.0785701408756428
183
- },
184
- {
185
- "category": "Video Gaming",
186
- "value": 0.06626958968249749
187
- },
188
- {
189
- "category": "Fine Art",
190
- "value": 0.017105357862223433
191
- },
192
- {
193
- "category": "Hobbies & Interests",
194
- "value": 0.016812511656388394
195
- },
196
- {
197
- "category": "Sports",
198
- "value": 0.011396157737341801
199
- },
200
- {
201
- "category": "Home & Garden",
202
- "value": 0.009099685741207822
203
- },
204
- {
205
- "category": "Personal Finance",
206
- "value": 0.0076400890345109055
207
- },
208
- {
209
- "category": "News and Politics",
210
- "value": 0.006692288300928684
211
- },
212
- {
213
- "category": "Careers",
214
- "value": 0.0039930258544077606
215
- },
216
- {
217
- "category": "Automotive",
218
- "value": 0.0029276292555247764
219
- },
220
- {
221
- "category": "Events and Attractions",
222
- "value": 0.0026449624402393084
223
- },
224
- {
225
- "category": "Shopping",
226
- "value": 0.0023606962223306537
227
- },
228
- {
229
- "category": "Family and Relationships",
230
- "value": 0.0023174171750800186
231
- },
232
- {
233
- "category": "Music and Audio",
234
- "value": 0.0020517145262615513
235
- },
236
- {
237
- "category": "Movies",
238
- "value": 0.0018936850100483473
239
- },
240
- {
241
- "category": "Travel",
242
- "value": 0.0009448942095545797
243
- },
244
- {
245
- "category": "Science",
246
- "value": 0.0008432696857311802
247
- },
248
- {
249
- "category": "Pets",
250
- "value": 0.0006956402098649299
251
- },
252
- {
253
- "category": "Television",
254
- "value": 0.0005261918310662409
255
- },
256
- {
257
- "category": "Real Estate",
258
- "value": 0.0005058920662560916
259
- },
260
- {
261
- "category": "Religion & Spirituality",
262
- "value": 0.000492253420442475
263
- },
264
- {
265
- "category": "Healthy Living",
266
- "value": 0.0004690261931844088
267
- },
268
- {
269
- "category": "Medical Health",
270
- "value": 0.0004467617749304944
271
- },
272
- {
273
- "category": "Education",
274
- "value": 0.00036333686743226124
275
- },
276
- {
277
- "category": "Food & Drink",
278
- "value": 0.0003463620639422737
279
- },
280
- {
281
- "category": "Books and Literature",
282
- "value": 0.00027078317064036986
283
- },
284
- {
285
- "category": "Style & Fashion",
286
- "value": 0.00011770141998920516
287
- },
288
- {
289
- "category": "Pop Culture",
290
- "value": 0.00006764487171529734
291
- }
292
- ],
293
- "html": "29101",
294
- "language": "en",
295
- "status": 200
296
- }
297
- ```
298
-
299
- ## Useful resources used in development of website categorization
300
-
301
- - [Tensorflow](https://www.tensorflow.org/)
302
-
303
- - [Website categorization](https://medium.com/website-categorization/website-categorization-api-ca6c3e0f6c4d)
304
-
305
- - [Sklearn](https://scikit-learn.org/stable/)
306
-
307
- - [Smart product categorization](https://medium.com/product-categorization/product-categorization-introduction-d62bb92e8515)
308
-
309
- - [Introduction to URL Categorization Database](https://www.alpha-quantum.com/blog/url-database/url-database/)
310
-
311
- - [Top shopify stores](https://www.leadsquantum.com)
312
-
313
-
314
-
315
-
316
- Keywords: website categorization,classification,categorization
317
- Platform: UNKNOWN
318
- Classifier: License :: OSI Approved :: MIT License
319
- Classifier: Environment :: GPU :: NVIDIA CUDA :: 11.3
320
- Classifier: Environment :: GPU :: NVIDIA CUDA :: 11.0
321
- Classifier: Environment :: GPU :: NVIDIA CUDA
322
- Classifier: Environment :: GPU :: NVIDIA CUDA :: 11.2
323
- Classifier: Environment :: GPU :: NVIDIA CUDA :: 10.1
324
- Classifier: Programming Language :: Python :: 3 :: Only
325
- Description-Content-Type: text/markdown
1
+ Metadata-Version: 2.1
2
+ Name: websiteclassificationapi
3
+ Version: 2.11
4
+ Summary: Website classification API
5
+ Home-page: https://github.com/websitecategorization/websiteclassificationnapi
6
+ Author-email: info@websitecategorizationapi.com
7
+ License: UNKNOWN
8
+ Description: # Website Classification API
9
+
10
+ This is the Python3 client library for [URL Classification](https://www.websitecategorizationapi.com) service.
11
+
12
+ Our Website Categorization API provides accurate URL/webpage classification based on the widely trusted IAB Taxonomy.
13
+
14
+ Categorizations are done in real-time and using full-path URLs.You can also use our API to classify plain text.
15
+
16
+ We return categorizations of URLS for the following taxonomies:
17
+
18
+ IAB, version 3, with 4 Tiers (taxonomy from Internet Advertising Bureau - IAB,
19
+ IAB, version 2, with 4 Tiers (taxonomy from Internet Advertising Bureau - IAB
20
+ IPTC NewsCodes, especially suitable for News Categorization
21
+ Web Content Filtering Taxonomy (44 categories)
22
+ Google Shopping Taxonomy, used by millions of online retailers (5474 categories)
23
+ Shopify Taxonomy, used by millions of online stores (10560 categories)
24
+ Amazon Taxonomy (39004 categories)
25
+
26
+
27
+ We also return the following classifications / enriched data about each URL:
28
+
29
+ Detection of Malware/Social Engineering/Mailicious Software
30
+ Web Technologies used
31
+ Likely Buyer Personas
32
+ Topics
33
+ Key Entities Named
34
+ Sentiment Analysis
35
+ Similar companies / Competitors
36
+ Similar domains
37
+ Tags
38
+ Keywords
39
+
40
+
41
+ For those looking for eCommerce classification, we also provide [Smart Product Categorization AI](https://www.productcategorization.com). It supports Shopify, Google Shopping, eBay and 120 other marketplaces.
42
+
43
+ Website classification API s a python library that allows to classify websites based on IAB.
44
+
45
+ ## Installation
46
+ ```
47
+ pip install websiteclassificationapi
48
+ ```
49
+ ## Requirements
50
+
51
+ Only Python 3 is supported. You need an API key which you can obtain at our website.
52
+ Python library requires only requests package.
53
+
54
+ ## Documentation
55
+
56
+ More detailed API documentation on URL Classification is [available here](https://www.websitecategorizationapi.com/api.php).
57
+
58
+ ## Examples
59
+
60
+ Please check our API documentation page on our website for most up to date examples.
61
+ ```
62
+ from websiteclassificationapi import websiteclassificationapi
63
+
64
+ api_key = 'h2XurA' # you can get API key from www.websitecategorizationapi.com
65
+ url = 'www.alpha-quantum.com' # can be set to any valid URL
66
+ classifier_type = 'iab1' # should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites
67
+
68
+ # calling the API
69
+ print(websiteclassificationapi.get_categorization(url,api_key,classifier_type))
70
+ ```
71
+
72
+ ## How to select classifiers of different taxonomies
73
+
74
+ NEW (update October 2024): Our newest version of API supports classifications for up to 4 Tiers. It returns one or more of 700 IAB categories.
75
+
76
+ Classifier_type should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites.
77
+
78
+ IAB Tier 1 categorization returns probabilities of text being classified as one of 29 possible categories.
79
+
80
+ IAB Tier 2 categorization returns probabilities of text being classified as one of 447 possible categories.
81
+
82
+ Ecommerce Tier 1 categorization returns probabilities of text being classified as one of 21 possible categories.
83
+
84
+ Ecommerce Tier 2 website categorization returns probabilities of text being classified as one of 182 possible categories.
85
+
86
+ Ecommerce Tier 3 website categorization returns probabilities of text being classified as one of 1113 possible categories.
87
+
88
+ ## Taxonomies
89
+
90
+ The list of categories available by classifier is also known as Taxonomy. There are many taxonomies available, some are standard are well known, e.g. IAB taxonomy is well suited for ads and advertising in general, whereas Facebook product categories taxonomy
91
+ is appropriate for ecommerce field.
92
+
93
+ Taxonomy also differ in how many tiers, levels, or depths do they support. E.g. taxonomy may only support 1 set of main categories, or it can further subcategories.
94
+
95
+ The categorization in the form of Tier 1/Tier 2/Tier 3/.... is also known as taxonomy path.
96
+
97
+ The classifiers can be either built in a way that they predict single Tier categories or they can return full taxonomy paths. It really depends on the use case what is most appropriate.
98
+
99
+ You can find more information about IAB taxonomy at this page: https://www.iab.com/guidelines/content-taxonomy/.
100
+
101
+ Taxonomy should be chosen in a way that it suits your use case. E.g. let us say you have an online store and currently you just list your products without any categorizations.
102
+
103
+ Then it may be very valuable if you could provide some kind of menus that categorize products in different verticals.
104
+
105
+ Why? Because your users may more easily find your products, you will have more subpages that can be indexed by search engines and thus provide you with more traffic and visits.
106
+
107
+ Having verticals set up may also mean better filtering and lead to higher conversions and thus lower cost of acquisition. There are a multitude of opportunities in adding categorization to an online store.
108
+
109
+ ## Our other services
110
+
111
+ Companies that manage internet access often start with a [web filtering database](https://webfilteringdatabase.com) to review how domains are categorized. This supports better decisions around policy cr
112
+ eation, reporting, and risk visibility. Many of those teams also evaluate a [web filtering service](https://webfilteringservice.com) to apply those controls in production.
113
+
114
+
115
+ We also provide APIs for other tasks: [Redaction API](https://www.redactionapi.net) and [Content Moderation API](https://www.contentmoderationapi.net) as well as [Anonymization API](https://www.anonymizationapi.com).
116
+
117
+ ## AI explainability
118
+
119
+ One of the unique features of classifiers is that they provide machine learning interpretability or artificial intelligence explainability (XAI) in the form of words that most contribute to resulting classification.
120
+
121
+ Example 1 of explainability:
122
+ ![Image1](https://www.websitecategorizationapi.com/product_categorization.png)
123
+
124
+ Example 2 of explainability:
125
+ ![Image1](https://www.websitecategorizationapi.com/productcategorizationnew1.jpg)
126
+
127
+ Why the need of AI explainability?
128
+
129
+ AI models are increasingly being used in ways that affect humans. E.g. you may apply for a loan at the bank and get rejected, but even though a human may have sent or explained you this, the decision may have actually been made by a machine learning model.
130
+
131
+ Machine learning models making decisions is increasingly part of every day and because often these decisions are made by what could be termed black boxes, there is increasing desire for having ML decisions made in a way that are explainable.
132
+
133
+ There are also many regulations that demand this, e.g. GDPR.
134
+
135
+ ## Support for languages
136
+
137
+ Classification service supports classifications of websites in 150 languages.
138
+
139
+ ## Offline database of categorized domains
140
+
141
+ We offer offline [URL database](https://www.websitecategorizationapi.com/url_database.php) of millions of categorized domains. It can be used web content filtering, AdTech marketing, cybersecurity, brand safety, contextual targeting.
142
+
143
+ For network administrators and IT security teams specifically, our [URL filtering for firewall](https://www.webfilteringdatabase.com) solution provides pre-categorized domains in 59 categories that can be loaded directly into firewalls, DNS resolvers, and secure web gateways for real-time content policy enforcement.
144
+
145
+ It is ideal for those use cases where you require very low latency of requests, which can be achieved with pre-classified websites stored in database.
146
+
147
+ Another great source for [URL categorization database](https://www.urlcategorizationdatabase.com/)
148
+
149
+ ## Handling websites with no texts
150
+
151
+ When encountering websites that have no text and just images, our classifier relies on online optical character recognition API service to extract text (if any available) from images on the website. And then classify it.
152
+
153
+ To deal with potential duplicates we use the reverse IP lookup of domains to find similar domains that are hosted on the same IP.
154
+
155
+ ## Application of website categorization to technologies usage
156
+
157
+ We have collected usage of technologies by millions of websites, by combining this with categorization, one can find interesting results.
158
+
159
+ Here is for example usage of Intercom across industry verticals:
160
+
161
+ ![Image1](https://user-images.githubusercontent.com/58834207/238138013-d59d68b8-d7e3-42d2-bdda-100924669b72.png)
162
+
163
+ Based on 50 millions of usage points we built an AI recommender which can predict which technologies for company using a set of technologies.
164
+
165
+ Here are e.g. recommendations for company using Mouse Flow: <table class="table" style="font-size:20px;line-height:30px"><thead><tr><th>Technology</th><th>AI Recommendation Score </th><th>Website</th></tr></thead><tbody><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-AppNexus">AppNexus</a></td><td>0.15</td><td>http://appnexus.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Microsoft Clarity">Microsoft Clarity</a></td><td>0.14</td><td>https://clarity.microsoft.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Osano">Osano</a></td><td>0.14</td><td>https://www.osano.com/</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Jetpack">Jetpack</a></td><td>0.14</td><td>https://jetpack.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Raphael">Raphael</a></td><td>0.14</td><td>https://dmitrybaranovskiy.github.io/raphael/</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Svelte">Svelte</a></td><td>0.13</td><td>https://svelte.dev</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-AWS Certificate Manager">AWS Certificate Manager</a></td><td>0.12</td><td>https://aws.amazon.com/certificate-manager/</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Extendify">Extendify</a></td><td>0.11</td><td>https://extendify.com</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Kendo UI">Kendo UI</a></td><td>0.11</td><td>https://www.telerik.com/kendo-ui</td></tr><tr><td><a href="https://www.alpha-quantum.com/technologies/websites-using-Flywheel">Flywheel</a></td><td>0.11</td><td>https://getflywheel.com</td></tr></tbody></table>
166
+
167
+
168
+
169
+ Our digital solutions are trusted by organizations seeking reliable, secure, and innovative tools. With the [image moderation api](https://www.contentmoderationapi.net), companies gain the ability to filter and manage user-generated visuals, all while staying informed with insights from [Harvard](https://www.harvard.edu). Protect privacy in every scenario using our [image anonymization](https://www.anomyizationapi.com) platform, engineered to meet global standards and inspired by research from [MIT](https://web.mit.edu). When sensitive information must be protected, our [pii redaction service](https://www.redactionapi.net) ensures that personal data is removed efficiently, reflecting approaches found at [Stanford](https://www.stanford.edu).
170
+
171
+ Business growth is supported through our [corporate enrichment data](https://www.companydataapi.com), delivering actionable intelligence much like the work at [Cambridge](https://www.cam.ac.uk). For cybersecurity and research teams, our [domain dataset](https://www.urlcategorizationdatabase.com) offers extensive categorization, paralleling the data-centric methods at [Berkeley](https://www.berkeley.edu). Legal departments benefit from our [automated contract review](https://www.aicontractreviewtool.com), streamlining compliance and review processes as promoted by [Oxford](https://www.ox.ac.uk).
172
+
173
+ By aligning our solutions with the excellence demonstrated at [Yale](https://www.yale.edu) and [Cornell](https://www.cornell.edu), we ensure that your organization stays at the cutting edge—combining technology, research, and practical results for every challenge.
174
+
175
+
176
+ ## Example classifications
177
+
178
+ Example classification for website www.github.com:
179
+ ```
180
+ {
181
+ "classification": [
182
+ {
183
+ "category": "Technology & Computing",
184
+ "value": 0.7621352908406164
185
+ },
186
+ {
187
+ "category": "Business and Finance",
188
+ "value": 0.0785701408756428
189
+ },
190
+ {
191
+ "category": "Video Gaming",
192
+ "value": 0.06626958968249749
193
+ },
194
+ {
195
+ "category": "Fine Art",
196
+ "value": 0.017105357862223433
197
+ },
198
+ {
199
+ "category": "Hobbies & Interests",
200
+ "value": 0.016812511656388394
201
+ },
202
+ {
203
+ "category": "Sports",
204
+ "value": 0.011396157737341801
205
+ },
206
+ {
207
+ "category": "Home & Garden",
208
+ "value": 0.009099685741207822
209
+ },
210
+ {
211
+ "category": "Personal Finance",
212
+ "value": 0.0076400890345109055
213
+ },
214
+ {
215
+ "category": "News and Politics",
216
+ "value": 0.006692288300928684
217
+ },
218
+ {
219
+ "category": "Careers",
220
+ "value": 0.0039930258544077606
221
+ },
222
+ {
223
+ "category": "Automotive",
224
+ "value": 0.0029276292555247764
225
+ },
226
+ {
227
+ "category": "Events and Attractions",
228
+ "value": 0.0026449624402393084
229
+ },
230
+ {
231
+ "category": "Shopping",
232
+ "value": 0.0023606962223306537
233
+ },
234
+ {
235
+ "category": "Family and Relationships",
236
+ "value": 0.0023174171750800186
237
+ },
238
+ {
239
+ "category": "Music and Audio",
240
+ "value": 0.0020517145262615513
241
+ },
242
+ {
243
+ "category": "Movies",
244
+ "value": 0.0018936850100483473
245
+ },
246
+ {
247
+ "category": "Travel",
248
+ "value": 0.0009448942095545797
249
+ },
250
+ {
251
+ "category": "Science",
252
+ "value": 0.0008432696857311802
253
+ },
254
+ {
255
+ "category": "Pets",
256
+ "value": 0.0006956402098649299
257
+ },
258
+ {
259
+ "category": "Television",
260
+ "value": 0.0005261918310662409
261
+ },
262
+ {
263
+ "category": "Real Estate",
264
+ "value": 0.0005058920662560916
265
+ },
266
+ {
267
+ "category": "Religion & Spirituality",
268
+ "value": 0.000492253420442475
269
+ },
270
+ {
271
+ "category": "Healthy Living",
272
+ "value": 0.0004690261931844088
273
+ },
274
+ {
275
+ "category": "Medical Health",
276
+ "value": 0.0004467617749304944
277
+ },
278
+ {
279
+ "category": "Education",
280
+ "value": 0.00036333686743226124
281
+ },
282
+ {
283
+ "category": "Food & Drink",
284
+ "value": 0.0003463620639422737
285
+ },
286
+ {
287
+ "category": "Books and Literature",
288
+ "value": 0.00027078317064036986
289
+ },
290
+ {
291
+ "category": "Style & Fashion",
292
+ "value": 0.00011770141998920516
293
+ },
294
+ {
295
+ "category": "Pop Culture",
296
+ "value": 0.00006764487171529734
297
+ }
298
+ ],
299
+ "html": "29101",
300
+ "language": "en",
301
+ "status": 200
302
+ }
303
+ ```
304
+
305
+ ## Useful resources used in development of website categorization
306
+
307
+ - [Tensorflow](https://www.tensorflow.org/)
308
+
309
+ - [Website categorization](https://medium.com/website-categorization/website-categorization-api-ca6c3e0f6c4d)
310
+
311
+ - [Sklearn](https://scikit-learn.org/stable/)
312
+
313
+ - [Smart product categorization](https://medium.com/product-categorization/product-categorization-introduction-d62bb92e8515)
314
+
315
+ - [Introduction to URL Categorization Database](https://www.alpha-quantum.com/blog/url-database/url-database/)
316
+
317
+ - [Top shopify stores](https://www.leadsquantum.com)
318
+
319
+
320
+
321
+
322
+ Keywords: website categorization,classification,categorization
323
+ Platform: UNKNOWN
324
+ Classifier: License :: OSI Approved :: MIT License
325
+ Classifier: Environment :: GPU :: NVIDIA CUDA :: 11.3
326
+ Classifier: Environment :: GPU :: NVIDIA CUDA :: 11.0
327
+ Classifier: Environment :: GPU :: NVIDIA CUDA
328
+ Classifier: Environment :: GPU :: NVIDIA CUDA :: 11.2
329
+ Classifier: Environment :: GPU :: NVIDIA CUDA :: 10.1
330
+ Classifier: Programming Language :: Python :: 3 :: Only
331
+ Description-Content-Type: text/markdown