websiteclassificationapi 2.3__tar.gz → 2.4__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/PKG-INFO +11 -7
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/README.md +10 -6
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/setup.py +1 -1
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/websiteclassificationapi.egg-info/PKG-INFO +11 -7
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/setup.cfg +0 -0
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/test/__init__.py +0 -0
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/websiteclassificationapi/__init__.py +0 -0
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/websiteclassificationapi.egg-info/SOURCES.txt +0 -0
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/websiteclassificationapi.egg-info/dependency_links.txt +0 -0
- {websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/websiteclassificationapi.egg-info/top_level.txt +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.1
|
|
2
2
|
Name: websiteclassificationapi
|
|
3
|
-
Version: 2.
|
|
3
|
+
Version: 2.4
|
|
4
4
|
Summary: Website classification API
|
|
5
5
|
Home-page: https://github.com/websitecategorization/websiteclassificationnapi
|
|
6
6
|
Author-email: info@websitecategorizationapi.com
|
|
@@ -9,7 +9,9 @@ Description: # Website Classification API
|
|
|
9
9
|
|
|
10
10
|
Python3 client library for [URL Classification](https://www.websitecategorizationapi.com).
|
|
11
11
|
|
|
12
|
-
|
|
12
|
+
For those looking for eCommerce classification, we also provide [Product Categorization API](https://www.productcategorization.com). It supports Shopify, Google Shopping, eBay and 120 other marketplaces.
|
|
13
|
+
|
|
14
|
+
Website classification API s a python library that allows to classify websites based on IAB.
|
|
13
15
|
|
|
14
16
|
## Installation
|
|
15
17
|
```
|
|
@@ -39,6 +41,8 @@ Description: # Website Classification API
|
|
|
39
41
|
|
|
40
42
|
## How to select classifiers of different taxonomies
|
|
41
43
|
|
|
44
|
+
NEW (update October 2024): Our newest version of API supports classifications for up to 4 Tiers. It returns one or more of 700 IAB categories.
|
|
45
|
+
|
|
42
46
|
Classifier_type should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites.
|
|
43
47
|
|
|
44
48
|
IAB Tier 1 categorization returns probabilities of text being classified as one of 29 possible categories.
|
|
@@ -92,7 +96,7 @@ Description: # Website Classification API
|
|
|
92
96
|
|
|
93
97
|
## Support for languages
|
|
94
98
|
|
|
95
|
-
Classification service supports classifications of websites in
|
|
99
|
+
Classification service supports classifications of websites in 150 languages.
|
|
96
100
|
|
|
97
101
|
## Offline database of categorized domains
|
|
98
102
|
|
|
@@ -102,9 +106,9 @@ Description: # Website Classification API
|
|
|
102
106
|
|
|
103
107
|
## Handling websites with no texts
|
|
104
108
|
|
|
105
|
-
When encountering websites that have no text and just images, our classifier relies on
|
|
109
|
+
When encountering websites that have no text and just images, our classifier relies on online optical character recognition API service to extract text (if any available) from images on the website. And then classify it.
|
|
106
110
|
|
|
107
|
-
To deal with potential duplicates we use the
|
|
111
|
+
To deal with potential duplicates we use the reverse IP lookup of domains to find similar domains that are hosted on the same IP.
|
|
108
112
|
|
|
109
113
|
## Application of website categorization to technologies usage
|
|
110
114
|
|
|
@@ -259,11 +263,11 @@ Description: # Website Classification API
|
|
|
259
263
|
|
|
260
264
|
- [Sklearn](https://scikit-learn.org/stable/)
|
|
261
265
|
|
|
262
|
-
- [
|
|
266
|
+
- [Smart product categorization](https://medium.com/product-categorization/product-categorization-introduction-d62bb92e8515)
|
|
263
267
|
|
|
264
268
|
- [Introduction to URL Categorization Database](https://www.alpha-quantum.com/blog/url-database/url-database/)
|
|
265
269
|
|
|
266
|
-
- [Top shopify stores](https://www.
|
|
270
|
+
- [Top shopify stores](https://www.leadsquantum.com)
|
|
267
271
|
|
|
268
272
|
|
|
269
273
|
|
|
@@ -2,7 +2,9 @@
|
|
|
2
2
|
|
|
3
3
|
Python3 client library for [URL Classification](https://www.websitecategorizationapi.com).
|
|
4
4
|
|
|
5
|
-
|
|
5
|
+
For those looking for eCommerce classification, we also provide [Product Categorization API](https://www.productcategorization.com). It supports Shopify, Google Shopping, eBay and 120 other marketplaces.
|
|
6
|
+
|
|
7
|
+
Website classification API s a python library that allows to classify websites based on IAB.
|
|
6
8
|
|
|
7
9
|
## Installation
|
|
8
10
|
```
|
|
@@ -32,6 +34,8 @@ print(websiteclassificationapi.get_categorization(url,api_key,classifier_type))
|
|
|
32
34
|
|
|
33
35
|
## How to select classifiers of different taxonomies
|
|
34
36
|
|
|
37
|
+
NEW (update October 2024): Our newest version of API supports classifications for up to 4 Tiers. It returns one or more of 700 IAB categories.
|
|
38
|
+
|
|
35
39
|
Classifier_type should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites.
|
|
36
40
|
|
|
37
41
|
IAB Tier 1 categorization returns probabilities of text being classified as one of 29 possible categories.
|
|
@@ -85,7 +89,7 @@ There are also many regulations that demand this, e.g. GDPR.
|
|
|
85
89
|
|
|
86
90
|
## Support for languages
|
|
87
91
|
|
|
88
|
-
Classification service supports classifications of websites in
|
|
92
|
+
Classification service supports classifications of websites in 150 languages.
|
|
89
93
|
|
|
90
94
|
## Offline database of categorized domains
|
|
91
95
|
|
|
@@ -95,9 +99,9 @@ It is ideal for those use cases where you require very low latency of requests,
|
|
|
95
99
|
|
|
96
100
|
## Handling websites with no texts
|
|
97
101
|
|
|
98
|
-
When encountering websites that have no text and just images, our classifier relies on
|
|
102
|
+
When encountering websites that have no text and just images, our classifier relies on online optical character recognition API service to extract text (if any available) from images on the website. And then classify it.
|
|
99
103
|
|
|
100
|
-
To deal with potential duplicates we use the
|
|
104
|
+
To deal with potential duplicates we use the reverse IP lookup of domains to find similar domains that are hosted on the same IP.
|
|
101
105
|
|
|
102
106
|
## Application of website categorization to technologies usage
|
|
103
107
|
|
|
@@ -252,11 +256,11 @@ Example classification for website www.github.com:
|
|
|
252
256
|
|
|
253
257
|
- [Sklearn](https://scikit-learn.org/stable/)
|
|
254
258
|
|
|
255
|
-
- [
|
|
259
|
+
- [Smart product categorization](https://medium.com/product-categorization/product-categorization-introduction-d62bb92e8515)
|
|
256
260
|
|
|
257
261
|
- [Introduction to URL Categorization Database](https://www.alpha-quantum.com/blog/url-database/url-database/)
|
|
258
262
|
|
|
259
|
-
- [Top shopify stores](https://www.
|
|
263
|
+
- [Top shopify stores](https://www.leadsquantum.com)
|
|
260
264
|
|
|
261
265
|
|
|
262
266
|
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.1
|
|
2
2
|
Name: websiteclassificationapi
|
|
3
|
-
Version: 2.
|
|
3
|
+
Version: 2.4
|
|
4
4
|
Summary: Website classification API
|
|
5
5
|
Home-page: https://github.com/websitecategorization/websiteclassificationnapi
|
|
6
6
|
Author-email: info@websitecategorizationapi.com
|
|
@@ -9,7 +9,9 @@ Description: # Website Classification API
|
|
|
9
9
|
|
|
10
10
|
Python3 client library for [URL Classification](https://www.websitecategorizationapi.com).
|
|
11
11
|
|
|
12
|
-
|
|
12
|
+
For those looking for eCommerce classification, we also provide [Product Categorization API](https://www.productcategorization.com). It supports Shopify, Google Shopping, eBay and 120 other marketplaces.
|
|
13
|
+
|
|
14
|
+
Website classification API s a python library that allows to classify websites based on IAB.
|
|
13
15
|
|
|
14
16
|
## Installation
|
|
15
17
|
```
|
|
@@ -39,6 +41,8 @@ Description: # Website Classification API
|
|
|
39
41
|
|
|
40
42
|
## How to select classifiers of different taxonomies
|
|
41
43
|
|
|
44
|
+
NEW (update October 2024): Our newest version of API supports classifications for up to 4 Tiers. It returns one or more of 700 IAB categories.
|
|
45
|
+
|
|
42
46
|
Classifier_type should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites.
|
|
43
47
|
|
|
44
48
|
IAB Tier 1 categorization returns probabilities of text being classified as one of 29 possible categories.
|
|
@@ -92,7 +96,7 @@ Description: # Website Classification API
|
|
|
92
96
|
|
|
93
97
|
## Support for languages
|
|
94
98
|
|
|
95
|
-
Classification service supports classifications of websites in
|
|
99
|
+
Classification service supports classifications of websites in 150 languages.
|
|
96
100
|
|
|
97
101
|
## Offline database of categorized domains
|
|
98
102
|
|
|
@@ -102,9 +106,9 @@ Description: # Website Classification API
|
|
|
102
106
|
|
|
103
107
|
## Handling websites with no texts
|
|
104
108
|
|
|
105
|
-
When encountering websites that have no text and just images, our classifier relies on
|
|
109
|
+
When encountering websites that have no text and just images, our classifier relies on online optical character recognition API service to extract text (if any available) from images on the website. And then classify it.
|
|
106
110
|
|
|
107
|
-
To deal with potential duplicates we use the
|
|
111
|
+
To deal with potential duplicates we use the reverse IP lookup of domains to find similar domains that are hosted on the same IP.
|
|
108
112
|
|
|
109
113
|
## Application of website categorization to technologies usage
|
|
110
114
|
|
|
@@ -259,11 +263,11 @@ Description: # Website Classification API
|
|
|
259
263
|
|
|
260
264
|
- [Sklearn](https://scikit-learn.org/stable/)
|
|
261
265
|
|
|
262
|
-
- [
|
|
266
|
+
- [Smart product categorization](https://medium.com/product-categorization/product-categorization-introduction-d62bb92e8515)
|
|
263
267
|
|
|
264
268
|
- [Introduction to URL Categorization Database](https://www.alpha-quantum.com/blog/url-database/url-database/)
|
|
265
269
|
|
|
266
|
-
- [Top shopify stores](https://www.
|
|
270
|
+
- [Top shopify stores](https://www.leadsquantum.com)
|
|
267
271
|
|
|
268
272
|
|
|
269
273
|
|
|
File without changes
|
|
File without changes
|
{websiteclassificationapi-2.3 → websiteclassificationapi-2.4}/websiteclassificationapi/__init__.py
RENAMED
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|