websiteclassificationapi 2.3__py3-none-any.whl → 2.4__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {websiteclassificationapi-2.3.dist-info → websiteclassificationapi-2.4.dist-info}/METADATA +11 -7
- websiteclassificationapi-2.4.dist-info/RECORD +7 -0
- websiteclassificationapi-2.3.dist-info/RECORD +0 -7
- {websiteclassificationapi-2.3.dist-info → websiteclassificationapi-2.4.dist-info}/LICENSE +0 -0
- {websiteclassificationapi-2.3.dist-info → websiteclassificationapi-2.4.dist-info}/WHEEL +0 -0
- {websiteclassificationapi-2.3.dist-info → websiteclassificationapi-2.4.dist-info}/top_level.txt +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.1
|
|
2
2
|
Name: websiteclassificationapi
|
|
3
|
-
Version: 2.
|
|
3
|
+
Version: 2.4
|
|
4
4
|
Summary: Website classification API
|
|
5
5
|
Home-page: https://github.com/websitecategorization/websiteclassificationnapi
|
|
6
6
|
Author-email: info@websitecategorizationapi.com
|
|
@@ -20,7 +20,9 @@ Description-Content-Type: text/markdown
|
|
|
20
20
|
|
|
21
21
|
Python3 client library for [URL Classification](https://www.websitecategorizationapi.com).
|
|
22
22
|
|
|
23
|
-
|
|
23
|
+
For those looking for eCommerce classification, we also provide [Product Categorization API](https://www.productcategorization.com). It supports Shopify, Google Shopping, eBay and 120 other marketplaces.
|
|
24
|
+
|
|
25
|
+
Website classification API s a python library that allows to classify websites based on IAB.
|
|
24
26
|
|
|
25
27
|
## Installation
|
|
26
28
|
```
|
|
@@ -50,6 +52,8 @@ print(websiteclassificationapi.get_categorization(url,api_key,classifier_type))
|
|
|
50
52
|
|
|
51
53
|
## How to select classifiers of different taxonomies
|
|
52
54
|
|
|
55
|
+
NEW (update October 2024): Our newest version of API supports classifications for up to 4 Tiers. It returns one or more of 700 IAB categories.
|
|
56
|
+
|
|
53
57
|
Classifier_type should be set to either iab1 (Tier 1 categorization) or iab2 (Tier 2 categorization) for general websites or ecommerce1, ecommerce2 and ecommerce3 for E-commerce or product websites.
|
|
54
58
|
|
|
55
59
|
IAB Tier 1 categorization returns probabilities of text being classified as one of 29 possible categories.
|
|
@@ -103,7 +107,7 @@ There are also many regulations that demand this, e.g. GDPR.
|
|
|
103
107
|
|
|
104
108
|
## Support for languages
|
|
105
109
|
|
|
106
|
-
Classification service supports classifications of websites in
|
|
110
|
+
Classification service supports classifications of websites in 150 languages.
|
|
107
111
|
|
|
108
112
|
## Offline database of categorized domains
|
|
109
113
|
|
|
@@ -113,9 +117,9 @@ It is ideal for those use cases where you require very low latency of requests,
|
|
|
113
117
|
|
|
114
118
|
## Handling websites with no texts
|
|
115
119
|
|
|
116
|
-
When encountering websites that have no text and just images, our classifier relies on
|
|
120
|
+
When encountering websites that have no text and just images, our classifier relies on online optical character recognition API service to extract text (if any available) from images on the website. And then classify it.
|
|
117
121
|
|
|
118
|
-
To deal with potential duplicates we use the
|
|
122
|
+
To deal with potential duplicates we use the reverse IP lookup of domains to find similar domains that are hosted on the same IP.
|
|
119
123
|
|
|
120
124
|
## Application of website categorization to technologies usage
|
|
121
125
|
|
|
@@ -270,11 +274,11 @@ Example classification for website www.github.com:
|
|
|
270
274
|
|
|
271
275
|
- [Sklearn](https://scikit-learn.org/stable/)
|
|
272
276
|
|
|
273
|
-
- [
|
|
277
|
+
- [Smart product categorization](https://medium.com/product-categorization/product-categorization-introduction-d62bb92e8515)
|
|
274
278
|
|
|
275
279
|
- [Introduction to URL Categorization Database](https://www.alpha-quantum.com/blog/url-database/url-database/)
|
|
276
280
|
|
|
277
|
-
- [Top shopify stores](https://www.
|
|
281
|
+
- [Top shopify stores](https://www.leadsquantum.com)
|
|
278
282
|
|
|
279
283
|
|
|
280
284
|
|
|
@@ -0,0 +1,7 @@
|
|
|
1
|
+
test/__init__.py,sha256=wTdBA0QbwpUvhKcdWegp481T0tvA6XcAb9BkdSDkzvE,2869
|
|
2
|
+
websiteclassificationapi/__init__.py,sha256=wTdBA0QbwpUvhKcdWegp481T0tvA6XcAb9BkdSDkzvE,2869
|
|
3
|
+
websiteclassificationapi-2.4.dist-info/LICENSE,sha256=7Fog1crdAQKBoXwk300JwPHIMJV-386OAuvTrd-N8hs,1083
|
|
4
|
+
websiteclassificationapi-2.4.dist-info/METADATA,sha256=32HKsecFTmfadp1rAmNqV5H5jfNYI2dsgp9-j8uP39I,12146
|
|
5
|
+
websiteclassificationapi-2.4.dist-info/WHEEL,sha256=D1Wh14kWDxPnrM-5t_6UCB-UuQNrEODtRa3vF4OsvQY,97
|
|
6
|
+
websiteclassificationapi-2.4.dist-info/top_level.txt,sha256=e76DP_x4tfbB3dn42joSGBVORT6U6GlMaReN7W-X8nY,30
|
|
7
|
+
websiteclassificationapi-2.4.dist-info/RECORD,,
|
|
@@ -1,7 +0,0 @@
|
|
|
1
|
-
test/__init__.py,sha256=wTdBA0QbwpUvhKcdWegp481T0tvA6XcAb9BkdSDkzvE,2869
|
|
2
|
-
websiteclassificationapi/__init__.py,sha256=wTdBA0QbwpUvhKcdWegp481T0tvA6XcAb9BkdSDkzvE,2869
|
|
3
|
-
websiteclassificationapi-2.3.dist-info/LICENSE,sha256=7Fog1crdAQKBoXwk300JwPHIMJV-386OAuvTrd-N8hs,1083
|
|
4
|
-
websiteclassificationapi-2.3.dist-info/METADATA,sha256=0PYGTiEiP-c7ADRCkxVZmXzxquzsR8x92D9D19fIEn8,11888
|
|
5
|
-
websiteclassificationapi-2.3.dist-info/WHEEL,sha256=D1Wh14kWDxPnrM-5t_6UCB-UuQNrEODtRa3vF4OsvQY,97
|
|
6
|
-
websiteclassificationapi-2.3.dist-info/top_level.txt,sha256=e76DP_x4tfbB3dn42joSGBVORT6U6GlMaReN7W-X8nY,30
|
|
7
|
-
websiteclassificationapi-2.3.dist-info/RECORD,,
|
|
File without changes
|
|
File without changes
|
{websiteclassificationapi-2.3.dist-info → websiteclassificationapi-2.4.dist-info}/top_level.txt
RENAMED
|
File without changes
|