PyPI - SqueakyCleanText - Versions diffs - 0.2.0__tar.gz → 0.2.1__tar.gz - Mend

SqueakyCleanText 0.2.0tar.gz → 0.2.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

{SqueakyCleanText-0.2.0/SqueakyCleanText.egg-info → SqueakyCleanText-0.2.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: SqueakyCleanText
-Version: 0.2.0
+Version: 0.2.1
 Summary: A comprehensive text cleaning and preprocessing pipeline.
 Home-page: https://github.com/rhnfzl/SqueakyCleanText
 Author: Rehan Fazal
@@ -53,8 +53,6 @@ SqueakyCleanText offers functionality to streamline this process, ensuring that
 Depending on sigle model for Name Entity recognition is not be ideal, as there is a high chance it might skip the entity all together. Also combining the language specific NER model makes it more specific for text and reduces the chance of missing out the entity.
 The package NER model has the chunking mechanism which helps to do the NER process even if the text is longer than the model token size.
-Important : Model
 By automating these text cleaning steps, SqueakyCleanText ensures your data is prepared efficiently and effectively, saving time and improving model performance.
 ## Installation

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/README.md RENAMED Viewed

@@ -28,8 +28,6 @@ SqueakyCleanText offers functionality to streamline this process, ensuring that
 Depending on sigle model for Name Entity recognition is not be ideal, as there is a high chance it might skip the entity all together. Also combining the language specific NER model makes it more specific for text and reduces the chance of missing out the entity.
 The package NER model has the chunking mechanism which helps to do the NER process even if the text is longer than the model token size.
-Important : Model
 By automating these text cleaning steps, SqueakyCleanText ensures your data is prepared efficiently and effectively, saving time and improving model performance.
 ## Installation

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1/SqueakyCleanText.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: SqueakyCleanText
-Version: 0.2.0
+Version: 0.2.1
 Summary: A comprehensive text cleaning and preprocessing pipeline.
 Home-page: https://github.com/rhnfzl/SqueakyCleanText
 Author: Rehan Fazal
@@ -53,8 +53,6 @@ SqueakyCleanText offers functionality to streamline this process, ensuring that
 Depending on sigle model for Name Entity recognition is not be ideal, as there is a high chance it might skip the entity all together. Also combining the language specific NER model makes it more specific for text and reduces the chance of missing out the entity.
 The package NER model has the chunking mechanism which helps to do the NER process even if the text is longer than the model token size.
-Important : Model
 By automating these text cleaning steps, SqueakyCleanText ensures your data is prepared efficiently and effectively, saving time and improving model performance.
 ## Installation

SqueakyCleanText-0.2.1/SqueakyCleanText.egg-info/requires.txt ADDED Viewed

@@ -0,0 +1,19 @@
+lingua-language-detector>=2.0.0
+nltk>=3.8
+emoji>=2.8
+ftfy>=6.1
+Unidecode>=1.3
+beautifulsoup4>=4.12
+transformers>=4.30
+torch>=2.0.0
+presidio_anonymizer>=2.2.355
+[dev]
+hypothesis==6.82.7
+faker==20.1.0
+flake8==6.1.0
+pytest==7.5.0
+[test]
+coverage==7.3.1
+pytest-cov==4.1.0

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/config.py RENAMED Viewed

@@ -52,5 +52,4 @@ LANGUAGE = None
 NER_MODELS_LIST = ["FacebookAI/xlm-roberta-large-finetuned-conll03-english",
               "FacebookAI/xlm-roberta-large-finetuned-conll02-dutch",
               "FacebookAI/xlm-roberta-large-finetuned-conll03-german",
-              "FacebookAI/xlm-roberta-large-finetuned-conll03-spanish",
-              "Babelscape/wikineural-multilingual-ner"]
+              "FacebookAI/xlm-roberta-large-finetuned-conll02-spanish",

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/ner.py RENAMED Viewed

@@ -36,7 +36,7 @@ class GeneralNER:
             model_name = ["FacebookAI/xlm-roberta-large-finetuned-conll03-english",
                           "FacebookAI/xlm-roberta-large-finetuned-conll02-dutch",
                         "FacebookAI/xlm-roberta-large-finetuned-conll03-german",
-                        "FacebookAI/xlm-roberta-large-finetuned-conll03-spanish",
+                        "FacebookAI/xlm-roberta-large-finetuned-conll02-spanish",
                         "Babelscape/wikineural-multilingual-ner"]
             english_model_name = model_name[0]

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/setup.py RENAMED Viewed

@@ -2,7 +2,7 @@ from setuptools import setup, find_packages
 setup(
     name='SqueakyCleanText',
-    version='0.2.0',
+    version='0.2.1',
     author='Rehan Fazal',
     description='A comprehensive text cleaning and preprocessing pipeline.',
     long_description=open('README.md', encoding='utf-8').read(),
@@ -11,15 +11,15 @@ setup(
     license='MIT',
     packages=find_packages(),
     install_requires=[
-        'lingua-language-detector>=2.0.0,<2.1',
-        'nltk>=3.8,<3.9',
-        'emoji>=2.8,<2.9',
-        'ftfy>=6.1,<6.2',
-        'Unidecode>=1.3,<1.4',
-        'beautifulsoup4>=4.12,<4.13',
-        'transformers>=4.30,<4.31',
-        'torch>=2.0,<2.1',
-        'presidio_anonymizer>=2.2.355,<2.3',
+        'lingua-language-detector>=2.0.0',
+        'nltk>=3.8',
+        'emoji>=2.8',
+        'ftfy>=6.1',
+        'Unidecode>=1.3',
+        'beautifulsoup4>=4.12',
+        'transformers>=4.30',
+        'torch>=2.0.0',
+        'presidio_anonymizer>=2.2.355',
     ],
     extras_require={
         'dev': [

SqueakyCleanText-0.2.0/SqueakyCleanText.egg-info/requires.txt DELETED Viewed

@@ -1,19 +0,0 @@
-lingua-language-detector<2.1,>=2.0.0
-nltk<3.9,>=3.8
-emoji<2.9,>=2.8
-ftfy<6.2,>=6.1
-Unidecode<1.4,>=1.3
-beautifulsoup4<4.13,>=4.12
-transformers<4.31,>=4.30
-torch<2.1,>=2.0
-presidio_anonymizer<2.3,>=2.2.355
-[dev]
-hypothesis==6.82.7
-faker==20.1.0
-flake8==6.1.0
-pytest==7.5.0
-[test]
-coverage==7.3.1
-pytest-cov==4.1.0

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/LICENSE RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/MANIFEST.in RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/SqueakyCleanText.egg-info/SOURCES.txt RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/SqueakyCleanText.egg-info/dependency_links.txt RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/SqueakyCleanText.egg-info/entry_points.txt RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/SqueakyCleanText.egg-info/top_level.txt RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/__init__.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/scripts/__init__.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/scripts/download_nltk_stopwords.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/sct.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/__init__.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/constants.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/contact.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/datetime.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/normtext.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/resources.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/special.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/sct/utils/stopwords.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/setup.cfg RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/tests/__init__.py RENAMED Viewed

File without changes

{SqueakyCleanText-0.2.0 → SqueakyCleanText-0.2.1}/tests/test_sct.py RENAMED Viewed

File without changes

SqueakyCleanText 0.2.0__tar.gz → 0.2.1__tar.gz

SqueakyCleanText 0.2.0tar.gz → 0.2.1tar.gz