PyPI - SinaTools - Versions diffs - 0.1.4__py2.py3-none-any.whl → 0.1.8__py2.py3-none-any.whl - Mend - Supply Chain Defender

SinaTools 0.1.4py2.py3-none-any.whl → 0.1.8py2.py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (132) hide show

nlptools/utils/__init__.py DELETED Viewed

File without changes

nlptools/utils/sentence_tokenizer.py DELETED Viewed

@@ -1,53 +0,0 @@
-def remove_empty_values(sentences):
-    return [value for value in sentences if value != '']
-def sent_tokenize(text, dot=True, new_line=True, question_mark=True, exclamation_mark=True):
-    """
-    This method tokenizes a text into a set of sentences based on the selected separators, including the dot, new line, question mark, and exclamation mark.
-    Args:
-        text (:obj:`str`): Arabic text to be tokenized.
-        dot (:obj:`str`): flag to split text based on Dot (default is True).
-        new_line (:obj:`str`): flag to split text based on new_line (default is True).
-        question_mark (:obj:`str`): flag to split text based on question_mark (default is True).
-        exclamation_mark (:obj:`str`): flag to split text based on exclamation_mark (default is True).
-    Returns:
-        :obj:`list`: list of sentences.
-    **Example:**
-    .. highlight:: python
-    .. code-block:: python
-        from nlptools.utils import sentence_tokenizer
-        sentences = sentence_tokenizer.sent_tokenize("مختبر سينا لحوسبة اللغة والذكاء الإصطناعي. في جامعة بيرزيت.", dot=True, new_line=True, question_mark=True, exclamation_mark=True)
-        print(sentences)
-        #output
-        ['مختبر سينا لحوسبة اللغة والذكاء الإصطناعي.', 'في جامعة بيرزيت.']
-    """
-    separators = []
-    split_text = [text]
-    if new_line==True:
-        separators.append('\n')
-    if dot==True:
-        separators.append('.')
-    if question_mark==True:
-        separators.append('?')
-        separators.append('؟')
-    if exclamation_mark==True:
-        separators.append('!')
-    for sep in separators:
-        new_split_text = []
-        for part in split_text:
-            tokens = part.split(sep)
-            tokens_with_separator = [token + sep for token in tokens[:-1]]
-            tokens_with_separator.append(tokens[-1].strip())
-            new_split_text.extend(tokens_with_separator)
-        split_text = new_split_text
-    split_text = remove_empty_values(split_text)
-    return split_text

{SinaTools-0.1.4.data/data/nlptools → SinaTools-0.1.8.data/data/sinatools}/environment.yml RENAMED Viewed

File without changes

{SinaTools-0.1.4.dist-info → SinaTools-0.1.8.dist-info}/AUTHORS.rst RENAMED Viewed

File without changes

{SinaTools-0.1.4.dist-info → SinaTools-0.1.8.dist-info}/LICENSE RENAMED Viewed

File without changes

{SinaTools-0.1.4.dist-info → SinaTools-0.1.8.dist-info}/WHEEL RENAMED Viewed

File without changes

{nlptools → sinatools}/CLI/utils/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/DataDownload/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/create_classification_data.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/create_pretraining_data.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/extract_features.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/lamb_optimizer.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/modeling.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/optimization.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/run_classifier.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/run_pretraining.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/run_squad.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/arabert/tokenization.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/build_openwebtext_pretraining_dataset.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/build_pretraining_dataset.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/build_pretraining_dataset_single_file.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/configure_finetuning.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/configure_pretraining.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/finetune/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/finetune/feature_spec.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/finetune/preprocessing.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/finetune/scorer.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/finetune/task.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/finetune/task_builder.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/flops_computation.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/model/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/model/modeling.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/model/optimization.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/model/tokenization.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/pretrain/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/pretrain/pretrain_data.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/pretrain/pretrain_helpers.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/run_finetuning.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/run_pretraining.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/util/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/util/training_utils.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/araelectra/util/utils.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/create_pretraining_data.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/gpt2/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/gpt2/lamb_optimizer.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/gpt2/optimization.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/gpt2/run_pretraining.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/dataloader.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/modeling.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/modeling_gpt2.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/optimization_adafactor.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/train_tpu.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/grover/utils.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/aragpt2/train_bpe_tokenizer.py RENAMED Viewed

File without changes

{nlptools → sinatools}/arabert/preprocess.py RENAMED Viewed

File without changes

{nlptools → sinatools}/environment.yml RENAMED Viewed

File without changes

{nlptools → sinatools}/install_env.py RENAMED Viewed

File without changes

/nlptools/nlptools.py → /sinatools/sinatools.py RENAMED Viewed

File without changes

{nlptools/arabiner → sinatools/utils}/__init__.py RENAMED Viewed

File without changes

{nlptools → sinatools}/utils/readfile.py RENAMED Viewed

File without changes

{nlptools → sinatools}/utils/utils.py RENAMED Viewed

File without changes