PyPI - hamtaa-texttools - Versions diffs - 1.1.20__py3-none-any.whl → 1.1.22__py3-none-any.whl - Mend

hamtaa-texttools 1.1.20py3-none-any.whl → 1.1.22py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (32) hide show

{hamtaa_texttools-1.1.20.dist-info → hamtaa_texttools-1.1.22.dist-info}/METADATA +49 -109
hamtaa_texttools-1.1.22.dist-info/RECORD +32 -0
texttools/__init__.py +3 -3
texttools/batch/batch_config.py +14 -1
texttools/batch/batch_runner.py +2 -2
texttools/internals/async_operator.py +49 -92
texttools/internals/models.py +74 -105
texttools/internals/operator_utils.py +25 -27
texttools/internals/prompt_loader.py +3 -20
texttools/internals/sync_operator.py +49 -92
texttools/prompts/README.md +2 -2
texttools/prompts/categorize.yaml +35 -77
texttools/prompts/check_fact.yaml +2 -2
texttools/prompts/extract_entities.yaml +2 -2
texttools/prompts/extract_keywords.yaml +6 -6
texttools/prompts/is_question.yaml +2 -2
texttools/prompts/merge_questions.yaml +4 -4
texttools/prompts/propositionize.yaml +2 -2
texttools/prompts/rewrite.yaml +6 -6
texttools/prompts/run_custom.yaml +1 -1
texttools/prompts/subject_to_question.yaml +2 -2
texttools/prompts/summarize.yaml +2 -2
texttools/prompts/text_to_question.yaml +2 -2
texttools/prompts/translate.yaml +2 -2
texttools/tools/async_tools.py +393 -487
texttools/tools/sync_tools.py +394 -488
hamtaa_texttools-1.1.20.dist-info/RECORD +0 -33
texttools/batch/internals/utils.py +0 -13
{hamtaa_texttools-1.1.20.dist-info → hamtaa_texttools-1.1.22.dist-info}/WHEEL +0 -0
{hamtaa_texttools-1.1.20.dist-info → hamtaa_texttools-1.1.22.dist-info}/licenses/LICENSE +0 -0
{hamtaa_texttools-1.1.20.dist-info → hamtaa_texttools-1.1.22.dist-info}/top_level.txt +0 -0
/texttools/batch/{internals/batch_manager.py → batch_manager.py} +0 -0

texttools/prompts/categorize.yaml CHANGED Viewed

@@ -1,77 +1,35 @@
-main_template:
-  category_list: |
-    You are an expert classification agent.
-    You receive a list of categories.
-    Your task:
-    - Read all provided categories carefully.
-    - Consider the user query, intent, and task explanation.
-    - Select exactly one category name from the list that best matches the user’s intent.
-    - Return only the category name, nothing else.
-    Rules:
-    - Never invent categories that are not in the list.
-    - If multiple categories seem possible, choose the closest match based on the description and user intent.
-    - If descriptions are missing or empty, rely on the category name.
-    - If the correct answer cannot be determined with certainty, choose the most likely one.
-    Output format:
-    {{
-    "reason": "Explanation of why the input belongs to the category"
-    "result": "<category_name_only>"
-    }}
-    Available categories with their descriptions:
-    {category_list}
-    The text that has to be categorized:
-    {input}
-  category_tree: |
-    You are an expert classification agent.
-    You receive a list of categories at the current level of a hierarchical category tree.
-    Your task:
-    - Read all provided categories carefully.
-    - Consider the user query, intent, and task explanation.
-    - Select exactly one category name from the list that best matches the user’s intent.
-    - Return only the category name, nothing else.
-    Rules:
-    - Never invent categories that are not in the list.
-    - If multiple categories seem possible, choose the closest match based on the description and user intent.
-    - If descriptions are missing or empty, rely on the category name.
-    - If the correct answer cannot be determined with certainty, choose the most likely one.
-    Output format:
-    {{
-    "reason": "Explanation of why the input belongs to the category"
-    "result": "<category_name_only>"
-    }}
-    Available categories with their descriptions at this level:
-    {category_list}
-    Do not include category descriptions at all. Only write the raw category.
-    The text that has to be categorized:
-    {input}
-analyze_template:
-  category_list: |
-    We want to categorize the given text.
-    To improve categorization, we need an analysis of the text.
-    Analyze the given text and write its main idea and a short analysis of that.
-    Analysis should be very short.
-    Text:
-    {input}
-  category_tree: |
-    We want to categorize the given text.
-    To improve categorization, we need an analysis of the text.
-    Analyze the given text and write its main idea and a short analysis of that.
-    Analysis should be very short.
-    Text:
-    {input}
+main_template: |
+  You are an expert classification agent.
+  You receive a list of categories.
+  Your task:
+  - Read all provided categories carefully.
+  - Consider the user query, intent, and task explanation.
+  - Select exactly one category name from the list that best matches the user’s intent.
+  - Return only the category name, nothing else.
+  Rules:
+  - Never invent categories that are not in the list.
+  - If multiple categories seem possible, choose the closest match based on the description and user intent.
+  - If descriptions are missing or empty, rely on the category name.
+  - If the correct answer cannot be determined with certainty, choose the most likely one.
+  Output format:
+  {{
+  "reason": "Explanation of why the input belongs to the category"
+  "result": "<category_name_only>"
+  }}
+  Available categories with their descriptions:
+  {category_list}
+  The text that has to be categorized:
+  {text}
+analyze_template: |
+  We want to categorize the given text.
+  To improve categorization, we need an analysis of the text.
+  Analyze the given text and write its main idea and a short analysis of that.
+  Analysis should be very short.
+  Text:
+  {text}

texttools/prompts/check_fact.yaml CHANGED Viewed

@@ -5,7 +5,7 @@ main_template: |
   Respond only in JSON format (Output should be a boolean):
   {{"result": True/False}}
   The statement is:
-  {input}
+  {text}
   The source text is:
   {source_text}
@@ -14,6 +14,6 @@ analyze_template: |
   summarized analysis that could help in determining that can the statement
   be concluded from the source or not.
   The statement is:
-  {input}
+  {text}
   The source text is:
   {source_text}

texttools/prompts/extract_entities.yaml CHANGED Viewed

@@ -12,9 +12,9 @@ main_template: |
     ]
   }}
   Here is the text:
-  {input}
+  {text}
 analyze_template: |
   Read the following text and identify any proper nouns, key concepts, or specific mentions that might represent named entities.
   Provide a brief, summarized analysis that could help in categorizing these entities.
-  {input}
+  {text}

texttools/prompts/extract_keywords.yaml CHANGED Viewed

@@ -12,7 +12,7 @@ main_template:
     - Respond only in JSON format:
     {{"result": ["keyword1", "keyword2", etc.]}}
     Here is the text:
-    {input}
+    {text}
   threshold: |
     You are an expert keyword extractor specialized in fine-grained concept identification.
@@ -32,7 +32,7 @@ main_template:
     - Respond only in JSON format:
     {{"result": ["keyword1", "keyword2", etc.]}}
     Here is the text:
-    {input}
+    {text}
   count: |
     You are an expert keyword extractor with precise output requirements.
@@ -49,20 +49,20 @@ main_template:
     {{"result": ["keyword1", "keyword2", "keyword3", ...]}}
     Here is the text:
-    {input}
+    {text}
 analyze_template:
   auto: |
     Analyze the following text to identify its main topics, concepts, and important terms.
     Provide a concise summary of your findings that will help in extracting relevant keywords.
-    {input}
+    {text}
   threshold: |
     Analyze the following text to identify its main topics, concepts, and important terms.
     Provide a concise summary of your findings that will help in extracting relevant keywords.
-    {input}
+    {text}
   count: |
     Analyze the following text to identify its main topics, concepts, and important terms.
     Provide a concise summary of your findings that will help in extracting relevant keywords.
-    {input}
+    {text}

texttools/prompts/is_question.yaml CHANGED Viewed

@@ -4,11 +4,11 @@ main_template: |
   Respond only in JSON format (Output should be a boolean):
   {{"result": True/False}}
   Here is the text:
-  {input}
+  {text}
 analyze_template: |
   We want to analyze this text snippet to see if it contains any question or request of some kind or not.
   Read the text, and reason about it being a request or not.
   Summerized, short answer.
-  {input}
+  {text}

texttools/prompts/merge_questions.yaml CHANGED Viewed

@@ -12,7 +12,7 @@ main_template:
     - Respond only in JSON format:
     {{"result": "string"}}
     Here is the questions:
-    {input}
+    {text}
   reason: |
     You are an AI assistant helping to unify semantically similar questions.
@@ -23,7 +23,7 @@ main_template:
     Respond only in JSON format:
     {{"result": "string"}}
     Here is the questions:
-    {input}
+    {text}
 analyze_template:
@@ -34,7 +34,7 @@ analyze_template:
     Provide a brief, summarized understanding of the questions' meaning that
     will help in merging and rephrasing it accurately without changing its intent.
     Here is the question:
-    {input}
+    {text}
   reason: |
     Analyze the following questions to identify their exact wording, phrasing,
@@ -42,5 +42,5 @@ analyze_template:
     Provide a brief, summarized analysis of their linguistic structure and current meaning,
     which will then be used to create a new question containing all of their contents.
     Here is the question:
-    {input}
+    {text}

texttools/prompts/propositionize.yaml CHANGED Viewed

@@ -12,11 +12,11 @@ main_template: |
   4. No Redundancy: Do not extract summary statements that merely repeat facts already listed.
   Extract the atomic propositions from the following text:
-  {input}
+  {text}
 analyze_template: |
   We want to analyze this text snippet and think about where we can split sentence to atomic meaningful propositions.
   An atomic proposition is a single, self-contained fact that is concise,
   verifiable, and does not rely on external context.
   You just have to think around the possible propositions in the text and how a proposition can be made.
-  {input}
+  {text}

texttools/prompts/rewrite.yaml CHANGED Viewed

@@ -18,7 +18,7 @@ main_template:
     {{"result": "str"}}
     Anchor Text:
-    "{input}"
+    "{text}"
   negative: |
     You are an AI assistant designed to generate high-quality training data for semantic text embedding models.
@@ -35,7 +35,7 @@ main_template:
     {{"result": "str"}}
     Anchor Text:
-    "{input}"
+    "{text}"
   hard_negative: |
       You are an AI assistant designed to generate high-quality training data for semantic text embedding models.
@@ -57,7 +57,7 @@ main_template:
       {{"result": "str"}}
       Anchor Text:
-      "{input}"
+      "{text}"
 analyze_template:
@@ -74,7 +74,7 @@ analyze_template:
     Your analysis should capture the ESSENTIAL MEANING that must be preserved in any paraphrase.
     Text:
-    {input}
+    {text}
   negative: |
     Analyze the following text to identify its SPECIFIC TOPIC and DOMAIN for creating a high-quality NEGATIVE sample.
@@ -88,7 +88,7 @@ analyze_template:
     The goal is to find topics that are in the same domain but semantically unrelated to this specific text.
     Text:
-    {input}
+    {text}
   hard_negative: |
     Analyze this text to identify EXACTLY ONE ELEMENT that can be changed to create a hard-negative sample.
@@ -107,5 +107,5 @@ analyze_template:
     - 80-90% of the vocabulary
     Text:
-    {input}
+    {text}

texttools/prompts/run_custom.yaml CHANGED Viewed

@@ -1,5 +1,5 @@
 main_template: |
-  {input}
+  {text}
   Respond only in JSON format:
   {output_model_str}

texttools/prompts/subject_to_question.yaml CHANGED Viewed

@@ -9,7 +9,7 @@ main_template: |
   Respond only in JSON format:
   {{"result": ["question1", "question2", ...], "reason": "string"}}
   Here is the text:
-  {input}
+  {text}
 analyze_template: |
   Our goal is to generate questions from the given subject.
@@ -19,4 +19,4 @@ analyze_template: |
   What is the subject about?
   What point of views can we see and generate questoins from it? (Questions that real users might have.)
   Here is the subject:
-  {input}
+  {text}

texttools/prompts/summarize.yaml CHANGED Viewed

@@ -4,11 +4,11 @@ main_template: |
   Respond only in JSON format:
   {{"result": "string"}}
   Provide a concise summary of the following text:
-  {input}
+  {text}
 analyze_template: |
   Read the following text and identify its main points, key arguments, and overall purpose.
   Provide a brief, summarized analysis that will help in generating an accurate and concise summary.
-  {input}
+  {text}

texttools/prompts/text_to_question.yaml CHANGED Viewed

@@ -9,7 +9,7 @@ main_template: |
   Respond only in JSON format:
   {{"result": ["question1", "question2", ...], "reason": "string"}}
   Here is the answer:
-  {input}
+  {text}
 analyze_template: |
   Analyze the following answer to identify its key facts,
@@ -18,5 +18,5 @@ analyze_template: |
   help in formulating relevant and direct questions.
   Just mention the keypoints that was provided in the answer
   Here is the answer:
-  {input}
+  {text}

texttools/prompts/translate.yaml CHANGED Viewed

@@ -5,11 +5,11 @@ main_template: |
   {{"result": "string"}}
   Don't translate proper name, only transliterate them to {target_language}
   Translate the following text to {target_language}:
-  {input}
+  {text}
 analyze_template: |
   Analyze the following text and identify important linguistic considerations for translation.
   Point out any idioms, cultural references, or complex structures that need special attention.
   Also, list all proper nouns that should not be translated. Write your analysis in the {target_language}.
-  {input}
+  {text}

hamtaa-texttools 1.1.20__py3-none-any.whl → 1.1.22__py3-none-any.whl

hamtaa-texttools 1.1.20py3-none-any.whl → 1.1.22py3-none-any.whl