PyPI - hamtaa-texttools - Versions diffs - 1.3.2__tar.gz → 2.0.0__tar.gz - Mend

hamtaa-texttools 1.3.2tar.gz → 2.0.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (45) hide show

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/LICENSE RENAMED Viewed

@@ -18,4 +18,4 @@ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-SOFTWARE.
+SOFTWARE.

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: hamtaa-texttools
-Version: 1.3.2
+Version: 2.0.0
 Summary: A high-level NLP toolkit built on top of modern LLMs.
 Author-email: Tohidi <the.mohammad.tohidi@gmail.com>, Erfan Moosavi <erfanmoosavi84@gmail.com>, Montazer <montazerh82@gmail.com>, Givechi <mohamad.m.givechi@gmail.com>, Zareshahi <a.zareshahi1377@gmail.com>
 Maintainer-email: Erfan Moosavi <erfanmoosavi84@gmail.com>, Tohidi <the.mohammad.tohidi@gmail.com>
@@ -11,7 +11,7 @@ Classifier: License :: OSI Approved :: MIT License
 Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
 Classifier: Topic :: Text Processing
 Classifier: Operating System :: OS Independent
-Requires-Python: >=3.9
+Requires-Python: >=3.11
 Description-Content-Type: text/markdown
 License-File: LICENSE
 Requires-Dist: openai>=1.97.1
@@ -30,30 +30,27 @@ Dynamic: license-file
 It provides both **sync (`TheTool`)** and **async (`AsyncTheTool`)** APIs for maximum flexibility.
-It provides ready-to-use utilities for **translation, question detection, keyword extraction, categorization, NER extraction, and more** - designed to help you integrate AI-powered text processing into your applications with minimal effort.
-**Note:** Most features of `texttools` are reliable when you use `google/gemma-3n-e4b-it` model.
+It provides ready-to-use utilities for **translation, question detection, categorization, NER extraction, and more** - designed to help you integrate AI-powered text processing into your applications with minimal effort.
 ---
 ## ✨ Features
-TextTools provides a rich collection of high-level NLP utilities,
+TextTools provides a collection of high-level NLP utilities.
 Each tool is designed to work with structured outputs.
-- **`categorize()`** - Classifies text into given categories
-- **`extract_keywords()`** - Extracts keywords from the text
-- **`extract_entities()`** - Named Entity Recognition (NER) system
-- **`is_question()`** - Binary question detection
-- **`text_to_question()`** - Generates questions from text
-- **`merge_questions()`** - Merges multiple questions into one
-- **`rewrite()`** - Rewrites text in a different way
-- **`subject_to_question()`** - Generates questions about a given subject
-- **`summarize()`** - Text summarization
-- **`translate()`** - Text translation
-- **`propositionize()`** - Convert text to atomic independent meaningful sentences
-- **`check_fact()`** - Check whether a statement is relevant to the source text
-- **`run_custom()`** - Allows users to define a custom tool with an arbitrary BaseModel
+- **`categorize()`** - Classify text into given categories
+- **`extract_keywords()`** - Extract keywords from the text
+- **`extract_entities()`** - Perform Named Entity Recognition (NER)
+- **`is_question()`** - Detect if the input is phrased as a question
+- **`to_question()`** - Generate questions from the given text / subject
+- **`merge_questions()`** - Merge multiple questions into one
+- **`augment()`** - Rewrite text in different augmentations
+- **`summarize()`** - Summarize the given text
+- **`translate()`** - Translate text between languages
+- **`propositionize()`** - Convert a text into atomic, independent, meaningful sentences
+- **`is_fact()`** - Check whether a statement is a fact based on the source text
+- **`run_custom()`** - Custom tool that can do almost anything
 ---
@@ -71,14 +68,12 @@ pip install -U hamtaa-texttools
 | Status | Meaning | Tools | Safe for Production? |
 |--------|---------|----------|-------------------|
-| **✅ Production** | Evaluated, tested, stable. | `categorize()` (list mode), `extract_keywords()`, `extract_entities()`, `is_question()`, `text_to_question()`, `merge_questions()`, `rewrite()`, `subject_to_question()`, `summarize()`, `run_custom()` | **Yes** - ready for reliable use. |
-| **🧪 Experimental** | Added to the package but **not fully evaluated**. Functional, but quality may vary. | `categorize()` (tree mode), `translate()`, `propositionize()`, `check_fact()` | **Use with caution** - outputs not yet validated. |
+| **✅ Production** | Evaluated and tested. | `categorize()` (list mode), `extract_keywords()`, `extract_entities()`, `is_question()`, `to_question()`, `merge_questions()`, `augment()`, `summarize()`, `run_custom()` | **Yes** - ready for reliable use. |
+| **🧪 Experimental** | Added to the package but **not fully evaluated**. | `categorize()` (tree mode), `translate()`, `propositionize()`, `is_fact()` | **Use with caution** |
 ---
-## ⚙️ `with_analysis`, `logprobs`, `output_lang`, `user_prompt`, `temperature`, `validator`, `priority` and `timeout` parameters
-TextTools provides several optional flags to customize LLM behavior:
+## ⚙️ Additional Parameters
 - **`with_analysis: bool`** → Adds a reasoning step before generating the final output.
 **Note:** This doubles token usage per call.
@@ -88,17 +83,17 @@ TextTools provides several optional flags to customize LLM behavior:
 - **`output_lang: str`** → Forces the model to respond in a specific language.
-- **`user_prompt: str`** → Allows you to inject a custom instruction or into the model alongside the main template. This gives you fine-grained control over how the model interprets or modifies the input text.
+- **`user_prompt: str`** → Allows you to inject a custom instruction into the model alongside the main template.
-- **`temperature: float`** → Determines how creative the model should respond. Takes a float number from `0.0` to `2.0`.
+- **`temperature: float`** → Determines how creative the model should respond. Takes a float number between `0.0` and `2.0`.
-- **`validator: Callable (Experimental)`** → Forces TheTool to validate the output result based on your custom validator. Validator should return a boolean. If the validator fails, TheTool will retry to get another output by modifying `temperature`. You can also specify `max_validation_retries=<N>`.
+- **`validator: Callable (Experimental)`** → Forces the tool to validate the output result based on your validator function. Validator should return a boolean. If the validator fails, TheTool will retry to get another output by modifying `temperature`. You can also specify `max_validation_retries=<N>`.
-- **`priority: int (Experimental)`** → Task execution priority level. Affects processing order in queues.
+- **`priority: int (Experimental)`** → Affects processing order in queues.
 **Note:** This feature works if it's supported by the model and vLLM.
-- **`timeout: float`** → Maximum time in seconds to wait for the response before raising a timeout error
-**Note:** This feature only exists in `AsyncTheTool`.
+- **`timeout: float`** → Maximum time in seconds to wait for the response before raising a timeout error.
+**Note:** This feature is only available in `AsyncTheTool`.
 ---
@@ -110,12 +105,14 @@ Every tool of `TextTools` returns a `ToolOutput` object which is a BaseModel wit
 - **`analysis: str`**
 - **`logprobs: list`**
 - **`errors: list[str]`**
-- **`ToolOutputMetadata`** →
+- **`ToolOutputMetadata`**
     - **`tool_name: str`**
     - **`processed_at: datetime`**
     - **`execution_time: float`**
-**Note:** You can use `repr(ToolOutput)` to print your output with all the details.
+- Serialize output to JSON using the `to_json()` method.
+- Verify operation success with the `is_successful()` method.
+- Convert output to a dictionary with the `to_dict()` method.
 ---
@@ -133,13 +130,13 @@ Every tool of `TextTools` returns a `ToolOutput` object which is a BaseModel wit
 from openai import OpenAI
 from texttools import TheTool
-client = OpenAI(base_url = "your_url", API_KEY = "your_api_key")
+client = OpenAI(base_url="your_url", API_KEY="your_api_key")
 model = "model_name"
 the_tool = TheTool(client=client, model=model)
 detection = the_tool.is_question("Is this project open source?")
-print(repr(detection))
+print(detection.to_json())
 ```
 ---
@@ -157,24 +154,24 @@ async def main():
     async_the_tool = AsyncTheTool(client=async_client, model=model)
-    translation_task = async_the_tool.translate("سلام، حالت چطوره؟", target_language="English")
-    keywords_task = async_the_tool.extract_keywords("Tomorrow, we will be dead by the car crash")
+    translation_task = async_the_tool.translate("سلام، حالت چطوره؟", target_lang="English")
+    keywords_task = async_the_tool.extract_keywords("This open source project is great for processing large datasets!")
     (translation, keywords) = await asyncio.gather(translation_task, keywords_task)
-    print(repr(translation))
-    print(repr(keywords))
+    print(translation.to_json())
+    print(keywords.to_json())
 asyncio.run(main())
 ```
 ---
-## 👍 Use Cases
+## ✅ Use Cases
 Use **TextTools** when you need to:
-- 🔍 **Classify** large datasets quickly without model training
-- 🌍 **Translate** and process multilingual corpora with ease
+- 🔍 **Classify** large datasets quickly without model training
 - 🧩 **Integrate** LLMs into production pipelines (structured outputs)
 - 📊 **Analyze** large text collections using embeddings and categorization

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/README.md RENAMED Viewed

@@ -9,30 +9,27 @@
 It provides both **sync (`TheTool`)** and **async (`AsyncTheTool`)** APIs for maximum flexibility.
-It provides ready-to-use utilities for **translation, question detection, keyword extraction, categorization, NER extraction, and more** - designed to help you integrate AI-powered text processing into your applications with minimal effort.
-**Note:** Most features of `texttools` are reliable when you use `google/gemma-3n-e4b-it` model.
+It provides ready-to-use utilities for **translation, question detection, categorization, NER extraction, and more** - designed to help you integrate AI-powered text processing into your applications with minimal effort.
 ---
 ## ✨ Features
-TextTools provides a rich collection of high-level NLP utilities,
+TextTools provides a collection of high-level NLP utilities.
 Each tool is designed to work with structured outputs.
-- **`categorize()`** - Classifies text into given categories
-- **`extract_keywords()`** - Extracts keywords from the text
-- **`extract_entities()`** - Named Entity Recognition (NER) system
-- **`is_question()`** - Binary question detection
-- **`text_to_question()`** - Generates questions from text
-- **`merge_questions()`** - Merges multiple questions into one
-- **`rewrite()`** - Rewrites text in a different way
-- **`subject_to_question()`** - Generates questions about a given subject
-- **`summarize()`** - Text summarization
-- **`translate()`** - Text translation
-- **`propositionize()`** - Convert text to atomic independent meaningful sentences
-- **`check_fact()`** - Check whether a statement is relevant to the source text
-- **`run_custom()`** - Allows users to define a custom tool with an arbitrary BaseModel
+- **`categorize()`** - Classify text into given categories
+- **`extract_keywords()`** - Extract keywords from the text
+- **`extract_entities()`** - Perform Named Entity Recognition (NER)
+- **`is_question()`** - Detect if the input is phrased as a question
+- **`to_question()`** - Generate questions from the given text / subject
+- **`merge_questions()`** - Merge multiple questions into one
+- **`augment()`** - Rewrite text in different augmentations
+- **`summarize()`** - Summarize the given text
+- **`translate()`** - Translate text between languages
+- **`propositionize()`** - Convert a text into atomic, independent, meaningful sentences
+- **`is_fact()`** - Check whether a statement is a fact based on the source text
+- **`run_custom()`** - Custom tool that can do almost anything
 ---
@@ -50,14 +47,12 @@ pip install -U hamtaa-texttools
 | Status | Meaning | Tools | Safe for Production? |
 |--------|---------|----------|-------------------|
-| **✅ Production** | Evaluated, tested, stable. | `categorize()` (list mode), `extract_keywords()`, `extract_entities()`, `is_question()`, `text_to_question()`, `merge_questions()`, `rewrite()`, `subject_to_question()`, `summarize()`, `run_custom()` | **Yes** - ready for reliable use. |
-| **🧪 Experimental** | Added to the package but **not fully evaluated**. Functional, but quality may vary. | `categorize()` (tree mode), `translate()`, `propositionize()`, `check_fact()` | **Use with caution** - outputs not yet validated. |
+| **✅ Production** | Evaluated and tested. | `categorize()` (list mode), `extract_keywords()`, `extract_entities()`, `is_question()`, `to_question()`, `merge_questions()`, `augment()`, `summarize()`, `run_custom()` | **Yes** - ready for reliable use. |
+| **🧪 Experimental** | Added to the package but **not fully evaluated**. | `categorize()` (tree mode), `translate()`, `propositionize()`, `is_fact()` | **Use with caution** |
 ---
-## ⚙️ `with_analysis`, `logprobs`, `output_lang`, `user_prompt`, `temperature`, `validator`, `priority` and `timeout` parameters
-TextTools provides several optional flags to customize LLM behavior:
+## ⚙️ Additional Parameters
 - **`with_analysis: bool`** → Adds a reasoning step before generating the final output.
 **Note:** This doubles token usage per call.
@@ -67,17 +62,17 @@ TextTools provides several optional flags to customize LLM behavior:
 - **`output_lang: str`** → Forces the model to respond in a specific language.
-- **`user_prompt: str`** → Allows you to inject a custom instruction or into the model alongside the main template. This gives you fine-grained control over how the model interprets or modifies the input text.
+- **`user_prompt: str`** → Allows you to inject a custom instruction into the model alongside the main template.
-- **`temperature: float`** → Determines how creative the model should respond. Takes a float number from `0.0` to `2.0`.
+- **`temperature: float`** → Determines how creative the model should respond. Takes a float number between `0.0` and `2.0`.
-- **`validator: Callable (Experimental)`** → Forces TheTool to validate the output result based on your custom validator. Validator should return a boolean. If the validator fails, TheTool will retry to get another output by modifying `temperature`. You can also specify `max_validation_retries=<N>`.
+- **`validator: Callable (Experimental)`** → Forces the tool to validate the output result based on your validator function. Validator should return a boolean. If the validator fails, TheTool will retry to get another output by modifying `temperature`. You can also specify `max_validation_retries=<N>`.
-- **`priority: int (Experimental)`** → Task execution priority level. Affects processing order in queues.
+- **`priority: int (Experimental)`** → Affects processing order in queues.
 **Note:** This feature works if it's supported by the model and vLLM.
-- **`timeout: float`** → Maximum time in seconds to wait for the response before raising a timeout error
-**Note:** This feature only exists in `AsyncTheTool`.
+- **`timeout: float`** → Maximum time in seconds to wait for the response before raising a timeout error.
+**Note:** This feature is only available in `AsyncTheTool`.
 ---
@@ -89,12 +84,14 @@ Every tool of `TextTools` returns a `ToolOutput` object which is a BaseModel wit
 - **`analysis: str`**
 - **`logprobs: list`**
 - **`errors: list[str]`**
-- **`ToolOutputMetadata`** →
+- **`ToolOutputMetadata`**
     - **`tool_name: str`**
     - **`processed_at: datetime`**
     - **`execution_time: float`**
-**Note:** You can use `repr(ToolOutput)` to print your output with all the details.
+- Serialize output to JSON using the `to_json()` method.
+- Verify operation success with the `is_successful()` method.
+- Convert output to a dictionary with the `to_dict()` method.
 ---
@@ -112,13 +109,13 @@ Every tool of `TextTools` returns a `ToolOutput` object which is a BaseModel wit
 from openai import OpenAI
 from texttools import TheTool
-client = OpenAI(base_url = "your_url", API_KEY = "your_api_key")
+client = OpenAI(base_url="your_url", API_KEY="your_api_key")
 model = "model_name"
 the_tool = TheTool(client=client, model=model)
 detection = the_tool.is_question("Is this project open source?")
-print(repr(detection))
+print(detection.to_json())
 ```
 ---
@@ -136,24 +133,24 @@ async def main():
     async_the_tool = AsyncTheTool(client=async_client, model=model)
-    translation_task = async_the_tool.translate("سلام، حالت چطوره؟", target_language="English")
-    keywords_task = async_the_tool.extract_keywords("Tomorrow, we will be dead by the car crash")
+    translation_task = async_the_tool.translate("سلام، حالت چطوره؟", target_lang="English")
+    keywords_task = async_the_tool.extract_keywords("This open source project is great for processing large datasets!")
     (translation, keywords) = await asyncio.gather(translation_task, keywords_task)
-    print(repr(translation))
-    print(repr(keywords))
+    print(translation.to_json())
+    print(keywords.to_json())
 asyncio.run(main())
 ```
 ---
-## 👍 Use Cases
+## ✅ Use Cases
 Use **TextTools** when you need to:
-- 🔍 **Classify** large datasets quickly without model training
-- 🌍 **Translate** and process multilingual corpora with ease
+- 🔍 **Classify** large datasets quickly without model training
 - 🧩 **Integrate** LLMs into production pipelines (structured outputs)
 - 📊 **Analyze** large text collections using embeddings and categorization

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/hamtaa_texttools.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: hamtaa-texttools
-Version: 1.3.2
+Version: 2.0.0
 Summary: A high-level NLP toolkit built on top of modern LLMs.
 Author-email: Tohidi <the.mohammad.tohidi@gmail.com>, Erfan Moosavi <erfanmoosavi84@gmail.com>, Montazer <montazerh82@gmail.com>, Givechi <mohamad.m.givechi@gmail.com>, Zareshahi <a.zareshahi1377@gmail.com>
 Maintainer-email: Erfan Moosavi <erfanmoosavi84@gmail.com>, Tohidi <the.mohammad.tohidi@gmail.com>
@@ -11,7 +11,7 @@ Classifier: License :: OSI Approved :: MIT License
 Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
 Classifier: Topic :: Text Processing
 Classifier: Operating System :: OS Independent
-Requires-Python: >=3.9
+Requires-Python: >=3.11
 Description-Content-Type: text/markdown
 License-File: LICENSE
 Requires-Dist: openai>=1.97.1
@@ -30,30 +30,27 @@ Dynamic: license-file
 It provides both **sync (`TheTool`)** and **async (`AsyncTheTool`)** APIs for maximum flexibility.
-It provides ready-to-use utilities for **translation, question detection, keyword extraction, categorization, NER extraction, and more** - designed to help you integrate AI-powered text processing into your applications with minimal effort.
-**Note:** Most features of `texttools` are reliable when you use `google/gemma-3n-e4b-it` model.
+It provides ready-to-use utilities for **translation, question detection, categorization, NER extraction, and more** - designed to help you integrate AI-powered text processing into your applications with minimal effort.
 ---
 ## ✨ Features
-TextTools provides a rich collection of high-level NLP utilities,
+TextTools provides a collection of high-level NLP utilities.
 Each tool is designed to work with structured outputs.
-- **`categorize()`** - Classifies text into given categories
-- **`extract_keywords()`** - Extracts keywords from the text
-- **`extract_entities()`** - Named Entity Recognition (NER) system
-- **`is_question()`** - Binary question detection
-- **`text_to_question()`** - Generates questions from text
-- **`merge_questions()`** - Merges multiple questions into one
-- **`rewrite()`** - Rewrites text in a different way
-- **`subject_to_question()`** - Generates questions about a given subject
-- **`summarize()`** - Text summarization
-- **`translate()`** - Text translation
-- **`propositionize()`** - Convert text to atomic independent meaningful sentences
-- **`check_fact()`** - Check whether a statement is relevant to the source text
-- **`run_custom()`** - Allows users to define a custom tool with an arbitrary BaseModel
+- **`categorize()`** - Classify text into given categories
+- **`extract_keywords()`** - Extract keywords from the text
+- **`extract_entities()`** - Perform Named Entity Recognition (NER)
+- **`is_question()`** - Detect if the input is phrased as a question
+- **`to_question()`** - Generate questions from the given text / subject
+- **`merge_questions()`** - Merge multiple questions into one
+- **`augment()`** - Rewrite text in different augmentations
+- **`summarize()`** - Summarize the given text
+- **`translate()`** - Translate text between languages
+- **`propositionize()`** - Convert a text into atomic, independent, meaningful sentences
+- **`is_fact()`** - Check whether a statement is a fact based on the source text
+- **`run_custom()`** - Custom tool that can do almost anything
 ---
@@ -71,14 +68,12 @@ pip install -U hamtaa-texttools
 | Status | Meaning | Tools | Safe for Production? |
 |--------|---------|----------|-------------------|
-| **✅ Production** | Evaluated, tested, stable. | `categorize()` (list mode), `extract_keywords()`, `extract_entities()`, `is_question()`, `text_to_question()`, `merge_questions()`, `rewrite()`, `subject_to_question()`, `summarize()`, `run_custom()` | **Yes** - ready for reliable use. |
-| **🧪 Experimental** | Added to the package but **not fully evaluated**. Functional, but quality may vary. | `categorize()` (tree mode), `translate()`, `propositionize()`, `check_fact()` | **Use with caution** - outputs not yet validated. |
+| **✅ Production** | Evaluated and tested. | `categorize()` (list mode), `extract_keywords()`, `extract_entities()`, `is_question()`, `to_question()`, `merge_questions()`, `augment()`, `summarize()`, `run_custom()` | **Yes** - ready for reliable use. |
+| **🧪 Experimental** | Added to the package but **not fully evaluated**. | `categorize()` (tree mode), `translate()`, `propositionize()`, `is_fact()` | **Use with caution** |
 ---
-## ⚙️ `with_analysis`, `logprobs`, `output_lang`, `user_prompt`, `temperature`, `validator`, `priority` and `timeout` parameters
-TextTools provides several optional flags to customize LLM behavior:
+## ⚙️ Additional Parameters
 - **`with_analysis: bool`** → Adds a reasoning step before generating the final output.
 **Note:** This doubles token usage per call.
@@ -88,17 +83,17 @@ TextTools provides several optional flags to customize LLM behavior:
 - **`output_lang: str`** → Forces the model to respond in a specific language.
-- **`user_prompt: str`** → Allows you to inject a custom instruction or into the model alongside the main template. This gives you fine-grained control over how the model interprets or modifies the input text.
+- **`user_prompt: str`** → Allows you to inject a custom instruction into the model alongside the main template.
-- **`temperature: float`** → Determines how creative the model should respond. Takes a float number from `0.0` to `2.0`.
+- **`temperature: float`** → Determines how creative the model should respond. Takes a float number between `0.0` and `2.0`.
-- **`validator: Callable (Experimental)`** → Forces TheTool to validate the output result based on your custom validator. Validator should return a boolean. If the validator fails, TheTool will retry to get another output by modifying `temperature`. You can also specify `max_validation_retries=<N>`.
+- **`validator: Callable (Experimental)`** → Forces the tool to validate the output result based on your validator function. Validator should return a boolean. If the validator fails, TheTool will retry to get another output by modifying `temperature`. You can also specify `max_validation_retries=<N>`.
-- **`priority: int (Experimental)`** → Task execution priority level. Affects processing order in queues.
+- **`priority: int (Experimental)`** → Affects processing order in queues.
 **Note:** This feature works if it's supported by the model and vLLM.
-- **`timeout: float`** → Maximum time in seconds to wait for the response before raising a timeout error
-**Note:** This feature only exists in `AsyncTheTool`.
+- **`timeout: float`** → Maximum time in seconds to wait for the response before raising a timeout error.
+**Note:** This feature is only available in `AsyncTheTool`.
 ---
@@ -110,12 +105,14 @@ Every tool of `TextTools` returns a `ToolOutput` object which is a BaseModel wit
 - **`analysis: str`**
 - **`logprobs: list`**
 - **`errors: list[str]`**
-- **`ToolOutputMetadata`** →
+- **`ToolOutputMetadata`**
     - **`tool_name: str`**
     - **`processed_at: datetime`**
     - **`execution_time: float`**
-**Note:** You can use `repr(ToolOutput)` to print your output with all the details.
+- Serialize output to JSON using the `to_json()` method.
+- Verify operation success with the `is_successful()` method.
+- Convert output to a dictionary with the `to_dict()` method.
 ---
@@ -133,13 +130,13 @@ Every tool of `TextTools` returns a `ToolOutput` object which is a BaseModel wit
 from openai import OpenAI
 from texttools import TheTool
-client = OpenAI(base_url = "your_url", API_KEY = "your_api_key")
+client = OpenAI(base_url="your_url", API_KEY="your_api_key")
 model = "model_name"
 the_tool = TheTool(client=client, model=model)
 detection = the_tool.is_question("Is this project open source?")
-print(repr(detection))
+print(detection.to_json())
 ```
 ---
@@ -157,24 +154,24 @@ async def main():
     async_the_tool = AsyncTheTool(client=async_client, model=model)
-    translation_task = async_the_tool.translate("سلام، حالت چطوره؟", target_language="English")
-    keywords_task = async_the_tool.extract_keywords("Tomorrow, we will be dead by the car crash")
+    translation_task = async_the_tool.translate("سلام، حالت چطوره؟", target_lang="English")
+    keywords_task = async_the_tool.extract_keywords("This open source project is great for processing large datasets!")
     (translation, keywords) = await asyncio.gather(translation_task, keywords_task)
-    print(repr(translation))
-    print(repr(keywords))
+    print(translation.to_json())
+    print(keywords.to_json())
 asyncio.run(main())
 ```
 ---
-## 👍 Use Cases
+## ✅ Use Cases
 Use **TextTools** when you need to:
-- 🔍 **Classify** large datasets quickly without model training
-- 🌍 **Translate** and process multilingual corpora with ease
+- 🔍 **Classify** large datasets quickly without model training
 - 🧩 **Integrate** LLMs into production pipelines (structured outputs)
 - 📊 **Analyze** large text collections using embeddings and categorization

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/hamtaa_texttools.egg-info/SOURCES.txt RENAMED Viewed

@@ -6,31 +6,29 @@ hamtaa_texttools.egg-info/SOURCES.txt
 hamtaa_texttools.egg-info/dependency_links.txt
 hamtaa_texttools.egg-info/requires.txt
 hamtaa_texttools.egg-info/top_level.txt
-tests/test_all_async_tools.py
-tests/test_all_tools.py
-tests/test_output_validation.py
+tests/test_category_tree.py
+tests/test_to_chunks.py
 texttools/__init__.py
 texttools/models.py
 texttools/py.typed
 texttools/core/__init__.py
-texttools/core/engine.py
 texttools/core/exceptions.py
 texttools/core/internal_models.py
+texttools/core/utils.py
 texttools/core/operators/__init__.py
 texttools/core/operators/async_operator.py
 texttools/core/operators/sync_operator.py
+texttools/prompts/augment.yaml
 texttools/prompts/categorize.yaml
-texttools/prompts/check_fact.yaml
 texttools/prompts/extract_entities.yaml
 texttools/prompts/extract_keywords.yaml
+texttools/prompts/is_fact.yaml
 texttools/prompts/is_question.yaml
 texttools/prompts/merge_questions.yaml
 texttools/prompts/propositionize.yaml
-texttools/prompts/rewrite.yaml
 texttools/prompts/run_custom.yaml
-texttools/prompts/subject_to_question.yaml
 texttools/prompts/summarize.yaml
-texttools/prompts/text_to_question.yaml
+texttools/prompts/to_question.yaml
 texttools/prompts/translate.yaml
 texttools/tools/__init__.py
 texttools/tools/async_tools.py

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "hamtaa-texttools"
-version = "1.3.2"
+version = "2.0.0"
 authors = [
   {name = "Tohidi", email = "the.mohammad.tohidi@gmail.com"},
   {name = "Erfan Moosavi", email = "erfanmoosavi84@gmail.com"},
@@ -19,7 +19,7 @@ maintainers = [
 description = "A high-level NLP toolkit built on top of modern LLMs."
 readme = "README.md"
 license = {text = "MIT"}
-requires-python = ">=3.9"
+requires-python = ">=3.11"
 dependencies = [
   "openai>=1.97.1",
   "pydantic>=2.0.0",

hamtaa_texttools-2.0.0/tests/test_category_tree.py ADDED Viewed

@@ -0,0 +1,48 @@
+import pytest
+from texttools.models import CategoryTree, Node
+@pytest.fixture
+def tree():
+    tree = CategoryTree()
+    tree.add_node("اخلاق", "root")
+    tree.add_node("معرفت شناسی", "root")
+    tree.add_node("متافیزیک", "root")
+    tree.add_node("فلسفه ذهن", "root")
+    tree.add_node("آگاهی", "فلسفه ذهن")
+    tree.add_node("ذهن و بدن", "فلسفه ذهن")
+    tree.add_node("امکان و ضرورت", "متافیزیک")
+    tree.add_node("مغز و ترشحات", "ذهن و بدن")
+    return tree
+def test_level_count(tree):
+    assert tree.get_level_count() == 3
+def test_none_node(tree):
+    assert tree.get_node("سلامت") is None
+def test_get_node(tree):
+    assert isinstance(tree.get_node("آگاهی"), Node)
+def test_add_duplicate_node(tree):
+    with pytest.raises(ValueError, match="Cannot add آگاهی category twice"):
+        tree.add_node("آگاهی", "root")
+def test_wrong_parent(tree):
+    with pytest.raises(ValueError, match="Parent category امکان not found"):
+        tree.add_node("ضرورت", "امکان")
+def test_remove_root(tree):
+    with pytest.raises(ValueError, match="Cannot remove the root node"):
+        tree.remove_node("root")
+def test_remove_none(tree):
+    with pytest.raises(ValueError, match="Category: ایجاب not found"):
+        tree.remove_node("ایجاب")

hamtaa_texttools-2.0.0/tests/test_to_chunks.py ADDED Viewed

@@ -0,0 +1,13 @@
+from texttools.core.utils import TheToolUtils
+def test_single_chunk():
+    text = "Short text"
+    chunks = TheToolUtils.to_chunks(text, size=100, overlap=0)
+    assert len(chunks) == 1
+    assert chunks[0] == "Short text"
+def test_empty_text():
+    chunks = TheToolUtils.to_chunks("", size=10, overlap=0)
+    assert len(chunks) == 0

{hamtaa_texttools-1.3.2 → hamtaa_texttools-2.0.0}/texttools/__init__.py RENAMED Viewed

@@ -2,4 +2,4 @@ from .models import CategoryTree
 from .tools.async_tools import AsyncTheTool
 from .tools.sync_tools import TheTool
-__all__ = ["TheTool", "AsyncTheTool", "CategoryTree"]
+__all__ = ["CategoryTree", "AsyncTheTool", "TheTool"]

hamtaa-texttools 1.3.2__tar.gz → 2.0.0__tar.gz

hamtaa-texttools 1.3.2tar.gz → 2.0.0tar.gz