PyPI - swarmauri_parser_keywordextractor - Versions diffs - 0.9.0.dev4__tar.gz → 0.9.0.dev33__tar.gz - Mend

swarmauri_parser_keywordextractor 0.9.0.dev4tar.gz → 0.9.0.dev33tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

{swarmauri_parser_keywordextractor-0.9.0.dev4 → swarmauri_parser_keywordextractor-0.9.0.dev33}/PKG-INFO RENAMED Viewed

@@ -1,8 +1,10 @@
-Metadata-Version: 2.3
+Metadata-Version: 2.4
 Name: swarmauri_parser_keywordextractor
-Version: 0.9.0.dev4
+Version: 0.9.0.dev33
 Summary: Keyword Extractor Parser for Swarmauri.
-License: Apache-2.0
+License-Expression: Apache-2.0
+License-File: LICENSE
+Keywords: swarmauri,sdk,standards,parser,keywordextractor
 Author: Jacob Stewart
 Author-email: jacob@swarmauri.com
 Requires-Python: >=3.10,<3.13
@@ -10,6 +12,9 @@ Classifier: License :: OSI Approved :: Apache Software License
 Classifier: Programming Language :: Python :: 3.10
 Classifier: Programming Language :: Python :: 3.11
 Classifier: Programming Language :: Python :: 3.12
+Classifier: Programming Language :: Python
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3 :: Only
 Requires-Dist: swarmauri_base
 Requires-Dist: swarmauri_core
 Requires-Dist: swarmauri_standard
@@ -17,7 +22,7 @@ Requires-Dist: yake (==0.4.8)
 Description-Content-Type: text/markdown
-![Swamauri Logo](https://res.cloudinary.com/dbjmpekvl/image/upload/v1730099724/Swarmauri-logo-lockup-2048x757_hww01w.png)
+![Swarmauri Logo](https://github.com/swarmauri/swarmauri-sdk/blob/3d4d1cfa949399d7019ae9d8f296afba773dfb7f/assets/swarmauri.brand.theme.svg)
 <p align="center">
     <a href="https://pypi.org/project/swarmauri_parser_keywordextractor/">
@@ -36,31 +41,54 @@ Description-Content-Type: text/markdown
 # Swarmauri Parser Keywordextractor
-A parser component that extracts keywords from text using the YAKE keyword extraction library.
+`KeywordExtractorParser` wraps the [YAKE](https://github.com/LIAAD/yake) keyword
+extraction library to turn arbitrary text into a ranked list of
+`swarmauri_standard.documents.Document` instances. Each returned document stores
+the detected keyword in `content` and the YAKE importance score in
+`metadata["score"]`.
+The parser normalizes any input into a string before analysis and, by default,
+extracts up to 10 keywords using the English language model, three-word maximum
+phrases, and YAKE's sequence-matching deduplication (`dedupLim=0.9`). Override
+`lang` or `num_keywords` when instantiating the parser to tailor the output to
+your dataset.
 ## Installation
+Choose the tool that matches your workflow:
 ```bash
+# pip
 pip install swarmauri_parser_keywordextractor
+# Poetry
+poetry add swarmauri_parser_keywordextractor
+# uv
+uv add swarmauri_parser_keywordextractor
 ```
 ## Usage
-Here's a basic example of how to use the KeywordExtractorParser:
+Here's a basic example of how to use the `KeywordExtractorParser`:
 ```python
-from swarmauri_parser_keywordextractor.KeywordExtractorParser import KeywordExtractorParser
+from swarmauri_parser_keywordextractor import KeywordExtractorParser
-# Initialize the parser
-parser = KeywordExtractorParser()
+# Initialize the parser for three keywords in English
+parser = KeywordExtractorParser(num_keywords=3, lang="en")
-# Parse text and extract keywords
 text = "Artificial intelligence and machine learning are transforming technology"
 documents = parser.parse(text)
-# Access extracted keywords and their scores
-for doc in documents:
-    print(f"Keyword: {doc.content}, Score: {doc.metadata['score']}")
+for document in documents:
+    score = document.metadata["score"]
+    print(f"Keyword: {document.content}, Score: {score:.4f}")
 ```
+Each call to `parse` returns a list of `Document` objects ranked by YAKE so you
+can feed them directly into downstream Swarmauri pipelines.
 ## Want to help?
 If you want to contribute to swarmauri-sdk, read up on our [guidelines for contributing](https://github.com/swarmauri/swarmauri-sdk/blob/master/contributing.md) that will help you get started.

{swarmauri_parser_keywordextractor-0.9.0.dev4 → swarmauri_parser_keywordextractor-0.9.0.dev33}/README.md RENAMED Viewed

@@ -1,5 +1,5 @@
-![Swamauri Logo](https://res.cloudinary.com/dbjmpekvl/image/upload/v1730099724/Swarmauri-logo-lockup-2048x757_hww01w.png)
+![Swarmauri Logo](https://github.com/swarmauri/swarmauri-sdk/blob/3d4d1cfa949399d7019ae9d8f296afba773dfb7f/assets/swarmauri.brand.theme.svg)
 <p align="center">
     <a href="https://pypi.org/project/swarmauri_parser_keywordextractor/">
@@ -18,31 +18,54 @@
 # Swarmauri Parser Keywordextractor
-A parser component that extracts keywords from text using the YAKE keyword extraction library.
+`KeywordExtractorParser` wraps the [YAKE](https://github.com/LIAAD/yake) keyword
+extraction library to turn arbitrary text into a ranked list of
+`swarmauri_standard.documents.Document` instances. Each returned document stores
+the detected keyword in `content` and the YAKE importance score in
+`metadata["score"]`.
+The parser normalizes any input into a string before analysis and, by default,
+extracts up to 10 keywords using the English language model, three-word maximum
+phrases, and YAKE's sequence-matching deduplication (`dedupLim=0.9`). Override
+`lang` or `num_keywords` when instantiating the parser to tailor the output to
+your dataset.
 ## Installation
+Choose the tool that matches your workflow:
 ```bash
+# pip
 pip install swarmauri_parser_keywordextractor
+# Poetry
+poetry add swarmauri_parser_keywordextractor
+# uv
+uv add swarmauri_parser_keywordextractor
 ```
 ## Usage
-Here's a basic example of how to use the KeywordExtractorParser:
+Here's a basic example of how to use the `KeywordExtractorParser`:
 ```python
-from swarmauri_parser_keywordextractor.KeywordExtractorParser import KeywordExtractorParser
+from swarmauri_parser_keywordextractor import KeywordExtractorParser
-# Initialize the parser
-parser = KeywordExtractorParser()
+# Initialize the parser for three keywords in English
+parser = KeywordExtractorParser(num_keywords=3, lang="en")
-# Parse text and extract keywords
 text = "Artificial intelligence and machine learning are transforming technology"
 documents = parser.parse(text)
-# Access extracted keywords and their scores
-for doc in documents:
-    print(f"Keyword: {doc.content}, Score: {doc.metadata['score']}")
+for document in documents:
+    score = document.metadata["score"]
+    print(f"Keyword: {document.content}, Score: {score:.4f}")
 ```
+Each call to `parse` returns a list of `Document` objects ranked by YAKE so you
+can feed them directly into downstream Swarmauri pipelines.
 ## Want to help?
 If you want to contribute to swarmauri-sdk, read up on our [guidelines for contributing](https://github.com/swarmauri/swarmauri-sdk/blob/master/contributing.md) that will help you get started.

{swarmauri_parser_keywordextractor-0.9.0.dev4 → swarmauri_parser_keywordextractor-0.9.0.dev33}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "swarmauri_parser_keywordextractor"
-version = "0.9.0.dev4"
+version = "0.9.0.dev33"
 description = "Keyword Extractor Parser for Swarmauri."
 license = "Apache-2.0"
 readme = "README.md"
@@ -11,6 +11,9 @@ classifiers = [
     "Programming Language :: Python :: 3.10",
     "Programming Language :: Python :: 3.11",
     "Programming Language :: Python :: 3.12",
+    "Programming Language :: Python",
+    "Programming Language :: Python :: 3",
+    "Programming Language :: Python :: 3 :: Only",
 ]
 authors = [{ name = "Jacob Stewart", email = "jacob@swarmauri.com" }]
 dependencies = [
@@ -19,6 +22,13 @@ dependencies = [
     "swarmauri_base",
     "swarmauri_standard",
 ]
+keywords = [
+    'swarmauri',
+    'sdk',
+    'standards',
+    'parser',
+    'keywordextractor',
+]
 [tool.uv.sources]
 swarmauri_core = { workspace = true }
@@ -37,6 +47,7 @@ markers = [
     "xfail: Expected failures",
     "acceptance: Acceptance tests",
     "perf: Performance tests that measure execution time and resource usage",
+    "example: Example usage tests",
 ]
 timeout = 300
 log_cli = true

{swarmauri_parser_keywordextractor-0.9.0.dev4 → swarmauri_parser_keywordextractor-0.9.0.dev33}/LICENSE RENAMED Viewed

File without changes

{swarmauri_parser_keywordextractor-0.9.0.dev4 → swarmauri_parser_keywordextractor-0.9.0.dev33}/swarmauri_parser_keywordextractor/KeywordExtractorParser.py RENAMED Viewed

File without changes

{swarmauri_parser_keywordextractor-0.9.0.dev4 → swarmauri_parser_keywordextractor-0.9.0.dev33}/swarmauri_parser_keywordextractor/__init__.py RENAMED Viewed

File without changes

swarmauri_parser_keywordextractor 0.9.0.dev4__tar.gz → 0.9.0.dev33__tar.gz

swarmauri_parser_keywordextractor 0.9.0.dev4tar.gz → 0.9.0.dev33tar.gz