PyPI - fast-sentence-segment - Versions diffs - 1.2.0__py3-none-any.whl → 1.2.1__py3-none-any.whl - Mend

fast-sentence-segment 1.2.0py3-none-any.whl → 1.2.1py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

{fast_sentence_segment-1.2.0.dist-info → fast_sentence_segment-1.2.1.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: fast-sentence-segment
-Version: 1.2.0
+Version: 1.2.1
 Summary: Fast and Efficient Sentence Segmentation
 License: MIT
 License-File: LICENSE
@@ -67,12 +67,21 @@ python -m spacy download en_core_web_sm
 ```python
 from fast_sentence_segment import segment_text
-text = "Here is a Dr. who says something. And then again, what else? I don't know. Do you?"
+text = "Do you like Dr. Who? I prefer Dr. Strange! Mr. T is also cool."
-results = segment_text(text)
-# Returns: [['Here is a Dr. who says something.', 'And then again, what else?', "I don't know.", 'Do you?']]
+results = segment_text(text, flatten=True)
 ```
+```json
+[
+  "Do you like Dr. Who?",
+  "I prefer Dr. Strange!",
+  "Mr. T is also cool."
+]
+```
+Notice how "Dr. Who?" stays together as a single sentence—the library correctly recognizes that a title followed by a single-word name ending in `?` or `!` is a name reference, not a sentence boundary.
 ## Usage
 ### Basic Segmentation
@@ -82,16 +91,24 @@ The `segment_text` function returns a list of lists, where each inner list repre
 ```python
 from fast_sentence_segment import segment_text
-text = """First paragraph here. It has two sentences.
+text = """Gandalf spoke softly. "All we have to decide is what to do with the time given us."
-Second paragraph starts here. This one also has multiple sentences. And a third."""
+Frodo nodded. The weight of the Ring pressed against his chest."""
 results = segment_text(text)
-# Returns:
-# [
-#     ['First paragraph here.', 'It has two sentences.'],
-#     ['Second paragraph starts here.', 'This one also has multiple sentences.', 'And a third.']
-# ]
+```
+```json
+[
+  [
+    "Gandalf spoke softly.",
+    "\"All we have to decide is what to do with the time given us.\"."
+  ],
+  [
+    "Frodo nodded.",
+    "The weight of the Ring pressed against his chest."
+  ]
+]
 ```
 ### Flattened Output
@@ -99,8 +116,17 @@ results = segment_text(text)
 If you don't need paragraph boundaries, use the `flatten` parameter:
 ```python
+text = "At 9 a.m. the hobbits set out. By 3 p.m. they reached Rivendell. Mr. Frodo was exhausted."
 results = segment_text(text, flatten=True)
-# Returns: ['First paragraph here.', 'It has two sentences.', 'Second paragraph starts here.', ...]
+```
+```json
+[
+  "At 9 a.m. the hobbits set out.",
+  "By 3 p.m. they reached Rivendell.",
+  "Mr. Frodo was exhausted."
+]
 ```
 ### Direct Segmenter Access
@@ -120,16 +146,28 @@ Segment text directly from the terminal:
 ```bash
 # Direct text input
-segment "Hello world. How are you? I am fine."
+echo "Have you seen Dr. Who? It's brilliant!" | segment
+```
+```
+Have you seen Dr. Who?
+It's brilliant!
+```
+```bash
 # Numbered output
-segment -n "First sentence. Second sentence."
+segment -n "Gandalf paused... You shall not pass! The Balrog roared."
+```
-# From stdin
-echo "Some text here. Another sentence." | segment
+```
+1. Gandalf paused...
+2. You shall not pass!
+3. The Balrog roared.
+```
+```bash
 # From file
-segment -f document.txt
+segment -f silmarillion.txt
 ```
 ## API Reference

{fast_sentence_segment-1.2.0.dist-info → fast_sentence_segment-1.2.1.dist-info}/RECORD RENAMED Viewed

@@ -20,8 +20,8 @@ fast_sentence_segment/dmo/title_name_merger.py,sha256=zbG04_VjwM8TtT8LhavvmZqIZL
 fast_sentence_segment/svc/__init__.py,sha256=9B12mXxBnlalH4OAm1AMLwUMa-RLi2ilv7qhqv26q7g,144
 fast_sentence_segment/svc/perform_paragraph_segmentation.py,sha256=zLKw9rSzb0NNfx4MyEeoGrHwhxTtH5oDrYcAL2LMVHY,1378
 fast_sentence_segment/svc/perform_sentence_segmentation.py,sha256=dqGxFsJoP6ox_MJwtB85R9avEbBAR4x9YKaRaQ5fAXo,5723
-fast_sentence_segment-1.2.0.dist-info/METADATA,sha256=05V3aFKHCD9JaYN8va_vIuMtaoAbGmKgFAOUDJWfM80,6405
-fast_sentence_segment-1.2.0.dist-info/WHEEL,sha256=zp0Cn7JsFoX2ATtOhtaFYIiE2rmFAD4OcMhtUki8W3U,88
-fast_sentence_segment-1.2.0.dist-info/entry_points.txt,sha256=mDiRuKOZlOeqmtH1eZwqGEGM6KUh0RTzwyETGMpxSDI,58
-fast_sentence_segment-1.2.0.dist-info/licenses/LICENSE,sha256=vou5JCLAT5nHcsUv-AkjUYAihYfN9mwPDXxV2DHyHBo,1067
-fast_sentence_segment-1.2.0.dist-info/RECORD,,
+fast_sentence_segment-1.2.1.dist-info/METADATA,sha256=OsUlH-UhmI6fw-ChvsF83G_WwTXBlhZPINo243CaziQ,6889
+fast_sentence_segment-1.2.1.dist-info/WHEEL,sha256=zp0Cn7JsFoX2ATtOhtaFYIiE2rmFAD4OcMhtUki8W3U,88
+fast_sentence_segment-1.2.1.dist-info/entry_points.txt,sha256=mDiRuKOZlOeqmtH1eZwqGEGM6KUh0RTzwyETGMpxSDI,58
+fast_sentence_segment-1.2.1.dist-info/licenses/LICENSE,sha256=vou5JCLAT5nHcsUv-AkjUYAihYfN9mwPDXxV2DHyHBo,1067
+fast_sentence_segment-1.2.1.dist-info/RECORD,,

{fast_sentence_segment-1.2.0.dist-info → fast_sentence_segment-1.2.1.dist-info}/WHEEL RENAMED Viewed

File without changes

{fast_sentence_segment-1.2.0.dist-info → fast_sentence_segment-1.2.1.dist-info}/entry_points.txt RENAMED Viewed

File without changes

{fast_sentence_segment-1.2.0.dist-info → fast_sentence_segment-1.2.1.dist-info}/licenses/LICENSE RENAMED Viewed

File without changes

fast-sentence-segment 1.2.0__py3-none-any.whl → 1.2.1__py3-none-any.whl

fast-sentence-segment 1.2.0py3-none-any.whl → 1.2.1py3-none-any.whl