PyPI - ttsforge - Versions diffs - 0.1.0__tar.gz → 0.1.1__tar.gz - Mend

ttsforge 0.1.0tar.gz → 0.1.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (79) hide show

{ttsforge-0.1.0 → ttsforge-0.1.1}/.codecrate.toml RENAMED Viewed

@@ -1,8 +1,9 @@
 [codecrate]
 output = "context_ttsforge.md"
 keep_docstrings = true
-dedupe = true
+dedupe = false
 metadata = false
+manifest = false
 respect_gitignore = true
 exclude = ["*/.venv/*"]
 include = ["**/*.py", "**/*.toml", "**/*.rst", "**/*.md"]

{ttsforge-0.1.0 → ttsforge-0.1.1}/.gitignore RENAMED Viewed

@@ -216,7 +216,6 @@ onnx/
 *.m4a
 # Test/demo scripts at project root
-test_*.py
 demo_*.py
 # Binary data files

{ttsforge-0.1.0 → ttsforge-0.1.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: ttsforge
-Version: 0.1.0
+Version: 0.1.1
 Summary: Generate audiobooks from EPUB files using Kokoro ONNX TTS.
 Author-email: Holger Nahrstaedt <nahrstaedt@gmail.com>
 License: MIT License
@@ -396,14 +396,14 @@ SSMD files use a simple markdown-like syntax:
 **Custom Phonemes**:
 ```
-[Hermione](ph: /hɝmˈIni/)    # Override pronunciation
-[API](ph: /ˌeɪpiˈaɪ/)        # Technical terms
+[Hermione]{ph="hɝmˈIni"}    # Override pronunciation
+[API]{ph="ˌeɪpiˈaɪ"}        # Technical terms
 ```
 **Language Switching** (planned):
 ```
-[Bonjour](fr)    # Mark text as French
+[Bonjour]{lang="fr"}    # Mark text as French
 ```
 #### Example SSMD File
@@ -411,7 +411,7 @@ SSMD files use a simple markdown-like syntax:
 ```ssmd
 Chapter One ...p
-[Harry](ph: /hæɹi/) Potter was a *highly unusual* boy in many ways. ...s
+[Harry]{ph="hæɹi"} Potter was a *highly unusual* boy in many ways. ...s
 For one thing, he **hated** the summer holidays more than any other
 time of year. ...s For another, he really wanted to do his homework,
 but was forced to do it in secret, in the dead of the night. ...p
@@ -498,12 +498,12 @@ Edit `custom_phonemes.json` to fix any incorrect phonemes. The file format is:
   },
   "entries": {
     "Hermione": {
-      "phoneme": "/hɝmˈIni/",
+      "phoneme": "hɝmˈIni",
       "occurrences": 847,
       "verified": false
     },
     "Kubernetes": {
-      "phoneme": "/kubɚnˈɛtɪs/",
+      "phoneme": "kubɚnˈɛtɪs",
       "occurrences": 12,
       "verified": false
     }
@@ -515,8 +515,8 @@ Or use the simple format:
 ```json
 {
-  "Hermione": "/hɝmˈIni/",
-  "Kubernetes": "/kubɚnˈɛtɪs/"
+  "Hermione": "hɝmˈIni",
+  "Kubernetes": "kubɚnˈɛtɪs"
 }
 ```
@@ -548,9 +548,9 @@ You can create a dictionary manually without extraction:
 ```json
 {
-  "Katniss": "/kætnɪs/",
-  "Peeta": "/pitə/",
-  "Panem": "/pænəm/"
+  "Katniss": "kætnɪs",
+  "Peeta": "pitə",
+  "Panem": "pænəm"
 }
 ```

{ttsforge-0.1.0 → ttsforge-0.1.1}/README.md RENAMED Viewed

@@ -333,14 +333,14 @@ SSMD files use a simple markdown-like syntax:
 **Custom Phonemes**:
 ```
-[Hermione](ph: /hɝmˈIni/)    # Override pronunciation
-[API](ph: /ˌeɪpiˈaɪ/)        # Technical terms
+[Hermione]{ph="hɝmˈIni"}    # Override pronunciation
+[API]{ph="ˌeɪpiˈaɪ"}        # Technical terms
 ```
 **Language Switching** (planned):
 ```
-[Bonjour](fr)    # Mark text as French
+[Bonjour]{lang="fr"}    # Mark text as French
 ```
 #### Example SSMD File
@@ -348,7 +348,7 @@ SSMD files use a simple markdown-like syntax:
 ```ssmd
 Chapter One ...p
-[Harry](ph: /hæɹi/) Potter was a *highly unusual* boy in many ways. ...s
+[Harry]{ph="hæɹi"} Potter was a *highly unusual* boy in many ways. ...s
 For one thing, he **hated** the summer holidays more than any other
 time of year. ...s For another, he really wanted to do his homework,
 but was forced to do it in secret, in the dead of the night. ...p
@@ -435,12 +435,12 @@ Edit `custom_phonemes.json` to fix any incorrect phonemes. The file format is:
   },
   "entries": {
     "Hermione": {
-      "phoneme": "/hɝmˈIni/",
+      "phoneme": "hɝmˈIni",
       "occurrences": 847,
       "verified": false
     },
     "Kubernetes": {
-      "phoneme": "/kubɚnˈɛtɪs/",
+      "phoneme": "kubɚnˈɛtɪs",
       "occurrences": 12,
       "verified": false
     }
@@ -452,8 +452,8 @@ Or use the simple format:
 ```json
 {
-  "Hermione": "/hɝmˈIni/",
-  "Kubernetes": "/kubɚnˈɛtɪs/"
+  "Hermione": "hɝmˈIni",
+  "Kubernetes": "kubɚnˈɛtɪs"
 }
 ```
@@ -485,9 +485,9 @@ You can create a dictionary manually without extraction:
 ```json
 {
-  "Katniss": "/kætnɪs/",
-  "Peeta": "/pitə/",
-  "Panem": "/pænəm/"
+  "Katniss": "kætnɪs",
+  "Peeta": "pitə",
+  "Panem": "pænəm"
 }
 ```

{ttsforge-0.1.0 → ttsforge-0.1.1}/docs/api/index.rst RENAMED Viewed

@@ -60,10 +60,6 @@ Utilities
 **ttsforge.vocab**
    Vocabulary utilities and metadata.
-**ttsforge.trim**
-   Audio trimming utilities for silence removal.
 Quick API Examples
 ------------------
@@ -80,9 +76,9 @@ Basic Text-to-Speech
        voice="af_heart",
        speed=1.0,
        use_gpu=False,
-       pause_clause=0.25,
-       pause_sentence=0.2,
-       pause_paragraph=0.75,
+       pause_clause=0.3,
+       pause_sentence=0.5,
+       pause_paragraph=0.9,
        pause_variance=0.05,
    )
    runner = KokoroRunner(opts, log=print)
@@ -242,8 +238,3 @@ Auto-generated API Documentation
    :members:
    :undoc-members:
    :show-inheritance:
-.. automodule:: ttsforge.trim
-   :members:
-   :undoc-members:
-   :show-inheritance:

{ttsforge-0.1.0 → ttsforge-0.1.1}/docs/ssmd.rst RENAMED Viewed

@@ -101,15 +101,15 @@ Override pronunciation using IPA phonemes:
 .. code-block:: ssmd
-   [word](ph: /phoneme/)
+   [word]{ph="phoneme"}
 Examples:
 .. code-block:: ssmd
-   [Hermione](ph: /hɝmˈIni/) Granger was Harry's best friend. ...s
-   The [API](ph: /ˌeɪpiˈaɪ/) supports [JSON](ph: /dʒˈeɪsɑn/). ...s
-   [Kubernetes](ph: /kubɚnˈɛtɪs/) is a container orchestrator. ...s
+   [Hermione]{ph="hɝmˈIni"} Granger was Harry's best friend. ...s
+   The [API]{ph="ˌeɪpiˈaɪ"} supports [JSON]{ph="dʒˈeɪsɑn"}. ...s
+   [Kubernetes]{ph="kubɚnˈɛtɪs"} is a container orchestrator. ...s
 Language Switching (Planned)
@@ -119,8 +119,8 @@ Mark text as a different language (placeholder for future):
 .. code-block:: ssmd
-   [Bonjour](fr)      # French text
-   [Hola](es)         # Spanish text
+   [Bonjour]{lang="fr"}      # French text
+   [Hola]{lang="es"}         # Spanish text
 Complete Example
@@ -132,14 +132,14 @@ Here's a complete SSMD file example:
    Chapter One ...p
-   [Harry](ph: /hæɹi/) Potter was a *highly unusual* boy in many ways. ...s
+   [Harry]{ph="hæɹi"} Potter was a *highly unusual* boy in many ways. ...s
    For one thing, he **hated** the summer holidays more than any other
    time of year. ...s For another, he really wanted to do his homework,
    but was forced to do it in secret, in the dead of the night. ...p
    And he also happened to be a wizard. ...p
-   The [Dursleys](ph: /dɝzliz/) had everything they wanted, but they
+   The [Dursleys]{ph="dɝzliz"} had everything they wanted, but they
    also had a secret. ...s And their greatest fear was that somebody
    would discover it. ...p
@@ -163,7 +163,7 @@ The generated SSMD will include:
 .. code-block:: ssmd
-   [Hermione](ph: /hɝmˈIni/) loved reading books. ...s
+   [Hermione]{ph="hɝmˈIni"} loved reading books. ...s
 HTML Emphasis Detection

ttsforge-0.1.1/tests/test_chapter_marker_leading_space.py ADDED Viewed

@@ -0,0 +1,88 @@
+"""Test chapter marker removal with leading whitespace."""
+import re
+class TestChapterMarkerLeadingWhitespace:
+    """Test that chapter markers are removed even with leading whitespace."""
+    def test_marker_with_no_leading_space(self):
+        """Test normal case - marker at start of line."""
+        text = "<<CHAPTER: Test Chapter>>\n\nThis is the content."
+        pattern = r"^<<CHAPTER:[^>]*>>\s*\n*"
+        result = re.sub(pattern, "", text, count=1, flags=re.MULTILINE)
+        assert result == "This is the content."
+    def test_marker_with_leading_space(self):
+        """Test marker with a leading space - should be removed with new pattern."""
+        text = " <<CHAPTER: Test Chapter>>\n\nThis is the content."
+        # New pattern handles leading whitespace
+        pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        result = re.sub(pattern, "", text, count=1, flags=re.MULTILINE)
+        assert result == "This is the content."
+    def test_marker_with_leading_tabs(self):
+        """Test marker with leading tabs - should be removed with new pattern."""
+        text = "\t<<CHAPTER: Test Chapter>>\n\nThis is the content."
+        pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        result = re.sub(pattern, "", text, count=1, flags=re.MULTILINE)
+        assert result == "This is the content."
+    def test_marker_with_multiple_spaces(self):
+        """Test marker with multiple leading spaces -
+        should be removed with new pattern."""
+        text = "   <<CHAPTER: Test Chapter>>\n\nThis is the content."
+        pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        result = re.sub(pattern, "", text, count=1, flags=re.MULTILINE)
+        assert result == "This is the content."
+    def test_improved_pattern_handles_leading_whitespace(self):
+        """Test that improved pattern handles all leading whitespace cases."""
+        # Improved pattern that handles leading whitespace
+        improved_pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        test_cases = [
+            ("<<CHAPTER: Test>>\n\nContent", "Content"),
+            (" <<CHAPTER: Test>>\n\nContent", "Content"),
+            ("\t<<CHAPTER: Test>>\n\nContent", "Content"),
+            ("   <<CHAPTER: Test>>\n\nContent", "Content"),
+            (" \t <<CHAPTER: Test>>\n\nContent", "Content"),
+        ]
+        for text, expected in test_cases:
+            result = re.sub(improved_pattern, "", text, count=1, flags=re.MULTILINE)
+            assert result == expected, f"Failed for input: {repr(text)}"
+    def test_marker_not_at_line_start_still_removed_with_multiline(self):
+        """Test that marker after newline is removed (MULTILINE mode)."""
+        text = "Some text\n<<CHAPTER: Test>>\n\nContent"
+        improved_pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        result = re.sub(improved_pattern, "", text, count=1, flags=re.MULTILINE)
+        assert result == "Some text\nContent"
+    def test_only_first_marker_removed(self):
+        """Test that only the first marker is removed (count=1)."""
+        text = "<<CHAPTER: One>>\n\nSome text <<CHAPTER: Two>> inside it."
+        improved_pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        result = re.sub(improved_pattern, "", text, count=1, flags=re.MULTILINE)
+        assert result == "Some text <<CHAPTER: Two>> inside it."
+    def test_real_world_epub_scenario(self):
+        """Test realistic epub2text output with potential whitespace issues."""
+        # Simulate what epub2text might return with whitespace quirks
+        epub_content = " <<CHAPTER: THE STORY SO FAR>>\n\nIn the shadow of the Apt..."
+        # Old pattern (fails)
+        old_pattern = r"^<<CHAPTER:[^>]*>>\s*\n*"
+        old_result = re.sub(old_pattern, "", epub_content, count=1, flags=re.MULTILINE)
+        # New pattern (works)
+        new_pattern = r"^\s*<<CHAPTER:[^>]*>>\s*\n*"
+        new_result = re.sub(new_pattern, "", epub_content, count=1, flags=re.MULTILINE)
+        # Verify old pattern fails to remove marker
+        assert "<<CHAPTER:" in old_result, "Old pattern should fail with leading space"
+        # Verify new pattern successfully removes marker
+        assert "<<CHAPTER:" not in new_result, "New pattern should remove marker"
+        assert new_result == "In the shadow of the Apt..."

ttsforge-0.1.1/tests/test_chapter_selection.py ADDED Viewed

@@ -0,0 +1,20 @@
+import pytest
+from ttsforge.chapter_selection import parse_chapter_selection
+def test_parse_all() -> None:
+    assert parse_chapter_selection("all", 5) == [0, 1, 2, 3, 4]
+def test_parse_ranges_and_commas() -> None:
+    assert parse_chapter_selection("1-3,5", 6) == [0, 1, 2, 4]
+def test_parse_open_ended_range() -> None:
+    assert parse_chapter_selection("3-", 5) == [2, 3, 4]
+def test_parse_invalid_range() -> None:
+    with pytest.raises(ValueError):
+        parse_chapter_selection("5-2", 6)

ttsforge-0.1.1/tests/test_cli_smoke.py ADDED Viewed

@@ -0,0 +1,27 @@
+from pathlib import Path
+from click.testing import CliRunner
+from ttsforge.cli import main
+def test_info_and_list_smoke(tmp_path: Path) -> None:
+    text = """Title: Sample Book
+Author: Jane Doe
+Language: English
+CHAPTER I
+This is the first chapter.
+CHAPTER II
+This is the second chapter.
+"""
+    input_file = tmp_path / "sample.txt"
+    input_file.write_text(text, encoding="utf-8")
+    runner = CliRunner()
+    info_result = runner.invoke(main, ["info", str(input_file)])
+    assert info_result.exit_code == 0
+    list_result = runner.invoke(main, ["list", str(input_file)])
+    assert list_result.exit_code == 0

ttsforge-0.1.1/tests/test_conversion_state.py ADDED Viewed

@@ -0,0 +1,84 @@
+import json
+from pathlib import Path
+from ttsforge.conversion import ChapterState, ConversionState
+def test_conversion_state_roundtrip(tmp_path: Path) -> None:
+    state = ConversionState(
+        source_file="book.epub",
+        source_hash="abc123",
+        output_file="book.m4b",
+        work_dir=str(tmp_path),
+        voice="af_heart",
+        language="a",
+        speed=1.0,
+        split_mode="auto",
+        output_format="m4b",
+        chapters=[
+            ChapterState(
+                index=0,
+                title="Chapter 1",
+                content_hash="hash",
+                completed=True,
+                audio_file="chapter_001.wav",
+                duration=1.2,
+                char_count=100,
+                ssmd_file="chapter_001.ssmd",
+                ssmd_hash="ssmdhash",
+            )
+        ],
+        started_at="2024-01-01 00:00:00",
+    )
+    state_file = tmp_path / "state.json"
+    state.save(state_file)
+    loaded = ConversionState.load(state_file)
+    assert loaded is not None
+    assert loaded.voice == "af_heart"
+    assert loaded.chapters[0].audio_file == "chapter_001.wav"
+    assert loaded.chapters[0].completed is True
+    assert not (tmp_path / "state.json.tmp").exists()
+def test_conversion_state_backward_compat(tmp_path: Path) -> None:
+    data = {
+        "version": 1,
+        "source_file": "book.epub",
+        "source_hash": "hash",
+        "output_file": "book.m4b",
+        "work_dir": str(tmp_path),
+        "voice": "af_heart",
+        "language": "a",
+        "speed": 1.0,
+        "split_mode": "auto",
+        "output_format": "m4b",
+        "chapters": [
+            {
+                "index": 0,
+                "title": "Chapter 1",
+                "content_hash": "hash",
+                "completed": False,
+                "audio_file": None,
+                "duration": 0.0,
+                "char_count": 10,
+                "ssmd_file": None,
+                "ssmd_hash": None,
+            }
+        ],
+        "segment_pause_min": 0.1,
+        "segment_pause_max": 0.3,
+        "paragraph_pause_min": 0.5,
+        "paragraph_pause_max": 1.0,
+    }
+    state_file = tmp_path / "legacy_state.json"
+    state_file.write_text(json.dumps(data), encoding="utf-8")
+    loaded = ConversionState.load(state_file)
+    assert loaded is not None
+    assert loaded.pause_sentence == 0.2
+    assert loaded.pause_paragraph == 0.75
+    assert loaded.pause_clause == 0.3
+    assert loaded.pause_variance >= 0.01

{ttsforge-0.1.0 → ttsforge-0.1.1}/tests/test_phoneme_dictionary.py RENAMED Viewed

@@ -13,8 +13,8 @@ class TestPhonemeDictionary:
     def test_load_simple_dictionary(self):
         """Test loading a simple phoneme dictionary."""
         test_dict = {
-            "Misaki": "/misˈɑki/",
-            "Kubernetes": "/kubɚnˈɛtɪs/",
+            "Misaki": "misˈɑki",
+            "Kubernetes": "kubɚnˈɛtɪs",
         }
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
@@ -40,8 +40,8 @@ class TestPhonemeDictionary:
                 "language": "en-us",
             },
             "entries": {
-                "Misaki": {"phoneme": "/misˈɑki/", "occurrences": 42},
-                "nginx": {"phoneme": "/ˈɛnʤɪnˈɛks/", "occurrences": 8},
+                "Misaki": {"phoneme": "misˈɑki", "occurrences": 42},
+                "nginx": {"phoneme": "ˈɛnʤɪnˈɛks", "occurrences": 8},
             },
         }
@@ -67,8 +67,8 @@ class TestPhonemeDictionary:
         """Test loading dictionary with metadata format but simple string values."""
         test_dict = {
             "entries": {
-                "Misaki": "/misˈɑki/",
-                "nginx": "/ˈɛnʤɪnˈɛks/",
+                "Misaki": "misˈɑki",
+                "nginx": "ˈɛnʤɪnˈɛks",
             }
         }
@@ -112,8 +112,8 @@ class TestPhonemeDictionary:
     def test_phonemize_with_dictionary(self):
         """Test phonemization with custom dictionary - through SSMD notation."""
         test_dict = {
-            "Misaki": "/misˈɑki/",
-            "Kubernetes": "/kubɚnˈɛtɪs/",
+            "Misaki": "misˈɑki",
+            "Kubernetes": "kubɚnˈɛtɪs",
         }
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
@@ -130,8 +130,8 @@ class TestPhonemeDictionary:
             ssmd_text = tokenizer._phoneme_dictionary_obj.apply(text)
             # Verify SSMD notation is applied
-            assert "[Misaki]{ph=" in ssmd_text or "[Misaki](ph:" in ssmd_text
-            assert "[Kubernetes]{ph=" in ssmd_text or "[Kubernetes](ph:" in ssmd_text
+            assert "[Misaki]{ph=" in ssmd_text
+            assert "[Kubernetes]{ph=" in ssmd_text
         finally:
             Path(temp_path).unlink()
@@ -162,7 +162,7 @@ class TestPhonemeDictionary:
     def test_case_sensitive_matching(self):
         """Test case-sensitive dictionary matching."""
-        test_dict = {"Misaki": "/misˈɑki/"}
+        test_dict = {"Misaki": "misˈɑki"}
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
             json.dump(test_dict, f)
@@ -184,13 +184,13 @@ class TestPhonemeDictionary:
             assert phoneme_count == 1, f"Expected 1 match, got {phoneme_count}"
             # Verify it's "Misaki" that matched
-            assert "[Misaki]{ph=" in ssmd_text or "[Misaki](ph:" in ssmd_text
+            assert "[Misaki]{ph=" in ssmd_text
         finally:
             Path(temp_path).unlink()
     def test_word_boundaries(self):
         """Test that word boundaries are respected."""
-        test_dict = {"test": "/tˈɛst/"}
+        test_dict = {"test": "tˈɛst"}
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
             json.dump(test_dict, f)
@@ -246,7 +246,7 @@ class TestPhonemeDictionary:
     def test_special_characters_in_words(self):
         """Test dictionary words with special regex characters (periods, etc.)."""
         # Use a simple word that can be phonemized
-        test_dict = {"Misaki": "/misˈɑki/"}
+        test_dict = {"Misaki": "misˈɑki"}
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
             json.dump(test_dict, f)
@@ -262,13 +262,13 @@ class TestPhonemeDictionary:
             ssmd_text = tokenizer._phoneme_dictionary_obj.apply(text)
             # Should use custom phoneme
-            assert "[Misaki]{ph=" in ssmd_text or "[Misaki](ph:" in ssmd_text
+            assert "[Misaki]{ph=" in ssmd_text
         finally:
             Path(temp_path).unlink()
     def test_multiple_occurrences(self):
         """Test that all occurrences of a word are replaced."""
-        test_dict = {"test": "/tˈɛst/"}
+        test_dict = {"test": "tˈɛst"}
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
             json.dump(test_dict, f)
@@ -293,8 +293,8 @@ class TestPhonemeDictionary:
         # Note: Multi-word phoneme annotations have limitations in kokorog2p's
         # markdown processing. Testing with overlapping single words instead.
         test_dict = {
-            "testing": "/tˈɛstɪŋ/",
-            "test": "/tˈɛst/",  # Shorter word, different pronunciation
+            "testing": "tˈɛstɪŋ",
+            "test": "tˈɛst",  # Shorter word, different pronunciation
         }
         with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:

ttsforge-0.1.1/tests/test_ssmd_generator.py ADDED Viewed

@@ -0,0 +1,25 @@
+from ttsforge.ssmd_generator import chapter_to_ssmd
+def test_emphasis_repeated_phrases() -> None:
+    html = "This is <em>very</em> good. This is <em>very</em> good."
+    text = "This is very good. This is very good."
+    ssmd = chapter_to_ssmd(
+        chapter_title="",
+        chapter_text=text,
+        html_content=html,
+        include_title=False,
+    )
+    assert ssmd.count("*very*") == 2
+def test_emphasis_with_punctuation() -> None:
+    html = "Wait, <strong>now</strong>."
+    text = "Wait, now."
+    ssmd = chapter_to_ssmd(
+        chapter_title="",
+        chapter_text=text,
+        html_content=html,
+        include_title=False,
+    )
+    assert "**now**" in ssmd

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/_version.py RENAMED Viewed

@@ -28,7 +28,7 @@ version_tuple: VERSION_TUPLE
 commit_id: COMMIT_ID
 __commit_id__: COMMIT_ID
-__version__ = version = '0.1.0'
-__version_tuple__ = version_tuple = (0, 1, 0)
+__version__ = version = '0.1.1'
+__version_tuple__ = version_tuple = (0, 1, 1)
-__commit_id__ = commit_id = 'gac9dbcdb5'
+__commit_id__ = commit_id = 'g08367e850'

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/cli/commands_conversion.py RENAMED Viewed

@@ -536,17 +536,17 @@ def convert(  # noqa: C901
         pause_clause=(
             pause_clause
             if pause_clause is not None
-            else config.get("pause_clause", 0.25)
+            else config.get("pause_clause", 0.3)
         ),
         pause_sentence=(
             pause_sentence
             if pause_sentence is not None
-            else config.get("pause_sentence", 0.2)
+            else config.get("pause_sentence", 0.5)
         ),
         pause_paragraph=(
             pause_paragraph
             if pause_paragraph is not None
-            else config.get("pause_paragraph", 0.75)
+            else config.get("pause_paragraph", 0.9)
         ),
         pause_variance=(
             pause_variance
@@ -1369,17 +1369,17 @@ def read(  # noqa: C901
         effective_split_mode = config_split_mode
     # Pause settings
     effective_pause_clause = (
-        pause_clause if pause_clause is not None else config.get("pause_clause", 0.25)
+        pause_clause if pause_clause is not None else config.get("pause_clause", 0.3)
     )
     effective_pause_sentence = (
         pause_sentence
         if pause_sentence is not None
-        else config.get("pause_sentence", 0.2)
+        else config.get("pause_sentence", 0.5)
     )
     effective_pause_paragraph = (
         pause_paragraph
         if pause_paragraph is not None
-        else config.get("pause_paragraph", 0.75)
+        else config.get("pause_paragraph", 0.9)
     )
     effective_pause_variance = (
         pause_variance
@@ -1695,7 +1695,6 @@ def read(  # noqa: C901
         def generate_audio(text_segment: str) -> tuple[np.ndarray, int]:
             """Generate audio for a text segment."""
-            print(text_segment)
             result = pipeline.run(text_segment)
             return result.audio, result.sample_rate

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/cli/commands_phonemes.py RENAMED Viewed

@@ -603,17 +603,17 @@ def phonemes_convert(
         pause_clause=(
             pause_clause
             if pause_clause is not None
-            else config.get("pause_clause", 0.25)
+            else config.get("pause_clause", 0.3)
         ),
         pause_sentence=(
             pause_sentence
             if pause_sentence is not None
-            else config.get("pause_sentence", 0.2)
+            else config.get("pause_sentence", 0.5)
         ),
         pause_paragraph=(
             pause_paragraph
             if pause_paragraph is not None
-            else config.get("pause_paragraph", 0.75)
+            else config.get("pause_paragraph", 0.9)
         ),
         pause_variance=(
             pause_variance

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/constants.py RENAMED Viewed

@@ -123,8 +123,8 @@ DEFAULT_CONFIG = {
     "default_split_mode": "auto",
     "default_content_mode": "chapters",  # Content mode for read: chapters or pages
     "default_page_size": 2000,  # Synthetic page size in characters for pages mode
-    "pause_clause": 0.5,
-    "pause_sentence": 0.7,
+    "pause_clause": 0.3,
+    "pause_sentence": 0.5,
     "pause_paragraph": 0.9,
     "pause_variance": 0.05,
     "pause_mode": "auto",  # "tts", "manual", or "auto

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/conversion.py RENAMED Viewed

@@ -124,9 +124,9 @@ class ConversionState:
     split_mode: str = "auto"
     output_format: str = "m4b"
     silence_between_chapters: float = 2.0
-    pause_clause: float = 0.25
-    pause_sentence: float = 0.2
-    pause_paragraph: float = 0.75
+    pause_clause: float = 0.3
+    pause_sentence: float = 0.5
+    pause_paragraph: float = 0.9
     pause_variance: float = 0.05
     pause_mode: str = "auto"  # "tts", "manual", or "auto
     lang: str | None = None  # Language override for phonemization
@@ -174,11 +174,11 @@ class ConversionState:
             # Set defaults for new parameters
             if "pause_clause" not in data:
-                data["pause_clause"] = 0.25
+                data["pause_clause"] = 0.3
             if "pause_sentence" not in data:
-                data["pause_sentence"] = 0.2
+                data["pause_sentence"] = 0.5
             if "pause_paragraph" not in data:
-                data["pause_paragraph"] = 0.75
+                data["pause_paragraph"] = 0.9
             if "pause_variance" not in data:
                 data["pause_variance"] = 0.05
             if "pause_mode" not in data:
@@ -289,9 +289,9 @@ class ConversionOptions:
     phoneme_dictionary_path: str | None = None
     phoneme_dict_case_sensitive: bool = False
     # Pause settings (pykokoro built-in pause handling)
-    pause_clause: float = 0.25  # For clause boundaries (commas)
-    pause_sentence: float = 0.2  # For sentence boundaries
-    pause_paragraph: float = 0.75  # For paragraph boundaries
+    pause_clause: float = 0.3  # For clause boundaries (commas)
+    pause_sentence: float = 0.5  # For sentence boundaries
+    pause_paragraph: float = 0.9  # For paragraph boundaries
     pause_variance: float = 0.05  # Standard deviation for natural variation
     pause_mode: str = "auto"  # "tts", "manual", or "auto
     # Chapter announcement settings

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/name_extractor.py RENAMED Viewed

@@ -174,7 +174,7 @@ def generate_phoneme_suggestions(
         Dictionary with phoneme suggestions and metadata:
         {
             "name": {
-                "phoneme": "/phoneme/",
+                "phoneme": "phoneme",
                 "occurrences": count,
                 "suggestion_quality": "auto"
             }
@@ -190,7 +190,7 @@ def generate_phoneme_suggestions(
             phoneme = phonemize(name, language=language).phonemes
             # Wrap in / / format for dictionary
-            phoneme_formatted = f"/{phoneme}/"
+            phoneme_formatted = f"{phoneme}"
             suggestions[name] = {
                 "phoneme": phoneme_formatted,
@@ -201,7 +201,7 @@ def generate_phoneme_suggestions(
             logger.warning(f"Failed to generate phoneme for '{name}': {e}")
             # Add placeholder
             suggestions[name] = {
-                "phoneme": "/FIXME/",
+                "phoneme": "FIXME",
                 "occurrences": count,
                 "suggestion_quality": "error",
                 "error": str(e),

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/phoneme_conversion.py RENAMED Viewed

@@ -95,9 +95,9 @@ class PhonemeConversionState:
     speed: float = 1.0
     output_format: str = "m4b"
     silence_between_chapters: float = 2.0
-    pause_clause: float = 0.25
-    pause_sentence: float = 0.2
-    pause_paragraph: float = 0.75
+    pause_clause: float = 0.3
+    pause_sentence: float = 0.5
+    pause_paragraph: float = 0.9
     pause_variance: float = 0.05
     pause_mode: str = "auto"
     lang: str | None = None  # Language override for phonemization
@@ -145,11 +145,11 @@ class PhonemeConversionState:
             # Set defaults for new parameters
             if "pause_clause" not in data:
-                data["pause_clause"] = 0.25
+                data["pause_clause"] = 0.3
             if "pause_sentence" not in data:
-                data["pause_sentence"] = 0.2
+                data["pause_sentence"] = 0.5
             if "pause_paragraph" not in data:
-                data["pause_paragraph"] = 0.75
+                data["pause_paragraph"] = 0.9
             if "pause_variance" not in data:
                 data["pause_variance"] = 0.05
             if "pause_mode" not in data:
@@ -210,9 +210,9 @@ class PhonemeConversionOptions:
     # If None, language from PhonemeSegments is used
     lang: str | None = None
     # Pause settings (pykokoro built-in pause handling)
-    pause_clause: float = 0.25  # For clause boundaries (commas)
-    pause_sentence: float = 0.2  # For sentence boundaries
-    pause_paragraph: float = 0.75  # For paragraph boundaries
+    pause_clause: float = 0.3  # For clause boundaries (commas)
+    pause_sentence: float = 0.5  # For sentence boundaries
+    pause_paragraph: float = 0.9  # For paragraph boundaries
     pause_variance: float = 0.05  # Standard deviation for natural variation
     pause_mode: str = "auto"  # "tts", "manual", or "auto"
     # Chapter announcement settings

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge/ssmd_generator.py RENAMED Viewed

@@ -2,8 +2,8 @@
 This module converts chapter text to SSMD format with markup for:
 - Emphasis (*text* for moderate, **text** for strong)
-- Language switches ([text](lang_code))
-- Phoneme substitutions ([word](ph: /phoneme/))
+- Language switches ([text]{lang="lang_code"})
+- Phoneme substitutions ([word]{ph="phoneme"})
 Note: Structural breaks (paragraphs, sentences, clauses) are NOT automatically
 added. The SSMD parser in pykokoro handles sentence detection automatically.
@@ -170,7 +170,7 @@ def _inject_phoneme_substitutions(
         if not phoneme:
             return matched_word
         clean_phoneme = phoneme.strip("/")
-        return f"[{matched_word}](ph: /{clean_phoneme}/)"
+        return f"[{matched_word}]" + "{" + f'ph="{clean_phoneme}"' + "}"
     segments: list[str] = []
     last_index = 0
@@ -260,7 +260,7 @@ def _strip_redundant_title(chapter_title: str, chapter_text: str) -> str:
         return chapter_text
     trimmed_line = title_pattern.sub("", first_line, count=1).lstrip(
-        " \t:;\-\u2013\u2014"
+        " \t:;-\u2013\u2014"
     )
     if trimmed_line:
         lines[first_idx] = trimmed_line

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: ttsforge
-Version: 0.1.0
+Version: 0.1.1
 Summary: Generate audiobooks from EPUB files using Kokoro ONNX TTS.
 Author-email: Holger Nahrstaedt <nahrstaedt@gmail.com>
 License: MIT License
@@ -396,14 +396,14 @@ SSMD files use a simple markdown-like syntax:
 **Custom Phonemes**:
 ```
-[Hermione](ph: /hɝmˈIni/)    # Override pronunciation
-[API](ph: /ˌeɪpiˈaɪ/)        # Technical terms
+[Hermione]{ph="hɝmˈIni"}    # Override pronunciation
+[API]{ph="ˌeɪpiˈaɪ"}        # Technical terms
 ```
 **Language Switching** (planned):
 ```
-[Bonjour](fr)    # Mark text as French
+[Bonjour]{lang="fr"}    # Mark text as French
 ```
 #### Example SSMD File
@@ -411,7 +411,7 @@ SSMD files use a simple markdown-like syntax:
 ```ssmd
 Chapter One ...p
-[Harry](ph: /hæɹi/) Potter was a *highly unusual* boy in many ways. ...s
+[Harry]{ph="hæɹi"} Potter was a *highly unusual* boy in many ways. ...s
 For one thing, he **hated** the summer holidays more than any other
 time of year. ...s For another, he really wanted to do his homework,
 but was forced to do it in secret, in the dead of the night. ...p
@@ -498,12 +498,12 @@ Edit `custom_phonemes.json` to fix any incorrect phonemes. The file format is:
   },
   "entries": {
     "Hermione": {
-      "phoneme": "/hɝmˈIni/",
+      "phoneme": "hɝmˈIni",
       "occurrences": 847,
       "verified": false
     },
     "Kubernetes": {
-      "phoneme": "/kubɚnˈɛtɪs/",
+      "phoneme": "kubɚnˈɛtɪs",
       "occurrences": 12,
       "verified": false
     }
@@ -515,8 +515,8 @@ Or use the simple format:
 ```json
 {
-  "Hermione": "/hɝmˈIni/",
-  "Kubernetes": "/kubɚnˈɛtɪs/"
+  "Hermione": "hɝmˈIni",
+  "Kubernetes": "kubɚnˈɛtɪs"
 }
 ```
@@ -548,9 +548,9 @@ You can create a dictionary manually without extraction:
 ```json
 {
-  "Katniss": "/kætnɪs/",
-  "Peeta": "/pitə/",
-  "Panem": "/pænəm/"
+  "Katniss": "kætnɪs",
+  "Peeta": "pitə",
+  "Panem": "pænəm"
 }
 ```

{ttsforge-0.1.0 → ttsforge-0.1.1}/ttsforge.egg-info/SOURCES.txt RENAMED Viewed

@@ -32,15 +32,20 @@ examples/__init__.py
 examples/phoneme_export.py
 tests/__init__.py
 tests/test_chapter_announcement.py
+tests/test_chapter_marker_leading_space.py
+tests/test_chapter_selection.py
 tests/test_cli.py
+tests/test_cli_smoke.py
 tests/test_constants.py
 tests/test_conversion.py
+tests/test_conversion_state.py
 tests/test_epub_chapter_markers.py
 tests/test_name_extractor.py
 tests/test_onnx_backend.py
 tests/test_phoneme_conversion.py
 tests/test_phoneme_dictionary.py
 tests/test_phonemes.py
+tests/test_ssmd_generator.py
 tests/test_tokenizer.py
 tests/test_utils.py
 ttsforge/__init__.py