PyPI - abstractvoice - Versions diffs - 0.4.1__tar.gz → 0.5.0__tar.gz - Mend

abstractvoice 0.4.1tar.gz → 0.5.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

{abstractvoice-0.4.1 → abstractvoice-0.5.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: abstractvoice
-Version: 0.4.1
+Version: 0.5.0
 Summary: A modular Python library for voice interactions with AI systems
 Author-email: Laurent-Philippe Albou <contact@abstractcore.ai>
 License-Expression: MIT
@@ -19,6 +19,14 @@ Description-Content-Type: text/markdown
 License-File: LICENSE
 Requires-Dist: numpy>=1.24.0
 Requires-Dist: requests>=2.31.0
+Requires-Dist: appdirs>=1.4.0
+Requires-Dist: coqui-tts<0.30.0,>=0.27.0
+Requires-Dist: torch<2.4.0,>=2.0.0
+Requires-Dist: torchvision<0.19.0,>=0.15.0
+Requires-Dist: torchaudio<2.4.0,>=2.0.0
+Requires-Dist: librosa>=0.10.0
+Requires-Dist: sounddevice>=0.4.6
+Requires-Dist: soundfile>=0.12.1
 Provides-Extra: voice
 Requires-Dist: sounddevice>=0.4.6; extra == "voice"
 Requires-Dist: webrtcvad>=2.0.10; extra == "voice"
@@ -164,38 +172,51 @@ AbstractVoice automatically detects espeak-ng and upgrades to premium quality vo
 ## Quick Start
-### ⚡ Instant TTS (v0.4.0+)
+### ⚡ Instant TTS (v0.5.0+)
 ```python
 from abstractvoice import VoiceManager
-# Initialize voice manager - automatically downloads essential model if needed
+# Initialize voice manager - works immediately with included dependencies
 vm = VoiceManager()
-# Text-to-speech works immediately!
+# Text-to-speech works right away!
 vm.speak("Hello! TTS works out of the box!")
+# Language switching with automatic model download
+vm.set_language('fr')
+vm.speak("Bonjour! Le français fonctionne aussi!")
 ```
-**That's it!** AbstractVoice v0.4.0+ automatically:
-- ✅ Downloads essential English model (107MB) on first use
-- ✅ Caches models permanently for offline use
-- ✅ Works immediately after first setup
+**That's it!** AbstractVoice v0.5.0+ automatically:
+- ✅ Includes essential TTS dependencies in base installation
+- ✅ Downloads models automatically when switching languages/voices
+- ✅ Works immediately after `pip install abstractvoice`
+- ✅ No silent failures - clear error messages if download fails
 - ✅ No complex configuration needed
-### 🌍 Multi-Language Support
+### 🌍 Multi-Language Support (Auto-Download in v0.5.0+)
 ```python
-# Download and use French voice
-vm.download_model('fr.css10_vits')  # Downloads automatically
+# Simply switch language - downloads model automatically if needed!
 vm.set_language('fr')
 vm.speak("Bonjour! Je parle français maintenant.")
-# Download and use German voice
-vm.download_model('de.thorsten_vits')
+# Switch to German - no manual download needed
 vm.set_language('de')
 vm.speak("Hallo! Ich spreche jetzt Deutsch.")
+# Spanish, Italian also supported
+vm.set_language('es')
+vm.speak("¡Hola! Hablo español ahora.")
+# If download fails, you'll get clear error messages with instructions
+# Example: "❌ Cannot switch to French: Model download failed"
+#          "   Try: abstractvoice download-models --language fr"
 ```
+**New in v0.5.0:** No more manual `download_model()` calls! Language switching handles downloads automatically.
 ### 🔧 Check System Status
 ```python
@@ -1363,20 +1384,22 @@ abstractvoice check-deps
 ### CLI Voice Commands
-In the CLI REPL, use these commands:
+In the CLI REPL, use these commands (v0.5.0+):
 ```bash
 # List all available voices with download status
 /setvoice
-# Download and set specific voice
-/setvoice fr.css10_vits      # French CSS10 VITS
-/setvoice de.thorsten_vits   # German Thorsten
-/setvoice it.mai_male_vits   # Italian Male
+# Automatically download and set specific voice (NEW in v0.5.0!)
+/setvoice fr.css10_vits      # Downloads French CSS10 if needed
+/setvoice de.thorsten_vits   # Downloads German Thorsten if needed
+/setvoice it.mai_male_vits   # Downloads Italian Male if needed
+/setvoice en.jenny           # Downloads Jenny voice if needed
-# Change language
-/language fr
-/language de
+# Change language (automatically downloads models if needed - NEW!)
+/language fr                 # Switches to French, downloads if needed
+/language de                 # Switches to German, downloads if needed
+/language es                 # Switches to Spanish, downloads if needed
 # Voice controls
 /pause                       # Pause current speech
@@ -1387,6 +1410,8 @@ In the CLI REPL, use these commands:
 /exit
 ```
+**New in v0.5.0:** Language and voice commands now automatically download missing models with progress indicators. No more silent failures!
 ## Perspectives
 This is a test project that I designed with examples to work with Ollama, but I will adapt the examples and abstractvoice to work with any LLM provider (anthropic, openai, etc).

{abstractvoice-0.4.1 → abstractvoice-0.5.0}/README.md RENAMED Viewed

@@ -82,38 +82,51 @@ AbstractVoice automatically detects espeak-ng and upgrades to premium quality vo
 ## Quick Start
-### ⚡ Instant TTS (v0.4.0+)
+### ⚡ Instant TTS (v0.5.0+)
 ```python
 from abstractvoice import VoiceManager
-# Initialize voice manager - automatically downloads essential model if needed
+# Initialize voice manager - works immediately with included dependencies
 vm = VoiceManager()
-# Text-to-speech works immediately!
+# Text-to-speech works right away!
 vm.speak("Hello! TTS works out of the box!")
+# Language switching with automatic model download
+vm.set_language('fr')
+vm.speak("Bonjour! Le français fonctionne aussi!")
 ```
-**That's it!** AbstractVoice v0.4.0+ automatically:
-- ✅ Downloads essential English model (107MB) on first use
-- ✅ Caches models permanently for offline use
-- ✅ Works immediately after first setup
+**That's it!** AbstractVoice v0.5.0+ automatically:
+- ✅ Includes essential TTS dependencies in base installation
+- ✅ Downloads models automatically when switching languages/voices
+- ✅ Works immediately after `pip install abstractvoice`
+- ✅ No silent failures - clear error messages if download fails
 - ✅ No complex configuration needed
-### 🌍 Multi-Language Support
+### 🌍 Multi-Language Support (Auto-Download in v0.5.0+)
 ```python
-# Download and use French voice
-vm.download_model('fr.css10_vits')  # Downloads automatically
+# Simply switch language - downloads model automatically if needed!
 vm.set_language('fr')
 vm.speak("Bonjour! Je parle français maintenant.")
-# Download and use German voice
-vm.download_model('de.thorsten_vits')
+# Switch to German - no manual download needed
 vm.set_language('de')
 vm.speak("Hallo! Ich spreche jetzt Deutsch.")
+# Spanish, Italian also supported
+vm.set_language('es')
+vm.speak("¡Hola! Hablo español ahora.")
+# If download fails, you'll get clear error messages with instructions
+# Example: "❌ Cannot switch to French: Model download failed"
+#          "   Try: abstractvoice download-models --language fr"
 ```
+**New in v0.5.0:** No more manual `download_model()` calls! Language switching handles downloads automatically.
 ### 🔧 Check System Status
 ```python
@@ -1281,20 +1294,22 @@ abstractvoice check-deps
 ### CLI Voice Commands
-In the CLI REPL, use these commands:
+In the CLI REPL, use these commands (v0.5.0+):
 ```bash
 # List all available voices with download status
 /setvoice
-# Download and set specific voice
-/setvoice fr.css10_vits      # French CSS10 VITS
-/setvoice de.thorsten_vits   # German Thorsten
-/setvoice it.mai_male_vits   # Italian Male
+# Automatically download and set specific voice (NEW in v0.5.0!)
+/setvoice fr.css10_vits      # Downloads French CSS10 if needed
+/setvoice de.thorsten_vits   # Downloads German Thorsten if needed
+/setvoice it.mai_male_vits   # Downloads Italian Male if needed
+/setvoice en.jenny           # Downloads Jenny voice if needed
-# Change language
-/language fr
-/language de
+# Change language (automatically downloads models if needed - NEW!)
+/language fr                 # Switches to French, downloads if needed
+/language de                 # Switches to German, downloads if needed
+/language es                 # Switches to Spanish, downloads if needed
 # Voice controls
 /pause                       # Pause current speech
@@ -1305,6 +1320,8 @@ In the CLI REPL, use these commands:
 /exit
 ```
+**New in v0.5.0:** Language and voice commands now automatically download missing models with progress indicators. No more silent failures!
 ## Perspectives
 This is a test project that I designed with examples to work with Ollama, but I will adapt the examples and abstractvoice to work with any LLM provider (anthropic, openai, etc).

{abstractvoice-0.4.1 → abstractvoice-0.5.0}/abstractvoice/__init__.py RENAMED Viewed

@@ -32,5 +32,5 @@ from .voice_manager import VoiceManager
 # Import simple APIs for third-party applications
 from .simple_model_manager import list_models, download_model, get_status, is_ready
-__version__ = "0.4.1"
+__version__ = "0.5.0"
 __all__ = ['VoiceManager', 'list_models', 'download_model', 'get_status', 'is_ready']

{abstractvoice-0.4.1 → abstractvoice-0.5.0}/abstractvoice/examples/voice_cli.py RENAMED Viewed

@@ -158,7 +158,7 @@ def main():
                     traceback.print_exc()
             return
         elif args.command == "download-models":
-            from abstractvoice.model_manager import download_models_cli
+            from abstractvoice.simple_model_manager import download_models_cli
             # Pass remaining arguments to download_models_cli
             import sys
             original_argv = sys.argv

abstractvoice-0.5.0/abstractvoice/instant_setup.py ADDED Viewed

@@ -0,0 +1,83 @@
+"""
+Instant Setup Module for AbstractVoice
+Provides immediate TTS functionality with seamless model download.
+"""
+import os
+import sys
+from pathlib import Path
+# Essential model for instant functionality (lightweight, reliable)
+ESSENTIAL_MODEL = "tts_models/en/ljspeech/fast_pitch"
+ESSENTIAL_MODEL_SIZE = "~100MB"
+def ensure_instant_tts():
+    """
+    Ensure TTS is ready for immediate use.
+    Downloads essential model if needed with progress indicator.
+    Returns:
+        bool: True if TTS is ready, False if failed
+    """
+    try:
+        from TTS.api import TTS
+        from TTS.utils.manage import ModelManager
+        manager = ModelManager()
+        # Check if essential model is already cached
+        if is_model_cached(ESSENTIAL_MODEL):
+            return True
+        # Download essential model with user-friendly progress
+        print(f"🚀 AbstractVoice: Setting up TTS ({ESSENTIAL_MODEL_SIZE})...")
+        print(f"   This happens once and takes ~30 seconds")
+        try:
+            # Download with progress bar
+            tts = TTS(model_name=ESSENTIAL_MODEL, progress_bar=True)
+            print(f"✅ TTS ready! AbstractVoice is now fully functional.")
+            return True
+        except Exception as e:
+            print(f"❌ Setup failed: {e}")
+            print(f"💡 Try: pip install abstractvoice[all]")
+            return False
+    except ImportError as e:
+        print(f"❌ Missing dependencies: {e}")
+        print(f"💡 Install with: pip install abstractvoice[all]")
+        return False
+def is_model_cached(model_name):
+    """Check if a model is already cached."""
+    try:
+        from TTS.utils.manage import ModelManager
+        manager = ModelManager()
+        # Get cached models list
+        models_file = os.path.join(manager.output_prefix, ".models.json")
+        if os.path.exists(models_file):
+            import json
+            with open(models_file, 'r') as f:
+                cached_models = json.load(f)
+                return model_name in cached_models
+        # Fallback: check if model directory exists and has content
+        model_dir = model_name.replace("/", "--")
+        model_path = os.path.join(manager.output_prefix, model_dir)
+        return os.path.exists(model_path) and bool(os.listdir(model_path))
+    except:
+        # If anything fails, assume not cached
+        return False
+def get_instant_model():
+    """Get the essential model name for instant setup."""
+    return ESSENTIAL_MODEL
+if __name__ == "__main__":
+    # CLI test
+    print("🧪 Testing instant setup...")
+    success = ensure_instant_tts()
+    print(f"Result: {'✅ Ready' if success else '❌ Failed'}")

{abstractvoice-0.4.1 → abstractvoice-0.5.0}/abstractvoice/simple_model_manager.py RENAMED Viewed

@@ -31,37 +31,65 @@ class SimpleModelManager:
     """Simple, clean model manager for AbstractVoice."""
     # Essential model - guaranteed to work everywhere, reasonable size
-    ESSENTIAL_MODEL = "tts_models/en/ljspeech/fast_pitch"
+    # Changed from fast_pitch to tacotron2-DDC because fast_pitch downloads are failing
+    ESSENTIAL_MODEL = "tts_models/en/ljspeech/tacotron2-DDC"
     # Available models organized by language with metadata
     AVAILABLE_MODELS = {
         "en": {
+            "tacotron2": {
+                "model": "tts_models/en/ljspeech/tacotron2-DDC",
+                "name": "Linda (LJSpeech)",
+                "quality": "good",
+                "size_mb": 362,
+                "description": "Standard female voice (LJSpeech speaker)",
+                "requires_espeak": False,
+                "default": True
+            },
+            "jenny": {
+                "model": "tts_models/en/jenny/jenny",
+                "name": "Jenny",
+                "quality": "excellent",
+                "size_mb": 368,
+                "description": "Different female voice, clear and natural",
+                "requires_espeak": False,
+                "default": False
+            },
+            "ek1": {
+                "model": "tts_models/en/ek1/tacotron2",
+                "name": "Edward (EK1)",
+                "quality": "excellent",
+                "size_mb": 310,
+                "description": "Male voice with British accent",
+                "requires_espeak": False,
+                "default": False
+            },
+            "sam": {
+                "model": "tts_models/en/sam/tacotron-DDC",
+                "name": "Sam",
+                "quality": "good",
+                "size_mb": 370,
+                "description": "Different male voice, deeper tone",
+                "requires_espeak": False,
+                "default": False
+            },
             "fast_pitch": {
                 "model": "tts_models/en/ljspeech/fast_pitch",
-                "name": "Fast Pitch (English)",
+                "name": "Linda Fast (LJSpeech)",
                 "quality": "good",
                 "size_mb": 107,
-                "description": "Lightweight, reliable English voice",
+                "description": "Same speaker as Linda but faster engine",
                 "requires_espeak": False,
-                "default": True
+                "default": False
             },
             "vits": {
                 "model": "tts_models/en/ljspeech/vits",
-                "name": "VITS (English)",
+                "name": "Linda Premium (LJSpeech)",
                 "quality": "excellent",
                 "size_mb": 328,
-                "description": "High-quality English voice with natural prosody",
+                "description": "Same speaker as Linda but premium quality",
                 "requires_espeak": True,
                 "default": False
-            },
-            "tacotron2": {
-                "model": "tts_models/en/ljspeech/tacotron2-DDC",
-                "name": "Tacotron2 (English)",
-                "quality": "good",
-                "size_mb": 362,
-                "description": "Classic English voice, reliable",
-                "requires_espeak": False,
-                "default": False
             }
         },
         "fr": {
@@ -184,7 +212,7 @@ class SimpleModelManager:
             return False
     def download_model(self, model_name: str, progress_callback: Optional[Callable[[str, bool], None]] = None) -> bool:
-        """Download a specific model.
+        """Download a specific model with improved error handling.
         Args:
             model_name: TTS model name (e.g., 'tts_models/en/ljspeech/fast_pitch')
@@ -203,25 +231,56 @@ class SimpleModelManager:
         try:
             TTS, _ = _import_tts()
-            if self.debug_mode:
-                print(f"📥 Downloading {model_name}...")
+            print(f"📥 Downloading {model_name}...")
+            print(f"   This may take a few minutes depending on your connection...")
             start_time = time.time()
             # Initialize TTS to trigger download
-            tts = TTS(model_name=model_name, progress_bar=True)
+            # Set gpu=False to avoid CUDA errors on systems without GPU
+            try:
+                tts = TTS(model_name=model_name, progress_bar=True, gpu=False)
+                # Verify the model actually downloaded
+                if not self.is_model_cached(model_name):
+                    print(f"⚠️ Model download completed but not found in cache")
+                    return False
+            except Exception as init_error:
+                # Try alternative download method
+                error_msg = str(init_error).lower()
+                if "connection" in error_msg or "timeout" in error_msg:
+                    print(f"❌ Network error: Check your internet connection")
+                elif "not found" in error_msg:
+                    print(f"❌ Model '{model_name}' not found in registry")
+                else:
+                    print(f"❌ Download error: {init_error}")
+                raise
             download_time = time.time() - start_time
-            if self.debug_mode:
-                print(f"✅ Downloaded {model_name} in {download_time:.1f}s")
+            print(f"✅ Downloaded {model_name} in {download_time:.1f}s")
             if progress_callback:
                 progress_callback(model_name, True)
             return True
         except Exception as e:
-            if self.debug_mode:
-                print(f"❌ Failed to download {model_name}: {e}")
+            error_msg = str(e).lower()
+            # Provide helpful error messages
+            if "connection" in error_msg or "timeout" in error_msg:
+                print(f"❌ Failed to download {model_name}: Network issue")
+                print(f"   Check your internet connection and try again")
+            elif "permission" in error_msg:
+                print(f"❌ Failed to download {model_name}: Permission denied")
+                print(f"   Check write permissions for cache directory")
+            elif "space" in error_msg:
+                print(f"❌ Failed to download {model_name}: Insufficient disk space")
+            else:
+                print(f"❌ Failed to download {model_name}")
+                if self.debug_mode:
+                    print(f"   Error: {e}")
             if progress_callback:
                 progress_callback(model_name, False)
             return False
@@ -395,4 +454,86 @@ def get_status() -> str:
 def is_ready() -> bool:
     """Check if essential model is ready for immediate use."""
     manager = get_model_manager()
-    return manager.is_model_cached(manager.ESSENTIAL_MODEL)
+    return manager.is_model_cached(manager.ESSENTIAL_MODEL)
+def download_models_cli():
+    """Simple CLI entry point for downloading models."""
+    import argparse
+    import sys
+    parser = argparse.ArgumentParser(description="Download TTS models for offline use")
+    parser.add_argument("--essential", action="store_true",
+                       help="Download essential model (default)")
+    parser.add_argument("--all", action="store_true",
+                       help="Download all available models")
+    parser.add_argument("--model", type=str,
+                       help="Download specific model by name")
+    parser.add_argument("--language", type=str,
+                       help="Download models for specific language (en, fr, es, de, it)")
+    parser.add_argument("--status", action="store_true",
+                       help="Show current cache status")
+    parser.add_argument("--clear", action="store_true",
+                       help="Clear model cache")
+    args = parser.parse_args()
+    manager = get_model_manager(debug_mode=True)
+    if args.status:
+        print(get_status())
+        return
+    if args.clear:
+        # Ask for confirmation
+        response = input("⚠️ This will delete all downloaded TTS models. Continue? (y/N): ")
+        if response.lower() == 'y':
+            success = manager.clear_cache(confirm=True)
+            if success:
+                print("✅ Model cache cleared")
+            else:
+                print("❌ Failed to clear cache")
+        else:
+            print("Cancelled")
+        return
+    if args.model:
+        success = download_model(args.model)
+        if success:
+            print(f"✅ Downloaded {args.model}")
+        else:
+            print(f"❌ Failed to download {args.model}")
+        sys.exit(0 if success else 1)
+    if args.language:
+        # Language-specific downloads using our simple API
+        lang_models = {
+            'en': ['en.tacotron2', 'en.jenny', 'en.ek1'],
+            'fr': ['fr.css10_vits', 'fr.mai_tacotron2'],
+            'es': ['es.mai_tacotron2'],
+            'de': ['de.thorsten_vits'],
+            'it': ['it.mai_male_vits', 'it.mai_female_vits']
+        }
+        if args.language not in lang_models:
+            print(f"❌ Language '{args.language}' not supported")
+            print(f"   Available: {list(lang_models.keys())}")
+            sys.exit(1)
+        success = False
+        for model_id in lang_models[args.language]:
+            if download_model(model_id):
+                print(f"✅ Downloaded {model_id}")
+                success = True
+                break
+        sys.exit(0 if success else 1)
+    # Default: download essential model
+    print("📦 Downloading essential TTS model...")
+    success = download_model(manager.ESSENTIAL_MODEL)
+    if success:
+        print("✅ Essential model ready!")
+    else:
+        print("❌ Failed to download essential model")
+    sys.exit(0 if success else 1)

abstractvoice 0.4.1__tar.gz → 0.5.0__tar.gz

abstractvoice 0.4.1tar.gz → 0.5.0tar.gz