PyPI - audio-scribe - Versions diffs - 0.1.4__tar.gz → 0.1.6__tar.gz - Mend

audio-scribe 0.1.4tar.gz → 0.1.6tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (25) hide show

{audio_scribe-0.1.4/src/audio_scribe.egg-info → audio_scribe-0.1.6}/PKG-INFO RENAMED Viewed

@@ -1,13 +1,13 @@
-Metadata-Version: 2.2
+Metadata-Version: 2.4
 Name: audio_scribe
-Version: 0.1.4
+Version: 0.1.6
 Summary: A command-line tool for audio transcription with Whisper and Pyannote.
-Home-page: https://gitlab.genomicops.cloud/genomicops/audio-scribe
+Home-page: https://gitlab.genomicops.cloud/innovation-hub/audio-scribe
 Author: Gurasis Osahan
 Author-email: contact@genomicops.com
 License: Apache-2.0
-Project-URL: Source, https://gitlab.genomicops.cloud/genomicops/audio-scribe
-Project-URL: Tracker, https://gitlab.genomicops.cloud/genomicops/audio-scribe/-/issues
+Project-URL: Source, https://gitlab.genomicops.cloud/innovation-hub/audio-scribe
+Project-URL: Tracker, https://gitlab.genomicops.cloud/innovation-hub/audio-scribe/-/issues
 Keywords: whisper pyannote transcription audio diarization
 Classifier: Development Status :: 3 - Alpha
 Classifier: Intended Audience :: Developers
@@ -23,15 +23,15 @@ Classifier: Operating System :: OS Independent
 Requires-Python: >=3.8
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: torch
+Requires-Dist: torch>=2.7.1
 Requires-Dist: openai-whisper
-Requires-Dist: pyannote.audio
+Requires-Dist: pyannote.audio>=3.3.2
 Requires-Dist: pytorch-lightning
-Requires-Dist: keyring
+Requires-Dist: keyring>=25.6.0
 Requires-Dist: cryptography
-Requires-Dist: alive-progress
-Requires-Dist: psutil
-Requires-Dist: GPUtil
+Requires-Dist: alive-progress>=3.2.0
+Requires-Dist: psutil>=7.0.0
+Requires-Dist: GPUtil>=1.4.0
 Dynamic: author
 Dynamic: author-email
 Dynamic: classifier
@@ -40,6 +40,7 @@ Dynamic: description-content-type
 Dynamic: home-page
 Dynamic: keywords
 Dynamic: license
+Dynamic: license-file
 Dynamic: project-url
 Dynamic: requires-dist
 Dynamic: requires-python
@@ -52,7 +53,7 @@ Dynamic: summary
 <p align="center" style="margin: 0px auto;">
   <img src="https://img.shields.io/gitlab/pipeline-status/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&style=for-the-badge&logo=gitlab&logoColor=white&color=green" alt="Pipeline Status">
-  <img src="https://img.shields.io/gitlab/pipeline-coverage/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&style=for-the-badge&logo=tag&logoColor=white&color=red" alt="Coverage">
+  <img src="https://img.shields.io/gitlab/pipeline-coverage/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&branch=main&style=for-the-badge&logo=tag&logoColor=white&color=red" alt="Coverage">
   <img src="https://img.shields.io/pypi/pyversions/audio-scribe?style=for-the-badge&logo=python&logoColor=white&logoWidth=30&color=yellow" alt="Python Versions">
   <img src="https://img.shields.io/pypi/dm/audio-scribe?style=for-the-badge&logo=pypi&logoColor=white&logoWidth=30&color=orange" alt="PyPI Downloads">
   <img src="https://img.shields.io/gitlab/v/tag/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&style=for-the-badge&logo=tag&logoColor=white&color=red" alt="Version">
@@ -67,9 +68,9 @@ Dynamic: summary
 ## Support the Project ☕
-<p align="center">
-  <a href="https://www.buymeacoffee.com/gosahan" target="_blank">
-    <img src="https://cdn.buymeacoffee.com/buttons/v2/default-green.png" alt="Buy Me A Coffee" height="60">
+<p align="center" style="margin: 0px auto;">
+  <a href="https://buymeacoffee.com/gosahan" target="_blank">
+    <img src="https://img.shields.io/badge/Buy%20Me%20A%20Coffee-Support-yellow?style=for-the-badge&logo=buymeacoffee&logoColor=white" alt="Buy Me A Coffee Badge"/>
   </a>
 </p>
@@ -107,11 +108,13 @@ This repository is licensed under the [Apache License 2.0](#license).
   - [Usage](#usage)
   - [Dependencies](#dependencies)
     - [Sample `requirements.txt`](#sample-requirementstxt)
+  - [Troubleshooting](#troubleshooting)
+    - [IndexError: list index out of range](#indexerror-list-index-out-of-range)
+      - [Option 1: System-level Installation (requires sudo access)](#option-1-system-level-installation-requires-sudo-access)
+      - [Option 2: Conda-only Installation (no sudo required)](#option-2-conda-only-installation-no-sudo-required)
   - [Contributing](#contributing)
   - [License](#license)
----
 ## Features
 - **Whisper Transcription**
@@ -127,8 +130,6 @@ This repository is licensed under the [Apache License 2.0](#license).
 - **Configurable Models**
   Default is `base.en` but you can specify any other Whisper model using `--whisper-model`.
----
 ## Installation
 ### Installing from PyPI
@@ -157,8 +158,6 @@ pip install -r requirements.txt
 This approach is particularly useful if you want the newest changes or plan to contribute.
----
 ## Quick Start
 1. **Obtain a Hugging Face Token**
@@ -174,7 +173,6 @@ This approach is particularly useful if you want the newest changes or plan to c
 3. **Watch the Progress Bar**
    - The tool displays a progress bar for each diarized speaker turn, along with real-time CPU, GPU, and memory usage.
----
 ## Usage
@@ -222,7 +220,6 @@ optional arguments:
   # When prompted for an audio file path, press Tab to autocomplete
   ```
----
 ## Dependencies
@@ -258,11 +255,89 @@ GPUtil
 pyreadline3; sys_platform == "win32"
 ```
-> Note:
+> Note:
 > - `pyreadline3` is appended with a [PEP 508 marker](https://peps.python.org/pep-0508/) (`; sys_platform == "win32"`) so it only installs on Windows.
 > - For GPU support, ensure you install a compatible PyTorch version with CUDA.
----
+## Troubleshooting
+### IndexError: list index out of range
+**Symptom**
+You encounter the following error when running `audio-scribe` or importing `pyannote.audio`:
+```
+IndexError: list index out of range
+  File ".../pyannote/audio/core/io.py", line 214, in __init__
+    backend = "soundfile" if "soundfile" in backends else backends[0]
+```
+This occurs when `pyannote.audio` is unable to detect any supported audio backend. Most commonly, the `soundfile` module is missing or its dependency `libsndfile` is not properly installed.
+**Solution**
+You have two ways to resolve this issue:
+#### Option 1: System-level Installation (requires sudo access)
+Install the system-level audio backend library:
+```bash
+sudo apt-get update
+sudo apt-get install libsndfile1
+```
+Then reinstall the `soundfile` Python package inside your environment:
+```bash
+# If using conda
+conda activate your-environment-name
+pip uninstall soundfile -y
+pip install soundfile
+# If using pip/virtualenv
+source your-venv/bin/activate  # or equivalent activation command
+pip uninstall soundfile -y
+pip install soundfile
+```
+#### Option 2: Conda-only Installation (no sudo required)
+Inside your Conda environment:
+```bash
+conda activate your-environment-name
+conda install -c conda-forge libsndfile
+```
+Then ensure Python uses the correct bindings:
+```bash
+pip uninstall soundfile -y
+pip install soundfile
+```
+**Verification**
+Test that audio backends are now available:
+```bash
+python -c "import soundfile as sf; print(sf.available_formats())"
+```
+Expected output:
+```python
+{'WAV': 'Microsoft WAV format (little endian)', 'FLAC': 'FLAC format', ...}
+```
+Then re-run `audio-scribe`:
+```bash
+audio-scribe --audio path/to/your/audio.wav
+```
+The tool should now initialize without error.
 ## Contributing
@@ -275,8 +350,6 @@ We welcome contributions to **Audio Scribe**!
 Please read any available guidelines or templates in our repository (such as `CONTRIBUTING.md` or `CODE_OF_CONDUCT.md`) before submitting.
----
 ## License
 This project is licensed under the [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0).

audio_scribe-0.1.4/PKG-INFO → audio_scribe-0.1.6/README.md RENAMED Viewed

@@ -1,50 +1,3 @@
-Metadata-Version: 2.2
-Name: audio_scribe
-Version: 0.1.4
-Summary: A command-line tool for audio transcription with Whisper and Pyannote.
-Home-page: https://gitlab.genomicops.cloud/genomicops/audio-scribe
-Author: Gurasis Osahan
-Author-email: contact@genomicops.com
-License: Apache-2.0
-Project-URL: Source, https://gitlab.genomicops.cloud/genomicops/audio-scribe
-Project-URL: Tracker, https://gitlab.genomicops.cloud/genomicops/audio-scribe/-/issues
-Keywords: whisper pyannote transcription audio diarization
-Classifier: Development Status :: 3 - Alpha
-Classifier: Intended Audience :: Developers
-Classifier: Intended Audience :: Science/Research
-Classifier: Topic :: Multimedia :: Sound/Audio
-Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
-Classifier: License :: OSI Approved :: Apache Software License
-Classifier: Programming Language :: Python :: 3
-Classifier: Programming Language :: Python :: 3.8
-Classifier: Programming Language :: Python :: 3.9
-Classifier: Programming Language :: Python :: 3.10
-Classifier: Operating System :: OS Independent
-Requires-Python: >=3.8
-Description-Content-Type: text/markdown
-License-File: LICENSE
-Requires-Dist: torch
-Requires-Dist: openai-whisper
-Requires-Dist: pyannote.audio
-Requires-Dist: pytorch-lightning
-Requires-Dist: keyring
-Requires-Dist: cryptography
-Requires-Dist: alive-progress
-Requires-Dist: psutil
-Requires-Dist: GPUtil
-Dynamic: author
-Dynamic: author-email
-Dynamic: classifier
-Dynamic: description
-Dynamic: description-content-type
-Dynamic: home-page
-Dynamic: keywords
-Dynamic: license
-Dynamic: project-url
-Dynamic: requires-dist
-Dynamic: requires-python
-Dynamic: summary
 # Audio Scribe
 **A Command-Line Tool for Audio Transcription and Speaker Diarization Using OpenAI Whisper and Pyannote**
@@ -52,7 +5,7 @@ Dynamic: summary
 <p align="center" style="margin: 0px auto;">
   <img src="https://img.shields.io/gitlab/pipeline-status/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&style=for-the-badge&logo=gitlab&logoColor=white&color=green" alt="Pipeline Status">
-  <img src="https://img.shields.io/gitlab/pipeline-coverage/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&style=for-the-badge&logo=tag&logoColor=white&color=red" alt="Coverage">
+  <img src="https://img.shields.io/gitlab/pipeline-coverage/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&branch=main&style=for-the-badge&logo=tag&logoColor=white&color=red" alt="Coverage">
   <img src="https://img.shields.io/pypi/pyversions/audio-scribe?style=for-the-badge&logo=python&logoColor=white&logoWidth=30&color=yellow" alt="Python Versions">
   <img src="https://img.shields.io/pypi/dm/audio-scribe?style=for-the-badge&logo=pypi&logoColor=white&logoWidth=30&color=orange" alt="PyPI Downloads">
   <img src="https://img.shields.io/gitlab/v/tag/innovation-hub%2Faudio-scribe?gitlab_url=https%3A%2F%2Fgitlab.genomicops.cloud&style=for-the-badge&logo=tag&logoColor=white&color=red" alt="Version">
@@ -67,9 +20,9 @@ Dynamic: summary
 ## Support the Project ☕
-<p align="center">
-  <a href="https://www.buymeacoffee.com/gosahan" target="_blank">
-    <img src="https://cdn.buymeacoffee.com/buttons/v2/default-green.png" alt="Buy Me A Coffee" height="60">
+<p align="center" style="margin: 0px auto;">
+  <a href="https://buymeacoffee.com/gosahan" target="_blank">
+    <img src="https://img.shields.io/badge/Buy%20Me%20A%20Coffee-Support-yellow?style=for-the-badge&logo=buymeacoffee&logoColor=white" alt="Buy Me A Coffee Badge"/>
   </a>
 </p>
@@ -107,11 +60,13 @@ This repository is licensed under the [Apache License 2.0](#license).
   - [Usage](#usage)
   - [Dependencies](#dependencies)
     - [Sample `requirements.txt`](#sample-requirementstxt)
+  - [Troubleshooting](#troubleshooting)
+    - [IndexError: list index out of range](#indexerror-list-index-out-of-range)
+      - [Option 1: System-level Installation (requires sudo access)](#option-1-system-level-installation-requires-sudo-access)
+      - [Option 2: Conda-only Installation (no sudo required)](#option-2-conda-only-installation-no-sudo-required)
   - [Contributing](#contributing)
   - [License](#license)
----
 ## Features
 - **Whisper Transcription**
@@ -127,8 +82,6 @@ This repository is licensed under the [Apache License 2.0](#license).
 - **Configurable Models**
   Default is `base.en` but you can specify any other Whisper model using `--whisper-model`.
----
 ## Installation
 ### Installing from PyPI
@@ -157,8 +110,6 @@ pip install -r requirements.txt
 This approach is particularly useful if you want the newest changes or plan to contribute.
----
 ## Quick Start
 1. **Obtain a Hugging Face Token**
@@ -174,7 +125,6 @@ This approach is particularly useful if you want the newest changes or plan to c
 3. **Watch the Progress Bar**
    - The tool displays a progress bar for each diarized speaker turn, along with real-time CPU, GPU, and memory usage.
----
 ## Usage
@@ -222,7 +172,6 @@ optional arguments:
   # When prompted for an audio file path, press Tab to autocomplete
   ```
----
 ## Dependencies
@@ -258,11 +207,89 @@ GPUtil
 pyreadline3; sys_platform == "win32"
 ```
-> Note:
+> Note:
 > - `pyreadline3` is appended with a [PEP 508 marker](https://peps.python.org/pep-0508/) (`; sys_platform == "win32"`) so it only installs on Windows.
 > - For GPU support, ensure you install a compatible PyTorch version with CUDA.
----
+## Troubleshooting
+### IndexError: list index out of range
+**Symptom**
+You encounter the following error when running `audio-scribe` or importing `pyannote.audio`:
+```
+IndexError: list index out of range
+  File ".../pyannote/audio/core/io.py", line 214, in __init__
+    backend = "soundfile" if "soundfile" in backends else backends[0]
+```
+This occurs when `pyannote.audio` is unable to detect any supported audio backend. Most commonly, the `soundfile` module is missing or its dependency `libsndfile` is not properly installed.
+**Solution**
+You have two ways to resolve this issue:
+#### Option 1: System-level Installation (requires sudo access)
+Install the system-level audio backend library:
+```bash
+sudo apt-get update
+sudo apt-get install libsndfile1
+```
+Then reinstall the `soundfile` Python package inside your environment:
+```bash
+# If using conda
+conda activate your-environment-name
+pip uninstall soundfile -y
+pip install soundfile
+# If using pip/virtualenv
+source your-venv/bin/activate  # or equivalent activation command
+pip uninstall soundfile -y
+pip install soundfile
+```
+#### Option 2: Conda-only Installation (no sudo required)
+Inside your Conda environment:
+```bash
+conda activate your-environment-name
+conda install -c conda-forge libsndfile
+```
+Then ensure Python uses the correct bindings:
+```bash
+pip uninstall soundfile -y
+pip install soundfile
+```
+**Verification**
+Test that audio backends are now available:
+```bash
+python -c "import soundfile as sf; print(sf.available_formats())"
+```
+Expected output:
+```python
+{'WAV': 'Microsoft WAV format (little endian)', 'FLAC': 'FLAC format', ...}
+```
+Then re-run `audio-scribe`:
+```bash
+audio-scribe --audio path/to/your/audio.wav
+```
+The tool should now initialize without error.
 ## Contributing
@@ -275,8 +302,6 @@ We welcome contributions to **Audio Scribe**!
 Please read any available guidelines or templates in our repository (such as `CONTRIBUTING.md` or `CODE_OF_CONDUCT.md`) before submitting.
----
 ## License
 This project is licensed under the [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0).
@@ -300,4 +325,4 @@ limitations under the License.
 ---
 **Thank you for using Audio Scribe!**
-For questions or feedback, please open a [GitHub issue](https://gitlab.genomicops.cloud/innovation-hub/audio-scribe/-/issues) or contact the maintainers.
+For questions or feedback, please open a [GitHub issue](https://gitlab.genomicops.cloud/innovation-hub/audio-scribe/-/issues) or contact the maintainers.

{audio_scribe-0.1.4 → audio_scribe-0.1.6}/setup.py RENAMED Viewed

@@ -5,33 +5,38 @@ with open("README.md", "r", encoding="utf-8") as fh:
 setuptools.setup(
     name="audio_scribe",
-    version="0.1.4",
+    version="0.1.6",
     author="Gurasis Osahan",
     author_email="contact@genomicops.com",
     description="A command-line tool for audio transcription with Whisper and Pyannote.",
     long_description=long_description,
     long_description_content_type="text/markdown",
-    url="https://gitlab.genomicops.cloud/genomicops/audio-scribe",
+    url="https://gitlab.genomicops.cloud/innovation-hub/audio-scribe",
     package_dir={"": "src"},
     packages=setuptools.find_packages(where="src"),
     python_requires=">=3.8",
     install_requires=[
-        "torch",
+        "torch>=2.7.1",
         "openai-whisper",
-        "pyannote.audio",
+        "pyannote.audio>=3.3.2",
         "pytorch-lightning",
-        "keyring",
+        "keyring>=25.6.0",
         "cryptography",
-        "alive-progress",
-        "psutil",
-        "GPUtil",
+        "alive-progress>=3.2.0",
+        "psutil>=7.0.0",
+        "GPUtil>=1.4.0",
     ],
-    entry_points={"console_scripts": ["audio-scribe=audio_scribe.transcriber:main"]},
+    entry_points={
+        "console_scripts": [
+            "audio-scribe=audio_scribe.transcriber:main",
+            "audioscribe=audio_scribe.transcriber:main",
+        ]
+    },
     keywords="whisper pyannote transcription audio diarization",
     license="Apache-2.0",
     project_urls={
-        "Source": "https://gitlab.genomicops.cloud/genomicops/audio-scribe",
-        "Tracker": "https://gitlab.genomicops.cloud/genomicops/audio-scribe/-/issues",
+        "Source": "https://gitlab.genomicops.cloud/innovation-hub/audio-scribe",
+        "Tracker": "https://gitlab.genomicops.cloud/innovation-hub/audio-scribe/-/issues",
     },
     classifiers=[
         "Development Status :: 3 - Alpha",

{audio_scribe-0.1.4 → audio_scribe-0.1.6}/src/audio_scribe/__init__.py RENAMED Viewed

@@ -5,13 +5,13 @@ A Python package for transcribing audio files with speaker diarization
 using Whisper and Pyannote.
 """
-from .transcriber import main
-from .models import TranscriptionPipeline, AudioProcessor
-from .config import TranscriptionConfig
-from .auth import TokenManager
-from .utils import DependencyManager, complete_path
+from audio_scribe.transcriber import main
+from audio_scribe.models import TranscriptionPipeline, AudioProcessor
+from audio_scribe.config import TranscriptionConfig
+from audio_scribe.auth import TokenManager
+from audio_scribe.utils import DependencyManager, complete_path
-__version__ = "0.1.4"
+__version__ = "0.1.6"
 __all__ = [
     "main",

{audio_scribe-0.1.4 → audio_scribe-0.1.6}/src/audio_scribe/models.py RENAMED Viewed

@@ -11,8 +11,8 @@ from datetime import datetime
 from pathlib import Path
 from pyannote.audio import Pipeline  # type: ignore
-from .config import TranscriptionConfig
-from .auth import TokenManager
+from audio_scribe.config import TranscriptionConfig
+from audio_scribe.auth import TokenManager
 logger = logging.getLogger(__name__)

audio-scribe 0.1.4__tar.gz → 0.1.6__tar.gz

audio-scribe 0.1.4tar.gz → 0.1.6tar.gz