npm - voice-mcp-server - Versions diffs - 0.2.0 → 0.3.0 - Mend

voice-mcp-server 0.2.0 → 0.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

package/README.md CHANGED Viewed

@@ -5,6 +5,7 @@
 **Give your AI agents a voice, real ears, and the ability to handle interruptions in real-time.**
 [![npm version](https://img.shields.io/npm/v/voice-mcp-server.svg?color=red&style=flat-square&logo=npm)](https://www.npmjs.com/package/voice-mcp-server)
+[![Publish to NPM](https://img.shields.io/github/actions/workflow/status/erickvs/voice-mcp-server/publish.yml?style=flat-square&logo=github)](https://github.com/erickvs/voice-mcp-server/actions)
 [![Platform: macOS Apple Silicon](https://img.shields.io/badge/Platform-macOS%20%7C%20Apple%20Silicon-lightgrey?style=flat-square&logo=apple)](#-target-environment)
 [![Python](https://img.shields.io/badge/Python-3.10%2B-blue?logo=python&style=flat-square)](https://python.org)
 [![MCP Compatible](https://img.shields.io/badge/MCP-Compatible-success?style=flat-square)](https://modelcontextprotocol.io/)
@@ -149,7 +150,34 @@ Simply use `voice-mcp-server` as the command in your configuration.
 > [!NOTE]
 > **First Run Performance:** The very first time you invoke the voice tool, it will take a few minutes to initialize the Python environment and download the heavy ML weights (~4GB). **The tools will not be available until this background setup completes.** You can monitor progress in your terminal logs. *Depending on your AI client, you may need to restart the application/CLI for the tools to appear after setup.*
-### 4. Uninstalling
+### 4. Customizing the Voice (ElevenLabs)
+If you prefer to use **ElevenLabs** for ultra-realistic cloud TTS instead of the default local Kokoro engine, you can easily configure it using Environment Variables!
+> [!WARNING]
+> **Privacy Notice:** By configuring and using ElevenLabs, the text generated by your LLM will be transmitted over the internet to ElevenLabs' servers for audio rendering. This data is subject to ElevenLabs' own privacy policies and terms of service. If you require absolute privacy and air-gapped security, do not configure this key and continue using the default local MLX engine.
+When adding the server to your MCP Client (like `claude_desktop_config.json`), simply provide your API key and your preferred Voice ID in the `env` object:
+```json
+{
+  "mcpServers": {
+    "voice-mcp-server": {
+      "command": "voice-mcp-server",
+      "args": [],
+      "env": {
+        "ELEVENLABS_API_KEY": "sk_your_api_key_here",
+        "ELEVENLABS_VOICE_ID": "aEO01A4wXwd1O8GPgGlF"
+      }
+    }
+  }
+}
+```
+*(If you are using Gemini CLI or Claude Code, you can simply `export` these variables in your terminal profile like `.zshrc`!)*
+Once configured, simply tell your AI: *"Switch your audio engine to use the elevenlabs_speaker adapter."*
+### 5. Uninstalling
 If you wish to completely remove the server and its downloaded ML models from your system to free up space:

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "voice-mcp-server",
-  "version": "0.2.0",
+  "version": "0.3.0",
   "description": "An MCP server to allow LLMs to speak and listen via bidirectional voice loops",
   "main": "build/index.js",
   "type": "module",
@@ -30,6 +30,10 @@
   ],
   "author": "Erick Vazquez Santillan",
   "license": "MIT",
+  "repository": {
+    "type": "git",
+    "url": "git+https://github.com/erickvs/voice-mcp-server.git"
+  },
   "dependencies": {
     "@modelcontextprotocol/sdk": "^1.5.0"
   },

package/src/adapters_real/elevenlabs_speaker.py CHANGED Viewed

@@ -16,7 +16,7 @@ class ElevenLabsSpeaker(ISpeaker):
         self.words = []
         self.process = None
         self.start_time = 0
-        self.voice_id = voice_id
+        self.voice_id = os.getenv("ELEVENLABS_VOICE_ID", voice_id)
         self.api_key = os.getenv("ELEVENLABS_API_KEY")
         self.temp_file = "/tmp/elevenlabs_output.mp3"