npm - claude-voice - Versions diffs - 1.5.10 → 1.5.11 - Mend

claude-voice 1.5.10 → 1.5.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +62 -10
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -18,7 +18,21 @@
 npm install -g claude-voice
 ```
-That's it. The extension auto-installs hooks and starts with Claude Code. Say **"Hey Jarvis"** and speak your command.
+That's it. The extension auto-installs hooks, downloads speech models, and starts with Claude Code. Say **"Hey Jarvis"** and speak your command.
+## What Gets Installed
+The postinstall script sets up everything automatically:
+1. Default configuration (`~/.claude-voice/config.json`)
+2. Claude Code hooks (session-start, stop, post-tool-use, notification)
+3. Claude Code plugin (voice skill)
+4. Wake word model (openWakeWord or Sherpa-ONNX KWS)
+5. **STT model** — Whisper Small (~488MB) for high-accuracy speech recognition
+6. **TTS voice** — Piper neural voice (`en_US-joe-medium`, ~50MB)
+7. Audio tools check (sox, afplay, etc.)
+Total download: ~1GB on first install. All models run locally — no API keys needed.
 ## Why Claude Voice?
@@ -26,10 +40,12 @@ That's it. The extension auto-installs hooks and starts with Claude Code. Say **
 **Hear every response** — Claude doesn't just respond in text. It speaks back to you. Step away from the screen and still follow along.
-**Works offline** — Local STT (Sherpa-ONNX) and local TTS (Piper, macOS Say) mean zero API calls, zero latency, zero cost. Your voice data never leaves your machine.
+**Works offline** — Local STT (Whisper Small via Sherpa-ONNX) and local TTS (Piper neural voice) mean zero API calls, zero latency, zero cost. Your voice data never leaves your machine.
 **5 TTS providers, 3 STT providers** — Choose the quality/cost/privacy balance that works for you. From free local voices to premium OpenAI and ElevenLabs.
+**Custom wake words** — Train your own Picovoice keyword model and drop the `.ppn` file in `~/.claude-voice/models/`. Say "Hey Claude" instead of "Hey Jarvis".
 **Tool announcements** — Hear "Build completed", "Tests passed", "File written" while you're making coffee. Know what Claude is doing without watching the screen.
 **Multi-language** — Speak in English, Turkish, German, French, Spanish, Japanese, Chinese, and more. Change with one command: `claude-voice config set stt.language=tr`
@@ -47,9 +63,9 @@ The extension integrates via Claude Code hooks: auto-start on session, speak res
 | | Local (Free) | Cloud |
 |---|---|---|
-| **TTS** | macOS Say, Piper, espeak | OpenAI, ElevenLabs |
-| **STT** | Sherpa-ONNX Whisper | OpenAI Whisper |
-| **Wake Word** | openWakeWord, Sherpa-ONNX | Picovoice |
+| **TTS** | Piper (default), macOS Say, espeak | OpenAI, ElevenLabs |
+| **STT** | Sherpa-ONNX Whisper Small (default) | OpenAI Whisper |
+| **Wake Word** | openWakeWord (default), Sherpa-ONNX | Picovoice |
 **Quick presets:**
@@ -59,6 +75,33 @@ claude-voice openai             # Cloud TTS + STT (requires API key)
 claude-voice local --download   # Piper TTS + larger Whisper model (offline)
 ```
+## Wake Word
+### Default: openWakeWord
+Works out of the box. Say **"Hey Jarvis"** to activate.
+### Picovoice (best accuracy)
+Picovoice offers 97%+ keyword detection accuracy with custom wake words.
+```bash
+claude-voice picovoice --key YOUR_ACCESS_KEY
+```
+Get a free access key at [console.picovoice.ai](https://console.picovoice.ai/).
+### Custom Wake Words (Picovoice)
+Train a custom keyword at [Picovoice Console](https://console.picovoice.ai/), download the `.ppn` file, and place it in `~/.claude-voice/models/`:
+```bash
+# Example: use "Hey Claude" as wake word
+cp hey-claude_en_mac_v3_0_0.ppn ~/.claude-voice/models/hey-claude.ppn
+claude-voice config set wakeWord.provider=picovoice
+claude-voice config set wakeWord.keyword=hey-claude
+claude-voice restart
+```
+The detector searches for `<keyword>.ppn` in `~/.claude-voice/models/` first, then falls back to Picovoice built-in keywords (jarvis, alexa, computer, etc.).
 ## Configuration
 ```bash
@@ -75,13 +118,15 @@ Config file: `~/.claude-voice/config.json`
 | Option | Default | Description |
 |--------|---------|-------------|
-| `tts.provider` | `macos-say` | macos-say, piper, openai, elevenlabs, espeak, disabled |
-| `tts.autoSpeak` | `true` | Auto-speak Claude responses |
+| `tts.provider` | `piper` | piper, macos-say, openai, elevenlabs, espeak, disabled |
+| `tts.autoSpeak` | `false` | Auto-speak Claude responses |
 | `tts.maxSpeechLength` | `5000` | Max characters to speak |
 | `stt.provider` | `sherpa-onnx` | sherpa-onnx, openai, whisper-local, disabled |
+| `stt.sherpaOnnx.model` | `whisper-small` | whisper-tiny (75MB), whisper-base (142MB), whisper-small (488MB) |
 | `stt.language` | `en` | Language code (en, tr, de, fr, es, ja, zh...) |
 | `wakeWord.enabled` | `true` | Enable wake word detection |
 | `wakeWord.provider` | `openwakeword` | openwakeword, sherpa-onnx, picovoice |
+| `wakeWord.keyword` | `jarvis` | Wake word (or custom .ppn filename without extension) |
 | `wakeWord.sensitivity` | `0.5` | Detection sensitivity (0.0-1.0) |
 | `voiceOutput.enabled` | `false` | TTS-friendly response formatting |
 | `toolTTS.enabled` | `false` | Announce tool completions |
@@ -100,6 +145,11 @@ claude-voice start / stop / restart / status
 claude-voice setup                # Interactive wizard
 claude-voice doctor               # Diagnose issues
+# Provider Presets
+claude-voice openai               # Cloud TTS + STT
+claude-voice local --download     # Local with model downloads
+claude-voice picovoice --key KEY  # Best wake word accuracy
 # Models & Voices
 claude-voice model list / download <id>     # STT models (whisper-tiny/base/small)
 claude-voice voice list / download <id>     # Piper TTS voices
@@ -120,11 +170,11 @@ claude-voice devices              # List audio devices
 | | macOS | Linux |
 |---|---|---|
-| TTS | Say, Piper, OpenAI, ElevenLabs | espeak, Piper, OpenAI, ElevenLabs |
+| TTS | Piper, Say, OpenAI, ElevenLabs | Piper, espeak, OpenAI, ElevenLabs |
 | STT | Sherpa-ONNX, OpenAI | Sherpa-ONNX, OpenAI |
 | Wake Word | openWakeWord, Sherpa-ONNX, Picovoice | openWakeWord, Sherpa-ONNX, Picovoice |
-**Requires:** Node.js 18+, microphone access. Python 3 recommended (for openWakeWord).
+**Requires:** Node.js 18+, microphone access. Python 3 recommended (for Piper TTS and openWakeWord).
 ## Troubleshooting
@@ -134,7 +184,9 @@ claude-voice logs                 # Check daemon logs
 claude-voice start -f             # Run in foreground for debugging
 ```
-**Wake word not detecting?** Run `claude-voice openwakeword --install` for better accuracy.
+**Wake word not detecting?** Run `claude-voice openwakeword --install` for better accuracy, or switch to Picovoice for 97%+ detection rate.
+**STT not working?** Check model is installed: `claude-voice model list`. Re-download if needed: `claude-voice model download whisper-small`.
 ## License

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "claude-voice",
-  "version": "1.5.10",
+  "version": "1.5.11",
   "description": "Voice interface extension for Claude Code - TTS, STT, and wake word detection",
   "main": "dist/index.js",
   "types": "dist/index.d.ts",