agentvibes 2.0.17 → 2.0.18

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (42) hide show
  1. package/.claude/activation-instructions +54 -0
  2. package/.claude/github-star-reminder.txt +1 -1
  3. package/.claude/hooks/bmad-tts-injector.sh +33 -4
  4. package/.claude/hooks/bmad-voice-manager.sh +146 -10
  5. package/.claude/hooks/check-output-style.sh +60 -8
  6. package/.claude/hooks/github-star-reminder.sh +67 -7
  7. package/.claude/hooks/language-manager.sh +34 -2
  8. package/.claude/hooks/learn-manager.sh +34 -2
  9. package/.claude/hooks/personality-manager.sh +34 -1
  10. package/.claude/hooks/piper-download-voices.sh +131 -9
  11. package/.claude/hooks/piper-installer.sh +38 -4
  12. package/.claude/hooks/piper-voice-manager.sh +102 -36
  13. package/.claude/hooks/play-tts-elevenlabs.sh +32 -6
  14. package/.claude/hooks/play-tts-piper.sh +79 -18
  15. package/.claude/hooks/play-tts.sh +34 -3
  16. package/.claude/hooks/provider-commands.sh +36 -1
  17. package/.claude/hooks/provider-manager.sh +32 -1
  18. package/.claude/hooks/replay-target-audio.sh +34 -3
  19. package/.claude/hooks/sentiment-manager.sh +39 -1
  20. package/.claude/hooks/speed-manager.sh +36 -2
  21. package/.claude/hooks/voice-manager.sh +85 -6
  22. package/.claude/hooks/voices-config.sh +39 -2
  23. package/README.md +77 -137
  24. package/RELEASE_NOTES.md +269 -0
  25. package/RELEASE_NOTES_v2.0.17_DRAFT.md +650 -0
  26. package/docs/ai-optimized-documentation-standards.md +78 -4
  27. package/docs/bryce-beattie-voice-licensing.md +131 -0
  28. package/docs/commands.md +21 -1
  29. package/docs/mcp-setup.md +1 -6
  30. package/github-star-reminder.txt +1 -0
  31. package/mcp-server/WINDOWS_SETUP.md +182 -199
  32. package/mcp-server/docs/elevenlabs-setup.md +213 -0
  33. package/mcp-server/docs/troubleshooting-audio.md +316 -0
  34. package/mcp-server/server.py +38 -4
  35. package/mcp-server/test_server.py +38 -3
  36. package/package.json +1 -1
  37. package/test/helpers/test-helper.bash +31 -4
  38. package/test/unit/personality-manager.bats +22 -6
  39. package/test/unit/play-tts.bats +21 -3
  40. package/test/unit/provider-manager.bats +508 -0
  41. package/test/unit/speed-manager.bats +436 -0
  42. package/test/unit/voice-manager.bats +4 -1
@@ -20,6 +20,37 @@ Every file must begin with a comprehensive context header that provides AI assis
20
20
 
21
21
  ```javascript
22
22
  /**
23
+ * File: [path/to/file.js]
24
+ *
25
+ * AgentVibes - Finally, your AI Agents can Talk Back! Text-to-Speech WITH personality for AI Assistants!
26
+ * Website: https://agentvibes.org
27
+ * Repository: https://github.com/paulpreibisch/AgentVibes
28
+ *
29
+ * Co-created by Paul Preibisch with Claude AI
30
+ * Copyright (c) 2025 Paul Preibisch
31
+ *
32
+ * Licensed under the Apache License, Version 2.0 (the "License");
33
+ * you may not use this file except in compliance with the License.
34
+ * You may obtain a copy of the License at
35
+ *
36
+ * http://www.apache.org/licenses/LICENSE-2.0
37
+ *
38
+ * Unless required by applicable law or agreed to in writing, software
39
+ * distributed under the License is distributed on an "AS IS" BASIS,
40
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
41
+ * See the License for the specific language governing permissions and
42
+ * limitations under the License.
43
+ *
44
+ * DISCLAIMER: This software is provided "AS IS", WITHOUT WARRANTY OF ANY KIND,
45
+ * express or implied, including but not limited to the warranties of
46
+ * merchantability, fitness for a particular purpose and noninfringement.
47
+ * In no event shall the authors or copyright holders be liable for any claim,
48
+ * damages or other liability, whether in an action of contract, tort or
49
+ * otherwise, arising from, out of or in connection with the software or the
50
+ * use or other dealings in the software.
51
+ *
52
+ * ---
53
+ *
23
54
  * @fileoverview [Brief description]
24
55
  * @context [Why this exists, what problem it solves]
25
56
  * @architecture [How it fits in the system, patterns used]
@@ -58,6 +89,33 @@ Every file must begin with a comprehensive context header that provides AI assis
58
89
 
59
90
  ```bash
60
91
  #!/bin/bash
92
+ #
93
+ # File: .claude/hooks/play-tts-elevenlabs.sh
94
+ #
95
+ # AgentVibes - Professional Text-to-Speech for AI Assistants
96
+ # Website: https://agentvibes.org
97
+ # Repository: https://github.com/paulpreibisch/AgentVibes
98
+ #
99
+ # Co-created by Paul Preibisch with Claude AI
100
+ # Copyright (c) 2025 Paul Preibisch
101
+ #
102
+ # Licensed under the Apache License, Version 2.0 (the "License");
103
+ # you may not use this file except in compliance with the License.
104
+ # You may obtain a copy of the License at
105
+ #
106
+ # http://www.apache.org/licenses/LICENSE-2.0
107
+ #
108
+ # Unless required by applicable law or agreed to in writing, software
109
+ # distributed under the License is distributed on an "AS IS" BASIS,
110
+ # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
111
+ # See the License for the specific language governing permissions and
112
+ # limitations under the License.
113
+ #
114
+ # DISCLAIMER: This software is provided "AS IS", WITHOUT WARRANTY OF ANY KIND,
115
+ # express or implied. Use at your own risk. See the Apache License for details.
116
+ #
117
+ # ---
118
+ #
61
119
  # @fileoverview ElevenLabs TTS provider implementation
62
120
  # @context Handles all ElevenLabs API interactions for voice synthesis
63
121
  # @architecture Provider interface implementation, follows play-tts.sh contract
@@ -297,10 +355,26 @@ When writing new code:
297
355
 
298
356
  - **Source PRD**: `docs/prd.md` (Multi-Provider TTS System)
299
357
  - **Example Implementation**: `.claude/hooks/play-tts.sh`, `provider-manager.sh`
300
- - **GitHub Issue**: #25 (Multi-Provider TTS Feature)
358
+ - **GitHub Repository**: https://github.com/paulpreibisch/AgentVibes
359
+ - **Project Website**: https://agentvibes.org
301
360
 
302
361
  ---
303
362
 
304
- **Generated from**: AgentVibes Multi-Provider TTS System PRD v1.0
305
- **Extracted by**: John (PM)
306
- **Date**: 2025-01-05
363
+ ## Document Information
364
+
365
+ **AgentVibes - Finally, your AI Agents can Talk Back! Text-to-Speech WITH personality for AI Assistants!**
366
+
367
+ - **Website**: https://agentvibes.org
368
+ - **Repository**: https://github.com/paulpreibisch/AgentVibes
369
+ - **Co-created by**: Paul Preibisch with Claude AI
370
+ - **Copyright**: © 2025 Paul Preibisch
371
+ - **License**: Apache-2.0
372
+ - **Version**: 1.0
373
+ - **Last Updated**: 2025-01-17
374
+
375
+ ---
376
+
377
+ **DISCLAIMER**: This software is provided "AS IS", WITHOUT WARRANTY OF ANY KIND, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose and noninfringement. In no event shall the authors or copyright holders be liable for any claim, damages or other liability, whether in an action of contract, tort or otherwise, arising from, out of or in connection with the software or the use or other dealings in the software.
378
+
379
+ For full license terms, see the LICENSE file in the project root or visit:
380
+ http://www.apache.org/licenses/LICENSE-2.0
@@ -0,0 +1,131 @@
1
+ # Bryce Beattie Custom Piper Voices - Licensing Documentation
2
+
3
+ **Source:** https://brycebeattie.com/files/tts/
4
+ **Date Captured:** 2025-10-17
5
+ **Purpose:** Legal documentation for including custom voices in AgentVibes repository
6
+
7
+ ---
8
+
9
+ ## Creator's Explicit Permission
10
+
11
+ > "Feel free to use these for any legal and ethical purpose. If somebody wants to upload these to HuggingFace or somewhere similar, you have my blessing."
12
+ >
13
+ > — Bryce Beattie
14
+
15
+ **Source URL:** https://brycebeattie.com/files/tts/
16
+
17
+ ---
18
+
19
+ ## Voices We're Including
20
+
21
+ ### 1. Kristin
22
+ - **Type:** US English female
23
+ - **License:** Public Domain
24
+ - **Training:** 2000 epochs on medium quality settings
25
+ - **Dataset:** ~11.5 hours from public domain LibriVox.org recordings
26
+ - **Download Links:**
27
+ - .onnx: https://sfo3.digitaloceanspaces.com/bkmdls/kristin.onnx
28
+ - .json: https://sfo3.digitaloceanspaces.com/bkmdls/kristin.onnx.json
29
+
30
+ ### 2. Jenny (Dioco)
31
+ - **Type:** UK English female (Irish accent)
32
+ - **License:** CC BY (Attribution required)
33
+ - **Attribution Note:** Similar to Creative Commons Attribution
34
+ - **Original Source:** Kaggle dataset (details on source page)
35
+ - **Download Links:**
36
+ - .onnx: https://sfo3.digitaloceanspaces.com/bkmdls/jenny.onnx
37
+ - .json: https://sfo3.digitaloceanspaces.com/bkmdls/jenny.onnx.json
38
+
39
+ ### 3. ManyVoice
40
+ - **Type:** Multi-speaker (16 voices: 12 US English, 4 UK English)
41
+ - **License:** Public Domain
42
+ - **Training:** 400 epochs on medium quality settings
43
+ - **Dataset:** ~8 hours per voice from public domain LibriVox recordings with sound quality enhancements
44
+ - **Download Links:**
45
+ - .onnx: https://sfo3.digitaloceanspaces.com/bkmdls/mv2.onnx
46
+ - .json: https://sfo3.digitaloceanspaces.com/bkmdls/mv2.onnx.json
47
+
48
+ ---
49
+
50
+ ## License Compatibility with Apache 2.0
51
+
52
+ ### Public Domain Voices (Kristin, ManyVoice)
53
+ ✅ **Fully compatible** - No restrictions on redistribution, modification, or commercial use
54
+
55
+ ### CC BY Licensed Voice (Jenny)
56
+ ✅ **Compatible** - Requires attribution:
57
+ - Include credit to Bryce Beattie
58
+ - Link to source: https://brycebeattie.com/files/tts/
59
+ - Mention CC BY license
60
+ - Already added to AgentVibes README and credits
61
+
62
+ ---
63
+
64
+ ## Legal Basis for Inclusion in AgentVibes Repository
65
+
66
+ 1. **Explicit Permission:** Creator granted blessing for redistribution
67
+ 2. **Public Domain:** Kristin and ManyVoice have no licensing restrictions
68
+ 3. **CC BY Compliance:** Jenny voice attributed properly in:
69
+ - README.md (Useful Links section)
70
+ - README.md (Credits/Special Thanks)
71
+ - VOICE_CREDITS.md (dedicated attribution file)
72
+ 4. **Apache 2.0 Compatibility:** All licenses compatible with our project license
73
+
74
+ ---
75
+
76
+ ## Attribution Requirements
77
+
78
+ ### For Jenny Voice (CC BY):
79
+ ```
80
+ Jenny voice by Bryce Beattie
81
+ Source: https://brycebeattie.com/files/tts/
82
+ License: CC BY (Creative Commons Attribution)
83
+ Original dataset: Dioco dataset (Kaggle)
84
+ ```
85
+
86
+ ### For Kristin and ManyVoice (Public Domain):
87
+ ```
88
+ Kristin and ManyVoice by Bryce Beattie
89
+ Source: https://brycebeattie.com/files/tts/
90
+ License: Public Domain
91
+ Dataset: LibriVox.org public domain recordings
92
+ ```
93
+
94
+ ---
95
+
96
+ ## File Locations in AgentVibes
97
+
98
+ - Voice models: `mcp-server/voices/` (to be created)
99
+ - Download script: `.claude/hooks/piper-download-voices.sh`
100
+ - Credits: `README.md` and `VOICE_CREDITS.md`
101
+
102
+ ---
103
+
104
+ ## Additional Voices Available (Not Included)
105
+
106
+ Other voices available from Bryce Beattie that we chose not to include:
107
+ - LJSpeech (medium and high)
108
+ - Cori (high and medium)
109
+ - John
110
+ - Norman
111
+ - Clean 100 (excluded per user preference)
112
+
113
+ ---
114
+
115
+ ## Notes
116
+
117
+ - All voice models are ONNX format for Piper TTS
118
+ - Each voice includes .onnx model file and .onnx.json config file
119
+ - Voices trained on LibriVox public domain audiobooks
120
+ - Sound quality enhancements applied during training
121
+ - Compatible with Piper TTS engine (https://github.com/rhasspy/piper)
122
+
123
+ ---
124
+
125
+ ## Legal Disclaimer
126
+
127
+ This documentation represents our good-faith interpretation of the licensing information provided by Bryce Beattie. The explicit permission granted ("you have my blessing") combined with Public Domain and CC BY licenses provides clear legal basis for inclusion in the AgentVibes repository under Apache 2.0 license.
128
+
129
+ **Date of Legal Review:** 2025-10-17
130
+ **Reviewed By:** AgentVibes Project Team
131
+ **Status:** ✅ Approved for inclusion with proper attribution
package/docs/commands.md CHANGED
@@ -6,8 +6,9 @@ All commands are prefixed with `/agent-vibes:`
6
6
 
7
7
  | Command | AgentVibes MCP Equivalent | Description |
8
8
  |---------|----------------|-------------|
9
- | `/agent-vibes:list` | "List all voices" or "What voices are available?" | Show all available voices |
9
+ | `/agent-vibes:list` | "List all voices" or "What voices are available?" | Show all available voices (expands multi-speaker voices) |
10
10
  | `/agent-vibes:switch <voice>` | "Switch to Aria voice" or "Change voice to Cowboy Bob" | Change to a different voice |
11
+ | `/agent-vibes:switch <voice>#<speaker>` | "Switch to 16Speakers#15" or "Use Kristin from 16Speakers" | Switch to specific speaker in multi-speaker voice |
11
12
  | `/agent-vibes:whoami` | "What's my current voice?" or "Show my configuration" | Show current voice, sentiment & personality |
12
13
  | `/agent-vibes:preview [N]` | "Preview voices" or "Let me hear the first 5 voices" | Preview voices with audio samples |
13
14
  | `/agent-vibes:sample <voice>` | "Test Aria voice" or "Let me hear Cowboy Bob" | Test a specific voice |
@@ -15,6 +16,25 @@ All commands are prefixed with `/agent-vibes:`
15
16
  | `/agent-vibes:replay [N]` | "Replay last message" or "Replay the 3rd message" | Replay recent TTS audio |
16
17
  | `/agent-vibes:get` | "What voice am I using?" or "Get current voice" | Get currently selected voice |
17
18
 
19
+ ### Multi-Speaker Voices
20
+
21
+ AgentVibes supports multi-speaker Piper voices that contain multiple voice personalities in one model:
22
+
23
+ - **16Speakers** - Contains 16 different voices (Cori, Kara, Kristin, Maria, Mike, Mark, Michael, James, Rose, progressingamerica, Steve, Owlivia, Paul, Jennifer, Emily, Martin)
24
+
25
+ **Usage:**
26
+ ```bash
27
+ # List all speakers in a multi-speaker voice
28
+ /agent-vibes:list
29
+
30
+ # Switch to a specific speaker (0-based index)
31
+ /agent-vibes:switch 16Speakers#0 # Cori Samuel
32
+ /agent-vibes:switch 16Speakers#2 # Kristin Hughes
33
+ /agent-vibes:switch 16Speakers#15 # Martin Clifton
34
+ ```
35
+
36
+ The list command will show all available speakers with their names and IDs, marking your current selection with `▶`.
37
+
18
38
  ## System Commands
19
39
 
20
40
  | Command | AgentVibes MCP Equivalent | Description |
package/docs/mcp-setup.md CHANGED
@@ -26,12 +26,7 @@ Instead of remembering slash commands like `/agent-vibes:switch Aria`, just say:
26
26
 
27
27
  ### For Claude Desktop
28
28
 
29
- **⚠️ IMPORTANT Requirements:**
30
-
31
- - **Windows:** WSL (Windows Subsystem for Linux) must be enabled
32
- - **Python 3:** Must be installed on your system ([Download Python](https://www.python.org/downloads/))
33
- - The MCP server will automatically install Python packages (mcp, pipx, Piper TTS)
34
- - But Python 3 itself must be installed first by you
29
+ **📖 Read This First:** If you haven't installed Claude Desktop yet, [click here for the Windows Setup Guide](../mcp-server/WINDOWS_SETUP.md) - it covers WSL setup, Python installation, and all prerequisites.
35
30
 
36
31
  **Step 1: Set Your ElevenLabs API Key (Windows)**
37
32
 
@@ -0,0 +1 @@
1
+ 20251017