agentvibes 3.3.0-alpha.5 → 3.3.0-alpha.6
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +224 -23
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -4,14 +4,14 @@
|
|
|
4
4
|
>
|
|
5
5
|
> 🌐 **[agentvibes.org](https://agentvibes.org)**
|
|
6
6
|
>
|
|
7
|
-
> Professional text-to-speech for **Claude Code**, **Claude Desktop**, **Warp Terminal**, and **
|
|
7
|
+
> Professional text-to-speech for **Claude Code**, **Claude Desktop**, **Warp Terminal**, and **OpenClaw** - **Piper TTS (Free!)** or **macOS Say (Built-in!)**
|
|
8
8
|
|
|
9
9
|
[](https://www.npmjs.com/package/agentvibes)
|
|
10
10
|
[](https://github.com/paulpreibisch/AgentVibes/actions/workflows/test.yml)
|
|
11
11
|
[](https://github.com/paulpreibisch/AgentVibes/actions/workflows/publish.yml)
|
|
12
12
|
[](https://opensource.org/licenses/Apache-2.0)
|
|
13
13
|
|
|
14
|
-
**Author**: Paul Preibisch ([@997Fire](https://x.com/997Fire)) | **Version**: v3.
|
|
14
|
+
**Author**: Paul Preibisch ([@997Fire](https://x.com/997Fire)) | **Version**: v3.3.0-alpha.5
|
|
15
15
|
|
|
16
16
|
---
|
|
17
17
|
|
|
@@ -22,7 +22,7 @@
|
|
|
22
22
|
| **Install AgentVibes** (just `npx`, no git!) | [Quick Start Guide](docs/quick-start.md) |
|
|
23
23
|
| **Understand what I need** (spoiler: just Node.js!) | [Prerequisites](#-prerequisites) |
|
|
24
24
|
| **Set up on Windows (Claude Desktop)** | [Windows Setup Guide](mcp-server/WINDOWS_SETUP.md) |
|
|
25
|
-
| **Use with
|
|
25
|
+
| **Use with OpenClaw** | [OpenClaw Integration](#-openclaw-integration) |
|
|
26
26
|
| **Run Claude Code on Android** | [Android/Termux Setup](#-android--termux) |
|
|
27
27
|
| **Use natural language** | [MCP Setup](docs/mcp-setup.md) |
|
|
28
28
|
| **Switch voices** | [Voice Library](docs/voice-library.md) |
|
|
@@ -35,7 +35,7 @@
|
|
|
35
35
|
|
|
36
36
|
**AgentVibes adds lively voice narration to your Claude AI sessions!**
|
|
37
37
|
|
|
38
|
-
Whether you're coding in Claude Code, chatting in Claude Desktop, using Warp Terminal, or running
|
|
38
|
+
Whether you're coding in Claude Code, chatting in Claude Desktop, using Warp Terminal, or running OpenClaw - AgentVibes brings AI to life with professional voices and personalities.
|
|
39
39
|
|
|
40
40
|
### 🎯 Key Features
|
|
41
41
|
|
|
@@ -101,6 +101,7 @@ All 50+ Piper voices AgentVibes provides are sourced from Hugging Face's open-so
|
|
|
101
101
|
- [💡 Common Workflows](#-common-workflows) - Quick examples
|
|
102
102
|
- [🔧 Advanced Features](#-advanced-features) - Custom voices & personalities
|
|
103
103
|
- [🔊 Remote Audio Setup](#-remote-audio-setup) - Play TTS from remote servers
|
|
104
|
+
- [🔐 Security Hardening Guide](docs/security-hardening-guide.md) - **Required for production**: SSH, Fail2Ban, Tailscale, UFW, AIDE
|
|
104
105
|
- [🔬 Technical Deep Dive](docs/technical-deep-dive.md) - How AgentVibes works under the hood
|
|
105
106
|
- [❓ Troubleshooting](#-troubleshooting) - Common issues & fixes
|
|
106
107
|
|
|
@@ -117,17 +118,26 @@ All 50+ Piper voices AgentVibes provides are sourced from Hugging Face's open-so
|
|
|
117
118
|
|
|
118
119
|
## 📰 Latest Release
|
|
119
120
|
|
|
120
|
-
**[v3.
|
|
121
|
+
**[v3.3.0-alpha.5 - Remote Audio Revolution: Voiceless Servers → Phone & Mobile Playback](https://github.com/paulpreibisch/AgentVibes/releases/tag/v3.3.0-alpha.5)** 📱🔊
|
|
121
122
|
|
|
122
|
-
AgentVibes v3.
|
|
123
|
+
AgentVibes v3.3.0-alpha.5 brings breakthrough remote audio capabilities, turning your messaging apps into voice-enabled AI assistants! **Use Case:** Install [OpenClaw](https://openclaw.ai/) on a remote server, message it via Telegram or WhatsApp from anywhere, and AgentVibes (running in Termux on your phone) plays the TTS responses through your phone speakers—making it work like Siri, but powered by AgentVibes! This release enables voiceless servers to play audio remotely on phones, mobile devices, or any machine via SSH/PulseAudio tunneling. Also includes audio tracks directory structure fix and comprehensive OpenClaw skill documentation.
|
|
123
124
|
|
|
124
125
|
**Key Highlights:**
|
|
125
|
-
-
|
|
126
|
-
-
|
|
127
|
-
-
|
|
128
|
-
-
|
|
129
|
-
-
|
|
130
|
-
-
|
|
126
|
+
- 📱 **Voiceless Server Support** - Generate TTS on servers without audio hardware, play on remote devices
|
|
127
|
+
- 🔊 **Phone/Mobile Playback** - Audio tunnels from cloud servers to your phone or local machine via SSH
|
|
128
|
+
- 🌐 **PulseAudio SSH Tunneling** - Automatic audio routing through SSH reverse forwarding (port 14713)
|
|
129
|
+
- 🤖 **Enhanced OpenClaw Integration** - Complete skill documentation with 50+ voices and remote audio setup
|
|
130
|
+
- 🎵 **Audio Tracks Fix** - Corrected directory structure (backgrounds → tracks) with proper .npmignore entries
|
|
131
|
+
- 📦 **Package Size Optimization** - 8.3 MB unpacked, 172 files, optimized for npm distribution
|
|
132
|
+
- 🛡️ **Security & Quality** - Removed sensitive data from git history, SonarCloud compliance
|
|
133
|
+
|
|
134
|
+
**Perfect For:**
|
|
135
|
+
- Running AgentVibes/OpenClaw on AWS, GCP, Azure, DigitalOcean
|
|
136
|
+
- VS Code Remote SSH development with TTS feedback
|
|
137
|
+
- Android/Termux with audio playback on phone speakers
|
|
138
|
+
- Any headless server → local audio scenario
|
|
139
|
+
|
|
140
|
+
💡 **Alpha Testing:** This is a pre-release version. Report issues at [GitHub Issues](https://github.com/paulpreibisch/AgentVibes/issues)
|
|
131
141
|
|
|
132
142
|
💡 **Tip:** If `npx agentvibes` shows an older version or missing commands, clear your npm cache: `npm cache clean --force && npx agentvibes@latest --help`
|
|
133
143
|
|
|
@@ -626,15 +636,17 @@ This design means **any TTS provider** can integrate with BMAD by replacing thes
|
|
|
626
636
|
|
|
627
637
|
---
|
|
628
638
|
|
|
629
|
-
## 🤖
|
|
639
|
+
## 🤖 OpenClaw Integration
|
|
630
640
|
|
|
631
|
-
**Use AgentVibes TTS with
|
|
641
|
+
**Use AgentVibes TTS with OpenClaw - the revolutionary AI assistant you can access via any instant messenger!**
|
|
632
642
|
|
|
633
|
-
**What is
|
|
643
|
+
**What is OpenClaw?** [OpenClaw](https://openclaw.ai/) is a revolutionary AI assistant that brings Claude AI to your favorite messaging platforms - WhatsApp, Telegram, Discord, and more. No apps to install, no websites to visit - just message your AI assistant like you would a friend.
|
|
634
644
|
|
|
635
|
-
🌐 **Website**: https://
|
|
645
|
+
🌐 **Website**: https://openclaw.ai/
|
|
636
646
|
|
|
637
|
-
AgentVibes seamlessly integrates with
|
|
647
|
+
AgentVibes seamlessly integrates with OpenClaw, providing professional text-to-speech for AI assistants running on messaging platforms and remote servers.
|
|
648
|
+
|
|
649
|
+
⚠️ **Security Notice**: Running OpenClaw on a remote server with SSH access exposes your infrastructure to security risks. See our **[Security Hardening Guide](docs/security-hardening-guide.md)** for comprehensive setup instructions covering SSH hardening, Fail2Ban, Tailscale VPN, UFW firewall, and intrusion detection.
|
|
638
650
|
|
|
639
651
|
### 🎯 Key Benefits
|
|
640
652
|
|
|
@@ -645,17 +657,115 @@ AgentVibes seamlessly integrates with Clawdbot, providing professional text-to-s
|
|
|
645
657
|
|
|
646
658
|
### 🚀 Installation
|
|
647
659
|
|
|
660
|
+
AgentVibes includes a ready-to-use OpenClaw skill that enables TTS on messaging platforms. The setup involves two components:
|
|
661
|
+
|
|
662
|
+
#### Component 1: OpenClaw Server (Remote)
|
|
663
|
+
|
|
664
|
+
Install AgentVibes on your OpenClaw server:
|
|
665
|
+
|
|
666
|
+
```bash
|
|
667
|
+
# On your remote server where OpenClaw is running
|
|
668
|
+
npx agentvibes install
|
|
669
|
+
```
|
|
670
|
+
|
|
671
|
+
The OpenClaw skill is **automatically included** in the AgentVibes npm package at `.clawdbot/skill/SKILL.md`.
|
|
672
|
+
|
|
673
|
+
**How to activate the skill in OpenClaw:**
|
|
674
|
+
|
|
675
|
+
1. **Locate the skill** - After installing AgentVibes, the skill is at:
|
|
676
|
+
```
|
|
677
|
+
node_modules/agentvibes/.clawdbot/skill/SKILL.md
|
|
678
|
+
```
|
|
679
|
+
|
|
680
|
+
2. **Link to OpenClaw skills directory** (if OpenClaw uses skills):
|
|
681
|
+
```bash
|
|
682
|
+
# Example - adjust path based on your OpenClaw installation
|
|
683
|
+
ln -s $(npm root -g)/agentvibes/.clawdbot/skill/SKILL.md ~/.openclaw/skills/agentvibes.md
|
|
684
|
+
```
|
|
685
|
+
|
|
686
|
+
3. **OpenClaw auto-detection** - Many OpenClaw setups automatically detect AgentVibes when it's installed. Check your OpenClaw logs for:
|
|
687
|
+
```
|
|
688
|
+
✓ AgentVibes skill detected and loaded
|
|
689
|
+
```
|
|
690
|
+
|
|
691
|
+
#### Component 2: AgentVibes Receiver (Local/Phone) ⚠️ REQUIRED
|
|
692
|
+
|
|
693
|
+
**CRITICAL: You MUST install AgentVibes on your phone (or local machine) to receive and play audio!**
|
|
694
|
+
|
|
695
|
+
Without this, audio cannot be heard - the server generates TTS but needs a receiver to play it.
|
|
696
|
+
|
|
697
|
+
**Install on Android Phone (Termux):**
|
|
698
|
+
|
|
699
|
+
1. **Install Termux from F-Droid** (NOT Google Play):
|
|
700
|
+
- Download: https://f-droid.org/en/packages/com.termux/
|
|
701
|
+
|
|
702
|
+
2. **Install Node.js in Termux:**
|
|
703
|
+
```bash
|
|
704
|
+
pkg update && pkg upgrade
|
|
705
|
+
pkg install nodejs-lts
|
|
706
|
+
```
|
|
707
|
+
|
|
708
|
+
3. **Install AgentVibes in Termux:**
|
|
709
|
+
```bash
|
|
710
|
+
npx agentvibes install
|
|
711
|
+
```
|
|
712
|
+
|
|
713
|
+
4. **Install Termux:API** (for audio playback):
|
|
714
|
+
- Download: https://f-droid.org/en/packages/com.termux.api/
|
|
715
|
+
- Then in Termux: `pkg install termux-api`
|
|
716
|
+
|
|
717
|
+
**Install on Local Mac/Linux:**
|
|
718
|
+
|
|
648
719
|
```bash
|
|
649
720
|
npx agentvibes install
|
|
650
721
|
```
|
|
651
722
|
|
|
652
|
-
|
|
723
|
+
**Why is this needed?**
|
|
724
|
+
- The **server generates TTS** but has no speakers (headless)
|
|
725
|
+
- AgentVibes on your **phone acts as the audio receiver** via SSH tunnel
|
|
726
|
+
- Audio tunnels from server → SSH → phone → speakers 🔊
|
|
727
|
+
|
|
728
|
+
Without AgentVibes installed on the receiving device, you'll generate audio but hear nothing!
|
|
729
|
+
|
|
730
|
+
#### How It Works: Server → SSH Tunnel → Local Playback
|
|
731
|
+
|
|
732
|
+
```
|
|
733
|
+
┌─────────────────────────────────────────────────────────┐
|
|
734
|
+
│ 1. User messages OpenClaw via Telegram/WhatsApp │
|
|
735
|
+
│ "Tell me about the weather" │
|
|
736
|
+
└─────────────────────────────────────────────────────────┘
|
|
737
|
+
↓
|
|
738
|
+
┌─────────────────────────────────────────────────────────┐
|
|
739
|
+
│ 2. OpenClaw (Server) processes request with Claude │
|
|
740
|
+
│ AgentVibes skill generates TTS audio │
|
|
741
|
+
└─────────────────────────────────────────────────────────┘
|
|
742
|
+
↓
|
|
743
|
+
┌─────────────────────────────────────────────────────────┐
|
|
744
|
+
│ 3. Audio tunnels through SSH → PulseAudio (port 14713)│
|
|
745
|
+
│ Server: PULSE_SERVER=tcp:localhost:14713 │
|
|
746
|
+
└─────────────────────────────────────────────────────────┘
|
|
747
|
+
↓
|
|
748
|
+
┌─────────────────────────────────────────────────────────┐
|
|
749
|
+
│ 4. Local AgentVibes receives and plays audio │
|
|
750
|
+
│ Phone speakers, laptop speakers, etc. │
|
|
751
|
+
│ 🔊 "The weather is sunny and 72 degrees" │
|
|
752
|
+
└─────────────────────────────────────────────────────────┘
|
|
753
|
+
```
|
|
754
|
+
|
|
755
|
+
**Architecture:**
|
|
756
|
+
- **Server (OpenClaw)**: Generates TTS, sends via PulseAudio
|
|
757
|
+
- **SSH Tunnel**: RemoteForward port 14713 (encrypted transport)
|
|
758
|
+
- **Local (Termux/Desktop)**: AgentVibes receives audio, plays on speakers
|
|
759
|
+
|
|
760
|
+
This creates a **Siri-like experience** - message from anywhere, hear responses on your phone! 📱🎤
|
|
653
761
|
|
|
654
762
|
### 📝 Usage
|
|
655
763
|
|
|
764
|
+
#### Basic TTS Commands
|
|
765
|
+
|
|
656
766
|
```bash
|
|
657
767
|
# Basic TTS
|
|
658
|
-
npx agentvibes speak "Hello from
|
|
768
|
+
npx agentvibes speak "Hello from OpenClaw"
|
|
659
769
|
|
|
660
770
|
# With different voices
|
|
661
771
|
npx agentvibes speak "Hello" --voice en_US-amy-medium
|
|
@@ -665,9 +775,99 @@ npx agentvibes speak "Bonjour" --voice fr_FR-siwis-medium
|
|
|
665
775
|
npx agentvibes voices
|
|
666
776
|
```
|
|
667
777
|
|
|
778
|
+
#### Advanced: Direct Hook Usage with Voice Override
|
|
779
|
+
|
|
780
|
+
For programmatic control, use the TTS hook directly:
|
|
781
|
+
|
|
782
|
+
```bash
|
|
783
|
+
# Basic: Use default voice
|
|
784
|
+
bash ~/.claude/hooks/play-tts.sh "Hello from OpenClaw"
|
|
785
|
+
|
|
786
|
+
# Advanced: Override voice per message
|
|
787
|
+
bash ~/.claude/hooks/play-tts.sh "Welcome message" "en_US-amy-medium"
|
|
788
|
+
bash ~/.claude/hooks/play-tts.sh "Bonjour!" "fr_FR-siwis-medium"
|
|
789
|
+
bash ~/.claude/hooks/play-tts.sh "British greeting" "en_GB-alan-medium"
|
|
790
|
+
```
|
|
791
|
+
|
|
792
|
+
**Parameters:**
|
|
793
|
+
- `$1` - **TEXT** (required): Message to speak
|
|
794
|
+
- `$2` - **VOICE** (optional): Voice name to override default
|
|
795
|
+
|
|
796
|
+
#### Audio Effects Configuration for OpenClaw
|
|
797
|
+
|
|
798
|
+
**File**: `.claude/config/audio-effects.cfg`
|
|
799
|
+
|
|
800
|
+
Customize audio effects, background music, and voice processing per agent or use default settings:
|
|
801
|
+
|
|
802
|
+
**Format:**
|
|
803
|
+
```
|
|
804
|
+
AGENT_NAME|SOX_EFFECTS|BACKGROUND_FILE|BACKGROUND_VOLUME
|
|
805
|
+
```
|
|
806
|
+
|
|
807
|
+
**Example Configuration:**
|
|
808
|
+
|
|
809
|
+
```bash
|
|
810
|
+
# Default - subtle background music
|
|
811
|
+
default||agentvibes_soft_flamenco_loop.mp3|0.30
|
|
812
|
+
|
|
813
|
+
# Custom agent with reverb + background
|
|
814
|
+
MyAgent|reverb 40 50 90 gain -2|agentvibes_soft_flamenco_loop.mp3|0.20
|
|
815
|
+
|
|
816
|
+
# Agent with pitch shift and EQ
|
|
817
|
+
Assistant|pitch -100 equalizer 3000 1q +2|agentvibes_dark_chill_step_loop.mp3|0.15
|
|
818
|
+
```
|
|
819
|
+
|
|
820
|
+
**Available SOX Effects:**
|
|
821
|
+
|
|
822
|
+
| Effect | Syntax | Example | Description |
|
|
823
|
+
|--------|--------|---------|-------------|
|
|
824
|
+
| **Reverb** | `reverb <reverberance> <HF-damping> <room-scale>` | `reverb 40 50 90` | Adds room ambiance (light: 30 40 70, heavy: 50 60 100) |
|
|
825
|
+
| **Pitch** | `pitch <cents>` | `pitch -100` | Shift pitch (100 cents = 1 semitone, negative = lower) |
|
|
826
|
+
| **Equalizer** | `equalizer <freq> <width>q <gain-dB>` | `equalizer 3000 1q +2` | Boost/cut frequencies (bass: 200Hz, treble: 4000Hz) |
|
|
827
|
+
| **Gain** | `gain <dB>` | `gain -2` | Adjust volume (negative = quieter, positive = louder) |
|
|
828
|
+
| **Compand** | `compand <attack,decay> <threshold:in,out>` | `compand 0.3,1 6:-70,-60,-20` | Dynamic range compression (makes quiet parts louder) |
|
|
829
|
+
|
|
830
|
+
**Background Music Tracks:**
|
|
831
|
+
|
|
832
|
+
Built-in tracks available in `.claude/audio/tracks/`:
|
|
833
|
+
- `agentvibes_soft_flamenco_loop.mp3` - Warm, rhythmic flamenco
|
|
834
|
+
- `agentvibes_dark_chill_step_loop.mp3` - Modern chill electronic
|
|
835
|
+
- (50+ additional tracks available)
|
|
836
|
+
|
|
837
|
+
**Background Volume:**
|
|
838
|
+
- `0.10` - Very subtle (10%)
|
|
839
|
+
- `0.20` - Subtle (20%)
|
|
840
|
+
- `0.30` - Moderate (30%, recommended default)
|
|
841
|
+
- `0.40` - Noticeable (40%, party mode)
|
|
842
|
+
|
|
843
|
+
**Example: OpenClaw Custom Configuration**
|
|
844
|
+
|
|
845
|
+
Create `.claude/config/audio-effects.cfg` on your OpenClaw server:
|
|
846
|
+
|
|
847
|
+
```bash
|
|
848
|
+
# OpenClaw assistant - warm voice with subtle reverb
|
|
849
|
+
OpenClaw|reverb 30 40 70 gain -1|agentvibes_soft_flamenco_loop.mp3|0.25
|
|
850
|
+
|
|
851
|
+
# Help desk agent - clear, bright voice
|
|
852
|
+
HelpDesk|equalizer 4000 1q +3 compand 0.2,0.5 6:-70,-60,-20|agentvibes_dark_chill_step_loop.mp3|0.15
|
|
853
|
+
|
|
854
|
+
# Default fallback
|
|
855
|
+
default||agentvibes_soft_flamenco_loop.mp3|0.30
|
|
856
|
+
```
|
|
857
|
+
|
|
858
|
+
**How AgentVibes Applies Effects:**
|
|
859
|
+
|
|
860
|
+
1. **Generate TTS** - Create base audio with Piper TTS
|
|
861
|
+
2. **Apply SOX effects** - Process audio (reverb, EQ, pitch, etc.)
|
|
862
|
+
3. **Mix background** - Blend background music at specified volume
|
|
863
|
+
4. **Tunnel via SSH** - Send processed audio to local receiver
|
|
864
|
+
5. **Play on device** - Output to phone/laptop speakers
|
|
865
|
+
|
|
866
|
+
This allows **per-message customization** or **consistent agent branding** with unique audio signatures!
|
|
867
|
+
|
|
668
868
|
### 🔊 Remote SSH Audio
|
|
669
869
|
|
|
670
|
-
Perfect for running
|
|
870
|
+
Perfect for running OpenClaw on a remote server with audio on your local machine:
|
|
671
871
|
|
|
672
872
|
**Quick Setup:**
|
|
673
873
|
|
|
@@ -686,16 +886,17 @@ Host your-server
|
|
|
686
886
|
3. **Connect and test**:
|
|
687
887
|
```bash
|
|
688
888
|
ssh your-server
|
|
689
|
-
agentvibes speak "Testing remote audio from
|
|
889
|
+
agentvibes speak "Testing remote audio from OpenClaw"
|
|
690
890
|
```
|
|
691
891
|
|
|
692
892
|
Audio plays on your local speakers! 🔊
|
|
693
893
|
|
|
694
894
|
### 📚 Documentation
|
|
695
895
|
|
|
696
|
-
- **
|
|
697
|
-
- **
|
|
896
|
+
- **OpenClaw Skill**: [.clawdbot/README.md](.clawdbot/README.md)
|
|
897
|
+
- **OpenClaw Website**: https://openclaw.ai/
|
|
698
898
|
- **Remote Audio Setup**: [docs/remote-audio-setup.md](docs/remote-audio-setup.md)
|
|
899
|
+
- **Security Hardening**: [docs/security-hardening-guide.md](docs/security-hardening-guide.md) ⚠️
|
|
699
900
|
|
|
700
901
|
[↑ Back to top](#-table-of-contents)
|
|
701
902
|
|
package/package.json
CHANGED
|
@@ -1,7 +1,7 @@
|
|
|
1
1
|
{
|
|
2
2
|
"$schema": "https://json.schemastore.org/package.json",
|
|
3
3
|
"name": "agentvibes",
|
|
4
|
-
"version": "3.3.0-alpha.
|
|
4
|
+
"version": "3.3.0-alpha.6",
|
|
5
5
|
"description": "Now your AI Agents can finally talk back! Professional TTS voice for Claude Code, Claude Desktop (via MCP), and Clawdbot with multi-provider support.",
|
|
6
6
|
"homepage": "https://agentvibes.org",
|
|
7
7
|
"keywords": [
|