agentvibes 3.3.0-alpha.5 → 3.3.0-alpha.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +224 -23
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -4,14 +4,14 @@
4
4
  >
5
5
  > 🌐 **[agentvibes.org](https://agentvibes.org)**
6
6
  >
7
- > Professional text-to-speech for **Claude Code**, **Claude Desktop**, **Warp Terminal**, and **Clawdbot** - **Piper TTS (Free!)** or **macOS Say (Built-in!)**
7
+ > Professional text-to-speech for **Claude Code**, **Claude Desktop**, **Warp Terminal**, and **OpenClaw** - **Piper TTS (Free!)** or **macOS Say (Built-in!)**
8
8
 
9
9
  [![npm version](https://img.shields.io/npm/v/agentvibes)](https://www.npmjs.com/package/agentvibes)
10
10
  [![Test Suite](https://github.com/paulpreibisch/AgentVibes/actions/workflows/test.yml/badge.svg)](https://github.com/paulpreibisch/AgentVibes/actions/workflows/test.yml)
11
11
  [![Publish](https://github.com/paulpreibisch/AgentVibes/actions/workflows/publish.yml/badge.svg)](https://github.com/paulpreibisch/AgentVibes/actions/workflows/publish.yml)
12
12
  [![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
13
13
 
14
- **Author**: Paul Preibisch ([@997Fire](https://x.com/997Fire)) | **Version**: v3.2.0
14
+ **Author**: Paul Preibisch ([@997Fire](https://x.com/997Fire)) | **Version**: v3.3.0-alpha.5
15
15
 
16
16
  ---
17
17
 
@@ -22,7 +22,7 @@
22
22
  | **Install AgentVibes** (just `npx`, no git!) | [Quick Start Guide](docs/quick-start.md) |
23
23
  | **Understand what I need** (spoiler: just Node.js!) | [Prerequisites](#-prerequisites) |
24
24
  | **Set up on Windows (Claude Desktop)** | [Windows Setup Guide](mcp-server/WINDOWS_SETUP.md) |
25
- | **Use with Clawdbot** | [Clawdbot Integration](#-clawdbot-integration) |
25
+ | **Use with OpenClaw** | [OpenClaw Integration](#-openclaw-integration) |
26
26
  | **Run Claude Code on Android** | [Android/Termux Setup](#-android--termux) |
27
27
  | **Use natural language** | [MCP Setup](docs/mcp-setup.md) |
28
28
  | **Switch voices** | [Voice Library](docs/voice-library.md) |
@@ -35,7 +35,7 @@
35
35
 
36
36
  **AgentVibes adds lively voice narration to your Claude AI sessions!**
37
37
 
38
- Whether you're coding in Claude Code, chatting in Claude Desktop, using Warp Terminal, or running Clawdbot - AgentVibes brings AI to life with professional voices and personalities.
38
+ Whether you're coding in Claude Code, chatting in Claude Desktop, using Warp Terminal, or running OpenClaw - AgentVibes brings AI to life with professional voices and personalities.
39
39
 
40
40
  ### 🎯 Key Features
41
41
 
@@ -101,6 +101,7 @@ All 50+ Piper voices AgentVibes provides are sourced from Hugging Face's open-so
101
101
  - [💡 Common Workflows](#-common-workflows) - Quick examples
102
102
  - [🔧 Advanced Features](#-advanced-features) - Custom voices & personalities
103
103
  - [🔊 Remote Audio Setup](#-remote-audio-setup) - Play TTS from remote servers
104
+ - [🔐 Security Hardening Guide](docs/security-hardening-guide.md) - **Required for production**: SSH, Fail2Ban, Tailscale, UFW, AIDE
104
105
  - [🔬 Technical Deep Dive](docs/technical-deep-dive.md) - How AgentVibes works under the hood
105
106
  - [❓ Troubleshooting](#-troubleshooting) - Common issues & fixes
106
107
 
@@ -117,17 +118,26 @@ All 50+ Piper voices AgentVibes provides are sourced from Hugging Face's open-so
117
118
 
118
119
  ## 📰 Latest Release
119
120
 
120
- **[v3.2.0 - Clawdbot Integration: AI Assistants on Any Messenger](https://github.com/paulpreibisch/AgentVibes/releases/tag/v3.2.0)** 🤖
121
+ **[v3.3.0-alpha.5 - Remote Audio Revolution: Voiceless Servers Phone & Mobile Playback](https://github.com/paulpreibisch/AgentVibes/releases/tag/v3.3.0-alpha.5)** 📱🔊
121
122
 
122
- AgentVibes v3.2.0 introduces seamless integration with Clawdbot, the revolutionary AI assistant accessible via any instant messenger. With this release, Clawdbot users get professional TTS with 50+ voices, remote SSH audio support for server deployments, and zero-configuration setupjust install AgentVibes and the Clawdbot skill is ready. The release also includes SonarCloud workflow fixes and improved test coverage validation.
123
+ AgentVibes v3.3.0-alpha.5 brings breakthrough remote audio capabilities, turning your messaging apps into voice-enabled AI assistants! **Use Case:** Install [OpenClaw](https://openclaw.ai/) on a remote server, message it via Telegram or WhatsApp from anywhere, and AgentVibes (running in Termux on your phone) plays the TTS responses through your phone speakersmaking it work like Siri, but powered by AgentVibes! This release enables voiceless servers to play audio remotely on phones, mobile devices, or any machine via SSH/PulseAudio tunneling. Also includes audio tracks directory structure fix and comprehensive OpenClaw skill documentation.
123
124
 
124
125
  **Key Highlights:**
125
- - 🤖 **Clawdbot Integration** - Native TTS support for Clawdbot AI assistant framework
126
- - 💬 **Messenger Platforms** - Works with WhatsApp, Telegram, Discord via Clawdbot
127
- - 🔊 **Remote SSH Audio** - Perfect for Clawdbot on remote servers with PulseAudio tunneling
128
- - 📦 **Simple Install** - Just `npx agentvibes install` and it works
129
- - 🛡️ **SonarCloud Fixes** - Quality gate workflow improvements and documentation
130
- - **Full Test Coverage** - All 213 BATS + 47 Node tests passing
126
+ - 📱 **Voiceless Server Support** - Generate TTS on servers without audio hardware, play on remote devices
127
+ - 🔊 **Phone/Mobile Playback** - Audio tunnels from cloud servers to your phone or local machine via SSH
128
+ - 🌐 **PulseAudio SSH Tunneling** - Automatic audio routing through SSH reverse forwarding (port 14713)
129
+ - 🤖 **Enhanced OpenClaw Integration** - Complete skill documentation with 50+ voices and remote audio setup
130
+ - 🎵 **Audio Tracks Fix** - Corrected directory structure (backgrounds tracks) with proper .npmignore entries
131
+ - 📦 **Package Size Optimization** - 8.3 MB unpacked, 172 files, optimized for npm distribution
132
+ - 🛡️ **Security & Quality** - Removed sensitive data from git history, SonarCloud compliance
133
+
134
+ **Perfect For:**
135
+ - Running AgentVibes/OpenClaw on AWS, GCP, Azure, DigitalOcean
136
+ - VS Code Remote SSH development with TTS feedback
137
+ - Android/Termux with audio playback on phone speakers
138
+ - Any headless server → local audio scenario
139
+
140
+ 💡 **Alpha Testing:** This is a pre-release version. Report issues at [GitHub Issues](https://github.com/paulpreibisch/AgentVibes/issues)
131
141
 
132
142
  💡 **Tip:** If `npx agentvibes` shows an older version or missing commands, clear your npm cache: `npm cache clean --force && npx agentvibes@latest --help`
133
143
 
@@ -626,15 +636,17 @@ This design means **any TTS provider** can integrate with BMAD by replacing thes
626
636
 
627
637
  ---
628
638
 
629
- ## 🤖 Clawdbot Integration
639
+ ## 🤖 OpenClaw Integration
630
640
 
631
- **Use AgentVibes TTS with Clawdbot - the revolutionary AI assistant you can access via any instant messenger!**
641
+ **Use AgentVibes TTS with OpenClaw - the revolutionary AI assistant you can access via any instant messenger!**
632
642
 
633
- **What is Clawdbot?** [Clawdbot](https://clawd.bot) is a revolutionary AI assistant that brings Claude AI to your favorite messaging platforms - WhatsApp, Telegram, Discord, and more. No apps to install, no websites to visit - just message your AI assistant like you would a friend.
643
+ **What is OpenClaw?** [OpenClaw](https://openclaw.ai/) is a revolutionary AI assistant that brings Claude AI to your favorite messaging platforms - WhatsApp, Telegram, Discord, and more. No apps to install, no websites to visit - just message your AI assistant like you would a friend.
634
644
 
635
- 🌐 **Website**: https://clawd.bot
645
+ 🌐 **Website**: https://openclaw.ai/
636
646
 
637
- AgentVibes seamlessly integrates with Clawdbot, providing professional text-to-speech for AI assistants running on messaging platforms and remote servers.
647
+ AgentVibes seamlessly integrates with OpenClaw, providing professional text-to-speech for AI assistants running on messaging platforms and remote servers.
648
+
649
+ ⚠️ **Security Notice**: Running OpenClaw on a remote server with SSH access exposes your infrastructure to security risks. See our **[Security Hardening Guide](docs/security-hardening-guide.md)** for comprehensive setup instructions covering SSH hardening, Fail2Ban, Tailscale VPN, UFW firewall, and intrusion detection.
638
650
 
639
651
  ### 🎯 Key Benefits
640
652
 
@@ -645,17 +657,115 @@ AgentVibes seamlessly integrates with Clawdbot, providing professional text-to-s
645
657
 
646
658
  ### 🚀 Installation
647
659
 
660
+ AgentVibes includes a ready-to-use OpenClaw skill that enables TTS on messaging platforms. The setup involves two components:
661
+
662
+ #### Component 1: OpenClaw Server (Remote)
663
+
664
+ Install AgentVibes on your OpenClaw server:
665
+
666
+ ```bash
667
+ # On your remote server where OpenClaw is running
668
+ npx agentvibes install
669
+ ```
670
+
671
+ The OpenClaw skill is **automatically included** in the AgentVibes npm package at `.clawdbot/skill/SKILL.md`.
672
+
673
+ **How to activate the skill in OpenClaw:**
674
+
675
+ 1. **Locate the skill** - After installing AgentVibes, the skill is at:
676
+ ```
677
+ node_modules/agentvibes/.clawdbot/skill/SKILL.md
678
+ ```
679
+
680
+ 2. **Link to OpenClaw skills directory** (if OpenClaw uses skills):
681
+ ```bash
682
+ # Example - adjust path based on your OpenClaw installation
683
+ ln -s $(npm root -g)/agentvibes/.clawdbot/skill/SKILL.md ~/.openclaw/skills/agentvibes.md
684
+ ```
685
+
686
+ 3. **OpenClaw auto-detection** - Many OpenClaw setups automatically detect AgentVibes when it's installed. Check your OpenClaw logs for:
687
+ ```
688
+ ✓ AgentVibes skill detected and loaded
689
+ ```
690
+
691
+ #### Component 2: AgentVibes Receiver (Local/Phone) ⚠️ REQUIRED
692
+
693
+ **CRITICAL: You MUST install AgentVibes on your phone (or local machine) to receive and play audio!**
694
+
695
+ Without this, audio cannot be heard - the server generates TTS but needs a receiver to play it.
696
+
697
+ **Install on Android Phone (Termux):**
698
+
699
+ 1. **Install Termux from F-Droid** (NOT Google Play):
700
+ - Download: https://f-droid.org/en/packages/com.termux/
701
+
702
+ 2. **Install Node.js in Termux:**
703
+ ```bash
704
+ pkg update && pkg upgrade
705
+ pkg install nodejs-lts
706
+ ```
707
+
708
+ 3. **Install AgentVibes in Termux:**
709
+ ```bash
710
+ npx agentvibes install
711
+ ```
712
+
713
+ 4. **Install Termux:API** (for audio playback):
714
+ - Download: https://f-droid.org/en/packages/com.termux.api/
715
+ - Then in Termux: `pkg install termux-api`
716
+
717
+ **Install on Local Mac/Linux:**
718
+
648
719
  ```bash
649
720
  npx agentvibes install
650
721
  ```
651
722
 
652
- That's it! AgentVibes is ready to use.
723
+ **Why is this needed?**
724
+ - The **server generates TTS** but has no speakers (headless)
725
+ - AgentVibes on your **phone acts as the audio receiver** via SSH tunnel
726
+ - Audio tunnels from server → SSH → phone → speakers 🔊
727
+
728
+ Without AgentVibes installed on the receiving device, you'll generate audio but hear nothing!
729
+
730
+ #### How It Works: Server → SSH Tunnel → Local Playback
731
+
732
+ ```
733
+ ┌─────────────────────────────────────────────────────────┐
734
+ │ 1. User messages OpenClaw via Telegram/WhatsApp │
735
+ │ "Tell me about the weather" │
736
+ └─────────────────────────────────────────────────────────┘
737
+
738
+ ┌─────────────────────────────────────────────────────────┐
739
+ │ 2. OpenClaw (Server) processes request with Claude │
740
+ │ AgentVibes skill generates TTS audio │
741
+ └─────────────────────────────────────────────────────────┘
742
+
743
+ ┌─────────────────────────────────────────────────────────┐
744
+ │ 3. Audio tunnels through SSH → PulseAudio (port 14713)│
745
+ │ Server: PULSE_SERVER=tcp:localhost:14713 │
746
+ └─────────────────────────────────────────────────────────┘
747
+
748
+ ┌─────────────────────────────────────────────────────────┐
749
+ │ 4. Local AgentVibes receives and plays audio │
750
+ │ Phone speakers, laptop speakers, etc. │
751
+ │ 🔊 "The weather is sunny and 72 degrees" │
752
+ └─────────────────────────────────────────────────────────┘
753
+ ```
754
+
755
+ **Architecture:**
756
+ - **Server (OpenClaw)**: Generates TTS, sends via PulseAudio
757
+ - **SSH Tunnel**: RemoteForward port 14713 (encrypted transport)
758
+ - **Local (Termux/Desktop)**: AgentVibes receives audio, plays on speakers
759
+
760
+ This creates a **Siri-like experience** - message from anywhere, hear responses on your phone! 📱🎤
653
761
 
654
762
  ### 📝 Usage
655
763
 
764
+ #### Basic TTS Commands
765
+
656
766
  ```bash
657
767
  # Basic TTS
658
- npx agentvibes speak "Hello from Clawdbot"
768
+ npx agentvibes speak "Hello from OpenClaw"
659
769
 
660
770
  # With different voices
661
771
  npx agentvibes speak "Hello" --voice en_US-amy-medium
@@ -665,9 +775,99 @@ npx agentvibes speak "Bonjour" --voice fr_FR-siwis-medium
665
775
  npx agentvibes voices
666
776
  ```
667
777
 
778
+ #### Advanced: Direct Hook Usage with Voice Override
779
+
780
+ For programmatic control, use the TTS hook directly:
781
+
782
+ ```bash
783
+ # Basic: Use default voice
784
+ bash ~/.claude/hooks/play-tts.sh "Hello from OpenClaw"
785
+
786
+ # Advanced: Override voice per message
787
+ bash ~/.claude/hooks/play-tts.sh "Welcome message" "en_US-amy-medium"
788
+ bash ~/.claude/hooks/play-tts.sh "Bonjour!" "fr_FR-siwis-medium"
789
+ bash ~/.claude/hooks/play-tts.sh "British greeting" "en_GB-alan-medium"
790
+ ```
791
+
792
+ **Parameters:**
793
+ - `$1` - **TEXT** (required): Message to speak
794
+ - `$2` - **VOICE** (optional): Voice name to override default
795
+
796
+ #### Audio Effects Configuration for OpenClaw
797
+
798
+ **File**: `.claude/config/audio-effects.cfg`
799
+
800
+ Customize audio effects, background music, and voice processing per agent or use default settings:
801
+
802
+ **Format:**
803
+ ```
804
+ AGENT_NAME|SOX_EFFECTS|BACKGROUND_FILE|BACKGROUND_VOLUME
805
+ ```
806
+
807
+ **Example Configuration:**
808
+
809
+ ```bash
810
+ # Default - subtle background music
811
+ default||agentvibes_soft_flamenco_loop.mp3|0.30
812
+
813
+ # Custom agent with reverb + background
814
+ MyAgent|reverb 40 50 90 gain -2|agentvibes_soft_flamenco_loop.mp3|0.20
815
+
816
+ # Agent with pitch shift and EQ
817
+ Assistant|pitch -100 equalizer 3000 1q +2|agentvibes_dark_chill_step_loop.mp3|0.15
818
+ ```
819
+
820
+ **Available SOX Effects:**
821
+
822
+ | Effect | Syntax | Example | Description |
823
+ |--------|--------|---------|-------------|
824
+ | **Reverb** | `reverb <reverberance> <HF-damping> <room-scale>` | `reverb 40 50 90` | Adds room ambiance (light: 30 40 70, heavy: 50 60 100) |
825
+ | **Pitch** | `pitch <cents>` | `pitch -100` | Shift pitch (100 cents = 1 semitone, negative = lower) |
826
+ | **Equalizer** | `equalizer <freq> <width>q <gain-dB>` | `equalizer 3000 1q +2` | Boost/cut frequencies (bass: 200Hz, treble: 4000Hz) |
827
+ | **Gain** | `gain <dB>` | `gain -2` | Adjust volume (negative = quieter, positive = louder) |
828
+ | **Compand** | `compand <attack,decay> <threshold:in,out>` | `compand 0.3,1 6:-70,-60,-20` | Dynamic range compression (makes quiet parts louder) |
829
+
830
+ **Background Music Tracks:**
831
+
832
+ Built-in tracks available in `.claude/audio/tracks/`:
833
+ - `agentvibes_soft_flamenco_loop.mp3` - Warm, rhythmic flamenco
834
+ - `agentvibes_dark_chill_step_loop.mp3` - Modern chill electronic
835
+ - (50+ additional tracks available)
836
+
837
+ **Background Volume:**
838
+ - `0.10` - Very subtle (10%)
839
+ - `0.20` - Subtle (20%)
840
+ - `0.30` - Moderate (30%, recommended default)
841
+ - `0.40` - Noticeable (40%, party mode)
842
+
843
+ **Example: OpenClaw Custom Configuration**
844
+
845
+ Create `.claude/config/audio-effects.cfg` on your OpenClaw server:
846
+
847
+ ```bash
848
+ # OpenClaw assistant - warm voice with subtle reverb
849
+ OpenClaw|reverb 30 40 70 gain -1|agentvibes_soft_flamenco_loop.mp3|0.25
850
+
851
+ # Help desk agent - clear, bright voice
852
+ HelpDesk|equalizer 4000 1q +3 compand 0.2,0.5 6:-70,-60,-20|agentvibes_dark_chill_step_loop.mp3|0.15
853
+
854
+ # Default fallback
855
+ default||agentvibes_soft_flamenco_loop.mp3|0.30
856
+ ```
857
+
858
+ **How AgentVibes Applies Effects:**
859
+
860
+ 1. **Generate TTS** - Create base audio with Piper TTS
861
+ 2. **Apply SOX effects** - Process audio (reverb, EQ, pitch, etc.)
862
+ 3. **Mix background** - Blend background music at specified volume
863
+ 4. **Tunnel via SSH** - Send processed audio to local receiver
864
+ 5. **Play on device** - Output to phone/laptop speakers
865
+
866
+ This allows **per-message customization** or **consistent agent branding** with unique audio signatures!
867
+
668
868
  ### 🔊 Remote SSH Audio
669
869
 
670
- Perfect for running Clawdbot on a remote server with audio on your local machine:
870
+ Perfect for running OpenClaw on a remote server with audio on your local machine:
671
871
 
672
872
  **Quick Setup:**
673
873
 
@@ -686,16 +886,17 @@ Host your-server
686
886
  3. **Connect and test**:
687
887
  ```bash
688
888
  ssh your-server
689
- agentvibes speak "Testing remote audio from Clawdbot"
889
+ agentvibes speak "Testing remote audio from OpenClaw"
690
890
  ```
691
891
 
692
892
  Audio plays on your local speakers! 🔊
693
893
 
694
894
  ### 📚 Documentation
695
895
 
696
- - **Clawdbot Skill**: [.clawdbot/README.md](.clawdbot/README.md)
697
- - **Clawdbot Website**: https://clawd.bot
896
+ - **OpenClaw Skill**: [.clawdbot/README.md](.clawdbot/README.md)
897
+ - **OpenClaw Website**: https://openclaw.ai/
698
898
  - **Remote Audio Setup**: [docs/remote-audio-setup.md](docs/remote-audio-setup.md)
899
+ - **Security Hardening**: [docs/security-hardening-guide.md](docs/security-hardening-guide.md) ⚠️
699
900
 
700
901
  [↑ Back to top](#-table-of-contents)
701
902
 
package/package.json CHANGED
@@ -1,7 +1,7 @@
1
1
  {
2
2
  "$schema": "https://json.schemastore.org/package.json",
3
3
  "name": "agentvibes",
4
- "version": "3.3.0-alpha.5",
4
+ "version": "3.3.0-alpha.6",
5
5
  "description": "Now your AI Agents can finally talk back! Professional TTS voice for Claude Code, Claude Desktop (via MCP), and Clawdbot with multi-provider support.",
6
6
  "homepage": "https://agentvibes.org",
7
7
  "keywords": [