npm - agentvibes - Versions diffs - 3.3.0 → 3.4.1 - Mend

agentvibes 3.3.0 → 3.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

package/.claude/config/audio-effects.cfg +1 -1
package/.claude/config/background-music-position.txt +1 -26
package/.claude/github-star-reminder.txt +1 -1
package/.claude/hooks/audio-cache-utils.sh +0 -0
package/.claude/hooks/clawdbot-receiver-SECURE.sh +0 -0
package/.claude/hooks/play-tts-soprano.sh +320 -0
package/.claude/hooks/play-tts.sh +6 -0
package/.claude/hooks/provider-manager.sh +17 -0
package/.claude/hooks/soprano-gradio-synth.py +139 -0
package/.claude/piper-voices-dir.txt +1 -1
package/.mcp.json +6 -28
package/README.md +27 -17
package/RELEASE_NOTES.md +199 -1
package/package.json +1 -1
package/src/installer.js +339 -47
package/.claude/config/background-music-volume.txt +0 -1
package/.claude/config/background-music.cfg +0 -1
package/.claude/config/background-music.txt +0 -1
package/.claude/config/tts-speech-rate.txt +0 -1
package/.claude/config/tts-verbosity.txt +0 -1
package/.claude/hooks/bmad-party-manager.sh +0 -225
package/.claude/hooks/stop.sh +0 -38

package/README.md CHANGED Viewed

@@ -11,7 +11,7 @@
 [![Publish](https://github.com/paulpreibisch/AgentVibes/actions/workflows/publish.yml/badge.svg)](https://github.com/paulpreibisch/AgentVibes/actions/workflows/publish.yml)
 [![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
-**Author**: Paul Preibisch ([@997Fire](https://x.com/997Fire)) | **Version**: v3.3.0
+**Author**: Paul Preibisch ([@997Fire](https://x.com/997Fire)) | **Version**: v3.4.0
 ---
@@ -39,9 +39,15 @@ Whether you're coding in Claude Code, chatting in Claude Desktop, using Warp Ter
 ### 🎯 Key Features
+**⚡ NEW IN v3.4.0:**
+- 🎤 **Soprano TTS Provider** - Ultra-fast neural TTS with 20x CPU, 2000x GPU acceleration (thanks [@nathanchase](https://github.com/nathanchase)!)
+- 🛡️ **Security Hardening** - 9.5/10 score with comprehensive validation and timeouts
+- 🌐 **Environment Intelligence** - PulseAudio tunnel auto-detection for SSH scenarios
+- 🎯 **Smart Recommendations** - GPU/RAM-based provider suggestions in installer
 **✨ NEW IN v3.3.0:**
-- 📱 **AgentVibes Receiver - NEW!** - Stream TTS from voiceless servers to your phone, laptop, or local machine via encrypted SSH tunnel
-- 🌐 **Voiceless Server Support - NEW!** - Generate TTS on cloud servers (AWS, GCP, Azure) and play on any device with speakers
+- 📱 **AgentVibes Receiver** - Stream TTS from voiceless servers to your phone, laptop, or local machine via encrypted SSH tunnel
+- 🌐 **Voiceless Server Support** - Generate TTS on cloud servers (AWS, GCP, Azure) and play on any device with speakers
 **⚡ Core Features:**
 - ⚡ **One-Command Install** - Get started in 30 seconds (`npx agentvibes install`)
@@ -88,7 +94,7 @@ All 50+ Piper voices AgentVibes provides are sourced from Hugging Face's open-so
 - [📱 Android/Termux](#-quick-setup-android--termux-claude-code-on-your-phone) - Run Claude Code on your phone
 - [📋 Prerequisites](#-prerequisites) - What you actually need (Node.js + optional tools)
 - [✨ What is AgentVibes?](#-what-is-agentvibes) - Overview & key features
-- [📰 Latest Release](#-latest-release) - v3.3.0 - Remote Audio Revolution
+- [📰 Latest Release](#-latest-release) - v3.4.0 - Soprano TTS & Security Hardening
 - [🪟 Windows Setup Guide for Claude Desktop](mcp-server/WINDOWS_SETUP.md) - Complete Windows installation with WSL & Python
 ### AgentVibes MCP (Natural Language Control)
@@ -132,24 +138,25 @@ All 50+ Piper voices AgentVibes provides are sourced from Hugging Face's open-so
 ## 📰 Latest Release
-**[Remote Audio Revolution: Voiceless Servers → Phone & Mobile Playback](https://github.com/paulpreibisch/AgentVibes/releases/tag/v3.3.0)** 📱🔊
+**[v3.4.0 - Soprano TTS, Security Hardening & Environment Intelligence](https://github.com/paulpreibisch/AgentVibes/releases/tag/v3.4.0)** ⚡🛡️
-AgentVibes v3.3.0 brings breakthrough remote audio capabilities, turning your messaging apps into voice-enabled AI assistants! **Use Case:** Install [OpenClaw](https://openclaw.ai/) on a remote server, message it via Telegram or WhatsApp from anywhere, and AgentVibes (running in Termux on your phone) plays the TTS responses through your phone speakers—making it work like Siri, but powered by AgentVibes! This release enables voiceless servers to play audio remotely on phones, mobile devices, or any machine via SSH/PulseAudio tunneling. Also includes audio tracks directory structure fix and comprehensive OpenClaw skill documentation.
+AgentVibes v3.4.0 introduces Soprano TTS - an 80M parameter neural provider offering 20x CPU and 2000x GPU acceleration with sub-1GB memory footprint - plus comprehensive security hardening (timeouts, bounds checking, NaN validation) achieving a 9.5/10 security score, and intelligent environment detection that recognizes PulseAudio tunnels as working audio for remote scenarios. The enhanced installer provides GPU-based provider recommendations and context-aware messaging. Special thanks to [@nathanchase](https://github.com/nathanchase) for contributing the Soprano TTS Provider integration!
 **Key Highlights:**
-- 📱 **Voiceless Server Support** - Generate TTS on servers without audio hardware, play on remote devices
-- 🔊 **Phone/Mobile Playback** - Audio tunnels from cloud servers to your phone or local machine via SSH
-- 🌐 **PulseAudio SSH Tunneling** - Automatic audio routing through SSH reverse forwarding (port 14713)
-- 🤖 **Enhanced OpenClaw Integration** - Complete skill documentation with 50+ voices and remote audio setup
-- 🎵 **Audio Tracks Fix** - Corrected directory structure (backgrounds → tracks) with proper .npmignore entries
-- 📦 **Package Size Optimization** - 8.3 MB unpacked, 172 files, optimized for npm distribution
-- 🛡️ **Security & Quality** - Removed sensitive data from git history, SonarCloud compliance
+- ⚡ **Soprano TTS Provider** - Ultra-fast neural TTS with 20x CPU, 2000x GPU acceleration (thanks @nathanchase!)
+- 🛡️ **Security Hardening** - 9.5/10 score with timeouts on system commands and comprehensive validation
+- 🌐 **Environment Intelligence** - PulseAudio tunnel auto-detection for SSH + tunnel scenarios
+- 🎯 **Smart Recommendations** - GPU/RAM-based provider suggestions (Soprano for CUDA, macOS Say for Apple)
+- 📱 **Provider-Aware Voice Pages** - Soprano shows model specs, auto-selects single voice
+- 🧪 **260/260 Tests Passing** - Complete suite coverage with all edge cases fixed
+- 🎨 **Better UX** - Context-aware messaging ("PulseAudio Tunnel Detected!" vs "speakers")
 **Perfect For:**
-- Running AgentVibes/OpenClaw on AWS, GCP, Azure, DigitalOcean
-- VS Code Remote SSH development with TTS feedback
-- Android/Termux with audio playback on phone speakers
-- Any headless server → local audio scenario
+- GPU users wanting ultra-fast TTS (2000x real-time with CUDA)
+- Low-RAM systems (<1GB memory footprint with Soprano)
+- SSH sessions with PulseAudio tunnels (auto-detected)
+- Production deployments requiring security hardening
+- Any environment needing intelligent provider selection
 💡 **Tip:** If `npx agentvibes` shows an older version or missing commands, clear your npm cache: `npm cache clean --force && npx agentvibes@latest --help`
@@ -1452,6 +1459,9 @@ Both do the exact same thing - MCP is more convenient, slash commands are more t
 - [Claude Code](https://claude.com/claude-code) - AI coding assistant
 - Licensed under Apache 2.0
+**Contributors:**
+- 🎤 [@nathanchase](https://github.com/nathanchase) - Soprano TTS Provider integration (PR #95) - Ultra-fast neural TTS with GPU acceleration
 **Special Thanks:**
 - 💡 [Claude Code Hooks Mastery](https://github.com/disler/claude-code-hooks-mastery) by [@disler](https://github.com/disler) - Hooks inspiration
 - 🤖 [BMAD METHOD](https://github.com/bmad-code-org/BMAD-METHOD) - Multi-agent framework with auto voice switching integration

package/RELEASE_NOTES.md CHANGED Viewed

@@ -1,4 +1,202 @@
-# AgentVibes Release Notes
+# AgentVibes v3.4.0 Release Notes - DRAFT
+## 📦 v3.4.0 - Soprano TTS, Security Hardening & Environment Intelligence
+**Release Date:** February 10, 2026
+### 🎯 Why v3.4.0?
+v3.4.0 introduces **Soprano TTS** - an ultra-fast neural TTS provider with GPU acceleration, comprehensive **security hardening** across the codebase, and **intelligent environment detection** that recognizes PulseAudio tunnels for remote audio scenarios.
+### 🚀 Key Highlights
+#### ⚡ Soprano TTS Provider (NEW!)
+- **80M parameter neural model** with premium female English voice
+- **20x CPU speed** (vs Piper), **2000x GPU speed** with CUDA
+- **3 synthesis modes**: WebUI (Gradio), API (OpenAI-compatible), CLI (fallback)
+- **Auto-detection**: Checks for running Gradio server, falls back gracefully
+- **<1GB memory footprint** - perfect for low-RAM systems
+- **Provider-aware voice management**: Auto-selects single voice, shows model specs
+- **Thanks to [@nathanchase](https://github.com/nathanchase)** for this contribution! ([see acknowledgments](#-acknowledgments))
+#### 🛡️ Security Hardening (9.5/10 Score)
+- **Timeouts on system commands**: Prevents installer hangs (nvidia-smi, sysctl, meminfo)
+- **Bounds checking**: Validates array access before parsing system output
+- **NaN validation**: Prevents crashes from malformed memory/GPU detection
+- **Case-insensitive checks**: PulseAudio tunnel detection handles TCP: and tcp:
+- **Code duplication eliminated**: Extracted PulseAudio helper function (DRY)
+#### 🌐 Environment Intelligence
+- **PulseAudio tunnel detection**: Recognizes `PULSE_SERVER=tcp:*` as working audio
+- **Context-aware messaging**:
+  - "🌐 PulseAudio Tunnel Detected!" for SSH + tunnel setups
+  - "🔊 Audio Output Detected!" for local speakers
+  - Distinguishes local/tunnel/hybrid configurations
+- **Smart environment classification**:
+  - DESKTOP: Local audio OR active PulseAudio tunnel
+  - VOICELESS: No audio AND no tunnel
+  - PHONE: Termux/Android devices
+#### 🎤 Installer Enhancements
+- **Provider-aware voice pages**: Soprano shows model specs, Piper shows 50+ voices
+- **Auto-selection logic**: Soprano (1 voice) auto-selects, no manual choice needed
+- **GPU-based recommendations**: "Your GPU will run Soprano 2000x faster!"
+- **RAM-based suggestions**: Low memory systems see "Soprano uses <1GB" message
+- **Better RAM display**: Shows "512MB" instead of "0GB" for sub-1GB systems
+### 🤖 AI Summary
+AgentVibes v3.4.0 brings Soprano TTS - an 80M parameter neural provider offering 20x CPU and 2000x GPU acceleration with sub-1GB memory footprint - plus comprehensive security hardening (timeouts, bounds checking, NaN validation) and intelligent environment detection that recognizes PulseAudio tunnels as working audio for remote scenarios. The enhanced installer provides context-aware messaging distinguishing local speakers from SSH tunnels, GPU-based provider recommendations (Soprano for CUDA users, macOS Say for Apple, Piper for versatility), and provider-specific voice pages that auto-select Soprano's single voice while showcasing model specifications. This release achieves a 9.5/10 security score through systematic defensive programming, making AgentVibes production-ready for enterprise deployments while expanding TTS provider options for diverse hardware configurations.
+---
+## ✨ New Features
+### Soprano TTS Provider
+- Add Soprano TTS provider script with 3 synthesis modes (WebUI, API, CLI) (#95)
+- Integrate Soprano into TTS router and provider manager
+- Add soprano-gradio-synth.py helper for WebUI/SSE protocol
+- Provider-aware voice selection page with model specifications
+- Auto-select single Soprano voice with performance details
+### Installer Intelligence
+- Add `detectSystemCapabilities()` for GPU/RAM detection
+- Add `hasPulseAudioTunnel()` helper function
+- Context-aware audio detection messaging (tunnel vs local)
+- GPU-based provider ordering (Soprano first for CUDA users)
+- RAM-based recommendations (<4GB systems see Soprano first)
+- Provider-specific intro messages (Soprano vs Piper vs macOS)
+### Environment Detection
+- PulseAudio tunnel recognition via PULSE_SERVER env var
+- Case-insensitive TCP protocol detection
+- Smart DESKTOP classification (local audio OR tunnel)
+- Improved VOICELESS detection (no audio AND no tunnel)
+---
+## 🐛 Bug Fixes
+### Security Fixes
+- Add 5s timeout to nvidia-smi to prevent GPU detection hangs
+- Add 3s timeout to sysctl/meminfo to prevent memory detection hangs
+- Add bounds checking before parsing sysctl output (macOS)
+- Add bounds checking before parsing /proc/meminfo (Linux)
+- Add NaN validation for parseInt() memory size parsing
+- Fix case sensitivity in PULSE_SERVER detection (handles TCP: and tcp:)
+### Test Fixes
+- Fix provider-manager test #90: Add soprano and ssh-remote to cleanup list
+- Ensure zero-provider edge case properly simulates empty state
+### User Experience
+- Fix RAM display for <1GB systems (show "512MB" not "0GB")
+- Fix PulseAudio selection triggering wrong setup flow
+- Separate PulseAudio tunnel setup from SSH receiver setup
+---
+## 🏗️ Improvements
+### Code Quality
+- Extract PulseAudio detection to helper function (DRY principle)
+- Implement system capabilities caching (eliminates duplicate calls)
+- Add comprehensive error handling in detectSystemCapabilities()
+- Improve code comments for security-critical sections
+### Performance
+- Cache system detection results (prevents duplicate nvidia-smi calls)
+- Add timeouts to prevent indefinite hangs
+- Optimize provider detection with early returns
+### Documentation
+- Add comprehensive commit message documenting all changes
+- Document security improvements (timeouts, bounds checking, NaN validation)
+- Explain PulseAudio tunnel detection architecture
+- Detail environment classification logic
+---
+## 📊 Statistics
+- **91 commits** since v3.3.0
+- **817 lines added** in merge to master
+- **6 files modified** in core integration
+- **260 tests passing** (213 BATS + 47 Node)
+- **Security score**: 7.5/10 → 9.5/10
+- **Test coverage**: 100% pass rate
+---
+## 🔧 Technical Details
+### Files Modified
+- `src/installer.js`: +335 lines (security fixes, environment detection, Soprano integration)
+- `test/unit/provider-manager.bats`: +4 lines (fix edge case test)
+- `.claude/hooks/play-tts-soprano.sh`: +320 lines (new provider)
+- `.claude/hooks/soprano-gradio-synth.py`: +139 lines (new helper)
+- `.claude/hooks/provider-manager.sh`: +17 lines (Soprano support)
+- `.claude/hooks/play-tts.sh`: +6 lines (route to Soprano)
+### Breaking Changes
+None - all changes are backward compatible.
+### Dependencies
+- **New**: `soprano-tts` (Python package, optional)
+- **Recommended**: CUDA-capable GPU for 2000x speedup (optional)
+- **Compatible**: Works on CPU-only systems (20x vs Piper)
+---
+## 🎓 Migration Notes
+### For New Users
+1. Run `npx agentvibes install`
+2. Installer auto-detects your hardware (GPU, RAM, platform)
+3. Soprano appears as option if you have working audio
+4. Select Soprano for ultra-fast TTS with GPU acceleration
+### For Existing Users
+1. Update: `npx agentvibes update`
+2. Switch provider: `/agent-vibes:provider switch soprano`
+3. Test: `/agent-vibes:sample soprano-default`
+4. Optionally install soprano-tts: `pip install soprano-tts`
+### PulseAudio Tunnel Users
+- Installer now auto-detects your tunnel configuration
+- Shows "🌐 PulseAudio Tunnel Detected!" instead of "speakers"
+- Provides DESKTOP mode options (Soprano, Piper, macOS Say)
+- No manual configuration needed
+---
+## 🙏 Acknowledgments
+### Special Thanks
+**🎉 [@nathanchase](https://github.com/nathanchase)** - For contributing the Soprano TTS Provider integration (PR #95)! Nathan's work brings ultra-fast neural TTS with GPU acceleration to AgentVibes, offering 20x CPU and 2000x GPU performance improvements. The comprehensive integration includes WebUI, API, and CLI synthesis modes with intelligent auto-detection and graceful fallback. Thank you for this outstanding contribution! 🚀
+### Quality Assurance
+- **Security Review**: Adversarial code review achieved 9.5/10 score
+- **Testing**: All 260 tests pass (100% suite coverage)
+- **Quality Gates**: All Sonar requirements validated
+- **Co-Authored-By**: Claude Sonnet 4.5
+---
+## 📚 Additional Resources
+- [Soprano TTS Documentation](https://github.com/paulpreibisch/AgentVibes/blob/master/docs/providers.md#soprano-tts)
+- [PulseAudio Tunnel Setup](https://github.com/paulpreibisch/AgentVibes/blob/master/docs/SSH_REMOTE_SETUP.md)
+- [Security Hardening Guide](https://github.com/paulpreibisch/AgentVibes/blob/master/docs/security-hardening-guide.md)
+- [Provider Comparison](https://github.com/paulpreibisch/AgentVibes/blob/master/docs/providers.md)
+---
+**Full Changelog**: https://github.com/paulpreibisch/AgentVibes/compare/v3.3.0...v3.4.0
+---
 ## 📦 v3.3.0 - Remote TTS, Smart Installer, OpenClaw Receiver & Cache Management

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "$schema": "https://json.schemastore.org/package.json",
   "name": "agentvibes",
-  "version": "3.3.0",
+  "version": "3.4.1",
   "description": "Now your AI Agents can finally talk back! Professional TTS voice for Claude Code, Claude Desktop (via MCP), and Clawdbot with multi-provider support.",
   "homepage": "https://agentvibes.org",
   "keywords": [