npm - opencode-smart-voice-notify - Versions diffs - 1.0.7 → 1.0.9 - Mend

opencode-smart-voice-notify 1.0.7 → 1.0.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md CHANGED Viewed

@@ -1,16 +1,22 @@
-# OpenCode Smart Voice Notify
-> **Disclaimer**: This project is not built by the OpenCode team and is not affiliated with [OpenCode](https://opencode.ai) in any way. It is an independent community plugin.
-A smart voice notification plugin for [OpenCode](https://opencode.ai) with **multiple TTS engines** and an intelligent reminder system.
-## Features
-### Smart TTS Engine Selection
-The plugin automatically tries multiple TTS engines in order, falling back if one fails:
+<!-- Dynamic Header -->
+<img width="100%" src="https://capsule-render.vercel.app/api?type=waving&color=0:667eea,100:764ba2&height=120&section=header"/>
+# OpenCode Smart Voice Notify
+> **Disclaimer**: This project is not built by the OpenCode team and is not affiliated with [OpenCode](https://opencode.ai) in any way. It is an independent community plugin.
+A smart voice notification plugin for [OpenCode](https://opencode.ai) with **multiple TTS engines** and an intelligent reminder system.
+<img width="1456" height="720" alt="image" src="https://github.com/user-attachments/assets/52ccf357-2548-400b-a346-6362f2fc3180" />
+## Features
+### Smart TTS Engine Selection
+The plugin automatically tries multiple TTS engines in order, falling back if one fails:
 1. **ElevenLabs** (Online) - High-quality, anime-like voices with natural expression
-2. **Edge TTS** (Free) - Microsoft's neural voices, no API key required
+2. **Edge TTS** (Free) - Microsoft's neural voices, native Node.js implementation (no Python required)
 3. **Windows SAPI** (Offline) - Built-in Windows speech synthesis
 4. **Local Sound Files** (Fallback) - Plays bundled MP3 files if all TTS fails
@@ -25,185 +31,187 @@ The plugin automatically tries multiple TTS engines in order, falling back if on
 - Follow-up reminders with exponential backoff
 - Automatic cancellation when user responds
 - Per-notification type delays (permission requests are more urgent)
+- **Smart Quota Handling**: Automatically falls back to free Edge TTS if ElevenLabs quota is exceeded
 ### System Integration
+- **Native Edge TTS**: No external dependencies (Python/pip) required
 - Wake monitor from sleep before notifying
 - Auto-boost volume if too low
 - TUI toast notifications
 - Cross-platform support (Windows, macOS, Linux)
-## Installation
-### Option 1: From npm (Recommended)
-Add to your OpenCode config file (`~/.config/opencode/opencode.json`):
-```json
-{
-  "$schema": "https://opencode.ai/config.json",
-  "plugin": ["opencode-smart-voice-notify@latest"]
-}
-```
-### Option 2: From GitHub
-```json
-{
-  "$schema": "https://opencode.ai/config.json",
-  "plugin": ["github:MasuRii/opencode-smart-voice-notify"]
-}
-```
-### Option 3: Local Development
-1. Clone the repository:
-   ```bash
-   git clone https://github.com/MasuRii/opencode-smart-voice-notify.git
-   ```
-2. Reference the local path in your config:
-   ```json
-   {
-     "plugin": ["file:///path/to/opencode-smart-voice-notify"]
-   }
-   ```
-## Configuration
-### Automatic Setup
-When you first run OpenCode with this plugin installed, it will **automatically create**:
-1. **`~/.config/opencode/smart-voice-notify.jsonc`** - A comprehensive configuration file with all available options fully documented.
-2. **`~/.config/opencode/assets/*.mp3`** - Bundled notification sound files.
-The auto-generated configuration includes all advanced settings, message arrays, and engine options, so you don't have to refer back to the documentation for available settings.
-### Manual Configuration
-If you prefer to create the config manually, add a `smart-voice-notify.jsonc` file in your OpenCode config directory (`~/.config/opencode/`):
-```jsonc
-{
-    // ============================================================
-    // NOTIFICATION MODE SETTINGS (Smart Notification System)
-    // ============================================================
-    // Controls how notifications are delivered:
-    //   'sound-first' - Play sound immediately, TTS reminder after delay (RECOMMENDED)
-    //   'tts-first'   - Speak TTS immediately, no sound
-    //   'both'        - Play sound AND speak TTS immediately
-    //   'sound-only'  - Only play sound, no TTS at all
-    "notificationMode": "sound-first",
-    // ============================================================
-    // TTS REMINDER SETTINGS (When user doesn't respond to sound)
-    // ============================================================
-    // Enable TTS reminder if user doesn't respond after sound notification
-    "enableTTSReminder": true,
-    // Delay (in seconds) before TTS reminder fires
-    "ttsReminderDelaySeconds": 30,         // Global default
-    "idleReminderDelaySeconds": 30,        // For task completion notifications
-    "permissionReminderDelaySeconds": 20,  // For permission requests (more urgent)
-    // Follow-up reminders if user STILL doesn't respond after first TTS
-    "enableFollowUpReminders": true,
-    "maxFollowUpReminders": 3,              // Max number of follow-up TTS reminders
-    "reminderBackoffMultiplier": 1.5,       // Each follow-up waits longer (30s, 45s, 67s...)
+## Installation
+### Option 1: From npm (Recommended)
+Add to your OpenCode config file (`~/.config/opencode/opencode.json`):
+```json
+{
+  "$schema": "https://opencode.ai/config.json",
+  "plugin": ["opencode-smart-voice-notify@latest"]
+}
+```
+### Option 2: From GitHub
+```json
+{
+  "$schema": "https://opencode.ai/config.json",
+  "plugin": ["github:MasuRii/opencode-smart-voice-notify"]
+}
+```
+### Option 3: Local Development
+1. Clone the repository:
+   ```bash
+   git clone https://github.com/MasuRii/opencode-smart-voice-notify.git
+   ```
+2. Reference the local path in your config:
+   ```json
+   {
+     "plugin": ["file:///path/to/opencode-smart-voice-notify"]
+   }
+   ```
+## Configuration
+### Automatic Setup
+When you first run OpenCode with this plugin installed, it will **automatically create**:
+1. **`~/.config/opencode/smart-voice-notify.jsonc`** - A comprehensive configuration file with all available options fully documented.
+2. **`~/.config/opencode/assets/*.mp3`** - Bundled notification sound files.
+The auto-generated configuration includes all advanced settings, message arrays, and engine options, so you don't have to refer back to the documentation for available settings.
+### Manual Configuration
+If you prefer to create the config manually, add a `smart-voice-notify.jsonc` file in your OpenCode config directory (`~/.config/opencode/`):
+```jsonc
+{
+    // ============================================================
+    // NOTIFICATION MODE SETTINGS (Smart Notification System)
+    // ============================================================
+    // Controls how notifications are delivered:
+    //   'sound-first' - Play sound immediately, TTS reminder after delay (RECOMMENDED)
+    //   'tts-first'   - Speak TTS immediately, no sound
+    //   'both'        - Play sound AND speak TTS immediately
+    //   'sound-only'  - Only play sound, no TTS at all
+    "notificationMode": "sound-first",
+    // ============================================================
+    // TTS REMINDER SETTINGS (When user doesn't respond to sound)
+    // ============================================================
+    // Enable TTS reminder if user doesn't respond after sound notification
+    "enableTTSReminder": true,
+    // Delay (in seconds) before TTS reminder fires
+    "ttsReminderDelaySeconds": 30,         // Global default
+    "idleReminderDelaySeconds": 30,        // For task completion notifications
+    "permissionReminderDelaySeconds": 20,  // For permission requests (more urgent)
+    // Follow-up reminders if user STILL doesn't respond after first TTS
+    "enableFollowUpReminders": true,
+    "maxFollowUpReminders": 3,              // Max number of follow-up TTS reminders
+    "reminderBackoffMultiplier": 1.5,       // Each follow-up waits longer (30s, 45s, 67s...)
     // ============================================================
     // TTS ENGINE SELECTION
     // ============================================================
     // 'elevenlabs' - Best quality, anime-like voices (requires API key)
-    // 'edge'       - Good quality neural voices (free, requires: pip install edge-tts)
+    // 'edge'       - Good quality neural voices (Free, Native Node.js implementation)
     // 'sapi'       - Windows built-in voices (free, offline)
     "ttsEngine": "edge",
     "enableTTS": true,
-    // ============================================================
-    // ELEVENLABS SETTINGS (Best Quality - Anime-like Voices)
-    // ============================================================
-    // Get your API key from: https://elevenlabs.io/app/settings/api-keys
-    // "elevenLabsApiKey": "YOUR_API_KEY_HERE",
-    "elevenLabsVoiceId": "cgSgspJ2msm6clMCkdW9",
-    "elevenLabsModel": "eleven_turbo_v2_5",
-    "elevenLabsStability": 0.5,
-    "elevenLabsSimilarity": 0.75,
-    "elevenLabsStyle": 0.5,
-    // ============================================================
-    // EDGE TTS SETTINGS (Free Neural Voices - Default Engine)
-    // ============================================================
-    "edgeVoice": "en-US-AnaNeural",
-    "edgePitch": "+50Hz",
-    "edgeRate": "+10%",
-    // ============================================================
-    // SAPI SETTINGS (Windows Built-in - Last Resort Fallback)
-    // ============================================================
-    "sapiVoice": "Microsoft Zira Desktop",
-    "sapiRate": -1,
-    "sapiPitch": "medium",
-    "sapiVolume": "loud",
-    // ============================================================
-    // INITIAL TTS MESSAGES (Used immediately or after sound)
-    // ============================================================
-    "idleTTSMessages": [
-        "All done! Your task has been completed successfully.",
-        "Hey there! I finished working on your request.",
-        "Task complete! Ready for your review whenever you are.",
-        "Good news! Everything is done and ready for you.",
-        "Finished! Let me know if you need anything else."
-    ],
-    "permissionTTSMessages": [
-        "Attention please! I need your permission to continue.",
-        "Hey! Quick approval needed to proceed with the task.",
-        "Heads up! There is a permission request waiting for you.",
-        "Excuse me! I need your authorization before I can continue.",
-        "Permission required! Please review and approve when ready."
-    ],
-    // ============================================================
-    // TTS REMINDER MESSAGES (Used after delay if no response)
-    // ============================================================
-    "idleReminderTTSMessages": [
-        "Hey, are you still there? Your task has been waiting for review.",
-        "Just a gentle reminder - I finished your request a while ago!",
-        "Hello? I completed your task. Please take a look when you can.",
-        "Still waiting for you! The work is done and ready for review.",
-        "Knock knock! Your completed task is patiently waiting for you."
-    ],
-    "permissionReminderTTSMessages": [
-        "Hey! I still need your permission to continue. Please respond!",
-        "Reminder: There is a pending permission request. I cannot proceed without you.",
-        "Hello? I am waiting for your approval. This is getting urgent!",
-        "Please check your screen! I really need your permission to move forward.",
-        "Still waiting for authorization! The task is on hold until you respond."
-    ],
-    // ============================================================
-    // SOUND FILES (relative to OpenCode config directory)
-    // ============================================================
-    "idleSound": "assets/Soft-high-tech-notification-sound-effect.mp3",
-    "permissionSound": "assets/Machine-alert-beep-sound-effect.mp3",
-    // ============================================================
-    // GENERAL SETTINGS
-    // ============================================================
-    "wakeMonitor": true,
-    "forceVolume": true,
-    "volumeThreshold": 50,
-    "enableToast": true,
-    "enableSound": true,
-    "idleThresholdSeconds": 60,
-    "debugLog": false
-}
-```
-See `example.config.jsonc` for more details.
+    // ============================================================
+    // ELEVENLABS SETTINGS (Best Quality - Anime-like Voices)
+    // ============================================================
+    // Get your API key from: https://elevenlabs.io/app/settings/api-keys
+    // "elevenLabsApiKey": "YOUR_API_KEY_HERE",
+    "elevenLabsVoiceId": "cgSgspJ2msm6clMCkdW9",
+    "elevenLabsModel": "eleven_turbo_v2_5",
+    "elevenLabsStability": 0.5,
+    "elevenLabsSimilarity": 0.75,
+    "elevenLabsStyle": 0.5,
+    // ============================================================
+    // EDGE TTS SETTINGS (Free Neural Voices - Default Engine)
+    // ============================================================
+    "edgeVoice": "en-US-AnaNeural",
+    "edgePitch": "+50Hz",
+    "edgeRate": "+10%",
+    // ============================================================
+    // SAPI SETTINGS (Windows Built-in - Last Resort Fallback)
+    // ============================================================
+    "sapiVoice": "Microsoft Zira Desktop",
+    "sapiRate": -1,
+    "sapiPitch": "medium",
+    "sapiVolume": "loud",
+    // ============================================================
+    // INITIAL TTS MESSAGES (Used immediately or after sound)
+    // ============================================================
+    "idleTTSMessages": [
+        "All done! Your task has been completed successfully.",
+        "Hey there! I finished working on your request.",
+        "Task complete! Ready for your review whenever you are.",
+        "Good news! Everything is done and ready for you.",
+        "Finished! Let me know if you need anything else."
+    ],
+    "permissionTTSMessages": [
+        "Attention please! I need your permission to continue.",
+        "Hey! Quick approval needed to proceed with the task.",
+        "Heads up! There is a permission request waiting for you.",
+        "Excuse me! I need your authorization before I can continue.",
+        "Permission required! Please review and approve when ready."
+    ],
+    // ============================================================
+    // TTS REMINDER MESSAGES (Used after delay if no response)
+    // ============================================================
+    "idleReminderTTSMessages": [
+        "Hey, are you still there? Your task has been waiting for review.",
+        "Just a gentle reminder - I finished your request a while ago!",
+        "Hello? I completed your task. Please take a look when you can.",
+        "Still waiting for you! The work is done and ready for review.",
+        "Knock knock! Your completed task is patiently waiting for you."
+    ],
+    "permissionReminderTTSMessages": [
+        "Hey! I still need your permission to continue. Please respond!",
+        "Reminder: There is a pending permission request. I cannot proceed without you.",
+        "Hello? I am waiting for your approval. This is getting urgent!",
+        "Please check your screen! I really need your permission to move forward.",
+        "Still waiting for authorization! The task is on hold until you respond."
+    ],
+    // ============================================================
+    // SOUND FILES (relative to OpenCode config directory)
+    // ============================================================
+    "idleSound": "assets/Soft-high-tech-notification-sound-effect.mp3",
+    "permissionSound": "assets/Machine-alert-beep-sound-effect.mp3",
+    // ============================================================
+    // GENERAL SETTINGS
+    // ============================================================
+    "wakeMonitor": true,
+    "forceVolume": true,
+    "volumeThreshold": 50,
+    "enableToast": true,
+    "enableSound": true,
+    "idleThresholdSeconds": 60,
+    "debugLog": false
+}
+```
+See `example.config.jsonc` for more details.
 ## Requirements
 ### For ElevenLabs TTS
@@ -211,64 +219,64 @@ See `example.config.jsonc` for more details.
 - Internet connection
 ### For Edge TTS
-- Python with `edge-tts` package:
-  ```bash
-  pip install edge-tts
-  ```
+- Internet connection (No external dependencies required)
 ### For Windows SAPI
 - Windows OS (uses built-in System.Speech)
 ### For Sound Playback
-- **Windows**: Built-in (uses Windows Media Player)
-- **macOS**: Built-in (`afplay`)
-- **Linux**: `paplay` or `aplay`
-## Events Handled
-| Event | Action |
-|-------|--------|
-| `session.idle` | Agent finished working - notify user |
-| `permission.updated` | Permission request - alert user |
-| `permission.replied` | User responded - cancel pending reminders |
-| `message.updated` | New user message - cancel pending reminders |
-| `session.created` | New session - reset state |
-## Development
-To develop on this plugin locally:
-1. Clone the repository:
-   ```bash
-   git clone https://github.com/MasuRii/opencode-smart-voice-notify.git
-   cd opencode-smart-voice-notify
-   bun install  # or npm install
-   ```
-2. Link to your OpenCode config:
-   ```json
-   {
-     "plugin": ["file:///absolute/path/to/opencode-smart-voice-notify"]
-   }
-   ```
-## Updating
-OpenCode does not automatically update plugins. To update to the latest version:
-```bash
-# Clear the cached plugin
-rm -rf ~/.cache/opencode/node_modules/opencode-smart-voice-notify
-# Run OpenCode to trigger a fresh install
-opencode
-```
-## License
-MIT
-## Support
-- Open an issue on [GitHub](https://github.com/MasuRii/opencode-smart-voice-notify/issues)
-- Check the [OpenCode docs](https://opencode.ai/docs/plugins)
+- **Windows**: Built-in (uses Windows Media Player)
+- **macOS**: Built-in (`afplay`)
+- **Linux**: `paplay` or `aplay`
+## Events Handled
+| Event | Action |
+|-------|--------|
+| `session.idle` | Agent finished working - notify user |
+| `permission.updated` | Permission request - alert user |
+| `permission.replied` | User responded - cancel pending reminders |
+| `message.updated` | New user message - cancel pending reminders |
+| `session.created` | New session - reset state |
+## Development
+To develop on this plugin locally:
+1. Clone the repository:
+   ```bash
+   git clone https://github.com/MasuRii/opencode-smart-voice-notify.git
+   cd opencode-smart-voice-notify
+   bun install  # or npm install
+   ```
+2. Link to your OpenCode config:
+   ```json
+   {
+     "plugin": ["file:///absolute/path/to/opencode-smart-voice-notify"]
+   }
+   ```
+## Updating
+OpenCode does not automatically update plugins. To update to the latest version:
+```bash
+# Clear the cached plugin
+rm -rf ~/.cache/opencode/node_modules/opencode-smart-voice-notify
+# Run OpenCode to trigger a fresh install
+opencode
+```
+## License
+MIT
+## Support
+- Open an issue on [GitHub](https://github.com/MasuRii/opencode-smart-voice-notify/issues)
+- Check the [OpenCode docs](https://opencode.ai/docs/plugins)
+<!-- Dynamic Header -->
+<img width="100%" src="https://capsule-render.vercel.app/api?type=waving&color=0:667eea,100:764ba2&height=120&section=header"/>

package/example.config.jsonc CHANGED Viewed

@@ -48,7 +48,7 @@
     // TTS ENGINE SELECTION
     // ============================================================
     // 'elevenlabs' - Best quality, anime-like voices (requires API key, free tier: 10k chars/month)
-    // 'edge'       - Good quality neural voices (free, requires: pip install edge-tts)
+    // 'edge'       - Good quality neural voices (Free, Native Node.js implementation)
     // 'sapi'       - Windows built-in voices (free, offline, robotic)
     "ttsEngine": "elevenlabs",
@@ -81,7 +81,7 @@
     // ============================================================
     // EDGE TTS SETTINGS (Free Neural Voices - Fallback)
     // ============================================================
-    // Requires: pip install edge-tts
+    // Native Node.js implementation (No external dependencies)
     // Voice options (run 'edge-tts --list-voices' to see all):
     //   'en-US-AnaNeural'   - Young, cute, cartoon-like (RECOMMENDED)

package/index.js CHANGED Viewed

@@ -198,6 +198,12 @@ export default async function SmartVoiceNotifyPlugin({ project, client, $, direc
           fallbackSound: options.fallbackSound
         });
+        // CRITICAL FIX: Check if cancelled during playback (user responded while TTS was speaking)
+        if (!pendingReminders.has(type)) {
+          debugLog(`scheduleTTSReminder: ${type} cancelled during playback - aborting follow-up`);
+          return;
+        }
         // Clean up
         pendingReminders.delete(type);
@@ -270,6 +276,18 @@ export default async function SmartVoiceNotifyPlugin({ project, client, $, direc
       await playSound(soundFile, soundLoops);
     }
+    // CRITICAL FIX: Check if user responded during sound playback
+    // For idle notifications: check if there was new activity after the idle start
+    if (type === 'idle' && lastUserActivityTime > lastSessionIdleTime) {
+      debugLog(`smartNotify: user active during sound - aborting idle reminder`);
+      return;
+    }
+    // For permission notifications: check if the permission was already handled
+    if (type === 'permission' && !activePermissionId) {
+      debugLog(`smartNotify: permission handled during sound - aborting reminder`);
+      return;
+    }
     // Step 2: Schedule TTS reminder if user doesn't respond
     if (config.enableTTSReminder && ttsMessage) {
       scheduleTTSReminder(type, ttsMessage, { fallbackSound });
@@ -347,9 +365,12 @@ export default async function SmartVoiceNotifyPlugin({ project, client, $, direc
           // CRITICAL: Clear activePermissionId FIRST to prevent race condition
           // where permission.updated handler is still running async operations
           const repliedPermissionId = event.properties?.permissionID;
-          if (activePermissionId === repliedPermissionId) {
+          // Match if IDs are equal, or if we have an active permission with unknown ID (undefined)
+          // (This happens if permission.updated received an event without permissionID)
+          if (activePermissionId === repliedPermissionId || activePermissionId === undefined) {
             activePermissionId = null;
-            debugLog(`Permission replied: cleared activePermissionId ${repliedPermissionId}`);
+            debugLog(`Permission replied: cleared activePermissionId ${repliedPermissionId || '(unknown)'}`);
           }
           lastUserActivityTime = Date.now();
           cancelPendingReminder('permission'); // Cancel permission-specific reminder
@@ -402,7 +423,13 @@ export default async function SmartVoiceNotifyPlugin({ project, client, $, direc
         if (event.type === "permission.updated") {
           // CRITICAL: Capture permissionID IMMEDIATELY (before any async work)
           // This prevents race condition where user responds before we finish notifying
-          const permissionId = event.properties?.permissionID;
+          // NOTE: In permission.updated, the property is 'id', but in permission.replied it is 'permissionID'
+          const permissionId = event.properties?.id;
+          if (!permissionId) {
+             debugLog('permission.updated: permission ID missing. properties keys: ' + Object.keys(event.properties || {}).join(', '));
+          }
           activePermissionId = permissionId;
           debugLog(`permission.updated: notifying (permissionId=${permissionId})`);

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "opencode-smart-voice-notify",
-  "version": "1.0.7",
+  "version": "1.0.9",
   "description": "Smart voice notification plugin for OpenCode with multiple TTS engines (ElevenLabs, Edge TTS, Windows SAPI) and intelligent reminder system",
   "main": "index.js",
   "type": "module",
@@ -38,9 +38,10 @@
     "node": ">=18.0.0"
   },
   "dependencies": {
-    "@elevenlabs/elevenlabs-js": "^2.28.0"
+    "@elevenlabs/elevenlabs-js": "^2.28.0",
+    "msedge-tts": "^2.0.3"
   },
   "peerDependencies": {
     "@opencode-ai/plugin": "^1.0.0"
   }
-}
+}

package/util/linux.js ADDED Viewed

@@ -0,0 +1,468 @@
+/**
+ * Linux Platform Compatibility Module
+ *
+ * Provides Linux-specific implementations for:
+ * - Wake monitor from sleep (X11 and Wayland)
+ * - Get current system volume (PulseAudio/PipeWire and ALSA)
+ * - Force system volume up (PulseAudio/PipeWire and ALSA)
+ * - Play audio files (PulseAudio and ALSA)
+ *
+ * Dependencies (optional - graceful fallback if missing):
+ * - x11-xserver-utils (for xset on X11)
+ * - pulseaudio-utils or pipewire-pulse (for pactl)
+ * - alsa-utils (for amixer, aplay, paplay)
+ *
+ * @module util/linux
+ */
+/**
+ * Creates a Linux platform utilities instance
+ * @param {object} params - { $: shell runner, debugLog: logging function }
+ * @returns {object} Linux platform API
+ */
+export const createLinuxPlatform = ({ $, debugLog = () => {} }) => {
+  // ============================================================
+  // DISPLAY SESSION DETECTION
+  // ============================================================
+  /**
+   * Detect if running under Wayland
+   * @returns {boolean}
+   */
+  const isWayland = () => {
+    return !!process.env.WAYLAND_DISPLAY;
+  };
+  /**
+   * Detect if running under X11
+   * @returns {boolean}
+   */
+  const isX11 = () => {
+    return !!process.env.DISPLAY && !isWayland();
+  };
+  /**
+   * Get the current session type
+   * @returns {'x11' | 'wayland' | 'tty' | 'unknown'}
+   */
+  const getSessionType = () => {
+    const sessionType = process.env.XDG_SESSION_TYPE;
+    if (sessionType === 'x11' || sessionType === 'wayland' || sessionType === 'tty') {
+      return sessionType;
+    }
+    if (isWayland()) return 'wayland';
+    if (isX11()) return 'x11';
+    return 'unknown';
+  };
+  // ============================================================
+  // WAKE MONITOR
+  // ============================================================
+  /**
+   * Wake monitor using X11 DPMS (works on X11 and often XWayland)
+   * @returns {Promise<boolean>} Success status
+   */
+  const wakeMonitorX11 = async () => {
+    if (!$) return false;
+    try {
+      await $`xset dpms force on`.quiet();
+      debugLog('wakeMonitor: X11 xset dpms force on succeeded');
+      return true;
+    } catch (e) {
+      debugLog(`wakeMonitor: X11 xset failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Wake monitor using GNOME D-Bus (for GNOME on Wayland)
+   * Triggers a brightness step which wakes the display
+   * @returns {Promise<boolean>} Success status
+   */
+  const wakeMonitorGnomeDBus = async () => {
+    if (!$) return false;
+    try {
+      await $`gdbus call --session --dest org.gnome.SettingsDaemon.Power --object-path /org/gnome/SettingsDaemon/Power --method org.gnome.SettingsDaemon.Power.Screen.StepUp`.quiet();
+      debugLog('wakeMonitor: GNOME D-Bus StepUp succeeded');
+      return true;
+    } catch (e) {
+      debugLog(`wakeMonitor: GNOME D-Bus failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Wake monitor from sleep/DPMS standby
+   * Tries multiple methods with graceful fallback:
+   * 1. X11 xset (works on X11 and XWayland)
+   * 2. GNOME D-Bus (works on GNOME Wayland)
+   *
+   * @returns {Promise<boolean>} True if any method succeeded
+   */
+  const wakeMonitor = async () => {
+    // Try X11 method first (most compatible, works on XWayland too)
+    if (await wakeMonitorX11()) return true;
+    // Try GNOME Wayland D-Bus method
+    if (await wakeMonitorGnomeDBus()) return true;
+    debugLog('wakeMonitor: all methods failed');
+    return false;
+  };
+  // ============================================================
+  // VOLUME CONTROL - PULSEAUDIO / PIPEWIRE
+  // ============================================================
+  /**
+   * Get current volume using PulseAudio/PipeWire (pactl)
+   * @returns {Promise<number>} Volume percentage (0-100) or -1 if failed
+   */
+  const getVolumePulse = async () => {
+    if (!$) return -1;
+    try {
+      const result = await $`pactl get-sink-volume @DEFAULT_SINK@`.quiet();
+      const output = result.stdout?.toString() || '';
+      // Parse output like: "Volume: front-left: 65536 / 100% / 0.00 dB, ..."
+      const match = output.match(/(\d+)%/);
+      if (match) {
+        const volume = parseInt(match[1], 10);
+        debugLog(`getVolume: pactl returned ${volume}%`);
+        return volume;
+      }
+    } catch (e) {
+      debugLog(`getVolume: pactl failed: ${e.message}`);
+    }
+    return -1;
+  };
+  /**
+   * Set volume using PulseAudio/PipeWire (pactl)
+   * @param {number} volume - Volume percentage (0-100)
+   * @returns {Promise<boolean>} Success status
+   */
+  const setVolumePulse = async (volume) => {
+    if (!$) return false;
+    try {
+      const clampedVolume = Math.max(0, Math.min(100, volume));
+      await $`pactl set-sink-volume @DEFAULT_SINK@ ${clampedVolume}%`.quiet();
+      debugLog(`setVolume: pactl set to ${clampedVolume}%`);
+      return true;
+    } catch (e) {
+      debugLog(`setVolume: pactl failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Unmute using PulseAudio/PipeWire (pactl)
+   * @returns {Promise<boolean>} Success status
+   */
+  const unmutePulse = async () => {
+    if (!$) return false;
+    try {
+      await $`pactl set-sink-mute @DEFAULT_SINK@ 0`.quiet();
+      debugLog('unmute: pactl succeeded');
+      return true;
+    } catch (e) {
+      debugLog(`unmute: pactl failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Check if muted using PulseAudio/PipeWire
+   * @returns {Promise<boolean|null>} True if muted, false if not, null if failed
+   */
+  const isMutedPulse = async () => {
+    if (!$) return null;
+    try {
+      const result = await $`pactl get-sink-mute @DEFAULT_SINK@`.quiet();
+      const output = result.stdout?.toString() || '';
+      // Output: "Mute: yes" or "Mute: no"
+      return /yes|true/i.test(output);
+    } catch (e) {
+      debugLog(`isMuted: pactl failed: ${e.message}`);
+      return null;
+    }
+  };
+  // ============================================================
+  // VOLUME CONTROL - ALSA (FALLBACK)
+  // ============================================================
+  /**
+   * Get current volume using ALSA (amixer)
+   * @returns {Promise<number>} Volume percentage (0-100) or -1 if failed
+   */
+  const getVolumeAlsa = async () => {
+    if (!$) return -1;
+    try {
+      const result = await $`amixer get Master`.quiet();
+      const output = result.stdout?.toString() || '';
+      // Parse output like: "Front Left: Playback 65536 [75%] [on]"
+      const match = output.match(/\[(\d+)%\]/);
+      if (match) {
+        const volume = parseInt(match[1], 10);
+        debugLog(`getVolume: amixer returned ${volume}%`);
+        return volume;
+      }
+    } catch (e) {
+      debugLog(`getVolume: amixer failed: ${e.message}`);
+    }
+    return -1;
+  };
+  /**
+   * Set volume using ALSA (amixer)
+   * @param {number} volume - Volume percentage (0-100)
+   * @returns {Promise<boolean>} Success status
+   */
+  const setVolumeAlsa = async (volume) => {
+    if (!$) return false;
+    try {
+      const clampedVolume = Math.max(0, Math.min(100, volume));
+      await $`amixer set Master ${clampedVolume}%`.quiet();
+      debugLog(`setVolume: amixer set to ${clampedVolume}%`);
+      return true;
+    } catch (e) {
+      debugLog(`setVolume: amixer failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Unmute using ALSA (amixer)
+   * @returns {Promise<boolean>} Success status
+   */
+  const unmuteAlsa = async () => {
+    if (!$) return false;
+    try {
+      await $`amixer set Master unmute`.quiet();
+      debugLog('unmute: amixer succeeded');
+      return true;
+    } catch (e) {
+      debugLog(`unmute: amixer failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Check if muted using ALSA
+   * @returns {Promise<boolean|null>} True if muted, false if not, null if failed
+   */
+  const isMutedAlsa = async () => {
+    if (!$) return null;
+    try {
+      const result = await $`amixer get Master`.quiet();
+      const output = result.stdout?.toString() || '';
+      // Look for [off] or [mute] in output
+      return /\[off\]|\[mute\]/i.test(output);
+    } catch (e) {
+      debugLog(`isMuted: amixer failed: ${e.message}`);
+      return null;
+    }
+  };
+  // ============================================================
+  // UNIFIED VOLUME CONTROL (AUTO-DETECT BACKEND)
+  // ============================================================
+  /**
+   * Get current system volume
+   * Tries PulseAudio first, then falls back to ALSA
+   * @returns {Promise<number>} Volume percentage (0-100) or -1 if failed
+   */
+  const getCurrentVolume = async () => {
+    // Try PulseAudio/PipeWire first (most common on desktop Linux)
+    let volume = await getVolumePulse();
+    if (volume >= 0) return volume;
+    // Fallback to ALSA
+    volume = await getVolumeAlsa();
+    return volume;
+  };
+  /**
+   * Set system volume
+   * Tries PulseAudio first, then falls back to ALSA
+   * @param {number} volume - Volume percentage (0-100)
+   * @returns {Promise<boolean>} Success status
+   */
+  const setVolume = async (volume) => {
+    // Try PulseAudio/PipeWire first
+    if (await setVolumePulse(volume)) return true;
+    // Fallback to ALSA
+    return await setVolumeAlsa(volume);
+  };
+  /**
+   * Unmute system audio
+   * Tries PulseAudio first, then falls back to ALSA
+   * @returns {Promise<boolean>} Success status
+   */
+  const unmute = async () => {
+    // Try PulseAudio/PipeWire first
+    if (await unmutePulse()) return true;
+    // Fallback to ALSA
+    return await unmuteAlsa();
+  };
+  /**
+   * Check if system audio is muted
+   * Tries PulseAudio first, then falls back to ALSA
+   * @returns {Promise<boolean|null>} True if muted, false if not, null if detection failed
+   */
+  const isMuted = async () => {
+    // Try PulseAudio/PipeWire first
+    let muted = await isMutedPulse();
+    if (muted !== null) return muted;
+    // Fallback to ALSA
+    return await isMutedAlsa();
+  };
+  /**
+   * Force volume to maximum (unmute + set to 100%)
+   * Used to ensure notifications are audible
+   * @returns {Promise<boolean>} Success status
+   */
+  const forceVolume = async () => {
+    const unmuted = await unmute();
+    const volumeSet = await setVolume(100);
+    return unmuted || volumeSet;
+  };
+  /**
+   * Force volume if below threshold
+   * @param {number} threshold - Minimum volume threshold (0-100)
+   * @returns {Promise<boolean>} True if volume was forced, false if already adequate
+   */
+  const forceVolumeIfNeeded = async (threshold = 50) => {
+    const currentVolume = await getCurrentVolume();
+    // If we couldn't detect volume, force it to be safe
+    if (currentVolume < 0) {
+      debugLog('forceVolumeIfNeeded: could not detect volume, forcing');
+      return await forceVolume();
+    }
+    // Check if already above threshold
+    if (currentVolume >= threshold) {
+      debugLog(`forceVolumeIfNeeded: volume ${currentVolume}% >= ${threshold}%, no action needed`);
+      return false;
+    }
+    // Force volume up
+    debugLog(`forceVolumeIfNeeded: volume ${currentVolume}% < ${threshold}%, forcing to 100%`);
+    return await forceVolume();
+  };
+  // ============================================================
+  // AUDIO PLAYBACK
+  // ============================================================
+  /**
+   * Play an audio file using PulseAudio (paplay)
+   * @param {string} filePath - Path to audio file
+   * @returns {Promise<boolean>} Success status
+   */
+  const playAudioPulse = async (filePath) => {
+    if (!$) return false;
+    try {
+      await $`paplay ${filePath}`.quiet();
+      debugLog(`playAudio: paplay succeeded for ${filePath}`);
+      return true;
+    } catch (e) {
+      debugLog(`playAudio: paplay failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Play an audio file using ALSA (aplay)
+   * Note: aplay only supports WAV files natively
+   * @param {string} filePath - Path to audio file
+   * @returns {Promise<boolean>} Success status
+   */
+  const playAudioAlsa = async (filePath) => {
+    if (!$) return false;
+    try {
+      await $`aplay ${filePath}`.quiet();
+      debugLog(`playAudio: aplay succeeded for ${filePath}`);
+      return true;
+    } catch (e) {
+      debugLog(`playAudio: aplay failed: ${e.message}`);
+      return false;
+    }
+  };
+  /**
+   * Play an audio file
+   * Tries PulseAudio (paplay) first, then falls back to ALSA (aplay)
+   * @param {string} filePath - Path to audio file
+   * @param {number} loops - Number of times to play (default: 1)
+   * @returns {Promise<boolean>} Success status
+   */
+  const playAudioFile = async (filePath, loops = 1) => {
+    for (let i = 0; i < loops; i++) {
+      // Try PulseAudio first (supports more formats including MP3)
+      if (await playAudioPulse(filePath)) continue;
+      // Fallback to ALSA
+      if (await playAudioAlsa(filePath)) continue;
+      // Both failed
+      debugLog(`playAudioFile: all methods failed for ${filePath}`);
+      return false;
+    }
+    return true;
+  };
+  // ============================================================
+  // PUBLIC API
+  // ============================================================
+  return {
+    // Session detection
+    isWayland,
+    isX11,
+    getSessionType,
+    // Wake monitor
+    wakeMonitor,
+    wakeMonitorX11,
+    wakeMonitorGnomeDBus,
+    // Volume control (unified)
+    getCurrentVolume,
+    setVolume,
+    unmute,
+    isMuted,
+    forceVolume,
+    forceVolumeIfNeeded,
+    // Volume control (specific backends)
+    pulse: {
+      getVolume: getVolumePulse,
+      setVolume: setVolumePulse,
+      unmute: unmutePulse,
+      isMuted: isMutedPulse,
+    },
+    alsa: {
+      getVolume: getVolumeAlsa,
+      setVolume: setVolumeAlsa,
+      unmute: unmuteAlsa,
+      isMuted: isMutedAlsa,
+    },
+    // Audio playback
+    playAudioFile,
+    playAudioPulse,
+    playAudioAlsa,
+  };
+};

package/util/tts.js CHANGED Viewed

@@ -2,6 +2,7 @@ import path from 'path';
 import os from 'os';
 import fs from 'fs';
 import { loadConfig } from './config.js';
+import { createLinuxPlatform } from './linux.js';
 const platform = os.platform();
 const configDir = process.env.OPENCODE_CONFIG_DIR || path.join(os.homedir(), '.config', 'opencode');
@@ -110,6 +111,8 @@ export const getTTSConfig = () => {
   });
 };
+let elevenLabsQuotaExceeded = false;
 /**
  * Creates a TTS utility instance
  * @param {object} params - { $, client }
@@ -119,6 +122,7 @@ export const createTTS = ({ $, client }) => {
   const config = getTTSConfig();
   const logFile = path.join(configDir, 'smart-voice-notify-debug.log');
+  // Debug logging function (defined early so it can be passed to Linux platform)
   const debugLog = (message) => {
     if (!config.debugLog) return;
     try {
@@ -127,6 +131,24 @@ export const createTTS = ({ $, client }) => {
     } catch (e) {}
   };
+  // Initialize Linux platform utilities (only used on Linux)
+  const linux = platform === 'linux' ? createLinuxPlatform({ $, debugLog }) : null;
+  const showToast = async (message, variant = 'info') => {
+    if (!config.enableToast) return;
+    try {
+      if (typeof client?.tui?.showToast === 'function') {
+        await client.tui.showToast({
+          body: {
+            message: message,
+            variant: variant,
+            duration: 6000
+          }
+        });
+      }
+    } catch (e) {}
+  };
   /**
    * Play an audio file using system media player
    */
@@ -156,7 +178,11 @@ export const createTTS = ({ $, client }) => {
         for (let i = 0; i < loops; i++) {
           await $`afplay ${filePath}`.quiet();
         }
+      } else if (platform === 'linux' && linux) {
+        // Use the Linux platform module for audio playback
+        await linux.playAudioFile(filePath, loops);
       } else {
+        // Generic fallback for other Unix-like systems
         for (let i = 0; i < loops; i++) {
           try {
             await $`paplay ${filePath}`.quiet();
@@ -174,6 +200,8 @@ export const createTTS = ({ $, client }) => {
    * ElevenLabs Engine (Online, High Quality, Anime-like voices)
    */
   const speakWithElevenLabs = async (text) => {
+    if (elevenLabsQuotaExceeded) return false;
     if (!config.elevenLabsApiKey) {
       debugLog('speakWithElevenLabs: No API key configured');
       return false;
@@ -204,6 +232,19 @@ export const createTTS = ({ $, client }) => {
       return true;
     } catch (e) {
       debugLog(`speakWithElevenLabs error: ${e.message}`);
+      // Handle quota exceeded (401 specifically, or specific error message)
+      const isQuotaError =
+        e.statusCode === 401 ||
+        e.message?.includes('401') ||
+        e.message?.toLowerCase().includes('quota_exceeded') ||
+        e.message?.toLowerCase().includes('quota exceeded');
+      if (isQuotaError) {
+        elevenLabsQuotaExceeded = true;
+        await showToast("⚠️ ElevenLabs quota exceeded! Switching to Edge TTS for this session.", "error");
+      }
       return false;
     }
   };
@@ -212,16 +253,20 @@ export const createTTS = ({ $, client }) => {
    * Edge TTS Engine (Free, Neural voices)
    */
   const speakWithEdgeTTS = async (text) => {
-    if (!$) return false;
     try {
-      const voice = config.edgeVoice || 'en-US-AnaNeural';
+      const { MsEdgeTTS, OUTPUT_FORMAT } = await import('msedge-tts');
+      const tts = new MsEdgeTTS();
+      const voice = config.edgeVoice || 'en-US-JennyNeural';
       const pitch = config.edgePitch || '+0Hz';
-      const rate = config.edgeRate || '+0%';
-      const tempFile = path.join(os.tmpdir(), `opencode-edge-${Date.now()}.mp3`);
+      const rate = config.edgeRate || '+10%';
+      const volume = config.edgeVolume || '+0%';
-      await $`edge-tts --voice ${voice} --pitch ${pitch} --rate ${rate} --text ${text} --write-media ${tempFile}`.quiet();
-      await playAudioFile(tempFile);
-      try { fs.unlinkSync(tempFile); } catch (e) {}
+      await tts.setMetadata(voice, OUTPUT_FORMAT.AUDIO_24KHZ_48KBITRATE_MONO_MP3);
+      const { audioFilePath } = await tts.toFile(os.tmpdir(), text, { pitch, rate, volume });
+      await playAudioFile(audioFilePath);
+      try { fs.unlinkSync(audioFilePath); } catch (e) {}
       return true;
     } catch (e) {
       debugLog(`speakWithEdgeTTS error: ${e.message}`);
@@ -301,8 +346,15 @@ ${ssml}
   /**
    * Check if the system has been idle long enough that the monitor might be asleep.
+   * On Linux, we always return true (assume monitor might be asleep) since idle detection
+   * varies significantly across desktop environments.
    */
   const isMonitorLikelyAsleep = async () => {
+    if (platform === 'linux') {
+      // On Linux, we can't reliably detect idle time across all DEs
+      // Return true to always attempt wake (it's a no-op if already awake)
+      return true;
+    }
     if (platform !== 'win32' || !$) return true;
     try {
       const idleThreshold = config.idleThresholdSeconds || 60;
@@ -342,6 +394,10 @@ public static class IdleCheck {
    * Get the current system volume level (0-100).
    */
   const getCurrentVolume = async () => {
+    // Use Linux platform module
+    if (platform === 'linux' && linux) {
+      return await linux.getCurrentVolume();
+    }
     if (platform !== 'win32' || !$) return -1;
     try {
       const cmd = `
@@ -380,6 +436,9 @@ public static extern int waveOutGetVolume(IntPtr hwo, out uint dwVolume);
         await $`powershell.exe -NoProfile -ExecutionPolicy Bypass -Command ${cmd}`.quiet();
       } else if (platform === 'darwin') {
         await $`caffeinate -u -t 1`.quiet();
+      } else if (platform === 'linux' && linux) {
+        // Use the Linux platform module for wake monitor
+        await linux.wakeMonitor();
       }
     } catch (e) {
       debugLog(`wakeMonitor error: ${e.message}`);
@@ -403,6 +462,9 @@ public static extern int waveOutGetVolume(IntPtr hwo, out uint dwVolume);
         await $`powershell.exe -NoProfile -ExecutionPolicy Bypass -Command ${cmd}`.quiet();
       } else if (platform === 'darwin') {
         await $`osascript -e "set volume output volume 100"`.quiet();
+      } else if (platform === 'linux' && linux) {
+        // Use the Linux platform module for force volume
+        await linux.forceVolume();
       }
     } catch (e) {
       debugLog(`forceVolume error: ${e.message}`);