npm - speechflow - Versions diffs - 1.4.5 → 1.5.0 - Mend

speechflow 1.4.5 → 1.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (166) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,34 @@
 ChangeLog
 =========
+1.5.0 (2025-08-31)
+------------------
+- IMPROVEMENT: add improved dashboard infrastructure and allow nodes to publish dashboard info
+- IMPROVEMENT: add CLI option for exporting dashboard info via OSC
+- IMPROVEMENT: add new audio processing nodes (compressor with sidechain, expander, gain, filler)
+- IMPROVEMENT: add AWS integration nodes (Polly, Translate, Transcribe)
+- IMPROVEMENT: add OpenAI Transcribe node for speech-to-text
+- IMPROVEMENT: add noise suppression nodes (rnnoise, speex)
+- IMPROVEMENT: provide audio helper utilities and access bus functionality
+- IMPROVEMENT: improve types and error handling
+- IMPROVEMENT: switch to GPT-5 with improved error handling and timeout support
+- IMPROVEMENT: switch from native compressor to custom implementation
+- BUGFIX: fix usage of AudioIO quit and abort methods
+- BUGFIX: fix operator order in audio processing
+- BUGFIX: reset envelope array when channels change
+- BUGFIX: fix parameter configuration in audio nodes
+- BUGFIX: fix private field access and remove unnecessary casts
+- UPDATE: upgrade NPM dependencies
+- UPDATE: update OxLint rules and configuration
+- CLEANUP: cleanup and simplify code throughout project
+- CLEANUP: cleanup expander node implementation and remove stereoLink feature
+- CLEANUP: cleanup gender, ffmpeg, filler, and AWS nodes
+- CLEANUP: reduce code depth in multiple components
+- CLEANUP: align identifiers with remaining code
+- CLEANUP: make code compliant with updated linter rules
+- CLEANUP: fix indentation and remove duplicate entries
 1.4.5 (2025-08-07)
 ------------------

package/README.md CHANGED Viewed

@@ -31,10 +31,20 @@ remote MQTT network I/O,
 local Voice Activity Detection (VAD),
 local voice gender recognition,
 local audio LUFS-S/RMS metering,
+local audio Speex noise suppression,
+local audio RNNoise noise suppression,
+local audio compressor processing,
+local audio expander processing,
+local audio gain processing,
+local audio filler processing,
 remote-controlable local audio muting,
+cloud-based [Amazon Transcribe](https://aws.amazon.com/transcribe/) speech-to-text conversion,
+cloud-based [OpenAI GPT Transcribe](https://platform.openai.com/docs/models/gpt-4o-mini-transcribe) speech-to-text conversion,
 cloud-based [Deepgram](https://deepgram.com) speech-to-text conversion,
 cloud-based [ElevenLabs](https://elevenlabs.io/) text-to-speech conversion,
+cloud-based [Amazon Polly](https://aws.amazon.com/polly/) text-to-speech conversion,
 cloud-based [DeepL](https://deepl.com) text-to-text translation,
+cloud-based [Amazon Translate](https://aws.amazon.com/translate/) text-to-text translation,
 cloud-based [OpenAI/GPT](https://openai.com) text-to-text translation (or spelling correction),
 local [Ollama/Gemma](https://ollama.com) text-to-text translation (or spelling correction),
 local [OPUS/ONNX](https://github.com/Helsinki-NLP/Opus-MT) text-to-text translation,
@@ -288,18 +298,29 @@ First a short overview of the available processing nodes:
   **mute**,
   **meter**,
   **vad**,
-  **gender**.
+  **gender**,
+  **speex**,
+  **rrnoise**,
+  **compressor**,
+  **expander**,
+  **gain**,
+  **filler**.
 - Audio-to-Text nodes:
+  **openaitranscribe**,
+  **awstranscribe**,
   **deepgram**.
 - Text-to-Text nodes:
   **deepl**,
+  **awstranslate**,
   **openai**,
   **ollama**,
   **transformers**,
   **subtitle**,
   **format**.
 - Text-to-Audio nodes:
+  **awspolly**.
   **elevenlabs**.
+  **kokoro**.
 - Any-to-Any nodes:
   **filter**,
   **trace**.
@@ -503,10 +524,160 @@ The following nodes process audio chunks only.
   | ----------- | --------- | -------- | ------------------------ |
   | **window**  | 0         | 500      | *none*                   |
+- Node: **speex**<br/>
+  Purpose: **Speex Noise Suppression node**<br/>
+  Example: `speex(attentuate: -18)`
+  > This node uses the Speex DSP pre-processor to perform noise
+  > suppression, i.e., it detects and attenuates (by a certain level of
+  > dB) the noise in the audio stream.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | audio       |
+  | Parameter   | Position  | Default  | Requirement              |
+  | ----------- | --------- | -------- | ------------------------ |
+  | **attentuate** | 0 | -18  | *none* | `-60 <= n <= 0` |
+- Node: **rnnoise**<br/>
+  Purpose: **RNNoise Noise Suppression node**<br/>
+  Example: `rnnoise()`
+  > This node uses RNNoise to perform noise suppression, i.e., it
+  > detects and attenuates the noise in the audio stream.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | audio       |
+  | Parameter   | Position  | Default  | Requirement              |
+  | ----------- | --------- | -------- | ------------------------ |
+- Node: **compressor**<br/>
+  Purpose: **audio compressor node**<br/>
+  Example: `compressor(thresholdDb: -18)`
+  > This node applies a dynamics compressor, i.e., it attenuates the
+  > volume by a certain ratio whenever the volume is above the threshold.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | audio       |
+  | Parameter   | Position  | Default  | Requirement              |
+  | ----------- | --------- | -------- | ------------------------ |
+  | **thresholdDb** | *none* | -18 | `n <= 0 && n >= -60` |
+  | **ratio**       | *none* | 4   | `n >= 1 && n <= 20`  |
+  | **attackMs**    | *none* | 10  | `n >= 0 && n <= 100` |
+  | **releaseMs**   | *none* | 50  | `n >= 0 && n <= 100` |
+  | **kneeDb**      | *none* | 6   | `n >= 0 && n <= 100` |
+  | **makeupDb**    | *none* | 0   | `n >= 0 && n <= 100` |
+- Node: **expander**<br/>
+  Purpose: **audio expander node**<br/>
+  Example: `expander(thresholdDb: -46)`
+  > This node applies a dynamics expander, i.e., it attenuates the
+  > volume by a certain ratio whenever the volume is below the threshold.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | audio       |
+  | Parameter   | Position  | Default  | Requirement              |
+  | ----------- | --------- | -------- | ------------------------ |
+  | **thresholdDb** | *none* | -45 | `n <= 0 && n >= -60` |
+  | **ratio**       | *none* | 4   | `n >= 1 && n <= 20`  |
+  | **attackMs**    | *none* | 10  | `n >= 0 && n <= 100` |
+  | **releaseMs**   | *none* | 50  | `n >= 0 && n <= 100` |
+  | **kneeDb**      | *none* | 6   | `n >= 0 && n <= 100` |
+  | **makeupDb**    | *none* | 0   | `n >= 0 && n <= 100` |
+- Node: **gain**<br/>
+  Purpose: **audio gain adjustment node**<br/>
+  Example: `gain(db: 12)`
+  > This node applies a gain adjustment to audio, i.e., it increases or
+  > decreases the volume by certain decibels
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | audio       |
+  | Parameter   | Position  | Default  | Requirement              |
+  | ----------- | --------- | -------- | ------------------------ |
+  | **db** | *none* | 12 | `n >= -60 && n <= -60` |
+- Node: **filler**<br/>
+  Purpose: **audio filler node**<br/>
+  Example: `filler()`
+  > This node adds missing audio frames of silence in order to fill
+  > the chronological gaps between generated audio frames (from
+  > text-to-speech).
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | audio       |
+  | Parameter   | Position  | Default  | Requirement              |
+  | ----------- | --------- | -------- | ------------------------ |
 ### Audio-to-Text Nodes
 The following nodes convert audio to text chunks.
+- Node: **openaitranscribe**<br/>
+  Purpose: **OpenAI/GPT Speech-to-Text conversion**<br/>
+  Example: `openaitranscribe(language: "de")`<br/>
+  Notice: this node requires an OpenAI API key!
+  > This node uses OpenAI GPT to perform Speech-to-Text (S2T)
+  > conversion, i.e., it recognizes speech in the input audio stream and
+  > outputs a corresponding text stream.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | text        |
+  | output  | text        |
+  | Parameter    | Position  | Default  | Requirement        |
+  | ------------ | --------- | -------- | ------------------ |
+  | **key**      | *none*    | env.SPEECHFLOW\_OPENAI\_KEY | *none* |
+  | **api**      | *none*    | "https://api.openai.com" | `/^https?:\/\/.+?:\d+$/` |
+  | **model**    | *none*    | "gpt-4o-mini-transcribe" | *none* |
+  | **language** | *none*    | "en"     | `/^(?:de\|en)$/` |
+  | **interim**  | *none*    | false    | *none* |
+- Node: **awstranscribe**<br/>
+  Purpose: **Amazon Transcribe Speech-to-Text conversion**<br/>
+  Example: `awstranscribe(language: "de")`<br/>
+  Notice: this node requires an API key!
+  > This node uses Amazon Trancribe to perform Speech-to-Text (S2T)
+  > conversion, i.e., it recognizes speech in the input audio stream and
+  > outputs a corresponding text stream.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | audio       |
+  | output  | text        |
+  | Parameter    | Position  | Default  | Requirement        |
+  | ------------ | --------- | -------- | ------------------ |
+  | **key**      | *none*    | env.SPEECHFLOW\_AMAZON\_KEY | *none* |
+  | **secKey**   | *none*    | env.SPEECHFLOW\_AMAZON\_KEY\_SEC | *none* |
+  | **region**   | *none*    | "eu-central-1" | *none* |
+  | **language** | *none*    | "en" | `/^(?:en|de)$/` |
+  | **interim**  | *none*    | false | *none* |
 - Node: **deepgram**<br/>
   Purpose: **Deepgram Speech-to-Text conversion**<br/>
   Example: `deepgram(language: "de")`<br/>
@@ -551,6 +722,26 @@ The following nodes process text chunks only.
   | **src**      | 0         | "de"     | `/^(?:de\|en)$/` |
   | **dst**      | 1         | "en"     | `/^(?:de\|en)$/` |
+- Node: **awstranslate**<br/>
+  Purpose: **AWS Translate Text-to-Text translation**<br/>
+  Example: `awstranslate(src: "de", dst: "en")`<br/>
+  Notice: this node requires an API key!
+  > This node performs translation between English and German languages.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | text        |
+  | output  | text        |
+  | Parameter    | Position  | Default  | Requirement        |
+  | ------------ | --------- | -------- | ------------------ |
+  | **key**      | *none*    | env.SPEECHFLOW\_AMAZON\_KEY | *none* |
+  | **secKey**   | *none*    | env.SPEECHFLOW\_AMAZON\_KEY\_SEC | *none* |
+  | **region**   | *none*    | "eu-central-1" | *none* |
+  | **src**      | 0         | "de"     | `/^(?:de\|en)$/` |
+  | **dst**      | 1         | "en"     | `/^(?:de\|en)$/` |
 - Node: **openai**<br/>
   Purpose: **OpenAI/GPT Text-to-Text translation and spelling correction**<br/>
   Example: `openai(src: "de", dst: "en")`<br/>
@@ -671,14 +862,36 @@ The following nodes process text chunks only.
 The following nodes convert text chunks to audio chunks.
+- Node: **awspolly**<br/>
+  Purpose: **Amazon Polly Text-to-Speech conversion**<br/>
+  Example: `awspolly(language: "en", voice: "Danielle)`<br/>
+  Notice: this node requires an Amazon API key!
+  > This node uses Amazon Polly to perform Text-to-Speech (T2S)
+  > conversion, i.e., it converts the input text stream into an output
+  > audio stream. It is intended to generate speech.
+  | Port    | Payload     |
+  | ------- | ----------- |
+  | input   | text        |
+  | output  | audio       |
+  | Parameter      | Position  | Default   | Requirement        |
+  | -------------- | --------- | --------- | ------------------ |
+  | **key**        | *none*    | env.SPEECHFLOW\_AMAZON\_KEY | *none* |
+  | **secKey**     | *none*    | env.SPEECHFLOW\_AMAZON\_KEY\_SEC | *none* |
+  | **region**     | *none*    | "eu-central-1" | *none* |
+  | **voice**      | 0         | "Amy"     | `^(?:Amy|Danielle|Joanna|Matthew|Ruth|Stephen|Viki|Daniel)$/` |
+  | **language**   | 1         | "en"      | `/^(?:de\|en)$/`  |
 - Node: **elevenlabs**<br/>
   Purpose: **ElevenLabs Text-to-Speech conversion**<br/>
   Example: `elevenlabs(language: "en")`<br/>
   Notice: this node requires an ElevenLabs API key!
-  > This node perform Text-to-Speech (T2S) conversion, i.e., it converts
-  > the input text stream into an output audio stream. It is intended to
-  > generate speech.
+  > This node uses ElevenLabs to perform Text-to-Speech (T2S)
+  > conversion, i.e., it converts the input text stream into an output
+  > audio stream. It is intended to generate speech.
   | Port    | Payload     |
   | ------- | ----------- |
@@ -700,9 +913,9 @@ The following nodes convert text chunks to audio chunks.
   Example: `kokoro(language: "en")`<br/>
   Notice: this currently support English language only!
-  > This node perform Text-to-Speech (T2S) conversion, i.e., it converts
-  > the input text stream into an output audio stream. It is intended to
-  > generate speech.
+  > This node uses Kokoro to perform Text-to-Speech (T2S) conversion,
+  > i.e., it converts the input text stream into an output audio stream.
+  > It is intended to generate speech.
   | Port    | Payload     |
   | ------- | ----------- |

package/etc/claude.md ADDED Viewed

@@ -0,0 +1,70 @@
+# CLAUDE.md
+This file provides guidance to Claude Code (claude.ai/code) when working
+with code in this repository.
+## Project Overview
+SpeechFlow is a command-line interface tool for establishing directed
+data flow graphs of audio and text processing nodes. It enables flexible
+speech processing tasks including capturing audio, text-to-speech,
+speech-to-text, and speech-to-speech translation.
+## Architecture
+SpeechFlow uses a modular node-based architecture:
+- **Core Engine**: TypeScript-based CLI tool that orchestrates processing flows
+- **Processing Nodes**: Modular components for different speech processing tasks (see `src/speechflow-node-*.ts`)
+- **Flow Expression Language**: Based on FlowLink for defining processing graphs
+- **Web Interfaces**: Two Vue.js applications for dashboard and subtitle display
+- **REST/WebSocket API**: External control interface for nodes
+### Key Components
+- **Main CLI**:
+  `src/speechflow.ts` - Entry point and CLI parsing
+- **Nodes**:
+  - Input/Output: `file`, `device`, `websocket`, `mqtt`
+  - Audio-to-Audio: `ffmpeg`, `wav`, `mute`, `meter`, `vad`, `gender`
+  - Audio-to-Text: `deepgram`
+  - Text-to-Text: `deepl`, `openai`, `ollama`, `transformers`, `subtitle`, `format`, `sentence`
+  - Text-to-Audio: `elevenlabs`, `kokoro`
+  - Any-to-Any: `filter`, `trace`
+## Development Commands
+The project uses STX (Simple Task eXecutor) for build automation. Main commands:
+### Core Project
+```bash
+npm start lint          # Static code analysis (TypeScript, ESLint, Biome, Oxlint)
+npm start build         # Compile TypeScript to JavaScript in dst/
+npm start dev           # Multi-pane development dashboard with linting, building, and server
+npm start server        # Run the main speechflow program
+npm start clean         # Remove generated files
+```
+## Project Structure
+- `src/` - Main TypeScript source files
+- `dst/` - Compiled JavaScript output
+- `etc/` - Configuration files (TypeScript, ESLint, Biome, etc.)
+- `package.d/` - NPM package patches
+## Development Notes
+- Node.js 22+ required
+- Uses object-mode streaming with timestamps for audio/text processing
+- External services integration: Deepgram, ElevenLabs, DeepL, OpenAI, Ollama
+- Supports local processing: FFmpeg, WAV, Voice Activity Detection, Gender Detection
+- REST/WebSocket API on port 8484 for external control
+## Configuration
+Main configuration in `etc/speechflow.yaml` with example
+processing graphs. Environment variables used for API keys (e.g.,
+`SPEECHFLOW_DEEPGRAM_KEY`, `SPEECHFLOW_ELEVENLABS_KEY`).

package/etc/speechflow.yaml CHANGED Viewed

@@ -68,8 +68,10 @@ studio-transcription: |
                 subtitle(format: "vtt") |
                     file(path: argv.2, mode: "w", type: "text"),
                 subtitle(format: "srt") |
-                    file(path: argv.3, mode: "w", type: "text")
-                elevenlabs(voice: "Mark", optimize: "quality", speed: 1.05, language: "en")
+                    file(path: argv.3, mode: "w", type: "text"),
+                elevenlabs(voice: "Mark", optimize: "quality", speed: 1.05, language: "en") |
+                    wav(mode: "encode") |
+                        file(path: argv.4, mode: "w", type: "audio")
             }
         }
     }
@@ -102,7 +104,7 @@ studio-translation: |
                                 filter(name: "S2T-female", type: "text", var: "meta:gender", op: "==", val: "female") |
                                     elevenlabs(voice: "Brittney", optimize: "latency", speed: 1.05, language: "en")
                             } | {
-                                meter(interval: 250, dashboard: "meter2", dashboard: "meter2"),
+                                meter(interval: 250, dashboard: "meter2"),
                                 wav(mode: "encode") |
                                     file(path: "program-en.wav", mode: "w", type: "audio"),
                                 device(device: "coreaudio:USBAudio2.0", mode: "w")

package/etc/stx.conf CHANGED Viewed

@@ -17,6 +17,13 @@ upd
     (cd speechflow-ui-db && npx -y upd) && \
     (cd speechflow-ui-st && npx -y upd)
+#   [top-level] provide statistics about code base
+cloc
+    cloc etc \
+        speechflow-cli/etc   speechflow-cli/src   \
+        speechflow-ui-db/etc speechflow-ui-db/src \
+        speechflow-ui-st/etc speechflow-ui-st/src
 #   [top-level] lint components for development
 lint
     npm --prefix speechflow-cli   start lint && \

package/package.json CHANGED Viewed

@@ -1,10 +1,11 @@
 {
     "name":             "speechflow",
-    "version":          "1.4.5",
-    "x-stdver":         "1.4.5-GA",
-    "x-release":        "2025-08-07",
+    "version":          "1.5.0",
+    "x-stdver":         "1.5.0-GA",
+    "x-release":        "2025-08-31",
     "homepage":         "https://github.com/rse/speechflow",
     "description":      "Speech Processing Flow Graph",
+    "keywords":         [ "speech", "audio", "flow", "graph" ],
     "license":          "GPL-3.0-only",
     "author": {
         "name":         "Dr. Ralf S. Engelschall",
@@ -16,17 +17,17 @@
         "url":          "git+https://github.com/rse/speechflow.git"
     },
     "dependencies": {
-        "@rse/stx":     "1.0.7"
+        "@rse/stx":     "1.0.9"
     },
     "devDependencies": {
         "nodemon":      "3.1.10",
         "watch":        "1.0.2",
-        "concurrently": "9.2.0",
+        "concurrently": "9.2.1",
         "wait-on":      "8.0.4",
         "cross-env":    "10.0.0",
         "shx":          "0.4.0"
     },
-    "engines" : {
+    "engines": {
         "npm":          ">=10.0.0",
         "node":         ">=22.0.0"
     },

package/speechflow-cli/dst/speechflow-node-a2a-compressor-wt.d.ts ADDED Viewed

	@@ -0,0 +1 @@
1	+ export {};

package/speechflow-cli/dst/speechflow-node-a2a-compressor-wt.js ADDED Viewed

@@ -0,0 +1,155 @@
+"use strict";
+/*
+**  SpeechFlow - Speech Processing Flow Graph
+**  Copyright (c) 2024-2025 Dr. Ralf S. Engelschall <rse@engelschall.com>
+**  Licensed under GPL 3.0 <https://spdx.org/licenses/GPL-3.0-only>
+*/
+var __createBinding = (this && this.__createBinding) || (Object.create ? (function(o, m, k, k2) {
+    if (k2 === undefined) k2 = k;
+    var desc = Object.getOwnPropertyDescriptor(m, k);
+    if (!desc || ("get" in desc ? !m.__esModule : desc.writable || desc.configurable)) {
+      desc = { enumerable: true, get: function() { return m[k]; } };
+    }
+    Object.defineProperty(o, k2, desc);
+}) : (function(o, m, k, k2) {
+    if (k2 === undefined) k2 = k;
+    o[k2] = m[k];
+}));
+var __setModuleDefault = (this && this.__setModuleDefault) || (Object.create ? (function(o, v) {
+    Object.defineProperty(o, "default", { enumerable: true, value: v });
+}) : function(o, v) {
+    o["default"] = v;
+});
+var __importStar = (this && this.__importStar) || (function () {
+    var ownKeys = function(o) {
+        ownKeys = Object.getOwnPropertyNames || function (o) {
+            var ar = [];
+            for (var k in o) if (Object.prototype.hasOwnProperty.call(o, k)) ar[ar.length] = k;
+            return ar;
+        };
+        return ownKeys(o);
+    };
+    return function (mod) {
+        if (mod && mod.__esModule) return mod;
+        var result = {};
+        if (mod != null) for (var k = ownKeys(mod), i = 0; i < k.length; i++) if (k[i] !== "default") __createBinding(result, mod, k[i]);
+        __setModuleDefault(result, mod);
+        return result;
+    };
+})();
+Object.defineProperty(exports, "__esModule", { value: true });
+const utils = __importStar(require("./speechflow-utils"));
+/*  downward compressor with soft knee  */
+class CompressorProcessor extends AudioWorkletProcessor {
+    /*  internal state  */
+    env = [];
+    sampleRate;
+    reduction = 0;
+    /*  eslint no-undef: off */
+    static get parameterDescriptors() {
+        return [
+            { name: "threshold", defaultValue: -23, minValue: -100, maxValue: 0, automationRate: "k-rate" }, // dBFS
+            { name: "ratio", defaultValue: 4.0, minValue: 1.0, maxValue: 20, automationRate: "k-rate" }, // compression ratio
+            { name: "attack", defaultValue: 0.010, minValue: 0.0, maxValue: 1, automationRate: "k-rate" }, // seconds
+            { name: "release", defaultValue: 0.050, minValue: 0.0, maxValue: 1, automationRate: "k-rate" }, // seconds
+            { name: "knee", defaultValue: 6.0, minValue: 0.0, maxValue: 40, automationRate: "k-rate" }, // dB
+            { name: "makeup", defaultValue: 0.0, minValue: -24, maxValue: 24, automationRate: "k-rate" } // dB
+        ];
+    }
+    /*  class constructor for custom option processing  */
+    constructor(options) {
+        super();
+        const { sampleRate } = options.processorOptions;
+        this.sampleRate = sampleRate;
+    }
+    /*  determine gain difference  */
+    gainDBFor(levelDB, thresholdDB, ratio, kneeDB) {
+        /*  short-circuit for unreasonable ratio  */
+        if (ratio <= 1.0)
+            return 0;
+        /*  determine thresholds  */
+        const halfKnee = kneeDB * 0.5;
+        const belowThr = levelDB < thresholdDB;
+        const aboveKnee = levelDB >= (thresholdDB + halfKnee);
+        /*  short-circuit for no compression (below threshold)  */
+        if (belowThr)
+            return 0;
+        /*  apply soft-knee  */
+        if (kneeDB > 0 && !aboveKnee) {
+            const x = (levelDB - thresholdDB) / kneeDB;
+            const idealGainDB = (thresholdDB + (levelDB - thresholdDB) / ratio) - levelDB;
+            return idealGainDB * x * x;
+        }
+        /*  determine target level  */
+        const targetOut = thresholdDB + (levelDB - thresholdDB) / ratio;
+        /*  return gain difference  */
+        return targetOut - levelDB;
+    }
+    /*  update envelope (smoothed amplitude contour) for single channel  */
+    updateEnvelopeForChannel(chan, samples, attack, release) {
+        /*  fetch old envelope value  */
+        if (this.env[chan] === undefined)
+            this.env[chan] = 1e-12;
+        let env = this.env[chan];
+        /*  calculate attack/release alpha values  */
+        const alphaA = Math.exp(-1 / (attack * this.sampleRate));
+        const alphaR = Math.exp(-1 / (release * this.sampleRate));
+        /*  iterate over all samples and calculate RMS  */
+        for (const s of samples) {
+            const x = Math.abs(s);
+            const det = x * x;
+            if (det > env)
+                env = alphaA * env + (1 - alphaA) * det;
+            else
+                env = alphaR * env + (1 - alphaR) * det;
+        }
+        this.env[chan] = Math.sqrt(Math.max(env, 1e-12));
+    }
+    /*  process a single sample frame  */
+    process(inputs, outputs, parameters) {
+        /*  sanity check  */
+        const input = inputs[0];
+        const output = outputs[0];
+        if (!input || input.length === 0 || !output)
+            return true;
+        /*  determine number of channels  */
+        const nCh = input.length;
+        /*  initially just copy input to output (pass-through)  */
+        for (let c = 0; c < output.length; c++) {
+            if (!output[c] || !input[c])
+                continue;
+            output[c].set(input[c]);
+        }
+        /*  fetch parameters  */
+        const thresholdDB = parameters["threshold"][0];
+        const ratio = parameters["ratio"][0];
+        const kneeDB = parameters["knee"][0];
+        const attackS = Math.max(parameters["attack"][0], 1 / this.sampleRate);
+        const releaseS = Math.max(parameters["release"][0], 1 / this.sampleRate);
+        const makeupDB = parameters["makeup"][0];
+        /*  update envelope per channel  */
+        for (let ch = 0; ch < nCh; ch++)
+            this.updateEnvelopeForChannel(ch, input[ch], attackS, releaseS);
+        /*  determine linear value from decibel makeup value */
+        const makeUpLin = utils.dB2lin(makeupDB);
+        /*  iterate over all channels  */
+        this.reduction = 0;
+        for (let ch = 0; ch < nCh; ch++) {
+            const levelDB = utils.lin2dB(this.env[ch]);
+            const gainDB = this.gainDBFor(levelDB, thresholdDB, ratio, kneeDB);
+            const gainLin = utils.dB2lin(gainDB) * makeUpLin;
+            /*  on first channel, calculate reduction  */
+            if (ch === 0)
+                this.reduction = Math.min(0, gainDB);
+            /*  apply gain change to channel  */
+            const inp = input[ch];
+            const out = output[ch];
+            for (let i = 0; i < inp.length; i++)
+                out[i] = inp[i] * gainLin;
+        }
+        return true;
+    }
+}
+/*  register the new audio nodes  */
+registerProcessor("compressor", CompressorProcessor);
+//# sourceMappingURL=speechflow-node-a2a-compressor-wt.js.map

package/speechflow-cli/dst/speechflow-node-a2a-compressor-wt.js.map ADDED Viewed

@@ -0,0 +1 @@

+ {"version":3,"file":"speechflow-node-a2a-compressor-wt.js","sourceRoot":"","sources":["../src/speechflow-node-a2a-compressor-wt.ts"],"names":[],"mappings":";AAAA;;;;EAIE;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;AAEF,0DAA2C;AAE3C,0CAA0C;AAC1C,MAAM,mBAAoB,SAAQ,qBAAqB;IACnD,sBAAsB;IACd,GAAG,GAAa,EAAE,CAAA;IAClB,UAAU,CAAQ;IACnB,SAAS,GAAG,CAAC,CAAA;IAEpB,2BAA2B;IAC3B,MAAM,KAAK,oBAAoB;QAC3B,OAAO;YACH,EAAE,IAAI,EAAE,WAAW,EAAG,YAAY,EAAE,CAAC,EAAE,EAAI,QAAQ,EAAE,CAAC,GAAG,EAAI,QAAQ,EAAE,CAAC,EAAI,cAAc,EAAE,QAAQ,EAAE,EAAE,OAAO;YAC/G,EAAE,IAAI,EAAE,OAAO,EAAO,YAAY,EAAE,GAAG,EAAI,QAAQ,EAAE,GAAG,EAAK,QAAQ,EAAE,EAAE,EAAG,cAAc,EAAE,QAAQ,EAAE,EAAE,oBAAoB;YAC5H,EAAE,IAAI,EAAE,QAAQ,EAAM,YAAY,EAAE,KAAK,EAAE,QAAQ,EAAE,GAAG,EAAK,QAAQ,EAAE,CAAC,EAAI,cAAc,EAAE,QAAQ,EAAE,EAAE,UAAU;YAClH,EAAE,IAAI,EAAE,SAAS,EAAK,YAAY,EAAE,KAAK,EAAE,QAAQ,EAAE,GAAG,EAAK,QAAQ,EAAE,CAAC,EAAI,cAAc,EAAE,QAAQ,EAAE,EAAE,UAAU;YAClH,EAAE,IAAI,EAAE,MAAM,EAAQ,YAAY,EAAE,GAAG,EAAI,QAAQ,EAAE,GAAG,EAAK,QAAQ,EAAE,EAAE,EAAG,cAAc,EAAE,QAAQ,EAAE,EAAE,KAAK;YAC7G,EAAE,IAAI,EAAE,QAAQ,EAAM,YAAY,EAAE,GAAG,EAAI,QAAQ,EAAE,CAAC,EAAE,EAAK,QAAQ,EAAE,EAAE,EAAG,cAAc,EAAE,QAAQ,EAAE,CAAE,KAAK;SAChH,CAAA;IACL,CAAC;IAED,sDAAsD;IACtD,YAAa,OAAY;QACrB,KAAK,EAAE,CAAA;QACP,MAAM,EAAE,UAAU,EAAE,GAAG,OAAO,CAAC,gBAAgB,CAAA;QAC/C,IAAI,CAAC,UAAU,GAAG,UAAoB,CAAA;IAC1C,CAAC;IAED,iCAAiC;IACzB,SAAS,CAAE,OAAe,EAAE,WAAmB,EAAE,KAAa,EAAE,MAAc;QAClF,4CAA4C;QAC5C,IAAI,KAAK,IAAI,GAAG;YACZ,OAAO,CAAC,CAAA;QAEZ,4BAA4B;QAC5B,MAAM,QAAQ,GAAI,MAAM,GAAG,GAAG,CAAA;QAC9B,MAAM,QAAQ,GAAI,OAAO,GAAG,WAAW,CAAA;QACvC,MAAM,SAAS,GAAG,OAAO,IAAI,CAAC,WAAW,GAAG,QAAQ,CAAC,CAAA;QAErD,0DAA0D;QAC1D,IAAI,QAAQ;YACR,OAAO,CAAC,CAAA;QAEZ,uBAAuB;QACvB,IAAI,MAAM,GAAG,CAAC,IAAI,CAAC,SAAS,EAAE,CAAC;YAC3B,MAAM,CAAC,GAAG,CAAC,OAAO,GAAG,WAAW,CAAC,GAAG,MAAM,CAAA;YAC1C,MAAM,WAAW,GAAG,CAAC,WAAW,GAAG,CAAC,OAAO,GAAG,WAAW,CAAC,GAAG,KAAK,CAAC,GAAG,OAAO,CAAA;YAC7E,OAAO,WAAW,GAAG,CAAC,GAAG,CAAC,CAAA;QAC9B,CAAC;QAED,8BAA8B;QAC9B,MAAM,SAAS,GAAG,WAAW,GAAG,CAAC,OAAO,GAAG,WAAW,CAAC,GAAG,KAAK,CAAA;QAE/D,8BAA8B;QAC9B,OAAO,SAAS,GAAG,OAAO,CAAA;IAC9B,CAAC;IAED,uEAAuE;IAC/D,wBAAwB,CAC5B,IAAsB,EACtB,OAA4B,EAC5B,MAAsB,EACtB,OAAsB;QAEtB,gCAAgC;QAChC,IAAI,IAAI,CAAC,GAAG,CAAC,IAAI,CAAC,KAAK,SAAS;YAC5B,IAAI,CAAC,GAAG,CAAC,IAAI,CAAC,GAAG,KAAK,CAAA;QAC1B,IAAI,GAAG,GAAG,IAAI,CAAC,GAAG,CAAC,IAAI,CAAC,CAAA;QAExB,6CAA6C;QAC7C,MAAM,MAAM,GAAG,IAAI,CAAC,GAAG,CAAC,CAAC,CAAC,GAAG,CAAC,MAAM,GAAI,IAAI,CAAC,UAAU,CAAC,CAAC,CAAA;QACzD,MAAM,MAAM,GAAG,IAAI,CAAC,GAAG,CAAC,CAAC,CAAC,GAAG,CAAC,OAAO,GAAG,IAAI,CAAC,UAAU,CAAC,CAAC,CAAA;QAEzD,kDAAkD;QAClD,KAAK,MAAM,CAAC,IAAI,OAAO,EAAE,CAAC;YACtB,MAAM,CAAC,GAAG,IAAI,CAAC,GAAG,CAAC,CAAC,CAAC,CAAA;YACrB,MAAM,GAAG,GAAG,CAAC,GAAG,CAAC,CAAA;YACjB,IAAI,GAAG,GAAG,GAAG;gBACT,GAAG,GAAG,MAAM,GAAG,GAAG,GAAG,CAAC,CAAC,GAAG,MAAM,CAAC,GAAG,GAAG,CAAA;;gBAEvC,GAAG,GAAG,MAAM,GAAG,GAAG,GAAG,CAAC,CAAC,GAAG,MAAM,CAAC,GAAG,GAAG,CAAA;QAC/C,CAAC;QACD,IAAI,CAAC,GAAG,CAAC,IAAI,CAAC,GAAG,IAAI,CAAC,IAAI,CAAC,IAAI,CAAC,GAAG,CAAC,GAAG,EAAE,KAAK,CAAC,CAAC,CAAA;IACpD,CAAC;IAED,qCAAqC;IACrC,OAAO,CACH,MAA4B,EAC5B,OAA4B,EAC5B,UAAwC;QAExC,oBAAoB;QACpB,MAAM,KAAK,GAAI,MAAM,CAAC,CAAC,CAAC,CAAA;QACxB,MAAM,MAAM,GAAG,OAAO,CAAC,CAAC,CAAC,CAAA;QACzB,IAAI,CAAC,KAAK,IAAI,KAAK,CAAC,MAAM,KAAK,CAAC,IAAI,CAAC,MAAM;YACvC,OAAO,IAAI,CAAA;QAEf,oCAAoC;QACpC,MAAM,GAAG,GAAG,KAAK,CAAC,MAAM,CAAA;QAExB,0DAA0D;QAC1D,KAAK,IAAI,CAAC,GAAG,CAAC,EAAE,CAAC,GAAG,MAAM,CAAC,MAAM,EAAE,CAAC,EAAE,EAAE,CAAC;YACrC,IAAI,CAAC,MAAM,CAAC,CAAC,CAAC,IAAI,CAAC,KAAK,CAAC,CAAC,CAAC;gBACvB,SAAQ;YACZ,MAAM,CAAC,CAAC,CAAC,CAAC,GAAG,CAAC,KAAK,CAAC,CAAC,CAAC,CAAC,CAAA;QAC3B,CAAC;QAED,wBAAwB;QACxB,MAAM,WAAW,GAAG,UAAU,CAAC,WAAW,CAAC,CAAC,CAAC,CAAC,CAAA;QAC9C,MAAM,KAAK,GAAS,UAAU,CAAC,OAAO,CAAC,CAAC,CAAC,CAAC,CAAA;QAC1C,MAAM,MAAM,GAAQ,UAAU,CAAC,MAAM,CAAC,CAAC,CAAC,CAAC,CAAA;QACzC,MAAM,OAAO,GAAO,IAAI,CAAC,GAAG,CAAC,UAAU,CAAC,QAAQ,CAAC,CAAC,CAAC,CAAC,EAAG,CAAC,GAAG,IAAI,CAAC,UAAU,CAAC,CAAA;QAC3E,MAAM,QAAQ,GAAM,IAAI,CAAC,GAAG,CAAC,UAAU,CAAC,SAAS,CAAC,CAAC,CAAC,CAAC,EAAE,CAAC,GAAG,IAAI,CAAC,UAAU,CAAC,CAAA;QAC3E,MAAM,QAAQ,GAAM,UAAU,CAAC,QAAQ,CAAC,CAAC,CAAC,CAAC,CAAA;QAE3C,mCAAmC;QACnC,KAAK,IAAI,EAAE,GAAG,CAAC,EAAE,EAAE,GAAG,GAAG,EAAE,EAAE,EAAE;YAC3B,IAAI,CAAC,wBAAwB,CAAC,EAAE,EAAE,KAAK,CAAC,EAAE,CAAC,EAAE,OAAO,EAAE,QAAQ,CAAC,CAAA;QAEnE,uDAAuD;QACvD,MAAM,SAAS,GAAG,KAAK,CAAC,MAAM,CAAC,QAAQ,CAAC,CAAA;QAExC,iCAAiC;QACjC,IAAI,CAAC,SAAS,GAAG,CAAC,CAAA;QAClB,KAAK,IAAI,EAAE,GAAG,CAAC,EAAE,EAAE,GAAG,GAAG,EAAE,EAAE,EAAE,EAAE,CAAC;YAC9B,MAAM,OAAO,GAAG,KAAK,CAAC,MAAM,CAAC,IAAI,CAAC,GAAG,CAAC,EAAE,CAAC,CAAC,CAAA;YAC1C,MAAM,MAAM,GAAI,IAAI,CAAC,SAAS,CAAC,OAAO,EAAE,WAAW,EAAE,KAAK,EAAE,MAAM,CAAC,CAAA;YACnE,MAAM,OAAO,GAAG,KAAK,CAAC,MAAM,CAAC,MAAM,CAAC,GAAG,SAAS,CAAA;YAEhD,6CAA6C;YAC7C,IAAI,EAAE,KAAK,CAAC;gBACR,IAAI,CAAC,SAAS,GAAG,IAAI,CAAC,GAAG,CAAC,CAAC,EAAE,MAAM,CAAC,CAAA;YAExC,oCAAoC;YACpC,MAAM,GAAG,GAAG,KAAK,CAAC,EAAE,CAAC,CAAA;YACrB,MAAM,GAAG,GAAG,MAAM,CAAC,EAAE,CAAC,CAAA;YACtB,KAAK,IAAI,CAAC,GAAG,CAAC,EAAE,CAAC,GAAG,GAAG,CAAC,MAAM,EAAE,CAAC,EAAE;gBAC/B,GAAG,CAAC,CAAC,CAAC,GAAG,GAAG,CAAC,CAAC,CAAC,GAAG,OAAO,CAAA;QACjC,CAAC;QACD,OAAO,IAAI,CAAA;IACf,CAAC;CACJ;AAED,oCAAoC;AACpC,iBAAiB,CAAC,YAAY,EAAE,mBAAmB,CAAC,CAAA"}

package/speechflow-cli/dst/speechflow-node-a2a-compressor.d.ts ADDED Viewed

@@ -0,0 +1,15 @@
+import SpeechFlowNode from "./speechflow-node";
+export default class SpeechFlowNodeCompressor extends SpeechFlowNode {
+    static name: string;
+    private destroyed;
+    private compressor;
+    private bus;
+    private intervalId;
+    constructor(id: string, cfg: {
+        [id: string]: any;
+    }, opts: {
+        [id: string]: any;
+    }, args: any[]);
+    open(): Promise<void>;
+    close(): Promise<void>;
+}