npm - whisper-coreml - Versions diffs - 0.2.0 - Mend

whisper-coreml 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (18) hide show

package/LICENSE +22 -0
package/README.md +290 -0
package/build/Release/whisper_asr.node +0 -0
package/dist/chunk-MOQMN4DX.js +216 -0
package/dist/chunk-MOQMN4DX.js.map +1 -0
package/dist/cli.cjs +374 -0
package/dist/cli.cjs.map +1 -0
package/dist/cli.d.cts +1 -0
package/dist/cli.d.ts +1 -0
package/dist/cli.js +182 -0
package/dist/cli.js.map +1 -0
package/dist/index.cjs +243 -0
package/dist/index.cjs.map +1 -0
package/dist/index.d.cts +162 -0
package/dist/index.d.ts +162 -0
package/dist/index.js +23 -0
package/dist/index.js.map +1 -0
package/package.json +105 -0

package/LICENSE ADDED Viewed

@@ -0,0 +1,22 @@
+MIT License
+Copyright (c) 2026 Sebastian Werner
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md ADDED Viewed

@@ -0,0 +1,290 @@
+# whisper-coreml
+<p align="center">
+  <img src="logo.svg" alt="whisper-coreml" width="128" height="128">
+</p>
+<p align="center">
+  <strong>OpenAI Whisper ASR for Node.js with CoreML/ANE acceleration on Apple Silicon</strong>
+</p>
+<p align="center">
+  <a href="https://github.com/sebastian-software/whisper-coreml/actions/workflows/ci.yml"><img src="https://github.com/sebastian-software/whisper-coreml/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
+  <a href="https://www.npmjs.com/package/whisper-coreml"><img src="https://img.shields.io/npm/v/whisper-coreml.svg" alt="npm version"></a>
+  <a href="https://www.npmjs.com/package/whisper-coreml"><img src="https://img.shields.io/npm/dm/whisper-coreml.svg" alt="npm downloads"></a>
+  <br>
+  <a href="https://www.typescriptlang.org/"><img src="https://img.shields.io/badge/TypeScript-5.x-blue.svg" alt="TypeScript"></a>
+  <a href="https://nodejs.org/"><img src="https://img.shields.io/badge/Node.js-20+-green.svg" alt="Node.js"></a>
+  <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"></a>
+</p>
+Powered by [whisper.cpp](https://github.com/ggerganov/whisper.cpp) running on Apple's Neural Engine
+via CoreML.
+## Why whisper-coreml?
+When you need **higher transcription quality** than
+[parakeet-coreml](https://github.com/sebastian-software/parakeet-coreml), Whisper's large-v3-turbo
+model delivers. It offers:
+- **99 language support** vs Parakeet's 25 European languages
+- **Better accuracy** on challenging audio (accents, background noise)
+- **Translation capability** (any language → English)
+- **Word-level confidence scores**
+### When to Use Which
+| Use Case                            | Recommended                                                              |
+| ----------------------------------- | ------------------------------------------------------------------------ |
+| Fast transcription, major languages | [parakeet-coreml](https://github.com/sebastian-software/parakeet-coreml) |
+| Maximum accuracy, any language      | **whisper-coreml**                                                       |
+| Translation to English              | **whisper-coreml**                                                       |
+| Edge cases (accents, noise)         | **whisper-coreml**                                                       |
+## Features
+- 🎯 **99 Languages** – Full Whisper multilingual support
+- 🚀 **14x real-time** – Transcribe 1 hour of audio in ~4.5 minutes (M1 Ultra, measured)
+- 🍎 **Neural Engine Acceleration** – Runs on Apple's dedicated ML silicon via CoreML
+- 🔒 **Fully Offline** – All processing happens locally
+- 📦 **Zero Runtime Dependencies** – No Python, no subprocess
+- 📝 **Timestamps** – Segment-level timing for subtitles
+- 🔄 **Translation** – Translate any language to English
+- ⬇️ **Easy Setup** – Single CLI command to download the model
+## Performance
+The CoreML encoder runs on Apple's Neural Engine for accelerated inference:
+**Measured: M1 Ultra**
+```
+5 minutes of audio → 22.5 seconds
+Speed: 14x real-time
+1 hour of audio in ~4.5 minutes
+```
+Run your own benchmark:
+```bash
+git clone https://github.com/sebastian-software/whisper-coreml
+cd whisper-coreml && npm install && npm run benchmark
+```
+### Comparison with parakeet-coreml
+| Metric           | whisper-coreml | parakeet-coreml |
+| ---------------- | -------------- | --------------- |
+| Speed (M1 Ultra) | 14x real-time  | 40x real-time   |
+| Languages        | 99             | 25 European     |
+| Translation      | ✅ Yes         | ❌ No           |
+| Accuracy (WER)   | Lower (better) | Higher          |
+| Model Size       | ~3 GB          | ~1.5 GB         |
+**When to choose whisper-coreml:** Maximum accuracy, rare languages, translation, challenging audio.
+**When to choose parakeet-coreml:** Maximum speed, major languages only.
+## Requirements
+- macOS 14.0+ (Sonoma or later)
+- Apple Silicon (M1, M2, M3, M4 – any variant)
+- Node.js 20+
+## Installation
+```bash
+npm install whisper-coreml
+```
+### Download the Model
+```bash
+npx whisper-coreml download
+```
+This downloads the **large-v3-turbo** model (~1.5GB) – the only model we support, as it offers the
+best speed/quality ratio.
+## Quick Start
+```typescript
+import { WhisperAsrEngine, getModelPath } from "whisper-coreml"
+const engine = new WhisperAsrEngine({
+  modelPath: getModelPath()
+})
+await engine.initialize()
+// Transcribe audio (16kHz, mono, Float32Array)
+const result = await engine.transcribe(audioSamples, 16000)
+console.log(result.text)
+// "Hello, this is a test transcription."
+console.log(`Language: ${result.language}`)
+console.log(`Processed in ${result.durationMs}ms`)
+// Segments include timestamps
+for (const seg of result.segments) {
+  console.log(`[${seg.startMs}ms - ${seg.endMs}ms] ${seg.text}`)
+}
+engine.cleanup()
+```
+## Audio Format
+| Property    | Requirement                                   |
+| ----------- | --------------------------------------------- |
+| Sample Rate | **16,000 Hz** (16 kHz)                        |
+| Channels    | **Mono** (single channel)                     |
+| Format      | **Float32Array** with values between -1.0–1.0 |
+| Duration    | **Any length** (auto-chunked internally)      |
+### Converting Audio Files
+Example with ffmpeg:
+```bash
+ffmpeg -i input.mp3 -ar 16000 -ac 1 -f f32le output.pcm
+```
+Then load the raw PCM file:
+```typescript
+import { readFileSync } from "fs"
+const buffer = readFileSync("output.pcm")
+const samples = new Float32Array(buffer.buffer, buffer.byteOffset, buffer.length / 4)
+```
+## CLI Commands
+```bash
+# Download the model (~1.5GB)
+npx whisper-coreml download
+# Check status
+npx whisper-coreml status
+# Run benchmark (requires cloned repo)
+npx whisper-coreml benchmark
+# Get model directory path
+npx whisper-coreml path
+```
+## API Reference
+### `WhisperAsrEngine`
+The main class for speech recognition.
+```typescript
+new WhisperAsrEngine(options: WhisperAsrOptions)
+```
+#### Options
+| Option      | Type      | Default  | Description                       |
+| ----------- | --------- | -------- | --------------------------------- |
+| `modelPath` | `string`  | required | Path to ggml model file           |
+| `language`  | `string`  | `"auto"` | Language code or "auto" to detect |
+| `translate` | `boolean` | `false`  | Translate to English              |
+| `threads`   | `number`  | `0`      | CPU threads (0 = auto)            |
+#### Methods
+| Method                      | Description                    |
+| --------------------------- | ------------------------------ |
+| `initialize()`              | Load model (async)             |
+| `transcribe(samples, rate)` | Transcribe audio               |
+| `isReady()`                 | Check if engine is initialized |
+| `cleanup()`                 | Release native resources       |
+| `getVersion()`              | Get version information        |
+### `TranscriptionResult`
+```typescript
+interface TranscriptionResult {
+  text: string // Full transcription
+  language: string // Detected language (ISO code)
+  durationMs: number // Processing time in milliseconds
+  segments: TranscriptionSegment[]
+}
+interface TranscriptionSegment {
+  startMs: number // Segment start in milliseconds
+  endMs: number // Segment end in milliseconds
+  text: string // Transcription for this segment
+  confidence: number // Confidence score (0-1)
+}
+```
+### Helper Functions
+| Function               | Description                            |
+| ---------------------- | -------------------------------------- |
+| `isAvailable()`        | Check if running on supported platform |
+| `getDefaultModelDir()` | Get default model cache path           |
+| `getModelPath()`       | Get path to the model file             |
+| `isModelDownloaded()`  | Check if model is downloaded           |
+| `downloadModel()`      | Download the model                     |
+## Translation
+Translate any language to English:
+```typescript
+const engine = new WhisperAsrEngine({
+  modelPath: getModelPath(),
+  language: "de", // German input
+  translate: true // Output in English
+})
+```
+## Architecture
+```
+┌─────────────────────────────────────────────────────────┐
+│                    Your Node.js App                     │
+├─────────────────────────────────────────────────────────┤
+│                  whisper-coreml API                     │  TypeScript
+├─────────────────────────────────────────────────────────┤
+│                   Native Addon                          │  N-API + C++
+│                  (whisper_engine)                       │
+├─────────────────────────────────────────────────────────┤
+│                    whisper.cpp                          │  C++
+├─────────────────────────────────────────────────────────┤
+│                      CoreML                             │  Apple Framework
+├─────────────────────────────────────────────────────────┤
+│                 Apple Neural Engine                     │  Dedicated ML Silicon
+└─────────────────────────────────────────────────────────┘
+```
+## Use Cases
+- **Maximum accuracy** – When Parakeet's quality isn't sufficient
+- **Rare languages** – Languages not supported by Parakeet
+- **Translation** – Convert foreign speech to English text
+- **Accented speech** – Whisper handles accents better
+- **Noisy audio** – More robust to background noise
+## Contributing
+Contributions are welcome! Please read our [Contributing Guide](CONTRIBUTING.md) for details.
+## License
+MIT – see [LICENSE](LICENSE) for details.
+## Credits
+- [whisper.cpp](https://github.com/ggerganov/whisper.cpp) by Georgi Gerganov
+- [OpenAI Whisper](https://github.com/openai/whisper) by OpenAI
+---
+Copyright © 2026 [Sebastian Software GmbH](https://sebastian-software.de), Mainz, Germany

package/build/Release/whisper_asr.node ADDED Viewed

Binary file

package/dist/chunk-MOQMN4DX.js ADDED Viewed

@@ -0,0 +1,216 @@
+var __require = /* @__PURE__ */ ((x) => typeof require !== "undefined" ? require : typeof Proxy !== "undefined" ? new Proxy(x, {
+  get: (a, b) => (typeof require !== "undefined" ? require : a)[b]
+}) : x)(function(x) {
+  if (typeof require !== "undefined") return require.apply(this, arguments);
+  throw Error('Dynamic require of "' + x + '" is not supported');
+});
+// src/download.ts
+import { existsSync, mkdirSync, writeFileSync, rmSync } from "fs";
+import { homedir } from "os";
+import { join, dirname } from "path";
+var WHISPER_MODEL = {
+  name: "large-v3-turbo",
+  size: "1.5 GB",
+  languages: "99 languages",
+  url: "https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-large-v3-turbo.bin"
+};
+function getDefaultModelDir() {
+  return join(homedir(), ".cache", "whisper-coreml", "models");
+}
+function getModelPath(modelDir) {
+  const dir = modelDir ?? getDefaultModelDir();
+  return join(dir, `ggml-${WHISPER_MODEL.name}.bin`);
+}
+function isModelDownloaded(modelDir) {
+  const modelPath = getModelPath(modelDir);
+  return existsSync(modelPath);
+}
+async function downloadModel(options = {}) {
+  const modelDir = options.modelDir ?? getDefaultModelDir();
+  const modelPath = getModelPath(modelDir);
+  if (!options.force && existsSync(modelPath)) {
+    return modelPath;
+  }
+  if (existsSync(modelPath)) {
+    rmSync(modelPath);
+  }
+  mkdirSync(dirname(modelPath), { recursive: true });
+  console.log(`Downloading Whisper ${WHISPER_MODEL.name} (${WHISPER_MODEL.size})...`);
+  console.log(`Source: ${WHISPER_MODEL.url}`);
+  console.log(`Target: ${modelPath}`);
+  const response = await fetch(WHISPER_MODEL.url);
+  if (!response.ok) {
+    throw new Error(`Failed to download model: ${response.statusText}`);
+  }
+  const contentLength = response.headers.get("content-length");
+  const totalBytes = contentLength ? parseInt(contentLength, 10) : 0;
+  const reader = response.body?.getReader();
+  if (!reader) {
+    throw new Error("Failed to get response body reader");
+  }
+  const chunks = [];
+  let downloadedBytes = 0;
+  while (true) {
+    const result = await reader.read();
+    if (result.done) {
+      break;
+    }
+    const chunk = result.value;
+    chunks.push(chunk);
+    downloadedBytes += chunk.length;
+    const percent = totalBytes > 0 ? Math.round(downloadedBytes / totalBytes * 100) : 0;
+    if (options.onProgress) {
+      options.onProgress({
+        downloadedBytes,
+        totalBytes,
+        percent
+      });
+    }
+    process.stdout.write(
+      `\rProgress: ${String(percent)}% (${formatBytes(downloadedBytes)}/${formatBytes(totalBytes)})`
+    );
+  }
+  const buffer = Buffer.concat(chunks);
+  writeFileSync(modelPath, buffer);
+  console.log("\n\u2713 Model downloaded successfully!");
+  return modelPath;
+}
+function formatBytes(bytes) {
+  if (bytes < 1024) {
+    return `${String(bytes)} B`;
+  }
+  if (bytes < 1024 * 1024) {
+    return `${(bytes / 1024).toFixed(1)} KB`;
+  }
+  if (bytes < 1024 * 1024 * 1024) {
+    return `${(bytes / 1024 / 1024).toFixed(1)} MB`;
+  }
+  return `${(bytes / 1024 / 1024 / 1024).toFixed(2)} GB`;
+}
+// src/index.ts
+var bindingsModule = __require("bindings");
+function loadAddon() {
+  if (process.platform !== "darwin") {
+    throw new Error("whisper-coreml is only supported on macOS");
+  }
+  try {
+    return bindingsModule("whisper_asr");
+  } catch (error) {
+    const message = error instanceof Error ? error.message : String(error);
+    throw new Error(`Failed to load Whisper ASR native addon: ${message}`);
+  }
+}
+var addon = null;
+var loadError = null;
+function getAddon() {
+  if (!addon) {
+    try {
+      addon = loadAddon();
+    } catch (error) {
+      loadError = error instanceof Error ? error : new Error(String(error));
+      throw error;
+    }
+  }
+  return addon;
+}
+function isAvailable() {
+  return process.platform === "darwin" && process.arch === "arm64";
+}
+function getLoadError() {
+  return loadError;
+}
+var WhisperAsrEngine = class {
+  options;
+  initialized = false;
+  constructor(options) {
+    this.options = options;
+  }
+  /* v8 ignore start - native addon calls, tested via E2E */
+  /**
+   * Initialize the Whisper engine
+   * This loads the model into memory - may take a few seconds.
+   */
+  initialize() {
+    if (this.initialized) {
+      return Promise.resolve();
+    }
+    const nativeAddon = getAddon();
+    const success = nativeAddon.initialize({
+      modelPath: this.options.modelPath,
+      language: this.options.language ?? "auto",
+      translate: this.options.translate ?? false,
+      threads: this.options.threads ?? 0
+    });
+    if (!success) {
+      return Promise.reject(new Error("Failed to initialize Whisper engine"));
+    }
+    this.initialized = true;
+    return Promise.resolve();
+  }
+  /**
+   * Check if the engine is ready for transcription
+   */
+  isReady() {
+    if (!this.initialized) {
+      return false;
+    }
+    try {
+      return getAddon().isInitialized();
+    } catch {
+      return false;
+    }
+  }
+  /**
+   * Transcribe audio samples
+   *
+   * @param samples - Float32Array of audio samples (mono, 16kHz)
+   * @param sampleRate - Sample rate in Hz (default: 16000)
+   * @returns Transcription result with text and segments
+   */
+  transcribe(samples, sampleRate = 16e3) {
+    if (!this.initialized) {
+      return Promise.reject(new Error("Whisper engine not initialized. Call initialize() first."));
+    }
+    const result = getAddon().transcribe(samples, sampleRate);
+    return Promise.resolve({
+      text: result.text,
+      language: result.language,
+      durationMs: result.durationMs,
+      segments: result.segments
+    });
+  }
+  /**
+   * Clean up resources and unload the model
+   */
+  cleanup() {
+    if (this.initialized) {
+      try {
+        getAddon().cleanup();
+      } catch {
+      }
+      this.initialized = false;
+    }
+  }
+  /**
+   * Get version information
+   */
+  getVersion() {
+    return getAddon().getVersion();
+  }
+  /* v8 ignore stop */
+};
+export {
+  WHISPER_MODEL,
+  getDefaultModelDir,
+  getModelPath,
+  isModelDownloaded,
+  downloadModel,
+  formatBytes,
+  isAvailable,
+  getLoadError,
+  WhisperAsrEngine
+};
+//# sourceMappingURL=chunk-MOQMN4DX.js.map

package/dist/chunk-MOQMN4DX.js.map ADDED Viewed

@@ -0,0 +1 @@

+ {"version":3,"sources":["../src/download.ts","../src/index.ts"],"sourcesContent":["/**\n * Model download functionality for whisper-coreml\n *\n * Note: We only support large-v3-turbo as it's the only Whisper model\n * that offers better quality than Parakeet while maintaining reasonable speed.\n */\n\nimport { existsSync, mkdirSync, writeFileSync, rmSync } from \"node:fs\"\nimport { homedir } from \"node:os\"\nimport { join, dirname } from \"node:path\"\n\n/**\n * Whisper large-v3-turbo model info\n * This is the only model we support as it offers the best speed/quality ratio\n * and is the main reason to choose Whisper over Parakeet.\n */\nexport const WHISPER_MODEL = {\n name: \"large-v3-turbo\",\n size: \"1.5 GB\",\n languages: \"99 languages\",\n url: \"https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-large-v3-turbo.bin\"\n} as const\n\n/**\n * Default model directory in user's cache\n */\nexport function getDefaultModelDir(): string {\n return join(homedir(), \".cache\", \"whisper-coreml\", \"models\")\n}\n\n/**\n * Get the path to the model\n */\nexport function getModelPath(modelDir?: string): string {\n const dir = modelDir ?? getDefaultModelDir()\n return join(dir, `ggml-${WHISPER_MODEL.name}.bin`)\n}\n\n/**\n * Check if the model is downloaded\n */\nexport function isModelDownloaded(modelDir?: string): boolean {\n const modelPath = getModelPath(modelDir)\n return existsSync(modelPath)\n}\n\ninterface DownloadProgress {\n downloadedBytes: number\n totalBytes: number\n percent: number\n}\n\nexport interface DownloadOptions {\n /** Target directory for model (default: ~/.cache/whisper-coreml/models) */\n modelDir?: string\n\n /** Progress callback */\n onProgress?: (progress: DownloadProgress) => void\n\n /** Force re-download even if model exists */\n force?: boolean\n}\n\n/* v8 ignore start - network I/O */\n\n/**\n * Download the Whisper large-v3-turbo model from Hugging Face\n */\nexport async function downloadModel(options: DownloadOptions = {}): Promise<string> {\n const modelDir = options.modelDir ?? getDefaultModelDir()\n const modelPath = getModelPath(modelDir)\n\n if (!options.force && existsSync(modelPath)) {\n return modelPath\n }\n\n // Clean up partial downloads\n if (existsSync(modelPath)) {\n rmSync(modelPath)\n }\n\n mkdirSync(dirname(modelPath), { recursive: true })\n\n console.log(`Downloading Whisper ${WHISPER_MODEL.name} (${WHISPER_MODEL.size})...`)\n console.log(`Source: ${WHISPER_MODEL.url}`)\n console.log(`Target: ${modelPath}`)\n\n const response = await fetch(WHISPER_MODEL.url)\n if (!response.ok) {\n throw new Error(`Failed to download model: ${response.statusText}`)\n }\n\n const contentLength = response.headers.get(\"content-length\")\n const totalBytes = contentLength ? parseInt(contentLength, 10) : 0\n\n const reader = response.body?.getReader()\n if (!reader) {\n throw new Error(\"Failed to get response body reader\")\n }\n\n const chunks: Uint8Array[] = []\n let downloadedBytes = 0\n\n // eslint-disable-next-line @typescript-eslint/no-unnecessary-condition\n while (true) {\n const result = await reader.read()\n if (result.done) {\n break\n }\n\n const chunk = result.value as Uint8Array\n chunks.push(chunk)\n downloadedBytes += chunk.length\n\n const percent = totalBytes > 0 ? Math.round((downloadedBytes / totalBytes) * 100) : 0\n\n if (options.onProgress) {\n options.onProgress({\n downloadedBytes,\n totalBytes,\n percent\n })\n }\n\n // Progress indicator\n process.stdout.write(\n `\\rProgress: ${String(percent)}% (${formatBytes(downloadedBytes)}/${formatBytes(totalBytes)})`\n )\n }\n\n // Combine chunks and write to file\n const buffer = Buffer.concat(chunks)\n writeFileSync(modelPath, buffer)\n\n console.log(\"\\n✓ Model downloaded successfully!\")\n return modelPath\n}\n\n/* v8 ignore stop */\n\n/**\n * Format bytes to human readable string\n * @internal Exported for testing\n */\nexport function formatBytes(bytes: number): string {\n if (bytes < 1024) {\n return `${String(bytes)} B`\n }\n if (bytes < 1024 * 1024) {\n return `${(bytes / 1024).toFixed(1)} KB`\n }\n if (bytes < 1024 * 1024 * 1024) {\n return `${(bytes / 1024 / 1024).toFixed(1)} MB`\n }\n return `${(bytes / 1024 / 1024 / 1024).toFixed(2)} GB`\n}\n","/**\n * whisper-coreml\n *\n * OpenAI Whisper ASR for Node.js with CoreML/ANE acceleration on Apple Silicon.\n * Based on whisper.cpp with Apple Neural Engine support.\n *\n * Uses the large-v3-turbo model exclusively, as it offers the best speed/quality\n * ratio and is the main reason to choose Whisper over Parakeet.\n */\n\n// Dynamic require for loading native addon (works in both ESM and CJS)\n// eslint-disable-next-line @typescript-eslint/no-require-imports\nconst bindingsModule = require(\"bindings\") as (name: string) => unknown\n\n/**\n * Native addon interface\n */\ninterface NativeAddon {\n initialize(options: {\n modelPath: string\n language?: string\n translate?: boolean\n threads?: number\n }): boolean\n isInitialized(): boolean\n transcribe(samples: Float32Array, sampleRate: number): NativeTranscriptionResult\n cleanup(): void\n getVersion(): { addon: string; whisper: string; coreml: string }\n}\n\ninterface NativeTranscriptionResult {\n text: string\n language: string\n durationMs: number\n segments: {\n startMs: number\n endMs: number\n text: string\n confidence: number\n }[]\n}\n\n/* v8 ignore start - platform checks and native addon loading */\n\n/**\n * Load the native addon\n */\nfunction loadAddon(): NativeAddon {\n if (process.platform !== \"darwin\") {\n throw new Error(\"whisper-coreml is only supported on macOS\")\n }\n\n try {\n return bindingsModule(\"whisper_asr\") as NativeAddon\n } catch (error) {\n const message = error instanceof Error ? error.message : String(error)\n throw new Error(`Failed to load Whisper ASR native addon: ${message}`)\n }\n}\n\n/* v8 ignore stop */\n\nlet addon: NativeAddon | null = null\nlet loadError: Error | null = null\n\nfunction getAddon(): NativeAddon {\n if (!addon) {\n try {\n addon = loadAddon()\n } catch (error) {\n loadError = error instanceof Error ? error : new Error(String(error))\n throw error\n }\n }\n return addon\n}\n\n/**\n * Check if Whisper ASR is available on this platform\n */\nexport function isAvailable(): boolean {\n return process.platform === \"darwin\" && process.arch === \"arm64\"\n}\n\n/**\n * Get the load error if the addon failed to load\n */\nexport function getLoadError(): Error | null {\n return loadError\n}\n\n/**\n * Transcription segment with timestamps\n */\nexport interface TranscriptionSegment {\n /** Start time in milliseconds */\n startMs: number\n /** End time in milliseconds */\n endMs: number\n /** Transcribed text for this segment */\n text: string\n /** Confidence score (0-1) */\n confidence: number\n}\n\n/**\n * Transcription result\n */\nexport interface TranscriptionResult {\n /** Full transcribed text */\n text: string\n /** Detected or specified language (ISO code) */\n language: string\n /** Processing time in milliseconds */\n durationMs: number\n /** Individual segments with timestamps */\n segments: TranscriptionSegment[]\n}\n\n/**\n * Whisper ASR engine options\n */\nexport interface WhisperAsrOptions {\n /** Path to the Whisper model file (ggml format) */\n modelPath: string\n /** Language code (e.g., \"en\", \"de\", \"fr\") or \"auto\" for auto-detection */\n language?: string\n /** Translate to English (default: false) */\n translate?: boolean\n /** Number of threads (0 = auto) */\n threads?: number\n}\n\n/**\n * Whisper ASR Engine with CoreML acceleration\n *\n * Uses the large-v3-turbo model for best speed/quality balance.\n *\n * @example\n * ```typescript\n * import { WhisperAsrEngine, getModelPath } from \"whisper-coreml\"\n *\n * const engine = new WhisperAsrEngine({\n * modelPath: getModelPath()\n * })\n *\n * await engine.initialize()\n * const result = await engine.transcribe(audioSamples, 16000)\n * console.log(result.text)\n * ```\n */\nexport class WhisperAsrEngine {\n private options: WhisperAsrOptions\n private initialized = false\n\n constructor(options: WhisperAsrOptions) {\n this.options = options\n }\n\n /* v8 ignore start - native addon calls, tested via E2E */\n\n /**\n * Initialize the Whisper engine\n * This loads the model into memory - may take a few seconds.\n */\n initialize(): Promise<void> {\n if (this.initialized) {\n return Promise.resolve()\n }\n\n const nativeAddon = getAddon()\n const success = nativeAddon.initialize({\n modelPath: this.options.modelPath,\n language: this.options.language ?? \"auto\",\n translate: this.options.translate ?? false,\n threads: this.options.threads ?? 0\n })\n\n if (!success) {\n return Promise.reject(new Error(\"Failed to initialize Whisper engine\"))\n }\n\n this.initialized = true\n return Promise.resolve()\n }\n\n /**\n * Check if the engine is ready for transcription\n */\n isReady(): boolean {\n if (!this.initialized) {\n return false\n }\n try {\n return getAddon().isInitialized()\n } catch {\n return false\n }\n }\n\n /**\n * Transcribe audio samples\n *\n * @param samples - Float32Array of audio samples (mono, 16kHz)\n * @param sampleRate - Sample rate in Hz (default: 16000)\n * @returns Transcription result with text and segments\n */\n transcribe(samples: Float32Array, sampleRate = 16000): Promise<TranscriptionResult> {\n if (!this.initialized) {\n return Promise.reject(new Error(\"Whisper engine not initialized. Call initialize() first.\"))\n }\n\n const result = getAddon().transcribe(samples, sampleRate)\n\n return Promise.resolve({\n text: result.text,\n language: result.language,\n durationMs: result.durationMs,\n segments: result.segments\n })\n }\n\n /**\n * Clean up resources and unload the model\n */\n cleanup(): void {\n if (this.initialized) {\n try {\n getAddon().cleanup()\n } catch {\n // Ignore cleanup errors\n }\n this.initialized = false\n }\n }\n\n /**\n * Get version information\n */\n getVersion(): { addon: string; whisper: string; coreml: string } {\n return getAddon().getVersion()\n }\n\n /* v8 ignore stop */\n}\n\n// Re-export download utilities\nexport {\n downloadModel,\n formatBytes,\n getDefaultModelDir,\n getModelPath,\n isModelDownloaded,\n WHISPER_MODEL,\n type DownloadOptions\n} from \"./download.js\"\n"],"mappings":";;;;;;;;AAOA,SAAS,YAAY,WAAW,eAAe,cAAc;AAC7D,SAAS,eAAe;AACxB,SAAS,MAAM,eAAe;AAOvB,IAAM,gBAAgB;AAAA,EAC3B,MAAM;AAAA,EACN,MAAM;AAAA,EACN,WAAW;AAAA,EACX,KAAK;AACP;AAKO,SAAS,qBAA6B;AAC3C,SAAO,KAAK,QAAQ,GAAG,UAAU,kBAAkB,QAAQ;AAC7D;AAKO,SAAS,aAAa,UAA2B;AACtD,QAAM,MAAM,YAAY,mBAAmB;AAC3C,SAAO,KAAK,KAAK,QAAQ,cAAc,IAAI,MAAM;AACnD;AAKO,SAAS,kBAAkB,UAA4B;AAC5D,QAAM,YAAY,aAAa,QAAQ;AACvC,SAAO,WAAW,SAAS;AAC7B;AAwBA,eAAsB,cAAc,UAA2B,CAAC,GAAoB;AAClF,QAAM,WAAW,QAAQ,YAAY,mBAAmB;AACxD,QAAM,YAAY,aAAa,QAAQ;AAEvC,MAAI,CAAC,QAAQ,SAAS,WAAW,SAAS,GAAG;AAC3C,WAAO;AAAA,EACT;AAGA,MAAI,WAAW,SAAS,GAAG;AACzB,WAAO,SAAS;AAAA,EAClB;AAEA,YAAU,QAAQ,SAAS,GAAG,EAAE,WAAW,KAAK,CAAC;AAEjD,UAAQ,IAAI,uBAAuB,cAAc,IAAI,KAAK,cAAc,IAAI,MAAM;AAClF,UAAQ,IAAI,WAAW,cAAc,GAAG,EAAE;AAC1C,UAAQ,IAAI,WAAW,SAAS,EAAE;AAElC,QAAM,WAAW,MAAM,MAAM,cAAc,GAAG;AAC9C,MAAI,CAAC,SAAS,IAAI;AAChB,UAAM,IAAI,MAAM,6BAA6B,SAAS,UAAU,EAAE;AAAA,EACpE;AAEA,QAAM,gBAAgB,SAAS,QAAQ,IAAI,gBAAgB;AAC3D,QAAM,aAAa,gBAAgB,SAAS,eAAe,EAAE,IAAI;AAEjE,QAAM,SAAS,SAAS,MAAM,UAAU;AACxC,MAAI,CAAC,QAAQ;AACX,UAAM,IAAI,MAAM,oCAAoC;AAAA,EACtD;AAEA,QAAM,SAAuB,CAAC;AAC9B,MAAI,kBAAkB;AAGtB,SAAO,MAAM;AACX,UAAM,SAAS,MAAM,OAAO,KAAK;AACjC,QAAI,OAAO,MAAM;AACf;AAAA,IACF;AAEA,UAAM,QAAQ,OAAO;AACrB,WAAO,KAAK,KAAK;AACjB,uBAAmB,MAAM;AAEzB,UAAM,UAAU,aAAa,IAAI,KAAK,MAAO,kBAAkB,aAAc,GAAG,IAAI;AAEpF,QAAI,QAAQ,YAAY;AACtB,cAAQ,WAAW;AAAA,QACjB;AAAA,QACA;AAAA,QACA;AAAA,MACF,CAAC;AAAA,IACH;AAGA,YAAQ,OAAO;AAAA,MACb,eAAe,OAAO,OAAO,CAAC,MAAM,YAAY,eAAe,CAAC,IAAI,YAAY,UAAU,CAAC;AAAA,IAC7F;AAAA,EACF;AAGA,QAAM,SAAS,OAAO,OAAO,MAAM;AACnC,gBAAc,WAAW,MAAM;AAE/B,UAAQ,IAAI,yCAAoC;AAChD,SAAO;AACT;AAQO,SAAS,YAAY,OAAuB;AACjD,MAAI,QAAQ,MAAM;AAChB,WAAO,GAAG,OAAO,KAAK,CAAC;AAAA,EACzB;AACA,MAAI,QAAQ,OAAO,MAAM;AACvB,WAAO,IAAI,QAAQ,MAAM,QAAQ,CAAC,CAAC;AAAA,EACrC;AACA,MAAI,QAAQ,OAAO,OAAO,MAAM;AAC9B,WAAO,IAAI,QAAQ,OAAO,MAAM,QAAQ,CAAC,CAAC;AAAA,EAC5C;AACA,SAAO,IAAI,QAAQ,OAAO,OAAO,MAAM,QAAQ,CAAC,CAAC;AACnD;;;AC/IA,IAAM,iBAAiB,UAAQ,UAAU;AAmCzC,SAAS,YAAyB;AAChC,MAAI,QAAQ,aAAa,UAAU;AACjC,UAAM,IAAI,MAAM,2CAA2C;AAAA,EAC7D;AAEA,MAAI;AACF,WAAO,eAAe,aAAa;AAAA,EACrC,SAAS,OAAO;AACd,UAAM,UAAU,iBAAiB,QAAQ,MAAM,UAAU,OAAO,KAAK;AACrE,UAAM,IAAI,MAAM,4CAA4C,OAAO,EAAE;AAAA,EACvE;AACF;AAIA,IAAI,QAA4B;AAChC,IAAI,YAA0B;AAE9B,SAAS,WAAwB;AAC/B,MAAI,CAAC,OAAO;AACV,QAAI;AACF,cAAQ,UAAU;AAAA,IACpB,SAAS,OAAO;AACd,kBAAY,iBAAiB,QAAQ,QAAQ,IAAI,MAAM,OAAO,KAAK,CAAC;AACpE,YAAM;AAAA,IACR;AAAA,EACF;AACA,SAAO;AACT;AAKO,SAAS,cAAuB;AACrC,SAAO,QAAQ,aAAa,YAAY,QAAQ,SAAS;AAC3D;AAKO,SAAS,eAA6B;AAC3C,SAAO;AACT;AA8DO,IAAM,mBAAN,MAAuB;AAAA,EACpB;AAAA,EACA,cAAc;AAAA,EAEtB,YAAY,SAA4B;AACtC,SAAK,UAAU;AAAA,EACjB;AAAA;AAAA;AAAA;AAAA;AAAA;AAAA,EAQA,aAA4B;AAC1B,QAAI,KAAK,aAAa;AACpB,aAAO,QAAQ,QAAQ;AAAA,IACzB;AAEA,UAAM,cAAc,SAAS;AAC7B,UAAM,UAAU,YAAY,WAAW;AAAA,MACrC,WAAW,KAAK,QAAQ;AAAA,MACxB,UAAU,KAAK,QAAQ,YAAY;AAAA,MACnC,WAAW,KAAK,QAAQ,aAAa;AAAA,MACrC,SAAS,KAAK,QAAQ,WAAW;AAAA,IACnC,CAAC;AAED,QAAI,CAAC,SAAS;AACZ,aAAO,QAAQ,OAAO,IAAI,MAAM,qCAAqC,CAAC;AAAA,IACxE;AAEA,SAAK,cAAc;AACnB,WAAO,QAAQ,QAAQ;AAAA,EACzB;AAAA;AAAA;AAAA;AAAA,EAKA,UAAmB;AACjB,QAAI,CAAC,KAAK,aAAa;AACrB,aAAO;AAAA,IACT;AACA,QAAI;AACF,aAAO,SAAS,EAAE,cAAc;AAAA,IAClC,QAAQ;AACN,aAAO;AAAA,IACT;AAAA,EACF;AAAA;AAAA;AAAA;AAAA;AAAA;AAAA;AAAA;AAAA,EASA,WAAW,SAAuB,aAAa,MAAqC;AAClF,QAAI,CAAC,KAAK,aAAa;AACrB,aAAO,QAAQ,OAAO,IAAI,MAAM,0DAA0D,CAAC;AAAA,IAC7F;AAEA,UAAM,SAAS,SAAS,EAAE,WAAW,SAAS,UAAU;AAExD,WAAO,QAAQ,QAAQ;AAAA,MACrB,MAAM,OAAO;AAAA,MACb,UAAU,OAAO;AAAA,MACjB,YAAY,OAAO;AAAA,MACnB,UAAU,OAAO;AAAA,IACnB,CAAC;AAAA,EACH;AAAA;AAAA;AAAA;AAAA,EAKA,UAAgB;AACd,QAAI,KAAK,aAAa;AACpB,UAAI;AACF,iBAAS,EAAE,QAAQ;AAAA,MACrB,QAAQ;AAAA,MAER;AACA,WAAK,cAAc;AAAA,IACrB;AAAA,EACF;AAAA;AAAA;AAAA;AAAA,EAKA,aAAiE;AAC/D,WAAO,SAAS,EAAE,WAAW;AAAA,EAC/B;AAAA;AAGF;","names":[]}