npm - prism-mcp-server - Versions diffs - 7.5.0 → 7.6.0 - Mend

prism-mcp-server 7.5.0 → 7.6.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md +60 -28
package/dist/config.js +11 -0
package/dist/utils/llm/adapters/anthropic.js +5 -2
package/dist/utils/llm/adapters/voyage.js +129 -0
package/dist/utils/llm/factory.js +12 -5
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -8,7 +8,7 @@
 [![TypeScript](https://img.shields.io/badge/TypeScript-5.0+-3178C6?logo=typescript&logoColor=white)](https://www.typescriptlang.org/)
 [![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg)](CONTRIBUTING.md)
-![Prism Mind Palace Demo](docs/mind-palace-demo.webp)
+![Prism Mind Palace Dashboard](docs/mind-palace-dashboard.png)
 **Your AI agent forgets everything between sessions. Prism fixes that.**
@@ -23,21 +23,21 @@ Works with **Claude Desktop · Claude Code · Cursor · Windsurf · Cline · Gem
 ## 📖 Table of Contents
 - [Why Prism?](#why-prism)
-- [Quick Start](#-quick-start)
-- [The Magic Moment](#-the-magic-moment)
-- [Setup Guides](#-setup-guides)
-- [Universal Import](#-universal-import-bring-your-history)
-- [What Makes Prism Different](#-what-makes-prism-different)
-- [Data Privacy & Egress](#-data-privacy--egress)
-- [Use Cases](#-use-cases)
-- [What's New](#-whats-new)
-- [How Prism Compares](#-how-prism-compares)
-- [Tool Reference](#-tool-reference)
+- [Quick Start](#quick-start)
+- [The Magic Moment](#the-magic-moment)
+- [Setup Guides](#setup-guides)
+- [Universal Import: Bring Your History](#universal-import-bring-your-history)
+- [What Makes Prism Different](#what-makes-prism-different)
+- [Data Privacy & Egress](#data-privacy--egress)
+- [Use Cases](#use-cases)
+- [What's New](#whats-new)
+- [How Prism Compares](#how-prism-compares)
+- [Tool Reference](#tool-reference)
 - [Environment Variables](#environment-variables)
 - [Architecture](#architecture)
-- [Scientific Foundation](#-scientific-foundation)
-- [Milestones & Roadmap](#-milestones--roadmap)
-- [Troubleshooting FAQ](#-troubleshooting-faq)
+- [Scientific Foundation](#scientific-foundation)
+- [Milestones & Roadmap](#milestones--roadmap)
+- [Troubleshooting FAQ](#troubleshooting-faq)
 ---
@@ -84,6 +84,8 @@ Add to your MCP client config (`claude_desktop_config.json`, `.cursor/mcp.json`,
 **That's it.** Restart your client. All tools are available. The **Mind Palace Dashboard** (the visual UI for your agent's brain) starts automatically at `http://localhost:3000`. You don't need to keep a tab open — the dashboard runs in the background and the MCP tools work with or without it.
+> 🔮 **Pro Tip:** Once installed, open **`http://localhost:3000`** in your browser to view the Mind Palace Dashboard — a beautiful, real-time UI of your agent's brain. Explore the Knowledge Graph, Intent Health gauges, and Session Ledger.
 > 🔄 **Updating Prism:** `npx -y` caches the package locally. To force an update to the latest version, restart your MCP client — `npx -y` will fetch the newest release automatically. If you're stuck on a stale version, run `npx clear-npx-cache` (or `npm cache clean --force`) before restarting.
 <details>
@@ -318,6 +320,32 @@ Then add to your MCP config:
 </details>
+<details>
+<summary><strong>Cloud Deployment (Render)</strong></summary>
+Prism can be deployed natively to cloud platforms like [Render](https://render.com) so your agent's memory is always online and accessible across different machines or teams.
+1. Fork this repository.
+2. In the Render Dashboard, create a new **Web Service** pointing to your repository.
+3. In the setup wizard, select **Docker** as the Runtime.
+4. Set the Dockerfile path to `Dockerfile.smithery`.
+5. Connect your local MCP client to your new cloud endpoint using the `sse` transport:
+```json
+{
+  "mcpServers": {
+    "prism-mcp-cloud": {
+      "command": "npx",
+      "args": ["-y", "supergateway", "--url", "https://your-prism-app.onrender.com/sse"]
+    }
+  }
+}
+```
+> **Note:** The `Dockerfile.smithery` uses an optimized multi-stage build that compiles Typescript safely in a development environment before booting the server in a stripped-down production image. No NPM publishing required!
+</details>
 ### Common Installation Pitfalls
 > **❌ Don't use `npm install -g`:**
@@ -343,11 +371,11 @@ Then add to your MCP config:
 > ```
 > At the start of every conversation, call session_load_context with project "my-project" before doing any work.
 > ```
-> Claude Code users can use the `.clauderules` auto-load hook shown in the [Setup Guides](#-setup-guides). Prism also has a **server-side fallback** (v5.2.1+) that auto-pushes context after 10 seconds if no load is detected.
+> Claude Code users can use the `.clauderules` auto-load hook shown in the [Setup Guides](#setup-guides). Prism also has a **server-side fallback** (v5.2.1+) that auto-pushes context after 10 seconds if no load is detected.
 ---
-## 📥 Universal Import — Bring Your History
+## 📥 Universal Import: Bring Your History
 Switching to Prism? Don't leave months of AI session history behind. Prism can **ingest historical sessions from Claude Code, Gemini, and OpenAI** and give your Mind Palace an instant head start — no manual re-entry required.
@@ -373,7 +401,7 @@ npx -y prism-mcp-server universal-import --format gemini --path ./gemini_history
 **Option 2 — Dashboard:** Open `localhost:3000`, navigate to the **Import** tab, select the format and file, and click Import. Supports dry-run preview.
 ### Why It's Safe to Re-Run
-* **OOM-Safe Streaming:** Processes massive log files line-by-line using `stream-json`.
+* **Memory-Safe Streaming:** Processes massive log files line-by-line using `stream-json` to prevent Out-of-Memory (OOM) crashes.
 * **Idempotent Dedup:** Content-hash prevents duplicate imports on re-run (`skipCount` reported).
 * **Chronological Integrity:** Uses timestamp fallbacks and `requestId` sorting to preserve your memory timeline.
 * **Smart Context Mapping:** Extracts `cwd`, `gitBranch`, and tool usage patterns into searchable metadata.
@@ -403,7 +431,7 @@ A gorgeous glassmorphism UI at `localhost:3000` that lets you see exactly what y
 - **Morning Briefing** — AI-synthesized action plan after 4+ hours away
 - **Brain Health** — memory integrity scan with one-click auto-repair
-![Mind Palace Dashboard](docs/mind-palace-dashboard.png)
 ### 🧬 10× Memory Compression
 Powered by a pure TypeScript port of Google's TurboQuant (inspired by Google's ICLR research), Prism compresses 768-dim embeddings from **3,072 bytes → ~400 bytes** — enabling decades of session history on a standard laptop. No native modules. No vector database required.
@@ -423,7 +451,12 @@ OpenTelemetry spans for every MCP tool call, LLM hop, and background worker. Rou
 ### 🌐 Autonomous Web Scholar
 Prism researches while you sleep. A background pipeline searches the web, scrapes articles, synthesizes findings via LLM, and injects results directly into your semantic memory — fully searchable on your next session. Brave Search → Firecrawl scrape → LLM synthesis → Prism ledger. Task-aware, Hivemind-integrated, and zero-config when API keys are missing (falls back to Yahoo + Readability).
-### 🔒 Data Privacy & Egress
+### 🏭 Dark Factory — Adversarial Autonomous Pipelines
+When you trigger a Dark Factory pipeline, Prism doesn't just run your task — it fights itself to produce high-quality output. A `PLAN_CONTRACT` step locks a machine-parseable rubric before any code is written. After execution, an **Adversarial Evaluator** (in a fully isolated context) scores the output against the rubric. It cannot pass the Generator without providing exact file and line evidence for every failing criterion. Failed evaluations inject the critique directly into the Generator's retry prompt so it's never flying blind. The result: security issues, regressions, and lazy debug logs caught autonomously — before you ever see the PR. → [See it in action](examples/adversarial-eval-demo/README.md)
+---
+## 🔒 Data Privacy & Egress
 **Where is my data stored?**
@@ -446,14 +479,12 @@ Prism will recreate the directory with empty databases on next startup.
 **What leaves your machine?**
 - **Local mode (default):** Nothing. Zero network calls. All data is on-disk SQLite.
 - **With `GOOGLE_API_KEY`:** Text snippets are sent to Gemini for embedding generation, summaries, and Morning Briefings. No session data is stored on Google's servers beyond the API call.
+- **With `VOYAGE_API_KEY` / `OPENAI_API_KEY`:** Text snippets are sent to providers if selected as your embedding endpoints.
 - **With `BRAVE_API_KEY` / `FIRECRAWL_API_KEY`:** Web Scholar queries are sent to Brave/Firecrawl for search and scraping.
 - **With Supabase:** Session data syncs to your own Supabase instance (you control the Postgres database).
 **GDPR compliance:** Soft/hard delete (Art. 17), full export in JSON, Markdown, or Obsidian vault `.zip` (Art. 20), API key redaction in exports, per-project TTL retention policies, and immutable audit trail. Enterprise-ready out of the box.
-### 🏭 Dark Factory — Adversarial Autonomous Pipelines
-When you trigger a Dark Factory pipeline, Prism doesn't just run your task — it fights itself to produce high-quality output. A `PLAN_CONTRACT` step locks a machine-parseable rubric before any code is written. After execution, an **Adversarial Evaluator** (in a fully isolated context) scores the output against the rubric. It cannot pass the Generator without providing exact file and line evidence for every failing criterion. Failed evaluations inject the critique directly into the Generator's retry prompt so it's never flying blind. The result: security issues, regressions, and lazy debug logs caught autonomously — before you ever see the PR. → [See it in action](examples/adversarial-eval-demo/README.md)
 ---
 ## 🎯 Use Cases
@@ -461,8 +492,8 @@ When you trigger a Dark Factory pipeline, Prism doesn't just run your task — i
 - **Long-running feature work** — Save state at end of day, restore full context next morning. No re-explaining.
 - **Multi-agent collaboration** — Dev, QA, and PM agents share real-time context without stepping on each other's memory.
 - **Consulting / multi-project** — Switch between client projects with progressive loading: `quick` (~50 tokens), `standard` (~200), or `deep` (~1000+).
-  - **Autonomous execution (v7.4)** — Dark Factory pipeline: `plan → plan_contract → execute → evaluate → verify → finalize`. Generator and evaluator run in isolated roles — the evaluator cannot approve without evidence-bound findings scored against a pre-committed rubric.
-  - **Project health monitoring (v7.5)** — Intent Health Dashboard scores each project 0–100 based on staleness, TODO load, and decision quality — turning silent drift into an actionable signal.
+- **Autonomous execution (v7.4)** — Dark Factory pipeline: `plan → plan_contract → execute → evaluate → verify → finalize`. Generator and evaluator run in isolated roles — the evaluator cannot approve without evidence-bound findings scored against a pre-committed rubric.
+- **Project health monitoring (v7.5)** — Intent Health Dashboard scores each project 0–100 based on staleness, TODO load, and decision quality — turning silent drift into an actionable signal.
 - **Team onboarding** — New team member's agent loads the full project history instantly.
 - **Behavior enforcement** — Agent corrections auto-graduate into permanent `.cursorrules` / `.clauderules` rules.
 - **Offline / air-gapped** — Full SQLite local mode + Ollama LLM adapter. Zero internet dependency.
@@ -485,8 +516,6 @@ Then continue a specific thread with a follow-up message to the selected agent,
 ---
----
 ## ⚔️ Adversarial Evaluation in Action
 > **Split-Brain Anti-Sycophancy** — the signature feature of v7.4.0.
@@ -811,6 +840,8 @@ Requires `PRISM_DARK_FACTORY_ENABLED=true`.
 | `PRISM_ENABLE_HIVEMIND` | No | `"true"` to enable multi-agent tools — restart required |
 | `PRISM_INSTANCE` | No | Instance name for multi-server PID isolation |
 | `GOOGLE_API_KEY` | No | Gemini — enables semantic search, Briefings, compaction |
+| `VOYAGE_API_KEY` | No | Voyage AI — optional premium embedding provider |
+| `OPENAI_API_KEY` | No | OpenAI — optional proxy model and embedding provider |
 | `BRAVE_ANSWERS_API_KEY` | No | Separate Brave Answers key |
 | `SUPABASE_URL` | If cloud | Supabase project URL |
 | `SUPABASE_KEY` | If cloud | Supabase anon/service key |
@@ -982,18 +1013,19 @@ A: Use `session_forget_memory` for targeted soft/hard deletion. For manual clean
 **Q: How do I verify the install quickly?**
 A: Run `npm run build && npm test`, then open the Mind Palace dashboard (`localhost:3000`) and confirm projects load plus Graph Health renders.
 ---
 ### 💡 Known Limitations & Quirks
 - **LLM-dependent features require an API key.** Semantic search, Morning Briefings, auto-compaction, and VLM captioning need a `GOOGLE_API_KEY` (your Gemini API key) or equivalent provider key. Without one, Prism falls back to keyword-only search (FTS5).
-- **Auto-load is model- and client-dependent.** Session auto-loading relies on both the LLM following system prompt instructions *and* the MCP client completing tool registration before the model's first turn. Prism provides platform-specific [Setup Guides](#-setup-guides) and a server-side fallback (v5.2.1) that auto-pushes context after 10 seconds.
+- **Auto-load is model- and client-dependent.** Session auto-loading relies on both the LLM following system prompt instructions *and* the MCP client completing tool registration before the model's first turn. Prism provides platform-specific [Setup Guides](#setup-guides) and a server-side fallback (v5.2.1) that auto-pushes context after 10 seconds.
 - **MCP client race conditions.** Some MCP clients may not finish tool enumeration before the model generates its first response, causing transient `unknown_tool` errors. This is a client-side timing issue — Prism's server completes the MCP handshake in ~60ms. Workaround: the server-side auto-push fallback and the startup skill's retry logic.
 - **No real-time sync without Supabase.** Local SQLite mode is single-machine only. Multi-device or team sync requires a Supabase backend.
 - **Embedding quality varies by provider.** Gemini `text-embedding-004` and OpenAI `text-embedding-3-small` produce high-quality 768-dim vectors. Prism passes `dimensions: 768` via the Matryoshka API for OpenAI models (native output is 1536-dim; this truncation is lossless and outperforms ada-002 at full 1536 dims). Ollama embeddings (e.g., `nomic-embed-text`) are usable but may reduce retrieval accuracy.
 - **Dashboard is HTTP-only.** The Mind Palace dashboard at `localhost:3000` does not support HTTPS. For remote access, use a reverse proxy (nginx/Caddy) or SSH tunnel. Basic auth is available via `PRISM_DASHBOARD_USER` / `PRISM_DASHBOARD_PASS`.
 - **Long-lived clients can accumulate zombie processes.** MCP clients that run for extended periods (e.g., Claude CLI) may leave orphaned Prism server processes. The lifecycle manager detects true orphans (PPID=1) but allows coexistence for active parent processes. Use `PRISM_INSTANCE` to isolate instances across clients.
-- **Migration is one-way.** Universal History Migration imports sessions *into* Prism but does not export back to Claude/Gemini/OpenAI formats. Use `session_export_memory` for portable JSON/Markdown export, or the `vault` format for Obsidian/Logseq-compatible `.zip` archives.
+- **Migration is one-way.** Universal Import ingests sessions *into* Prism but does not export back to Claude/Gemini/OpenAI formats. Use `session_export_memory` for portable JSON/Markdown export, or the `vault` format for Obsidian/Logseq-compatible `.zip` archives.
 - **Export ceiling at 10,000 ledger entries.** The `session_export_memory` tool and the dashboard export button cap vault/JSON exports at 10,000 entries per project as an OOM guard. Projects exceeding this limit should use per-project exports and time-based filtering to stay within the ceiling. This limit does not affect search or context loading.
 - **No Windows CI testing.** Prism is developed and tested on macOS/Linux. It should work on Windows via Node.js, but edge cases (file paths, PID locks) may surface.

package/dist/config.js CHANGED Viewed

@@ -15,6 +15,9 @@ import { fileURLToPath } from "node:url";
  *   SUPABASE_KEY           — (optional) Your Supabase anon/service key. Enables session memory tools.
  *   PRISM_USER_ID          — (optional) Unique tenant ID for multi-user Supabase instances.
  *                            Defaults to "default". Set per-user in Claude Desktop config.
+ *   VOYAGE_API_KEY         — (optional) API key for Voyage AI embeddings. Enables embedding_provider=voyage.
+ *                            Voyage AI is the embedding provider recommended by Anthropic for use with
+ *                            Claude. Get a free key at https://dash.voyageai.com.
  *
  * If a required key is missing, the process exits immediately.
  * If an optional key is missing, a warning is logged but the server continues
@@ -60,6 +63,14 @@ export const BRAVE_ANSWERS_API_KEY = process.env.BRAVE_ANSWERS_API_KEY;
 if (!BRAVE_ANSWERS_API_KEY) {
     console.error("Warning: BRAVE_ANSWERS_API_KEY environment variable is missing. Brave Answers tool will be unavailable.");
 }
+// ─── Optional: Voyage AI API Key ──────────────────────────────
+// Used when embedding_provider = "voyage" in the dashboard.
+// Voyage AI is the embedding provider recommended by Anthropic for use
+// alongside Claude. voyage-3 supports 768-dim output via MRL truncation,
+// matching Prism's storage schema for zero-migration drop-in replacement.
+// Without this, VoyageAdapter construction will throw at server start if
+// embedding_provider=voyage is selected.
+export const VOYAGE_API_KEY = process.env.VOYAGE_API_KEY;
 // ─── v2.0: Storage Backend Selection ─────────────────────────
 // REVIEWER NOTE: Step 1 of v2.0 introduces a storage abstraction.
 // Currently only "supabase" is implemented. "local" (SQLite) is

package/dist/utils/llm/adapters/anthropic.js CHANGED Viewed

@@ -85,8 +85,11 @@ export class AnthropicAdapter {
         // silent zero-vector or crash.
         throw new Error("AnthropicAdapter does not support text embeddings. " +
             "Anthropic has no native embedding API. " +
-            "In the Mind Palace dashboard, set 'Embedding Provider' to Gemini or OpenAI/Ollama. " +
-            "When using Ollama locally, 'nomic-embed-text' is a free, high-quality option.");
+            "Their official recommendation is Voyage AI (voyage-3, voyage-3-lite). " +
+            "In the Mind Palace dashboard, set 'Embedding Provider' to: " +
+            "'voyage' (Anthropic-recommended, set VOYAGE_API_KEY), " +
+            "'openai' (OpenAI cloud or local Ollama with nomic-embed-text), " +
+            "or 'gemini' (Google AI, set GOOGLE_API_KEY).");
     }
     // ─── Image Description (VLM) ─────────────────────────────────────────────
     /**

package/dist/utils/llm/adapters/voyage.js ADDED Viewed

@@ -0,0 +1,129 @@
+/**
+ * Voyage AI Adapter (v1.0)
+ * ─────────────────────────────────────────────────────────────────────────────
+ * PURPOSE:
+ *   Implements LLMProvider using Voyage AI's REST API for text embeddings.
+ *   Voyage AI is the embedding provider officially recommended by Anthropic
+ *   for use alongside Claude — it fills the gap left by Anthropic's lack
+ *   of a native embedding API.
+ *
+ * TEXT GENERATION:
+ *   Voyage AI is an embeddings-only service. generateText() throws an explicit
+ *   error, the same pattern used by AnthropicAdapter.generateEmbedding().
+ *   Set text_provider separately (anthropic, openai, or gemini).
+ *
+ * EMBEDDING DIMENSION PARITY (768 dims):
+ *   Prism's SQLite (sqlite-vec) and Supabase (pgvector) schemas define
+ *   embedding columns as EXACTLY 768 dimensions.
+ *
+ *   Voyage solution: voyage-3 and voyage-3-lite output 1024 dims by default,
+ *   but both support the `output_dimension` parameter (Matryoshka Representation
+ *   Learning), enabling truncation to 768 while preserving quality.
+ *   voyage-3-lite at 768 dims is the fastest and most cost-efficient option.
+ *
+ * MODELS:
+ *   voyage-3           — Highest quality, 1024 dims natively (MRL → 768)
+ *   voyage-3-lite      — Fast & cheap, 512 dims natively (MRL → 768 NOT supported)
+ *   voyage-3-large     — Best quality, use for offline indexing
+ *   voyage-code-3      — Optimised for code (recommended for dev sessions)
+ *
+ *   NOTE: voyage-3-lite natively outputs 512 dims; it does NOT support
+ *   output_dimension truncation to 768. Use voyage-3 for dimension parity.
+ *   Default is voyage-3 for this reason.
+ *
+ * CONFIG KEYS (Prism dashboard "AI Providers" tab OR environment variables):
+ *   voyage_api_key     — Required. Voyage AI API key (pa-...)
+ *   voyage_model       — Embedding model (default: voyage-3)
+ *
+ * USAGE WITH ANTHROPIC TEXT PROVIDER:
+ *   Set text_provider=anthropic, embedding_provider=voyage in the dashboard.
+ *   This pairs Claude for reasoning with Voyage for semantic memory — the
+ *   combination Anthropic recommends in their documentation.
+ *
+ * API REFERENCE:
+ *   https://docs.voyageai.com/reference/embeddings-api
+ */
+import { getSettingSync } from "../../../storage/configStorage.js";
+import { debugLog } from "../../logger.js";
+// ─── Constants ────────────────────────────────────────────────────────────────
+// Must match Prism's DB schema (sqlite-vec and pgvector column sizes).
+const EMBEDDING_DIMS = 768;
+// voyage-3 supports up to 32,000 tokens. Character-based cap (consistent
+// with OpenAI and Gemini adapters) avoids tokenizer dependency.
+// 8000 chars ≈ 1500-2000 tokens for typical session summaries.
+const MAX_EMBEDDING_CHARS = 8000;
+// Default model: voyage-3 (supports output_dimension=768 via MRL)
+// voyage-3-lite is NOT recommended as its native 512 dims < 768.
+const DEFAULT_MODEL = "voyage-3";
+const VOYAGE_API_BASE = "https://api.voyageai.com/v1";
+// ─── Adapter ─────────────────────────────────────────────────────────────────
+export class VoyageAdapter {
+    apiKey;
+    constructor() {
+        const apiKey = getSettingSync("voyage_api_key", process.env.VOYAGE_API_KEY ?? "");
+        if (!apiKey) {
+            throw new Error("VoyageAdapter requires a Voyage AI API key. " +
+                "Get one free at https://dash.voyageai.com — then set VOYAGE_API_KEY " +
+                "or configure it in the Mind Palace dashboard under 'AI Providers'.");
+        }
+        this.apiKey = apiKey;
+        debugLog("[VoyageAdapter] Initialized");
+    }
+    // ─── Text Generation (Not Supported) ────────────────────────────────────
+    async generateText(_prompt, _systemInstruction) {
+        // Voyage AI is an embeddings-only service.
+        // Use text_provider=anthropic, openai, or gemini for text generation.
+        throw new Error("VoyageAdapter does not support text generation. " +
+            "Voyage AI is an embeddings-only service. " +
+            "Set text_provider to 'anthropic', 'openai', or 'gemini' in the dashboard.");
+    }
+    // ─── Embedding Generation ────────────────────────────────────────────────
+    async generateEmbedding(text) {
+        if (!text || !text.trim()) {
+            throw new Error("[VoyageAdapter] generateEmbedding called with empty text");
+        }
+        // Truncate to character limit (consistent with other adapters)
+        const truncated = text.length > MAX_EMBEDDING_CHARS
+            ? text.slice(0, MAX_EMBEDDING_CHARS).replace(/\s+\S*$/, "")
+            : text;
+        const model = getSettingSync("voyage_model", DEFAULT_MODEL);
+        debugLog(`[VoyageAdapter] generateEmbedding — model=${model}, chars=${truncated.length}`);
+        const requestBody = {
+            input: [truncated],
+            model,
+            // Request exactly 768 dims via Matryoshka truncation.
+            // Supported by voyage-3, voyage-3-large, voyage-code-3.
+            // voyage-3-lite (native 512 dims) will ignore this and return 512,
+            // which will be caught by the dimension guard below.
+            output_dimension: EMBEDDING_DIMS,
+        };
+        const response = await fetch(`${VOYAGE_API_BASE}/embeddings`, {
+            method: "POST",
+            headers: {
+                "Authorization": `Bearer ${this.apiKey}`,
+                "Content-Type": "application/json",
+            },
+            body: JSON.stringify(requestBody),
+        });
+        if (!response.ok) {
+            const errorText = await response.text().catch(() => "unknown error");
+            throw new Error(`[VoyageAdapter] API request failed — status=${response.status}: ${errorText}`);
+        }
+        const data = (await response.json());
+        const embedding = data?.data?.[0]?.embedding;
+        if (!Array.isArray(embedding)) {
+            throw new Error("[VoyageAdapter] Unexpected response format — no embedding array found");
+        }
+        // Dimension guard: Prism's DB schema requires exactly 768 dims.
+        // This catches voyage-3-lite (512) or future API changes silently early.
+        if (embedding.length !== EMBEDDING_DIMS) {
+            throw new Error(`[VoyageAdapter] Embedding dimension mismatch: expected ${EMBEDDING_DIMS}, ` +
+                `got ${embedding.length}. ` +
+                `Use voyage-3 (not voyage-3-lite) to get 768-dim output via MRL truncation. ` +
+                `Change voyage_model in the Mind Palace dashboard.`);
+        }
+        debugLog(`[VoyageAdapter] Embedding generated — dims=${embedding.length}, ` +
+            `tokens_used=${data.usage?.total_tokens ?? "unknown"}`);
+        return embedding;
+    }
+}

package/dist/utils/llm/factory.js CHANGED Viewed

@@ -1,5 +1,5 @@
 /**
- * LLM Provider Factory (v4.4 — Split Provider Architecture)
+ * LLM Provider Factory (v4.5 — Voyage AI Embedding Support)
  * ─────────────────────────────────────────────────────────────────────────────
  * PURPOSE:
  *   Single point of resolution for the active LLMProvider.
@@ -11,19 +11,21 @@
  *   Two independent settings control text and embedding routing:
  *
  *   text_provider      — "gemini" (default) | "openai" | "anthropic"
- *   embedding_provider — "auto" (default)   | "gemini" | "openai"
+ *   embedding_provider — "auto" (default)   | "gemini" | "openai" | "voyage"
  *
  *   When embedding_provider = "auto":
  *     * If text_provider is gemini or openai → use same provider for embeddings
  *     * If text_provider is anthropic → auto-fallback to gemini for embeddings
- *       (Anthropic has no native embedding API)
+ *       (Anthropic has no native embedding API; consider setting
+ *        embedding_provider=voyage for the Anthropic-recommended pairing)
  *
  * EXAMPLE CONFIGURATIONS:
  *   text_provider=gemini,    embedding_provider=auto   → Gemini+Gemini (default)
  *   text_provider=openai,    embedding_provider=auto   → OpenAI+OpenAI
  *   text_provider=anthropic, embedding_provider=auto   → Claude+Gemini (auto-bridge)
+ *   text_provider=anthropic, embedding_provider=voyage → Claude+Voyage (Anthropic-recommended)
  *   text_provider=anthropic, embedding_provider=openai → Claude+Ollama (cost-optimized)
- *   text_provider=gemini,    embedding_provider=openai → Gemini+Ollama (mixed)
+ *   text_provider=gemini,    embedding_provider=voyage → Gemini+Voyage (mixed)
  *
  * SINGLETON + GRACEFUL DEGRADATION:
  *   Same as before — instance cached per process, errors fall back to Gemini.
@@ -41,6 +43,7 @@ import { getSettingSync } from "../../storage/configStorage.js";
 import { GeminiAdapter } from "./adapters/gemini.js";
 import { OpenAIAdapter } from "./adapters/openai.js";
 import { AnthropicAdapter } from "./adapters/anthropic.js";
+import { VoyageAdapter } from "./adapters/voyage.js";
 import { TracingLLMProvider } from "./adapters/traced.js";
 // Module-level singleton — one composed provider per MCP server process.
 let providerInstance = null;
@@ -59,8 +62,10 @@ function buildEmbeddingAdapter(type) {
     // Note: "anthropic" is intentionally absent from this switch.
     // Anthropic has no embedding API, so it can never be an embedding provider.
     // The factory resolves "auto" away from "anthropic" before calling this.
+    // For Anthropic text users, "voyage" is the Anthropic-recommended pairing.
     switch (type) {
         case "openai": return new OpenAIAdapter();
+        case "voyage": return new VoyageAdapter();
         case "gemini":
         default: return new GeminiAdapter();
     }
@@ -90,7 +95,9 @@ export function getLLMProvider() {
         if (textType === "anthropic") {
             console.info("[LLMFactory] text_provider=anthropic with embedding_provider=auto: " +
                 "routing embeddings to GeminiAdapter (Anthropic has no native embedding API). " +
-                "Set embedding_provider=openai in dashboard to use Ollama/OpenAI instead.");
+                "For the Anthropic-recommended pairing, set embedding_provider=voyage in the dashboard " +
+                "(voyage-3 supports 768-dim output via MRL). " +
+                "Alternatively, set embedding_provider=openai to use Ollama/OpenAI.");
         }
     }
     try {

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "prism-mcp-server",
-  "version": "7.5.0",
+  "version": "7.6.0",
   "mcpName": "io.github.dcostenco/prism-mcp",
   "description": "The Mind Palace for AI Agents — adversarial evaluation (PLAN_CONTRACT→EVALUATE anti-sycophancy), fail-closed Dark Factory autonomous pipelines (3-gate parse→type→scope), persistent memory (SQLite/Supabase), ACT-R cognitive retrieval, behavioral learning & IDE rules sync, multi-agent Hivemind, time travel, visual dashboard. Zero-config local mode.",
   "module": "index.ts",