npm - prism-mcp-server - Versions diffs - 2.3.11 → 2.5.0 - Mend

prism-mcp-server 2.3.11 → 2.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (9) hide show

package/README.md +125 -15
package/dist/server.js +13 -2
package/dist/storage/sqlite.js +70 -3
package/dist/storage/supabase.js +35 -0
package/dist/tools/index.js +2 -2
package/dist/tools/sessionMemoryDefinitions.js +88 -0
package/dist/tools/sessionMemoryHandlers.js +285 -42
package/dist/utils/tracing.js +139 -0
package/package.json +2 -1

package/README.md CHANGED Viewed

@@ -2,19 +2,29 @@
 [![npm version](https://img.shields.io/npm/v/prism-mcp-server?color=cb0000&label=npm)](https://www.npmjs.com/package/prism-mcp-server)
 [![MCP Registry](https://img.shields.io/badge/MCP_Registry-listed-00ADD8?logo=data:image/svg+xml;base64,PHN2ZyB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciIHZpZXdCb3g9IjAgMCAyNCAyNCI+PHBhdGggZmlsbD0id2hpdGUiIGQ9Ik0xMiAyTDIgN2wxMCA1IDEwLTUtMTAtNXpNMiAxN2wxMCA1IDEwLTV2LTJMMTI0djJMMiA5djh6Ii8+PC9zdmc+)](https://registry.modelcontextprotocol.io)
-[![Glama](https://img.shields.io/badge/Glama-listed-FF5601)](https://glama.ai/mcp/servers/dcostenco/BCBA)
+[![Glama](https://img.shields.io/badge/Glama-listed-FF5601)](https://glama.ai/mcp/servers/dcostenco/prism-mcp)
 [![Smithery](https://img.shields.io/badge/Smithery-listed-6B4FBB)](https://smithery.ai/server/prism-mcp-server)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
 [![TypeScript](https://img.shields.io/badge/TypeScript-5.0+-3178C6?logo=typescript&logoColor=white)](https://www.typescriptlang.org/)
 [![Node.js](https://img.shields.io/badge/Node.js-18+-339933?logo=node.js&logoColor=white)](https://nodejs.org/)
-> **Your AI agent's memory that survives between sessions.** Prism MCP is a Model Context Protocol server that gives Claude Desktop, Cursor, Windsurf, and any MCP client **persistent memory**, **time travel**, **visual context**, **multi-agent sync**, and **multi-engine search** — all running locally with zero cloud dependencies.
+> **Your AI agent's memory that survives between sessions.** Prism MCP is a Model Context Protocol server that gives Claude Desktop, Cursor, Windsurf, and any MCP client **persistent memory**, **time travel**, **visual context**, **multi-agent sync**, **GDPR-compliant deletion**, **memory tracing**, and **LangChain integration** — all running locally with zero cloud dependencies.
 >
-> Built with **SQLite + F32_BLOB vector search**, **optimistic concurrency control**, **MCP Prompts & Resources**, **auto-compaction**, **Gemini-powered Morning Briefings**, and optional **Supabase cloud sync**.
+> Built with **SQLite + F32_BLOB vector search**, **optimistic concurrency control**, **MCP Prompts & Resources**, **auto-compaction**, **Gemini-powered Morning Briefings**, **MemoryTrace explainability**, and optional **Supabase cloud sync**.
 ---
-## What's New in v2.3.10 — Stability & Fixes 🛠️
+## What's New in v2.5.0 — Enterprise Memory 🏗️
+| Feature | Description |
+|---|---|
+| 🔍 **Memory Tracing (Phase 1)** | Every search now returns a structured `MemoryTrace` with latency breakdown (`embedding_ms`, `storage_ms`, `total_ms`), search strategy, and scoring metadata — surfaced as a separate `content[1]` block for LangSmith integration. |
+| 🛡️ **GDPR Memory Deletion (Phase 2)** | New `session_forget_memory` tool with soft-delete (tombstoning via `deleted_at`) and hard-delete. Ownership guards prevent cross-user deletion. `deleted_reason` column captures GDPR Article 17 justification. Top-K Hole solved by filtering inside SQL, not post-query. |
+| 🔗 **LangChain Integration (Phase 3)** | `PrismMemoryRetriever` and `PrismKnowledgeRetriever` — async-first `BaseRetriever` subclasses that wrap Prism MCP's traced search endpoints. Trace metadata flows automatically into `Document.metadata["trace"]` for LangSmith visibility. |
+| 🧩 **LangGraph Research Agent** | Full example in `examples/langgraph-agent/` — a 5-node agentic research loop with MCP bridge, persistent memory, and `EnsembleRetriever` hybrid search. |
+<details>
+<summary><strong>What's in v2.3.12 — Stability & Fixes</strong></summary>
 | Feature | Description |
 |---|---|
@@ -22,6 +32,8 @@
 | 📝 **Debug Logging** | Gated verbose startup logs behind `PRISM_DEBUG_LOGGING` for a cleaner default experience. |
 | ⚡ **Excess Loading Fixes** | Performance improvements to resolve excess loading loops. |
+</details>
 <details>
 <summary><strong>What's in v2.3.8 — LangGraph Research Agent</strong></summary>
@@ -82,6 +94,9 @@
 | **Auto-Compaction** | ✅ Gemini rollups | ❌ | ❌ | ❌ | ❌ |
 | **Morning Briefing** | ✅ Gemini synthesis | ❌ | ❌ | ❌ | ❌ |
 | **OCC (Concurrency)** | ✅ Version-based | ❌ | ❌ | ❌ | ❌ |
+| **GDPR Compliance** | ✅ Soft/hard delete + audit trail | ❌ | ❌ | ❌ | ❌ |
+| **Memory Tracing** | ✅ MemoryTrace with latency breakdown | ❌ | ❌ | ❌ | ❌ |
+| **LangChain Native** | ✅ BaseRetriever adapters | ❌ | ❌ | ❌ | ❌ |
 | **MCP Native** | ✅ stdio (Claude Desktop, Cursor) | ✅ stdio | ❌ Python SDK / REST | ✅ HTTP + MCP | ✅ stdio |
 | **Language** | TypeScript | TypeScript | Python | Python | Python |
@@ -316,32 +331,39 @@ Verification pattern (same for both clients):
 ```mermaid
 graph TB
     Client["AI Client<br/>(Claude Desktop / Cursor / Windsurf)"]
+    LangChain["LangChain / LangGraph<br/>(Python Retrievers)"]
     MCP["Prism MCP Server<br/>(TypeScript)"]
     Client -- "MCP Protocol (stdio)" --> MCP
+    LangChain -- "JSON-RPC via MCP Bridge" --> MCP
+    MCP --> Tracing["MemoryTrace Engine<br/>Latency + Strategy + Scoring"]
     MCP --> Dashboard["Mind Palace Dashboard<br/>localhost:3000"]
     MCP --> Brave["Brave Search API<br/>Web + Local + AI Answers"]
     MCP --> Gemini["Google Gemini API<br/>Analysis + Briefings"]
     MCP --> Sandbox["QuickJS Sandbox<br/>Code-Mode Templates"]
     MCP --> SyncBus["SyncBus<br/>Agent Telepathy"]
+    MCP --> GDPR["GDPR Engine<br/>Soft/Hard Delete + Audit"]
     MCP --> Storage{"Storage Backend"}
     Storage --> SQLite["SQLite (Local)<br/>libSQL + F32_BLOB vectors"]
     Storage --> Supabase["Supabase (Cloud)<br/>PostgreSQL + pgvector"]
-    SQLite --> Ledger["session_ledger"]
+    SQLite --> Ledger["session_ledger<br/>(+ deleted_at tombstoning)"]
     SQLite --> Handoffs["session_handoffs"]
     SQLite --> History["history_snapshots<br/>(Time Travel)"]
     SQLite --> Media["media vault<br/>(Visual Memory)"]
     style Client fill:#4A90D9,color:#fff
+    style LangChain fill:#1C3D5A,color:#fff
     style MCP fill:#2D3748,color:#fff
+    style Tracing fill:#D69E2E,color:#fff
     style Dashboard fill:#9F7AEA,color:#fff
     style Brave fill:#FB542B,color:#fff
     style Gemini fill:#4285F4,color:#fff
     style Sandbox fill:#805AD5,color:#fff
     style SyncBus fill:#ED64A6,color:#fff
+    style GDPR fill:#E53E3E,color:#fff
     style Storage fill:#2D3748,color:#fff
     style SQLite fill:#38B2AC,color:#fff
     style Supabase fill:#3ECF8E,color:#fff
@@ -390,6 +412,14 @@ graph TB
 |------|---------|----------|---------|
 | `session_health_check` | Scan brain for integrity issues (`fsck`) | `auto_fix` (boolean) | Health report & auto-repairs |
+### v2.5 Enterprise Memory Tools
+| Tool | Purpose | Key Args | Returns |
+|------|---------|----------|---------|
+| `session_forget_memory` | GDPR-compliant deletion (soft/hard) | `memory_id`, `hard_delete`, `reason` | Deletion confirmation + audit |
+| `session_search_memory` | Semantic search with `enable_trace` | `query`, `enable_trace` | Results + `MemoryTrace` in `content[1]` |
+| `knowledge_search` | Knowledge search with `enable_trace` | `query`, `enable_trace` | Results + `MemoryTrace` in `content[1]` |
 ### Code Mode Templates (v2.1)
 Instead of writing custom JavaScript, pass a `template` name for instant extraction:
@@ -409,6 +439,53 @@ Instead of writing custom JavaScript, pass a `template` name for instant extract
 ---
+## LangChain / LangGraph Integration
+Prism MCP includes first-class Python adapters for the LangChain ecosystem, located in `examples/langgraph-agent/`:
+| Component | File | Purpose |
+|-----------|------|---------|
+| **MCP Bridge** | `mcp_client.py` | JSON-RPC 2.0 client with `call_tool()` and `call_tool_raw()` (preserves `MemoryTrace`) |
+| **Semantic Retriever** | `prism_retriever.py` | `PrismMemoryRetriever(BaseRetriever)` — async-first vector search |
+| **Keyword Retriever** | `prism_retriever.py` | `PrismKnowledgeRetriever(BaseRetriever)` — FTS5 keyword search |
+| **Forget Tool** | `tools.py` | `forget_memory()` — GDPR deletion bridge |
+| **Research Agent** | `agent.py` | 5-node LangGraph agent (plan→search→analyze→decide→answer→save) |
+### Hybrid Search with EnsembleRetriever
+Combine both retrievers for hybrid (semantic + keyword) search with a single line:
+```python
+from langchain.retrievers import EnsembleRetriever
+from prism_retriever import PrismMemoryRetriever, PrismKnowledgeRetriever
+retriever = EnsembleRetriever(
+    retrievers=[PrismMemoryRetriever(...), PrismKnowledgeRetriever(...)],
+    weights=[0.7, 0.3],  # 70% semantic, 30% keyword
+)
+```
+### MemoryTrace in LangSmith
+When `enable_trace=True`, each `Document.metadata["trace"]` contains:
+```json
+{
+  "strategy": "vector_cosine_similarity",
+  "latency": { "embedding_ms": 45, "storage_ms": 12, "total_ms": 57 },
+  "result_count": 5,
+  "threshold": 0.7
+}
+```
+This metadata flows automatically into LangSmith traces for observability.
+### Async Architecture
+The retrievers use `_aget_relevant_documents` as the primary path with `asyncio.to_thread()` to wrap the synchronous MCP bridge. This prevents the `RuntimeError: This event loop is already running` crash that plagues most LangGraph deployments.
+---
 ## Environment Variables
 | Variable | Required | Description |
@@ -422,6 +499,7 @@ Instead of writing custom JavaScript, pass a `template` name for instant extract
 | `PRISM_USER_ID` | No | Multi-tenant user isolation (default: `"default"`) |
 | `PRISM_AUTO_CAPTURE` | No | Set `"true"` to auto-capture HTML snapshots of dev servers |
 | `PRISM_CAPTURE_PORTS` | No | Comma-separated ports to scan (default: `3000,3001,5173,8080`) |
+| `PRISM_DEBUG_LOGGING` | No | Set `"true"` to enable verbose debug logs (default: quiet) |
 ---
@@ -590,6 +668,29 @@ Every `session_save_ledger` and `session_save_handoff` automatically extracts ke
 | **By age** | `older_than_days: 30` | Forget entries older than 30 days |
 | **Dry run** | `dry_run: true` | Preview what would be deleted |
+### GDPR-Compliant Deletion (v2.5)
+Prism supports surgical, per-entry deletion for GDPR Article 17 compliance:
+```json
+// Soft delete (tombstone — reversible, keeps audit trail)
+{ "name": "session_forget_memory", "arguments": {
+  "memory_id": "abc123",
+  "reason": "User requested data deletion"
+}}
+// Hard delete (permanent — irreversible)
+{ "name": "session_forget_memory", "arguments": {
+  "memory_id": "abc123",
+  "hard_delete": true
+}}
+```
+**How it works:**
+- **Soft delete** sets `deleted_at = NOW()` + `deleted_reason`. The entry stays in the DB for audit but is excluded from ALL search results (vector, FTS5, and context loading).
+- **Hard delete** physically removes the row. FTS5 triggers auto-clean the full-text index.
+- **Top-K Hole Prevention**: `deleted_at IS NULL` filtering happens INSIDE the SQL query, BEFORE the `LIMIT` clause — so `LIMIT 5` always returns 5 live results, never fewer.
 ---
 ## Supabase Setup (Cloud Mode)
@@ -657,12 +758,12 @@ See [`vertex-ai/`](vertex-ai/) for setup and benchmarks.
 ```
 ├── src/
-│   ├── server.ts                        # MCP server core + Mind Palace HTTP server
+│   ├── server.ts                        # MCP server core + tool routing
 │   ├── config.ts                        # Environment management
 │   ├── storage/
-│   │   ├── interface.ts                 # StorageBackend abstraction
-│   │   ├── sqlite.ts                    # SQLite local storage (libSQL + F32_BLOB)
-│   │   ├── supabase.ts                  # Supabase cloud storage
+│   │   ├── interface.ts                 # StorageBackend abstraction (+ GDPR delete methods)
+│   │   ├── sqlite.ts                    # SQLite local storage (libSQL + F32_BLOB + deleted_at migration)
+│   │   ├── supabase.ts                  # Supabase cloud storage (+ soft/hard delete)
 │   │   └── index.ts                     # Backend factory (auto-selects based on PRISM_STORAGE)
 │   ├── sync/
 │   │   ├── interface.ts                 # SyncBus abstraction (Telepathy)
@@ -675,21 +776,30 @@ See [`vertex-ai/`](vertex-ai/) for setup and benchmarks.
 │   ├── templates/
 │   │   └── codeMode.ts                  # 8 pre-built QuickJS extraction templates
 │   ├── tools/
-│   │   ├── definitions.ts               # All tool schemas (JSON Schema + type guards)
+│   │   ├── definitions.ts               # Search & analysis tool schemas
 │   │   ├── handlers.ts                  # Search & analysis handlers
-│   │   ├── sessionMemoryDefinitions.ts  # Memory + knowledge tool schemas
-│   │   ├── sessionMemoryHandlers.ts     # Memory handlers (OCC, Time Travel, Drift, Briefing)
+│   │   ├── sessionMemoryDefinitions.ts  # Memory tools + GDPR + tracing schemas
+│   │   ├── sessionMemoryHandlers.ts     # Memory handlers (OCC, GDPR, Tracing, Time Travel)
+│   │   ├── compactionHandler.ts         # Gemini-powered ledger compaction
 │   │   └── index.ts                     # Tool registration & re-exports
 │   └── utils/
+│       ├── tracing.ts                   # MemoryTrace types + factory (Phase 1)
+│       ├── logger.ts                    # Debug logging (gated by PRISM_DEBUG_LOGGING)
 │       ├── braveApi.ts                  # Brave Search REST client
 │       ├── googleAi.ts                  # Gemini SDK wrapper
 │       ├── executor.ts                  # QuickJS sandbox executor
 │       ├── autoCapture.ts               # Dev server HTML snapshot utility
-│       ├── healthCheck.ts               # Brain integrity engine (v2.2.0) + security scanner (v2.3.0)
-│       ├── factMerger.ts                # Async LLM contradiction resolution (v2.3.0)
+│       ├── healthCheck.ts               # Brain integrity engine + security scanner
+│       ├── factMerger.ts                # Async LLM contradiction resolution
 │       ├── git.ts                       # Git state capture + drift detection
 │       ├── embeddingApi.ts              # Embedding generation (Gemini)
 │       └── keywordExtractor.ts          # Zero-dependency NLP keyword extraction
+├── examples/langgraph-agent/            # LangChain/LangGraph integration
+│   ├── agent.py                         # 5-node LangGraph research agent
+│   ├── mcp_client.py                    # MCP Bridge (call_tool + call_tool_raw)
+│   ├── prism_retriever.py               # PrismMemoryRetriever + PrismKnowledgeRetriever
+│   ├── tools.py                         # Agent tools + GDPR forget_memory
+│   └── demo_retriever.py                # Standalone retriever demo
 ├── supabase/migrations/                 # Cloud mode SQL schemas
 ├── vertex-ai/                           # Vertex AI hybrid search pipeline
 ├── index.ts                             # Server entry point
@@ -704,4 +814,4 @@ MIT
 ---
-<sub>**Keywords:** MCP server, Model Context Protocol, Claude Desktop memory, persistent session memory, AI agent memory, local-first, SQLite MCP, Mind Palace, time travel, visual memory, agent telepathy, multi-agent sync, reality drift detection, morning briefing, code mode templates, cursor MCP server, windsurf MCP server, cline MCP server, pgvector semantic search, progressive context loading, MCP Prompts, MCP Resources, knowledge management AI, Brave Search MCP, Gemini analysis, optimistic concurrency control, zero config</sub>
+<sub>**Keywords:** MCP server, Model Context Protocol, Claude Desktop memory, persistent session memory, AI agent memory, local-first, SQLite MCP, Mind Palace, time travel, visual memory, agent telepathy, multi-agent sync, reality drift detection, morning briefing, code mode templates, cursor MCP server, windsurf MCP server, cline MCP server, pgvector semantic search, progressive context loading, MCP Prompts, MCP Resources, knowledge management AI, Brave Search MCP, Gemini analysis, optimistic concurrency control, zero config, GDPR compliant, memory tracing, LangChain retriever, LangGraph agent, soft delete, memory lineage, explainability, enterprise AI memory</sub>

package/dist/server.js CHANGED Viewed

@@ -79,7 +79,9 @@ MEMORY_HISTORY_TOOL, MEMORY_CHECKOUT_TOOL,
 // ─── v2.0: Visual Memory tool definitions ───
 SESSION_SAVE_IMAGE_TOOL, SESSION_VIEW_IMAGE_TOOL,
 // ─── v2.2.0: Health Check tool definition ───
-SESSION_HEALTH_CHECK_TOOL, sessionSaveLedgerHandler, sessionSaveHandoffHandler, sessionLoadContextHandler, knowledgeSearchHandler, knowledgeForgetHandler,
+SESSION_HEALTH_CHECK_TOOL,
+// ─── Phase 2: GDPR Memory Deletion tool definition ───
+SESSION_FORGET_MEMORY_TOOL, sessionSaveLedgerHandler, sessionSaveHandoffHandler, sessionLoadContextHandler, knowledgeSearchHandler, knowledgeForgetHandler,
 // ─── v0.4.0: New tool handlers ───
 compactLedgerHandler, sessionSearchMemoryHandler,
 // ─── v2.0: Time Travel handlers ───
@@ -87,7 +89,9 @@ memoryHistoryHandler, memoryCheckoutHandler,
 // ─── v2.0: Visual Memory handlers ───
 sessionSaveImageHandler, sessionViewImageHandler,
 // ─── v2.2.0: Health Check handler ───
-sessionHealthCheckHandler, } from "./tools/index.js";
+sessionHealthCheckHandler,
+// ─── Phase 2: GDPR Memory Deletion handler ───
+sessionForgetMemoryHandler, } from "./tools/index.js";
 // ─── Dynamic Tool Registration ───────────────────────────────────
 // Base tools: always available regardless of configuration
 const BASE_TOOLS = [
@@ -118,6 +122,8 @@ const SESSION_MEMORY_TOOLS = [
     SESSION_VIEW_IMAGE_TOOL, // session_view_image — retrieve image from vault (v2.0)
     // ─── v2.2.0: Health Check tool ───
     SESSION_HEALTH_CHECK_TOOL, // session_health_check — brain integrity checker (v2.2.0)
+    // ─── Phase 2: GDPR Memory Deletion tool ───
+    SESSION_FORGET_MEMORY_TOOL, // session_forget_memory — GDPR-compliant memory deletion (Phase 2)
 ];
 // Combine: always list ALL tools so scanners (Glama, Smithery, MCP Registry)
 // can enumerate the full capability set. Runtime guards in the CallTool handler
@@ -493,6 +499,11 @@ export function createServer() {
                     if (!SESSION_MEMORY_ENABLED)
                         throw new Error("Session memory not configured. Set SUPABASE_URL and SUPABASE_KEY.");
                     return await sessionHealthCheckHandler(args);
+                // ─── Phase 2: GDPR Memory Deletion Tool ───
+                case "session_forget_memory":
+                    if (!SESSION_MEMORY_ENABLED)
+                        throw new Error("Session memory not configured. Set SUPABASE_URL and SUPABASE_KEY.");
+                    return await sessionForgetMemoryHandler(args);
                 default:
                     return {
                         content: [{ type: "text", text: `Unknown tool: ${name}` }],

package/dist/storage/sqlite.js CHANGED Viewed

@@ -152,6 +152,36 @@ export class SqliteStorage {
       CREATE INDEX IF NOT EXISTS idx_history_version
         ON session_handoffs_history(project, version);
     `);
+        // ─── Phase 2 Migration: GDPR Soft Delete Columns ──────────
+        //
+        // SQLITE GOTCHA: Unlike CREATE TABLE IF NOT EXISTS, ALTER TABLE
+        // throws a fatal error if the column already exists. We MUST
+        // wrap each ALTER TABLE in a try/catch and only ignore
+        // "duplicate column name" errors.
+        //
+        // This migration runs on every boot but is idempotent — the
+        // try/catch ensures it's safe to run repeatedly.
+        try {
+            await this.db.execute(`ALTER TABLE session_ledger ADD COLUMN deleted_at TEXT DEFAULT NULL`);
+            debugLog("[SqliteStorage] Phase 2 migration: added deleted_at column");
+        }
+        catch (e) {
+            // "duplicate column name" = column already exists from prior boot.
+            // Any other error is a real problem — rethrow it.
+            if (!e.message?.includes("duplicate column name"))
+                throw e;
+        }
+        try {
+            await this.db.execute(`ALTER TABLE session_ledger ADD COLUMN deleted_reason TEXT DEFAULT NULL`);
+            debugLog("[SqliteStorage] Phase 2 migration: added deleted_reason column");
+        }
+        catch (e) {
+            if (!e.message?.includes("duplicate column name"))
+                throw e;
+        }
+        // Index for fast WHERE deleted_at IS NULL queries.
+        // CREATE INDEX IF NOT EXISTS is safe to run repeatedly (no try/catch needed).
+        await this.db.execute(`CREATE INDEX IF NOT EXISTS idx_ledger_deleted ON session_ledger(deleted_at)`);
     }
     // ─── PostgREST Filter Parser ───────────────────────────────
     //
@@ -341,6 +371,37 @@ export class SqliteStorage {
         });
         return entries;
     }
+    // ─── Phase 2: GDPR-Compliant Memory Deletion ──────────────
+    //
+    // These methods are SURGICAL — they operate on a single entry by ID.
+    // They MUST verify user_id ownership to prevent cross-user deletion.
+    //
+    // softDeleteLedger: Sets deleted_at + deleted_reason. Entry stays in
+    //   DB for audit trail. All search queries filter it out via
+    //   "AND deleted_at IS NULL". Reversible.
+    //
+    // hardDeleteLedger: Physical DELETE. Irreversible. FTS5 triggers
+    //   automatically clean up the full-text index.
+    async softDeleteLedger(id, userId, reason) {
+        // UPDATE (not DELETE): sets tombstone fields while preserving the row.
+        // The JS-side datetime('now') matches SQLite's native format.
+        await this.db.execute({
+            sql: `UPDATE session_ledger
+            SET deleted_at = datetime('now'), deleted_reason = ?
+            WHERE id = ? AND user_id = ?`,
+            args: [reason || null, id, userId],
+        });
+        debugLog(`[SqliteStorage] Soft-deleted ledger entry ${id} (reason: ${reason || "none"})`);
+    }
+    async hardDeleteLedger(id, userId) {
+        // Physical DELETE — row is permanently removed.
+        // FTS5 trigger (ledger_fts_delete) automatically cleans up the index.
+        await this.db.execute({
+            sql: `DELETE FROM session_ledger WHERE id = ? AND user_id = ?`,
+            args: [id, userId],
+        });
+        debugLog(`[SqliteStorage] Hard-deleted ledger entry ${id}`);
+    }
     // ─── Handoff Operations (OCC) ──────────────────────────────
     async saveHandoff(handoff, expectedVersion) {
         // CASE 1: No expectedVersion → UPSERT (create or force-update)
@@ -471,10 +532,11 @@ export class SqliteStorage {
         context.key_context = handoff.key_context;
         if (level === "standard") {
             // Add recent ledger entries as summaries
+            // Phase 2: AND deleted_at IS NULL — exclude soft-deleted entries
             const recentLedger = await this.db.execute({
                 sql: `SELECT summary, decisions, session_date, created_at
               FROM session_ledger
-              WHERE project = ? AND user_id = ? AND archived_at IS NULL
+              WHERE project = ? AND user_id = ? AND archived_at IS NULL AND deleted_at IS NULL
               ORDER BY created_at DESC
               LIMIT 5`,
                 args: [project, userId],
@@ -487,10 +549,11 @@ export class SqliteStorage {
             return context;
         }
         // Deep: add full session history
+        // Phase 2: AND deleted_at IS NULL — exclude soft-deleted entries
         const fullLedger = await this.db.execute({
             sql: `SELECT summary, decisions, files_changed, todos, session_date, created_at
             FROM session_ledger
-            WHERE project = ? AND user_id = ? AND archived_at IS NULL
+            WHERE project = ? AND user_id = ? AND archived_at IS NULL AND deleted_at IS NULL
             ORDER BY created_at DESC
             LIMIT 50`,
             args: [project, userId],
@@ -529,6 +592,7 @@ export class SqliteStorage {
           AND l.project = ?
           AND l.user_id = ?
           AND l.archived_at IS NULL
+          AND l.deleted_at IS NULL
         ORDER BY rank
         LIMIT ?
       `;
@@ -544,6 +608,7 @@ export class SqliteStorage {
         WHERE ledger_fts MATCH ?
           AND l.user_id = ?
           AND l.archived_at IS NULL
+          AND l.deleted_at IS NULL
         ORDER BY rank
         LIMIT ?
       `;
@@ -573,7 +638,7 @@ export class SqliteStorage {
     }
     /** Fallback search using LIKE when FTS5 query syntax fails */
     async searchKnowledgeFallback(params) {
-        const conditions = ["user_id = ?", "archived_at IS NULL"];
+        const conditions = ["user_id = ?", "archived_at IS NULL", "deleted_at IS NULL"];
         const args = [params.userId];
         if (params.project) {
             conditions.push("project = ?");
@@ -626,6 +691,7 @@ export class SqliteStorage {
             AND l.user_id = ?
             AND l.project = ?
             AND l.archived_at IS NULL
+            AND l.deleted_at IS NULL
           ORDER BY similarity DESC
           LIMIT ?
         `;
@@ -640,6 +706,7 @@ export class SqliteStorage {
           WHERE l.embedding IS NOT NULL
             AND l.user_id = ?
             AND l.archived_at IS NULL
+            AND l.deleted_at IS NULL
           ORDER BY similarity DESC
           LIMIT ?
         `;

package/dist/storage/supabase.js CHANGED Viewed

@@ -67,6 +67,41 @@ export class SupabaseStorage {
         const result = await supabaseDelete("session_ledger", params);
         return Array.isArray(result) ? result : [];
     }
+    // ─── Phase 2: GDPR-Compliant Memory Deletion ──────────────
+    //
+    // These methods are SURGICAL — they operate on a single entry by ID.
+    // They MUST verify user_id ownership to prevent cross-user deletion.
+    //
+    // softDeleteLedger: Sets deleted_at + deleted_reason. Entry stays in
+    //   DB for audit trail. Supabase RPCs and TypeScript queries filter
+    //   it out via "WHERE deleted_at IS NULL". Reversible.
+    //
+    // hardDeleteLedger: Physical DELETE. Irreversible. For GDPR Article 17
+    //   "right to erasure" when the audit trail must also be removed.
+    async softDeleteLedger(id, userId, reason) {
+        // PATCH (not DELETE): sets tombstone fields while preserving the row.
+        // The deleted_at timestamp is set server-side for consistency.
+        // deleted_reason captures the GDPR justification (e.g., "User requested",
+        // "Data retention policy", "GDPR Article 17 request").
+        await supabasePatch("session_ledger", {
+            deleted_at: new Date().toISOString(),
+            deleted_reason: reason || null,
+        }, {
+            id: `eq.${id}`,
+            user_id: `eq.${userId}`, // Ownership guard — prevents cross-user deletion
+        });
+        debugLog(`[SupabaseStorage] Soft-deleted ledger entry ${id} (reason: ${reason || "none"})`);
+    }
+    async hardDeleteLedger(id, userId) {
+        // Physical DELETE — row is permanently removed from the database.
+        // This is irreversible. The FTS5 index (if any) is cleaned up by
+        // Supabase's built-in trigger handling.
+        await supabaseDelete("session_ledger", {
+            id: `eq.${id}`,
+            user_id: `eq.${userId}`, // Ownership guard
+        });
+        debugLog(`[SupabaseStorage] Hard-deleted ledger entry ${id}`);
+    }
     // ─── Handoff Operations ────────────────────────────────────
     async saveHandoff(handoff, expectedVersion) {
         // Direct mapping from sessionSaveHandoffHandler line 214

package/dist/tools/index.js CHANGED Viewed

@@ -26,8 +26,8 @@ export { webSearchHandler, braveWebSearchCodeModeHandler, localSearchHandler, br
 // This file always exports them — server.ts decides whether to include them in the tool list.
 //
 // v0.4.0: Added SESSION_COMPACT_LEDGER_TOOL and SESSION_SEARCH_MEMORY_TOOL
-export { SESSION_SAVE_LEDGER_TOOL, SESSION_SAVE_HANDOFF_TOOL, SESSION_LOAD_CONTEXT_TOOL, KNOWLEDGE_SEARCH_TOOL, KNOWLEDGE_FORGET_TOOL, SESSION_COMPACT_LEDGER_TOOL, SESSION_SEARCH_MEMORY_TOOL, MEMORY_HISTORY_TOOL, MEMORY_CHECKOUT_TOOL, SESSION_SAVE_IMAGE_TOOL, SESSION_VIEW_IMAGE_TOOL, SESSION_HEALTH_CHECK_TOOL } from "./sessionMemoryDefinitions.js";
-export { sessionSaveLedgerHandler, sessionSaveHandoffHandler, sessionLoadContextHandler, knowledgeSearchHandler, knowledgeForgetHandler, sessionSearchMemoryHandler, backfillEmbeddingsHandler, memoryHistoryHandler, memoryCheckoutHandler, sessionSaveImageHandler, sessionViewImageHandler, sessionHealthCheckHandler } from "./sessionMemoryHandlers.js";
+export { SESSION_SAVE_LEDGER_TOOL, SESSION_SAVE_HANDOFF_TOOL, SESSION_LOAD_CONTEXT_TOOL, KNOWLEDGE_SEARCH_TOOL, KNOWLEDGE_FORGET_TOOL, SESSION_COMPACT_LEDGER_TOOL, SESSION_SEARCH_MEMORY_TOOL, MEMORY_HISTORY_TOOL, MEMORY_CHECKOUT_TOOL, SESSION_SAVE_IMAGE_TOOL, SESSION_VIEW_IMAGE_TOOL, SESSION_HEALTH_CHECK_TOOL, SESSION_FORGET_MEMORY_TOOL } from "./sessionMemoryDefinitions.js";
+export { sessionSaveLedgerHandler, sessionSaveHandoffHandler, sessionLoadContextHandler, knowledgeSearchHandler, knowledgeForgetHandler, sessionSearchMemoryHandler, backfillEmbeddingsHandler, memoryHistoryHandler, memoryCheckoutHandler, sessionSaveImageHandler, sessionViewImageHandler, sessionHealthCheckHandler, sessionForgetMemoryHandler } from "./sessionMemoryHandlers.js";
 // ── Compaction Handler (v0.4.0 — Enhancement #2) ──
 // The compaction handler is in a separate file because it's significantly
 // more complex than the other session memory handlers (chunked Gemini

package/dist/tools/sessionMemoryDefinitions.js CHANGED Viewed

@@ -114,6 +114,10 @@ export const SESSION_LOAD_CONTEXT_TOOL = {
     },
 };
 // ─── Knowledge Search ─────────────────────────────────────────
+// Phase 1 Change: Added `enable_trace` optional boolean.
+// When true, the handler returns a separate content[1] block with a
+// MemoryTrace object (strategy="keyword", latency, result metadata).
+// Default: false — output is identical to pre-Phase 1 behavior.
 export const KNOWLEDGE_SEARCH_TOOL = {
     name: "knowledge_search",
     description: "Search accumulated knowledge across all sessions by keywords, category, or free text. " +
@@ -144,6 +148,14 @@ export const KNOWLEDGE_SEARCH_TOOL = {
                 description: "Maximum results to return (default: 10, max: 50).",
                 default: 10,
             },
+            // Phase 1: Explainability — when true, appends a MemoryTrace JSON
+            // object as content[1] in the response array.
+            // MCP clients can parse content[1] programmatically for debugging.
+            enable_trace: {
+                type: "boolean",
+                description: "If true, returns a separate MEMORY TRACE content block with search strategy, " +
+                    "latency breakdown, and scoring metadata for explainability. Default: false.",
+            },
         },
     },
 };
@@ -265,6 +277,15 @@ export const SESSION_SEARCH_MEMORY_TOOL = {
                 description: "Minimum similarity score 0-1 (default: 0.7). Higher = more relevant, fewer results.",
                 default: 0.7,
             },
+            // Phase 1: Explainability — when true, appends a MemoryTrace JSON
+            // object as content[1] in the response array. For semantic search,
+            // the trace includes embedding_ms (Gemini API time) vs storage_ms
+            // (pgvector query time) to pinpoint performance bottlenecks.
+            enable_trace: {
+                type: "boolean",
+                description: "If true, returns a separate MEMORY TRACE content block with search strategy, " +
+                    "latency breakdown (embedding vs storage), and scoring metadata. Default: false.",
+            },
         },
         required: ["query"],
     },
@@ -306,6 +327,9 @@ export const SESSION_BACKFILL_EMBEDDINGS_TOOL = {
 export function isKnowledgeForgetArgs(args) {
     return typeof args === "object" && args !== null;
 }
+// Phase 1: Added enable_trace to the type guard.
+// Optional boolean — when true, the handler returns a MemoryTrace content block.
+// Default: false, so existing callers see no change in behavior.
 export function isKnowledgeSearchArgs(args) {
     return typeof args === "object" && args !== null;
 }
@@ -328,6 +352,9 @@ export function isSessionSaveHandoffArgs(args) {
         typeof args.project === "string");
 }
 // ─── v0.4.0: Type guard for semantic search ──────────────────
+// Phase 1: Added enable_trace to the type guard.
+// Optional boolean — when true, a MemoryTrace block (with embedding_ms,
+// storage_ms, top_score, etc.) is appended as content[1] in the response.
 export function isSessionSearchMemoryArgs(args) {
     return (typeof args === "object" &&
         args !== null &&
@@ -500,3 +527,64 @@ export const SESSION_HEALTH_CHECK_TOOL = {
 export function isSessionHealthCheckArgs(args) {
     return typeof args === "object" && args !== null; // any object is valid
 }
+// ─── Phase 2: GDPR-Compliant Memory Deletion Tool ────────────
+//
+// This tool enables SURGICAL deletion of individual memory entries by ID.
+// It supports two modes:
+//   1. Soft Delete (default): Sets deleted_at = NOW(). The entry remains
+//      in the database for audit trails but is excluded from ALL search
+//      queries (both FTS5 and vector). This prevents the Top-K Hole
+//      problem where LIMIT N queries return fewer results than expected.
+//   2. Hard Delete: Physical removal from the database. Irreversible.
+//      Use only when GDPR Article 17 requires complete erasure.
+//
+// DESIGN DECISION: This is intentionally separate from knowledge_forget,
+// which operates on bulk filter criteria (project, category, age).
+// session_forget_memory is surgical — one entry at a time — for
+// precise GDPR compliance.
+export const SESSION_FORGET_MEMORY_TOOL = {
+    name: "session_forget_memory",
+    description: "Forget (delete) a specific memory entry by its ID. " +
+        "Supports two modes:\n\n" +
+        "- **Soft delete** (default): Tombstones the entry — it stays in the database " +
+        "for audit trails but is excluded from all search results. Reversible.\n" +
+        "- **Hard delete**: Permanently removes the entry from the database. Irreversible. " +
+        "Use only when GDPR Article 17 requires complete erasure.\n\n" +
+        "⚠️ Soft delete is recommended for most use cases. The entry can be " +
+        "restored in the future if needed.",
+    inputSchema: {
+        type: "object",
+        properties: {
+            memory_id: {
+                type: "string",
+                description: "The UUID of the memory (ledger) entry to forget. " +
+                    "You can find this ID in search results returned by " +
+                    "session_search_memory or knowledge_search.",
+            },
+            hard_delete: {
+                type: "boolean",
+                description: "If true, permanently removes the entry (irreversible). " +
+                    "If false (default), soft-deletes by setting deleted_at timestamp. " +
+                    "Soft-deleted entries are excluded from searches but remain in the database.",
+            },
+            reason: {
+                type: "string",
+                description: "Optional GDPR Article 17 justification for the deletion. " +
+                    "Examples: 'User requested', 'Data retention policy', 'Outdated information'. " +
+                    "Stored alongside the tombstone for audit trail purposes.",
+            },
+        },
+        required: ["memory_id"],
+    },
+};
+/**
+ * Type guard for session_forget_memory arguments.
+ * Validates that memory_id (required) is present and is a string.
+ * hard_delete and reason are optional.
+ */
+export function isSessionForgetMemoryArgs(args) {
+    return (typeof args === "object" &&
+        args !== null &&
+        "memory_id" in args &&
+        typeof args.memory_id === "string");
+}

package/dist/tools/sessionMemoryHandlers.js CHANGED Viewed

@@ -20,9 +20,17 @@ import { getStorage } from "../storage/index.js";
 import { toKeywordArray } from "../utils/keywordExtractor.js";
 import { generateEmbedding } from "../utils/embeddingApi.js";
 import { getCurrentGitState, getGitDrift } from "../utils/git.js";
+// ─── Phase 1: Explainability & Memory Lineage ────────────────
+// These utilities provide structured tracing metadata for search operations.
+// When `enable_trace: true` is passed to session_search_memory or knowledge_search,
+// a separate MCP content block (content[1]) is returned with a MemoryTrace object
+// containing: strategy, scores, latency breakdown (embedding/storage/total), and metadata.
+// See src/utils/tracing.ts for full type definitions and design decisions.
+import { createMemoryTrace, traceToContentBlock } from "../utils/tracing.js";
 import { GOOGLE_API_KEY, PRISM_USER_ID, PRISM_AUTO_CAPTURE, PRISM_CAPTURE_PORTS } from "../config.js";
 import { captureLocalEnvironment } from "../utils/autoCapture.js";
 import { isSessionSaveLedgerArgs, isSessionSaveHandoffArgs, isSessionLoadContextArgs, isKnowledgeSearchArgs, isKnowledgeForgetArgs, isSessionSearchMemoryArgs, isBackfillEmbeddingsArgs, isMemoryHistoryArgs, isMemoryCheckoutArgs, isSessionHealthCheckArgs, // v2.2.0: health check type guard
+isSessionForgetMemoryArgs, // Phase 2: GDPR-compliant memory deletion type guard
  } from "./sessionMemoryDefinitions.js";
 import { notifyResourceUpdate } from "../server.js";
 // ─── Save Ledger Handler ──────────────────────────────────────
@@ -471,15 +479,43 @@ export async function sessionLoadContextHandler(args) {
 // ─── Knowledge Search Handler ─────────────────────────────────
 /**
  * Searches accumulated knowledge across all past sessions.
+ *
+ * ═══════════════════════════════════════════════════════════════════
+ * PHASE 1 CHANGES (Explainability & Memory Lineage):
+ *
+ * Added `enable_trace` optional parameter (default: false).
+ * When enabled, appends a MemoryTrace content block to the response
+ * with strategy="keyword", timing data, and result metadata.
+ *
+ * TIMING INSTRUMENTATION:
+ *   - totalStart: captured before any work begins
+ *   - storageStart/storageMs: isolates database query time
+ *   - embeddingMs: always 0 for keyword search (no embedding needed)
+ *   - totalMs: end-to-end including keyword extraction overhead
+ *
+ * BACKWARD COMPATIBILITY:
+ *   When enable_trace is false (default), the response is identical
+ *   to the pre-Phase 1 implementation. Zero breaking changes.
+ *
+ * MCP OUTPUT ARRAY:
+ *   content[0] = human-readable search results (unchanged)
+ *   content[1] = machine-readable MemoryTrace JSON (only when enable_trace=true)
+ * ═══════════════════════════════════════════════════════════════════
  */
 export async function knowledgeSearchHandler(args) {
     if (!isKnowledgeSearchArgs(args)) {
         throw new Error("Invalid arguments for knowledge_search");
     }
-    const { project, query, category, limit = 10 } = args;
+    // Phase 1: destructure enable_trace (defaults to false for backward compat)
+    const { project, query, category, limit = 10, enable_trace = false } = args;
     debugLog(`[knowledge_search] Searching: project=${project || "all"}, query="${query || ""}", category=${category || "any"}, limit=${limit}`);
+    // Phase 1: Capture total start time for latency measurement
+    const totalStart = performance.now();
     const searchKeywords = query ? toKeywordArray(query) : [];
     const storage = await getStorage();
+    // Phase 1: Capture storage-specific start time to isolate DB latency
+    // from keyword extraction and other overhead
+    const storageStart = performance.now();
     const data = await storage.searchKnowledge({
         project: project || null,
         keywords: searchKeywords,
@@ -488,27 +524,60 @@ export async function knowledgeSearchHandler(args) {
         limit: Math.min(limit, 50),
         userId: PRISM_USER_ID,
     });
+    const storageMs = performance.now() - storageStart;
+    const totalMs = performance.now() - totalStart;
     if (!data) {
-        return {
-            content: [{
-                    type: "text",
-                    text: `🔍 No knowledge found matching your search.\n` +
-                        (query ? `Query: "${query}"\n` : "") +
-                        (category ? `Category: ${category}\n` : "") +
-                        (project ? `Project: ${project}\n` : "") +
-                        `\nTip: Try session_search_memory for semantic (meaning-based) search ` +
-                        `if keyword search doesn't find what you need.`,
-                }],
-            isError: false,
-        };
-    }
-    return {
-        content: [{
+        // Phase 1: Use contentBlocks array instead of inline object
+        // so we can conditionally push the trace block at content[1]
+        const contentBlocks = [{
                 type: "text",
-                text: `🧠 Found ${data.count} knowledge entries:\n\n${JSON.stringify(data, null, 2)}`,
-            }],
-        isError: false,
-    };
+                text: `🔍 No knowledge found matching your search.\n` +
+                    (query ? `Query: "${query}"\n` : "") +
+                    (category ? `Category: ${category}\n` : "") +
+                    (project ? `Project: ${project}\n` : "") +
+                    `\nTip: Try session_search_memory for semantic (meaning-based) search ` +
+                    `if keyword search doesn't find what you need.`,
+            }];
+        // Phase 1: Append trace block even on empty results — this tells
+        // the developer the search DID execute, it just found nothing.
+        // topScore and threshold are null for keyword search (no scoring system).
+        if (enable_trace) {
+            const trace = createMemoryTrace({
+                strategy: "keyword",
+                query: query || "",
+                resultCount: 0,
+                topScore: null, // keyword search doesn't produce similarity scores
+                threshold: null, // keyword search has no threshold concept
+                embeddingMs: 0, // no embedding needed for keyword search
+                storageMs,
+                totalMs,
+                project: project || null,
+            });
+            contentBlocks.push(traceToContentBlock(trace));
+        }
+        return { content: contentBlocks, isError: false };
+    }
+    // Phase 1: Wrap in contentBlocks array for optional trace attachment
+    const contentBlocks = [{
+            type: "text",
+            text: `🧠 Found ${data.count} knowledge entries:\n\n${JSON.stringify(data, null, 2)}`,
+        }];
+    // Phase 1: Attach MemoryTrace with strategy="keyword" and timing data
+    if (enable_trace) {
+        const trace = createMemoryTrace({
+            strategy: "keyword",
+            query: query || "",
+            resultCount: data.count,
+            topScore: null, // keyword search doesn't produce similarity scores
+            threshold: null, // keyword search has no threshold concept
+            embeddingMs: 0, // no embedding needed for keyword search
+            storageMs,
+            totalMs,
+            project: project || null,
+        });
+        contentBlocks.push(traceToContentBlock(trace));
+    }
+    return { content: contentBlocks, isError: false };
 }
 // ─── Knowledge Forget Handler ─────────────────────────────────
 /**
@@ -581,15 +650,55 @@ export async function knowledgeForgetHandler(args) {
 }
 // ─── Semantic Search Handler ──────────────────────────────────
 /**
- * Searches session history semantically using embeddings.
+ * Searches session history semantically using vector embeddings.
+ *
+ * ═══════════════════════════════════════════════════════════════════
+ * PHASE 1 CHANGES (Explainability & Memory Lineage):
+ *
+ * Added `enable_trace` optional parameter (default: false).
+ * When enabled, appends a MemoryTrace content block to the response.
+ *
+ * TIMING INSTRUMENTATION (3 checkpoints):
+ *   1. totalStart: before any work begins
+ *   2. embeddingStart/embeddingMs: isolates Gemini API call latency
+ *      (this is the most variable — 50ms to 2000ms depending on load)
+ *   3. storageStart/storageMs: isolates pgvector/SQLite query time
+ *
+ * WHY SEPARATE EMBEDDING FROM STORAGE:
+ *   A single latency_ms number is misleading. Example:
+ *   - 500ms total could be 480ms Gemini API + 20ms pgvector
+ *     → Fix: cache embeddings or switch to a faster model
+ *   - 500ms total could be 20ms Gemini API + 480ms pgvector
+ *     → Fix: add an index or reduce vector dimensions
+ *
+ * SCORE BUBBLING:
+ *   The `topScore` in the trace comes from results[0].similarity,
+ *   which is the cosine distance returned by SemanticSearchResult
+ *   (see src/storage/interface.ts L104-112). No storage layer
+ *   modifications were needed — the score was already there.
+ *
+ * MCP OUTPUT ARRAY:
+ *   content[0] = human-readable search results (unchanged)
+ *   content[1] = machine-readable MemoryTrace JSON (only when enable_trace=true)
+ *
+ * BACKWARD COMPATIBILITY:
+ *   When enable_trace is false (default), the response is byte-for-byte
+ *   identical to the pre-Phase 1 implementation. Zero breaking changes.
+ *   Existing tests pass without modification.
+ * ═══════════════════════════════════════════════════════════════════
  */
 export async function sessionSearchMemoryHandler(args) {
     if (!isSessionSearchMemoryArgs(args)) {
         throw new Error("Invalid arguments for session_search_memory");
     }
-    const { query, project, limit = 5, similarity_threshold = 0.7, } = args;
+    const { query, project, limit = 5, similarity_threshold = 0.7,
+    // Phase 1: enable_trace defaults to false for full backward compatibility.
+    // When true, a MemoryTrace JSON block is appended as content[1].
+    enable_trace = false, } = args;
     debugLog(`[session_search_memory] Semantic search: query="${query}", ` +
         `project=${project || "all"}, limit=${limit}, threshold=${similarity_threshold}`);
+    // Phase 1: Start total latency timer BEFORE any work (embedding + storage)
+    const totalStart = performance.now();
     // Step 1: Generate embedding for the search query
     if (!GOOGLE_API_KEY) {
         return {
@@ -603,6 +712,9 @@ export async function sessionSearchMemoryHandler(args) {
         };
     }
     let queryEmbedding;
+    // Phase 1: Start embedding latency timer — isolates Gemini API call time.
+    // This is the most variable component: 50ms on a good day, 2000ms under load.
+    const embeddingStart = performance.now();
     try {
         queryEmbedding = await generateEmbedding(query);
     }
@@ -616,9 +728,15 @@ export async function sessionSearchMemoryHandler(args) {
             isError: true,
         };
     }
+    // Phase 1: Capture embedding API latency
+    const embeddingMs = performance.now() - embeddingStart;
     // Step 2: Search via storage backend
     try {
         const storage = await getStorage();
+        // Phase 1: Start storage latency timer — isolates DB query time.
+        // For Supabase: this measures the pgvector cosine distance RPC call.
+        // For SQLite: this measures the local sqlite-vec similarity search.
+        const storageStart = performance.now();
         const results = await storage.searchMemory({
             queryEmbedding: JSON.stringify(queryEmbedding),
             project: project || null,
@@ -626,20 +744,38 @@ export async function sessionSearchMemoryHandler(args) {
             similarityThreshold: similarity_threshold,
             userId: PRISM_USER_ID,
         });
+        // Phase 1: Capture storage query latency and compute total
+        const storageMs = performance.now() - storageStart;
+        const totalMs = performance.now() - totalStart;
         if (results.length === 0) {
-            return {
-                content: [{
-                        type: "text",
-                        text: `🔍 No semantically similar sessions found for: "${query}"\n` +
-                            (project ? `Project: ${project}\n` : "") +
-                            `Similarity threshold: ${similarity_threshold}\n\n` +
-                            `Tips:\n` +
-                            `• Lower the similarity_threshold (e.g., 0.5) for broader results\n` +
-                            `• Try knowledge_search for keyword-based matching\n` +
-                            `• Ensure sessions have been saved with embeddings (requires GOOGLE_API_KEY)`,
-                    }],
-                isError: false,
-            };
+            // Phase 1: Use contentBlocks array so we can optionally push trace at [1]
+            const contentBlocks = [{
+                    type: "text",
+                    text: `🔍 No semantically similar sessions found for: "${query}"\n` +
+                        (project ? `Project: ${project}\n` : "") +
+                        `Similarity threshold: ${similarity_threshold}\n\n` +
+                        `Tips:\n` +
+                        `• Lower the similarity_threshold (e.g., 0.5) for broader results\n` +
+                        `• Try knowledge_search for keyword-based matching\n` +
+                        `• Ensure sessions have been saved with embeddings (requires GOOGLE_API_KEY)`,
+                }];
+            // Phase 1: Trace is still valuable on empty results — it proves the search
+            // executed and reveals whether the bottleneck was embedding or storage.
+            if (enable_trace) {
+                const trace = createMemoryTrace({
+                    strategy: "semantic",
+                    query,
+                    resultCount: 0,
+                    topScore: null, // no results = no top score
+                    threshold: similarity_threshold,
+                    embeddingMs,
+                    storageMs,
+                    totalMs,
+                    project: project || null,
+                });
+                contentBlocks.push(traceToContentBlock(trace));
+            }
+            return { content: contentBlocks, isError: false };
         }
         // Format results with similarity scores
         const formatted = results.map((r, i) => {
@@ -652,13 +788,33 @@ export async function sessionSearchMemoryHandler(args) {
                 (r.decisions?.length ? `  Decisions: ${r.decisions.join("; ")}\n` : "") +
                 (r.files_changed?.length ? `  Files: ${r.files_changed.join(", ")}\n` : "");
         }).join("\n");
-        return {
-            content: [{
-                    type: "text",
-                    text: `🧠 Found ${results.length} semantically similar sessions:\n\n${formatted}`,
-                }],
-            isError: false,
-        };
+        // Phase 1: content[0] = human-readable results (unchanged from pre-Phase 1)
+        const contentBlocks = [{
+                type: "text",
+                text: `🧠 Found ${results.length} semantically similar sessions:\n\n${formatted}`,
+            }];
+        // Phase 1: content[1] = machine-readable MemoryTrace (only when enable_trace=true)
+        // topScore is read from results[0].similarity — this is the cosine distance
+        // already returned by SemanticSearchResult in the storage interface.
+        // No storage layer modifications were needed ("Score Bubbling" reviewer level-up).
+        if (enable_trace) {
+            const topScore = results.length > 0 && typeof results[0].similarity === "number"
+                ? results[0].similarity
+                : null;
+            const trace = createMemoryTrace({
+                strategy: "semantic",
+                query,
+                resultCount: results.length,
+                topScore,
+                threshold: similarity_threshold,
+                embeddingMs,
+                storageMs,
+                totalMs,
+                project: project || null,
+            });
+            contentBlocks.push(traceToContentBlock(trace));
+        }
+        return { content: contentBlocks, isError: false };
     }
     catch (err) {
         const errorMsg = err instanceof Error ? err.message : String(err);
@@ -1194,3 +1350,90 @@ export async function sessionHealthCheckHandler(args) {
         };
     }
 }
+// ═══════════════════════════════════════════════════════════════
+// Phase 2: GDPR-Compliant Memory Deletion Handler
+// ═══════════════════════════════════════════════════════════════
+//
+// This handler implements the session_forget_memory MCP tool.
+// It provides SURGICAL deletion of individual memory entries by ID,
+// supporting both soft-delete (tombstoning) and hard-delete (physical removal).
+//
+// WHY THIS IS SEPARATE FROM knowledgeForgetHandler:
+//   knowledgeForgetHandler operates on BULK criteria (project, category, age).
+//   sessionForgetMemoryHandler operates on a SINGLE entry by ID.
+//   This surgical approach is required for GDPR Article 17 compliance,
+//   where a data subject requests deletion of specific personal data.
+//
+// THE TOP-K HOLE PROBLEM (Solved):
+//   Without deleted_at filtering inside the database queries (both SQL and RPCs),
+//   a LIMIT 5 query might return 5 rows where 4 are soft-deleted. Post-filtering
+//   in TypeScript would strip them, leaving only 1 result. This destroys the
+//   agent's recall capability. By adding "AND deleted_at IS NULL" to ALL
+//   search queries (done in sqlite.ts and Supabase RPCs), the filtering
+//   happens BEFORE the LIMIT is applied, guaranteeing full Top-K results.
+// ═══════════════════════════════════════════════════════════════
+export async function sessionForgetMemoryHandler(args) {
+    try {
+        // ─── Input Validation ───
+        if (!isSessionForgetMemoryArgs(args)) {
+            return {
+                content: [{
+                        type: "text",
+                        text: "Invalid arguments. Required: memory_id (string). Optional: hard_delete (boolean), reason (string).",
+                    }],
+                isError: true,
+            };
+        }
+        const { memory_id, hard_delete = false, reason } = args;
+        // ─── Get Storage Backend ───
+        const storage = await getStorage();
+        // ─── Execute Deletion ───
+        // The storage methods verify user_id ownership internally,
+        // preventing cross-user deletion attacks.
+        if (hard_delete) {
+            // IRREVERSIBLE: Physical removal from the database.
+            // FTS5 triggers (SQLite) or Supabase cascades clean up indexes.
+            await storage.hardDeleteLedger(memory_id, PRISM_USER_ID);
+            debugLog(`[session_forget_memory] Hard-deleted entry ${memory_id}`);
+            return {
+                content: [{
+                        type: "text",
+                        text: `🗑️ **Hard Deleted** memory entry \`${memory_id}\`.\n\n` +
+                            `This entry has been permanently removed from the database. ` +
+                            `It cannot be recovered. All associated embeddings and FTS indexes ` +
+                            `have been cleaned up.`,
+                    }],
+                isError: false,
+            };
+        }
+        else {
+            // REVERSIBLE: Soft-delete (tombstone) — sets deleted_at + deleted_reason.
+            // The entry remains in the database but is excluded from ALL search
+            // queries (vector, FTS5, and context loading).
+            await storage.softDeleteLedger(memory_id, PRISM_USER_ID, reason);
+            debugLog(`[session_forget_memory] Soft-deleted entry ${memory_id} (reason: ${reason || "none"})`);
+            return {
+                content: [{
+                        type: "text",
+                        text: `🔇 **Soft Deleted** memory entry \`${memory_id}\`.\n\n` +
+                            `The entry has been tombstoned (deleted_at = NOW()). ` +
+                            `It will no longer appear in any search results, but remains ` +
+                            `in the database for audit trail purposes.\n\n` +
+                            (reason ? `📋 **Reason**: ${reason}\n\n` : "") +
+                            `To permanently remove this entry, call again with \`hard_delete: true\`.`,
+                    }],
+                isError: false,
+            };
+        }
+    }
+    catch (error) {
+        console.error(`[session_forget_memory] Error: ${error}`);
+        return {
+            content: [{
+                    type: "text",
+                    text: `Error forgetting memory: ${error instanceof Error ? error.message : String(error)}`,
+                }],
+            isError: true,
+        };
+    }
+}

package/dist/utils/tracing.js ADDED Viewed

@@ -0,0 +1,139 @@
+/**
+ * Memory Trace — Phase 1 Explainability & Lineage
+ *
+ * ═══════════════════════════════════════════════════════════════════
+ * PURPOSE:
+ *   Provides structured tracing metadata for every search/recall
+ *   operation in Prism MCP. When `enable_trace: true` is passed to
+ *   `session_search_memory` or `knowledge_search`, the response
+ *   includes a separate MCP content block with a MemoryTrace object.
+ *
+ * WHY THIS EXISTS:
+ *   Without tracing, developers have no visibility into *why* a
+ *   memory was returned — was it a semantic match? A keyword hit?
+ *   How confident was the score? Was the 500ms latency caused by
+ *   the embedding API or the database query?
+ *
+ *   This module answers all of those questions by providing:
+ *   - strategy: "semantic" | "keyword" → which search path was used
+ *   - top_score: the cosine similarity / relevance score of the best result
+ *   - latency: { embedding_ms, storage_ms, total_ms } → pinpoints bottlenecks
+ *   - result_count, threshold, project, query, timestamp → full context
+ *
+ * DESIGN DECISIONS:
+ *
+ *   1. NO OPENTELEMETRY SDK IN PHASE 1
+ *      We get the data structures right in-memory first. OTel
+ *      integration (W3C traceparent headers, span export to
+ *      Datadog/LangSmith) layers on top in a follow-up without
+ *      any code changes to the MemoryTrace types.
+ *
+ *   2. SEPARATE MCP CONTENT BLOCK (The "Output Array Trick")
+ *      Instead of concatenating trace JSON into the human-readable
+ *      text response (content[0]), we return it as content[1].
+ *
+ *      Why?
+ *      - Prevents LLMs from accidentally blending trace JSON into
+ *        their reasoning (they sometimes try to "interpret" inline JSON)
+ *      - Programmatic MCP clients can grab content[1] directly
+ *        without parsing/splitting string output
+ *      - Clean separation of concerns: content[0] = human-readable,
+ *        content[1] = machine-readable trace metadata
+ *
+ *   3. LATENCY BREAKDOWN (Not just total)
+ *      A single `latency_ms` number is misleading. A 500ms total could
+ *      be 480ms embedding API + 20ms DB, or 20ms embedding + 480ms DB.
+ *      These are very different problems requiring different fixes.
+ *
+ *      We capture three timestamps:
+ *        - Before embedding API call → after = embedding_ms
+ *        - Before storage.searchMemory() → after = storage_ms
+ *        - Start to finish = total_ms (includes overhead, serialization, etc.)
+ *
+ *   4. SCORE BUBBLING (No storage layer changes needed)
+ *      The existing SemanticSearchResult interface (interface.ts L104-112)
+ *      already includes `similarity: number`. We read this directly from
+ *      results[0].similarity — no modifications to the storage layer.
+ *      For keyword search, top_score is null since keyword search doesn't
+ *      return relevance scores in the current implementation.
+ *
+ *   5. BACKWARD COMPATIBILITY
+ *      When `enable_trace` is not set (default: false), the response
+ *      is identical to pre-Phase 1 output. Zero breaking changes.
+ *      Existing tests pass without modification.
+ *
+ * USAGE:
+ *   This module is imported by sessionMemoryHandlers.ts. It is NOT
+ *   imported by the storage layer, server.ts, or any other module.
+ *
+ * FILES THAT IMPORT THIS:
+ *   - src/tools/sessionMemoryHandlers.ts (search handlers)
+ *
+ * RELATED FILES:
+ *   - src/tools/sessionMemoryDefinitions.ts (enable_trace param definition)
+ *   - src/storage/interface.ts (SemanticSearchResult with similarity score)
+ *
+ * FUTURE EXTENSIONS (Phase 1.5+):
+ *   - Add OpenTelemetry span creation using these same trace objects
+ *   - Add `reranked_score` field when re-ranking is implemented
+ *   - Add `graph_hops` field when graph-based recall is added
+ *   - Add PII sanitization flags for GDPR-strict deployments
+ * ═══════════════════════════════════════════════════════════════════
+ */
+// ─── Factory ──────────────────────────────────────────────────
+/**
+ * Create a MemoryTrace object from search operation metrics.
+ *
+ * This is a pure factory function — no side effects, no I/O.
+ * Called by the search handlers after both the embedding API call
+ * and storage query have completed.
+ *
+ * Latency values are rounded to nearest integer for cleaner output
+ * (sub-millisecond precision is noise, not signal).
+ *
+ * @param params.strategy      - "semantic" or "keyword"
+ * @param params.query         - Original search query string
+ * @param params.resultCount   - Number of results returned
+ * @param params.topScore      - Best similarity score, or null for keyword
+ * @param params.threshold     - Threshold used, or null for keyword
+ * @param params.embeddingMs   - Time for embedding API call (0 for keyword)
+ * @param params.storageMs     - Time for database query
+ * @param params.totalMs       - Total end-to-end time
+ * @param params.project       - Project filter, or null for all
+ * @returns A complete MemoryTrace object ready for serialization
+ */
+export function createMemoryTrace(params) {
+    return {
+        strategy: params.strategy,
+        query: params.query,
+        result_count: params.resultCount,
+        top_score: params.topScore,
+        threshold: params.threshold,
+        latency: {
+            embedding_ms: Math.round(params.embeddingMs),
+            storage_ms: Math.round(params.storageMs),
+            total_ms: Math.round(params.totalMs),
+        },
+        timestamp: new Date().toISOString(),
+        project: params.project,
+    };
+}
+/**
+ * Format a MemoryTrace into an MCP content block.
+ *
+ * Returns a single content block to push into the content[] array
+ * at index [1]. The "=== MEMORY TRACE ===" header makes it visually
+ * distinct from the human-readable search results at content[0].
+ *
+ * The trace is pretty-printed (2-space indent) for readability in
+ * console output and MCP inspector tools.
+ *
+ * @param trace - A MemoryTrace object from createMemoryTrace()
+ * @returns An MCP content block: { type: "text", text: "..." }
+ */
+export function traceToContentBlock(trace) {
+    return {
+        type: "text",
+        text: `=== MEMORY TRACE ===\n${JSON.stringify(trace, null, 2)}`,
+    };
+}

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "prism-mcp-server",
-  "version": "2.3.11",
+  "version": "2.5.0",
   "mcpName": "io.github.dcostenco/prism-mcp",
   "description": "The Mind Palace for AI Agents — local-first MCP server with persistent memory (SQLite/Supabase), visual dashboard, time travel, multi-agent sync, Morning Briefings, reality drift detection, code mode templates, semantic vector search, and Brave Search + Gemini analysis. Zero-config local mode.",
   "module": "index.ts",
@@ -80,6 +80,7 @@
     "@google/generative-ai": "^0.24.1",
     "@libsql/client": "^0.17.2",
     "@modelcontextprotocol/sdk": "^1.9.0",
+    "@supabase/supabase-js": "^2.99.3",
     "dotenv": "^16.5.0",
     "quickjs-emscripten": "^0.32.0"
   }