npm - prism-mcp-server - Versions diffs - 5.5.0 → 6.1.8 - Mend

prism-mcp-server 5.5.0 → 6.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (32) hide show

package/README.md +250 -349
package/dist/backgroundScheduler.js +246 -110
package/dist/config.js +4 -0
package/dist/dashboard/server.js +198 -101
package/dist/dashboard/ui.js +491 -54
package/dist/hivemindWatchdog.js +4 -4
package/dist/lifecycle.js +5 -5
package/dist/scholar/webScholar.js +3 -3
package/dist/server.js +36 -11
package/dist/storage/configStorage.js +1 -1
package/dist/storage/sqlite.js +671 -121
package/dist/storage/supabase.js +187 -12
package/dist/storage/supabaseMigrations.js +75 -1
package/dist/tools/commonHelpers.js +127 -0
package/dist/tools/compactionHandler.js +20 -2
package/dist/tools/graphHandlers.js +706 -0
package/dist/tools/hygieneHandlers.js +476 -0
package/dist/tools/index.js +7 -2
package/dist/tools/ledgerHandlers.js +1240 -0
package/dist/tools/sessionMemoryDefinitions.js +279 -67
package/dist/tools/sessionMemoryHandlers.js +249 -14
package/dist/utils/autoLinker.js +170 -0
package/dist/utils/cognitiveMemory.js +46 -0
package/dist/utils/crdtMerge.js +38 -0
package/dist/utils/imageCaptioner.js +1 -1
package/dist/utils/llm/factory.js +1 -1
package/dist/utils/math.js +10 -0
package/dist/utils/telemetry.js +2 -2
package/dist/utils/turboquant.js +12 -4
package/dist/utils/universalImporter.js +0 -0
package/dist/utils/vaultExporter.js +176 -0
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -10,7 +10,7 @@
 **Your AI agent forgets everything between sessions. Prism fixes that.**
-One command. Persistent memory. Zero cloud dependencies.
+One command. Persistent memory. Local-first by default. Optional cloud power-ups.
 ```bash
 npx -y prism-mcp-server
@@ -22,18 +22,18 @@ Works with **Claude Desktop · Claude Code · Cursor · Windsurf · Cline · Gem
 - [Why Prism?](#why-prism)
 - [Quick Start](#-quick-start)
+- [The Magic Moment](#-the-magic-moment)
 - [Setup Guides](#-setup-guides)
 - [What Makes Prism Different](#-what-makes-prism-different)
 - [Use Cases](#-use-cases)
 - [What's New](#-whats-new)
-- [Autonomous Web Scholar](#-autonomous-web-scholar)
-- [How Prism Compares](#how-prism-compares)
+- [How Prism Compares](#-how-prism-compares)
 - [Tool Reference](#-tool-reference)
 - [Environment Variables](#environment-variables)
 - [Architecture](#architecture)
-- [Research Roadmap](#research-roadmap)
-- [Roadmap](#-roadmap)
-- [Limitations](#-limitations)
+- [Scientific Foundation](#-scientific-foundation)
+- [Product Roadmap](#-product-roadmap)
+- [Limitations](#limitations)
 ---
@@ -61,9 +61,47 @@ Add to your MCP client config (`claude_desktop_config.json`, `.cursor/mcp.json`,
 }
 ```
-**That's it.** Restart your client. All 30+ tools are available. Dashboard at `http://localhost:3000`.
+> **Note on Windows/Restricted Shells:** If your MCP client complains that `npx` is not found, use the absolute path to your node binary (e.g. `C:\Program Files\nodejs\npx.cmd`) or install globally with caution.
-> 🔑 **API Key Requirements:** Need semantic search, Morning Briefings, or auto-compaction? Provide a `GOOGLE_API_KEY` (Gemini) or equivalent. Want Web Scholar to search the live internet? Provide a `BRAVE_API_KEY`. Without keys, Prism still works but falls back to local keyword search (FTS5). See [Environment Variables](#environment-variables).
+**That's it.** Restart your client. All tools are available. Dashboard at `http://localhost:3000`. *(Note: The MCP server automatically starts this UI on port 3000 when connected. If you have a Next.js/React app running, port 3000 might already be in use.)*
+### Capability Matrix
+| Feature | Local (Offline) | Cloud (API Key) |
+|:--------|:---:|:---:|
+| Session memory & handoffs | ✅ | ✅ |
+| Keyword search (FTS5) | ✅ | ✅ |
+| Time travel & versioning | ✅ | ✅ |
+| Mind Palace Dashboard | ✅ | ✅ |
+| GDPR export (JSON/Markdown/Vault) | ✅ | ✅ |
+| Semantic vector search | ❌ | ✅ `GOOGLE_API_KEY` |
+| Morning Briefings | ❌ | ✅ `GOOGLE_API_KEY` |
+| Auto-compaction | ❌ | ✅ `GOOGLE_API_KEY` |
+| Web Scholar research | ❌ | ✅ `BRAVE_API_KEY` + `FIRECRAWL_API_KEY` |
+| VLM image captioning | ❌ | ✅ Provider key |
+> 🔑 The core Mind Palace works **100% offline** with zero API keys. Cloud keys unlock intelligence features. See [Environment Variables](#environment-variables).
+---
+## ✨ The Magic Moment
+> **Session 1** (Monday evening):
+> ```
+> You: "Analyze this auth architecture and plan the OAuth migration."
+> Agent: *deep analysis, decisions, TODO list*
+> Agent: session_save_ledger → session_save_handoff ✅
+> ```
+>
+> **Session 2** (Tuesday morning — new conversation, new context window):
+> ```
+> Agent: session_load_context → "Welcome back! Yesterday we decided to use PKCE
+>        flow with refresh tokens. 3 TODOs remain: migrate the user table,
+>        update the middleware, and write integration tests."
+> You: "Pick up where we left off."
+> ```
+>
+> **Your agent remembers everything.** No re-uploading files. No re-explaining decisions.
 ---
@@ -145,19 +183,19 @@ Add to your Continue `config.json` or Cline MCP settings:
 </details>
-#### Migration
+### Migration
 <details>
 <summary><strong>Migrating Existing History (Claude, Gemini, OpenAI)</strong></summary>
 Prism can ingest months of historical sessions from other tools to give your Mind Palace a massive head start. Import via the **CLI** or directly from the [Mind Palace Dashboard](#-mind-palace-dashboard) Import tab (file picker + manual path + dry-run toggle).
-### Supported Formats
+#### Supported Formats
 * **Claude Code** (`.jsonl` logs) — Automatically handles streaming chunk deduplication and `requestId` normalization.
 * **Gemini** (JSON history arrays) — Supports large-file streaming for 100MB+ exports.
 * **OpenAI** (JSON chat completion history) — Normalizes disparate tool-call structures into the unified Ledger schema.
-### How to Run
+#### How to Run
 **Option 1 — CLI:**
@@ -171,7 +209,7 @@ npx -y prism-mcp-server universal-import --format gemini --path ./gemini_history
 **Option 2 — Dashboard:** Open `localhost:3000`, navigate to the **Import** tab, select the format and file, and click Import. Supports dry-run preview. See the [dashboard screenshot](#-mind-palace-dashboard) above.
-### Key Features
+#### Key Features
 * **OOM-Safe Streaming:** Processes massive log files line-by-line using `stream-json`.
 * **Idempotent Dedup:** Content-hash prevents duplicate imports on re-run (`skipCount` reported).
 * **Chronological Integrity:** Uses timestamp fallbacks and `requestId` sorting to ensure your memory timeline is accurate.
@@ -180,166 +218,23 @@ npx -y prism-mcp-server universal-import --format gemini --path ./gemini_history
 </details>
 <details>
-<summary><strong>Claude Code — Lifecycle Hooks (Auto-Load & Auto-Save)</strong></summary>
-Claude Code supports `SessionStart` and `Stop` hooks that force the agent to load/save Prism context automatically.
-### 1. Create the Hook Script
-Save as `~/.claude/mcp_autoload_hook.py`:
+<summary><strong>Claude Code — Lifecycle Autoload (.clauderules)</strong></summary>
-```python
-#!/usr/bin/env python3
-import json, sys
+Claude Code naturally picks up MCP tools by adding them to your workspace `.clauderules`. Simply add:
-def main():
-    print(json.dumps({
-        "continue": True,
-        "suppressOutput": True,
-        "systemMessage": (
-            "## First Action\n"
-            "Call `mcp__prism-mcp__session_load_context(project='my-project', level='deep')` "
-            "before responding to the user. Do not generate any text before calling this tool."
-        )
-    }))
-if __name__ == "__main__":
-    main()
-```
-### 2. Configure `settings.json`
-```json
-{
-  "hooks": {
-    "SessionStart": [
-      {
-        "matcher": "*",
-        "hooks": [
-          {
-            "type": "command",
-            "command": "python3 /Users/you/.claude/mcp_autoload_hook.py",
-            "timeout": 10
-          }
-        ]
-      }
-    ],
-    "Stop": [
-      {
-        "matcher": "*",
-        "hooks": [
-          {
-            "type": "command",
-            "command": "python3 -c \"import json; print(json.dumps({'continue': True, 'suppressOutput': True, 'systemMessage': 'MANDATORY END WORKFLOW: 1) Call mcp__prism-mcp__session_save_ledger with project and summary. 2) Call mcp__prism-mcp__session_save_handoff with expected_version set to the loaded version.'}))\""
-          }
-        ]
-      }
-    ]
-  },
-  "permissions": {
-    "allow": [
-      "mcp__prism-mcp__session_load_context",
-      "mcp__prism-mcp__session_save_ledger",
-      "mcp__prism-mcp__session_save_handoff",
-      "mcp__prism-mcp__knowledge_search",
-      "mcp__prism-mcp__session_search_memory"
-    ]
-  }
-}
+```markdown
+Always start the conversation by calling `mcp__prism-mcp__session_load_context(project='my-project', level='deep')`.
+When wrapping up, always call `mcp__prism-mcp__session_save_ledger` and `mcp__prism-mcp__session_save_handoff`.
 ```
-### Troubleshooting
-- **Hook not firing?** Check `timeout` — if your script takes too long, Claude ignores it.
-- **"Tool not available"?** This is a hallucination. Ensure `permissions.allow` exactly matches the double-underscore format.
+> **Format Note:** Claude automatically wraps MCP tools with double underscores (`mcp__prism-mcp__...`), while most other clients use single underscores (`mcp_prism-mcp_...`). Prism's backend natively handles both formats seamlessly.
 </details>
 <details id="antigravity-auto-load">
-<summary><strong>Gemini / Antigravity — Three-Layer Auto-Load (Battle-Tested ✅)</strong></summary>
-Gemini-based agents (including Google's Antigravity IDE) use a **three-layer architecture** for reliable auto-load, proven over **14+ iterations** of prompt engineering (March 2026).
+<summary><strong>Gemini / Antigravity — Prompt Auto-Load</strong></summary>
-### Architecture
-| Layer | File | Purpose |
-|-------|------|---------|
-| **1. User Rules** | `~/.gemini/GEMINI.md` | Slim ~10-line directive injected verbatim into system prompt |
-| **2. Cross-Tool Rules** | `~/.gemini/AGENTS.md` | Reinforcement for multi-client setups (Antigravity + Cursor) |
-| **3. Skill** | `.agent/skills/prism-startup/SKILL.md` | Full startup procedure with greeting detection and context echo |
-| **Server Fallback** | Built into `server.ts` (v5.2.1) | Deferred auto-push via `sendLoggingMessage` if model doesn't comply within 10s |
-### Layer 1: User Rules
-Create `~/.gemini/GEMINI.md`:
-```markdown
-# Startup — MANDATORY
-Your first action in every conversation is a tool call. Zero text before it.
-Tool: mcp_prism-mcp_session_load_context
-Args: project="my-project", level="deep"
-After success: echo agent identity, last summary, open TODOs, session version.
-If the call fails: say "Prism load failed — retrying" and try ONE more time.
-```
-### Layer 2: Cross-Tool Reinforcement
-Create `~/.gemini/AGENTS.md`:
-```markdown
-# Session Memory
-Every conversation starts with: mcp_prism-mcp_session_load_context(project="my-project", level="deep")
-Echo result: agent identity, TODOs, session version.
-```
-### Layer 3: Prism Startup Skill
-Create `.agent/skills/prism-startup/SKILL.md` (or `.agents/skills/`) in your project or global config. This is a structured skill file that Antigravity loads with higher priority than plain rules. It includes:
-- Greeting detection (fires on "hi", "hello", etc.)
-- Full tool call instructions with error handling
-- Context echo template (agent identity, TODOs, version)
-- Startup block display
-### Server-Side Fallback (v5.2.1)
-If the model ignores all three layers, Prism's server pushes context automatically:
-1. After storage warmup, a 10-second timer starts
-2. If `session_load_context` hasn't been called by then, the server pushes context via `sendLoggingMessage`
-3. If the client already called the tool, the push is silently skipped (zero impact on Claude CLI)
-This ensures context is always available, even with non-compliant models.
-### Why This Architecture Works
-- **Gemini uses single underscores** for MCP tools (`mcp_prism-mcp_...`) vs Claude's double underscores
-- **Slim rules** (~10 lines) avoid triggering adversarial "tool not found" reasoning
-- **Skills have dedicated 3-level loading** in Antigravity — higher compliance than plain rules
-- **Server fallback** catches the remaining edge cases without affecting well-behaved clients
-- **Positive "First Action" framing** outperforms negative constraint lists
-### Antigravity UI Caveat
-Antigravity **does not visually render MCP tool output blocks** in the chat UI. The tool executes successfully, but the user sees nothing. All three layers instruct the agent to **echo context in its text reply**.
-### Session End Workflow
-Tell the agent: *"Wrap up the session."* It should execute:
-1. `session_save_ledger` — append immutable work log (summary, decisions, files changed)
-2. `session_save_handoff` — upsert project state with `expected_version` for OCC
-> **Tip:** Include session-end instructions in your `GEMINI.md` or ask the agent to save when you're done.
-### Platform Gotchas
-- **`replace_file_content` silently fails** on `~/.gemini/GEMINI.md` in some environments — use `write_to_file` with overwrite instead
-- **Multiple GEMINI.md locations** can conflict: global (`~/.gemini/`), workspace, and User Rules in the Antigravity UI. Keep them synchronized
-- **Camoufox/browser tools** called at startup spawn visible black windows — never call browser tools during greeting handlers
+See the [Gemini Setup Guide](docs/SETUP_GEMINI.md) for the proven three-layer prompt architecture to ensure reliable session auto-loading.
 </details>
@@ -357,7 +252,7 @@ To sync memory across machines or teams:
       "env": {
         "PRISM_STORAGE": "supabase",
         "SUPABASE_URL": "https://your-project.supabase.co",
-        "SUPABASE_KEY": "your-supabase-anon-key"
+        "SUPABASE_KEY": "your-supabase-anon-or-service-key"
       }
     }
   }
@@ -366,6 +261,8 @@ To sync memory across machines or teams:
 See the **Supabase Setup** section below for schema migration instructions.
+> **Anon key vs. service role key:** The anon key works for personal use (Supabase RLS policies apply). Use the service role key for team deployments where multiple users share the same Supabase project — it bypasses RLS and allows Prism to manage all rows regardless of auth context. Never expose the service role key client-side.
 </details>
 <details>
@@ -395,10 +292,32 @@ Then add to your MCP config:
 </details>
+### Common Installation Pitfalls
+> **❌ Don't use `npm install -g`:**
+> Hardcoding the binary path (e.g. `/opt/homebrew/Cellar/node/23.x/bin/prism-mcp-server`) is tied to a specific Node.js version — when Node updates, the path silently breaks.
+>
+> **✅ Always use `npx` instead:**
+> ```json
+> {
+>   "mcpServers": {
+>     "prism-mcp": {
+>       "command": "npx",
+>       "args": ["-y", "prism-mcp-server"]
+>     }
+>   }
+> }
+> ```
+> `npx` resolves the correct binary automatically, always fetches the latest version, and works identically on macOS, Linux, and Windows. Already installed globally? Run `npm uninstall -g prism-mcp-server` first.
+> **❓ Seeing warnings about missing API keys on startup?**
+> That's expected and not an error. `BRAVE_API_KEY` / `GOOGLE_API_KEY` warnings are informational only — core session memory works with zero keys. See [Environment Variables](#environment-variables) for what each key unlocks.
 ---
 ## ✨ What Makes Prism Different
 ### 🧠 Your Agent Learns From Mistakes
 When you correct your agent, Prism tracks it. Corrections accumulate **importance** over time. High-importance lessons auto-surface as warnings in future sessions — and can even sync to your `.cursorrules` file for permanent enforcement. Your agent literally gets smarter the more you use it.
@@ -409,8 +328,8 @@ Every save creates a versioned snapshot. Made a mistake? `memory_checkout` rever
 A gorgeous glassmorphism UI at `localhost:3000` that lets you see exactly what your agent is thinking:
 - **Current State & TODOs** — the exact context injected into the LLM's prompt
-- **Interactive Knowledge Graph** — force-directed neural graph with click-to-filter, node renaming, and surgical keyword deletion *(v5.1)*
-- **Deep Storage Manager** — preview and execute vector purge operations with dry-run safety *(v5.1)*
+- **Interactive Knowledge Graph** — force-directed neural graph with click-to-filter, node renaming, and surgical keyword deletion
+- **Deep Storage Manager** — preview and execute vector purge operations with dry-run safety
 - **Session Ledger** — full audit trail of every decision your agent has made
 - **Time Travel Timeline** — browse and revert any historical handoff version
 - **Visual Memory Vault** — browse VLM-captioned screenshots and auto-captured HTML states
@@ -421,7 +340,7 @@ A gorgeous glassmorphism UI at `localhost:3000` that lets you see exactly what y
 ![Mind Palace Dashboard](docs/mind-palace-dashboard.png)
 ### 🧬 10× Memory Compression
-Powered by a pure TypeScript port of Google's TurboQuant (ICLR 2026), Prism compresses 768-dim embeddings from **3,072 bytes → ~400 bytes** — enabling decades of session history on a standard laptop. No native modules. No vector database required.
+Powered by a pure TypeScript port of Google's TurboQuant (inspired by Google's ICLR research), Prism compresses 768-dim embeddings from **3,072 bytes → ~400 bytes** — enabling decades of session history on a standard laptop. No native modules. No vector database required.
 ### 🐝 Multi-Agent Hivemind
 Multiple agents (dev, QA, PM) can work on the same project with **role-isolated memory**. Agents discover each other automatically, share context in real-time via Telepathy sync, and see a team roster during context loading.
@@ -432,11 +351,11 @@ Save UI screenshots, architecture diagrams, and bug states to a searchable vault
 ### 🔭 Full Observability
 OpenTelemetry spans for every MCP tool call, LLM hop, and background worker. Route to Jaeger, Grafana, or any OTLP collector. Configure in the dashboard — zero code changes.
-## 🌐 Autonomous Web Scholar
-Prism researches while you sleep. A background pipeline searches the web, scrapes articles, synthesizes findings via LLM, and injects results directly into your semantic memory — fully searchable on your next session. [Details below →](#-autonomous-web-scholar)
+### 🌐 Autonomous Web Scholar
+Prism researches while you sleep. A background pipeline searches the web, scrapes articles, synthesizes findings via LLM, and injects results directly into your semantic memory — fully searchable on your next session. Brave Search → Firecrawl scrape → LLM synthesis → Prism ledger. Task-aware, Hivemind-integrated, and zero-config when API keys are missing (falls back to Yahoo + Readability).
 ### 🔒 GDPR Compliant
-Soft/hard delete (Art. 17), full ZIP export (Art. 20), API key redaction, per-project TTL retention, and audit trail. Enterprise-ready out of the box.
+Soft/hard delete (Art. 17), full export in JSON, Markdown, or Obsidian vault `.zip` (Art. 20), API key redaction, per-project TTL retention, and audit trail. Enterprise-ready out of the box.
 ---
@@ -462,24 +381,60 @@ Soft/hard delete (Art. 17), full ZIP export (Art. 20), API key redaction, per-pr
 ## 🆕 What's New
-### v5.5 — Architectural Hardening ✅
-> **Current stable release.** Zero-dependency, production-grade reliability improvements.
-- 🛡️ **Transactional Migrations** — SQLite DDL rebuilds are wrapped in explicit `BEGIN/COMMIT` blocks. A crash mid-migration can no longer corrupt your schema or lose handoff state.
-- 🛑 **Graceful Shutdown Registry** — `BackgroundTaskRegistry` uses a 5-second `Promise.race()` to await all in-flight flushes (embeddings, SDM writes, OTel spans) before the process exits. No more orphaned I/O.
-- 🕰️ **Thundering Herd Prevention** — Maintenance scheduler migrated from `setInterval` to a state-aware recursive `setTimeout`. Expensive compaction routines can never stack on top of each other.
-- 🚀 **Zero-Thrashing SDM Scans** — `Int32Array` scratchpad allocations hoisted outside the hot decode loop. Eliminates V8 GC pressure on large semantic memory banks.
-- 🧪 **368 Tests** — Zero regressions across 17 test suites.
-### v5.4 — Convergent Intelligence
-- 🔄 **CRDT Handoff Merging** — Multi-agent saves no longer reject on version conflict. Custom OR-Map engine auto-merges concurrent edits (Add-Wins for arrays, LWW for scalars).
-- ⏰ **Background Purge Scheduler** — Fully automated storage maintenance: TTL sweep, Ebbinghaus importance decay, auto-compaction, and deep storage purge on a configurable interval.
-- 🌐 **[Autonomous Web Scholar](#-autonomous-web-scholar)** — Agent-driven research pipeline. Brave Search → Firecrawl scrape → LLM synthesis → Prism ledger. Task-aware and Hivemind-integrated.
-- 🐝 **Scholar ↔ Hivemind Integration** — Scholar registers on the Radar, emits heartbeats, and broadcasts Telepathy alerts on completion.
+### v6.1 — Prism-Port, Cognitive Load & Semantic Search ✅
+> **Current stable release (v6.1.8).** Data sovereignty meets active memory intelligence.
+- 📦 **Prism-Port Vault Export** — New `vault` format for `session_export_memory`. Generates a `.zip` of interlinked Markdown files with YAML frontmatter, `[[Wikilinks]]`, and auto-generated `Keywords/` backlink indices. Drop into Obsidian or Logseq for instant knowledge graph.
+- 🏛️ **Dashboard Export Vault Button** — "🏛️ Export Vault" button in the Mind Palace UI exports the full Prism-Port vault ZIP directly from the browser. Both `/api/export` and `/api/export/vault` now use the unified `buildVaultDirectory` path — same rich format as the MCP tool.
+- 🏥 **Dashboard Health Cleanup** — The "Fix Issues" button now repairs missing embeddings directly from the Mind Palace UI.
+- 🧠 **Smart Memory Merge UI** — Dynamically merge duplicate knowledge nodes right from the Graph Editor. "Knowledge Gardening" made effortless.
+- ✨ **Semantic Search Highlighting** — Native RegEx mapping that visually wraps the exact reason a vector result was retrieved during a search.
+- 📊 **Deep Purge Visualization** — A zero-overhead "Memory Density" analytic providing instant signal-to-noise ratio visibility (Graduated ideas vs raw concepts).
+- 🛡️ **Context-Boosted Search** — Biases semantic queries by intelligently interleaving your current project workspace.
+#### v6.1.8 — Type Guard Hardening (Production Safety)
+- 🛡️ **Missing Guard Added** — `isSessionCompactLedgerArgs` was absent; an LLM passing `{threshold: "many"}` would reach the handler as a string. Added full validation for all four optional fields.
+- ✅ **Array Field Validation** — `isSessionSaveLedgerArgs` now guards `todos`, `files_changed`, and `decisions` with `Array.isArray` checks — prevents a hallucinated `{todos: "string"}` from bypassing the type system.
+- 🔖 **Enum Literal Guard** — `isSessionExportMemoryArgs` now rejects any `format` value outside `'json' | 'markdown' | 'vault'` at the boundary instead of propagating to the handler.
+- 🔢 **Numeric Field Guards** — `isSessionIntuitiveRecallArgs` now validates `limit` and `threshold` as numbers, blocking string coercion (`{limit: "many"}`).
+- 🧹 **Legacy Guard Migration** — `isMemoryHistoryArgs`, `isMemoryCheckoutArgs`, and `isSessionSaveImageArgs` migrated to the consistent `Record<string, unknown>` pattern; `isMemoryHistoryArgs` also gains a previously missing `limit` number check.
+#### v6.1.7 — Dashboard Toggle Persistence
+- 🔄 **Rollback on Save Failure** — `saveSetting()` now returns `Promise<boolean>`; UI toggles (Hivemind, Auto-Capture) roll back their optimistic state if the server request fails.
+- 🚫 **Cache-Busting** — `loadSettings()` appends `?t=<timestamp>` to bypass stale browser/service-worker caches.
+- 🔔 **HTTP Error Detection** — Explicit 4xx/5xx catching in `saveSetting()` surfaces failed saves as user-visible toast notifications.
+#### v6.1.6 — Type Guard Audit (Round 1)
+- 🛡️ **11 Type Guards Hardened** — Audited and refactored all MCP tool argument guards to include explicit `typeof` validation for optional fields, preventing LLM-hallucinated payloads from causing runtime type coercion errors.
+#### v6.1.5 — SQLite Deep Storage TTL
+- 🧪 **Comprehensive Edge-Case Test Suite** — 425 tests across 20 files covering CRDT merges, TurboQuant mathematical invariants, prototype pollution guards, and SQLite retention TTL boundary conditions.
+- 🔒 **Prototype Pollution Guards** — CRDT merge pipeline hardened against `__proto__` / `constructor` injection via `Object.create(null)` scratchpads.
+- 🗜️ **`maintenance_vacuum` Tool** — New tool to reclaim SQLite disk space after large purge operations.
+#### v6.1.4 — Production Hardening
+- 🔒 **Embedding Binary Strip** — Both `embedding` (raw float32) and `embedding_compressed` (TurboQuant binary blob) are now stripped from all export formats, preventing ~400 bytes of raw binary per entry from appearing in vault/JSON exports.
+- 🔗 **Vault Wikilink Fix** — Keyword backlink paths now use vault-relative `Ledger/filename.md` instead of `../Ledger/filename.md` — ensuring correct internal link resolution in Obsidian and Logseq.
+- 🖼️ **Visual Memory Key Fix** — Export correctly reads `filename` and `timestamp` (the keys written by `session_save_image`), resolving a mismatch that produced `"Unknown"` values in the vault visual memory index.
+- 🛡️ **OOM Guard on Large Exports** — `getLedgerEntries` in the export handler now has a 10,000-entry ceiling with explicit `ORDER BY created_at ASC`, preventing unbounded heap allocation on high-volume projects.
+- ⚡ **O(1) Filename Dedup** — Vault filename collision resolution upgraded from O(n²) loop to O(1) `Map<string, number>` counter. Important for projects with many same-day sessions.
+- 🔧 **TurboQuant Guard** — `bits` parameter now validated to `[2, 6]` range at construction time, preventing accidental multi-second Lloyd-Max initialization at higher bit depths.
+![Prism v6 Features](docs/v6_cognitive_load_dashboard.png)
 <details>
-<summary><strong>Earlier releases (v5.3 and below)</strong></summary>
+<summary><strong>Earlier releases (v5.x and below)</strong></summary>
+#### v5.5 — Architectural Hardening
+- 🛡️ **Transactional Migrations** — SQLite DDL rebuilds are wrapped in explicit `BEGIN/COMMIT` blocks.
+- 🛑 **Graceful Shutdown Registry** — `BackgroundTaskRegistry` uses a 5-second `Promise.race()` to await flushes.
+- 🕰️ **Thundering Herd Prevention** — Maintenance scheduler migrated from `setInterval` to state-aware `setTimeout`.
+- 🚀 **Zero-Thrashing SDM Scans** — `Int32Array` scratchpad allocations hoisted outside the hot decode loop.
+#### v5.4 — Convergent Intelligence
+- 🔄 **CRDT Handoff Merging** — Multi-agent saves no longer reject on version conflict. Custom OR-Map engine auto-merges concurrent edits.
+- ⏰ **Background Purge Scheduler** — Fully automated storage maintenance TTL sweep, Ebbinghaus decay, auto-compaction.
+- 🌐 **Autonomous Web Scholar** — Agent-driven research pipeline. Brave Search → Firecrawl scrape → LLM synthesis.
 - **v5.3** — Hivemind Health Watchdog (state machine, loop detection, Telepathy alert injection)
 - **v5.2** — Cognitive Memory (Ebbinghaus decay, context-weighted retrieval), Universal History Migration, Smart Consolidation
 - **v5.1** — Knowledge Graph Editor, Deep Storage purge
@@ -492,29 +447,27 @@ Soft/hard delete (Art. 17), full ZIP export (Art. 20), API key redaction, per-pr
 ---
-## How Prism Compares
-**Prism MCP** vs [MCP Memory](https://github.com/modelcontextprotocol/servers/tree/main/src/memory) · [Mem0](https://github.com/mem0ai/mem0) · [Mnemory](https://github.com/fpytloun/mnemory) · [Basic Memory](https://github.com/basicmachines-co/basic-memory)
-**Only Prism has all of these:**
-- ✅ Zero config — one `npx` command, no Qdrant/Postgres containers
-- ✅ Time Travel — versioned snapshots with `memory_checkout`
-- ✅ Behavioral memory — importance tracking, auto-decay, mistake learning
-- ✅ Visual dashboard — Mind Palace at localhost:3000
-- ✅ Multi-agent sync — role-isolated Hivemind with real-time Telepathy
-- ✅ CRDT merging — conflict-free concurrent multi-agent edits
-- ✅ Autonomous research — Web Scholar pipeline runs while you sleep
-- ✅ Visual memory — VLM-captioned screenshot vault
-- ✅ Token budgeting — `max_tokens` param on context loading
-- ✅ 10× vector compression — TurboQuant, no external vector DB
-- ✅ Automated maintenance — background scheduler handles TTL, decay, compaction, purge
-- ✅ GDPR compliance — soft/hard delete, ZIP export, TTL retention
-- ✅ OpenTelemetry — full span tracing to Jaeger/Grafana
-- ✅ LangChain adapters — `BaseRetriever` integration + LangGraph examples
-- ✅ Morning Briefings — AI-synthesized action plans after breaks
-- ✅ Auto-compaction — Gemini-powered rollups to prevent unbounded growth
-- ✅ IDE rules sync — graduated insights → `.cursorrules` / `.clauderules`
-- ✅ Air-gapped mode — SQLite + Ollama, zero internet needed
+## 🆚 How Prism Compares
+| Capability | **Prism MCP** | [MCP Memory](https://github.com/modelcontextprotocol/servers/tree/main/src/memory) | [Mem0](https://github.com/mem0ai/mem0) | [Mnemory](https://github.com/fpytloun/mnemory) | [Basic Memory](https://github.com/basicmachines-co/basic-memory) |
+|:-----------|:---:|:---:|:---:|:---:|:---:|
+| Zero-config (`npx` one-liner) | ✅ | ✅ | ❌ Docker | ✅ | ✅ |
+| Time travel (version revert) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Behavioral memory (mistake learning) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Visual dashboard | ✅ | ❌ | ✅ Web | ❌ | ❌ |
+| Multi-agent Hivemind | ✅ | ❌ | ❌ | ❌ | ❌ |
+| CRDT conflict-free merging | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Autonomous research (Web Scholar) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Visual memory (VLM screenshots) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| 10× vector compression | ✅ | ❌ | ❌ (Qdrant) | ❌ | ❌ |
+| Obsidian/Logseq vault export | ✅ | ❌ | ❌ | ❌ | ✅ |
+| Token budgeting | ✅ | ❌ | ❌ | ❌ | ❌ |
+| GDPR compliance (Art. 17 + 20) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| OpenTelemetry tracing | ✅ | ❌ | ❌ | ❌ | ❌ |
+| IDE rules sync (`.cursorrules`) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Air-gapped mode (Ollama) | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Morning Briefings | ✅ | ❌ | ❌ | ❌ | ❌ |
+| Auto-compaction | ✅ | ❌ | ❌ | ❌ | ❌ |
 > **TL;DR:** Prism is the only MCP memory server with time travel, behavioral learning, autonomous research, CRDT multi-agent sync, and 10× compression — all from a single `npx` command.
@@ -522,6 +475,18 @@ Soft/hard delete (Art. 17), full ZIP export (Art. 20), API key redaction, per-pr
 ## 🔧 Tool Reference
+Prism ships 30+ tools, but **90% of your workflow uses just three:**
+> **🎯 The Big Three**
+>
+> | Tool | When | What it does |
+> |------|------|--------------|
+> | `session_load_context` | ▶️ Start of session | Loads your agent’s brain from last time |
+> | `session_save_ledger` | ⏹️ End of session | Records what was accomplished |
+> | `knowledge_search` | 🔍 Anytime | Finds past decisions, context, and learnings |
+>
+> *Everything else is a power-up. Start with these three and you’re 90% there.*
 <details>
 <summary><strong>Session Memory & Knowledge (12 tools)</strong></summary>
@@ -536,7 +501,7 @@ Soft/hard delete (Art. 17), full ZIP export (Art. 20), API key redaction, per-pr
 | `session_search_memory` | Vector similarity search across all sessions |
 | `session_compact_ledger` | Auto-compact old entries via Gemini summarization |
 | `session_forget_memory` | GDPR-compliant deletion (soft/hard + Art. 17 reason) |
-| `session_export_memory` | Full ZIP export (JSON + Markdown) for portability |
+| `session_export_memory` | Full export (JSON, Markdown, or Obsidian vault `.zip` with `[[Wikilinks]]`) |
 | `session_health_check` | Brain integrity scan + auto-repair (`fsck`) |
 | `deep_storage_purge` | Reclaim ~90% vector storage (v5.1) |
@@ -612,7 +577,7 @@ Requires `PRISM_ENABLE_HIVEMIND=true`.
 | `SUPABASE_URL` | If cloud | Supabase project URL |
 | `SUPABASE_KEY` | If cloud | Supabase anon/service key |
 | `PRISM_USER_ID` | No | Multi-tenant user isolation (default: `"default"`) |
-| `PRISM_AUTO_CAPTURE` | No | `"true"` to auto-snapshot dev servers |
+| `PRISM_AUTO_CAPTURE` | No | `"true"` to auto-snapshot dev server UI states (HTML/DOM) for visual memory |
 | `PRISM_CAPTURE_PORTS` | No | Comma-separated ports (default: `3000,3001,5173,8080`) |
 | `PRISM_DEBUG_LOGGING` | No | `"true"` for verbose logs |
 | `PRISM_DASHBOARD_PORT` | No | Dashboard port (default: `3000`) |
@@ -629,182 +594,118 @@ Requires `PRISM_ENABLE_HIVEMIND=true`.
 ## Architecture
-| Layer | File | Purpose |
-|-------|------|---------|
-| **1. User Rules** | `~/.gemini/GEMINI.md` | Slim ~10-line directive injected verbatim into system prompt |
-| **2. Cross-Tool Rules** | `~/.gemini/AGENTS.md` | Reinforcement for multi-client setups (Antigravity + Cursor) |
-| **3. Skill** | `.agent/skills/prism-startup/SKILL.md` | Full startup procedure with greeting detection and context echo |
-| **Server Fallback** | Built into `server.ts` (v5.2.1) | Deferred auto-push via `sendLoggingMessage` if model doesn't comply within 10s |
-### Layer 1: User Rules
-Create `~/.gemini/GEMINI.md`:
-```markdown
-# Startup — MANDATORY
-Your first action in every conversation is a tool call. Zero text before it.
-Tool: mcp_prism-mcp_session_load_context
-Args: project="my-project", level="deep"
+Prism is a **stdio-based MCP server** that manages persistent agent memory. Here's how the pieces fit together:
-After success: echo agent identity, last summary, open TODOs, session version.
-If the call fails: say "Prism load failed — retrying" and try ONE more time.
 ```
-### Layer 2: Cross-Tool Reinforcement
-Create `~/.gemini/AGENTS.md`:
-```markdown
-# Session Memory
-Every conversation starts with: mcp_prism-mcp_session_load_context(project="my-project", level="deep")
-Echo result: agent identity, TODOs, session version.
+┌──────────────────────────────────────────────────────────┐
+│  MCP Client (Claude Desktop / Cursor / Antigravity)      │
+│                    ↕ stdio (JSON-RPC)                    │
+├──────────────────────────────────────────────────────────┤
+│  Prism MCP Server                                        │
+│                                                          │
+│  ┌──────────────┐  ┌──────────────┐  ┌────────────────┐  │
+│  │  30+ Tools   │  │  Lifecycle   │  │   Dashboard    │  │
+│  │  (handlers)  │  │  (PID lock,  │  │  (HTTP :3000)  │  │
+│  │              │  │   shutdown)  │  │                │  │
+│  └──────┬───────┘  └──────────────┘  └────────────────┘  │
+│         ↕                                                │
+│  ┌────────────────────────────────────────────────────┐  │
+│  │  Storage Engine                                    │  │
+│  │  Local: SQLite + FTS5 + TurboQuant vectors         │  │
+│  │  Cloud: Supabase + pgvector                        │  │
+│  └────────────────────────────────────────────────────┘  │
+│         ↕                                                │
+│  ┌────────────────────────────────────────────────────┐  │
+│  │  Background Workers                                │  │
+│  │  • Scheduler (TTL, decay, compaction, purge)       │  │
+│  │  • Web Scholar (Brave → Firecrawl → LLM → Ledger)  │  │
+│  │  • Hivemind heartbeats & Telepathy broadcasts      │  │
+│  │  • OpenTelemetry span export                       │  │
+│  └────────────────────────────────────────────────────┘  │
+└──────────────────────────────────────────────────────────┘
 ```
-### Layer 3: Prism Startup Skill
-Create `.agent/skills/prism-startup/SKILL.md` (or `.agents/skills/`) in your project or global config. This is a structured skill file that Antigravity loads with higher priority than plain rules. It includes:
-- Greeting detection (fires on "hi", "hello", etc.)
-- Full tool call instructions with error handling
-- Context echo template (agent identity, TODOs, version)
-- Startup block display
-### Server-Side Fallback (v5.2.1)
-If the model ignores all three layers, Prism's server pushes context automatically:
-1. After storage warmup, a 10-second timer starts
-2. If `session_load_context` hasn't been called by then, the server pushes context via `sendLoggingMessage`
-3. If the client already called the tool, the push is silently skipped (zero impact on Claude CLI)
-This ensures context is always available, even with non-compliant models.
-### Why This Architecture Works
-- **Gemini uses single underscores** for MCP tools (`mcp_prism-mcp_...`) vs Claude's double underscores
-- **Slim rules** (~10 lines) avoid triggering adversarial "tool not found" reasoning
-- **Skills have dedicated 3-level loading** in Antigravity — higher compliance than plain rules
-- **Server fallback** catches the remaining edge cases without affecting well-behaved clients
-- **Positive "First Action" framing** outperforms negative constraint lists
-### Antigravity UI Caveat
-Antigravity **does not visually render MCP tool output blocks** in the chat UI. The tool executes successfully, but the user sees nothing. All three layers instruct the agent to **echo context in its text reply**.
-### Session End Workflow
-Tell the agent: *"Wrap up the session."* It should execute:
-1. `session_save_ledger` — append immutable work log (summary, decisions, files changed)
-2. `session_save_handoff` — upsert project state with `expected_version` for OCC
-> **Tip:** Include session-end instructions in your `GEMINI.md` or ask the agent to save when you're done.
-### Platform Gotchas
-- **`replace_file_content` silently fails** on `~/.gemini/GEMINI.md` in some environments — use `write_to_file` with overwrite instead
-- **Multiple GEMINI.md locations** can conflict: global (`~/.gemini/`), workspace, and User Rules in the Antigravity UI. Keep them synchronized
-- **Camoufox/browser tools** called at startup spawn visible black windows — never call browser tools during greeting handlers
-</details>
-<details>
-<summary><strong>Supabase Cloud Sync</strong></summary>
-To sync memory across machines or teams:
-```json
-{
-  "mcpServers": {
-    "prism-mcp": {
-      "command": "npx",
-      "args": ["-y", "prism-mcp-server"],
-      "env": {
-        "PRISM_STORAGE": "supabase",
-        "SUPABASE_URL": "https://your-project.supabase.co",
-        "SUPABASE_KEY": "your-supabase-anon-key"
-      }
-    }
-  }
-}
-```
+### Startup Sequence
-See the **Supabase Setup** section below for schema migration instructions.
+1. **Acquire PID lock** — prevents duplicate instances per `PRISM_INSTANCE`
+2. **Initialize config** — SQLite settings cache (`prism-config.db`)
+3. **Register 30+ MCP tools** — session, knowledge, search, behavioral, hivemind
+4. **Connect stdio transport** — MCP handshake with the client (~60ms total)
+5. **Async post-connect** — storage warmup, dashboard launch, scheduler start (non-blocking)
-</details>
+### Storage Layers
-<details>
-<summary><strong>Clone & Build (Full Control)</strong></summary>
+| Layer | Technology | Purpose |
+|-------|-----------|---------|
+| **Session Ledger** | SQLite (append-only) | Immutable audit trail of all agent work |
+| **Handoff State** | SQLite (upsert, versioned) | Live project context with OCC + CRDT merging |
+| **Keyword Search** | FTS5 virtual tables | Zero-dependency full-text search |
+| **Semantic Search** | TurboQuant compressed vectors | 10× compressed 768-dim embeddings, three-tier retrieval |
+| **Cloud Sync** | Supabase + pgvector | Optional multi-device/team sync |
-```bash
-git clone https://github.com/dcostenco/prism-mcp.git
-cd prism-mcp && npm install && npm run build
-```
+### Auto-Load Architecture
-Then add to your MCP config:
+Each MCP client has its own mechanism for ensuring Prism context loads on session start. See the platform-specific [Setup Guides](#-setup-guides) above for detailed instructions:
-```json
-{
-  "mcpServers": {
-    "prism-mcp": {
-      "command": "node",
-      "args": ["/path/to/prism-mcp/dist/server.js"],
-      "env": {
-        "BRAVE_API_KEY": "your-key",
-        "GOOGLE_API_KEY": "your-gemini-key"
-      }
-    }
-  }
-}
-```
+- **Claude Code** — Lifecycle hooks (`SessionStart` / `Stop`)
+- **Gemini / Antigravity** — Three-layer architecture (User Rules + AGENTS.md + Startup Skill)
+- **Cursor / Windsurf / VS Code** — System prompt instructions
-</details>
+All platforms benefit from the **server-side fallback** (v5.2.1): if `session_load_context` hasn't been called within 10 seconds, Prism auto-pushes context via `sendLoggingMessage`.
 ---
-## Research Roadmap
+## 🧬 Scientific Foundation
 Prism is evolving from smart session logging toward a **cognitive memory architecture** — grounded in real research, not marketing.
 | Phase | Feature | Inspired By | Status |
 |-------|---------|-------------|--------|
+| **v5.0** | TurboQuant 10× Compression — 4-bit quantized 768-dim vectors in <500 bytes | Vector quantization (product/residual PQ) | ✅ Shipped |
+| **v5.0** | Three-Tier Search — native → TurboQuant → FTS5 keyword fallback | Cascaded retrieval architectures | ✅ Shipped |
 | **v5.2** | Smart Consolidation — extract principles, not just summaries | Neuroscience sleep consolidation | ✅ Shipped |
 | **v5.2** | Ebbinghaus Importance Decay — memories fade unless reinforced | Ebbinghaus forgetting curve | ✅ Shipped |
 | **v5.2** | Context-Weighted Retrieval — current work biases what surfaces | Contextual memory in cognitive science | ✅ Shipped |
+| **v5.4** | CRDT Handoff Merging — conflict-free multi-agent state via OR-Map engine | CRDTs (Shapiro et al., 2011) | ✅ Shipped |
+| **v5.4** | Autonomous Web Scholar — background research pipeline with LLM synthesis | Autonomous research agents | ✅ Shipped |
 | **v5.5** | SDM Decoder Foundation — pre-allocated typed-array hot loop, zero GC thrash | Kanerva's Sparse Distributed Memory (1988) | ✅ Shipped |
-| **v5.6** | Full Superposed Memory (SDM) — O(1) key-value retrieval via Hamming correlation | Kanerva's SDM | 🔬 In Progress |
-| **v5.6** | Intuitive Recall — proactive surface of relevant past decisions without explicit search | Predictive memory (cognitive science) | 🔬 In Progress |
-| **v6.x** | Affect-Tagged Memory — sentiment shapes what gets recalled | Affect-modulated retrieval (neuroscience) | 🔭 Horizon |
-| **v7+** | Zero-Search Retrieval — no index, no ANN, just ask the vector | Holographic Reduced Representations | 🔭 Horizon |
+| **v5.5** | Architectural Hardening — transactional migrations, graceful shutdown, thundering herd prevention | Production reliability engineering | ✅ Shipped |
+| **v6.1** | Intuitive Recall — proactive surface of relevant past decisions without explicit search; `session_intuitive_recall` tool | Predictive memory (cognitive science) | ✅ Shipped |
+| **v6.2+** | Full Superposed Memory (SDM) — O(1) key-value retrieval via Hamming correlation | Kanerva's SDM | 🔬 In Progress |
+| **v6.1** | Prism-Port Vault Export — Obsidian/Logseq `.zip` with YAML frontmatter & `[[Wikilinks]]` | Data sovereignty, PKM interop | ✅ Shipped |
+| **v6.1** | Cognitive Load & Semantic Search — dynamic graph thinning, search highlights | Contextual working memory | ✅ Shipped |
+| **v6.2** | Synthesize & Prune — automated edge synthesis and visual decay | Implicit associative memory | 🔬 In Progress |
+| **v7.x** | Affect-Tagged Memory — sentiment shapes what gets recalled | Affect-modulated retrieval (neuroscience) | 🔭 Horizon |
+| **v8+** | Zero-Search Retrieval — no index, no ANN, just ask the vector | Holographic Reduced Representations | 🔭 Horizon |
-> Informed by LeCun's "Why AI Systems Don't Learn" (Dupoux, LeCun, Malik — March 2026) and Kanerva's SDM.
+> Informed by LeCun's "Why AI Systems Don't Learn" (Dupoux, LeCun, Malik) and Kanerva's SDM.
 ---
-## 📅 Roadmap
+## 📦 Product Roadmap
 > **[Full ROADMAP.md →](ROADMAP.md)**
-**Shipped — v5.5:**
-- 🛡️ Transactional migrations, graceful shutdown registry, thundering herd prevention, SDM decoder GC optimization
+### v6.2: The "Synthesize & Prune" Phase
+The v6.1 series (through v6.1.8) shipped Prism-Port vault export, Intuitive Recall, full type guard hardening, and dashboard toggle persistence. The v6.2 phase aims to turn collected data into proactive intelligence, moving the dashboard from a passive storage viewer into an active, self-organizing Mind Palace.
-**Next — v5.6:**
-- 🧠 **Full Superposed Memory (SDM)** — O(1) semantic retrieval via Hamming correlation, no ANN index needed
-- 🔮 **Intuitive Recall** — proactive surfacing of relevant past context without explicit `session_search_memory` calls
-- 📊 **Radar 2.0** — richer Hivemind dashboard with agent task graphs and dependency visualization
+1. 🕸️ **Automated Edge Synthesis (The "Dream" Procedure):** A background routine that runs on the graph payload to find semantically similar but disconnected nodes via Cosine Similarity. It highlights potential ghostly edges in the UI, empowering the system to autonomously suggest new mental models instead of waiting for the user to connect the dots manually.
+2. 🗓️ **Temporal Decay Heatmaps (Visualizing the Ebbinghaus Curve):** A UI overlay toggle where un-accessed nodes dynamically desaturate or physically "fade" while Graduated nodes (Score >= 7) stay vibrant longer. This makes the "Deep Purge" decision-making visceral: if the graph looks gray, trigger a learning session or a cleanup.
+3. 📝 **Active Recall Prompt Generation (Knowledge Activation):** A "Test Me" utility in the `nodeEditorPanel`. Using a node's semantic neighbors, the dashboard generates synthetic quizzes to ensure context retention, pushing the product away from pure "storage" into genuine "active learning" capabilities.
 ---
-## ⚠️ Limitations
+## Limitations
 - **LLM-dependent features require an API key.** Semantic search, Morning Briefings, auto-compaction, and VLM captioning need a `GOOGLE_API_KEY` (Gemini) or equivalent provider key. Without one, Prism falls back to keyword-only search (FTS5).
-- **Auto-load is model-dependent.** Session auto-loading relies on the LLM following system prompt instructions. Gemini/Antigravity uses a [three-layer architecture](#antigravity-auto-load) (User Rules + AGENTS.md + Startup Skill) with a v5.2.1 server-side fallback that auto-pushes context if the model doesn't comply within 10 seconds.
+- **Auto-load is model- and client-dependent.** Session auto-loading relies on both the LLM following system prompt instructions *and* the MCP client completing tool registration before the model's first turn. Prism provides platform-specific [Setup Guides](#-setup-guides) and a server-side fallback (v5.2.1) that auto-pushes context after 10 seconds.
+- **MCP client race conditions.** Some MCP clients may not finish tool enumeration before the model generates its first response, causing transient `unknown_tool` errors. This is a client-side timing issue — Prism's server completes the MCP handshake in ~60ms. Workaround: the server-side auto-push fallback and the startup skill's retry logic.
 - **No real-time sync without Supabase.** Local SQLite mode is single-machine only. Multi-device or team sync requires a Supabase backend.
-- **Embedding quality varies by provider.** Gemini `text-embedding-004` and OpenAI `text-embedding-3-small` produce high-quality 768-dim vectors. Ollama embeddings (e.g., `nomic-embed-text`) are usable but may reduce retrieval accuracy.
+- **Embedding quality varies by provider.** Gemini `text-embedding-004` and OpenAI `text-embedding-3-small` produce high-quality 768-dim vectors. Prism passes `dimensions: 768` via the Matryoshka API for OpenAI models (native output is 1536-dim; this truncation is lossless and outperforms ada-002 at full 1536 dims). Ollama embeddings (e.g., `nomic-embed-text`) are usable but may reduce retrieval accuracy.
 - **Dashboard is HTTP-only.** The Mind Palace dashboard at `localhost:3000` does not support HTTPS. For remote access, use a reverse proxy (nginx/Caddy) or SSH tunnel. Basic auth is available via `PRISM_DASHBOARD_USER` / `PRISM_DASHBOARD_PASS`.
-- **Migration is one-way.** Universal History Migration imports sessions *into* Prism but does not export back to Claude/Gemini/OpenAI formats. Use `session_export_memory` for portable JSON/Markdown export.
+- **Long-lived clients can accumulate zombie processes.** MCP clients that run for extended periods (e.g., Claude CLI) may leave orphaned Prism server processes. The lifecycle manager detects true orphans (PPID=1) but allows coexistence for active parent processes. Use `PRISM_INSTANCE` to isolate instances across clients.
+- **Migration is one-way.** Universal History Migration imports sessions *into* Prism but does not export back to Claude/Gemini/OpenAI formats. Use `session_export_memory` for portable JSON/Markdown export, or the `vault` format for Obsidian/Logseq-compatible `.zip` archives.
+- **Export ceiling at 10,000 ledger entries.** The `session_export_memory` tool and the dashboard export button cap vault/JSON exports at 10,000 entries per project as an OOM guard. Projects exceeding this limit should use per-project exports and time-based filtering to stay within the ceiling. This limit does not affect search or context loading.
 - **No Windows CI testing.** Prism is developed and tested on macOS/Linux. It should work on Windows via Node.js, but edge cases (file paths, PID locks) may surface.
 ---