prism-mcp-server 9.13.4 → 10.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +36 -24
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -12,7 +12,7 @@
12
12
 
13
13
  **Your AI agent forgets everything between sessions. Prism fixes that — then teaches it to think.**
14
14
 
15
- Prism v9.13 is a true **Cognitive Architecture** inspired by human brain mechanics. Beyond flat vector search, your agent now forms principles from experience, follows causal trains of thought, and possesses the self-awareness to know when it lacks information. **Your agents don't just remember; they learn.** With v9.13, semantic search works **100% offline** no API keys required.
15
+ Prism v10 is a true **Cognitive Architecture** inspired by human brain mechanics. Beyond flat vector search, your agent now forms principles from experience, follows causal trains of thought, and possesses the self-awareness to know when it lacks information. **Your agents don't just remember; they learn.** With v10, the entire cognitive pipeline — including ledger compaction, task routing, and semantic search runs **100% on-device** via `prism-coder:7b`, a HIPAA-hardened local LLM that underwent 3 rounds of adversarial security review. No API keys. No cloud. No data leaves your machine.
16
16
 
17
17
  ```bash
18
18
  npx -y prism-mcp-server
@@ -125,8 +125,9 @@ Then open `http://localhost:3001` instead.
125
125
  | Mind Palace Dashboard | ✅ | ✅ |
126
126
  | GDPR export (JSON/Markdown/Vault) | ✅ | ✅ |
127
127
  | Semantic vector search | ✅ (`embedding_provider=local`) | ✅ (gemini, openai, or voyage) |
128
+ | **Ledger compaction** | ✅ `prism-coder:7b` via Ollama | ✅ Text provider key |
129
+ | **Task routing (LLM tiebreaker)** | ✅ `prism-coder:7b` via Ollama | N/A (heuristic-only) |
128
130
  | Morning Briefings | ❌ | ✅ Text provider key |
129
- | Auto-compaction | ❌ | ✅ Text provider key |
130
131
  | Web Scholar research | ❌ | ✅ [`BRAVE_API_KEY`](#environment-variables) + [`FIRECRAWL_API_KEY`](#environment-variables) (or `TAVILY_API_KEY`) |
131
132
  | VLM image captioning | ❌ | ✅ Provider key |
132
133
  | Autonomous Pipelines (Dark Factory) | ❌ | ✅ Text provider key |
@@ -554,15 +555,32 @@ Built atop Qwen 2.5 Coder 7B using the MLX framework for Apple Silicon, this eng
554
555
 
555
556
  To guarantee zero-hallucination MCP tool use, it was further aligned using **GRPO (Group Relative Policy Optimization)** with a deterministic reward function that deducts points for missing required parameters or misnaming tools.
556
557
 
557
- **Benchmark Test Results (10-iteration proxy test):**
558
- - **Tool-Call Accuracy:** 33.3%
559
- - **JSON Validity:** 100.0%
558
+ **Benchmark Test Results (1000-iteration Phase 5 Model):**
559
+ - **Tool-Call Accuracy:** 33.3% *(Pending GRPO loop over SFT)*
560
+ - **JSON Validity:** 100.0% *(CoT properly mapping schemas)*
560
561
  - **Parameter Accuracy:** 33.3%
561
- - **Average Latency:** 8.0s (Apple M4 Max, 36GB)
562
- - **Tokens/sec:** 43.7
562
+ - **Average Latency:** 5.4s (Apple M4 Max, 36GB)
563
+ - **Generation Speed:** 45.1 Tokens/sec
563
564
 
564
565
  **Integration**: Run via Ollama natively to power autonomous file operations and session routing entirely within the local host environment.
565
566
 
567
+ #### 🛡️ HIPAA-Grade Security Hardening (v10.0)
568
+
569
+ The prism-coder integration underwent **3 rounds of adversarial security review** treating the reviewer as an attacker with HIPAA compliance, data exfiltration, and system stability as threat vectors. **22 findings identified and closed:**
570
+
571
+ | Defense Layer | What It Prevents |
572
+ |---------------|------------------|
573
+ | **`PRISM_STRICT_LOCAL_MODE`** | Silent cloud fallback — when enabled, compaction throws instead of sending ePHI to Gemini/OpenRouter |
574
+ | **`redirect: "error"`** | SSRF via 3xx redirects to AWS IMDS or internal services |
575
+ | **URL credential redaction** | Passwords in `user:pass@host` URLs stripped from all log paths (startup + per-call) |
576
+ | **Entry-boundary truncation** | Prompt injection via mid-tag XML truncation — payload split at `\n\n` boundaries, never mid-tag |
577
+ | **Full XML escaping** | All 5 XML entities (`& < > " '`) escaped on all user-controlled fields including `id` and `session_date` |
578
+ | **`<task>` boundary tags** | Task description XML-escaped and wrapped in delimiters to prevent routing manipulation |
579
+ | **`setTimeout` cap** | Integer overflow (>2³¹) that silently aborted every local LLM call |
580
+ | **Graceful HIPAA errors** | `try/catch` ensures strict mode returns MCP error response, not server crash |
581
+
582
+ > 🔒 **HIPAA deployment:** Set `PRISM_LOCAL_LLM_ENABLED=true` + `PRISM_STRICT_LOCAL_MODE=true`. Session data will **never** leave the device — even if Ollama crashes.
583
+
566
584
  ### 🖼️ Visual Memory
567
585
  Save UI screenshots, architecture diagrams, and bug states to a searchable vault. Images are auto-captioned by a VLM (Claude Vision / GPT-4V / Gemini) and become semantically searchable across sessions.
568
586
 
@@ -1305,31 +1323,25 @@ Prism MCP is open-source and free for individual developers. For teams and enter
1305
1323
 
1306
1324
  ## 📦 Milestones & Roadmap
1307
1325
 
1308
- > **Current: v9.4.1** — Adversarial Security Hardening & Bidirectional Sync ([CHANGELOG](CHANGELOG.md))
1326
+ > **Current: v10.0.0** — HIPAA-Hardened Local LLM Engine + 3-Round Adversarial Security Audit ([CHANGELOG](CHANGELOG.md))
1309
1327
 
1310
1328
  | Release | Headline |
1311
1329
  |---------|----------|
1312
- | **v9.2.4** | 🔄 Cross-Backend Reconciliationautomatic Supabase SQLite sync on startup, two-layer (handoff + ledger), 5s timeout, 13 tests |
1313
- | **v9.2.3** | 🔧 Code Review Hardening 10x faster split-brain detection, variable shadowing fix, resource leak fix |
1314
- | **v9.2.2** | 🚨 Split-Brain Detection & Prevention`--storage` flag, drift detection, session loader hardening |
1315
- | **v9.2.1** | 💻 CLI Full Feature Parity text mode enrichments, agent identity, PATH fix |
1316
- | **v9.1.0** | 🚦 Task Router v2 file-type routing signal, 6-signal heuristics, local agent streaming buffer |
1317
- | **v9.0.5** | 🔒 JWKS Auth Security Hardening — audience/issuer validation, JWT failure logging, typed agent identity |
1330
+ | **v10.0** | 🛡️ **HIPAA-Hardened Local LLM** `prism-coder:7b` powers compaction + task routing 100% on-device; 22-finding adversarial audit, `PRISM_STRICT_LOCAL_MODE`, SSRF/injection/exfiltration hardening. Zero API keys required. |
1331
+ | **v9.14** | 🧬 Dynamic Hardware Routing & Semantic Tool RAG MLX SFT pipeline, Nomic pruning, GRPO alignment |
1332
+ | **v9.13** | 🔬 Local Embeddings & Zero-API-Key Semantic Search — `nomic-embed-text-v1.5` on-device |
1333
+ | **v9.5** | 🛡️ Adversarial Behavioral Hardening24 forbidden openers, XML anti-tag system, sycophancy defense |
1334
+ | **v9.4** | 🔒 Security Sweepcommand injection, path traversal, CORS, fail-closed rate limiter, bidirectional sync |
1318
1335
  | **v9.0** | 🧠 Autonomous Cognitive OS — Surprisal Gate, Cognitive Budget, Affect-Tagged Memory |
1319
- | **v7.8** | 🧠 Cognitive Architecture — Hebbian consolidation, multi-hop reasoning, rejection gate, dynamic decay |
1320
- | **v7.7** | 🌐 Cloud-Native SSE Transport |
1321
- | **v7.5** | 🩺 Intent Health Dashboard + Security Hardening |
1336
+ | **v7.8** | 🧠 Cognitive Architecture — Hebbian consolidation, multi-hop reasoning, rejection gate |
1322
1337
  | **v7.4** | ⚔️ Adversarial Evaluation (anti-sycophancy) |
1323
- | **v7.3** | 🏭 Dark Factory fail-closed execution |
1324
- | **v7.2** | ✅ Verification Harness |
1325
- | **v7.1** | 🚦 Task Router |
1326
1338
  | **v7.0** | 🧬 ACT-R Activation Memory |
1327
- | **v6.5** | 🔮 HDC Cognitive Routing |
1328
- | **v6.2** | 🧩 Synthesize & Prune |
1329
1339
 
1330
1340
  ### Future Tracks
1331
- - **v7.x: Affect-Tagged Memory** — Recall prioritization improves by weighting memories with affective/contextual valence.
1332
- - **v8+: Zero-Search Retrieval** — Direct vector-addressed recall reduces retrieval indirection.
1341
+ - **v10.1: Semantic Routing** — Replace regex-based task classification with lightweight local embedding model (`all-MiniLM-L6-v2`) for intent-based routing.
1342
+ - **v10.2: Background Task Mutex** — Pause background compaction during active user chat streams to prevent resource contention.
1343
+ - **v10.3: Agent Self-Evaluation** — Local LLM scores its own compaction quality and requests re-compaction when output confidence is low.
1344
+ - **v11+: Zero-Search Retrieval** — Direct vector-addressed recall eliminates retrieval indirection entirely.
1333
1345
 
1334
1346
  👉 **[Full ROADMAP.md →](ROADMAP.md)**
1335
1347
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "prism-mcp-server",
3
- "version": "9.13.4",
3
+ "version": "10.0.0",
4
4
  "mcpName": "io.github.dcostenco/prism-mcp",
5
5
  "description": "The Mind Palace for AI Agents — a true Cognitive Architecture with Hebbian learning (episodic→semantic consolidation), ACT-R spreading activation (multi-hop causal reasoning), uncertainty-aware rejection gates (agents that know when they don't know), adversarial evaluation (anti-sycophancy), fail-closed Dark Factory pipelines, persistent memory (SQLite/Supabase), multi-agent Hivemind, time travel & visual dashboard. Zero-config local mode.",
6
6
  "module": "index.ts",