prism-mcp-server 17.1.0 โ†’ 18.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -28,12 +28,12 @@ Ask "what did I decide about the auth flow last month?" and get the answer with
28
28
  ### ๐Ÿงฌ Cognitive routing
29
29
  Different memory types live in different stores: episodic (what happened), semantic (what's true), procedural (how to do X). The router picks where to store and where to retrieve.
30
30
 
31
- ### ๐Ÿ”„ Proactive session drift detection *(new in v15)*
31
+ ### ๐Ÿ”„ Proactive session drift detection *(new in v15, HRR-powered in v17)*
32
32
  Your AI agent can now detect when it has drifted from your original goals โ€” mid-session, automatically โ€” and self-correct before you notice the problem.
33
33
 
34
34
  Three direct Prism calls:
35
35
  1. **`session_save_ledger`** โ€” snapshot current state
36
- 2. **`session_cognitive_route`** โ€” compare current work against original goals, returns `on_track / minor_drift / major_drift`
36
+ 2. **`session_detect_drift`** โ€” HRR-powered semantic comparison of current work vs original goals, returns `on_track / minor_drift / major_drift` with domain-specific signals (BCBA/Coding/AAC)
37
37
  3. **`session_compact_ledger`** โ€” if drifted, compress and reload only what matters
38
38
 
39
39
  When major drift is detected, the alert routes to the **Synalux portal** so it's visible across sessions and devices โ€” not just in the current conversation.
@@ -42,6 +42,19 @@ When major drift is detected, the alert routes to the **Synalux portal** so it's
42
42
 
43
43
  No scripts. No cron. No hooks. Three tool calls, Prism handles the rest.
44
44
 
45
+ ### ๐Ÿ›ก PHI Guard *(new in v17)*
46
+ Automatic Protected Health Information detection and redaction in the memory pipeline. Every `session_save_ledger` and `session_save_handoff` call passes through the PHI guard before storage.
47
+
48
+ **What it catches:** Names, DOBs, SSNs, MRNs, phone numbers, email addresses, and 18 HIPAA identifier categories. Redaction is deterministic (regex + pattern matching, no LLM) โ€” zero false negatives on structured identifiers.
49
+
50
+ **Fail-closed:** PHI detection errors log to stderr (never suppressed) and block the save. Metric: `phi_guard.detected` count per category is always emitted for audit compliance.
51
+
52
+ ### โšก Prompt-based skill routing *(new in v17)*
53
+ 114 agent skills auto-load based on prompt keywords. No manual skill selection needed โ€” the MCP server scans the user's prompt and injects the relevant skill instructions into the session context before the AI responds.
54
+
55
+ ### ๐Ÿ’ฐ Tier enforcement *(new in v17.1)*
56
+ `prism_infer` now enforces subscription-tier gates: model ceiling, max tokens, daily limits, and cloud fallback are all gated by your plan. Free users get local-only inference up to 4b; paid tiers unlock higher models, more tokens, and cloud fallback. Flat-rate seat caps via `max_seats` per plan.
57
+
45
58
  ### ๐Ÿ›ก Local-first โ€” security + speed
46
59
  Free tier runs entirely on your machine โ€” SQLite, local embedding model, no API keys, no cloud. Paid tier adds cloud sync via Synalux portal.
47
60
 
@@ -92,7 +105,7 @@ prism-coder:4b โ”€โ”€ verifies claims โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ถ grounde
92
105
  prism-coder:32b โ”€โ”€ deep reasoning โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ถ serve (~8s, 19GB, FREE)
93
106
  โ”‚
94
107
  โ–ผ (cloud fallback when local insufficient)
95
- Claude Sonnet 4 โ†’ Claude Opus 4.7 โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ถ serve (cloud, ~$0.01/req)
108
+ Claude Sonnet 4 โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ถ serve (cloud, ~$0.01/req)
96
109
  ```
97
110
 
98
111
  | Tier | Model | Role | RAM | Latency | Cost |
@@ -100,7 +113,7 @@ Claude Sonnet 4 โ†’ Claude Opus 4.7 โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ถ serve (clou
100
113
  | **Default** | prism-coder:14b | Router + general inference | 9 GB | ~3s | $0 |
101
114
  | **Verifier** | prism-coder:4b | Grounding claims check | 2.5 GB | <1s | $0 |
102
115
  | **Complex** | prism-coder:32b | Deep reasoning (on-demand) | 19 GB | ~8s | $0 |
103
- | **Cloud** | Sonnet โ†’ Opus | Fallback for max quality | โ€” | ~5-10s | ~$0.01 |
116
+ | **Cloud** | Claude Sonnet 4 | Fallback for max quality | โ€” | ~5-10s | ~$0.01 |
104
117
 
105
118
  **Mobile / offline cascade** (Prism AAC iOS):
106
119
  ```
@@ -155,7 +168,58 @@ Categories: abstention, adversarial traps, cascade, disambiguation, edge cases,
155
168
  **What it does NOT mean**: these scores measure routing precision on a 17-tool taxonomy, not general intelligence. Claude outperforms on everything outside this task. The value is **offline reliability at zero cost**, not replacing Claude. Code and clinical knowledge come from RAG via `knowledge_search`.
156
169
 
157
170
  ### ๐Ÿ” L3 Grounding Verifier
158
- When `prism_infer` receives an `evidence` payload, the grounding verifier automatically checks the model's response against the provided evidence before returning to the caller. Unverified or hallucinated claims are flagged. This is the third layer (L3) of the cascade โ€” after tool routing (L1) and confidence gating (L2).
171
+
172
+ Fail-closed fact-checking layer. When `prism_infer` receives an `evidence` payload, a separate verifier model (default: `prism-coder:4b`) checks every factual claim in the draft against the evidence before serving it. This is the third layer (L3) of the cascade โ€” after tool routing (L1) and confidence gating (L2).
173
+
174
+ **Three-tier pre-check:**
175
+
176
+ | Tier | Condition | Action |
177
+ |---|---|---|
178
+ | **0 โ€” Conversational** | Draft has no numbers, dates, names, codes, or $ amounts | Serve without verification |
179
+ | **0a โ€” No evidence** | Assertive draft + zero evidence snippets | Refuse (fail-closed) |
180
+ | **2 โ€” NLI** | Assertive draft + evidence provided | Verify each claim against evidence |
181
+
182
+ **Per-claim verdicts:**
183
+ - `ENTAILED` โ€” claim matches evidence (including arithmetic identity: "3" โ‰ˆ "three")
184
+ - `CONTRADICTED` โ€” evidence states a different value for the same fact โ†’ **refuse**
185
+ - `NEUTRAL` โ€” claim not covered by evidence โ†’ **refuse** (fail-closed default)
186
+
187
+ **Fail-closed guarantees:** HTTP errors, malformed JSON, timeouts โ†’ all treated as refusal. The caller gets the specific claim that failed and can retry with more evidence or fall back to cloud.
188
+
189
+ **Usage with `prism_infer`:**
190
+ ```json
191
+ {
192
+ "prompt": "What was the patient's last A1C?",
193
+ "evidence": [
194
+ { "source": "lab_2026-05-01", "content": "HbA1c: 6.8% (ref <7.0)" }
195
+ ]
196
+ }
197
+ ```
198
+
199
+ **Structured output:**
200
+ ```json
201
+ {
202
+ "output": "The patient's last A1C was 6.8%.",
203
+ "verification": {
204
+ "action": "served",
205
+ "claims": [{ "text": "A1C was 6.8%", "verdict": "ENTAILED" }],
206
+ "verifierChain": [{ "model": "prism-coder:4b", "verdict": "ENTAILED", "latencyMs": 340 }]
207
+ }
208
+ }
209
+ ```
210
+
211
+ When a claim is contradicted or unsupported:
212
+ ```json
213
+ {
214
+ "output": "โš  Verification failed: claim 'A1C was 7.2%' is CONTRADICTED by evidence.",
215
+ "verification": {
216
+ "action": "refused_fabricated",
217
+ "refusalClaim": "A1C was 7.2%"
218
+ }
219
+ }
220
+ ```
221
+
222
+ The verifier model (`prism-coder:4b`) is intentionally different from the inference model โ€” satisfying the independent-reviewer principle. Requires a paid plan (see [Plans](#plans)). Set `verify: false` to explicitly skip verification even when evidence is provided.
159
223
 
160
224
  ### ๐Ÿง  HRR Semantic Drift Detection (v17.0)
161
225
  Detects when long AI agent sessions drift from their original goal โ€” using Holographic Reduced Representations for temporal trajectory encoding and anomaly detection.
@@ -214,6 +278,74 @@ Multiple AI agents share the same Mind Palace. Each agent has a role (dev / qa /
214
278
 
215
279
  ---
216
280
 
281
+ ## Why Prism Coder
282
+
283
+ ### vs AI coding assistants
284
+
285
+ | Feature | Prism Coder | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine | Devin |
286
+ |---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
287
+ | Local inference (1.7Bโ€“32B) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
288
+ | Works offline (local-only mode) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
289
+ | Open-weight models (HuggingFace) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
290
+ | Data stays on machine (local tier) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
291
+ | Persistent cross-session memory | โœ… | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
292
+ | Cognitive routing (episodic/semantic) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
293
+ | Session drift detection (HRR) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
294
+ | L3 grounding verifier | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
295
+ | Multi-agent hivemind | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
296
+ | MCP server (tools + memory for agents) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
297
+ | Cloud fallback (14b โ†’ 32b โ†’ Sonnet) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
298
+ | Web IDE | โœ… | โœ… | โŒ | โŒ | โœ… | โŒ | โœ… |
299
+ | VS Code extension | โœ… | โœ… | โŒ | โŒ | โœ… | โœ… | โŒ |
300
+ | HIPAA / air-gapped ready | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
301
+ | Flat-rate pricing (not per-seat) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ | โŒ |
302
+
303
+ ### vs local AI tools
304
+
305
+ | Feature | Prism Coder | Ollama | LM Studio | Jan.ai | Mem0 | Zep |
306
+ |---|:---:|:---:|:---:|:---:|:---:|:---:|
307
+ | Local inference (1.7Bโ€“32B cascade) | โœ… | โœ… | โœ… | โœ… | โŒ | โŒ |
308
+ | Automatic cloud fallback | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
309
+ | Persistent cross-session memory | โœ… | โŒ | โŒ | โŒ | โœ… | โœ… |
310
+ | Knowledge ingestion (MCP + webhook + REST) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
311
+ | Cognitive routing (3-store) | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
312
+ | L3 grounding verifier | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
313
+ | Session drift detection | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
314
+ | Native MCP server | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
315
+ | Web IDE + VS Code extension | โœ… | โŒ | โŒ | โŒ | โŒ | โŒ |
316
+ | Analytics dashboard | โœ… | โŒ | โŒ | โŒ | โœ… | โœ… |
317
+
318
+ ### Pricing โ€” flat-rate, not per-seat
319
+
320
+ | | **Prism Coder** | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine |
321
+ |---|:---:|:---:|:---:|:---:|:---:|:---:|
322
+ | **Individual** | **$19/mo** | $10/mo | $20/mo | $15โ€“20/mo | $19/mo | $39/mo |
323
+ | **Team (5 devs)** | **$49/mo flat** | $95/mo | $200/mo | $200/mo | $95/mo | $295/mo |
324
+ | **Enterprise (25 devs)** | **$99/mo flat** | $195/mo | $1,000/mo | Custom | Custom | Custom |
325
+ | **Cost per dev (team)** | **$9.80** | $19 | $40 | $40 | $19 | $59 |
326
+ | **Annual savings (5 devs)** | โ€” | **$552** | **$1,812** | **$1,812** | **$552** | **$2,952** |
327
+
328
+ ---
329
+
330
+ ## Plans
331
+
332
+ | | **Free** | **Standard $19/mo** | **Advanced $49/mo** | **Enterprise $99/mo** |
333
+ |---|---|---|---|---|
334
+ | **Seats included** | 1 | 1 | up to 5 | up to 25 |
335
+ | **Local model ceiling** | up to 4b | up to 14b | up to 32b | up to 32b |
336
+ | **Daily inference limit** | 50 | 200 | 2,000 | 100,000 |
337
+ | **Max output tokens** | 512 | 1,024 | 2,048 | 4,096 |
338
+ | **Cloud fallback** | โ€” | Claude Sonnet 4 | Claude Sonnet 4 | Priority + Sonnet 4 |
339
+ | **L3 grounding verifier** | โ€” | โœ… | โœ… | โœ… |
340
+ | **Knowledge search** | limited | unlimited | unlimited | unlimited |
341
+ | **Session memory** | limited | unlimited | unlimited | unlimited |
342
+ | **Analytics dashboard** | โ€” | โœ… | โœ… | โœ… |
343
+ | **HIPAA BAA** | โ€” | โ€” | โ€” | โœ… |
344
+
345
+ All on-device models are open-weight and free to run locally via Ollama. The subscription gates cloud features, higher model tiers, and increased limits. Need 25+ seats? [Contact sales](https://synalux.ai/contact). 14-day free trial on all paid plans. [Subscribe โ†’](https://synalux.ai/pricing)
346
+
347
+ ---
348
+
217
349
  ## Get started
218
350
 
219
351
  ```bash
@@ -298,6 +430,7 @@ Paid Synalux subscribers get a built-in analytics dashboard at `/app/memory-anal
298
430
  | `knowledge_search` | Semantic + keyword search over all memories |
299
431
  | `query_memory_natural` | Natural-language Q&A over your Mind Palace |
300
432
  | `extract_entities` | Pull people / projects / decisions from text |
433
+ | `session_detect_drift` | HRR-powered semantic drift detection (BCBA/Coding/AAC) |
301
434
  | `session_synthesize_edges` | Auto-link related memories into a graph |
302
435
 
303
436
  (35+ tools total โ€” full TypeScript signatures in `src/tools/`. Architecture overview in [`docs/ARCHITECTURE.md`](docs/ARCHITECTURE.md).)
@@ -502,7 +635,7 @@ ollama pull dcostenco/prism-coder:32b
502
635
 
503
636
  Set `LOCAL_LLM_URL=http://localhost:11434` in your portal config. Routing is automatic:
504
637
 
505
- **Desktop/server**: 14B โ†’ 32B โ†’ Claude Opus fallback ยท **Mobile/offline**: 14B โ†’ 8B โ†’ 1.7B
638
+ **Desktop/server**: 14B โ†’ 32B โ†’ Claude Sonnet 4 fallback ยท **Mobile/offline**: 14B โ†’ 8B โ†’ 1.7B
506
639
 
507
640
  iOS/mobile on same WiFi: `OLLAMA_HOST=0.0.0.0 ollama serve` on the Mac, then point `LOCAL_LLM_URL` at the Mac's IP.
508
641
  Routing accuracy (May 2026, v36/v7 system prompt, 3-seed mean): 32B v7 = **100.0%** ยท 8B v36 = **100.0%** ยท 14B v36 = **100.0%** ยท 1.7B v42 = **100.0%**
@@ -510,104 +643,6 @@ Cascade (14Bโ†’32B): **100.0%** ยท Opus solo: 98.3% ยท Opus engaged: **0% of req
510
643
 
511
644
  ---
512
645
 
513
- ## Plans
514
-
515
- | | **Free** | **Standard $19/mo** | **Advanced $49/mo** | **Enterprise $99/mo** |
516
- |---|---|---|---|---|
517
- | **Local model ceiling** | up to 4b | up to 14b | up to 32b | up to 32b |
518
- | **Daily inference limit** | 50 | 200 | 2,000 | 100,000 |
519
- | **Max output tokens** | 512 | 1,024 | 2,048 | 4,096 |
520
- | **Cloud fallback** | โ€” | Portal cascade (14b โ†’ 32b) | Portal cascade (14b โ†’ 32b โ†’ Claude Opus) | Priority cascade + Claude Opus |
521
- | **L3 grounding verifier** | โ€” | โœ“ | โœ“ | โœ“ |
522
- | **Knowledge search** | limited | unlimited | unlimited | unlimited |
523
- | **Session memory** | limited | unlimited | unlimited | unlimited |
524
- | **Analytics dashboard** | โ€” | โœ“ | โœ“ | โœ“ |
525
- | **HIPAA BAA** | โ€” | โ€” | โ€” | โœ“ |
526
-
527
- ### What free users get
528
-
529
- - Local Ollama inference with models up to 4b (prism-coder:1b7 and prism-coder:4b)
530
- - 50 calls/day, 512 max output tokens per call
531
- - Local SQLite storage for session memory and knowledge
532
- - All open-weight models available to pull via `ollama pull`
533
-
534
- ### What paid users get
535
-
536
- - **Higher model ceilings** โ€” Standard unlocks 14b, Advanced/Enterprise unlock 32b
537
- - **Cloud fallback** โ€” when local Ollama is down or underpowered, inference routes through the Synalux portal cascade (14b โ†’ 32b โ†’ Claude Opus)
538
- - **L3 grounding verifier** โ€” evidence-based claim verification that rejects hallucinated outputs
539
- - **Unlimited knowledge search and session memory** โ€” no caps on stored context
540
- - **Analytics dashboard** โ€” usage metrics, latency tracking, model performance
541
- - **Higher daily limits and token caps** โ€” see table above
542
-
543
- All on-device models are open-weight and free to run locally via Ollama. The subscription gates cloud features, higher model tiers, and increased limits.
544
-
545
- 14-day free trial on all paid plans. [Subscribe โ†’](https://synalux.ai/pricing)
546
-
547
- ### Why Prism MCP
548
-
549
- **Pricing โ€” flat-rate, not per-seat:**
550
-
551
- | | **Prism MCP** | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine |
552
- |---|---|---|---|---|---|---|
553
- | **Individual** | **$19/mo** | $10/mo | $20/mo | $15-20/mo | $19/mo | $39/mo |
554
- | **Team (5 devs)** | **$49/mo flat** | $95/mo | $200/mo | $200/mo | $95/mo | $295/mo |
555
- | **Enterprise** | **$99/mo flat** | $195/mo | $1,000/mo | Custom | Custom | Custom |
556
-
557
- **Features โ€” full stack vs single-purpose:**
558
-
559
- | | **Prism MCP** | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine | Devin |
560
- |---|---|---|---|---|---|---|---|
561
- | **Web IDE** | **Synalux Coder** | github.dev | โ€” | โ€” | Console | โ€” | Browser |
562
- | **VS Code extension** | **Yes** | Yes | N/A (is a fork) | N/A (is a fork) | Yes | Yes | No |
563
- | **MCP server** | **Native** | No | Partial | No | No | No | No |
564
- | **Works with Claude Code** | **Yes** | No | N/A | No | No | No | No |
565
- | **Local inference (Ollama)** | **1.7Bโ€“32B fleet** | No | No | No | No | No | No |
566
- | **Cloud fallback** | **14bโ†’32bโ†’Opus** | Cloud only | Cloud only | Cloud only | Cloud only | Cloud only | Cloud only |
567
- | **Works offline** | **Yes** | No | No | No | No | No | No |
568
- | **Open-weight models** | **HuggingFace** | Proprietary | Proprietary | Proprietary | Proprietary | Proprietary | Proprietary |
569
- | **Persistent memory** | **Cross-session** | No | No | No | No | No | Partial |
570
- | **Cognitive routing** | **Episodic/semantic/procedural** | No | No | No | No | No | No |
571
- | **Session drift detection** | **HRR-based** | No | No | No | No | No | No |
572
- | **Codebase indexing** | **Knowledge ingest (MCP + webhook + REST)** | Partial | Yes | Yes | Yes | Yes | Yes |
573
- | **L3 grounding verifier** | **Evidence-based** | No | No | No | No | No | No |
574
- | **Multi-agent hivemind** | **Shared Mind Palace** | No | No | No | No | No | No |
575
- | **Analytics dashboard** | **Yes** | No | Yes | Yes | Yes | No | Yes |
576
- | **HIPAA / air-gapped** | **On-prem, no BAA needed** | Requires BAA | No | No | Partial | No | No |
577
- | **Data stays local** | **Yes** | No | No | No | No | No | No |
578
-
579
- **vs local AI tools:**
580
-
581
- | | **Prism MCP** | Ollama | LM Studio | Jan.ai | Mem0 | Zep |
582
- |---|---|---|---|---|---|---|
583
- | **Local inference** | 1.7Bโ€“32B cascade | Any GGUF | Any GGUF | Any GGUF | No | No |
584
- | **Cloud fallback** | Automatic | No | No | Partial | Cloud only | Cloud only |
585
- | **Persistent memory** | Cross-session | No | No | No | Yes | Yes |
586
- | **Knowledge ingestion** | MCP + GitHub webhook + REST | No | No | No | Partial | No |
587
- | **Cognitive routing** | 3-store (episodic/semantic/procedural) | No | No | No | No | Temporal graph |
588
- | **Grounding verifier** | L3 evidence-based | No | No | No | No | No |
589
- | **Drift detection** | HRR-based | No | No | No | No | No |
590
- | **MCP server** | Native | No | No | No | No | No |
591
- | **Web IDE** | Synalux Coder | No | No | No | No | No |
592
- | **VS Code extension** | Yes | No | No | No | No | No |
593
- | **Analytics** | Dashboard + Datadog | No | No | No | Yes | Yes |
594
- | **Price** | $0โ€“99/mo flat | Free | Free/$10/user | Free | $249/mo | $99/mo |
595
-
596
- **Why developers choose Prism:**
597
- - **Full IDE experience** โ€” Synalux Coder (web) + VS Code extension + MCP for Claude Code, Cursor, JetBrains
598
- - **Local-first** โ€” your code and context never leave your machine unless you opt in to cloud
599
- - **Flat-rate pricing** โ€” $49/mo for your whole team, not $40/seat/mo
600
- - **Works offline** โ€” airplane, hospital, air-gapped classified environments
601
- - **Open models** โ€” prism-coder weights are on HuggingFace, not locked behind an API
602
- - **Memory that persists** โ€” cognitive routing stores episodic, semantic, and procedural memory across sessions
603
- - **Drift detection** โ€” HRR-based session monitoring catches when your AI agent goes off-track
604
- - **Grounding verification** โ€” L3 verifier rejects hallucinated outputs before they reach you
605
- - **Codebase indexing** โ€” knowledge ingestion via MCP tool, GitHub webhooks, or REST API
606
- - **Multi-agent ready** โ€” Hivemind lets multiple agents share the same Mind Palace with role-scoped context
607
- - **HIPAA without paperwork** โ€” local inference means no BAA required, PHI never leaves the device
608
-
609
- ---
610
-
611
646
  ## What you can build with it
612
647
 
613
648
  - **Persistent coding assistant** that remembers your codebase, your decisions, your team's conventions
@@ -624,17 +659,17 @@ All on-device models are open-weight and free to run locally via Ollama. The sub
624
659
 
625
660
  **[synalux.ai/prism-mcp](https://synalux.ai/prism-mcp)** โ€” full documentation, dashboard, subscription plans, and model downloads.
626
661
 
627
- ### ๐Ÿ’ป Web IDE โ€” Synalux Coder
662
+ ### ๐Ÿ’ป Web IDE โ€” Prism Coder
628
663
 
629
- Use Prism Coder directly in your browser โ€” no install required. Local-first IDE with the prism-coder agent built in. Connects to GitHub repos, Synalux Mail, Drive, and Source for cross-product workflows.
664
+ Use Prism Coder directly in your browser โ€” no install, no desktop app required. Standalone coding IDE with the prism-coder agent built in. Works with any Prism plan (no Synalux health subscription needed).
630
665
 
631
- **[synalux.ai/coder](https://synalux.ai/coder)** ยท also reachable at **[synalux.ai/prism-ide](https://synalux.ai/prism-ide)**
666
+ **[synalux.ai/coder](https://synalux.ai/coder)**
632
667
 
633
668
  | Feature | Detail |
634
669
  |---|---|
635
- | Agent | prism-coder:7b offline ยท Claude Sonnet 4 (Standard+) ยท Claude Opus 4 (Enterprise) |
636
- | Integrations | GitHub repos, Synalux Mail, Drive, Source โ€” same OAuth, no separate accounts |
637
- | Compliance | Audit log on every turn ยท PHI redaction ยท air-gapped offline mode (HIPAA) |
670
+ | Agent | prism-coder:8b offline ยท Claude Sonnet 4 (Standard+) |
671
+ | Integrations | GitHub repos ยท same Prism account, no separate sign-up |
672
+ | Plans | Free (4b) ยท Standard $19/mo (14b) ยท Advanced $49/mo (32b) ยท Enterprise $99/mo |
638
673
 
639
674
  ### ๐Ÿงฉ VS Code Extension โ€” Synalux
640
675
 
@@ -705,7 +740,7 @@ prism register-models # Alias dcostenco/prism-coder:* โ†’ prism-coder:*
705
740
  ## Testing
706
741
 
707
742
  ```bash
708
- npm test # 2,418 test cases across 81 files (vitest)
743
+ npm test # 2,676 test cases across 89 files (vitest)
709
744
  npm test -- --coverage # coverage report
710
745
  python3 tests/benchmarks/prism-routing-100/benchmark.py --models 1b7 14b 32b
711
746
  ```
@@ -801,11 +801,15 @@ export async function sessionLoadContextHandler(args) {
801
801
  const eff = computeEffectiveImportance(s.importance, s.last_accessed_at, s.created_at, Boolean(s.is_rollup));
802
802
  impStr = ` [Imp: ${eff}]`;
803
803
  }
804
- return ` [${s.session_date?.split("T")[0]}]${impStr} ${s.summary}`;
804
+ const dateStr = (s.session_date || s.created_at || s.date || "unknown").split("T")[0];
805
+ return ` [${dateStr}]${impStr} ${s.summary}`;
805
806
  }).join("\n") + `\n`;
806
807
  }
807
808
  if (d.session_history?.length) {
808
- formattedContext += `\n๐Ÿ“‚ Session History (${d.session_history.length} entries):\n` + d.session_history.map((s) => ` [${s.session_date?.split("T")[0]}] ${s.summary}`).join("\n") + `\n`;
809
+ formattedContext += `\n๐Ÿ“‚ Session History (${d.session_history.length} entries):\n` + d.session_history.map((s) => {
810
+ const dateStr = (s.session_date || s.created_at || s.date || "unknown").split("T")[0];
811
+ return ` [${dateStr}] ${s.summary}`;
812
+ }).join("\n") + `\n`;
809
813
  }
810
814
  if (d.recent_validations?.length) {
811
815
  formattedContext += `\n๐Ÿ”ฌ Recent Validations:\n` + d.recent_validations.map((v) => {
@@ -91,12 +91,12 @@ export const PRISM_INFER_TOOL = {
91
91
  type: "boolean",
92
92
  description: "Enable the L3 grounding verifier. Default: true when `evidence` is provided, " +
93
93
  "false otherwise. When enabled, the model's draft is checked by a different model " +
94
- "(prism-coder:1b7 by default) against the supplied `evidence`. Drafts with " +
94
+ "(prism-coder:4b by default) against the supplied `evidence`. Drafts with " +
95
95
  "NEUTRAL or CONTRADICTED claims are refused.",
96
96
  },
97
97
  verifier_model: {
98
98
  type: "string",
99
- description: "Override the verifier model. Default: prism-coder:1b7.",
99
+ description: "Override the verifier model. Default: prism-coder:4b.",
100
100
  },
101
101
  verifier_timeout_ms: {
102
102
  type: "number",
@@ -17,6 +17,7 @@ export const FREE_ENTITLEMENTS = {
17
17
  model_ceiling: "4b",
18
18
  daily_infer_limit: 50,
19
19
  max_tokens: 512,
20
+ max_seats: 1,
20
21
  features: {
21
22
  cloud_fallback: false,
22
23
  grounding_verifier: false,
@@ -27,7 +28,7 @@ export const FREE_ENTITLEMENTS = {
27
28
  upgrade_url: "https://synalux.ai/pricing",
28
29
  };
29
30
  // โ”€โ”€ Cache โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€
30
- const CACHE_TTL_MS = 60 * 60 * 1000; // 1 hour
31
+ const CACHE_TTL_MS = 5 * 60 * 1000; // 5 minutes
31
32
  let cache = null;
32
33
  let inFlight = null;
33
34
  // โ”€โ”€ Model tier ordering for ceiling enforcement โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€
package/package.json CHANGED
@@ -1,8 +1,8 @@
1
1
  {
2
2
  "name": "prism-mcp-server",
3
- "version": "17.1.0",
3
+ "version": "18.0.0",
4
4
  "mcpName": "io.github.dcostenco/prism-coder",
5
- "description": "Prism Coder โ€” Cognitive memory + tool-calling intelligence for AI agents. Mind Palace persistent memory (BFCL Gold Certified, 100% Tool-Call Accuracy, 114 Agent Skills, Zero-Search HDC/HRR retrieval, HRR Semantic Drift Detection across BCBA/Coding/AAC domains, HIPAA-hardened local-first storage, SLERP-optimized GRPO alignment) plus the prism-coder:7b / 14b open-weights LLM fleet.",
5
+ "description": "Prism Coder โ€” Cognitive memory + tool-calling intelligence for AI agents. Mind Palace persistent memory (BFCL Gold Certified, 100% Tool-Call Accuracy, 114 Agent Skills, PHI Guard, Tier Enforcement, Prompt-Based Skill Routing, Zero-Search HDC/HRR retrieval, HRR Semantic Drift Detection across BCBA/Coding/AAC domains, HIPAA-hardened local-first storage, SLERP-optimized GRPO alignment) plus the prism-coder 1.7Bโ€“32B open-weights LLM fleet.",
6
6
  "module": "index.ts",
7
7
  "type": "module",
8
8
  "main": "dist/server.js",