prism-mcp-server 17.1.0 → 17.1.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +77 -107
- package/dist/utils/entitlements.js +2 -1
- package/package.json +2 -2
package/README.md
CHANGED
|
@@ -92,7 +92,7 @@ prism-coder:4b ── verifies claims ──────────▶ grounde
|
|
|
92
92
|
prism-coder:32b ── deep reasoning ──────────▶ serve (~8s, 19GB, FREE)
|
|
93
93
|
│
|
|
94
94
|
▼ (cloud fallback when local insufficient)
|
|
95
|
-
Claude Sonnet 4
|
|
95
|
+
Claude Sonnet 4 ────────────────────────────▶ serve (cloud, ~$0.01/req)
|
|
96
96
|
```
|
|
97
97
|
|
|
98
98
|
| Tier | Model | Role | RAM | Latency | Cost |
|
|
@@ -100,7 +100,7 @@ Claude Sonnet 4 → Claude Opus 4.7 ─────────▶ serve (clou
|
|
|
100
100
|
| **Default** | prism-coder:14b | Router + general inference | 9 GB | ~3s | $0 |
|
|
101
101
|
| **Verifier** | prism-coder:4b | Grounding claims check | 2.5 GB | <1s | $0 |
|
|
102
102
|
| **Complex** | prism-coder:32b | Deep reasoning (on-demand) | 19 GB | ~8s | $0 |
|
|
103
|
-
| **Cloud** | Sonnet
|
|
103
|
+
| **Cloud** | Claude Sonnet 4 | Fallback for max quality | — | ~5-10s | ~$0.01 |
|
|
104
104
|
|
|
105
105
|
**Mobile / offline cascade** (Prism AAC iOS):
|
|
106
106
|
```
|
|
@@ -214,6 +214,74 @@ Multiple AI agents share the same Mind Palace. Each agent has a role (dev / qa /
|
|
|
214
214
|
|
|
215
215
|
---
|
|
216
216
|
|
|
217
|
+
## Why Prism Coder
|
|
218
|
+
|
|
219
|
+
### vs AI coding assistants
|
|
220
|
+
|
|
221
|
+
| Feature | Prism Coder | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine | Devin |
|
|
222
|
+
|---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
|
|
223
|
+
| Local inference (1.7B–32B) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
224
|
+
| Works offline (local-only mode) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
225
|
+
| Open-weight models (HuggingFace) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
226
|
+
| Data stays on machine (local tier) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
227
|
+
| Persistent cross-session memory | ✅ | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
228
|
+
| Cognitive routing (episodic/semantic) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
229
|
+
| Session drift detection (HRR) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
230
|
+
| L3 grounding verifier | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
231
|
+
| Multi-agent hivemind | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
232
|
+
| MCP server (tools + memory for agents) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
233
|
+
| Cloud fallback (14b → 32b → Sonnet) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
234
|
+
| Web IDE | ✅ | ✅ | ❌ | ❌ | ✅ | ❌ | ✅ |
|
|
235
|
+
| VS Code extension | ✅ | ✅ | ❌ | ❌ | ✅ | ✅ | ❌ |
|
|
236
|
+
| HIPAA / air-gapped ready | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
237
|
+
| Flat-rate pricing (not per-seat) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
238
|
+
|
|
239
|
+
### vs local AI tools
|
|
240
|
+
|
|
241
|
+
| Feature | Prism Coder | Ollama | LM Studio | Jan.ai | Mem0 | Zep |
|
|
242
|
+
|---|:---:|:---:|:---:|:---:|:---:|:---:|
|
|
243
|
+
| Local inference (1.7B–32B cascade) | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ |
|
|
244
|
+
| Automatic cloud fallback | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
245
|
+
| Persistent cross-session memory | ✅ | ❌ | ❌ | ❌ | ✅ | ✅ |
|
|
246
|
+
| Knowledge ingestion (MCP + webhook + REST) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
247
|
+
| Cognitive routing (3-store) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
248
|
+
| L3 grounding verifier | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
249
|
+
| Session drift detection | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
250
|
+
| Native MCP server | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
251
|
+
| Web IDE + VS Code extension | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
|
|
252
|
+
| Analytics dashboard | ✅ | ❌ | ❌ | ❌ | ✅ | ✅ |
|
|
253
|
+
|
|
254
|
+
### Pricing — flat-rate, not per-seat
|
|
255
|
+
|
|
256
|
+
| | **Prism Coder** | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine |
|
|
257
|
+
|---|:---:|:---:|:---:|:---:|:---:|:---:|
|
|
258
|
+
| **Individual** | **$19/mo** | $10/mo | $20/mo | $15–20/mo | $19/mo | $39/mo |
|
|
259
|
+
| **Team (5 devs)** | **$49/mo flat** | $95/mo | $200/mo | $200/mo | $95/mo | $295/mo |
|
|
260
|
+
| **Enterprise (25 devs)** | **$99/mo flat** | $195/mo | $1,000/mo | Custom | Custom | Custom |
|
|
261
|
+
| **Cost per dev (team)** | **$9.80** | $19 | $40 | $40 | $19 | $59 |
|
|
262
|
+
| **Annual savings (5 devs)** | — | **$552** | **$1,812** | **$1,812** | **$552** | **$2,952** |
|
|
263
|
+
|
|
264
|
+
---
|
|
265
|
+
|
|
266
|
+
## Plans
|
|
267
|
+
|
|
268
|
+
| | **Free** | **Standard $19/mo** | **Advanced $49/mo** | **Enterprise $99/mo** |
|
|
269
|
+
|---|---|---|---|---|
|
|
270
|
+
| **Seats included** | 1 | 1 | up to 5 | up to 25 |
|
|
271
|
+
| **Local model ceiling** | up to 4b | up to 14b | up to 32b | up to 32b |
|
|
272
|
+
| **Daily inference limit** | 50 | 200 | 2,000 | 100,000 |
|
|
273
|
+
| **Max output tokens** | 512 | 1,024 | 2,048 | 4,096 |
|
|
274
|
+
| **Cloud fallback** | — | Claude Sonnet 4 | Claude Sonnet 4 | Priority + Sonnet 4 |
|
|
275
|
+
| **L3 grounding verifier** | — | ✅ | ✅ | ✅ |
|
|
276
|
+
| **Knowledge search** | limited | unlimited | unlimited | unlimited |
|
|
277
|
+
| **Session memory** | limited | unlimited | unlimited | unlimited |
|
|
278
|
+
| **Analytics dashboard** | — | ✅ | ✅ | ✅ |
|
|
279
|
+
| **HIPAA BAA** | — | — | — | ✅ |
|
|
280
|
+
|
|
281
|
+
All on-device models are open-weight and free to run locally via Ollama. The subscription gates cloud features, higher model tiers, and increased limits. Need 25+ seats? [Contact sales](https://synalux.ai/contact). 14-day free trial on all paid plans. [Subscribe →](https://synalux.ai/pricing)
|
|
282
|
+
|
|
283
|
+
---
|
|
284
|
+
|
|
217
285
|
## Get started
|
|
218
286
|
|
|
219
287
|
```bash
|
|
@@ -502,7 +570,7 @@ ollama pull dcostenco/prism-coder:32b
|
|
|
502
570
|
|
|
503
571
|
Set `LOCAL_LLM_URL=http://localhost:11434` in your portal config. Routing is automatic:
|
|
504
572
|
|
|
505
|
-
**Desktop/server**: 14B → 32B → Claude
|
|
573
|
+
**Desktop/server**: 14B → 32B → Claude Sonnet 4 fallback · **Mobile/offline**: 14B → 8B → 1.7B
|
|
506
574
|
|
|
507
575
|
iOS/mobile on same WiFi: `OLLAMA_HOST=0.0.0.0 ollama serve` on the Mac, then point `LOCAL_LLM_URL` at the Mac's IP.
|
|
508
576
|
Routing accuracy (May 2026, v36/v7 system prompt, 3-seed mean): 32B v7 = **100.0%** · 8B v36 = **100.0%** · 14B v36 = **100.0%** · 1.7B v42 = **100.0%**
|
|
@@ -510,104 +578,6 @@ Cascade (14B→32B): **100.0%** · Opus solo: 98.3% · Opus engaged: **0% of req
|
|
|
510
578
|
|
|
511
579
|
---
|
|
512
580
|
|
|
513
|
-
## Plans
|
|
514
|
-
|
|
515
|
-
| | **Free** | **Standard $19/mo** | **Advanced $49/mo** | **Enterprise $99/mo** |
|
|
516
|
-
|---|---|---|---|---|
|
|
517
|
-
| **Local model ceiling** | up to 4b | up to 14b | up to 32b | up to 32b |
|
|
518
|
-
| **Daily inference limit** | 50 | 200 | 2,000 | 100,000 |
|
|
519
|
-
| **Max output tokens** | 512 | 1,024 | 2,048 | 4,096 |
|
|
520
|
-
| **Cloud fallback** | — | Portal cascade (14b → 32b) | Portal cascade (14b → 32b → Claude Opus) | Priority cascade + Claude Opus |
|
|
521
|
-
| **L3 grounding verifier** | — | ✓ | ✓ | ✓ |
|
|
522
|
-
| **Knowledge search** | limited | unlimited | unlimited | unlimited |
|
|
523
|
-
| **Session memory** | limited | unlimited | unlimited | unlimited |
|
|
524
|
-
| **Analytics dashboard** | — | ✓ | ✓ | ✓ |
|
|
525
|
-
| **HIPAA BAA** | — | — | — | ✓ |
|
|
526
|
-
|
|
527
|
-
### What free users get
|
|
528
|
-
|
|
529
|
-
- Local Ollama inference with models up to 4b (prism-coder:1b7 and prism-coder:4b)
|
|
530
|
-
- 50 calls/day, 512 max output tokens per call
|
|
531
|
-
- Local SQLite storage for session memory and knowledge
|
|
532
|
-
- All open-weight models available to pull via `ollama pull`
|
|
533
|
-
|
|
534
|
-
### What paid users get
|
|
535
|
-
|
|
536
|
-
- **Higher model ceilings** — Standard unlocks 14b, Advanced/Enterprise unlock 32b
|
|
537
|
-
- **Cloud fallback** — when local Ollama is down or underpowered, inference routes through the Synalux portal cascade (14b → 32b → Claude Opus)
|
|
538
|
-
- **L3 grounding verifier** — evidence-based claim verification that rejects hallucinated outputs
|
|
539
|
-
- **Unlimited knowledge search and session memory** — no caps on stored context
|
|
540
|
-
- **Analytics dashboard** — usage metrics, latency tracking, model performance
|
|
541
|
-
- **Higher daily limits and token caps** — see table above
|
|
542
|
-
|
|
543
|
-
All on-device models are open-weight and free to run locally via Ollama. The subscription gates cloud features, higher model tiers, and increased limits.
|
|
544
|
-
|
|
545
|
-
14-day free trial on all paid plans. [Subscribe →](https://synalux.ai/pricing)
|
|
546
|
-
|
|
547
|
-
### Why Prism MCP
|
|
548
|
-
|
|
549
|
-
**Pricing — flat-rate, not per-seat:**
|
|
550
|
-
|
|
551
|
-
| | **Prism MCP** | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine |
|
|
552
|
-
|---|---|---|---|---|---|---|
|
|
553
|
-
| **Individual** | **$19/mo** | $10/mo | $20/mo | $15-20/mo | $19/mo | $39/mo |
|
|
554
|
-
| **Team (5 devs)** | **$49/mo flat** | $95/mo | $200/mo | $200/mo | $95/mo | $295/mo |
|
|
555
|
-
| **Enterprise** | **$99/mo flat** | $195/mo | $1,000/mo | Custom | Custom | Custom |
|
|
556
|
-
|
|
557
|
-
**Features — full stack vs single-purpose:**
|
|
558
|
-
|
|
559
|
-
| | **Prism MCP** | GitHub Copilot | Cursor | Windsurf | Amazon Q | Tabnine | Devin |
|
|
560
|
-
|---|---|---|---|---|---|---|---|
|
|
561
|
-
| **Web IDE** | **Synalux Coder** | github.dev | — | — | Console | — | Browser |
|
|
562
|
-
| **VS Code extension** | **Yes** | Yes | N/A (is a fork) | N/A (is a fork) | Yes | Yes | No |
|
|
563
|
-
| **MCP server** | **Native** | No | Partial | No | No | No | No |
|
|
564
|
-
| **Works with Claude Code** | **Yes** | No | N/A | No | No | No | No |
|
|
565
|
-
| **Local inference (Ollama)** | **1.7B–32B fleet** | No | No | No | No | No | No |
|
|
566
|
-
| **Cloud fallback** | **14b→32b→Opus** | Cloud only | Cloud only | Cloud only | Cloud only | Cloud only | Cloud only |
|
|
567
|
-
| **Works offline** | **Yes** | No | No | No | No | No | No |
|
|
568
|
-
| **Open-weight models** | **HuggingFace** | Proprietary | Proprietary | Proprietary | Proprietary | Proprietary | Proprietary |
|
|
569
|
-
| **Persistent memory** | **Cross-session** | No | No | No | No | No | Partial |
|
|
570
|
-
| **Cognitive routing** | **Episodic/semantic/procedural** | No | No | No | No | No | No |
|
|
571
|
-
| **Session drift detection** | **HRR-based** | No | No | No | No | No | No |
|
|
572
|
-
| **Codebase indexing** | **Knowledge ingest (MCP + webhook + REST)** | Partial | Yes | Yes | Yes | Yes | Yes |
|
|
573
|
-
| **L3 grounding verifier** | **Evidence-based** | No | No | No | No | No | No |
|
|
574
|
-
| **Multi-agent hivemind** | **Shared Mind Palace** | No | No | No | No | No | No |
|
|
575
|
-
| **Analytics dashboard** | **Yes** | No | Yes | Yes | Yes | No | Yes |
|
|
576
|
-
| **HIPAA / air-gapped** | **On-prem, no BAA needed** | Requires BAA | No | No | Partial | No | No |
|
|
577
|
-
| **Data stays local** | **Yes** | No | No | No | No | No | No |
|
|
578
|
-
|
|
579
|
-
**vs local AI tools:**
|
|
580
|
-
|
|
581
|
-
| | **Prism MCP** | Ollama | LM Studio | Jan.ai | Mem0 | Zep |
|
|
582
|
-
|---|---|---|---|---|---|---|
|
|
583
|
-
| **Local inference** | 1.7B–32B cascade | Any GGUF | Any GGUF | Any GGUF | No | No |
|
|
584
|
-
| **Cloud fallback** | Automatic | No | No | Partial | Cloud only | Cloud only |
|
|
585
|
-
| **Persistent memory** | Cross-session | No | No | No | Yes | Yes |
|
|
586
|
-
| **Knowledge ingestion** | MCP + GitHub webhook + REST | No | No | No | Partial | No |
|
|
587
|
-
| **Cognitive routing** | 3-store (episodic/semantic/procedural) | No | No | No | No | Temporal graph |
|
|
588
|
-
| **Grounding verifier** | L3 evidence-based | No | No | No | No | No |
|
|
589
|
-
| **Drift detection** | HRR-based | No | No | No | No | No |
|
|
590
|
-
| **MCP server** | Native | No | No | No | No | No |
|
|
591
|
-
| **Web IDE** | Synalux Coder | No | No | No | No | No |
|
|
592
|
-
| **VS Code extension** | Yes | No | No | No | No | No |
|
|
593
|
-
| **Analytics** | Dashboard + Datadog | No | No | No | Yes | Yes |
|
|
594
|
-
| **Price** | $0–99/mo flat | Free | Free/$10/user | Free | $249/mo | $99/mo |
|
|
595
|
-
|
|
596
|
-
**Why developers choose Prism:**
|
|
597
|
-
- **Full IDE experience** — Synalux Coder (web) + VS Code extension + MCP for Claude Code, Cursor, JetBrains
|
|
598
|
-
- **Local-first** — your code and context never leave your machine unless you opt in to cloud
|
|
599
|
-
- **Flat-rate pricing** — $49/mo for your whole team, not $40/seat/mo
|
|
600
|
-
- **Works offline** — airplane, hospital, air-gapped classified environments
|
|
601
|
-
- **Open models** — prism-coder weights are on HuggingFace, not locked behind an API
|
|
602
|
-
- **Memory that persists** — cognitive routing stores episodic, semantic, and procedural memory across sessions
|
|
603
|
-
- **Drift detection** — HRR-based session monitoring catches when your AI agent goes off-track
|
|
604
|
-
- **Grounding verification** — L3 verifier rejects hallucinated outputs before they reach you
|
|
605
|
-
- **Codebase indexing** — knowledge ingestion via MCP tool, GitHub webhooks, or REST API
|
|
606
|
-
- **Multi-agent ready** — Hivemind lets multiple agents share the same Mind Palace with role-scoped context
|
|
607
|
-
- **HIPAA without paperwork** — local inference means no BAA required, PHI never leaves the device
|
|
608
|
-
|
|
609
|
-
---
|
|
610
|
-
|
|
611
581
|
## What you can build with it
|
|
612
582
|
|
|
613
583
|
- **Persistent coding assistant** that remembers your codebase, your decisions, your team's conventions
|
|
@@ -624,17 +594,17 @@ All on-device models are open-weight and free to run locally via Ollama. The sub
|
|
|
624
594
|
|
|
625
595
|
**[synalux.ai/prism-mcp](https://synalux.ai/prism-mcp)** — full documentation, dashboard, subscription plans, and model downloads.
|
|
626
596
|
|
|
627
|
-
### 💻 Web IDE —
|
|
597
|
+
### 💻 Web IDE — Prism Coder
|
|
628
598
|
|
|
629
|
-
Use Prism Coder directly in your browser — no install required.
|
|
599
|
+
Use Prism Coder directly in your browser — no install, no desktop app required. Standalone coding IDE with the prism-coder agent built in. Works with any Prism plan (no Synalux health subscription needed).
|
|
630
600
|
|
|
631
|
-
**[synalux.ai/coder](https://synalux.ai/coder)**
|
|
601
|
+
**[synalux.ai/coder](https://synalux.ai/coder)**
|
|
632
602
|
|
|
633
603
|
| Feature | Detail |
|
|
634
604
|
|---|---|
|
|
635
|
-
| Agent | prism-coder:
|
|
636
|
-
| Integrations | GitHub repos
|
|
637
|
-
|
|
|
605
|
+
| Agent | prism-coder:8b offline · Claude Sonnet 4 (Standard+) |
|
|
606
|
+
| Integrations | GitHub repos · same Prism account, no separate sign-up |
|
|
607
|
+
| Plans | Free (4b) · Standard $19/mo (14b) · Advanced $49/mo (32b) · Enterprise $99/mo |
|
|
638
608
|
|
|
639
609
|
### 🧩 VS Code Extension — Synalux
|
|
640
610
|
|
|
@@ -17,6 +17,7 @@ export const FREE_ENTITLEMENTS = {
|
|
|
17
17
|
model_ceiling: "4b",
|
|
18
18
|
daily_infer_limit: 50,
|
|
19
19
|
max_tokens: 512,
|
|
20
|
+
max_seats: 1,
|
|
20
21
|
features: {
|
|
21
22
|
cloud_fallback: false,
|
|
22
23
|
grounding_verifier: false,
|
|
@@ -27,7 +28,7 @@ export const FREE_ENTITLEMENTS = {
|
|
|
27
28
|
upgrade_url: "https://synalux.ai/pricing",
|
|
28
29
|
};
|
|
29
30
|
// ── Cache ─────────────────────────────────────────────────────────
|
|
30
|
-
const CACHE_TTL_MS =
|
|
31
|
+
const CACHE_TTL_MS = 5 * 60 * 1000; // 5 minutes
|
|
31
32
|
let cache = null;
|
|
32
33
|
let inFlight = null;
|
|
33
34
|
// ── Model tier ordering for ceiling enforcement ───────────────────
|
package/package.json
CHANGED
|
@@ -1,8 +1,8 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "prism-mcp-server",
|
|
3
|
-
"version": "17.1.
|
|
3
|
+
"version": "17.1.1",
|
|
4
4
|
"mcpName": "io.github.dcostenco/prism-coder",
|
|
5
|
-
"description": "Prism Coder — Cognitive memory + tool-calling intelligence for AI agents. Mind Palace persistent memory (BFCL Gold Certified, 100% Tool-Call Accuracy, 114 Agent Skills, Zero-Search HDC/HRR retrieval, HRR Semantic Drift Detection across BCBA/Coding/AAC domains, HIPAA-hardened local-first storage, SLERP-optimized GRPO alignment) plus the prism-coder
|
|
5
|
+
"description": "Prism Coder — Cognitive memory + tool-calling intelligence for AI agents. Mind Palace persistent memory (BFCL Gold Certified, 100% Tool-Call Accuracy, 114 Agent Skills, Zero-Search HDC/HRR retrieval, HRR Semantic Drift Detection across BCBA/Coding/AAC domains, HIPAA-hardened local-first storage, SLERP-optimized GRPO alignment) plus the prism-coder 1.7B–32B open-weights LLM fleet.",
|
|
6
6
|
"module": "index.ts",
|
|
7
7
|
"type": "module",
|
|
8
8
|
"main": "dist/server.js",
|