@neruva/mcp 0.25.0 → 0.25.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +39 -2
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -1,9 +1,34 @@
|
|
|
1
1
|
# @neruva/mcp
|
|
2
2
|
|
|
3
|
-
MCP server for [Neruva](https://neruva.io) — memory + reasoning substrate for AI agents. Knowledge graph (5 engines), Pearl do-operator, HD analogy, episodic CBR, deterministic replay. Drop into Claude Code / Cursor / Codex / Gemini CLI in one line.
|
|
3
|
+
MCP server for [Neruva](https://neruva.io) — memory + reasoning substrate for AI agents. Knowledge graph (5 engines), Pearl do-operator, HD analogy, episodic CBR, deterministic replay. Drop into Claude Code / Cursor / Codex / Gemini CLI / Goose in one line.
|
|
4
4
|
|
|
5
5
|
> **For Claude Code users:** see [neruva.io/claude-code](https://neruva.io/claude-code) for the 30-second install + first-queries to try.
|
|
6
6
|
|
|
7
|
+
## The 5 benchmark crowns
|
|
8
|
+
|
|
9
|
+
| Benchmark | Score | Position |
|
|
10
|
+
|---|---|---|
|
|
11
|
+
| **LongMemEval** (end-to-end memory QA) | **93.3%** | top-4 globally, +22pp vs Zep |
|
|
12
|
+
| **DFAH compliance** (det × acc) | **100% / 88%** | first system to break the trade-off, 2.75× prior SOTA |
|
|
13
|
+
| **Hi-ToM** (higher-order theory-of-mind) | **84.58%** with **99.58%** byte-identical replay | 15× GPT-4 at order 4 · depth-1000 nested beliefs proven |
|
|
14
|
+
| **AMA-Bench** (causal + memory reasoning) | **63.3%** | new SOTA, +6pp over paper baseline |
|
|
15
|
+
| **Latency** (p95 cache-hit) | **80ms** | 2.5× faster than Mem0 |
|
|
16
|
+
|
|
17
|
+
Full breakdown: [neruva.io/benchmarks](https://neruva.io/benchmarks). **The only memory stack hitting world-class scores on memory, reasoning, AND audit determinism — simultaneously.**
|
|
18
|
+
|
|
19
|
+
## What's new in 0.25.0 — `qa_optimized` + `recency_first` graduated
|
|
20
|
+
|
|
21
|
+
The LongMemEval 93.3% recipe lifted into the substrate. `agent_recall` now accepts two opt-in flags (both default off, backwards compatible):
|
|
22
|
+
|
|
23
|
+
- **`mode="qa_optimized"`** — 3× over-fetches the candidate pool and BM25-RRF-fuses (k=60) over the text. Lifts records whose text contains specific entity tokens (brand names, amounts, dates, IDs) that pure semantic embedding under-weights. Worth **~+20pp** on memory-QA benchmarks.
|
|
24
|
+
- **`recency_first=true`** — re-sorts returned records by `ts` desc after ranking. Use for knowledge-update questions where the most recent statement wins over older contradictory ones.
|
|
25
|
+
|
|
26
|
+
Pair them for the strongest memory-QA recipe. Auto-router in `neruva-record` 0.19+ suggests both automatically when the intent classifier hits `recall_extended`.
|
|
27
|
+
|
|
28
|
+
## What's new in 0.24.0 — Code-graph bare-name resolution
|
|
29
|
+
|
|
30
|
+
`code_kg_module_of` and `code_kg_class_of` now accept short identifiers like `bind` (no module prefix). When the substrate sees an unqualified name, it falls back to a 1-hop `called_by` lookup to find the likely qualified target. Drops the friction of remembering full dotted paths inside Claude Code.
|
|
31
|
+
|
|
7
32
|
## What's new in 0.23.0 — Native hook binary (sub-100ms cache-hit latency)
|
|
8
33
|
|
|
9
34
|
We benchmarked Neruva against the 2026 agent-memory field and made the hook **faster than Mem0** without giving up any of the capability surface.
|
|
@@ -151,7 +176,7 @@ The substrate now exposes the full **9-level cognitive ladder**. Every primitive
|
|
|
151
176
|
| 9 | **Continual learning, zero forgetting** | `agent_continual_train` · `agent_continual_predict` | catastrophic forgetting |
|
|
152
177
|
| + | **Hierarchical chunking (recursive L^K decode)** | `agent_hierarchical_add` · `agent_hierarchical_decode` | not a primitive |
|
|
153
178
|
|
|
154
|
-
**~
|
|
179
|
+
**~80 tools** across Records, KG, Causal, Analogy, CBR, Blend, federated `agent_*`, the 9 cognitive primitives above, self-introspection.
|
|
155
180
|
|
|
156
181
|
### Why this is unique
|
|
157
182
|
|
|
@@ -206,6 +231,18 @@ env = { NERUVA_API_KEY = "..." }
|
|
|
206
231
|
{ "mcpServers": { "neruva": { "command": "npx", "args": ["-y", "@neruva/mcp@latest"], "env": { "NERUVA_API_KEY": "..." } } } }
|
|
207
232
|
```
|
|
208
233
|
|
|
234
|
+
### Goose (`~/.config/goose/config.yaml`)
|
|
235
|
+
```yaml
|
|
236
|
+
extensions:
|
|
237
|
+
neruva:
|
|
238
|
+
type: stdio
|
|
239
|
+
cmd: npx
|
|
240
|
+
args: ["-y", "@neruva/mcp@latest"]
|
|
241
|
+
env:
|
|
242
|
+
NERUVA_API_KEY: nv_...
|
|
243
|
+
```
|
|
244
|
+
For Goose auto-pilot (pattern-C route / reflect / extract via your LLM): `pip install neruva-goose`.
|
|
245
|
+
|
|
209
246
|
## The substrate, in one paragraph
|
|
210
247
|
|
|
211
248
|
Five layers, one API. **Records** = typed agentic events (decisions, mistakes, tool_calls, llm_turns; auto-embedded at D=1024). **Knowledge Graph** = mutable structured state across 5 engines, sub-ms cosine retrieval, matrix-power N-hop derive. **Causal** = Pearl's do-operator (`observation` vs `intervention` arithmetically distinct). **Analogy** = `a:b::c:?` in HD feature space. **Concept Blending** = provenance-preserving merge of multiple memories. **CBR** = factored episode store. The new **federated agent_*** layer (agent_remember / agent_recall / agent_context) routes across all substrates so a single call handles "where does X store, and how do I get it back?"
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@neruva/mcp",
|
|
3
|
-
"version": "0.25.
|
|
3
|
+
"version": "0.25.1",
|
|
4
4
|
"description": "MCP server for Neruva agent memory + reasoning substrate. INSTALL: npx -y -p @neruva/mcp@latest neruva-mcp (the -p ... bin form is required because the package ships two bins: neruva-mcp + neruva-record-hook). v0.23 BUNDLES neruva-record-hook native binary (Rust, ~145-340KB) for all 6 platforms (Win/Mac/Linux x amd64/arm64). Hook talks to long-lived Python daemon over TCP localhost, daemon holds warm httpx pool + TTL cache: cache-hit p95 ~80ms, warm avg 167ms -- faster than Mem0 (200ms p95) with full cognitive surface (5 KG engines + Pearl do-operator + HD analogy + CBR + ToM + EFE planning + replay). v0.22 added AUTO-PILOT (agent_route_intent_prompt + agent_reflect_prompt + auto-extract on records_ingest). Plus typed Records, 5-engine KG, federated agent_remember/recall/context, counterfactual rollouts, rule induction, replay, code_kg_* navigation, .neruva V3 container. Pattern-C throughout: substrate stays $0/call.",
|
|
5
5
|
"license": "MIT",
|
|
6
6
|
"type": "module",
|