moltmind 0.7.1 → 0.7.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -4,6 +4,20 @@ Persistent semantic memory and session continuity for AI agents. One install, ze
4
4
 
5
5
  MoltMind is an [MCP](https://modelcontextprotocol.io) server that gives your AI agent long-term memory across sessions — storing learnings, decisions, error fixes, and handoff context using local SQLite and embeddings. No API keys, no cloud, no accounts needed.
6
6
 
7
+ ## Why MoltMind?
8
+
9
+ Every time your AI agent starts a new conversation, it forgets everything. Re-exploring your codebase costs ~8,000 tokens per session — about **$0.024** on Claude Sonnet. That adds up fast:
10
+
11
+ | Project size | Without MoltMind | With MoltMind | You save |
12
+ |-------------|-----------------|---------------|----------|
13
+ | 5 sessions | $0.12 | $0.02 | **$0.10** |
14
+ | 20 sessions | $0.48 | $0.05 | **$0.43** |
15
+ | Daily use (1 year) | $8.76 | $0.87 | **$7.89** |
16
+
17
+ MoltMind restores your agent's context in ~325 tokens ($0.001) instead of re-exploring from scratch. Your agent picks up right where it left off — same project knowledge, same decisions, same learnings.
18
+
19
+ > Dollar estimates based on Claude Sonnet 4.5 input pricing ($3/1M tokens). Actual savings vary by model and usage.
20
+
7
21
  ## Quick Start
8
22
 
9
23
  ### Claude Code
@@ -12,7 +26,15 @@ MoltMind is an [MCP](https://modelcontextprotocol.io) server that gives your AI
12
26
  claude mcp add moltmind -- npx -y moltmind
13
27
  ```
14
28
 
15
- Restart Claude Code, then run `/mcp` to verify. Add `--moltbook` for social features ([moltbook.com](https://moltbook.com)).
29
+ Restart Claude Code, then run `/mcp` to verify.
30
+
31
+ With moltbook social features:
32
+
33
+ ```bash
34
+ claude mcp add moltmind -- npx -y moltmind --moltbook
35
+ ```
36
+
37
+ See [moltbook.com](https://moltbook.com) for the agent social network.
16
38
 
17
39
  ### Other Clients
18
40
 
@@ -73,11 +95,23 @@ npm uninstall -g moltmind # then let npx handle it
73
95
 
74
96
  ## How It Works
75
97
 
76
- **Memory & Search** — Memories are stored in local SQLite with FTS5. Each has a type (`learning`, `error`, `decision`, `plan`, `raw`), tags, and a tier (`hot`, `warm`, `cold`, `archived`). `mm_recall` runs hybrid search: semantic similarity (0.7 weight) via a local [MiniLM-L6-v2](https://huggingface.co/Xenova/all-MiniLM-L6-v2) embedding model plus FTS5 keyword matching (0.3 weight). If the embedding model isn't available, it falls back to keyword-only.
98
+ **Memory & Search** — Your agent stores memories in a local database. When it needs to find something, MoltMind searches by meaning (not just keywords) so searching for "API port" finds a memory about "our server runs on port 8080". If the search model isn't downloaded yet, it falls back to keyword matching.
99
+
100
+ **Sessions & Handoffs** — Sessions are auto-created on startup and auto-paused on shutdown. Your agent saves where it left off and picks up seamlessly next time. Handoffs let one agent pass context to another with structured goal/state/next-action documents.
101
+
102
+ **Diagnostics** — Every tool call is logged locally with timing and success/failure. `mm_status` shows health, `mm_metrics` shows usage stats and token savings. All data stays on your machine.
77
103
 
78
- **Sessions & Handoffs** Sessions are auto-created on startup and auto-paused on shutdown. `mm_session_save` captures what happened and where you left off; `mm_session_resume` restores full context. `mm_handoff_create` structures goal/state/next-action for agent-to-agent transfers. All tool calls are tagged with session IDs for traceability.
104
+ ## What It Costs (Tokens)
79
105
 
80
- **Diagnostics** Every tool call is logged locally with latency and success/failure. `mm_status` shows health score, `mm_metrics` shows per-tool usage stats, error rates, and token savings. All data stays on your machine.
106
+ Every MCP tool adds a small overhead to each request because the AI needs to know what tools are available. Here's what MoltMind costs and what it saves you:
107
+
108
+ | | Cost per request | In dollars |
109
+ |--|-----------------|------------|
110
+ | MoltMind overhead (14 tools) | ~500 tokens | ~$0.0015 |
111
+ | With prompt caching | ~50 tokens | ~$0.00015 |
112
+ | **Session resume (saves you)** | **~7,675 tokens** | **~$0.023** |
113
+
114
+ **Bottom line:** MoltMind pays for itself after a single session resume. Every conversation after that is pure savings.
81
115
 
82
116
  ## Free vs Pro
83
117
 
@@ -91,27 +125,17 @@ npm uninstall -g moltmind # then let npx handle it
91
125
 
92
126
  Upgrade: `npx moltmind --upgrade`
93
127
 
94
- ## Token Cost
95
-
96
- MCP tools add overhead because descriptions are sent with every request. MoltMind pays for itself quickly:
128
+ ## Search Performance (Pro)
97
129
 
98
- | Mode | Overhead per request |
99
- |------|---------------------|
100
- | Default (14 tools) | ~500 tokens |
101
- | + Moltbook (21 tools) | ~1,000 tokens |
102
- | With prompt caching | ~50 tokens |
130
+ Pro tier uses [Zvec ANN](https://github.com/ariv14/zvec-native) for fast memory search. Here's what that means in practice:
103
131
 
104
- ### Session resume vs cold start
132
+ **Accuracy** At 1,000 memories (a typical heavy user), Zvec finds **98% of the exact same results** as an exhaustive search. Your agent gets the right answer almost every time.
105
133
 
106
- Without MoltMind, re-exploring a codebase costs ~8,000 tokens per session. `mm_session_resume` restores context in ~325 tokens.
134
+ **Speed** Search takes **under 1ms** at 1,000 memories. At 10,000 memories, it's still under 5ms. Your agent won't notice any delay.
107
135
 
108
- | Scenario | Without | With MoltMind | Savings |
109
- |----------|---------|---------------|---------|
110
- | Single resume | ~8,000 | ~825 | 90% |
111
- | 5-session project | ~40,000 | ~7,500 | 81% |
112
- | 20-session project | ~160,000 | ~40,200 | 75% |
136
+ **Reliability** Handles **330+ searches per second** with zero latency spikes. Deleted memories never come back. Results are deterministic.
113
137
 
114
- Run `npm run benchmark` for latency measurements and projected savings. See [RUNBOOK.md](RUNBOOK.md) for detailed results.
138
+ See [BENCHMARK_RESULTS.md](BENCHMARK_RESULTS.md) for the full report, or [RUNBOOK.md](RUNBOOK.md) for how to run benchmarks yourself.
115
139
 
116
140
  ## Data Storage
117
141
 
package/dist/index.js CHANGED
@@ -25,7 +25,7 @@ const moltbookInstructions = isMoltbookEnabled()
25
25
  : "";
26
26
  const server = new McpServer({
27
27
  name: "moltmind",
28
- version: "0.7.1",
28
+ version: "0.7.3",
29
29
  }, {
30
30
  instructions: `MoltMind provides persistent memory and session continuity. On startup, call mm_session_resume to restore context from previous sessions. Before disconnecting or when a task is complete, call mm_session_save to preserve session state. Use mm_handoff_create to checkpoint progress during long tasks.${moltbookInstructions}`,
31
31
  });
@@ -9,7 +9,7 @@ export async function handleMmStatus() {
9
9
  const uptimeSeconds = Math.floor((Date.now() - startTime) / 1000);
10
10
  return {
11
11
  success: true,
12
- version: "0.7.1",
12
+ version: "0.7.3",
13
13
  tier: isProTier() ? "pro" : "free",
14
14
  usage: checkStoreLimits().message,
15
15
  db_stats: stats,
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "moltmind",
3
- "version": "0.7.1",
3
+ "version": "0.7.3",
4
4
  "description": "Agent Memory MCP Server — persistent semantic memory and session continuity for AI agents",
5
5
  "type": "module",
6
6
  "bin": {
@@ -43,7 +43,7 @@
43
43
  "@xenova/transformers": "^2.17.0"
44
44
  },
45
45
  "optionalDependencies": {
46
- "@moltmind/zvec-native": "^0.1.0"
46
+ "@moltmind/zvec-native": "^0.1.1"
47
47
  },
48
48
  "devDependencies": {
49
49
  "typescript": "^5.5.0",