npm - r2mcp - Versions diffs - 0.2.0 - Mend

r2mcp 0.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (138) hide show

package/CHANGELOG.md +66 -0
package/LICENSE +21 -0
package/README.md +532 -0
package/dist/breadcrumbs.d.ts +123 -0
package/dist/breadcrumbs.js +135 -0
package/dist/cli/classify-edges.d.ts +2 -0
package/dist/cli/classify-edges.js +130 -0
package/dist/cli/compile-wiki.d.ts +2 -0
package/dist/cli/compile-wiki.js +173 -0
package/dist/cli/dump-edges-json.d.ts +2 -0
package/dist/cli/dump-edges-json.js +21 -0
package/dist/cli/extract-entities.d.ts +17 -0
package/dist/cli/extract-entities.js +166 -0
package/dist/cli/lint-memory.d.ts +16 -0
package/dist/cli/lint-memory.js +94 -0
package/dist/cli/migrate.d.ts +17 -0
package/dist/cli/migrate.js +146 -0
package/dist/cli/setup-helpers.d.ts +7 -0
package/dist/cli/setup-helpers.js +72 -0
package/dist/cli/setup.d.ts +15 -0
package/dist/cli/setup.js +95 -0
package/dist/compiler/clustering.d.ts +29 -0
package/dist/compiler/clustering.js +66 -0
package/dist/compiler/frontmatter.d.ts +35 -0
package/dist/compiler/frontmatter.js +168 -0
package/dist/compiler/manifest.d.ts +32 -0
package/dist/compiler/manifest.js +82 -0
package/dist/compiler/prompts.d.ts +17 -0
package/dist/compiler/prompts.js +82 -0
package/dist/compiler/run.d.ts +52 -0
package/dist/compiler/run.js +186 -0
package/dist/compiler/tier.d.ts +10 -0
package/dist/compiler/tier.js +85 -0
package/dist/compiler/topic.d.ts +16 -0
package/dist/compiler/topic.js +105 -0
package/dist/compiler/types.d.ts +101 -0
package/dist/compiler/types.js +4 -0
package/dist/db.d.ts +10 -0
package/dist/db.js +46 -0
package/dist/edges/candidate-pairs.d.ts +24 -0
package/dist/edges/candidate-pairs.js +35 -0
package/dist/edges/classifier.d.ts +45 -0
package/dist/edges/classifier.js +172 -0
package/dist/edges/signals.d.ts +13 -0
package/dist/edges/signals.js +45 -0
package/dist/edges/stage1-haiku.d.ts +21 -0
package/dist/edges/stage1-haiku.js +33 -0
package/dist/edges/stage2-opus.d.ts +41 -0
package/dist/edges/stage2-opus.js +101 -0
package/dist/edges/state.d.ts +44 -0
package/dist/edges/state.js +79 -0
package/dist/edges/types.d.ts +20 -0
package/dist/edges/types.js +1 -0
package/dist/embeddings.d.ts +13 -0
package/dist/embeddings.js +54 -0
package/dist/entities/db.d.ts +49 -0
package/dist/entities/db.js +109 -0
package/dist/entities/extractor.d.ts +14 -0
package/dist/entities/extractor.js +154 -0
package/dist/entities/normalize.d.ts +5 -0
package/dist/entities/normalize.js +7 -0
package/dist/entities/prompt.d.ts +19 -0
package/dist/entities/prompt.js +100 -0
package/dist/entities/state.d.ts +44 -0
package/dist/entities/state.js +99 -0
package/dist/entities/types.d.ts +62 -0
package/dist/entities/types.js +6 -0
package/dist/env.d.ts +13 -0
package/dist/env.js +32 -0
package/dist/fingerprint.d.ts +2 -0
package/dist/fingerprint.js +12 -0
package/dist/graph-rebuild.d.ts +6 -0
package/dist/graph-rebuild.js +20 -0
package/dist/index.d.ts +4 -0
package/dist/index.js +403 -0
package/dist/instrumentation.d.ts +10 -0
package/dist/instrumentation.js +37 -0
package/dist/lint/checks/contradictions.d.ts +30 -0
package/dist/lint/checks/contradictions.js +52 -0
package/dist/lint/checks/drift.d.ts +5 -0
package/dist/lint/checks/drift.js +34 -0
package/dist/lint/checks/orphans.d.ts +5 -0
package/dist/lint/checks/orphans.js +25 -0
package/dist/lint/checks/stale.d.ts +6 -0
package/dist/lint/checks/stale.js +29 -0
package/dist/lint/checks/superseded-unflagged.d.ts +5 -0
package/dist/lint/checks/superseded-unflagged.js +47 -0
package/dist/lint/run.d.ts +11 -0
package/dist/lint/run.js +95 -0
package/dist/lint/types.d.ts +60 -0
package/dist/lint/types.js +13 -0
package/dist/mcp-response.d.ts +7 -0
package/dist/mcp-response.js +13 -0
package/dist/providers/anthropic.d.ts +13 -0
package/dist/providers/anthropic.js +56 -0
package/dist/providers/claude-code.d.ts +35 -0
package/dist/providers/claude-code.js +175 -0
package/dist/providers/errors.d.ts +12 -0
package/dist/providers/errors.js +19 -0
package/dist/providers/index.d.ts +30 -0
package/dist/providers/index.js +71 -0
package/dist/providers/openrouter.d.ts +19 -0
package/dist/providers/openrouter.js +76 -0
package/dist/providers/semaphore.d.ts +19 -0
package/dist/providers/semaphore.js +51 -0
package/dist/providers/types.d.ts +27 -0
package/dist/providers/types.js +7 -0
package/dist/schema.sql +116 -0
package/dist/server-instructions.d.ts +9 -0
package/dist/server-instructions.js +20 -0
package/dist/telemetry.d.ts +39 -0
package/dist/telemetry.js +130 -0
package/dist/tools/classify.d.ts +44 -0
package/dist/tools/classify.js +121 -0
package/dist/tools/compile.d.ts +31 -0
package/dist/tools/compile.js +132 -0
package/dist/tools/dump-edges-sidecar.d.ts +37 -0
package/dist/tools/dump-edges-sidecar.js +80 -0
package/dist/tools/extract-entities.d.ts +53 -0
package/dist/tools/extract-entities.js +169 -0
package/dist/tools/lint.d.ts +10 -0
package/dist/tools/lint.js +13 -0
package/dist/tools/meditate.d.ts +25 -0
package/dist/tools/meditate.js +128 -0
package/dist/tools/recall.d.ts +66 -0
package/dist/tools/recall.js +409 -0
package/dist/tools/reject.d.ts +10 -0
package/dist/tools/reject.js +24 -0
package/dist/tools/remember.d.ts +26 -0
package/dist/tools/remember.js +140 -0
package/dist/tools/search.d.ts +30 -0
package/dist/tools/search.js +69 -0
package/dist/tools/spawn-cli.d.ts +14 -0
package/dist/tools/spawn-cli.js +41 -0
package/dist/tools/stats.d.ts +31 -0
package/dist/tools/stats.js +88 -0
package/package.json +86 -0
package/skills/remember/SKILL.md +357 -0

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,66 @@
+# Changelog
+All notable changes to r2mcp. Versions follow [semver](https://semver.org/);
+entries reference the internal spec numbers that shipped them.
+## [0.2.0] — 2026-06-13
+First release packaged for use beyond the original workspace.
+### Added
+- **npm bins**: `r2mcp` (the MCP server) and `r2mcp-setup` (schema provisioner) —
+  `.mcp.json` can now use `npx -y r2mcp` instead of an absolute dist path.
+- **MCP server instructions** sent in the initialize response — a fresh project's
+  agent learns the recall-at-start / remember-as-you-go loop with zero setup.
+- **`warnings[]` on `remember`/`recall`** when embeddings are unavailable,
+  distinguishing *disabled* (no `R2MCP_OPENROUTER_API_KEY`) from *failed*.
+- **Fail-fast configuration**: the server and `npm run setup` refuse to start
+  without `R2MCP_DATABASE_URL` (previously defaulted silently to localhost);
+  startup logs a warning when embeddings are off.
+- **`extract_entities` tool + entity-scoped `recall`** (SPEC-046): light entity
+  extraction (project/person/tool/decision) with alias merging and cost caps.
+- **`next_tools[]` breadcrumbs** on every tool response (SPEC-047): context-aware
+  follow-up suggestions, capped at 3.
+- **MCP-only operation** (SPEC-045): `classify` and `dump_edges_sidecar` promoted
+  to MCP tools; subprocess spawning centralized via `resolveCliCommand`;
+  `R2MCP_CLAUDE_BIN` escape hatch for sanitized-PATH hosts (now documented, and
+  named in ENOENT spawn errors).
+- **MIT LICENSE**, engines field, repository metadata, this changelog.
+### Changed
+- **Single-rootDir build**: CLI drivers moved from `scripts/` to `src/cli/`;
+  the dual compile tree (`dist/` + `dist/src/`, 43 duplicate files) is gone and
+  `schema.sql` is copied once. `npm run` script names are unchanged.
+- **README onboarding overhaul**: Supabase **Session pooler** is the recommended
+  connection (the Direct connection is IPv6-only without a paid add-on; setup
+  classifies `ENETUNREACH` accordingly); `.mcp.json` examples use `${VAR}`
+  expansion with a secret-hygiene warning; the bundled `/remember` skill install
+  corrected to `.claude/skills/`; new Configuration table and Troubleshooting
+  section.
+### Fixed
+- **`.env` loader regex could not match any `R2MCP_*` key** (the digit excluded
+  by `[A-Z_]+`) — every documented fresh-clone setup silently fell back to
+  localhost. All five hand-rolled loaders replaced by a shared `loadEnvFile()`.
+- CLI drivers load `.env` only on their CLI entry path, never at module import
+  (a module-level load leaked the consumer's DB URL into test processes).
+- `lint:memory` previously loaded no environment at all.
+## [0.1.0] — 2026-05-09
+Initial extraction from the ClaudeClaw workspace (SPEC-041 through SPEC-044).
+- 9 MCP tools: `remember`, `recall`, `search`, `meditate`, `reject`, `stats`,
+  `compile`, `lint`, `classify` over PostgreSQL + pgvector (Docker or Supabase
+  free tier).
+- 3-tier memory (preferences / project-context / conversations) with semantic +
+  full-text hybrid retrieval, MMR diversity, progressive tier search,
+  token-budget retrieval (Recall v2).
+- Typed memory edges (SPEC-043) surfaced as `signals[]` on recall.
+- Wiki mode (SPEC-044): regenerable compiled views, SQL-only lint
+  (contradictions / stale / orphans / drift / superseded_unflagged).
+- Multi-provider LLM layer: claude-code (Max plan, $0/call), Anthropic API,
+  OpenRouter — batch jobs only; the MCP server itself never makes LLM calls.

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 Dustin Cheng
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md ADDED Viewed

@@ -0,0 +1,532 @@
+# r2mcp — Persistent Memory for Claude Code
+Persistent, semantic, tiered memory layer for Claude Code sessions.
+**The problem:** Every Claude Code session starts fresh. Context is lost. You repeat yourself.
+**The fix:** r2mcp gives Claude a structured, searchable memory that survives session boundaries — stored in PostgreSQL with pgvector semantic search.
+## What you get
+- **11 MCP tools:** `remember`, `recall`, `search`, `meditate`, `reject`, `stats`, `compile`, `lint`, `classify`, `dump_edges_sidecar`, `extract_entities`
+- **3-tier memory:** `preferences` (decisions, style) → `project-context` (architecture, state) → `conversations` (relationship, history)
+- **Semantic search:** Progressive tier search with MMR diversity reranking and relevance floor filtering (Recall v2)
+- **Typed memory edges:** `contradicts`, `supersedes`, `supports`, `evolved_into`, `depends_on`, `related_to` — surfaced as signals on `recall()`
+- **Wiki compile:** Regenerable browsable views — `compile()` synthesizes `memory/compiled/` from pgvector
+- **Lint as a first-class op:** SQL-only structural feedback — contradictions, stale, orphans, drift, superseded_unflagged
+- **Multi-provider LLM layer:** Classifier and compile work on a Max plan ($0/call), Anthropic API, or OpenRouter — picked per-invocation
+- **Bundled `/remember` skill:** Client-side judgment pipeline — classify → conflict-check → store
+## Setup
+**Prerequisites:** Node.js 20+. An OpenRouter API key is strongly recommended — it powers semantic-search embeddings. Without one, r2mcp still works but degrades to full-text search (and tells you so via a startup warning and `warnings[]` on tool responses). Docker is optional (Option B only).
+r2mcp works with any PostgreSQL + pgvector backend. The fastest path is Supabase (free tier, no Docker required).
+### Option A: Supabase (no Docker required)
+#### 1. Create a Supabase project
+Create a free project at [supabase.com](https://supabase.com). Once created, click **Connect** (top of the dashboard) and copy the **Session pooler** connection string — port `5432`, host like `aws-0-<region>.pooler.supabase.com`, username `postgres.<project-ref>`.
+> **Why the Session pooler?** The Direct connection (`db.<ref>.supabase.co:5432`) resolves to an IPv6 address, and IPv4 for direct connections is a paid add-on — on an IPv4-only network it fails with `connect ENETUNREACH`. The Session pooler is IPv4-compatible on every tier and fully supports schema setup. (Do **not** use the Transaction pooler on port `6543` — it can't run DDL; setup will refuse it.) If your network has IPv6, the Direct connection works too.
+#### 2. Clone and configure
+```bash
+git clone https://github.com/DMokong/r2mcp.git && cd r2mcp && npm install
+cp .env.example .env
+# Set R2MCP_DATABASE_URL to your Session pooler URL (port 5432, not 6543)
+# Set R2MCP_OPENROUTER_API_KEY to your OpenRouter key (enables semantic search)
+```
+#### 3. Provision schema and build
+```bash
+npm run setup && npm run build
+```
+This creates the `memories` table, pgvector indexes, and full-text search index. **Safe to re-run.**
+#### 4. Register in Claude Code
+Add to your project's `.mcp.json`. Use `${VAR}` expansion so credentials stay in your environment instead of the file — **`.mcp.json` is typically committed, so never paste real credentials into it**:
+```json
+{
+  "mcpServers": {
+    "memory": {
+      "command": "node",
+      "args": ["/path/to/r2mcp/dist/index.js"],
+      "env": {
+        "R2MCP_DATABASE_URL": "${R2MCP_DATABASE_URL}",
+        "R2MCP_OPENROUTER_API_KEY": "${R2MCP_OPENROUTER_API_KEY}"
+      }
+    }
+  }
+}
+```
+Claude Code expands `${VAR}` (and `${VAR:-default}`) from your environment at launch. Inline literal values are fine only for throwaway local experiments — if you go that route, gitignore `.mcp.json` and treat any committed credential as compromised.
+Restart Claude Code, then see [After setup](#after-setup-both-options).
+---
+### Option B: Docker (local dev)
+For local development or air-gapped environments.
+#### 1. Clone
+```bash
+git clone https://github.com/DMokong/r2mcp.git
+cd r2mcp
+npm install
+```
+#### 2. Configure
+```bash
+cp .env.example .env
+# Edit .env — set R2MCP_DATABASE_URL and R2MCP_OPENROUTER_API_KEY
+```
+#### 3. Start Postgres
+```bash
+docker compose up -d
+# Wait ~10s for healthy status
+```
+#### 4. Provision schema
+```bash
+npm run setup
+```
+This creates the `memories` table, pgvector indexes, and full-text search index. **Safe to re-run.**
+#### 5. Build
+```bash
+npm run build
+```
+#### 6. Register in Claude Code
+Add to your project's `.mcp.json`:
+```json
+{
+  "mcpServers": {
+    "memory": {
+      "command": "node",
+      "args": ["/path/to/r2mcp/dist/index.js"],
+      "env": {
+        "R2MCP_DATABASE_URL": "postgresql://r2mcp:r2mcp@localhost:5432/r2mcp",
+        "R2MCP_OPENROUTER_API_KEY": "${R2MCP_OPENROUTER_API_KEY}"
+      }
+    }
+  }
+}
+```
+(The local Docker DB URL contains no real secret; the OpenRouter key does — keep it in your environment via `${VAR}` expansion.)
+Restart Claude Code, then see [After setup](#after-setup-both-options).
+## After setup (both options)
+You now have `mcp__memory__remember`, `mcp__memory__recall`, etc. available. Two optional steps make memory actually get used:
+### Install the /remember skill (recommended)
+The bundled skill gives Claude a judgment pipeline for memory writes — classify → conflict-check → store. Copy it into your **consuming project** (the one whose `.mcp.json` registers r2mcp):
+```bash
+mkdir -p .claude/skills && cp -r /path/to/r2mcp/skills/remember .claude/skills/
+```
+Claude Code auto-discovers project skills from `.claude/skills/<name>/SKILL.md`. (For all your projects at once, use `~/.claude/skills/` instead.) Then `/remember <note>` persists memories through the full pipeline.
+### Teach your agent the session loop (recommended)
+r2mcp ships MCP server instructions that Claude Code loads automatically, so the agent knows the basics. For stronger habits, add a short protocol to your project's `CLAUDE.md`:
+```markdown
+## Memory
+This project has persistent memory via the `memory` MCP server.
+- At session start, `recall` context relevant to the task at hand.
+- When a durable decision, preference, or correction surfaces, `remember` it
+  (tier: preferences = decisions/style, project-context = architecture/state,
+  conversations = session continuity).
+- Run `/remember` before ending a work session to persist anything unsaved.
+```
+**First session on an empty database:** `recall` returning zero results is expected — start `remember`-ing as decisions come up and recall pays off within a session or two.
+## Configuration
+r2mcp reads its configuration from the MCP transport's environment — for
+consumers, **the `.mcp.json` `env` block is the primary config surface**
+(use `${VAR}` expansion for secrets):
+```json
+{
+  "mcpServers": {
+    "memory": {
+      "command": "node",
+      "args": ["./node_modules/r2mcp/dist/index.js"],
+      "env": {
+        "R2MCP_DATABASE_URL": "${R2MCP_DATABASE_URL}",
+        "R2MCP_OPENROUTER_API_KEY": "${R2MCP_OPENROUTER_API_KEY}",
+        "R2MCP_CLASSIFIER_PROVIDER": "claude-code",
+        "R2MCP_EDGE_MAX_USD": "1.00",
+        "R2MCP_COMPILE_MAX_USD": "1.00"
+      }
+    }
+  }
+}
+```
+| Variable | Required | What it does |
+|----------|----------|--------------|
+| `R2MCP_DATABASE_URL` | **Yes** | PostgreSQL + pgvector connection string. The server **fails fast at startup** if unset — it never guesses a database. |
+| `R2MCP_OPENROUTER_API_KEY` | Recommended | Enables semantic-search embeddings. When unset, the server logs a startup warning and `remember`/`recall` responses carry a `warnings[]` field — everything still works full-text. |
+| `R2MCP_CLAUDE_BIN` | Sometimes | Absolute path to the `claude` binary for the $0 Max-plan provider. Needed when the spawning process's PATH doesn't include it — common under launchd jobs and some MCP hosts (e.g. `~/.local/bin/claude`). The spawn error names this variable when it's the fix. |
+| `ANTHROPIC_API_KEY` | Optional | Only for `--provider=anthropic` on classifier/compile runs. |
+| `R2MCP_CLASSIFIER_PROVIDER` | Optional | Pin a provider (`claude-code` \| `anthropic` \| `openrouter`) instead of auto-fallback. |
+| `R2MCP_EDGE_MAX_USD` / `R2MCP_COMPILE_MAX_USD` / `R2MCP_ENTITY_MAX_USD` | Optional | Cost caps for the batch jobs (defaults `$1.00`). |
+The server also loads a `.env` file from its working directory at startup
+(non-clobbering — real environment variables always win). A `.env` at the
+r2mcp source root is the normal path for `npm run` scripts when working from
+a checkout; consumers configuring via `.mcp.json env` don't need one.
+## Troubleshooting
+| Symptom | Cause & fix |
+|---------|-------------|
+| `connect ENETUNREACH 2406:...` during setup | Supabase Direct connection is IPv6-only (IPv4 is a paid add-on) and your network is IPv4-only. Use the **Session pooler** string instead: Dashboard → Connect → Session pooler (port 5432). Setup classifies this error and says the same. |
+| `Transaction-pooler URL detected (port 6543)` | The transaction pooler can't run DDL or prepared statements. Use the Session pooler (port 5432). |
+| `R2MCP_DATABASE_URL is not set` | Deliberate fail-fast — set it in `.mcp.json env` or `.env`. The error lists both surfaces and the Docker default URL. |
+| `embeddings disabled` warning at startup or in `warnings[]` | `R2MCP_OPENROUTER_API_KEY` is unset (or the embed call failed — the message distinguishes the two). Full-text search still works; set the key to enable semantic search. |
+| `could not spawn 'claude' (ENOENT)` on classifier/compile runs | The claude CLI isn't on the spawning process's PATH. Set `R2MCP_CLAUDE_BIN` to its absolute path. |
+| Fresh credentials rejected right after a Supabase password reset | The pooler caches auth-rejection state for 30–60s. Wait a minute and retry before assuming the rotation failed. |
+## Memory Tiers
+| Tier | What goes here | Auto-archived after |
+|------|---------------|---------------------|
+| `preferences` | Decisions, coding style, tool choices | Never |
+| `project-context` | Architecture, system state, what's built | 180 days |
+| `conversations` | Relationship continuity, session history | 90 days |
+## Tools Reference
+| Tool | Description |
+|------|-------------|
+| `remember` | Store/update/archive a memory with tier + metadata |
+| `recall` | Semantic + full-text search with progressive tier search; emits `signals[]` from typed edges |
+| `search` | Filter by type, tier, topics, date range |
+| `meditate` | Archive stale entries, find duplicates; pass `include_lint: true` to fold lint findings in |
+| `reject` | Mark a memory as rejected (excluded from future recall) |
+| `stats` | Health check — counts, staleness, embedding status |
+| `compile` | Regenerate browsable wiki views under `memory/compiled/` (SPEC-044, see below) |
+| `classify` | Classify candidate memory pairs into typed edges (supports, contradicts, supersedes, evolved_into, depends_on, related_to). Subprocess-spawned (SPEC-044 invariant). |
+| `dump_edges_sidecar` | In-process JSON dump of memory_edges + memories to a caller-supplied directory. Used by downstream consumers like Memory Explorer. |
+| `lint` | Surface structural feedback: contradictions, stale, orphans, drift, superseded_unflagged (SPEC-044, see below) |
+| `extract_entities` | Extract structured entities (project / person / tool / decision) from memories. Spawns the entity extractor driver via the shared `resolveCliCommand` helper. Inherits cost cap (`R2MCP_ENTITY_MAX_USD`, default $1.00) and resumability from SPEC-043. Top-N known entities (`R2MCP_ENTITY_CONTEXT_TOP_N`, default 100) seed the LLM context. (SPEC-046, see below) |
+| `recall` (extended) | Accepts an optional `entity` parameter that narrows results to memories linked to a named entity (matched by canonical name or any alias). When `entity` is set, `query` is optional. Response gains `entity_resolved: boolean`, optional `entity_id`, and per-result `entity_links[]`. (SPEC-046) |
+## Recall v2 — semantic + budget-aware retrieval
+`recall()` is the workhorse retrieval tool. v2 (xMemory-inspired, 2026-04) layers four retrieval shapes on top of the underlying hybrid semantic + full-text search:
+### 1. Relevance floor — `min_score`
+Filter out low-quality matches before they're returned. Without this, semantic search dumps a long tail of weakly-related results.
+```ts
+recall({ query: "edge classifier cost cap", min_score: 0.3 })
+```
+Suggested defaults: `0.3` for semantic queries, `0.1` for keyword-driven ones.
+### 2. MMR diversity — `diversity` (lambda 0.0–1.0)
+Maximal Marginal Relevance reranks results to balance relevance against redundancy. `1.0` is pure relevance (may return three near-duplicates of the top hit); `0.0` is pure diversity (spreads coverage); the default `0.7` favors relevance with mild diversification.
+```ts
+recall({ query: "memory architecture", diversity: 0.5, top_k: 8 })
+```
+Use lower values when you want broad coverage of a topic, higher when you want the single best answer plus close runners-up.
+### 3. Context budget — `max_tokens`
+Token-budget retrieval: walks MMR-reranked results in score order and stops when adding the next result would exceed the budget. Returns `tokens_used` in the response so you know how much you actually pulled.
+```ts
+recall({ query: "what we learned about classifiers", max_tokens: 4000 })
+// → up to N results, summing to ≤4000 tokens, prioritized by relevance × diversity
+```
+This is the right call when you're stuffing recall results into a downstream prompt and have a hard context limit. `top_k` is ignored when `max_tokens` is set — the budget decides the cut.
+### 4. Progressive tier search — `progressive` + `confidence_threshold`
+Top-down retrieval through the tier hierarchy (`preferences` → `project-context` → `conversations`). High-confidence matches in `preferences` short-circuit the search before lower tiers are consulted, mimicking the xMemory observation that decisions/preferences usually answer questions before context/history needs to.
+```ts
+recall({ query: "do we use bun or npm", progressive: true, confidence_threshold: 0.82 })
+// → returns immediately if a preferences-tier match scores ≥0.82, else widens to project-context, then conversations
+```
+Default behavior — turn off with `progressive: false` to force a full sweep across tiers, or pin a single tier with `tier: 'preferences'`.
+### Composing them
+The four parameters compose:
+```ts
+recall({
+  query: "spec-bench cleanup conventions",
+  min_score: 0.3,           // drop weak matches
+  diversity: 0.6,           // some diversification
+  max_tokens: 3000,         // fit in context
+  progressive: true,        // early-stop on prefs hits
+  confidence_threshold: 0.82,
+})
+```
+Plus `signals[]` on the response surfaces typed memory edges (`contradicts`, `superseded_by`) on the returned memories so callers can flag conflicts inline.
+## Cross-Project Memory
+All projects pointing at the same `R2MCP_DATABASE_URL` share a single memory pool. This is intentional — your knowledge travels with you. Namespace isolation is a v2 roadmap item.
+## OpenTelemetry (optional)
+Enable OTel tracing and metrics:
+```bash
+OTEL_ENABLED=true
+OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318
+```
+Metrics use the `r2mcp.memory.*` namespace.
+## Prior Art & Acknowledgements
+r2mcp stands on the shoulders of two projects:
+**[Open Brain](https://github.com/NateBJones-Projects/OB1) by [Nate B. Jones](https://natesnewsletter.substack.com/)**
+The core architectural insight — "one database, any AI plugs in" — comes from Open Brain. The idea that your knowledge layer should be sovereign and portable (not locked inside a specific tool) is the founding premise of r2mcp. Open Brain proved the PostgreSQL + pgvector substrate works for personal AI memory at minimal cost ($0.10–0.30/month). r2mcp narrows the scope to Claude Code's MCP protocol and adds a more opinionated retrieval layer on top of that foundation.
+**[xMemory](https://arxiv.org/abs/2602.02007) — "Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation"**
+Hu et al. (2026) established the hierarchical tier approach and showed that progressive top-down retrieval with coverage maximization + redundancy minimization cuts token usage ~50% vs. flat RAG while improving accuracy. r2mcp's 3-tier memory (preferences → project-context → conversations) is a hand-crafted simplification of their 4-level hierarchy (messages → episodes → semantics → themes). The MMR diversity reranking in `recall()` directly implements their redundancy minimization insight.
+## Migrating from ClaudeClaw
+If you're moving from the ClaudeClaw-internal `memory-mcp-server`:
+```bash
+R2MCP_DATABASE_URL=<your-new-url> npx tsx src/cli/migrate.ts /path/to/your/memory/
+```
+The migration script reads `preferences.md`, `project-context.md`, and `conversations.md` from the specified directory and imports them. It's idempotent — safe to re-run.
+## Memory edges (SPEC-043)
+r2mcp supports a typed-relation table (`memory_edges`) that captures structural
+relations between memories — `contradicts`, `supersedes`, `supports`,
+`evolved_into`, `depends_on`, `related_to`. The `recall()` MCP tool surfaces
+`contradicts` / `superseded_by` relations as an optional `signals[]` field on
+the response (additive — existing clients work unchanged).
+### Running the classifier
+The classifier is a manual batch process — it is NOT invoked from the MCP server
+hot path. Provider selection follows the SPEC-044 precedence (see below); on a
+Max plan, no API key is required.
+```bash
+# Estimate cost without making API calls or writing edges
+npm run edges:classify -- --dry-run
+# Auto-fallback: prefers claude-code (Max plan, $0/call)
+npm run edges:classify -- --max-cost=1.00
+# Force a specific provider
+npm run edges:classify -- --provider=anthropic --max-cost=1.00
+npm run edges:classify -- --provider=openrouter --max-cost=1.00
+# Incremental run on memories from the last 7 days
+npm run edges:classify -- --since=7d --max-cost=0.25
+# Resume a prior run that hit its cap (the run_id is printed at exit and stored in
+# data/edges-state.last-run)
+npm run edges:classify -- --resume=<run_id>
+```
+State and run summaries are written under `data/edges-state.*` (JSONL append-log,
+last-run sidecar, per-run JSON summary at `data/edges-state.runs/<run_id>.json`).
+## LLM provider abstraction (SPEC-044)
+The classifier and wiki compiler share a small `LLMProvider` abstraction with
+three adapters. Providers run from standalone Node processes only — the MCP
+server itself never makes LLM calls.
+| Adapter | Auth | Cost per call | Concurrency cap |
+|---------|------|---------------|------------------|
+| `claude-code` | Claude Code OAuth (Max plan) | **$0** (strict equality) | 2 (subprocess overhead) |
+| `anthropic` | `ANTHROPIC_API_KEY` | Per-token (list price) | 10 |
+| `openrouter` | `R2MCP_OPENROUTER_API_KEY` | Per-token (list price) | 10 |
+### Selection precedence
+1. `--provider=<name>` CLI flag — highest priority
+2. `R2MCP_CLASSIFIER_PROVIDER` environment variable
+3. Auto-fallback: `claude-code` if logged in → `anthropic` if API key set →
+   `openrouter` if API key set → fatal error naming all three remediation paths
+The fallback prefers `claude-code` so a Max-plan user pays nothing by default.
+OpenRouter's primary role remains text→vector embeddings. Its classifier /
+compile use is opt-in per invocation, never auto-routed for embeddings.
+## Wiki compile (SPEC-044)
+`compile()` regenerates browsable markdown views of the memory store from
+pgvector. Output goes to `memory/compiled/` (gitignored, regenerable).
+```bash
+# Compile all three tier files (preferences.md, project-context.md, conversations.md)
+npm run compile-wiki -- --all
+# Compile a single tier
+npm run compile-wiki -- --tier=preferences
+# Compile a per-topic page
+npm run compile-wiki -- --topic=wiki-mode
+# Preview without writing
+npm run compile-wiki -- --all --dry-run
+# Force a provider (otherwise uses auto-fallback)
+npm run compile-wiki -- --all --provider=claude-code
+```
+### Output shape
+Every compiled file carries YAML frontmatter recording `generated_at`,
+`compile_run_id`, `source_count`, `source_memory_ids`, `provider`,
+`source_git_sha`, and `tier` or `topic`. The body is structured prose with
+inline `<m:id>` citations and a `Sources:` line per cluster.
+### Structural stability
+Compile is treated as a regenerable view: across two runs against the same
+input, the set of `## H2` / `### H3` headers and the set of cited memory IDs
+are bit-identical. Prose-level variance is bounded at 5% (Levenshtein ratio
+≥ 0.95) — the only LLM nondeterminism allowance. The compiler controls
+headers and citations; only the prose paragraphs come from the LLM.
+### Cost cap
+`R2MCP_COMPILE_MAX_USD` (default `$1.00`) — when exceeded mid-run, compile
+exits cleanly with `hit_cost_cap: true` and partial files. Same shape as the
+classifier cap-hit behavior.
+### What compile never does
+- Modifies `memory/MEMORY.md` — the human-curated hub stays invariant
+- Writes outside `memory/compiled/`
+- Touches the live `memories` or `memory_edges` tables — read-only at the DB layer
+- Uses any direct Anthropic SDK call — every synthesis routes through `LLMProvider`
+## Lint (SPEC-044)
+`lint()` surfaces five structural checks on the memory store. SQL-only — no
+LLM calls, no cost cap.
+```bash
+# Run all checks against the live DB and produce a human-readable report
+npm run lint:memory
+# Run a single check
+npm run lint:memory -- --check=stale
+# Apply auto-fixes for high-confidence findings
+npm run lint:memory -- --fix
+```
+| Check | What it surfaces |
+|-------|-------------------|
+| `contradictions` | Edges where `relation='contradicts'` between two unarchived memories |
+| `stale` | Memories older than 90d with zero incoming edges, tier ≠ preferences |
+| `orphans` | Memories with zero edges in either direction, older than 30d |
+| `drift` | Pairs sharing ≥2 topics with no edge yet — classifier hasn't run on this pair |
+| `superseded_unflagged` | `contradicts` edge where the temporal pattern says it should be `supersedes` |
+### `--fix` semantics
+`lint --fix` only acts on findings with `confidence ≥ 0.9`:
+- `stale` → memory is archived (`type='archived'`)
+- `superseded_unflagged` → edge type is rewritten from `contradicts` to `supersedes`
+Lower-confidence findings are returned as suggestions only, never auto-acted.
+### `meditate` integration
+`meditate({include_lint: true})` runs lint first and surfaces findings as a
+`lint_findings` field on the response. The default invocation
+(`meditate({mode: 'full', dry_run: false})`) returns the byte-identical
+pre-spec response shape — backward compatibility for direct callers is
+preserved.
+## Entity extraction (SPEC-046)
+Light entity extraction over the memory store — pulls structured `project` /
+`person` / `tool` / `decision` entities out of memories, persists them to two
+new tables (`entities` for canonical names + aliases, `memory_entities` for the
+M:N link to `memories`), and lets `recall()` filter on entity name or alias.
+The extractor is a subprocess-driven batch process — the MCP server itself
+never makes LLM calls. Provider selection follows the SPEC-044 precedence; on a
+Max plan, no API key is required.
+```bash
+# One-shot batch extraction over the last week, capped at $0.50
+npm run entities:extract -- --since-days=7 --max-cost=0.5
+# Or via MCP tool from any client
+# mcp.callTool('extract_entities', { since_days: 7, max_cost_usd: 0.5 })
+# Then ask for Speculator-scoped recall
+# mcp.callTool('recall', { entity: 'Speculator', query: 'compaction' })
+```
+### Env vars
+| Variable | Default | What it controls |
+|----------|---------|------------------|
+| `R2MCP_ENTITY_MAX_USD` | `1.00` | Cost cap for a single extraction run. On overrun, the run exits cleanly with `hit_cost_cap: true` (same shape as the classifier and compile caps). |
+| `R2MCP_ENTITY_CONTEXT_TOP_N` | `100` | Number of known entities seeded into the LLM context to bias toward canonical names + alias merging. |
+### Scoped recall
+When `recall()` is called with `entity` set:
+- `query` is optional — entity-only recall returns all memories linked to the entity (matched by canonical name or any alias).
+- The response carries `entity_resolved: boolean` and, when resolved, `entity_id`.
+- Each result carries an `entity_links[]` array describing how that memory connects to the named entity.
+## License
+[MIT](LICENSE)