npm - @unerr-ai/unerr - Versions diffs - 0.1.9 → 0.2.1 - Mend

@unerr-ai/unerr 0.1.9 → 0.2.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/README.md +156 -333
package/dist/cli.js +14570 -4208
package/dist/rules/developer.mozilla.org.json +9 -0
package/dist/rules/github.com.json +21 -0
package/package.json +7 -2

package/README.md CHANGED Viewed

@@ -1,22 +1,24 @@
 <p align="center">
-  <a href="https://www.unerr.dev/"><img src="https://unerr.dev/icon-wordmark.svg" alt="unerr — local intelligence layer for AI coding agents" width="320" /></a>
+  <a href="https://www.unerr.dev/"><img src="https://unerr.dev/icon-wordmark.svg" alt="unerr — operational memory for your codebase" width="320" /></a>
 </p>
 <p align="center">
-  <strong>Lands your AI agent at the right code in fewer turns, tokens, & breakages.</strong>
+  <strong>Stop babysitting your AI.</strong>
 </p>
 <p align="center">
-  A local intelligence layer that sits between your AI agent and your codebase —<br/>
-  indexes every call, remembers every decision, and gets sharper the longer you use it.
+  <strong>unerr is operational memory for your codebase</strong> — one local runtime that sits <em>behind</em> every MCP<br/>
+  your coding agent already speaks, carrying a shared code graph, persistent memory,<br/>
+  drift detection, and the guardrails the protocol itself doesn't.
+</p>
+<p align="center">
+  <sub><strong>Works with</strong> Cursor · Claude Code · Windsurf · Gemini CLI · Antigravity · GitHub Copilot CLI · and every MCP-compatible client.</sub>
 </p>
 <p align="center">
-  <a href="https://www.unerr.dev/"><img src="https://img.shields.io/badge/website-unerr.dev-8B5CF6?style=flat-square&logo=icloud&logoColor=white" alt="Website" /></a>
   <a href="https://www.npmjs.com/package/@unerr-ai/unerr"><img src="https://img.shields.io/badge/install-npm_i_@unerr--ai/unerr-8B5CF6?style=flat-square&logo=npm" alt="Install" /></a>
-  <a href="https://discord.gg/JfZ4pYgb"><img src="https://img.shields.io/badge/community-Discord-5865F2?style=flat-square&logo=discord&logoColor=white" alt="Discord" /></a>
-  <a href="https://x.com/unerr_ai"><img src="https://img.shields.io/badge/follow-@unerr__ai-000000?style=flat-square&logo=x&logoColor=white" alt="X / Twitter" /></a>
-  <a href="https://www.linkedin.com/company/unerr"><img src="https://img.shields.io/badge/linkedin-unerr-0A66C2?style=flat-square&logo=linkedin&logoColor=white" alt="LinkedIn" /></a>
+  <a href="https://www.unerr.dev/"><img src="https://img.shields.io/badge/website-unerr.dev-8B5CF6?style=flat-square&logo=icloud&logoColor=white" alt="Website" /></a>
   <img src="https://img.shields.io/badge/runtime-Node.js_≥20-339933?style=flat-square&logo=node.js&logoColor=white" alt="Node.js" />
   <img src="https://img.shields.io/badge/protocol-MCP-7C3AED?style=flat-square" alt="MCP" />
   <img src="https://img.shields.io/badge/local--first-no_cloud-22D3EE?style=flat-square" alt="Local-first" />
@@ -25,86 +27,81 @@
 <p align="center">
   <code>npm install -g @unerr-ai/unerr</code>
+  <br /><br />
+  <sub>Zero configuration. Install, restart your IDE, and the next prompt is smarter.</sub>
+</p>
+<p align="center">
+  <sub>Measured, not estimated: removes <strong>86–90%</strong> of the tokens an agent spends navigating code —<br/>
+  and wins head-to-head against other code-intelligence tools on the same corpus. <a href="./benchmarks/README.md">See the benchmarks →</a></sub>
 </p>
 ---
-## The agent isn't stupid. It's flying blind.
+## The pains this fixes
-Watch any AI coding session for ten minutes and you'll see the same loop:
+You've felt all four of these in the last 48 hours:
-- It **reads 30 files** to find one function — burning the context window before it writes a line.
-- It **edits something with 40 callers** and never knows it just broke three services.
-- It **re-derives the same conventions** you taught it yesterday, this morning, and an hour ago.
-- It **forgets the entire session** the moment the window closes.
+- Claude is brilliant for 20 minutes, then hallucinates a duplicate component and forgets the styling rules you set five turns ago.
+- More time spent writing `MEMORY.md`, updating `.cursorrules`, and pasting session summaries than writing code.
+- The agent reads a 2,000-line file to find a 5-line function, then still doesn't know that function has 24 callers across three services.
+- You don't trust the agent to refactor anything important. It treats your codebase like a flat string of text — locally correct, globally wrong.
-Every one of these is the same root cause: the agent has **no persistent memory of your code, your team's style, or its own past mistakes**. unerr is that memory. One process, fully local, indexed in seconds — and your agent picks it up automatically through MCP.
+These aren't four problems. They're one: today's agents are incredibly smart but structurally blind and severely amnesiac. They grep when a senior engineer would check the call graph. They forget on Tuesday what they learned on Monday.
 ---
-## What you actually see
+## What changes when you install it
-Run `unerr` and open the dashboard. Four panes, all live:
+| You feel | What unerr does |
+|---|---|
+| **Trust returns.** The agent runs for an hour without you watching. | Every edit is preceded by a graph lookup. All 24 callers are visible *before* it touches the function. Refactors stop rippling silently. |
+| **The babysitter tax disappears.** You delete `MEMORY.md` and `.cursorrules`. | A local fact store remembers what you decided, what failed, and the conventions the team accreted — with decay-adjusted confidence. Open the laptop on Tuesday and the agent already knows what you decided on Monday. |
+| **The agent stays sharp at turn 50.** | `file_read({entity})` returns 200 lines instead of 3,000. Shell output is compressed 93% on average. The context window stays uncluttered, so the model isn't fighting "lost in the middle." |
+| **Tool sprawl dies.** | One graph, one set of tools, project-aware routing. Five MCP servers no longer compete for the agent's attention. |
-| Pane | Answers the question | Powered by |
-|---|---|---|
-| **Token Optimization** | *How much context did unerr save my agent this session?* — saved vs. delivered, compounding multiplier, breakdown by mechanism (compression, graph hits, skipped re-reads, web fetches). | Per-turn ledger of every tool call |
-| **Reasoning Quality** | *Did the agent actually use what it remembered?* — 4-pillar score across exploration, planning, execution, persistent memory. | 5-turn outcome window per fact/convention |
-| **Codebase Map + Code Intelligence** | *What's the call graph and where are the blast-radius landmines?* — entities, edges, fan-in/out chokepoints, cross-module surprise links. | CozoDB graph (in-process, <5ms) |
-| **Project Memory + Activity** | *What did we already learn, and what was I doing last time?* — facts the agent recorded, sessions stitched into intents, open blockers. | Append-only fact store + timeline.db |
+**What it looks like in your chat:**
+> ⚡ unerr · cascade guard: `PaymentGateway` has 8 callers across 3 services. Call `get_references({direction:'callers'})` before the edit — refactor it locally and 7 sites break silently.
+The outcome you get is **agents that behave like senior engineers** — checking dependencies before editing, remembering project history, refusing to thrash on a function they've already failed on three times.
+---
+## See it in action
+Two places unerr shows up so you know it's working — inside the chat, and in a browser.
-The agent reads from the same store through MCP — every claim on the dashboard is also a tool call it just made.
+**Inside the chat.** Every coding turn opens with one line naming what unerr loaded ("loaded a convention you wrote yesterday for `src/proxy/proxy.ts`…") and closes with one line totalling what it saved you ("this turn: 2 catches · ≈ 4.2k tokens saved · +5 turns of headroom this session"). Catches are *named, countable events*, not a ratio.
-### See it in action
+**In a browser.** A live dashboard at `http://localhost:9847` reads from the same store the agent reads from over MCP — the graph it navigates, the facts it remembers, the tokens it didn't have to chew through, and the score showing which of those facts actually shaped the next answer.
 <p align="center">
-  <img src="https://unerr.dev/open-cli/video/unerr_short.gif" alt="unerr in action" width="720" />
+  <img src="https://unerr.dev/open-cli/screenshots/dashboard.png" alt="unerr dashboard — live overview" width="300" />
+  <br/><sub><strong>Dashboard</strong> · live overview — active sessions, recent tool calls, tokens the agent skipped this turn.</sub>
 </p>
-<table align="center">
-  <tr>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/dashboard.png" alt="unerr dashboard — live overview" width="240" height="150" />
-      <br/><sub><strong>Dashboard</strong><br/>Live overview — active sessions, recent tool calls, tokens saved this turn.</sub>
-    </td>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/token-trace-main.png" alt="unerr token trace — global" width="240" height="150" />
-      <br/><sub><strong>Token Trace · global</strong><br/>Aggregate savings across every session, broken down by mechanism (graph, file_read, fetch_url, shell, dedup, format).</sub>
-    </td>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/token-session.png" alt="unerr token trace — session" width="240" height="150" />
-      <br/><sub><strong>Token Trace · session</strong><br/>Single session: per-turn savings, mechanism mix, and the compounding multiplier.</sub>
-    </td>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/token-turn.png" alt="unerr token trace — turn" width="240" height="150" />
-      <br/><sub><strong>Token Trace · turn</strong><br/>Single turn: which tool calls fired, tokens each would have cost without unerr vs what was delivered.</sub>
-    </td>
-  </tr>
-  <tr>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/reasoning-quality.png" alt="unerr reasoning quality — global" width="240" height="150" />
-      <br/><sub><strong>Reasoning Quality · global</strong><br/>Four-pillar score across cleaner context, fewer wasted turns, fewer breakages, persistent memory.</sub>
-    </td>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/reasoning-session.png" alt="unerr reasoning quality — session" width="240" height="150" />
-      <br/><sub><strong>Reasoning Quality · session</strong><br/>Per-session: which facts and conventions were reinforced, acted on, ignored, or corrected.</sub>
-    </td>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/code-base-intelligence.png" alt="unerr code intelligence" width="240" height="150" />
-      <br/><sub><strong>Code Intelligence</strong><br/>Call graph, fan-in/out chokepoints, cross-module surprise links, and a risk grade per file.</sub>
-    </td>
-    <td align="center" width="240">
-      <img src="https://unerr.dev/open-cli/screenshots/project-memory.png" alt="unerr project memory — facts" width="240" height="150" />
-      <br/><sub><strong>Project Memory</strong><br/>Conventions, anti-patterns, decisions — with decay-adjusted confidence and reinforcement counts.</sub>
-    </td>
-  </tr>
-  <tr>
-    <td align="center" width="240" colspan="4">
-      <img src="https://unerr.dev/open-cli/screenshots/activity.png" alt="unerr activity — timeline + heatmap" width="240" height="150" />
-      <br/><sub><strong>Activity</strong><br/>Turn-grouped timeline with a 30-day heatmap — each row is one burst of agent work (intent → tools → outcome).</sub>
-    </td>
-  </tr>
-</table>
+<p align="center">
+  <img src="https://unerr.dev/open-cli/screenshots/activity.png" alt="unerr activity — session timeline" width="300" />
+  <br/><sub><strong>Activity</strong> · session timeline — every tool call, marker, and catch in order, replayable across sessions.</sub>
+</p>
+<p align="center">
+  <img src="https://unerr.dev/open-cli/screenshots/token-trace-main.png" alt="unerr token trace" width="300" />
+  <br/><sub><strong>Token Trace</strong> · context kept out of the window, broken down by mechanism — graph hits, skipped re-reads, compressed shell output, deduped fetches.</sub>
+</p>
+<p align="center">
+  <img src="https://unerr.dev/open-cli/screenshots/prompt-trace.png" alt="unerr prompt trace" width="300" />
+  <br/><sub><strong>Prompt Trace</strong> · every prompt and the context unerr fed it — what was recalled, and what shaped the response.</sub>
+</p>
+<p align="center">
+  <img src="https://unerr.dev/open-cli/screenshots/reasoning-quality.png" alt="unerr reasoning quality" width="300" />
+  <br/><sub><strong>Reasoning Quality</strong> · which remembered facts actually shaped the next answer — scored, so memory earns its place in context.</sub>
+</p>
+<p align="center"><sub>More views in the <a href="https://www.unerr.dev/">full dashboard tour</a>.</sub></p>
 ---
@@ -118,7 +115,7 @@ Three steps. Step 1 is once per machine; steps 2–3 are per repo.
 npm install -g @unerr-ai/unerr
 ```
-Puts the `unerr` binary on your PATH. If the global `npm` directory isn't already in your shell's PATH (common with nvm, fnm, volta, pnpm), run `unerr doctor` once — it patches your shell config and won't need to run again.
+Puts the `unerr` binary on your PATH. If your shell can't find it (common with nvm, fnm, volta, pnpm), run `unerr doctor` once — it patches your shell config and won't need to run again.
 ### 2. Install for your agent (per repo)
@@ -127,265 +124,106 @@ cd ~/your-project
 unerr install cursor
 ```
-Writes the MCP config, skills, hooks, and instructions for that agent in the current repo. Swap `cursor` for any of the [supported agents](#supported-agents): `claude-code`, `windsurf`, `gemini-cli`, `antigravity`, `github-copilot-cli`.
-### 3. Restart your IDE
-Close and reopen your IDE (or start a new chat session). Your agent picks up unerr through MCP — graph-backed tools, persistent memory, shell compression all available immediately.
-> **Dashboard:** <http://localhost:9847> — open any time to watch token savings, reasoning quality, and the codebase map update as your agent works.
-### Supported agents
-```bash
-unerr install claude-code        # → .mcp.json + CLAUDE.md + .claude/skills/ + hooks
-unerr install cursor             # → .cursor/mcp.json + .cursor/rules/ + hooks
-unerr install antigravity        # → .antigravity/mcp_config.json + .agents/rules/ + .agents/skills/
-unerr install windsurf           # → ~/.codeium/windsurf/mcp_config.json + .windsurf/rules/ + .windsurf/skills/
-unerr install gemini-cli         # → .gemini/settings.json + GEMINI.md + .gemini/skills/
-unerr install github-copilot-cli # → .copilot/mcp-config.json + .github/copilot-instructions.md + .github/skills/
-```
-Install multiple agents in the same repo — each writes its own config:
+Writes the MCP config, skills, hooks, and instructions for that agent in the current repo. Swap `cursor` for any of the supported agents:
 ```bash
-unerr install cursor
 unerr install claude-code
+unerr install cursor
+unerr install antigravity
+unerr install windsurf
+unerr install gemini-cli
+unerr install github-copilot-cli
 ```
-> Need manual setup? `unerr install --show-instructions <agent>` prints copy-pasteable steps.
-<details>
-<summary>Manual MCP config (any MCP-compatible client)</summary>
-```json
-{
-  "mcpServers": {
-    "unerr": {
-      "command": "npx",
-      "args": ["@unerr-ai/unerr", "--mcp"]
-    }
-  }
-}
-```
-</details>
+Install multiple agents in the same repo — each writes its own config. Idempotent: re-running updates if content changed, skips if identical. Remove with `unerr uninstall`.
-<details>
-<summary>What <code>unerr install</code> writes</summary>
+### 3. Restart your IDE
-| Item | File(s) |
-|------|------|
-| MCP config pointing to `unerr --mcp` | `.mcp.json`, `.cursor/mcp.json`, … |
-| Skills — 12 definitions teaching the agent when to use each tool | `.claude/skills/`, `.cursor/rules/`, … |
-| Instructions — tool-routing table injected into the agent's instruction file | `CLAUDE.md`, `.cursor/rules/unerr-instructions.mdc` |
-| Hooks — shell compression + tool-adoption nudging | `.claude/settings.json`, `.cursor/hooks.json` |
-| Gitignore — keeps `.unerr/` out of commits | `.gitignore` |
-| Force tools (Claude Code only) — denies built-in Read/Grep/Glob so the agent uses graph tools. Opt out with `--no-force-tools`. | `.claude/settings.json` |
+Close and reopen your IDE (or start a new chat session). Your agent picks up unerr through MCP — graph-backed tools, persistent memory, shell compression all available immediately.
-Idempotent — re-running updates if content changed, skips if identical. Remove with `unerr uninstall`.
+> **Dashboard:** <http://localhost:9847> — open any time to watch unerr's operational memory at work in real time.
-</details>
+> Need manual setup or any other MCP client? `unerr install --show-instructions <agent>` prints copy-pasteable steps.
 ---
-## What changes the moment you connect
-### First session — instant value
-- **Graph navigation in <5ms** — `get_entity`, `get_references`, `get_imports`, `search_code`. The agent stops reading 30 files to find one function.
-- **Blast radius before edits** — `get_references` returns every caller. No more confident wrong changes that ripple across services.
-- **Targeted file reads** — `file_read({entity: "fnName"})` returns just that function + relevant conventions/facts, not 2000 lines.
-- **Shell compression** — 11 strategies, 645+ command classifiers. Diffs, errors, logs, test runs, YAML — each compressed differently. **93% average compression** across real-world benchmarks (2 MB → 138 KB). Raw output is kept on disk; the agent can recover it on demand.
-- **Web fetches** — `fetch_url` strips page chrome via Defuddle/Readability, converts to markdown, splits into heading-bounded passages, optionally re-ranks with BM25 when a `prompt` is supplied, and caches by content hash. Replaces built-in WebFetch — **5–10× fewer tokens** per page.
-- **Convention awareness** — naming, structure, import patterns auto-detected and injected into the agent's context.
-- **Tool adoption nudging** — five reinforcement layers (exec nudges, hook interception, instruction injection, skill reminders, default-deny of built-ins on Claude Code) push the agent to use the graph instead of grep.
-### Session 2+ — it starts compounding
+## Who it's for
-- **Session persistence** — what the agent learned today is available tomorrow. No more starting from zero.
-- **Fact memory** — `record_fact` persists conventions, decisions, and anti-patterns; `recall_facts` retrieves them with decay-adjusted confidence. Facts also auto-detect from coding sessions.
-- **Episodic narratives** — when you reopen a file, the agent sees what was modified there, when, and why.
-- **Loop prevention** — a circuit breaker fires after repeated failed attempts on the same entity, surfacing the failure mode instead of letting the agent thrash.
-- **Memory-effectiveness scoring** — every fact and convention opens a 5-turn observation window and resolves to a verdict (reinforced / acted_on / caught / ignored / corrected). The Reasoning Quality pane shows the **load-bearing rate** — not just how much the agent remembered, but how much of it actually mattered.
+- **Vibe coders.** The thing that stops your app from breaking on turn 30 when the AI gets confused.
+- **Solo builders.** The continuous thread. Switch from Claude Code in the terminal to Cursor in the IDE — your project memory comes with you.
+- **Senior / staff engineers.** The dependency graph, prior incidents, and team conventions a human engineer would already carry in their head — fed to AI on every edit.
-### Background behaviors
-While unerr is running these activate automatically — no extra commands:
-- **Architecture guard** — flags structural violations before they ship.
-- **Cascade guard** — warns when an edit has wide blast radius.
-- **Convention drift** — detects when new code diverges from established patterns.
-- **Auto-doc** — generates docs for undocumented entities.
-- **Change narrative** — tracks the story behind multi-step refactors.
-- **Loop breaker** — intervenes when the agent is stuck retrying.
-- **Session continuity** — preserves state across restarts.
+---
-### Shell compression benchmarks
+## Why one runtime, not five separate tools
-| Strategy | What it compresses | Avg compression |
-|----------|-------------------|:-:|
-| **diff** | `git diff`, patch output | **99%** |
-| **structured** | JSON APIs, `docker inspect` | **97%** |
-| **progress** | `npm install`, `pip install` | **95%** |
-| **log_text** | Build logs, server logs, `make`, `cargo build` | **89%** |
-| **test_results** | `vitest`, `pytest`, `cargo test`, `playwright` | **80%** |
-| **tabular** | `ps aux`, `docker ps`, `kubectl get` | **77%** |
-| **error_diagnostic** | `tsc`, `eslint`, `rustc`, `shellcheck` | **72%** |
-| **key_value** | `env`, `kubectl describe`, `systemctl status` | **48%** |
-| **tree_paths** | `find`, `tree`, `ls -R` | **42%** |
-| **yaml** | YAML configs, `kubectl get -o yaml`, Helm output | adaptive |
-| **omni** | Fallback for unrecognized output | adaptive |
+**unerr is the layer your agents share — sitting *behind* every MCP they already speak.** Every coding agent on your machine — Claude Code, Cursor, Windsurf, Antigravity — speaks MCP. MCP carries tool calls; it does not carry context. Without unerr, every agent rebuilds your codebase's dependency graph, conventions, and prior decisions from scratch — every session, by reading files blindly. With unerr, all of them read the same per-repo runtime over MCP, so your project's graph, memory, and guardrails carry across sessions *and* across IDEs.
-**Overall: 93% compression** (2 MB → 138 KB across 40 real-world test cases).
+The adjacent space already has strong point tools. unerr's job is not to out-feature any of them in their lane — it's to be the single per-repo runtime that joins them.
-### Language support
+| Layer | Where point tools live | What unerr adds |
+|---|---|---|
+| Memory across sessions | claude-mem, Mem0, Zep, Letta | Memory tied to the *current* state of the code — facts get drift signals when the file they're about moves. |
+| Code-graph navigation | Graphify, CodeGraphContext, Serena | The graph is read *before every file read* — surgical context instead of 3,000-line dumps. |
+| Output compression | RTK, Repomix | Compression is fed through the same MCP runtime as the graph and memory, not a separate tool the agent has to remember to invoke. |
+| Convention enforcement | `.cursorrules`, CLAUDE.md hand-maintained | Conventions auto-detected from ≥70% adherence in the code. No file to maintain. |
-| Language | Tier | Entities | Edges | Tree-sitter | SCIP |
-|----------|:---:|:-:|:-:|:-:|:-:|
-| TypeScript / JavaScript / Python / Go / Java / Kotlin / Scala / Rust / Ruby / C / C++ / C# | 1 | ✓ | ✓ | ✓ | ✓ |
-| PHP / Swift / Lua / Dart / Elixir / Zig | 2 | ✓ | ✓ | ✓ | — |
+We deliberately don't ship a feature-by-feature checkmark matrix against the depth leaders on each lane — that's the trap. Mem0 will out-memory us on memory depth; Graphify will out-graph us on graph aesthetics; RTK will out-compress us on shell compression simplicity. The runtime is the join across all four lanes — not the depth on any one.
-**Tier 1** (12 languages): full tree-sitter AST + dedicated extraction + SCIP compiler-verified call graphs where the toolchain is on PATH.
-**Tier 2** (6 languages): tree-sitter AST + generic extraction. Regex fallback for the rest.
+Three numbers behind the runtime:
-**Tier 3 (search-discoverable):** Markdown, IaC (`.tf`, `.yaml`, `.toml`), schemas (`.proto`, `.graphql`, `.prisma`), SQL, shell, templates, build files, CI configs — indexed for `search_code` only, no entity extraction.
+- **~84%** of an AI coding agent's tokens are tool output, mostly file reads ([JetBrains, NeurIPS 2025](https://blog.jetbrains.com/research/2025/12/efficient-context-management/)) — unerr intercepts at the read layer, so attention isn't diluted.
+- **Tool-selection accuracy collapses 58% → 26% as MCP tools go from 9 to 51** ([LangChain ReAct benchmark](https://blog.langchain.com/react-agent-benchmarking/)) — unerr is one MCP runtime instead of five, freeing the agent's tool-selection budget. Anthropic itself acknowledged this in Jan 2026 by shipping [MCP Tool Search](https://www.anthropic.com/engineering/code-execution-with-mcp) to hide tool definitions until queried.
+- **0** LLM calls per query in the core — facts, conventions, drift signals, and graph lookups are all algorithmic. No API keys, no per-turn inference cost, no telemetry.
+- **86–90%** of an agent's code-navigation tokens removed in head-to-head benchmarks vs grep+read — real tokenizer, fidelity-gated, reproducible on any repo ([benchmarks](./benchmarks/README.md)).
 ---
-## How unerr compares
+## How the runtime works
-Adjacent tools each own one layer — graph navigation, persistent memory, or output compression. unerr integrates all three, plus the drift prevention that keeps the graph tools in active rotation. Peer strengths are real; the table credits them where they win.
+One local process per repo. Four slices, joined deterministically — *the joins are the product, not the slices.* Point tools own one slice each. None of them can ship the joins without becoming a per-repo runtime themselves.
-| Capability | unerr | Graphify (~47K) | Serena (~23K) | claude-mem (~75K) | RTK (~40K) |
-|---|:---:|:---:|:---:|:---:|:---:|
-| **Code intelligence** | | | | | |
-| Pre-hoc file-read intercept — resolves entity via graph, returns ~200 lines + conventions + blast radius instead of a 3,000-line file | ✓ | ✗ | Partial | ✗ | ✗ |
-| Convention auto-detection — naming, structure, import direction from ≥70% adherence; no manual rules file | ✓ | ✗ | ✗ | ✗ | ✗ |
-| Drift / staleness signals — `ur\|dft` fires when code moves under stored memory | ✓ | ✗ | ✗ | ✗ | ✗ |
-| **Memory & continuity** | | | | | |
-| Persistent across sessions — typed facts with per-type decay, contradiction handling | ✓ | ✗ | ✗ | ✓ | ✗ |
-| Per-repo isolation — all state in `.unerr/` inside the repo, no cross-project leakage | ✓ | ✓ | ✓ | ✗ | — |
-| **Runtime** | | | | | |
-| Zero LLM in core — no API keys, no per-turn inference cost | ✓ | ✓ | ✓ | ✗ | ✓ |
-| Keeps MCP tools in active rotation — without enforcement, agents revert to built-in Read/Grep/Glob within 3–5 turns | ✓ | ✗ | ✗ | ✗ | ✗ |
-Three numbers behind the table:
+| Slice | What's inside | What the join enables |
+|---|---|---|
+| **Live code graph** | CozoDB · tree-sitter ASTs · SCIP-verified call graphs · 18+ languages · <5ms queries | Read *before every file read*. The agent opens 50 targeted lines and a caller list — not 3,000 lines and a guess. |
+| **Anchored memory** | Typed facts · conventions auto-detected at ≥70% adherence · decay-adjusted confidence | Every fact is pinned to a file or entity in the graph. When the code moves, the fact gets a **drift signal** — never silent staleness. |
+| **Context delivery** | Shell output compression (93% overall, 645+ command classifiers) · Web fetches (5–10× via Defuddle + BM25) · Entity-targeted file reads | Compression, graph, and memory share one process — the agent doesn't have to remember which tool to invoke for which kind of content. |
+| **Behaviour modules** | cascade guard · convention drift · loop breaker · session continuity · auto-doc · change narrative · architecture guard | Each guardrail fires on a *join* — cascade-guard reads the graph before the edit, convention-drift compares new code against memory, loop-breaker watches the timeline. None of these are reachable from a single point tool. |
-- **~84%** of an AI coding agent's tokens are tool output, mostly file reads (JetBrains, NeurIPS 2025 DL4Code Workshop) — unerr intercepts before the read.
-- **0** LLM calls per query in the Free tier — facts, conventions, and drift signals are algorithmic.
-- **3–5** turns is how long agents take to revert to built-in Read/Grep/Glob without drift prevention.
+**The unifying point.** Drift detection requires memory anchored to a live graph. Cascade-guard requires the graph and the edit-intent ledger on the same process. Convention-drift requires the auto-detected pattern store and the new-code stream in the same memory space. These aren't "features" you can buy individually — they're *emergent properties of the runtime*, only available when all four slices live in one per-repo process.
-Honest acknowledgements: unerr is the new entrant with fewer stars than every peer; the install is heavier than `brew install` (Node + index step); TypeScript is deepest, other languages run on tree-sitter; no semantic vector retrieval and no narrative session resume in the Free tier.
+Five disconnected MCP servers — one for memory, one for graph, one for compression, one for tracing, one for skills — burn ~55K tokens of schemas just to *announce themselves* (Anthropic's own engineering example). They can't reach across each other to fire any of these guardrails. That's the difference between a stack and a runtime.
 ---
-## How it works
+<details>
+<summary><strong>Under the hood — architecture, CLI commands, MCP tools, dev setup</strong></summary>
+### Architecture
 ```
 AI Agent (Claude Code / Cursor / Windsurf / any MCP client)
     │
     ├── stdio MCP ──→ unerr --mcp (bridge, per IDE session)
     │                       │
-    │                       └── UDS ──→ unerrd (lightweight Node process,
-    │                                           one per machine, auto-spawned)
+    │                       └── UDS ──→ unerrd (one lightweight Node process
+    │                                           per machine, auto-spawned,
+    │                                           exits after 30 min idle)
     │                                       │
     │                                       └── per-repo unerr process(es)
-    │                                              │
     │                                              ├── CozoDB graph     (in-process, <5ms)
     │                                              ├── Fact store       (cross-session memory)
     │                                              ├── Timeline + ledger (every tool call)
     │                                              ├── File watcher     (incremental reindex)
     │                                              ├── Convention engine
-    │                                              ├── Compression engine (11 strategies, 645+ classifiers)
-    │                                              └── Behavior modules (cascade-guard, loop-breaker, auto-doc…)
+    │                                              ├── Compression engine
+    │                                              └── Behavior modules
     │
     └── Dashboard ──→ http://localhost:9847 (SSE-streamed live)
 ```
 One local DB per repo. Zero network calls. No API keys. No cloud. Your code never leaves the machine.
----
-## MCP Tools (20)
-### Graph Intelligence (8)
-| Tool | What the agent gets |
-|------|-----|
-| `get_entity` | Any code entity — signature, body, callers, callees, risk |
-| `get_file` | All entities in a file with risk summary |
-| `get_references` | Callers (blast radius) or callees (dependencies) |
-| `get_imports` | Import graph for a file |
-| `search_code` | Graph-ranked full-text search across all entities |
-| `get_conventions` | Detected naming/structure/import patterns + adherence rates |
-| `get_critical_nodes` | High fan-in/fan-out chokepoints |
-| `get_cross_boundary_links` | Unexpected cross-module dependencies, scored by surprise |
-### Structural Analysis (3)
-| Tool | What the agent gets |
-|------|-----|
-| `get_project_stats` | Entity counts, risk distribution, health grade |
-| `file_connections` | Imports + co-change correlations for a file |
-| `get_test_coverage` | Direct + transitive tests for any entity |
-### File Protocol (2)
-| Tool | What the agent gets |
-|------|-----|
-| `file_read` | Context-aware read — auto-injects conventions and facts |
-| `file_outline` | File structure (entities, exports) without reading the body |
-### Persistent Memory (2)
-| Tool | What the agent gets |
-|------|-----|
-| `record_fact` | Persist a convention, decision, or anti-pattern |
-| `recall_facts` | Retrieve facts with hierarchical scope + decay-adjusted confidence |
-### Session Narrative — Markers (4)
-Inline markers the agent emits as it works. Persisted to the shadow ledger and `.unerr/timeline.db` — powers turn titles, cross-session intent stitching, the resume strip, and loop/blocker miners.
-| Tool | What it does |
-|------|-----|
-| `mark_intent` | One-sentence task start (≤80 chars). Becomes the turn title |
-| `mark_decision` | Records a chosen approach + up to 5 alternatives (≤140 chars) |
-| `mark_blocker` | Flags an unresolved obstacle. Carries into the next session's resume strip |
-| `mark_resolution` | Resolves a prior blocker by `marker_id` |
-### Web Fetch (1)
-| Tool | What the agent gets |
-|------|-----|
-| `fetch_url` | DOM-extracted markdown of a web page (Defuddle/Readability), split into heading-bounded passages, optionally re-ranked by BM25 against a `prompt`, cached by content hash. Replaces built-in WebFetch — 5–10× fewer tokens. Optional Playwright SPA fallback. |
-Every response includes `_meta` (latency, risk level, drift status) and inline `ur|<tag>` signals for high-priority guidance (drift, blast-radius warnings, circuit-breaker halts).
----
-<details>
-<summary><strong>CLI commands</strong></summary>
-```bash
-unerr install <agent> # MCP config + skills + hooks + instructions for one agent
-unerr uninstall       # Remove unerr integration from this repo
-unerr doctor          # Check PATH + environment, auto-fix if unerr isn't on all shells
-unerr status          # Proxy health, entity count, graph age
-unerr stats           # Session statistics (tokens, tool calls, compression)
-unerr --mcp           # Stdio bridge — what your IDE invokes via .mcp.json
-unerr                 # Start a standalone per-repo proxy (rare — IDE invocation covers this)
-```
-`unerr pm …` manages the cross-repo `unerrd` process — see the [reference](#process-manager-command-reference) below.
-</details>
-<details>
-<summary><strong>Architecture</strong></summary>
 ```
 src/
   entrypoints/   CLI entry + boot state machine
@@ -399,79 +237,64 @@ src/
   hooks/         Claude Code hook system integration
   skills/        12 bundled skill definitions
   server/ + ui/  HTTP API + React (Vite) dashboard
-  config/        Agent registry, MCP config writer, instruction injector
-  schemas/       Zod schemas
 ```
-**Design principles**
-- Zero network calls — fully local, no API keys.
-- stdout is sacred — MCP JSON-RPC only; everything else to stderr.
-- <5 ms query responses — CozoDB runs in-process (Rust via NAPI).
-- First useful output <5 s — shallow index first, deep enrichment in background.
-- Graceful degradation — the agent still works if unerr is down, you just lose the intelligence layer.
+**Design principles** — zero network calls; stdout is sacred (MCP JSON-RPC only, everything else to stderr); <5 ms query responses; first useful output <5 s (shallow index first, deep enrichment in background); graceful degradation (the agent still works if unerr is down, you just lose the operational memory).
 **Tech stack** TypeScript (ESM) · CozoDB (Rust/NAPI) · web-tree-sitter (WASM) · MCP SDK · Ink (React CLI) · React + Vite (dashboard) · tsup · Vitest
-</details>
-<details>
-<summary><strong>Development</strong></summary>
+### CLI commands
 ```bash
-pnpm install
-pnpm run build          # tsup → dist/ (ESM, node20)
-pnpm run dev            # tsx watch
-pnpm run test:run       # full suite
-pnpm run lint           # biome check
-pnpm run typecheck      # tsc --noEmit
-pnpm link --global      # make local `unerr` available globally
+unerr install <agent>   # MCP config + skills + hooks + instructions for one agent
+unerr uninstall         # Remove unerr integration from this repo
+unerr doctor            # Check PATH + environment, auto-fix if unerr isn't on all shells
+unerr status            # Proxy health, entity count, graph age
+unerr stats             # Session statistics (tokens, tool calls, compression)
+unerr --mcp             # Stdio bridge — what your IDE invokes via .mcp.json
+unerr pm status         # Process manager: PID, uptime, repos, memory, idle countdown
+unerr pm logs           # Tail ~/.unerr/logs/unerrd.log
+unerr pm dashboard      # Open http://localhost:9847
 ```
-</details>
-<details>
-<summary><strong>Contributing</strong></summary>
-Contributions welcome — please open an issue first.
-**Before submitting a PR:**
-- `pnpm run typecheck && pnpm run lint && pnpm run test:run`
-- All output to stderr — never stdout (MCP JSON-RPC channel)
-- All CozoDB interactions are async — always `await`
-- Use `.js` extensions in imports (NodeNext resolution)
+`unerrd` is a lightweight Node process that supervises every registered repo. Your IDE invocation auto-spawns it; it exits cleanly after 30 minutes of no MCP activity. `unerr pm --help` lists the rest.
-See [CLAUDE.md](./CLAUDE.md) for full conventions.
+### MCP tools (20)
-</details>
+Grouped by what the agent gets, not by file:
----
+- **Graph intelligence (8)** — `get_entity`, `get_file`, `get_references`, `get_imports`, `search_code`, `get_conventions`, `get_critical_nodes`, `get_cross_boundary_links`.
+- **Structural analysis (3)** — `get_project_stats`, `file_connections`, `get_test_coverage`.
+- **File protocol (2)** — `file_read` (context-aware, auto-injects conventions and facts), `file_outline` (structure without body).
+- **Persistent memory (3)** — `unerr_remember` (user-stated facts with verbatim quote + confidence), `record_fact` (agent-detected conventions / decisions / anti-patterns), `recall_facts` (hierarchical scope + decay-adjusted confidence).
+- **Session markers (4)** — `mark_intent`, `mark_decision`, `mark_blocker`, `mark_resolution`. Inline as the agent works; powers turn titles and the cross-session resume strip.
+- **Web fetch (1)** — `fetch_url` (DOM-extracted markdown, BM25 re-ranking, content-hash cache). Replaces built-in WebFetch.
-## Process Manager Command Reference
+Every response carries inline `ur|<tag>` signals for high-priority guidance — drift, blast-radius warnings, circuit-breaker halts — so the agent acts on what it just learned without burning a turn.
-`unerrd` is a lightweight Node process that supervises every registered repo. Your IDE invocation auto-spawns it; it exits cleanly after 30 minutes of no MCP activity. You rarely run these commands directly, but they're here when you want a look under the hood.
+### Manual MCP config (any MCP-compatible client)
-```bash
-unerr pm status                       # PID, uptime, repos, memory, idle countdown
-unerr pm start                        # Start manually (auto-spawn usually covers this)
-unerr pm stop                         # Graceful shutdown — stops children, flushes state
+```json
+{
+  "mcpServers": {
+    "unerr": {
+      "command": "npx",
+      "args": ["@unerr-ai/unerr", "--mcp"]
+    }
+  }
+}
+```
-unerr pm add <path>                   # Register a repo (auto-registered on first MCP call)
-unerr pm remove <path>                # Unregister a repo
-unerr pm config <path> <key>=<value>  # Per-repo settings (idleTimeout, javaBuildTool, …)
+### Benchmarks
-unerr pm logs                         # Tail ~/.unerr/logs/unerrd.log
-unerr pm logs --repo <path>           # Tail a specific repo's log
-unerr pm logs --bridge --follow       # Stream bridge session logs continuously
-unerr pm logs --boot                  # Most recent spawn sequence only
+unerr removes **86–90% of the tokens** an agent would otherwise spend navigating and reading code — measured, not estimated, with head-to-head runs against other code-intelligence tools on the same questions, same tokenizer, and a fidelity gate that discards any "saving" that lost the answer. Methodology, reproduction commands, and per-repo results: [benchmarks/README.md](./benchmarks/README.md).
-unerr pm dashboard                    # Open http://localhost:9847 in your browser
-```
+### Contributing
-**Dashboard** shows the global overview (registered repos, health, active sessions), a repo switcher into each repo's full intelligence dashboard, and process-manager info (uptime, memory, idle countdown).
+See [CONTRIBUTING.md](./CONTRIBUTING.md) for setup, day-to-day commands, code conventions, and pre-PR checklist.
-**Updates** — `npm i -g @unerr-ai/unerr` and restart the IDE. The next bridge invocation re-spawns the manager on the new version.
+</details>
 ---