npm - @jonathangu/openclawbrain - Versions diffs - 0.3.0 → 0.3.1 - Mend

@jonathangu/openclawbrain 0.3.0 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (56) hide show

package/README.md CHANGED Viewed

@@ -1,77 +1,90 @@
 # OpenClawBrain v2
-OpenClawBrain v2 is a clean rebuild on top of [lossless-claw](https://github.com/Martian-Engineering/lossless-claw). The goal is a production-ready OpenClaw plugin that keeps lossless transcript memory while adding a correctly wired learning layer for retrieval and correction routing.
+OpenClawBrain is an OpenClaw plugin that keeps the inherited lossless transcript-memory substrate and adds a learned routing layer on top.
-This repo is the active v2 codebase.
+Front doors:
+- Project site: https://openclawbrain.ai
+- GitHub repo: https://github.com/jonathangu/openclawbrain
+- Jonathan Gu's 2016 reinforcement-learning paper: https://openclawbrain.ai/jonathan-gu-2016-reinforcement-learning-paper.pdf
-The earlier spike is archived at [jonathangu/openclawbrain-v1-spike-archive](https://github.com/jonathangu/openclawbrain-v1-spike-archive).
+This repo is the active v2 trunk. The earlier spike lives at [jonathangu/openclawbrain-v1-spike-archive](https://github.com/jonathangu/openclawbrain-v1-spike-archive).
 ## Release truth in 30 seconds
 | Public label | Status | What it means right now |
 | --- | --- | --- |
-| **paper-faithful core** | true now | finite-horizon traversal, terminal reward, stochastic policy, full-trajectory REINFORCE updates, learned seed routing, and immutable promoted packs are all implemented in the current repo |
-| **live-path implemented** | true now | the OpenClaw runtime already has recurrence gating, explicit skip reasons, shadow mode, correction-first context injection, immediate `brain_teach` retrieval, and replay-gated promotion wired into the live path |
-| **operationally validated** | not yet | child-worker mode is real, but frozen host-surface teach/worker-down proof, bundle-level mutation validation, dated evidence artifacts, and full-repo `npx tsc --noEmit` are still unfinished |
+| **paper-faithful core** | yes | finite-horizon traversal, stochastic policy, terminal reward, full-trajectory REINFORCE updates, learned seed routing, and immutable promoted packs are implemented in the current repo |
+| **live-path implemented** | yes | OpenClaw runtime decisioning, shadow mode, correction-first assembly, immediate `brain_teach` retrieval, and replay-gated promotion are wired into the live path |
+| **operationally validated** | not yet | deterministic runtime proof is real, but the full sterile host-surface harness is still not frozen end to end; bundle-level mutation evaluation, CI-enforced proof gates, and packaging/type hardening remain open |
 If you want the exact contract rather than the pitch, read [docs/RELEASE_CONTRACT.md](docs/RELEASE_CONTRACT.md).
-## Table of contents
+## What OpenClawBrain does
-- [What it does](#what-it-does)
-- [Quick start](#quick-start)
-- [Configuration](#configuration)
-- [Operator Commands](#operator-commands)
-- [Fallback Behavior](#fallback-behavior)
-- [Operational gaps still open](#operational-gaps-still-open)
-- [Finish path to 1.0](#finish-path-to-10)
-- [Documentation](#documentation)
-- [Development](#development)
-- [License](#license)
+OpenClawBrain has two layers:
-## What it does today
+1. **LCM / transcript memory**
+   - persists conversation history in SQLite
+   - compacts older turns into a summary DAG instead of dropping them
+   - assembles summaries plus fresh raw turns back into model context
+   - exposes recall tools like `lcm_grep`, `lcm_describe`, and `lcm_expand_query`
-When a conversation grows beyond the model's context window, OpenClaw (just like all of the other agents) normally truncates older messages. LCM instead:
+2. **Learned routing layer**
+   - decides whether to use learned retrieval, shadow the route, or skip with an explicit reason
+   - retrieves from immutable promoted packs only
+   - supports immediate `brain_teach` correction retrieval
+   - trains in the background from human/self/scanner/teacher evidence
+   - gates promotion with replay checks before serving new packs
-1. **Persists every message** in a SQLite database, organized by conversation
-2. **Summarizes chunks** of older messages into summaries using your configured LLM
-3. **Condenses summaries** into higher-level nodes as they accumulate, forming a DAG (directed acyclic graph)
-4. **Assembles context** each turn by combining summaries + recent raw messages
-5. **Provides tools** (`lcm_grep`, `lcm_describe`, `lcm_expand`) so agents can search and recall details from compacted history
+Nothing in the transcript-memory substrate is supposed to be thrown away casually. The point is to keep lossless recall while adding a learned context-routing layer that can improve over time.
-Nothing is lost. Raw messages stay in the database. Summaries link back to their source messages. Agents can drill into any summary to recover the original detail.
+## Current reality
-Today this repo ships a working hybrid runtime:
+### True in the repo now
+- paper-faithful traversal/update path exists
+- child-worker mode is a real runtime boundary, with supervisor/protocol/restart truth
+- shadow mode is a real runtime decision rather than a fake `use_brain` variant
+- deterministic session-bound `brain_teach` proof exists
+- deterministic runtime proof exists for immediate teach retrieval and serve-from-last-promoted-pack after worker failure
+- structured raw evidence and worker-side trust-ordered resolution are real
-1. The LCM substrate still persists, compacts, and recalls transcript history.
-2. The brain runtime explicitly decides whether to route through learned retrieval or bypass with a concrete skip reason.
-3. Learned traversal now includes a seed-head policy over candidate seed regions, not just post-seed edge updates.
-4. `brain_teach` embeds taught nodes immediately, connects them into the recent route, and promotes a new immutable pack.
-5. The worker applies full-trajectory REINFORCE updates, decay, scanner/self/human/teacher labels, candidate-graph mutation replay, and replay-gated promotion.
+### Implemented but not frozen
+- the real OpenClaw host-surface validation lane
+- mutation evaluation at the intended bundle level
+- CI-enforced proof gates
+- clean npm/package boundary for outside operators
+### Honest current blocker
+The current docs and runtime story should stay aligned with this exact state:
+- sterile preflight/config seam has been repaired
+- deterministic runtime proof is repaired and repeatably passing on fresh isolated roots
+- the full sterile host harness is **still not frozen end to end** because it currently stalls during `openclawbrain init` before the host-turn proof bundle completes
+That means the remaining pain is mainly host/operator/release-boundary work, not another learning-architecture rewrite.
 ## Quick start
 ### Prerequisites
-- OpenClaw with plugin context engine support
+- OpenClaw
 - Node.js 22+
-- An LLM provider configured in OpenClaw (used for summarization)
+- an LLM provider for transcript summarization
+- an embeddings provider for `openclawbrain init`, learned retrieval, and `brain_teach`
-### Install the plugin
+### Install
-Use OpenClaw's plugin installer once the package is published:
+Published package:
 ```bash
 openclaw plugins install @jonathangu/openclawbrain
 ```
-If you're running from a local OpenClaw checkout, use:
+From a local OpenClaw checkout:
 ```bash
 pnpm openclaw plugins install @jonathangu/openclawbrain
 ```
-For local plugin development, link your working copy instead of copying files:
+For local development, link your working tree instead of copying files:
 ```bash
 openclaw plugins install --link /path/to/openclawbrain
@@ -79,33 +92,15 @@ openclaw plugins install --link /path/to/openclawbrain
 # pnpm openclaw plugins install --link /path/to/openclawbrain
 ```
-The install command records the plugin and enables it.
-### Configure OpenClaw
-In most cases, no manual JSON edits are needed after `openclaw plugins install`.
-Important current truth: older OpenClaw builds exposed a `plugins.slots.contextEngine` seam, but the current host build used for Phase 1 validation no longer accepts that slot and no longer exposes `api.registerContextEngine`. Until OpenClawBrain adapts to the current plugin/memory seam, do **not** treat old `contextEngine` slot examples as valid host-surface setup guidance.
-Restart OpenClaw after configuration changes.
-### Initialize the brain index
-The lossless transcript path works immediately. Learned retrieval needs an explicit init pass:
-```bash
-openclawbrain init /path/to/your/workspace
-```
-`openclawbrain init` scans the workspace, chunks source material, computes embeddings, builds the initial graph, writes `state.db`, creates pack `v000001`, and promotes it.
+### Important host-seam truth
-## Configuration
+On current OpenClaw hosts, **do not manually write** `plugins.slots.contextEngine` for OpenClawBrain.
-LCM is configured through a combination of plugin config and environment variables. Environment variables take precedence for backward compatibility.
+That older seam is no longer the stable installation boundary. OpenClawBrain now includes a hook-based compatibility bridge for hosts where `api.registerContextEngine` is gone, and the plugin installer is the supported path.
-### Plugin config
+If you are debugging an older host build, treat any manual slot/config surgery as version-specific debugging rather than the normal install story.
-Add an `openclawbrain` entry under `plugins.entries` in your OpenClaw config:
+### Recommended starting config
 ```json
 {
@@ -118,8 +113,9 @@ Add an `openclawbrain` entry under `plugins.entries` in your OpenClaw config:
           "contextThreshold": 0.75,
           "incrementalMaxDepth": -1,
           "brainRoot": "~/.openclaw/openclawbrain",
-          "brainEmbeddingProvider": "openai",
-          "brainEmbeddingModel": "text-embedding-3-large"
+          "brainEmbeddingProvider": "ollama",
+          "brainEmbeddingModel": "bge-large:latest",
+          "brainWorkerMode": "child"
         }
       }
     }
@@ -127,14 +123,33 @@ Add an `openclawbrain` entry under `plugins.entries` in your OpenClaw config:
 }
 ```
-For local dogfood or other self-hosted installs, Ollama is now a first-class embedding option too:
+Why these defaults:
+- `freshTailCount=32` keeps recent turns raw
+- `contextThreshold=0.75` leaves response headroom
+- `incrementalMaxDepth=-1` lets compaction keep cascading when needed
+- `brainWorkerMode=child` is the practical operator boundary
+### Initialize the graph
+The transcript-memory layer works immediately after install. The learned layer needs an explicit init pass:
+```bash
+openclawbrain init /path/to/workspace
+```
+That creates the initial graph, writes `state.db`, creates pack `v000001`, and promotes it.
+## Embeddings
+OpenClawBrain currently targets tested OpenAI-compatible `/v1/embeddings` APIs. That includes local Ollama endpoints and remote OpenAI-compatible services.
+### Local Ollama
 ```json
 {
   "plugins": {
     "entries": {
       "openclawbrain": {
-        "enabled": true,
         "config": {
           "brainEmbeddingProvider": "ollama",
           "brainEmbeddingModel": "bge-large:latest"
@@ -145,267 +160,102 @@ For local dogfood or other self-hosted installs, Ollama is now a first-class emb
 }
 ```
-That defaults to Ollama's local OpenAI-compatible embeddings endpoint at `http://127.0.0.1:11434/v1`.
-### Environment variables
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `LCM_ENABLED` | `true` | Enable/disable the plugin |
-| `LCM_DATABASE_PATH` | `~/.openclaw/lcm.db` | Path to the SQLite database |
-| `LCM_CONTEXT_THRESHOLD` | `0.75` | Fraction of context window that triggers compaction (0.0–1.0) |
-| `LCM_FRESH_TAIL_COUNT` | `32` | Number of recent messages protected from compaction |
-| `LCM_LEAF_MIN_FANOUT` | `8` | Minimum raw messages per leaf summary |
-| `LCM_CONDENSED_MIN_FANOUT` | `4` | Minimum summaries per condensed node |
-| `LCM_CONDENSED_MIN_FANOUT_HARD` | `2` | Relaxed fanout for forced compaction sweeps |
-| `LCM_INCREMENTAL_MAX_DEPTH` | `0` | How deep incremental compaction goes (0 = leaf only, -1 = unlimited) |
-| `LCM_LEAF_CHUNK_TOKENS` | `20000` | Max source tokens per leaf compaction chunk |
-| `LCM_LEAF_TARGET_TOKENS` | `1200` | Target token count for leaf summaries |
-| `LCM_CONDENSED_TARGET_TOKENS` | `2000` | Target token count for condensed summaries |
-| `LCM_MAX_EXPAND_TOKENS` | `4000` | Token cap for sub-agent expansion queries |
-| `LCM_LARGE_FILE_TOKEN_THRESHOLD` | `25000` | File blocks above this size are intercepted and stored separately |
-| `LCM_LARGE_FILE_SUMMARY_PROVIDER` | `""` | Provider override for large-file summarization |
-| `LCM_LARGE_FILE_SUMMARY_MODEL` | `""` | Model override for large-file summarization |
-| `LCM_SUMMARY_MODEL` | *(from OpenClaw)* | Model for summarization (e.g. `anthropic/claude-sonnet-4-20250514`) |
-| `LCM_SUMMARY_PROVIDER` | *(from OpenClaw)* | Provider override for summarization |
-| `LCM_AUTOCOMPACT_DISABLED` | `false` | Disable automatic compaction after turns |
-| `LCM_PRUNE_HEARTBEAT_OK` | `false` | Retroactively delete `HEARTBEAT_OK` turn cycles from LCM storage |
-| `OPENCLAWBRAIN_ENABLED` | `true` | Enable/disable the learning layer |
-| `OPENCLAWBRAIN_ROOT` | `~/.openclaw/openclawbrain` | Root directory for `state.db` and immutable packs |
-| `OPENCLAWBRAIN_EMBEDDING_PROVIDER` | `openai` | Embedding provider (`openai`, `openai-resp`, or `ollama`) |
-| `OPENCLAWBRAIN_EMBEDDING_MODEL` | `""` | Embedding model required for `init`, retrieval, and `brain_teach` |
-| `OPENCLAWBRAIN_EMBEDDING_BASE_URL` | `""` | Optional embeddings API base URL override; `ollama` defaults to `http://127.0.0.1:11434/v1` |
-| `OPENCLAWBRAIN_EMBEDDING_API_KEY` | `""` | Optional explicit API key for authenticated embedding proxies / nonstandard OpenAI-compatible endpoints |
-| `OPENCLAWBRAIN_MAX_HOPS` | `8` | Hard traversal cap |
-| `OPENCLAWBRAIN_MAX_SEEDS` | `10` | Max seed nodes per query |
-| `OPENCLAWBRAIN_SEMANTIC_THRESHOLD` | `0.7` | Minimum seed similarity |
-| `OPENCLAWBRAIN_SHADOW_MODE` | `false` | Record brain routes and traces without injecting learned context into the prompt |
-| `OPENCLAWBRAIN_TRAINER_INTERVAL_MS` | `30000` | Background worker interval |
-## Operator Commands
-```bash
-openclawbrain init [workspace]
-openclawbrain status
-openclawbrain trace [traceId]
-openclawbrain replay
-openclawbrain promote
-openclawbrain rollback [version]
-openclawbrain disable
-openclawbrain enable
-openclawbrain doctor
-```
-## Fallback Behavior
-- If the brain has not been initialized, the plugin serves LCM-only context.
-- If embeddings are not configured, learned retrieval and `brain_teach` stay disabled.
-- Local loopback embedding endpoints (for example Ollama on `127.0.0.1` / `localhost`) do not require a bearer token; remote OpenAI-compatible endpoints still do unless you provide `OPENCLAWBRAIN_EMBEDDING_API_KEY`.
-- `openclawbrain status` and `openclawbrain doctor` expose the resolved embedding provider / model / base URL / auth mode so operator truth stays visible.
-- If the background worker is unavailable, serving still uses the last promoted pack.
-- `brain_teach` now binds taught corrections to the active conversation when invoked from a live tool session.
-- Seed learning is persisted as explicit per-node seed weights and exposed in traces.
-## Operational gaps still open
-This repo is already beyond “foundation only,” but it is **not** yet operationally validated end to end.
-- Embedding support currently targets tested OpenAI-compatible `/v1/embeddings` APIs, including local Ollama-style endpoints.
-- Child-worker mode now runs behind a dedicated supervisor with explicit protocol messages, restart accounting, reload acknowledgements, and stronger status/doctor truth; the remaining operator-proof gap is on the host-surface seam, not the learner boundary itself.
-- Structured evidence harvesting now exists end to end (raw evidence → resolved labels with explicit episode attribution), harvested completions can now persist multiple concurrent raw signals with extractor metadata before worker resolution, and structured tool-result/function-output parts now feed self-evidence detection before regex fallback; the remaining gap is that source detection still leans on heuristics more than the intended richer human/self/scanner evidence flow.
-- Deterministic session-bound `brain_teach` proof now exists, but the current raw host lane is blocked by stale OpenClaw seam drift (`plugins.slots.contextEngine` / `api.registerContextEngine`) and the final narrow worker-down host claim is still unfrozen.
-- Replay-gated promotion exists, but mutation evaluation has not yet reached the intended bundle-level replay contract.
-- Upstream `openclaw/plugin-sdk` type drift still affects full-repo `npx tsc --noEmit`.
-## Finish path to 1.0
-1. **Align repo truth with repo reality** so the README and canonical docs cleanly separate what is true now, implemented-but-not-frozen, and not done yet.
-2. **Finish the real OpenClaw host-surface validation harness** by adapting the stale current-OpenClaw plugin/config seam first, then freezing recurrent routing, static bypass, shadow mode, worker-down fail-open, and explicit skip modes on that repaired boundary.
-3. **Finish the evidence pipeline** so structured evidence tied to exact episodes outruns heuristic-only harvesting.
-4. **Upgrade mutation evaluation to replay-gated bundles** instead of proposal-by-proposal promotion.
-5. **Freeze the proof ladder** with dated artifact bundles under `docs/evidence/`.
-6. **Clean the packaging and type surface** until another OpenClaw operator can install, validate, and recover the plugin without tribal knowledge.
-### Recommended starting configuration
+Default base URL:
+```text
+http://127.0.0.1:11434/v1
 ```
-LCM_FRESH_TAIL_COUNT=32
-LCM_INCREMENTAL_MAX_DEPTH=-1
-LCM_CONTEXT_THRESHOLD=0.75
-```
-- **freshTailCount=32** protects the last 32 messages from compaction, giving the model enough recent context for continuity.
-- **incrementalMaxDepth=-1** enables unlimited automatic condensation after each compaction pass — the DAG cascades as deep as needed. Set to `0` (default) for leaf-only, or a positive integer for a specific depth cap.
-- **contextThreshold=0.75** triggers compaction when context reaches 75% of the model's window, leaving headroom for the model's response.
-### OpenClaw session reset settings
-LCM preserves history through compaction, but it does **not** change OpenClaw's core session reset policy. If sessions are resetting sooner than you want, increase OpenClaw's `session.reset.idleMinutes` or use a channel/type-specific override.
+### Remote OpenAI-compatible endpoint
 ```json
 {
-  "session": {
-    "reset": {
-      "mode": "idle",
-      "idleMinutes": 10080
+  "plugins": {
+    "entries": {
+      "openclawbrain": {
+        "config": {
+          "brainEmbeddingProvider": "openai",
+          "brainEmbeddingModel": "text-embedding-3-large",
+          "brainEmbeddingBaseUrl": "https://your-endpoint.example/v1"
+        }
+      }
     }
   }
 }
 ```
-- `session.reset.mode: "idle"` keeps a session alive until the idle window expires.
-- `session.reset.idleMinutes` is the actual reset interval in minutes.
-- OpenClaw does **not** currently enforce a maximum `idleMinutes`; in source it is validated only as a positive integer.
-- If you also use daily reset mode, `idleMinutes` acts as a secondary guard and the session resets when **either** the daily boundary or the idle window is reached first.
-- Legacy `session.idleMinutes` still works, but OpenClaw prefers `session.reset.idleMinutes`.
-Useful values:
-- `1440` = 1 day
-- `10080` = 7 days
-- `43200` = 30 days
-- `525600` = 365 days
+If the remote endpoint needs auth, set `OPENCLAWBRAIN_EMBEDDING_API_KEY`.
-For most long-lived LCM setups, a good starting point is:
+## Operator commands
-```json
-{
-  "session": {
-    "reset": {
-      "mode": "idle",
-      "idleMinutes": 10080
-    }
-  }
-}
+```bash
+openclawbrain init [workspace]
+openclawbrain status
+openclawbrain trace [traceId]
+openclawbrain replay
+openclawbrain promote
+openclawbrain rollback [version]
+openclawbrain disable
+openclawbrain enable
+openclawbrain doctor
 ```
-## Documentation
+## Validation commands
-Canonical starting docs:
-- [Release contract](docs/RELEASE_CONTRACT.md)
-- [Definitive end-state guide](docs/END_STATE.md)
-- [Evidence ladder](docs/EVIDENCE.md)
-Supporting docs:
-- [Configuration guide](docs/configuration.md)
-- [LCM architecture](docs/architecture.md)
-- [Agent tools](docs/agent-tools.md)
-- [TUI Reference](docs/tui.md)
-- [lcm-tui](tui/README.md)
-- [Optional: enable FTS5 for fast full-text search](docs/fts5.md)
-## Development
+Deterministic runtime proof harness:
 ```bash
-# Run tests
-npx vitest
-# Type check
-npx tsc --noEmit
-# Run a specific test file
-npx vitest test/engine.test.ts
+pnpm exec tsx scripts/validate-brain-runtime-behavior.ts
 ```
-### Validation harness (Phase 1 scaffold)
-A disposable host-app validation scaffold now lives at:
+Disposable host-surface harness:
 ```bash
 node scripts/validate-openclaw-install.mjs --setup-only
-```
-For Phase 1 close-out work, prefer the explicit sterile validation lane instead of a mixed live-machine layout:
-```bash
-OPENCLAWBRAIN_VALIDATION_LANE_NAME=ocbphase1 \
-OPENCLAWBRAIN_VALIDATION_GATEWAY_PORT=19031 \
-node scripts/validate-openclaw-install.mjs --sterile-lane --setup-only
+OPENCLAWBRAIN_VALIDATION_EMBEDDING_PROVIDER=ollama \
+OPENCLAWBRAIN_VALIDATION_EMBEDDING_MODEL=bge-large:latest \
+OPENCLAWBRAIN_VALIDATION_MODEL=ollama/qwen2.5:7b-instruct \
+node scripts/validate-openclaw-install.mjs
 ```
-Full init + host-app routing checks require explicit embedding/model env:
+Current honest boundary: the runtime proof harness is a real release signal today; the full sterile host harness is still not a frozen end-to-end release gate.
-```bash
-OPENCLAWBRAIN_VALIDATION_LANE_NAME=ocbphase1 \
-OPENCLAWBRAIN_VALIDATION_GATEWAY_PORT=19031 \
-OPENCLAWBRAIN_VALIDATION_EMBEDDING_MODEL=text-embedding-3-small \
-OPENCLAWBRAIN_VALIDATION_MODEL=openai/gpt-4.1-mini \
-node scripts/validate-openclaw-install.mjs --sterile-lane
-```
+## Fallback behavior
-Each serious run now writes a predictable artifact bundle under `docs/evidence/YYYY-MM-DD/<git-sha>/`, including the pre-run diagnostic ladder (`openclaw status`, `status --all`, `gateway probe`, `gateway status`, `doctor`, and `channels status --probe`).
+- if the brain has not been initialized, the plugin serves transcript-memory context only
+- if embeddings are not configured, learned retrieval and `brain_teach` stay disabled
+- local loopback embedding endpoints do not require a bearer token by default
+- if the background worker is unavailable, serving still uses the last promoted pack
+- `openclawbrain status` and `openclawbrain doctor` surface resolved embedding and worker truth so operator state is visible
-The deterministic session-bound `brain_teach` harness now lives at:
+## What is still open
-```bash
-pnpm exec tsx scripts/validate-brain-teach-session-bound.ts \
-  --state-dir "$HOME/.openclaw-ocbphase1" \
-  --workspace "$HOME/.openclaw/workspace-ocbphase1" \
-  --repetitions 20
-```
+1. freeze the host-surface proof boundary honestly
+2. move mutation evaluation from proposal-level gating to bundle-level replay decisions
+3. turn the evidence ladder into a real CI/release gate
+4. clean the npm/package boundary and type surface
+5. keep pushing evidence sourcing away from heuristics toward structured signals
+## Documentation map
-It binds the real registered `brain_teach` tool to a deterministic `ctx.sessionKey`, proves the teach action is recorded against the warmup episode, runs follow-up runtime assembly through the real retrieval path, and writes artifacts under `docs/evidence/YYYY-MM-DD/<git-sha>/brain-teach-session-bound/`.
+- [docs/RELEASE_CONTRACT.md](docs/RELEASE_CONTRACT.md) — exact truth contract: true now vs not frozen vs not done
+- [docs/EVIDENCE.md](docs/EVIDENCE.md) — proof ladder and artifact contract
+- [docs/configuration.md](docs/configuration.md) — practical operator setup
+- [docs/END_STATE.md](docs/END_STATE.md) — maintainer execution guide
+- [docs/architecture.md](docs/architecture.md) — inherited LCM substrate plus product architecture context
+- [docs/agent-tools.md](docs/agent-tools.md) — recall tools vs live brain tools
+- [docs/tui.md](docs/tui.md) — TUI reference
-The current short-static host-path classifier now lives at:
+## Development
 ```bash
-pnpm exec tsx scripts/validate-short-static-classification.ts \
-  --state-dir "$HOME/.openclaw-ocbphase1-short-static" \
-  --workspace "$HOME/.openclaw/workspace-ocbphase1"
+npm test
+npm pack --dry-run
+npx tsc --noEmit
 ```
-On the current OpenClaw host build, that classifier truthfully freezes the remaining “short-static drift” question as **stale host seam drift first**: the host config rejects `plugins.slots.contextEngine`, and plugin register fails because `api.registerContextEngine` is gone. Until that seam is adapted, raw host-path short-static probing is not a valid semantic proof boundary.
-Current state: install/config wiring + fixture workspace + `openclawbrain init/status/doctor` are wired, and the harness can now target either temp-home isolation or a named sterile lane. The deterministic runtime layer already proves immediate `brain_teach` retrieval plus worker-down fail-open serving, and the session-bound harness now proves `brain_teach` deterministically at the correct seam with 20/20 identical passes. Raw prompt-driven `openclaw agent --local` is **not** the release proof boundary for `brain_teach`; on the current OpenClaw host, the remaining host work is to adapt the stale plugin/config seam and then freeze recurrent/shadow/skip-mode plus the narrow worker-down serving claim on that repaired boundary.
-### Project structure
-```
-index.ts                    # Plugin entry point and registration
-src/
-  engine.ts                 # LcmContextEngine — implements ContextEngine interface
-  assembler.ts              # Context assembly (summaries + messages → model context)
-  compaction.ts             # CompactionEngine — leaf passes, condensation, sweeps
-  summarize.ts              # Depth-aware prompt generation and LLM summarization
-  retrieval.ts              # RetrievalEngine — grep, describe, expand operations
-  expansion.ts              # DAG expansion logic for lcm_expand_query
-  expansion-auth.ts         # Delegation grants for sub-agent expansion
-  expansion-policy.ts       # Depth/token policy for expansion
-  large-files.ts            # File interception, storage, and exploration summaries
-  integrity.ts              # DAG integrity checks and repair utilities
-  transcript-repair.ts      # Tool-use/result pairing sanitization
-  types.ts                  # Core type definitions (dependency injection contracts)
-  openclaw-bridge.ts        # Bridge utilities
-  db/
-    config.ts               # LcmConfig resolution from env vars
-    connection.ts           # SQLite connection management
-    migration.ts            # Schema migrations
-  store/
-    conversation-store.ts   # Message persistence and retrieval
-    summary-store.ts        # Summary DAG persistence and context item management
-    fts5-sanitize.ts        # FTS5 query sanitization
-  tools/
-    lcm-grep-tool.ts        # lcm_grep tool implementation
-    lcm-describe-tool.ts    # lcm_describe tool implementation
-    lcm-expand-tool.ts      # lcm_expand tool (sub-agent only)
-    lcm-expand-query-tool.ts # lcm_expand_query tool (main agent wrapper)
-    lcm-conversation-scope.ts # Conversation scoping utilities
-    common.ts               # Shared tool utilities
-test/                       # Vitest test suite
-scripts/                    # Validation harnesses and operator helpers
-openclaw.plugin.json        # Plugin manifest with config schema and UI hints
-tui/                        # Interactive terminal UI (Go)
-  main.go                   # Entry point and bubbletea app
-  data.go                   # Data loading and SQLite queries
-  dissolve.go               # Summary dissolution
-  repair.go                 # Corrupted summary repair
-  rewrite.go                # Summary re-summarization
-  transplant.go             # Cross-conversation DAG copy
-  prompts/                  # Depth-aware prompt templates
-.goreleaser.yml             # GoReleaser config for TUI binary releases
-```
+Current repo truth: `npm test` and targeted runtime validation are ahead of full-repo `npx tsc --noEmit`, which still has known drift outside the latest runtime slices.
 ## License