npm - brainclaw - Versions diffs - 1.8.0 → 1.9.0 - Mend

brainclaw 1.8.0 → 1.9.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (140) hide show

package/README.md +12 -11
package/dist/brainclaw-vscode.vsix +0 -0
package/dist/cli.js +138 -13
package/dist/commands/add-step.js +1 -1
package/dist/commands/bootstrap.js +2 -26
package/dist/commands/check-security-mcp.js +50 -33
package/dist/commands/check-security.js +86 -43
package/dist/commands/claim.js +22 -21
package/dist/commands/confirm.js +26 -0
package/dist/commands/context-diff.js +1 -1
package/dist/commands/dispatch-watch.js +142 -0
package/dist/commands/doctor.js +113 -2
package/dist/commands/estimation-report.js +115 -16
package/dist/commands/harvest.js +285 -22
package/dist/commands/init.js +123 -21
package/dist/commands/loops-handlers.js +4 -0
package/dist/commands/mcp-read-handlers.js +198 -29
package/dist/commands/mcp.js +588 -92
package/dist/commands/memory.js +21 -17
package/dist/commands/migrate.js +81 -17
package/dist/commands/prune.js +78 -4
package/dist/commands/reflect.js +26 -20
package/dist/commands/register-agent.js +57 -1
package/dist/commands/repair.js +20 -0
package/dist/commands/session-end.js +15 -6
package/dist/commands/session-start.js +18 -1
package/dist/commands/setup-security.js +39 -18
package/dist/commands/setup.js +26 -27
package/dist/commands/stale.js +16 -2
package/dist/commands/uninstall.js +126 -34
package/dist/commands/update-step.js +6 -0
package/dist/commands/worktree.js +60 -0
package/dist/core/actions.js +12 -3
package/dist/core/agent-capability.js +11 -13
package/dist/core/agent-files.js +844 -547
package/dist/core/agent-integrations.js +0 -3
package/dist/core/agent-inventory.js +67 -0
package/dist/core/agent-registry.js +163 -29
package/dist/core/agentrun-reconciler.js +33 -2
package/dist/core/agentruns.js +7 -1
package/dist/core/ai-agent-detection.js +31 -44
package/dist/core/archival.js +15 -9
package/dist/core/assignment-reconciler.js +56 -0
package/dist/core/assignment-sweeper.js +127 -4
package/dist/core/assignments.js +69 -11
package/dist/core/bootstrap.js +233 -67
package/dist/core/brainclaw-version.js +22 -0
package/dist/core/candidates.js +21 -1
package/dist/core/claims.js +313 -150
package/dist/core/config.js +6 -1
package/dist/core/context-diff.js +148 -20
package/dist/core/context.js +129 -8
package/dist/core/coordination.js +22 -3
package/dist/core/dispatch-status.js +79 -5
package/dist/core/dispatcher.js +64 -11
package/dist/core/entity-operations.js +45 -24
package/dist/core/entity-registry.js +31 -5
package/dist/core/event-log.js +138 -21
package/dist/core/events/checkpoint.js +258 -0
package/dist/core/events/genesis.js +220 -0
package/dist/core/events/journal.js +507 -0
package/dist/core/events/materialize.js +126 -0
package/dist/core/events/registry-post-image.js +110 -0
package/dist/core/events/verify.js +109 -0
package/dist/core/execution-adapters.js +23 -0
package/dist/core/facade-schema.js +38 -0
package/dist/core/gc-semantic.js +130 -5
package/dist/core/handoff-snapshot.js +68 -0
package/dist/core/ids.js +19 -8
package/dist/core/instruction-templates.js +34 -115
package/dist/core/io.js +39 -3
package/dist/core/json-store.js +10 -1
package/dist/core/lock.js +153 -28
package/dist/core/loops/bootstrap-acquire.js +25 -1
package/dist/core/loops/facade-schema.js +2 -0
package/dist/core/loops/hooks/survey-signals-baseline.js +36 -0
package/dist/core/loops/index.js +1 -0
package/dist/core/loops/presets/bootstrap.js +7 -0
package/dist/core/loops/store.js +17 -0
package/dist/core/loops/verbs.js +24 -1
package/dist/core/markdown.js +8 -76
package/dist/core/mcp-command-resolution.js +245 -0
package/dist/core/memory-compactor.js +5 -3
package/dist/core/memory-lifecycle.js +282 -0
package/dist/core/merge-risk.js +150 -0
package/dist/core/messaging.js +8 -1
package/dist/core/migration.js +11 -1
package/dist/core/observer-mode.js +26 -0
package/dist/core/operations/memory-mutation.js +90 -65
package/dist/core/operations/plan.js +27 -1
package/dist/core/protocol-skills.js +210 -0
package/dist/core/reflection-safety.js +6 -7
package/dist/core/reputation.js +84 -2
package/dist/core/runtime-signals.js +71 -9
package/dist/core/runtime.js +84 -1
package/dist/core/schema.js +114 -0
package/dist/core/security-detectors.js +125 -0
package/dist/core/security-extract.js +189 -0
package/dist/core/security-guard.js +107 -29
package/dist/core/security-packages.js +121 -0
package/dist/core/security-scoring.js +76 -9
package/dist/core/security.js +34 -2
package/dist/core/sequence.js +11 -2
package/dist/core/setup-flow.js +141 -13
package/dist/core/staleness.js +72 -1
package/dist/core/state.js +250 -54
package/dist/core/store-resolution.js +19 -5
package/dist/core/worktree.js +72 -8
package/dist/facts.js +8 -8
package/dist/facts.json +7 -7
package/docs/PROTOCOL.md +223 -0
package/docs/cli.md +11 -10
package/docs/concepts/coordinator-runbook.md +129 -0
package/docs/concepts/event-log-store-critique-A.md +333 -0
package/docs/concepts/event-log-store-critique-B.md +353 -0
package/docs/concepts/event-log-store-phase0-measurements.md +58 -0
package/docs/concepts/event-log-store-proposal-A.md +365 -0
package/docs/concepts/event-log-store-proposal-B.md +404 -0
package/docs/concepts/event-log-store.md +928 -0
package/docs/concepts/identity-model-proposal.md +371 -0
package/docs/concepts/memory.md +5 -4
package/docs/concepts/observer-protocol.md +361 -0
package/docs/concepts/parallel-merge-protocol.md +71 -0
package/docs/concepts/plans-and-claims.md +43 -0
package/docs/concepts/skills.md +78 -0
package/docs/concepts/workspace-bootstrapping.md +61 -0
package/docs/integrations/agents.md +4 -4
package/docs/integrations/cline.md +10 -11
package/docs/integrations/codex.md +2 -2
package/docs/integrations/continue.md +5 -5
package/docs/integrations/copilot.md +14 -12
package/docs/integrations/openclaw.md +7 -6
package/docs/integrations/overview.md +7 -7
package/docs/integrations/roo.md +3 -3
package/docs/integrations/windsurf.md +6 -6
package/docs/mcp-schema-changelog.md +29 -2
package/docs/quickstart.md +48 -47
package/docs/security.md +174 -15
package/docs/storage.md +4 -2
package/package.json +8 -6

package/docs/concepts/observer-protocol.md ADDED Viewed

@@ -0,0 +1,361 @@
+# Observer Protocol — language-agnostic read-only surfaces
+Status: spec (pln#560 step 1). Pivot deliverable: it serves the VS Code
+extension, the JetBrains plugin, and any future surface identically. Companion
+to `event-log-store.md` (the journal this protocol consumes) and the VS Code
+vision §5 (the UX it powers).
+## 1. The one rule
+**An observability surface is a pure consumer of the event journal. It never
+acquires a store lock, never writes inside `.brainclaw/`, never runs a polling
+timer against the MCP server for display.** It tails the append-only journal,
+projects board state in memory, and refreshes the affected section when a new
+record arrives. The MCP server is reserved for *actions* (accept / release /
+dispatch / transition) through a separate, lazily-created client.
+Why this exists (2026-06-10 calibration): the prior extension "read" the board
+by calling a path that mutated and git-committed the entire store under the
+lock (`autoAcknowledge → persistState`, held >5s), ran the agent-run reconciler
+twice per poll with locked writes, created ~120 locked "unverified" event files
+per hour per run, and impersonated the parent shell's agent identity —
+consuming that agent's cursor. A dashboard is not an agent. This protocol makes
+that class of bug unrepresentable: a conforming observer *cannot* write.
+## 2. What the observer reads
+The journal defined in `event-log-store.md`:
+```
+.brainclaw/events/
+  meta.json                # { next_seq, active_segment, entity_revs } — a cache
+  seg-<firstSeq>.jsonl     # immutable once rolled; named by first seq it holds
+  seg-<firstSeq>.jsonl     # active = lexicographically-last segment
+  checkpoints/             # NOT YET EMITTED by the writer (see §5) — directory
+    ckpt-<seq>.json        # may be absent today; self-contained state manifest
+```
+**Segment naming is normative.** `firstSeq` is encoded as a **decimal,
+zero-padded to 8 digits** (e.g. `seg-00018342.jsonl`) so directory-listing
+lex-sort matches numeric seq order with no parsing. Implementations MUST emit
+and accept exactly this format; any other padding (or none) breaks the
+binary-search-by-filename in §5. The 8-digit field overflows at `seq > 1e8`;
+at the historical write rate (~17k events to date) this is decades away, but
+a journal that crosses the boundary requires a coordinated widen-and-pad
+migration in writer + every observer (deferred until needed; flagged here so
+no implementer assumes the pad width is incidental).
+`meta.json` is advisory only (§4): the observer reads it to cheaply detect
+"did anything change" but never trusts its `entity_revs` or `next_seq` for
+correctness — those are derived from the journal tail itself.
+Record envelope (v2), one JSON object per line:
+```jsonc
+{ "v": 2, "seq": 18342, "ts": "…", "writer": "w_…", "agent": "claude-code",
+  "action": "update", "item_type": "plan", "item_id": "pln_…",
+  "entity_rev": 7, "summary": "…", "payload": { /* full post-image */ } }
+```
+Action → class mapping (the observer needs the class, not a hardcoded verb
+list — it is `ACTION_CLASS_BY_ACTION` in `event-log.ts`, and MUST be mirrored
+in every implementation or fetched from a shared manifest):
+| Class | Effect on the projection |
+|---|---|
+| `entity-state` (`create`,`update`,`accept`,`reject`,`claim`,`release_claim`,`rollback`,`upgrade`,`backfill`) | upsert `payload` at `(item_type,item_id)` |
+| `tombstone` (`delete`) | remove `(item_type,item_id)` |
+| `journal-meta` (`checkpoint_ref`,`journal_note`,`seq_repair`,`federation_apply`) | ignore for state; `checkpoint_ref` is a bootstrap hint (§5) |
+| `observability` (`session_start`,`session_end`,`assignment_offered`,`assignment_progress`,`run_progress`) | activity feed only — never a state upsert |
+| `registry-lifecycle` (`assignment_*`,`run_*`) | upsert when `payload` present (phase 1.5+), else a status/activity signal |
+The observer is **forward-compatible**: an unknown `action` whose class it
+cannot resolve is applied as `entity-state` iff it carries a `payload` and an
+`item_id`, else treated as an activity signal. Never crash on an unknown verb.
+## 3. The cursor lives OUTSIDE the store
+The observer's read position is a **seq watermark** persisted in *client*
+storage, never in `.brainclaw/`:
+- VS Code: `ExtensionContext.workspaceState`, key `bclaw.observer.cursor.<project_id>`.
+- JetBrains: `PropertiesComponent` / project-scoped state, same key shape.
+- Generic: any client-private kv keyed by `project_id`.
+Shape: `{ seq: number, checkpoint_seq: number }`. `seq` = highest record seq
+applied; `checkpoint_seq` = the checkpoint the in-memory projection was last
+seeded from (for fast re-bootstrap). The store's own `.cursors/` directory is
+the AGENTS' read position and is **off-limits** to observers — touching it is
+the identity-leak bug this protocol forbids.
+Rationale: a watermark survives segment rotation, compaction, and archival
+(byte offsets do not). It is private to the surface, so N observers never
+interfere with each other or with agents.
+## 4. Change detection — a file watch, not a poll, not a lock
+The observer watches the journal directory for growth and reacts:
+1. Watch `.brainclaw/events/` (the active segment's size/mtime, and creation of
+   new `seg-*.jsonl`). VS Code: `FileSystemWatcher` on `events/seg-*.jsonl` +
+   `meta.json`. JetBrains: `VirtualFileListener` / NIO `WatchService`. Generic
+   fallback: stat the active segment on a *long* interval (≥10 s) — this is a
+   stat, not an MCP call, and acquires no lock.
+2. On a growth signal, **tail forward** from `cursor.seq` (§5) and apply records
+   to the in-memory projection.
+3. `meta.json` is advisory only; never trust it for correctness — the journal
+   tail is the truth (it may be a stale cache mid-write). Use it only to detect
+   "did anything change" cheaply.
+There is no MCP server process for display. The watcher is OS-level; the read
+is a file read. Under the 2026-06-10 load (3 workers + open surface) this yields
+zero lock acquisitions by the surface — the validation gate (step 3).
+## 5. Bootstrap and tail algorithm
+**Status of checkpoint emission (2026-06-12, pln#543 step 4 landed):** the
+writer does NOT yet produce `checkpoints/ckpt-*.json` files — checkpoint
+emission ships with step 3/5 of pln#543. Until then, the empty-seed + full
+tail path below is the **primary** cold-start path in production, not a
+degenerate fallback. The checkpoint-first path is the spec the consumer must
+implement for forward-compatibility; an observer that hard-requires a
+checkpoint at activation is broken against today's store. The perf targets
+in §10 assume checkpoint emission; until it ships, "activation → first
+summary" is bounded by the full tail length instead (~10 MB of segments in
+the typical case, sub-second in practice, but the budget no longer has
+slack).
+**Cold start (no cursor, or cursor below the oldest live segment):**
+1. **If a verified checkpoint exists**, load the newest `ckpt-<S>.json` →
+   seed the in-memory projection (full post-image set at head `S`). Set
+   `cursor.checkpoint_seq = S`. (Today: this branch is dead until the
+   writer ships checkpoint emission.)
+2. **Otherwise** (today's primary path), seed from the empty projection
+   with `cursor.checkpoint_seq = 0`.
+3. Tail every record with `seq > checkpoint_seq` across segments in
+   (segment, file-line) order; apply by class (§2). With no checkpoint
+   this is a full replay from seq 1, bounded by retention (the
+   `events/archive/` floor — segments are park-don't-deleted past the
+   second-newest verified checkpoint; with no checkpoint, no segment is
+   ever eligible for archive, so "the journal" = "every segment ever
+   written" until checkpoint emission ships).
+4. Set `cursor.seq` to the last applied seq. Render.
+If the cursor's `seq` is **below the oldest non-archived segment's first
+seq** (gap — segments archived past the watermark), discard the cursor and
+cold-start: notifications degrade, state never does. Today no segment is
+ever archived (gc requires a verified checkpoint floor, see §2.3 of the
+store spec), so this branch is unreachable in production until checkpoint
+emission ships — but the rule is normative regardless: an observer that
+crashes on a gap is broken against any future store.
+**Warm tail (cursor present, within live segments):**
+1. Binary-search the segment whose name (`seg-<firstSeq>`) contains
+   `cursor.seq + 1` (filenames sort by first seq).
+2. Stream forward from that point across segments; apply by class.
+3. A **torn tail** (final line unparseable or missing trailing `\n`) is expected
+   crash residue mid-write by an agent — skip it; it reappears complete on the
+   next growth signal. Never block on it.
+4. A mid-file unparseable line is logged and skipped (do not halt the tail).
+5. Advance `cursor.seq` only over records actually applied.
+Replay order is always (segment order, then file-line order) — never sorted by
+seq (matches the store's own reducer; a dup `seq` from a lock-steal applies
+later-line-wins, harmlessly, in a read-only projection).
+## 6. Board projection — which records touch which section
+The in-memory projection is `Map<item_type, Map<item_id, payload>>` plus a
+bounded recent-activity ring (observability + registry signals, last N). The
+board sections are derived; a record invalidates only the sections its
+`item_type` feeds, and only those re-render (push-by-affected-section, §5.3):
+| `item_type` | Invalidates sections |
+|---|---|
+| `plan` | IN_PROGRESS, SPRINTS, BACKLOG, ATTENTION (badge), SYSTEM (counts) |
+| `claim` | IN_PROGRESS, AGENTS (roster freshness) |
+| `assignment` | IN_PROGRESS, ATTENTION (blocked/failed), "Recently terminal" |
+| `agent_run` | IN_PROGRESS (worker rows), AGENTS, "Recently terminal" |
+| `candidate` | ATTENTION (human-review), CANDIDATES |
+| `action` | ATTENTION (the dominant attention input) |
+| `constraint`/`decision`/`trap` | SYSTEM (counts), TRAPS |
+| `handoff` | ACTIVITY, SYSTEM (counts) |
+| `sequence` | SPRINTS |
+| `session`/`*_progress` (observability) | ACTIVITY feed only — never a section state change |
+`attention_required` is computed by the observer from the projection (actions +
+human candidates + blocked/failed assignments + failed runs + evidence-
+contradicted terminals), matching what the server-side composite returns — the
+surface must not under-count by reading "actions only" (the pln#559 fix, now in
+the projection rule).
+### 6.1 Dual-mode coverage gap (CLOSED by pln#568 phase 1.5)
+> **Status (pln#568):** the writer-side gap below is **closed**. The
+> registry / coordination families (claim, assignment, agent_run,
+> action_required [journaled under item_type `state`], candidate, sequence,
+> and SHARED runtime_note) now emit full entity-state **post-images** on their
+> persist chokepoint (`src/core/events/registry-post-image.ts`), and the
+> observer materializer projects them (`board-projection.ts` ARRAY_SLOT).
+>
+> **Cutover signal (O2, resolved):** an observer switches a registry family
+> from the MCP `board_summary` seed to the journal only once the journal
+> carries the `journal_note` kind **`registry_genesis`** marker — emitted by
+> `runRegistryGenesisSupplement` (run via `brainclaw migrate --enable-journal`)
+> after it backfills every pre-existing registry entity. The marker is the
+> safety gate: without a complete backfill a partially-journaled store would
+> undercount the attention badge (trp#559). `BoardObserver.registryAuthoritative()`
+> tracks the marker (sticky, re-derived on cold start by replaying from the
+> checkpoint floor); `mergeCounts(journal, seed, journalActive, registryAuthoritative)`
+> takes claims/assignments/runs/actions from the journal when it is set, and
+> from the seed otherwise. `agents`/`sessions` are never journaled → always seed.
+> A store that has NOT run the supplement keeps the seed (no regression).
+>
+> The historical (pre-pln#568) description below is kept for context.
+The journal classifies records into five classes (§2). In phase 1 / `dual`
+mode — what runs today after pln#543 step 4 — **registry-lifecycle records
+are payload-OPTIONAL** (event-log-store.md §2.1.1, J4); the dual-write path
+in `src/core/event-log.ts:152` forwards `assignment_*` and `run_*` events to
+the journal with `item_id` only, no `payload`. The §2 rule "upsert when
+payload present, else a status/activity signal" means today's journal carries
+**no post-images for `assignment` or `agent_run`**: the in-memory projection
+has zero rows for those item_types, and the materializer
+(`src/core/events/materialize.ts`) only enumerates the 5 memory families
+(constraint/decision/trap/handoff/plan).
+Consequence for the §6 mapping table: until phase 1.5 ships, the rows that
+the table claims `assignment` / `agent_run` / `claim` populate (IN_PROGRESS
+worker rows, ATTENTION blocked/failed, Recently terminal under IN_PROGRESS)
+cannot be drawn from the journal alone. A conforming observer in dual mode
+MUST:
+- Seed those sections at activation from a **single observer-flagged
+  `bclaw_context(kind: "board_summary")`** call (no timer, no poll) — that
+  read is lock-free under the §8 observer contract (validated against
+  `getDispatchStatus` and `loadAssignment`/`loadAgentRun`, which are pure
+  projection reads — `mcp-read-handlers.ts:1916`, `json-store.ts:47`); and
+- Mark those sections "live-view degraded" in the tooltip until phase 1.5,
+  so the operator can tell journal-driven sections (memory entities,
+  attention badges) from MCP-seeded sections (workers, lifecycle).
+Memory-entity sections (plan / constraint / decision / trap / handoff /
+sequence / handoff-derived ACTIVITY) ARE journal-driven today via the
+per-entity diff in `persistState` (`src/core/state.ts:400`) — the protocol
+delivers its full value for them.
+### 6.2 Section ID glossary
+The §6 table uses display names; the canonical IDs in
+`vscode-extension/src/board-tree.ts:321` are `attention | in-progress |
+sprints | backlog | system | agents | candidates | activity | plans | claims
+| assignments | runs | actions | handoffs | sprint | traps | cross-project`.
+"Recently terminal" is **not** a top-level section — it is a sub-node
+rendered under `in-progress`. The board also surfaces `cross-project`
+(federation incoming signals) and `linked_projects`, neither of which has a
+single journal `item_type` today: cross-project signals arrive via the
+handoff/candidate streams (already covered), `linked_projects` is derived
+from project config and is intentionally NOT a journal concern.
+The projection is **state**, not administrative belief: a worker row's health
+comes from evidence in the records (commits/fs signals carried on
+registry-lifecycle payloads when present), not from a bare status field that the
+2026-06-10 log proved lies. Where richer evidence requires it, the surface MAY
+call `bclaw_dispatch_status` through the actions client (§7) — that is a
+read-only MCP call, used sparingly (per visible terminal row), not a poll.
+## 7. Actions go through a separate, lazy MCP client
+Mutations (accept candidate, release claim, dispatch, transition, complete step)
+are the *only* reason an observer talks to the MCP server. Rules:
+- One lazily-created MCP client per project, spun up on first action, idle-timed
+  out after inactivity. Never created just to display.
+- Distinct from any agent session: the client identifies as an **observer
+  principal** (see §8), so its calls never adopt an agent's claim/cursor.
+- After an action, the observer does NOT optimistically mutate its projection;
+  it waits for the resulting journal record(s) to arrive via the tail (§5) and
+  re-projects. Single source of truth, no split-brain. (A short-lived "pending"
+  affordance on the clicked item is a UI concern, not projection state.)
+- `bclaw_dispatch_status` and other read-only facades are permitted through this
+  client for on-demand evidence enrichment, but are never on a timer.
+## 8. Observer identity (no impersonation, no side effects)
+The surface declares itself an observer so the server suppresses every write a
+read would otherwise trigger:
+- Transport signal: `BRAINCLAW_OBSERVER=1` in the action client's env, and/or
+  MCP `clientInfo.name = "brainclaw-observer/<surface>"`.
+- Server contract (already implemented, pln#558): observer reads do not
+  `autoAcknowledge`, do not run agent-run reconciliation, do not advance
+  `readUnseenEvents` cursors, do not implicit-heartbeat or auto-register an
+  identity. This protocol is the client half of that contract: even the read-
+  only facade calls in §7 carry the observer flag.
+- The observer never presents an agent name as the actor of anything. Actions
+  the human triggers are attributed to the human operator principal, not to a
+  spawned agent.
+## 9. Failure modes and degradation
+| Condition | Behavior |
+|---|---|
+| `events/` absent (journal off / not migrated) | Fall back to a single MCP `board_summary` read at activation (no timer); show a "journal off — limited live view" hint. The surface still works, just not push-driven. |
+| Cursor gap (archived past watermark) | Cold-start from newest checkpoint (§5); silent — state is correct, only missed-activity history is lost. |
+| Checkpoint missing/corrupt | Fall back to the previous checkpoint, replay more segments (the two-checkpoint floor guarantees one exists); if none, seed empty + full tail. |
+| Torn / unparseable line | Skip, keep tailing (§5). |
+| Active segment shrinks / meta regresses | Trust the journal tail, re-derive; never write a "repair" (that is an agent/doctor job). |
+| Watch unavailable (network FS, sandbox) | Degrade to a long-interval stat of the active segment; still zero locks. |
+## 10. Performance budget (vision §5.3, restated as observer obligations)
+| Operation | Target | Hard limit | How the protocol meets it |
+|---|---|---|---|
+| Activation → first summary | 500 ms | 2 s | seed from newest checkpoint, no full replay |
+| Summary refresh | 300 ms | 1 s | apply only the new tail records |
+| Section expand (warm) | 50 ms | 200 ms | projection is in memory; expand reads the map |
+| Section expand (cold) | 500 ms | 2 s | first projection build from checkpoint+tail |
+| Action round-trip | 500 ms | 2 s | lazy MCP client; result observed via tail |
+Out of budget → surface in tooltip + a "performance degraded" status-bar
+indicator (never escalate by calling a heavier path — that is the contention-
+breeds-contention bug this protocol exists to kill).
+## 11. Language-agnostic conformance checklist
+A surface in any language conforms iff:
+1. It reads only files under `.brainclaw/events/` (+ checkpoints) and writes
+   nothing under `.brainclaw/`.
+2. Its cursor is a seq watermark in client-private storage, keyed by
+   `project_id`, never in the store's `.cursors/`.
+3. It seeds from the newest verified checkpoint and tails by (segment, line)
+   order, applying records by action *class*, tolerant of unknown verbs and torn
+   tails.
+4. Change detection is an OS file watch (or long-interval stat) — never an MCP
+   poll, never a lock.
+5. Mutations go through a separate lazy MCP client flagged as an observer
+   principal; the projection updates only from the resulting journal records.
+6. `attention_required` and worker health are computed from journal evidence,
+   not from administrative status fields alone.
+Reference implementation: the VS Code extension (pln#560 step 2). The JetBrains
+plugin (next plan) implements this same checklist in Kotlin — its existence is
+the cross-language validation that this protocol, not the TypeScript code, is
+the contract.
+## 12. OPEN QUESTIONS
+Carried from the 2026-06-12 symmetric review (pln#560 step 1, this branch).
+Each is something the spec text cannot close on its own; one or more must
+be answered before the JetBrains plugin (Kotlin) ships.
+| # | Sev | Question |
+|---|---|---|
+| O1 | MED | **Shared `ACTION_CLASS_BY_ACTION` manifest.** §2 says implementations MUST mirror the table "or fetch it from a shared manifest." Today only the TS version exists (`src/core/events/journal.ts:66`); a Kotlin implementer would re-type 42 entries by hand and silently drift on the 43rd. Should this ship as a generated JSON next to `event-log-store.md` (single source of truth, both runtimes load it) or as part of a versioned schema bundle? Recommend the generated JSON — the table is small and changes per spec revision, not per release. |
+| O2 | RESOLVED (pln#568) | **Phase-1.5 cutover signal for §6.1.** Resolved with a `journal_note` kind **`registry_genesis`** marker emitted by `runRegistryGenesisSupplement` after it backfills every pre-existing registry entity (`brainclaw migrate --enable-journal`). Observers detect it (`BoardObserver.registryAuthoritative()`, sticky + re-derived on cold start) and switch the registry counts from the MCP seed to the journal via `mergeCounts(..., registryAuthoritative)`. Chosen over a `meta.json` version bump because meta is a rebuildable cache (§2.3) — the marker is a durable journal record, the source of truth, and survives a meta rebuild. Open follow-up: when checkpoints start emitting (today `checkpoint_seq=0`), the checkpoint must encode the capability so a cold start past the marker's segment still re-derives authority. |
+| O3 | LOW | **`bclaw_dispatch_status` enrichment scope.** §6 + §7 allow it "per visible terminal row" but the wording is ambiguous between "terminal-state row" (Recently terminal) and "row currently visible in the terminal UI" (every IN_PROGRESS row). Settle: probably the first (only failed/silent_death rows want the evidence digest) — but the contract must say so, otherwise an implementor renders an O(workers) burst on every refresh. |
+| O4 | LOW | **Segment pad-width upgrade path.** §2 pins 8-digit decimal padding; the writer is 8-digit too (`src/core/events/journal.ts:214 SEGMENT_PAD=8`). At ~17k events historical, the 1e8 ceiling is decades out — but a future widen would require coordinated writer + every observer roll-out. Carry the migration recipe (pad-width in `meta.json`?) here so a future maintainer doesn't have to rediscover it. |
+| O5 | LOW | **File watch semantics on Windows network mounts.** §4 falls back to a "long-interval stat" when the watcher is unavailable. VS Code's `FileSystemWatcher` on a junction-linked worktree (the brainclaw dispatch substrate) may fire on the link target's mtime but not the source-of-truth segment writes from another process; verify against the dispatch worktree machinery (`pln#498` junctions) before declaring the watch path universal. |

package/docs/concepts/parallel-merge-protocol.md ADDED Viewed

@@ -0,0 +1,71 @@
+# Parallel-lane merge protocol
+Status: operational (pln#396). When a sequence runs multiple lanes in parallel
+worktrees, their branches must land back on the base without silently
+overwriting each other. This is the minimum protocol; `brainclaw worktree
+check` is its tool.
+## The risk
+Each lane is a worktree branched from the base (master) at dispatch time. Two
+lanes editing the same file will, on the second merge, either conflict
+(visible, recoverable) or — worse — produce a clean-but-wrong merge where one
+lane's change is silently dropped or a file is parasitically deleted
+(trp_merge_wipes_node_modules class). File-level overlap between lanes is the
+predictor; `worktree check` surfaces it before `git merge`, not after.
+## Before merging any lane
+```
+brainclaw worktree check
+```
+It reports, per live worktree lane: the files it changes (committed since base
++ uncommitted tracked), the owning claim / session / agent, and — the payload —
+**which files are touched by more than one lane**. Exit code:
+- `0` — lanes are disjoint. Merge in any order; no cross-lane conflict possible.
+- `3` — overlapping files exist. Follow the ordering rule below.
+`brainclaw worktree merge <branch>` runs the same check inline and prints an
+advisory warning if the branch overlaps another live lane (it never blocks —
+the operator decides).
+## Ordering rule when lanes overlap
+1. **Merge the overlapping lanes one at a time**, never as a batch.
+2. After merging lane A, **rebase (or re-create) the still-pending overlapping
+   lanes onto the new base** so they see A's change, then re-run `worktree
+   check`. The overlap on the merged file should now be gone (the pending lane
+   either already has A's change or will conflict explicitly on rebase, where
+   it is resolvable in isolation).
+3. Disjoint lanes (no shared files with anything merged) can merge freely at any
+   point — `check` confirms they carry no risk.
+4. Prefer merging the **smallest / most foundational** overlapping lane first
+   (fewer files, or the one others build on), so the rebases that follow are
+   the cheap direction.
+## When automatic reconciliation is not possible
+`worktree check` is a *predictor*, and `worktree merge` auto-restores parasitic
+deletions but does **not** resolve real content conflicts. Escalate to a human
+(or a single coordinator session doing the merge serially) when:
+- Two lanes changed overlapping *hunks* of the same file (not just the same
+  file) — git will conflict; resolve in the main worktree, do not `--force`.
+- A lane's worktree carries **uncommitted** changes to a shared file (it spawned
+  from HEAD; those edits never reach the merge). `check` flags these as
+  `(+N uncommitted)` — harvest or discard them deliberately before merging
+  (a dead worker's stranded edits are the feedback_review_loop_symmetric_fixer
+  / trp#545 case).
+- The overlap is in a generated / high-churn file (lockfiles, `dist/`) — prefer
+  regenerating post-merge over merging both sides.
+## Invariants
+- The check is pure-read: only `git diff` / `git status`, no store lock, no
+  mutation — safe to run anytime, even mid-dispatch with workers live.
+- `.brainclaw/` and `.gitignore` are never counted as conflict surface (store-
+  internal + birth noise).
+- A flagged overlap that turns out disjoint at the hunk level costs one glance;
+  the protocol deliberately over-reports rather than miss a real conflict.

package/docs/concepts/plans-and-claims.md CHANGED Viewed

@@ -75,6 +75,49 @@ task with estimate      est:30min  actual:45min  [ratio:1.5x]
 A ratio below 1.0 means the task finished faster than expected (early). Above 1.0 means it took longer (over).
+### Step-level estimation (pln#495)
+Estimation can be captured per **step**, not just per plan — which removes a real
+source of noise. A plan's wall-clock span (`created_at`→`completed_at`) counts
+the idle time *between* steps as if it were work: if steps 1–5 finish in a
+morning and step 6 lands the next day, the plan-level elapsed smears an 18h gap
+that was never effort. Summing per-step durations excludes those gaps.
+Set step estimates and actuals via `add-step` / `update-step`:
+```bash
+brainclaw add-step <plan-id> "write unit tests" --estimate 30
+brainclaw update-step <plan-id> <step-id> --status in_progress   # stamps started_at
+brainclaw update-step <plan-id> <step-id> --status done           # stamps completed_at
+# or record an explicit actual:
+brainclaw update-step <plan-id> <step-id> --actual-effort 45m
+```
+`estimated_effort` accepts the same forms as the plan-level `--estimate` (an
+integer of minutes, or a legacy duration string like `2h` / `30m`).
+`estimation-report` then prefers step-level data and tags each plan with its
+**measurement source**:
+- **`step`** — the highest quality: `estimated_minutes` is the sum of step
+  estimates (used only when *every* step has one), and `elapsed_minutes` is the
+  sum of per-step durations (used only when *every* step is measurable, via an
+  explicit `actual_effort` or both `started_at`+`completed_at`). Idle gaps
+  between steps are excluded.
+- **`plan_string`** — fell back to the plan-level `actual_effort` string.
+- **`plan_wallclock`** — fell back to the plan's `created_at`→`completed_at`
+  span (the noisiest; what older plans use).
+The report's summary breaks the median ratio down per source
+(`step-derived: 1.0x · plan-wallclock: 0.4x …`) so you can see how much
+calibration error was wall-clock contamination vs real estimation drift, and the
+chart tags each line (`✓step` / `~wall`).
+**Migration:** none required. A plan whose steps carry no estimation data — or a
+plan with no steps at all — keeps working exactly as before via the fallback
+chain. Mixed plans (some steps estimated, some not) fall back to plan-level
+entirely rather than reporting a misleading partial sum.
 ## Claims
 Claims make current ownership explicit.

package/docs/concepts/skills.md ADDED Viewed

@@ -0,0 +1,78 @@
+# Skills — agent-profile vs workflow-protocol
+Brainclaw writes two orthogonal kinds of agent skill. The same agent can load
+both; they answer different questions.
+## Agent-profile skills
+The agent's **landing page** for brainclaw — one per agent profile
+(`openclaw`, `nanoclaw`, `nemoclaw`, `picoclaw`, `zeroclaw`) plus the universal
+`.agents/skills/brainclaw/SKILL.md` discovered by every agent that honors the
+shared `.agents/skills/` convention (Cursor, Copilot, Roo, OpenCode, Codex,
+Kilo, Mistral…). It answers *"what is brainclaw and how do I load context here?"*
+and points at `bclaw_work` as the entry verb.
+Source: `src/core/agent-files.ts` (`ensureUniversalBrainclawSkill`, the per-profile
+writers). These files are **generated, not committed** — they are gitignored and
+materialized by `brainclaw export` / setup, so the source of truth is the code,
+never a checked-in `SKILL.md`.
+## Workflow-protocol skills (pln#519)
+**Workflow-decomposed** skills that package a critical brainclaw protocol so the
+agent loads *the right one at the right moment* instead of skimming a monolithic
+AGENTS.md. Three ship today, namespaced `brainclaw-*`:
+| Skill | Trigger |
+|---|---|
+| `brainclaw-session` | starting / resuming / closing a session; before claiming a scope |
+| `brainclaw-memory-capture` | recording a decision / constraint / trap / handoff at the right type |
+| `brainclaw-multi-agent` | delegating, reviewing, dispatching, driving a loop |
+They carry `metadata.protocol: true` in frontmatter so skill-loader UIs can list
+protocols separately from profile skills, and `metadata.brainclaw_version` (set
+to `package.json:version` at write time) so a loader can detect a cached skill
+that predates a brainclaw upgrade. Each follows the "process not prose" shape:
+**When to use → Workflow → Anti-rationalizations → Red flags → Verification**.
+Source: `src/core/protocol-skills.ts` (single source of truth; content is
+embedded, not read from a repo file, so it installs identically from source or
+an npm install). Written to `.agents/skills/<id>/SKILL.md` by
+`ensureProtocolSkills`, wired into `writeDetectedAgentAutoConfig` for every agent
+whose capability profile declares `hasSkills: true`. Generated + gitignored, like
+the profile skills.
+### Design invariants
+- **No dynamic state.** A protocol-skill never embeds a concrete `claim_id` /
+  `loop_id` / `plan_id` — it tells the agent *when* to call a facade and *how to
+  read* live state, never *what the current state is*. (Enforced by a test.)
+- **Facade-only.** Skills reference canonical-grammar / facade verbs by name;
+  they never re-implement `mcp.ts` logic. A protocol change updates the skill,
+  not the other way round.
+- **Both MCP and CLI.** Each workflow shows the MCP call AND the `brainclaw …`
+  CLI fallback, because MCP is not always wired (cold start; a dispatched worker
+  in a worktree without `.brainclaw/`, trp#336).
+- **Capped at 3.** Setup and troubleshooting protocols are deferred until an
+  empirical friction shows the three are insufficient (design §E.2). A 4th
+  protocol needs a runtime_note documenting the gap it closes.
+## Namespace claim
+Brainclaw owns the `brainclaw-*` prefix under `.agents/skills/`. Other tools that
+install skills into the shared directory must avoid `brainclaw-` ids to prevent
+collisions.
+## Supply chain
+Protocol-skills ship **inside the npm package**. Brainclaw does **not** currently
+support installing external skills (no `brainclaw skill install <url>`, no loading
+from arbitrary paths). If external installs are ever added they will require a
+deliberate trust boundary (local-only or brainclaw-signed); that is out of scope
+today.
+## Staleness
+If `brainclaw --version` reports newer than a skill's `brainclaw_version`,
+re-run `brainclaw export --all --write` to regenerate. The version tag lets a
+caching loader detect drift; it does not self-heal.

package/docs/concepts/workspace-bootstrapping.md CHANGED Viewed

@@ -25,6 +25,26 @@ It establishes the first shared memory foundation for the workspace:
 - writes to the detected agent's native instruction file (Cursor, Claude Code, Windsurf, etc.)
 - creates `AGENTS.md` and `.github/copilot-instructions.md`
+## Empty memory: one rule
+"Bootstrap" historically named three different systems (init scaffolding, the
+`bclaw_bootstrap` brownfield extractor, and the bootstrap ideation loop). When
+the memory store is empty, every surface — the `bclaw_work` hint, the
+`bclaw_setup` quick-init preview, and the `brainclaw init` preflight — now
+emits the same decision rule (`resolveEmptyMemoryRecommendation`):
+- **Repo with existing content** → run `bclaw_bootstrap` (CLI: `brainclaw bootstrap`)
+  to extract initial context from docs, manifests, native agent files, and git
+  history.
+- **Greenfield repo** (nothing to extract) → open a bootstrap loop to ideate
+  the project vision: `bclaw_coordinate(intent='ideate', preset='bootstrap')`
+  (CLI: `brainclaw bootstrap-loop`).
+The two routes are chainable in either order: extract first, then open a loop
+for whatever vision the docs could not provide — or ideate first, then extract
+once content exists. On greenfield, the brownfield preflight scan is skipped
+entirely (there is nothing to harvest yet).
 ## Good integration pattern
 1. check whether the workspace is initialized
@@ -39,6 +59,47 @@ It may simply mean the workspace has not been onboarded yet.
 This lets a single machine support multiple very different workspaces without forcing one static instruction layer to fit all of them equally well.
+## init = single project entry point
+`brainclaw init` is the single code path for turning a project into a
+brainclaw-aware workspace, whether invoked from a terminal, from the
+`bclaw_setup` MCP tool's quick-init step, or from a multi-repo
+`brainclaw setup`. After detecting the local AI agent, init runs the
+per-agent slice of the machine prerequisites (the same writes `setup`
+performs at machine scope, scoped to the detected agent) so an agent
+landing in the carte-blanche / fresh-repo case does not need a separate
+shell-out + session reload. The slice is idempotent — each `ensure*`
+function returns "skipped" when the agent's user-scope config doesn't
+exist, and the writes are short-circuited in `BRAINCLAW_TEST_MODE` or
+when `--skip-agent-bootstrap` is passed. `setup` is rescoped to
+multi-repo / machine-bootstrap; `setup-machine` is the explicit
+machine-only path.
+### `init --force`
+`--force` rebuilds managed identity fields (project_id, current_agent,
+storage_dir, topology) but **merges through the existing config** so
+curator personalisations (redaction patterns, sensitive paths,
+governance overrides, claim TTL, cross-project links, custom markdown
+caps) survive the reset. Before any write, a sibling backup is taken at
+`.brainclaw.bak-<timestamp>/` — the standard recovery-backups pattern
+used by `brainclaw upgrade`. Recovery: `brainclaw upgrade --rollback`.
+## Solo-agent fresh defaults
+A fresh `brainclaw init` seeds `governance.curators` with the human
+running init. Without this, the default `approval_policy: 'review'`
+combined with `curators: []` trapped every reflective note in pending
+forever — a surprise that doesn't show up until enough memory has
+accumulated to notice. The merge logic preserves any explicit curator
+list on an existing store, so this only takes effect on fresh installs.
+On an empty store, `bclaw_work` carries an explicit
+`bclaw_create(entity='plan')` hint in `next_actions` alongside the
+bootstrap recommendation: the bootstrap covers *vision*, the plan
+affordance covers *work* itself. The two are independent — both can
+appear simultaneously.
 ## Multi-project workspaces
 A workspace may contain multiple brainclaw-initialized child projects (each with its own `.brainclaw/` store). In this topology:

package/docs/integrations/agents.md CHANGED Viewed

@@ -80,10 +80,10 @@ The agent can use `bclaw_setup` to walk through the process interactively.
 ### Per-agent manual setup
 ```bash
-brainclaw enable-agent claude-code
-brainclaw enable-agent cursor
-brainclaw export --format claude-md --write
-brainclaw export --detect --write          # auto-detect and write all formats
+brainclaw enable-agent claude-code
+brainclaw enable-agent cursor
+brainclaw export --format claude-md --write
+brainclaw export --detect --write          # auto-detect and write the current agent format
 ```
 ### Regenerating after changes