npm - brainclaw - Versions diffs - 1.12.0 → 1.13.0 - Mend

brainclaw 1.12.0 → 1.13.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

package/README.md +32 -0
package/dist/brainclaw-vscode.vsix +0 -0
package/dist/cli.js +9 -2
package/dist/commands/claim-resource.js +1 -0
package/dist/commands/estimation-report.js +1 -1
package/dist/commands/harvest.js +30 -22
package/dist/commands/mcp.js +279 -44
package/dist/commands/release-claim.js +21 -1
package/dist/core/agent-capability.js +15 -4
package/dist/core/agent-registry.js +7 -1
package/dist/core/claims.js +160 -1
package/dist/core/context.js +11 -4
package/dist/core/dispatch-status.js +113 -7
package/dist/core/entity-operations.js +51 -3
package/dist/core/loops/store.js +33 -0
package/dist/core/reputation.js +18 -0
package/dist/core/schema.js +7 -0
package/dist/core/worktree.js +146 -22
package/dist/facts.js +36 -3
package/dist/facts.json +35 -2
package/docs/mcp-schema-changelog.md +7 -2
package/package.json +6 -4

package/README.md CHANGED Viewed

@@ -66,6 +66,20 @@ Capable agents use the MCP equivalents `bclaw_code_find` / `bclaw_code_brief`, e
 ---
+## Measured agent experience
+The onboarding path a fresh agent takes — init, first `bclaw_work`, first `code_find` — is exercised on every CI run by a reproducible bench (`scripts/bench.mjs`) against a synthetic store calibrated on the shape of a real production project (~200 plans, ~500 handoffs, ~450 claims). Budgets (`bench-budgets.json`) live in the repo alongside the coverage gate; a regression beyond a per-scenario tolerance fails the build. The latest report ships as `dist/facts.json` under `facts.bench` so the site can render current numbers.
+The bench covers three scenarios:
+- **Cold onboard** — fresh machine → init → first useful context. Measures the baseline time-to-first-value.
+- **Warm work** — `bclaw_work` consult over a real-shaped store. Tracks the cost of building context when the store is non-empty (the surface `pln#578`/`pln#566` optimise).
+- **First edit** — `code_find` + `code_brief` on the fresh-agent path. Tracks payload compactness (`pln#598`) and match relevance (`pln#601`).
+Reproduce locally with `npm run bench` (writes `dist/bench-report.json`); check budgets with `npm run bench:check`. The bench runs in-process against synthetic fixtures, so it is deterministic per seed and safe to run on any machine.
+---
 ## Agent Surfaces
 brainclaw exposes the same collaboration state through three surfaces, but they do not have the same role in an agent-first workflow.
@@ -417,6 +431,24 @@ npm run test:coverage      # with coverage report
 For older releases (v0.x and the early v1.0 launch series), `git log` on `master` is the source of truth — every release commit follows the `chore(release): bump version to <semver>` convention, and the matching feature/fix commits reference their plan id (e.g. `feat(mcp): self-heal ... (pln#478)`).
+### v1.13.0
+Operator-maturity batch from two days of heavy multi-agent dogfooding — dispatch/worktree lifecycle, claim parity, write-path auto-repair, model routing, benchmark gate, 2× faster context reads:
+- **Model selection for codex and copilot spawns** (pln#606) — `dispatch run --model <m>` now reaches codex (`exec -m`) and copilot argv, verified on the installed binaries. The codex `--add-dir` writable-roots spike closed negative on Windows: file-first stays the codex transport.
+- **Auto-repair identity & session on canonical writes** (pln#608) — `bclaw_create`/`update`/`remove`/`transition` auto-register + auto-session instead of throwing "Start a session first", with an explicit `session auto-created` warning; the trust boundary for unknown identities is unchanged.
+- **Time-to-first-value benchmark with blocking CI budgets** (pln#604) — seeded synthetic stores, three scenarios (cold_onboard / warm_work / first_edit), budgets versioned in `bench-budgets.json`.
+- **Claim lifecycle parity** (trp#928) — `bclaw_transition(entity='claim')` wired, `coordinator_override` (trusted+) on release/transition, cascade release with per-claim logging on plan-done / loop close / assignment-completed / harvest, entity-scoped `bclaw_find` filter rejections.
+- **Squash-aware worktree GC + junction-safe removal** (trp#926) — squash-merged lanes are finally collected (content-based detection), and the Windows `node_modules`-junction wipe class is closed for good; `dispatch_status` compares against the worktree's creation ref.
+- **VS Code extension** — Backlog pagination (recent plans were invisible, trp#925) and probe/spawn parity with classified, actionable resolver failures (trp#927).
+- **Context read 2× faster on large stores** (pln#578) — 3 of the 4 full-store read passes per context build eliminated (disabled-reputation sweep ×2, estimation reload, triple candidate scan): 23.9 s → 11.9 s, 196 MB → 49 MB parsed, byte-identical output.
+### v1.12.0
+Auto-localized execution writes for multi-project workspaces (pln#597), from DGX-Spark dogfooding:
+- **Execution writes auto-localize into a workspace sibling named by `project=X`** — `bclaw_create`/`bclaw_transition` (plan & claim), `bclaw_claim`, the step tools and `bclaw_delete_plan` open a session + sticky switch into a workspace store-chain child instead of rejecting with "limited to signaling entities", echoing `auto_switched`. The signaling-vs-execution boundary is re-scoped to federation (`cross_project_links`): federated links and unknown names stay blocked.
 ### v1.11.1
 Agent-identity & session-hook resilience, from a fresh-CLI dogfood on a monorepo:

package/dist/brainclaw-vscode.vsix CHANGED Viewed

Binary file

package/dist/cli.js CHANGED Viewed

@@ -1179,10 +1179,16 @@ program
     .option('--all', 'Include released claims in list')
     .option('--json', 'Output as JSON for list')
     .option('--plan-status <status>', 'Optional linked plan status when releasing: todo, in_progress, blocked, done, dropped')
+    .option('--coordinator-override', 'Trusted+ only: release a claim owned by another agent')
     .option('--store <target>', 'Target store level: local (default), repo, workspace')
     .option('--local-only', 'Read from local store only for list (skip parent stores in chain)')
     .action((subcommand, args, options) => {
-    runClaimResource(subcommand, args, { ...options, planStatus: options.planStatus, localOnly: options.localOnly });
+    runClaimResource(subcommand, args, {
+        ...options,
+        planStatus: options.planStatus,
+        coordinatorOverride: options.coordinatorOverride,
+        localOnly: options.localOnly,
+    });
 });
 // --- assignment ---
 program
@@ -1222,8 +1228,9 @@ program
     .command('release-claim <id>')
     .description('Release a work claim')
     .option('--plan-status <status>', 'Optional linked plan status: todo, in_progress, blocked, done, dropped')
+    .option('--coordinator-override', 'Trusted+ only: release a claim owned by another agent')
     .action((id, options) => {
-    runReleaseClaim(id, options);
+    runReleaseClaim(id, { ...options, coordinatorOverride: options.coordinatorOverride });
 });
 // --- release-claims ---
 program

package/dist/commands/claim-resource.js CHANGED Viewed

@@ -36,6 +36,7 @@ export function runClaimResource(subcommand, args, options) {
         }
         runReleaseClaim(id, {
             planStatus: options.planStatus,
+            coordinatorOverride: options.coordinatorOverride,
             cwd: options.cwd,
         });
         return;

package/dist/commands/estimation-report.js CHANGED Viewed

@@ -104,7 +104,7 @@ export function renderRatioBar(ratio, width = 40) {
     return bar;
 }
 export function buildEstimationReport(options = {}) {
-    const state = loadState(options.cwd);
+    const state = options.state ?? loadState(options.cwd);
     const done = state.plan_items.filter((p) => p.status === 'done' && (!options.agent || p.author === options.agent));
     const entries = done.map((p) => {
         // Estimate: prefer the sum of per-step estimates when ALL steps carry one,

package/dist/commands/harvest.js CHANGED Viewed

@@ -19,7 +19,7 @@ import { listCandidates, listArchivedCandidates, saveCandidate } from '../core/c
 import { createRuntimeEvent } from '../core/events.js';
 import { memoryExists } from '../core/io.js';
 import { loadAssignment, transitionAssignment } from '../core/assignments.js';
-import { releaseClaimWithCascade, loadClaim } from '../core/claims.js';
+import { loadClaim, releaseClaimsCascade, logCascadeReleaseResult } from '../core/claims.js';
 import { getCapabilityProfile, dispatchCanCommit } from '../core/agent-capability.js';
 import { commitWorktreeOnBehalf, worktreesBaseDir } from '../core/worktree.js';
 /**
@@ -438,12 +438,16 @@ export function integrateLaneResults(options = {}) {
                     ...entry.files_changed.slice(0, 50).map((f) => ({ type: 'file', ref: f })),
                 ];
                 entry.assignment_completed = forceCompleteAssignment(lane.assignment_id, artifacts, `pln#534 on-behalf integration: ${lane.summary.slice(0, 120)}`, actor, cwd);
-                try {
-                    const rel = releaseClaimWithCascade(assignment.claim_id, { planStatus: 'done', cwd });
-                    entry.claim_released = rel.claim.status === 'released';
-                }
-                catch (err) {
-                    reasons.push(`claim release failed: ${err instanceof Error ? err.message : String(err)}`);
+                // trp#928 — use the cascade helper (was releaseClaimWithCascade — same
+                // logic for the last-claim rule but the cascade wrapper LOGS per-claim,
+                // so a silent ownership failure is observable in the runtime event log
+                // rather than only in this in-memory `reasons` string).
+                const cascade = releaseClaimsCascade([assignment.claim_id], { cwd, planStatus: 'done' });
+                logCascadeReleaseResult({ actor, trigger: 'harvest_integrate', assignment_id: lane.assignment_id, claim_id: assignment.claim_id, cascade, cwd });
+                const claimEntry = cascade.entries[0];
+                entry.claim_released = claimEntry?.released === true;
+                if (claimEntry && !claimEntry.released) {
+                    reasons.push(`claim release ${claimEntry.reason}${claimEntry.error ? `: ${claimEntry.error}` : ''}`);
                 }
             }
             else {
@@ -455,15 +459,15 @@ export function integrateLaneResults(options = {}) {
                 catch (err) {
                     reasons.push(`assignment ${target} transition rejected: ${err instanceof Error ? err.message : String(err)}`);
                 }
-                try {
-                    const rel = releaseClaimWithCascade(assignment.claim_id, {
-                        planStatus: lane.status === 'blocked' ? 'blocked' : undefined,
-                        cwd,
-                    });
-                    entry.claim_released = rel.claim.status === 'released';
-                }
-                catch (err) {
-                    reasons.push(`claim release failed: ${err instanceof Error ? err.message : String(err)}`);
+                const cascade = releaseClaimsCascade([assignment.claim_id], {
+                    cwd,
+                    planStatus: lane.status === 'blocked' ? 'blocked' : undefined,
+                });
+                logCascadeReleaseResult({ actor, trigger: 'harvest_integrate', assignment_id: lane.assignment_id, claim_id: assignment.claim_id, cascade, cwd });
+                const claimEntry = cascade.entries[0];
+                entry.claim_released = claimEntry?.released === true;
+                if (claimEntry && !claimEntry.released) {
+                    reasons.push(`claim release ${claimEntry.reason}${claimEntry.error ? `: ${claimEntry.error}` : ''}`);
                 }
             }
             // Durable trace of the integration.
@@ -615,12 +619,16 @@ export function harvestOrphaned(options) {
                 ...report.files_changed.slice(0, 50).map((f) => ({ type: 'file', ref: f })),
             ];
             report.assignment_completed = forceCompleteAssignment(options.assignmentId, artifacts, 'pln#554 harvest --orphaned: worker died before delivering; work recovered from worktree', actor, cwd);
-            try {
-                const rel = releaseClaimWithCascade(assignment.claim_id, { planStatus: 'done', cwd });
-                report.claim_released = rel.claim.status === 'released';
-            }
-            catch (err) {
-                report.errors.push(`claim release failed: ${err instanceof Error ? err.message : String(err)}`);
+            // trp#928 — log per-claim via releaseClaimsCascade instead of the raw
+            // releaseClaimWithCascade so an ownership_denied outcome is visible in the
+            // runtime event log (previously trapped into report.errors only, which
+            // dies with the CLI invocation).
+            const cascade = releaseClaimsCascade([assignment.claim_id], { cwd, planStatus: 'done' });
+            logCascadeReleaseResult({ actor, trigger: 'harvest_integrate', assignment_id: options.assignmentId, claim_id: assignment.claim_id, cascade, cwd });
+            const claimEntry = cascade.entries[0];
+            report.claim_released = claimEntry?.released === true;
+            if (claimEntry && !claimEntry.released) {
+                report.errors.push(`claim release ${claimEntry.reason}${claimEntry.error ? `: ${claimEntry.error}` : ''}`);
             }
         }
         else {