npm - @exaudeus/workrail - Versions diffs - 3.66.0 → 3.68.0 - Mend

@exaudeus/workrail 3.66.0 → 3.68.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (150) hide show

package/dist/application/services/compiler/template-registry.js +10 -1
package/dist/application/validation.js +1 -1
package/dist/cli/commands/worktrain-init.js +1 -1
package/dist/console/standalone-console.js +4 -1
package/dist/console-ui/assets/{index-BynU38Vu.js → index-CyzltI6D.js} +1 -1
package/dist/console-ui/index.html +1 -1
package/dist/coordinators/modes/full-pipeline.js +4 -4
package/dist/coordinators/modes/implement-shared.js +5 -5
package/dist/coordinators/modes/implement.js +4 -4
package/dist/coordinators/pr-review.js +4 -4
package/dist/daemon/workflow-runner.d.ts +1 -0
package/dist/daemon/workflow-runner.js +1 -0
package/dist/infrastructure/storage/schema-validating-workflow-storage.d.ts +21 -2
package/dist/infrastructure/storage/schema-validating-workflow-storage.js +48 -0
package/dist/manifest.json +41 -41
package/dist/mcp/handlers/v2-workflow.js +24 -7
package/dist/mcp/output-schemas.d.ts +36 -0
package/dist/mcp/output-schemas.js +11 -1
package/dist/mcp/workflow-protocol-contracts.js +2 -2
package/dist/v2/projections/session-metrics.d.ts +1 -1
package/dist/v2/projections/session-metrics.js +16 -35
package/dist/v2/usecases/console-routes.d.ts +2 -2
package/docs/authoring-v2.md +4 -4
package/docs/changelog-recent.md +3 -3
package/docs/configuration.md +1 -1
package/docs/design/adaptive-coordinator-context-candidates.md +1 -1
package/docs/design/adaptive-coordinator-context.md +1 -1
package/docs/design/adaptive-coordinator-routing-candidates.md +18 -18
package/docs/design/adaptive-coordinator-routing-review.md +1 -1
package/docs/design/adaptive-coordinator-routing.md +34 -34
package/docs/design/agent-cascade-protocol.md +2 -2
package/docs/design/console-daemon-separation-discovery.md +323 -0
package/docs/design/context-assembly-design-candidates.md +1 -1
package/docs/design/context-assembly-implementation-plan.md +1 -1
package/docs/design/context-assembly-layer.md +2 -2
package/docs/design/context-assembly-review-findings.md +1 -1
package/docs/design/coordinator-access-audit.md +293 -0
package/docs/design/coordinator-architecture-audit.md +62 -0
package/docs/design/coordinator-error-handling-audit.md +240 -0
package/docs/design/coordinator-testability-audit.md +426 -0
package/docs/design/daemon-architecture-discovery.md +1 -1
package/docs/design/daemon-console-separation-discovery.md +242 -0
package/docs/design/daemon-memory-audit.md +203 -0
package/docs/design/design-candidates-console-daemon-separation.md +256 -0
package/docs/design/design-candidates-discovery-loop-fix.md +141 -0
package/docs/design/design-review-findings-console-daemon-separation.md +106 -0
package/docs/design/design-review-findings-discovery-loop-fix.md +81 -0
package/docs/design/discovery-loop-fix-candidates.md +161 -0
package/docs/design/discovery-loop-fix-design-review.md +106 -0
package/docs/design/discovery-loop-fix-validation.md +258 -0
package/docs/design/discovery-loop-investigation-A.md +188 -0
package/docs/design/discovery-loop-investigation-B.md +287 -0
package/docs/design/exploration-workflow-candidates.md +205 -0
package/docs/design/exploration-workflow-design-review.md +166 -0
package/docs/design/exploration-workflow-discovery.md +443 -0
package/docs/design/ide-context-files-candidates.md +231 -0
package/docs/design/ide-context-files-design-review.md +85 -0
package/docs/design/ide-context-files.md +615 -0
package/docs/design/implementation-plan-discovery-loop-fix.md +199 -0
package/docs/design/implementation-plan-queue-poll-rotation.md +102 -0
package/docs/design/in-process-http-audit.md +190 -0
package/docs/design/layer3b-ghost-nodes-design-candidates.md +2 -2
package/docs/design/loadSessionNotes-candidates.md +108 -0
package/docs/design/loadSessionNotes-test-coverage-discovery.md +297 -0
package/docs/design/loadSessionNotes-test-coverage-session4.md +209 -0
package/docs/design/loadSessionNotes-test-coverage-v3.md +321 -0
package/docs/design/probe-session-design-candidates.md +261 -0
package/docs/design/probe-session-phase0.md +490 -0
package/docs/design/routines-guide.md +7 -7
package/docs/design/session-metrics-attribution-candidates.md +250 -0
package/docs/design/session-metrics-attribution-design-review.md +115 -0
package/docs/design/session-metrics-attribution-discovery.md +319 -0
package/docs/design/session-metrics-candidates.md +227 -0
package/docs/design/session-metrics-design-review.md +104 -0
package/docs/design/session-metrics-discovery.md +454 -0
package/docs/design/spawn-session-debug.md +202 -0
package/docs/design/trigger-validator-candidates.md +214 -0
package/docs/design/trigger-validator-review.md +109 -0
package/docs/design/trigger-validator-shaping-phase0.md +239 -0
package/docs/design/trigger-validator.md +454 -0
package/docs/design/v2-core-design-locks.md +2 -2
package/docs/design/workflow-extension-points.md +15 -15
package/docs/design/workflow-id-validation-at-startup.md +1 -1
package/docs/design/workflow-id-validation-implementation-plan.md +2 -2
package/docs/design/workflow-trigger-lifecycle-audit.md +175 -0
package/docs/design/worktrain-task-queue-candidates.md +5 -5
package/docs/design/worktrain-task-queue.md +4 -4
package/docs/discovery/coordinator-script-design.md +1 -1
package/docs/discovery/coordinator-ux-discovery.md +3 -3
package/docs/discovery/simulation-report.md +1 -1
package/docs/discovery/workflow-modernization-discovery.md +326 -0
package/docs/discovery/workflow-selection-for-discovery-tasks.md +33 -33
package/docs/discovery/worktrain-status-briefing.md +1 -1
package/docs/discovery/wr-discovery-goal-reframing.md +1 -1
package/docs/docker.md +1 -1
package/docs/ideas/backlog.md +227 -0
package/docs/ideas/third-party-workflow-setup-design-thinking.md +1 -1
package/docs/integrations/claude-code.md +5 -5
package/docs/integrations/firebender.md +1 -1
package/docs/plans/agentic-orchestration-roadmap.md +2 -2
package/docs/plans/mr-review-workflow-redesign.md +9 -9
package/docs/plans/ui-ux-workflow-design-candidates.md +4 -4
package/docs/plans/ui-ux-workflow-discovery.md +2 -2
package/docs/plans/workflow-categories-candidates.md +8 -8
package/docs/plans/workflow-categories-discovery.md +4 -4
package/docs/plans/workflow-modernization-design.md +430 -0
package/docs/plans/workflow-staleness-detection-candidates.md +11 -11
package/docs/plans/workflow-staleness-detection-review.md +4 -4
package/docs/plans/workflow-staleness-detection.md +9 -9
package/docs/plans/workrail-platform-vision.md +3 -3
package/docs/reference/agent-context-cleaner-snippet.md +1 -1
package/docs/reference/agent-context-guidance.md +4 -4
package/docs/reference/context-optimization.md +2 -2
package/docs/roadmap/now-next-later.md +2 -2
package/docs/roadmap/open-work-inventory.md +16 -16
package/docs/workflows.md +31 -31
package/package.json +1 -1
package/spec/workflow-tags.json +47 -47
package/workflows/adaptive-ticket-creation.json +16 -16
package/workflows/architecture-scalability-audit.json +22 -22
package/workflows/bug-investigation.agentic.v2.json +3 -3
package/workflows/classify-task-workflow.json +1 -1
package/workflows/coding-task-workflow-agentic.json +6 -6
package/workflows/cross-platform-code-conversion.v2.json +8 -8
package/workflows/document-creation-workflow.json +8 -8
package/workflows/documentation-update-workflow.json +8 -8
package/workflows/intelligent-test-case-generation.json +2 -2
package/workflows/learner-centered-course-workflow.json +2 -2
package/workflows/mr-review-workflow.agentic.v2.json +4 -4
package/workflows/personal-learning-materials-creation-branched.json +8 -8
package/workflows/presentation-creation.json +5 -5
package/workflows/production-readiness-audit.json +1 -1
package/workflows/relocation-workflow-us.json +31 -31
package/workflows/routines/context-gathering.json +1 -1
package/workflows/routines/design-review.json +1 -1
package/workflows/routines/execution-simulation.json +1 -1
package/workflows/routines/feature-implementation.json +3 -3
package/workflows/routines/final-verification.json +1 -1
package/workflows/routines/hypothesis-challenge.json +1 -1
package/workflows/routines/ideation.json +1 -1
package/workflows/routines/parallel-work-partitioning.json +3 -3
package/workflows/routines/philosophy-alignment.json +2 -2
package/workflows/routines/plan-analysis.json +1 -1
package/workflows/routines/plan-generation.json +1 -1
package/workflows/routines/tension-driven-design.json +6 -6
package/workflows/scoped-documentation-workflow.json +26 -26
package/workflows/ui-ux-design-workflow.json +14 -14
package/workflows/workflow-diagnose-environment.json +1 -1
package/workflows/workflow-for-workflows.json +32 -77
package/workflows/workflow-for-workflows.v2.json +0 -788

package/docs/design/coordinator-access-audit.md ADDED Viewed

@@ -0,0 +1,293 @@
+# Coordinator Access Audit: HTTP vs. In-Process Dep Functions
+**Date:** 2026-04-19
+**Scope:** All `coordinatorDeps` functions in `src/trigger/trigger-listener.ts` (lines 410-752),
+covering `CoordinatorDeps` and `AdaptiveCoordinatorDeps` interfaces.
+**TL;DR:** Two dep functions (`awaitSessions`, `getAgentResult`) make HTTP calls to the daemon's
+own console API (port 3456) from inside the daemon process. Both have direct in-process
+replacements via `ConsoleService`. All other deps are correctly implemented.
+---
+## 1. Dep-by-Dep Audit Table
+| Dep function | Lines | What it does | Transport | In-process access exists? | Breaks when HTTP fails? | Status |
+|---|---|---|---|---|---|---|
+| `spawnSession` | 411-476 | Allocate a session, dispatch to agent loop | **In-process** (`executeStartWorkflow` + `router.dispatch`) | n/a (already in-process) | No -- no HTTP | FIXED |
+| `contextAssembler` | 478-497 | Assemble git diff + prior session notes | **CLI** (`git`, `gh` subprocesses) | No in-process alternative for `git`/`gh` | Only if `git`/`gh` binary unavailable -- acceptable | CORRECT |
+| `awaitSessions` | 499-540 | Poll for session terminal status | **HTTP** to `GET /api/v2/sessions/:id` on port 3456 | Yes -- `ConsoleService.getSessionDetail()` | **Yes** -- ECONNREFUSED if port unavailable | **BUG** |
+| `getAgentResult` | 542-609 | Fetch recap + artifacts from completed session | **HTTP** to `GET /api/v2/sessions/:id` + `/nodes/:id` on port 3456 | Yes -- `ConsoleService.getSessionDetail()` + `.getNodeDetail()` | **Yes** -- returns empty result silently | **BUG** |
+| `listOpenPRs` | 611-622 | List open PRs via `gh pr list` | **CLI** (`gh` subprocess) | No in-process alternative | Only if `gh` unavailable -- acceptable | CORRECT |
+| `mergePR` | 624-635 | Merge PR via `gh pr merge --squash` | **CLI** (`gh` subprocess) | No in-process alternative | Only if `gh` unavailable -- acceptable | CORRECT |
+| `writeFile` | 637-639 | Write a file to disk | **Filesystem** (`fs.promises.writeFile`) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+| `readFile` | 641 | Read a file from disk | **Filesystem** (`fs.promises.readFile`) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+| `appendFile` | 643-644 | Append content to a file | **Filesystem** (`fs.promises.appendFile`) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+| `mkdir` | 646-647 | Create directory (recursive) | **Filesystem** (`fs.promises.mkdir`) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+| `homedir` | 649 | Return home directory path | **OS** (`os.homedir()`) | n/a | Never -- pure OS call | CORRECT |
+| `joinPath` | 650 | Join path segments | **Pure** (`path.join`) | n/a | Never -- pure function | CORRECT |
+| `nowIso` | 651 | Return ISO timestamp | **Pure** (`new Date().toISOString()`) | n/a | Never -- pure function | CORRECT |
+| `generateId` | 652 | Generate UUID | **Pure** (`randomUUID()`) | n/a | Never -- pure function | CORRECT |
+| `stderr` | 654 | Write to stderr | **Process** (`process.stderr.write`) | n/a | Never -- process I/O | CORRECT |
+| `now` | 655 | Return current ms timestamp | **Pure** (`Date.now()`) | n/a | Never -- pure function | CORRECT |
+| `port` | 656 | Resolved console port number | Constant (3456) | Not a function -- used by CLI context | n/a | REMOVE after fix |
+| `fileExists` | 660 | Check file existence (sync) | **Filesystem** (`fs.existsSync`) | n/a | Only on unusual FS errors | CORRECT |
+| `archiveFile` | 662-663 | Move a file (archive) | **Filesystem** (`fs.promises.rename`) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+| `pollForPR` | 665-692 | Poll `gh pr list` for a matching PR | **CLI** (`gh` subprocess in a loop) | No in-process alternative | Only if `gh` unavailable -- acceptable | CORRECT |
+| `postToOutbox` | 694-705 | Append entry to `~/.workrail/outbox.jsonl` | **Filesystem** (direct JSONL append) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+| `pollOutboxAck` | 707-751 | Poll `~/.workrail/inbox-cursor.json` for ack | **Filesystem** (polling loop) | n/a | Only on disk I/O error -- acceptable | CORRECT |
+**Summary:** 2 bugs (`awaitSessions`, `getAgentResult`). 20 deps correct. 1 constant to remove.
+---
+## 2. Priority-Ranked Fix List
+### Priority 1 -- `awaitSessions` (all pipelines)
+**File:** `src/trigger/trigger-listener.ts:499-540`
+**Impact:** Every pipeline phase that spawns child sessions uses `awaitSessions`:
+- `full-pipeline.ts`: discovery, shaping, ux-gate, coding sessions
+- `implement.ts`: ux-gate, coding
+- `implement-shared.ts`: review, fix-agent, audit, re-review
+- `pr-review.ts`: review, fix, re-review
+**Failure modes:**
+1. Port unavailable (startup race, port conflict, test environment): `ECONNREFUSED` -- all handles returned as `outcome: 'failed'`. The coordinator interprets this as every spawned session having crashed, triggering cascading escalations.
+2. HTTP server not yet bound: sessions created in-process by `spawnSession()` may be polled before the HTTP server starts. First poll returns 404 (session not found).
+**Why high priority:** Failing `awaitSessions` terminates the entire coordinator pipeline. It is called at every await point across all pipeline modes.
+---
+### Priority 2 -- `getAgentResult` (verdict and handoff artifacts)
+**File:** `src/trigger/trigger-listener.ts:542-609`
+**Impact:** Called after every `awaitSessions` resolves with `outcome: 'success'` to extract recap markdown and artifacts. Used by:
+- Review phases: reads `wr.review_verdict` artifact for typed verdict extraction
+- Discovery phase (FULL pipeline): reads handoff artifact with shaping context
+**Failure modes:**
+1. Port unavailable: returns `{ recapMarkdown: null, artifacts: [] }` silently (no error thrown). The coordinator falls back to keyword-scan on `null` notes, producing `severity: 'unknown'` which routes to escalation.
+2. 30-second per-fetch timeout: for sessions with many nodes, the N sequential node fetches can add minutes of latency.
+3. Silent degradation: the `getAgentResult` fallback makes it look like the session succeeded but produced no usable output, causing the coordinator to escalate with opaque reasons.
+**Why priority 2 (not 1):** Only fires after sessions actually complete. A broken `awaitSessions` prevents reaching `getAgentResult` entirely.
+---
+### Priority 3 -- Remove `port: DAEMON_CONSOLE_PORT` from `coordinatorDeps`
+**File:** `src/trigger/trigger-listener.ts:656`
+The `port` field is referenced in `pr-review.ts`'s `discoverConsolePort()` helper which is only called from the CLI entry point (`src/cli-worktrain.ts`), not from the in-process coordinator. After the two bugs above are fixed, this constant and the `DAEMON_CONSOLE_PORT = 3456` declaration (line 402) and its associated comment (lines 394-401) can be removed from `trigger-listener.ts`.
+---
+## 3. Recommended In-Process Architecture
+### Root cause
+`awaitSessions` and `getAgentResult` were implemented when the coordinator ran as an out-of-process CLI command (`worktrain run pr-review`). HTTP was the correct transport there. When the coordinator was moved into the daemon (TriggerRouter in `trigger-listener.ts`), only `spawnSession` was migrated to in-process. The other two deps were deferred. The comment at line 394 acknowledges this:
+> `// WHY port=3456 (DAEMON_CONSOLE_PORT): still used by awaitSessions and getAgentResult which poll the console HTTP API`
+### Current (broken) wiring
+```
+TriggerListener (daemon process)
+  ├── coordinatorDeps (wired in startTriggerListener)
+  │   ├── spawnSession:     executeStartWorkflow + router.dispatch  [in-process, CORRECT]
+  │   ├── awaitSessions:    executeWorktrainAwaitCommand
+  │   │                     └── fetch http://127.0.0.1:3456/api/v2/sessions/:id  [HTTP-to-self, BUG]
+  │   └── getAgentResult:   globalThis.fetch
+  │                         └── http://127.0.0.1:3456/api/v2/sessions/:id/nodes/...  [HTTP-to-self, BUG]
+  └── startDaemonConsole() (separate call, after coordinatorDeps)
+      └── consoleService = new ConsoleService({ ctx.v2.sessionStore, ... })
+          ├── getSessionDetail()  [in-process, NOT wired to coordinatorDeps]
+          └── getNodeDetail()     [in-process, NOT wired to coordinatorDeps]
+```
+### Target (correct) wiring
+```
+TriggerListener (daemon process)
+  └── startTriggerListener()
+      ├── consoleService = new ConsoleService({   [constructed BEFORE coordinatorDeps]
+      │     directoryListing: ctx.v2.directoryListing,
+      │     dataDir:          ctx.v2.dataDir,
+      │     sessionStore:     ctx.v2.sessionStore,
+      │     snapshotStore:    ctx.v2.snapshotStore,
+      │     pinnedWorkflowStore: ctx.v2.pinnedStore,
+      │   })
+      └── coordinatorDeps
+          ├── awaitSessions:    in-process polling loop
+          │   └── consoleService.getSessionDetail(handle)  [no HTTP, no port]
+          │       └── runs[0].status: ConsoleRunStatus
+          └── getAgentResult:   in-process node reading
+              ├── consoleService.getSessionDetail(handle)  [no HTTP, no port]
+              │   └── runs[0]: { nodes, preferredTipNodeId }
+              └── consoleService.getNodeDetail(handle, nodeId)  [no HTTP, no port]
+                  └── { recapMarkdown, artifacts }
+```
+### `awaitSessions` in-process design
+```typescript
+awaitSessions: async (handles: readonly string[], timeoutMs: number) => {
+  const startMs = Date.now();
+  const pending = new Set(handles);
+  const results = new Map<string, SessionResult>();
+  while (pending.size > 0) {
+    const elapsed = Date.now() - startMs;
+    if (elapsed >= timeoutMs) {
+      // timeout: remaining handles marked as 'timeout'
+      break;
+    }
+    for (const handle of [...pending]) {
+      const detail = await consoleService.getSessionDetail(handle);
+      if (detail.isErr()) {
+        // 'SESSION_LOAD_FAILED' or 'NODE_NOT_FOUND': session not yet visible or corrupt
+        continue; // retry on next poll cycle
+      }
+      const run = detail.value.runs[0];
+      if (!run) continue; // session started but no run yet
+      const status: ConsoleRunStatus = run.status;
+      // Terminal: complete, complete_with_gaps (success), blocked (failed)
+      // in_progress: still running
+      // ConsoleRunStatus does not include 'dormant' -- the poll timeout covers dormant.
+      if (status === 'complete' || status === 'complete_with_gaps') {
+        results.set(handle, { handle, outcome: 'success', status, durationMs: Date.now() - startMs });
+        pending.delete(handle);
+      } else if (status === 'blocked') {
+        results.set(handle, { handle, outcome: 'failed', status, durationMs: Date.now() - startMs });
+        pending.delete(handle);
+      }
+    }
+    if (pending.size > 0) {
+      await new Promise<void>((resolve) => setTimeout(resolve, 3000));
+    }
+  }
+  // Any remaining pending handles: timeout
+  for (const handle of pending) {
+    results.set(handle, { handle, outcome: 'timeout', status: null, durationMs: timeoutMs });
+  }
+  const resultsArray = [...results.values()];
+  return {
+    results: resultsArray,
+    allSucceeded: resultsArray.every((r) => r.outcome === 'success'),
+  };
+},
+```
+Key design decisions:
+- `SESSION_LOAD_FAILED` on `getSessionDetail` is treated as "not ready yet" (retry), not as failure. This handles the case where the session event log has just been created by `spawnSession` but is not yet complete enough to project.
+- `dormant` is not a `ConsoleRunStatus` -- it is a `ConsoleSessionStatus` computed by `ConsoleService` using mtime + `DORMANCY_THRESHOLD_MS`. The poll timeout (`timeoutMs`) handles sessions that go quiet.
+- Poll interval of 3000ms matches the current `executeWorktrainAwaitCommand` default.
+### `getAgentResult` in-process design
+```typescript
+getAgentResult: async (sessionHandle: string) => {
+  const emptyResult = { recapMarkdown: null, artifacts: [] as readonly unknown[] };
+  const detailResult = await consoleService.getSessionDetail(sessionHandle);
+  if (detailResult.isErr()) return emptyResult;
+  const run = detailResult.value.runs[0];
+  if (!run) return emptyResult;
+  const allNodeIds = run.nodes.map((n) => n.nodeId);
+  const tipNodeId = run.preferredTipNodeId;
+  if (!tipNodeId) return emptyResult;
+  const nodeIdsToFetch = allNodeIds.length > 0 ? allNodeIds : [tipNodeId];
+  let recap: string | null = null;
+  const collectedArtifacts: unknown[] = [];
+  for (const nodeId of nodeIdsToFetch) {
+    const nodeResult = await consoleService.getNodeDetail(sessionHandle, nodeId);
+    if (nodeResult.isErr()) continue;
+    if (nodeId === tipNodeId) {
+      recap = nodeResult.value.recapMarkdown;
+    }
+    if (nodeResult.value.artifacts.length > 0) {
+      collectedArtifacts.push(...nodeResult.value.artifacts);
+    }
+  }
+  return { recapMarkdown: recap, artifacts: collectedArtifacts };
+},
+```
+Key design decisions:
+- Mirrors the existing HTTP logic exactly: all nodes for artifacts, tip node only for recap. The field names on `ConsoleNodeDetail` (`recapMarkdown`, `artifacts`) match what the current HTTP JSON parsing extracts.
+- `ConsoleArtifact` has `{ sha256, contentType, byteLength, content }`. The `content` field carries the artifact payload. `readVerdictArtifact()` in `pr-review.ts` receives this array and searches for `kind: 'wr.review_verdict'` -- the shape must match what `projectArtifactsV2` produces. Verify `ConsoleArtifact.content` matches the artifact schema expected by `ReviewVerdictArtifactV1Schema`.
+### `ConsoleService` injection
+`ConsoleService` is already constructed in `daemon-console.ts:123-129` from `ctx.v2` ports. The simplest approach for `trigger-listener.ts` is to construct a second instance locally -- there is no correctness issue since the summary cache is instance-scoped and mtime-invalidated. The instance is cheap to construct.
+Alternatively, `ConsoleService` could be threaded from `daemon-console.ts` through the call chain to `trigger-listener.ts`. This avoids a duplicate instance but requires a parameter change in `startTriggerListener()`. The local construction approach is recommended as the lower-risk first step.
+`ConsoleService` does NOT need to be added to `CoordinatorDeps` or `AdaptiveCoordinatorDeps` interfaces. It is an implementation detail of the specific concrete dep functions, not a contract concern.
+### What must NOT change
+The CLI `run pr-review` command (`src/cli-worktrain.ts:1265+`) runs in a **separate process** from the daemon. Its `spawnSession`, `awaitSessions`, and `getAgentResult` deps correctly use HTTP to communicate with a running daemon. These are not bugs and must not be changed.
+---
+## 4. `ConsoleService` Testability Audit
+**Question:** Is `ConsoleService` properly abstracted for injection, or is it constructed inline making testing hard?
+**Finding: Properly abstracted.**
+`ConsoleService` takes `ConsoleServicePorts` in its constructor:
+```typescript
+// src/v2/usecases/console-service.ts:314-315
+export class ConsoleService {
+  constructor(private readonly ports: ConsoleServicePorts) {}
+```
+The `ConsoleServicePorts` interface:
+```typescript
+export interface ConsoleServicePorts {
+  readonly directoryListing: DirectoryListingPortV2;
+  readonly dataDir: DataDirPortV2;
+  readonly sessionStore: SessionEventLogReadonlyStorePortV2;
+  readonly snapshotStore: SnapshotStorePortV2;
+  readonly pinnedWorkflowStore: PinnedWorkflowStorePortV2;
+  readonly daemonRegistry?: DaemonRegistry; // optional
+}
+```
+All ports are pure interfaces -- entirely fakeable in tests. No construction-inline issues. The same pattern as `WorktrainSpawnCommandDeps` / `WorktrainAwaitCommandDeps`.
+**For testing the in-process coordinator deps:** A test can construct `ConsoleService` with a fake `sessionStore` that returns pre-seeded session events at specific statuses. This is far simpler than the current situation where tests must either start a real HTTP server or mock `globalThis.fetch`.
+---
+## 5. Evidence of Known Tech Debt
+The source code explicitly acknowledges both bugs. `src/trigger/trigger-listener.ts:394-401`:
+```typescript
+// WHY port=3456 (DAEMON_CONSOLE_PORT): still used by awaitSessions and getAgentResult
+// which poll the console HTTP API. This constant is kept for those paths.
+```
+`spawnSession` was already migrated (see lines 418-476 and its `WHY in-process (not HTTP)` comment). The comment at line 394 confirms that `awaitSessions` and `getAgentResult` are the remaining two deferred items from that migration.

package/docs/design/coordinator-architecture-audit.md ADDED Viewed

@@ -0,0 +1,62 @@
+# Coordinator Architecture Audit
+**Status:** In progress
+**Date:** 2026-04-19
+**Scope:** `src/coordinators/`, `src/trigger/trigger-listener.ts`, `src/daemon/` (NOT `src/mcp/`)
+---
+## Context / Ask
+**Stated goal (solution-shaped):** Produce a dep-by-dep analysis of `CoordinatorDeps` and `AdaptiveCoordinatorDeps`, identify indirect access anti-patterns, find missing abstractions, and produce a priority-ranked fix list.
+**Reframed problem:** WorkTrain coordinators cross the HTTP/shell boundary for data already available in-process via `ctx` (V2ToolContext), and `CoordinatorDeps` lacks the sub-interface structure needed to make these boundaries independently testable. The real risk: if the HTTP console is slow or unavailable, coordinator sessions degrade silently with misleading error messages.
+**Anti-goals:**
+- Do not audit `src/mcp/` (out of scope)
+- Do not propose rewrites of coordinator logic -- only interface/wiring changes
+- Do not change public CLI commands (`worktrain await` is correct for external callers)
+**Primary uncertainty:** How many indirect-access sites exist beyond the two known ones (`awaitSessions`, `getAgentResult`)?
+**Known approaches:**
+- Replace HTTP polling in `awaitSessions` with in-process session store reads + DaemonRegistry
+- Replace HTTP calls in `getAgentResult` with `projectNodeOutputsV2` projection on the session store
+- Introduce a `SessionStatusPort` interface so coordinators can inject a fake for testing
+**Path recommendation:** `landscape_first` -- the landscape (dep-by-dep analysis of what each function actually does) is the dominant need. The solution direction is already known; what's missing is the complete catalog of all anti-patterns and the exact recommended interface designs.
+---
+## Artifact Strategy
+This document is the human-readable output of the audit. It is NOT execution truth for the workflow -- notes and context variables in the WorkRail session are the durable record. If the session is rewound, this file may be stale; regenerate from notes.
+**Capabilities available:**
+- Delegation: YES (mcp__nested-subagent__Task available)
+- Web browsing: Not needed (codebase-only audit)
+- File reads: YES (main agent reads source files directly)
+---
+## Landscape Packet
+*(Populated during research phase)*
+---
+## Problem Frame Packet
+*(Populated during analysis phase)*
+---
+## Candidate Directions
+*(Populated during design phase)*
+---
+## Final Summary
+*(Populated when audit is complete)*

package/docs/design/coordinator-error-handling-audit.md ADDED Viewed

@@ -0,0 +1,240 @@
+# Coordinator Error Handling Audit
+**Date:** 2026-04-19
+**Scope:** `src/coordinators/` coordinator layer
+**Audited files:**
+1. `src/coordinators/adaptive-pipeline.ts` - `PipelineOutcome` type and main entry point
+2. `src/coordinators/modes/full-pipeline.ts` - FULL pipeline phase failure handling
+3. `src/coordinators/modes/implement.ts` - IMPLEMENT mode executor
+4. `src/coordinators/modes/implement-shared.ts` - shared review + verdict cycle
+5. `src/coordinators/pr-review.ts` - reference coordinator
+6. `src/runtime/result.ts` - the `Result<T,E>` type (baseline reference)
+**Excluded:** `src/mcp/`, `src/v2/durable-core/`
+---
+## Executive Summary
+The coordinator layer is broadly sound. The "errors are data" principle is followed: no bare `throw` statements exist in any coordinator file, all session spawn failures return `PipelineOutcome { kind: 'escalated' }`, and `Result<T,string>` is used correctly for the `spawnSession` / `mergePR` boundaries. `WorkflowRunResult` is guarded with `assertNever` in `trigger-router.ts`. The `PipelineOutcome` discriminated union is well-formed.
+One genuine runtime bug and two design inconsistencies are present. The bug (missing null-guard) is the only finding that can cause incorrect behavior at runtime.
+---
+## Findings
+### Finding F1 - Critical
+**Title:** Missing zombie-detection null-guard on `uxHandle` in `implement.ts`
+**File:line:** `src/coordinators/modes/implement.ts:144-145`
+**Description:**
+```typescript
+const uxHandle = uxSpawnResult.value;
+const uxAwait = await deps.awaitSessions([uxHandle], REVIEW_TIMEOUT_MS);
+```
+`uxHandle` is passed directly to `deps.awaitSessions` without checking for an empty/null value. Every other session handle in the coordinator layer has an explicit zombie-detection guard before being passed to `awaitSessions`:
+- `implement.ts:189-195` (`codingHandle`)
+- `full-pipeline.ts:222-228` (`discoveryHandle`)
+- `full-pipeline.ts:300-305` (`shapingHandle`)
+- `full-pipeline.ts:348-354` (`uxHandle` - the equivalent check in FULL mode, which this file is missing)
+- `full-pipeline.ts:428-433` (`codingHandle`)
+- `implement-shared.ts:68-74` (`reviewHandle`)
+- `implement-shared.ts:148-153` (`fixHandle`)
+- `implement-shared.ts:233-243` (`auditHandle`)
+- `implement-shared.ts:286-296` (`reReviewHandle`)
+**Risk:** If `deps.spawnSession` returns `ok('')` (empty string), an empty handle is passed to `awaitSessions`. Downstream behavior depends on the `awaitSessions` implementation, but at minimum it could return a result for a non-existent session, causing the UX gate to be treated as successfully completed when it was not. The coding session would then spawn without UX review having occurred.
+**Recommended fix:**
+```typescript
+const uxHandle = uxSpawnResult.value;
+if (!uxHandle) {
+  return {
+    kind: 'escalated',
+    escalationReason: { phase: 'ux-gate', reason: 'UX design session returned empty handle' },
+  };
+}
+const uxAwait = await deps.awaitSessions([uxHandle], REVIEW_TIMEOUT_MS);
+```
+This matches the pattern used at `implement.ts:189-195` and `full-pipeline.ts:348-354`.
+---
+### Finding F2 - Major
+**Title:** Non-exhaustive `switch` on `ReviewSeverity` in `implement-shared.ts` - missing `assertNever`
+**File:line:** `src/coordinators/modes/implement-shared.ts:110-175`
+**Description:**
+```typescript
+switch (findings.severity) {
+  case 'clean':
+    return { kind: 'merged', prUrl };
+  case 'minor': {
+    // ... fix loop ...
+  }
+  case 'blocking':
+  case 'unknown': {
+    return runAuditChain(...);
+  }
+  // no default
+}
+```
+`ReviewSeverity = 'clean' | 'minor' | 'blocking' | 'unknown'` is a 4-variant union. The switch covers all four, but there is no `default: assertNever(findings.severity)`. TypeScript does not produce a compile error for switches missing a `default` branch - it only enforces exhaustiveness when the fallthrough type is narrowed to `never`.
+**Risk:** If `ReviewSeverity` is widened with a new variant (e.g. `'critical'`), the `switch` will compile successfully and the new variant will fall through without being routed. Depending on TypeScript's control-flow analysis, this may return `undefined` (typed as `PipelineOutcome`) and corrupt the coordinator's return value silently.
+**Note:** The same switch pattern exists in `runFixAgentLoop` in `pr-review.ts` at several points, but `pr-review.ts` uses `PrOutcome` (not `PipelineOutcome`) and the consequences are less severe.
+**Recommended fix:**
+```typescript
+switch (findings.severity) {
+  case 'clean':
+    ...
+  case 'minor': {
+    ...
+  }
+  case 'blocking':
+  case 'unknown': {
+    return runAuditChain(...);
+  }
+  default:
+    return assertNever(findings.severity);
+}
+```
+Import `assertNever` from `../../runtime/assert-never.js`. The same fix applies to the corresponding switch at `implement-shared.ts:334` (`if (reFindings.severity === 'clean' || reFindings.severity === 'minor')`) - that one uses `if/else if` rather than `switch`, and the `else` arm covers the remaining cases. Consider converting to a `switch` with `assertNever` for consistency.
+---
+### Finding F3 - Major
+**Title:** `process.stderr.write` bypasses injected `deps.stderr` in `pr-review.ts`
+**File:line:** `src/coordinators/pr-review.ts:445-449` inside `readVerdictArtifact()`
+**Description:**
+```typescript
+process.stderr.write(
+  `[WARN coord:reason=artifact_parse_failed handle=${handlePrefix}] readVerdictArtifact: wr.review_verdict schema validation failed: ${issues}\n`,
+);
+```
+All other log calls in `pr-review.ts` use the injected `deps.stderr()` function or the local `log()` closure. This single call uses the Node.js global `process.stderr.write` directly, bypassing the injected dependency contract.
+**Risk:** Test fakes that inject a no-op or recording `stderr` will miss this warning silently. When a schema-invalid verdict artifact is emitted by an agent, the warning will appear in process stderr even in test environments that redirect all deps-based logging.
+**Recommended fix:**
+`readVerdictArtifact` currently accepts no `deps` parameter. Two options:
+**Option A** (preferred - consistent with `full-pipeline.ts:readDiscoveryHandoffArtifact`): Add an optional `stderrFn` parameter:
+```typescript
+export function readVerdictArtifact(
+  artifacts: readonly unknown[],
+  sessionHandle?: string,
+  stderrFn?: (line: string) => void,
+): ReviewFindings | null {
+  // ...
+  (stderrFn ?? ((s) => process.stderr.write(s + '\n')))(
+    `[WARN coord:reason=artifact_parse_failed handle=${handlePrefix}] ...`
+  );
+}
+```
+**Option B** (minimal): Accept the hardcoded `process.stderr.write` as intentional for this utility function and document it explicitly. This is the lower-effort option but does not fix the test isolation gap.
+---
+### Finding F4 - Minor
+**Title:** `checkSpawnCutoff()` returns `PipelineOutcome | null` rather than an `Option` type
+**File:line:** `src/coordinators/adaptive-pipeline.ts:245-260`
+**Description:**
+```typescript
+export function checkSpawnCutoff(
+  coordinatorStartMs: number,
+  now: number,
+  phase: string,
+): PipelineOutcome | null {
+  if (now - coordinatorStartMs > COORDINATOR_SPAWN_CUTOFF_MS) {
+    return { kind: 'escalated', ... };
+  }
+  return null;
+}
+```
+`null` is used as a sentinel for "safe to spawn." Callers check `if (cutoffCheck) return cutoffCheck;`. This diverges slightly from the "errors are data" principle (null as absence) but is scoped to one helper with a clear usage pattern.
+**Risk:** None at runtime. Callers immediately check the return. The naming (`checkSpawnCutoff`) clearly implies a nullable return. This is a stylistic inconsistency, not a functional issue.
+**Recommended fix (optional):** If the project ever adopts an `Option<T>` type, this is a good candidate for `Option<PipelineOutcome>`. As-is, the current implementation is clear and the null is well-understood at each call site.
+---
+### Finding F5 - Minor
+**Title:** `dispatchAdaptivePipeline` returns `{ kind: 'escalated' }` for dedup-skip, which is semantically impure
+**File:line:** `src/trigger/trigger-router.ts:1003-1030`
+**Description:**
+When a duplicate adaptive dispatch is detected within the 30-second dedup window, `dispatchAdaptivePipeline` returns:
+```typescript
+return {
+  kind: 'escalated',
+  escalationReason: { phase: 'dispatch', reason: 'duplicate ...' },
+};
+```
+The code comment at line 1003-1007 explicitly acknowledges this: *"WHY 'escalated' as the return kind: PipelineOutcome has no 'skipped' variant."*
+**Risk:** None at runtime. The calling path (GitHub queue poller) is fire-and-forget and does not branch on `outcome.kind`. This is acknowledged technical debt.
+**Recommended fix (optional):** Add `{ readonly kind: 'skipped'; readonly reason: string }` to `PipelineOutcome` in `adaptive-pipeline.ts`. Update `dispatchAdaptivePipeline` to use it. Update any caller that switches on `outcome.kind` to add a `case 'skipped':` arm (with `assertNever` in the default).
+---
+## Non-Issues
+The following patterns were audited and are **not** problems:
+- **No bare `throw` in coordinator files.** Archive failures in `implement.ts` and `full-pipeline.ts` `finally` blocks catch exceptions and log them without re-throwing - correct.
+- **`PipelineOutcome.escalationReason.reason` is `string`.** String is appropriate at the coordinator boundary (external-facing). Internal domain types use discriminated unions; coordinator escalation reasons are human-readable strings for operator consumption.
+- **`WorkflowRunResult` -> `PipelineOutcome` mapping.** These types operate at different architectural layers and are never mapped to each other. `assertNever` is correctly present in `trigger-router.ts` for `WorkflowRunResult` switches (lines 811-814 and 918-921).
+- **`outcome !== 'success'` string comparisons.** These compare against session result outcomes from the daemon layer - the correct approach at a typed interface boundary.
+- **`readDiscoveryHandoffArtifact` returning `null`.** The null is not leaked outside the function boundary; the function is a local helper returning an optional domain object.
+- **`parseFindingsFromNotes` catching JSON parse errors silently.** The `try/catch` in the JSON block scanner (`pr-review.ts:322-337`) is correct - JSON.parse throws for malformed input and the catch correctly advances to the next block.
+---
+## Severity Summary
+| ID | File | Severity | Title |
+|----|------|----------|-------|
+| F1 | `src/coordinators/modes/implement.ts:144` | **Critical** | Missing null-guard on `uxHandle` (zombie detection) |
+| F2 | `src/coordinators/modes/implement-shared.ts:110` | **Major** | Non-exhaustive `ReviewSeverity` switch - no `assertNever` |
+| F3 | `src/coordinators/pr-review.ts:445` | **Major** | `process.stderr.write` bypasses injected `deps.stderr` |
+| F4 | `src/coordinators/adaptive-pipeline.ts:248` | Minor | `checkSpawnCutoff` returns `null` as sentinel |
+| F5 | `src/trigger/trigger-router.ts:1026` | Minor | `escalated` used for dedup-skip (acknowledged in comments) |