npm - @exaudeus/workrail - Versions diffs - 3.28.0 → 3.30.0 - Mend

@exaudeus/workrail 3.28.0 → 3.30.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (160) hide show

package/dist/console/assets/{index-C146q2kN.js → index-Bl5-Ghuu.js} +1 -1
package/dist/console/index.html +1 -1
package/dist/manifest.json +3 -3
package/docs/README.md +57 -0
package/docs/adrs/001-hybrid-storage-backend.md +38 -0
package/docs/adrs/002-four-layer-context-classification.md +38 -0
package/docs/adrs/003-checkpoint-trigger-strategy.md +35 -0
package/docs/adrs/004-opt-in-encryption-strategy.md +36 -0
package/docs/adrs/005-agent-first-workflow-execution-tokens.md +105 -0
package/docs/adrs/006-append-only-session-run-event-log.md +76 -0
package/docs/adrs/007-resume-and-checkpoint-only-sessions.md +51 -0
package/docs/adrs/008-blocked-nodes-architectural-upgrade.md +178 -0
package/docs/adrs/009-bridge-mode-single-instance-mcp.md +195 -0
package/docs/adrs/010-release-pipeline.md +89 -0
package/docs/architecture/README.md +7 -0
package/docs/architecture/refactor-audit.md +364 -0
package/docs/authoring-v2.md +527 -0
package/docs/authoring.md +873 -0
package/docs/changelog-recent.md +201 -0
package/docs/configuration.md +505 -0
package/docs/ctc-mcp-proposal.md +518 -0
package/docs/design/README.md +22 -0
package/docs/design/agent-cascade-protocol.md +96 -0
package/docs/design/autonomous-console-design-candidates.md +253 -0
package/docs/design/autonomous-console-design-review.md +111 -0
package/docs/design/autonomous-platform-mvp-discovery.md +525 -0
package/docs/design/claude-code-source-deep-dive.md +713 -0
package/docs/design/console-cyberpunk-ui-discovery.md +504 -0
package/docs/design/console-execution-trace-candidates-final.md +160 -0
package/docs/design/console-execution-trace-candidates.md +211 -0
package/docs/design/console-execution-trace-design-candidates-v2.md +113 -0
package/docs/design/console-execution-trace-design-review.md +74 -0
package/docs/design/console-execution-trace-discovery.md +394 -0
package/docs/design/console-execution-trace-final-review.md +77 -0
package/docs/design/console-execution-trace-review.md +92 -0
package/docs/design/console-performance-discovery.md +415 -0
package/docs/design/console-ui-backlog.md +280 -0
package/docs/design/daemon-architecture-discovery.md +853 -0
package/docs/design/daemon-design-candidates.md +318 -0
package/docs/design/daemon-design-review-findings.md +119 -0
package/docs/design/daemon-engine-design-candidates.md +210 -0
package/docs/design/daemon-engine-design-review.md +131 -0
package/docs/design/daemon-execution-engine-discovery.md +280 -0
package/docs/design/daemon-gap-analysis.md +554 -0
package/docs/design/daemon-owns-console-plan.md +168 -0
package/docs/design/daemon-owns-console-review.md +91 -0
package/docs/design/daemon-owns-console.md +195 -0
package/docs/design/data-model-erd.md +11 -0
package/docs/design/design-candidates-consolidate-dev-staleness.md +98 -0
package/docs/design/design-candidates-walk-cache-depth-limit.md +80 -0
package/docs/design/design-review-consolidate-dev-staleness.md +54 -0
package/docs/design/design-review-walk-cache-depth-limit.md +48 -0
package/docs/design/implementation-plan-consolidate-dev-staleness.md +142 -0
package/docs/design/implementation-plan-walk-cache-depth-limit.md +141 -0
package/docs/design/layer3b-ghost-nodes-design-candidates.md +229 -0
package/docs/design/layer3b-ghost-nodes-design-review.md +93 -0
package/docs/design/layer3b-ghost-nodes-implementation-plan.md +219 -0
package/docs/design/list-workflows-latency-fix-plan.md +128 -0
package/docs/design/list-workflows-latency-fix-review.md +55 -0
package/docs/design/list-workflows-latency-fix.md +109 -0
package/docs/design/native-context-management-api.md +11 -0
package/docs/design/performance-sweep-2026-04.md +96 -0
package/docs/design/routines-guide.md +219 -0
package/docs/design/sequence-diagrams.md +11 -0
package/docs/design/subagent-design-principles.md +220 -0
package/docs/design/temporal-patterns-design-candidates.md +312 -0
package/docs/design/temporal-patterns-design-review-findings.md +163 -0
package/docs/design/test-isolation-from-config-file.md +335 -0
package/docs/design/v2-core-design-locks.md +2746 -0
package/docs/design/v2-lock-registry.json +734 -0
package/docs/design/workflow-authoring-v2.md +1044 -0
package/docs/design/workflow-docs-spec.md +218 -0
package/docs/design/workflow-extension-points.md +687 -0
package/docs/design/workrail-auto-trigger-system.md +359 -0
package/docs/design/workrail-config-file-discovery.md +513 -0
package/docs/docker.md +110 -0
package/docs/generated/v2-lock-closure-plan.md +26 -0
package/docs/generated/v2-lock-coverage.json +797 -0
package/docs/generated/v2-lock-coverage.md +177 -0
package/docs/ideas/backlog.md +3927 -0
package/docs/ideas/design-candidates-mcp-resilience.md +208 -0
package/docs/ideas/design-review-findings-mcp-resilience.md +119 -0
package/docs/ideas/implementation_plan.md +249 -0
package/docs/ideas/third-party-workflow-setup-design-thinking.md +1948 -0
package/docs/implementation/02-architecture.md +316 -0
package/docs/implementation/04-testing-strategy.md +124 -0
package/docs/implementation/09-simple-workflow-guide.md +835 -0
package/docs/implementation/13-advanced-validation-guide.md +874 -0
package/docs/implementation/README.md +21 -0
package/docs/integrations/claude-code.md +300 -0
package/docs/integrations/firebender.md +315 -0
package/docs/migration/v0.1.0.md +147 -0
package/docs/naming-conventions.md +45 -0
package/docs/planning/README.md +104 -0
package/docs/planning/github-ticketing-playbook.md +195 -0
package/docs/plans/README.md +24 -0
package/docs/plans/agent-managed-ticketing-design.md +605 -0
package/docs/plans/agentic-orchestration-roadmap.md +112 -0
package/docs/plans/assessment-gates-engine-handoff.md +536 -0
package/docs/plans/content-coherence-and-references.md +151 -0
package/docs/plans/library-extraction-plan.md +340 -0
package/docs/plans/mr-review-workflow-redesign.md +1451 -0
package/docs/plans/native-context-management-epic.md +11 -0
package/docs/plans/perf-fixes-design-candidates.md +225 -0
package/docs/plans/perf-fixes-design-review-findings.md +61 -0
package/docs/plans/perf-fixes-new-issues-candidates.md +264 -0
package/docs/plans/perf-fixes-new-issues-review.md +110 -0
package/docs/plans/prompt-fragments.md +53 -0
package/docs/plans/ui-ux-workflow-design-candidates.md +120 -0
package/docs/plans/ui-ux-workflow-discovery.md +100 -0
package/docs/plans/ui-ux-workflow-review.md +48 -0
package/docs/plans/v2-followup-enhancements.md +587 -0
package/docs/plans/workflow-categories-candidates.md +105 -0
package/docs/plans/workflow-categories-discovery.md +110 -0
package/docs/plans/workflow-categories-review.md +51 -0
package/docs/plans/workflow-discovery-model-candidates.md +94 -0
package/docs/plans/workflow-discovery-model-discovery.md +74 -0
package/docs/plans/workflow-discovery-model-review.md +48 -0
package/docs/plans/workflow-source-setup-phase-1.md +245 -0
package/docs/plans/workflow-source-setup-phase-2.md +361 -0
package/docs/plans/workflow-staleness-detection-candidates.md +104 -0
package/docs/plans/workflow-staleness-detection-review.md +58 -0
package/docs/plans/workflow-staleness-detection.md +80 -0
package/docs/plans/workflow-v2-design.md +69 -0
package/docs/plans/workflow-v2-roadmap.md +74 -0
package/docs/plans/workflow-validation-design.md +98 -0
package/docs/plans/workflow-validation-roadmap.md +108 -0
package/docs/plans/workrail-platform-vision.md +420 -0
package/docs/reference/agent-context-cleaner-snippet.md +94 -0
package/docs/reference/agent-context-guidance.md +140 -0
package/docs/reference/context-optimization.md +284 -0
package/docs/reference/example-workflow-repository-template/.github/workflows/validate.yml +125 -0
package/docs/reference/example-workflow-repository-template/README.md +268 -0
package/docs/reference/example-workflow-repository-template/workflows/example-workflow.json +80 -0
package/docs/reference/external-workflow-repositories.md +916 -0
package/docs/reference/feature-flags-architecture.md +472 -0
package/docs/reference/feature-flags.md +349 -0
package/docs/reference/god-tier-workflow-validation.md +272 -0
package/docs/reference/loop-optimization.md +209 -0
package/docs/reference/loop-validation.md +176 -0
package/docs/reference/loops.md +465 -0
package/docs/reference/mcp-platform-constraints.md +59 -0
package/docs/reference/recovery.md +88 -0
package/docs/reference/releases.md +177 -0
package/docs/reference/troubleshooting.md +105 -0
package/docs/reference/workflow-execution-contract.md +998 -0
package/docs/roadmap/README.md +22 -0
package/docs/roadmap/legacy-planning-status.md +103 -0
package/docs/roadmap/now-next-later.md +70 -0
package/docs/roadmap/open-work-inventory.md +389 -0
package/docs/tickets/README.md +39 -0
package/docs/tickets/next-up.md +76 -0
package/docs/workflow-management.md +317 -0
package/docs/workflow-templates.md +423 -0
package/docs/workflow-validation.md +184 -0
package/docs/workflows.md +254 -0
package/package.json +4 -1
package/spec/authoring-spec.json +61 -16
package/workflows/workflow-for-workflows.json +3 -3
package/workflows/workflow-for-workflows.v2.json +3 -3

package/docs/plans/v2-followup-enhancements.md ADDED Viewed

@@ -0,0 +1,587 @@
+# WorkRail v2 Follow-up Enhancements
+> **Active follow-up initiative**
+>
+> This file is still useful for the detailed open v2 follow-up work, but it is no longer the canonical high-level entrypoint.
+>
+> Prefer:
+> - `docs/plans/workflow-v2-roadmap.md`
+> - `docs/plans/workflow-v2-design.md`
+**Status**: In Progress — detailed follow-up initiative after core v2 delivery
+**Date**: 2026-02-17
+**Updated**: 2026-02-18
+**Context**: Post-v2 core completion. All functional slices shipped, 2628 tests passing. This doc captures enhancement opportunities discovered during manual testing and production usage.
+---
+## Priority 1: MCP Roots Protocol Integration (Critical Bug Fix)
+### Problem
+`resume_session` fails to find sessions across Firebender workspaces because WorkRail detects git context from the MCP server process's CWD (`process.cwd()`), not the client's workspace.
+**Scenario**:
+- E1: Agent creates session in Firebender workspace A (zillow repo)
+- Server: Detects git context from server CWD (workrail repo)
+- Session observations: `git_branch: "main"`, `git_head_sha: "b419857..."` (workrail's main)
+- E2: Agent searches for session in Firebender workspace A (zillow repo)
+- `resume_session`: Filters by git context → no match (workrail main ≠ zillow branch)
+- Result: Session not found despite being in the same client workspace
+**Impact**: Cross-chat resumption is broken for multi-workspace users.
+---
+### Solution: Use MCP Roots Protocol
+MCP provides `notifications/roots/list_changed` to notify servers when client workspace changes. Firebender sends this on workspace switch.
+**Architecture**:
+```
+Client → notifications/roots/list_changed → Server stores latest roots
+Server → start_workflow → resolves git from roots[0].uri (client workspace)
+Server → resume_session → matches sessions by stored git observations
+```
+**Key invariant**: Workspace anchor is resolved **per-request** from current client roots, not once at server startup.
+---
+### Implementation Plan
+#### 1. Immutable roots state manager
+**File**: `src/mcp/workspace-roots-manager.ts`
+Split read and write capabilities at the type level so handler code can only read — no
+mutation surface leaks into consumers via `V2Dependencies`.
+```typescript
+/** Read-only view — passed into V2Dependencies. */
+export interface RootsReader {
+  getCurrentRootUris(): readonly string[];
+}
+/** Write capability — only the MCP notification handler holds this. */
+export interface RootsWriter {
+  updateRootUris(uris: readonly string[]): void;
+}
+export class WorkspaceRootsManager implements RootsReader, RootsWriter {
+  private rootUris: readonly string[] = Object.freeze([]);
+  updateRootUris(uris: readonly string[]): void {
+    this.rootUris = Object.freeze([...uris]);
+  }
+  getCurrentRootUris(): readonly string[] {
+    return this.rootUris;
+  }
+}
+```
+**Philosophy alignment**:
+- Mutable cell is minimal, confined behind an explicit `RootsWriter` interface
+- Handlers receive `RootsReader` — cannot call `updateRootUris`
+- Single-writer (MCP notification handler on Node.js event loop)
+---
+#### 2. Add roots notification handler
+**File**: `src/mcp/server.ts`
+Two important protocol details:
+1. `notifications/roots/list_changed` is a **signal only** — it carries no roots payload. After
+   receiving it, the server must call `server.listRoots()` (which sends a `roots/list` request
+   to the client) to get the updated list.
+2. Initial roots must be fetched **after** `server.connect(transport)`. Some clients don't support
+   `roots/list`; wrap in try/catch and degrade gracefully to CWD fallback.
+```typescript
+const rootsManager = new WorkspaceRootsManager();
+// rootsWriter stays local — never passed to handlers
+const rootsWriter: RootsWriter = rootsManager;
+// Register before connect. Notification is signal-only; re-fetch via listRoots().
+server.setNotificationHandler(RootsListChangedNotificationSchema, async () => {
+  try {
+    const result = await server.listRoots();
+    rootsWriter.updateRootUris(result.roots.map((r) => r.uri));
+    console.error(`[Roots] Updated: ${result.roots.map((r) => r.uri).join(', ') || '(none)'}`);
+  } catch {
+    console.error('[Roots] Failed to fetch updated roots after change notification');
+  }
+});
+// After server.connect(transport): fetch initial roots.
+// Graceful: clients that don't support roots/list will throw; fall back to CWD.
+try {
+  const result = await server.listRoots();
+  rootsWriter.updateRootUris(result.roots.map((r) => r.uri));
+  console.error(`[Roots] Initial: ${result.roots.map((r) => r.uri).join(', ') || '(none)'}`);
+} catch {
+  console.error('[Roots] Client does not support roots/list; CWD fallback active');
+}
+```
+---
+#### 3. Make workspace anchor resolver per-request
+**File**: `src/v2/infra/local/workspace-anchor/index.ts`
+**Before**:
+```typescript
+export class LocalWorkspaceAnchorV2 implements WorkspaceAnchorPortV2 {
+  constructor(private readonly cwd: string) {}
+  resolveAnchors(): RA<readonly WorkspaceAnchor[], WorkspaceAnchorError> {
+    // uses this.cwd (singleton)
+  }
+}
+```
+**After**:
+```typescript
+export interface WorkspaceContextResolverPortV2 {
+  resolveFromUri(rootUri: string): ResultAsync<readonly WorkspaceAnchor[], WorkspaceAnchorError>;
+  resolveFromCwd(): ResultAsync<readonly WorkspaceAnchor[], WorkspaceAnchorError>;
+}
+export class LocalWorkspaceAnchorV2 implements WorkspaceContextResolverPortV2 {
+  resolveFromUri(rootUri: string): RA<readonly WorkspaceAnchor[], WorkspaceAnchorError> {
+    const fsPath = this.uriToPath(rootUri);
+    if (!fsPath) return okAsync([]); // Not file:// URI, graceful empty
+    return this.resolveFromPath(fsPath);
+  }
+  resolveFromCwd(): RA<readonly WorkspaceAnchor[], WorkspaceAnchorError> {
+    return this.resolveFromPath(process.cwd());
+  }
+  private resolveFromPath(cwd: string): RA<readonly WorkspaceAnchor[], WorkspaceAnchorError> {
+    // run git commands in specified cwd (existing logic)
+  }
+  private uriToPath(uri: string): string | null {
+    if (!uri.startsWith('file://')) return null;
+    // Use fileURLToPath (node:url) — handles Windows drive letters and percent-encoding correctly.
+    // decodeURIComponent(slice(7)) is wrong on Windows: file:///C:/foo → /C:/foo (leading slash).
+    try { return fileURLToPath(uri); } catch { return null; }
+  }
+}
+```
+**Philosophy**: Pure functions, no constructor state, explicit about file:// URIs only.
+---
+#### 4. Update V2Dependencies
+**File**: `src/mcp/types.ts`
+```typescript
+export interface V2Dependencies {
+  readonly gate: ExecutionSessionGateV2;
+  readonly sessionStore: ...;
+  // Remove: readonly workspaceAnchor?: WorkspaceAnchorPortV2;
+  // Add:
+  readonly workspaceResolver?: WorkspaceContextResolverPortV2;
+  // Per-request snapshot of client root URIs, injected at the CallTool boundary.
+  // Optional: absent when client doesn't support roots/list (degrades to CWD).
+  readonly resolvedRootUris?: readonly string[];
+}
+```
+At the `CallToolRequestSchema` handler, snapshot roots once and spread into `V2Dependencies`:
+```typescript
+const requestCtx: ToolContext = ctx.v2
+  ? { ...ctx, v2: { ...ctx.v2, resolvedRootUris: rootsManager.getCurrentRootUris() } }
+  : ctx;
+return handler(args ?? {}, requestCtx);
+```
+**Why `resolvedRootUris` as a value, not a `getCurrentRoots` thunk**: a function that reads
+ambient state at call-time is not deterministic from the handler's perspective — the roots could
+change between calls. Snapshotting at the request boundary gives handlers an immutable value for
+their entire duration, consistent with the determinism-over-cleverness principle.
+---
+#### 5. Update start_workflow to use primary root
+**File**: `src/mcp/handlers/v2-execution/start.ts`, around line 331
+**Before**:
+```typescript
+const workspaceAnchor = ctx.v2?.workspaceAnchor;
+const anchorsRA = workspaceAnchor
+  ? workspaceAnchor.resolveAnchors()
+  : okAsync([]);
+```
+**After**:
+```typescript
+const workspaceResolver = ctx.v2.workspaceResolver;
+const primaryRootUri = ctx.v2.resolvedRootUris?.[0]; // snapshotted at CallTool boundary
+const anchorsRA = workspaceResolver
+  ? (primaryRootUri
+      ? workspaceResolver.resolveFromUri(primaryRootUri)
+      : workspaceResolver.resolveFromCwd()
+    ).orElse(() => okAsync([]))
+  : okAsync([]);
+```
+**Why**: Uses client's workspace URI if available (snapshotted at request boundary — deterministic
+for this call), falls back to server CWD for clients that don't support roots/list.
+---
+#### 6. Tests (pending)
+**Unit tests** (`tests/unit/v2/workspace-roots-manager.test.ts`):
+- `updateRootUris` stores immutable copy; subsequent mutations don't affect the returned slice
+- `getCurrentRootUris` returns frozen array
+- `RootsWriter` interface is separate from `RootsReader` — consumers cannot call `updateRootUris`
+**Unit tests** (`tests/unit/v2/workspace-anchor-resolver.test.ts`):
+- `resolveFromUri` with valid `file://` URI
+- `resolveFromUri` with non-`file://` URI (e.g., `http://`, `vscode-vfs://`) → returns empty (graceful)
+- `resolveFromUri` with malformed URI → returns empty (graceful)
+- `resolveFromCwd` uses the adapter's default CWD
+- Windows path handling: `file:///C:/foo` → `C:\foo` (via `fileURLToPath`)
+**Integration test** (`tests/integration/v2/resume-session-workspace-filtering.test.ts`):
+- Create session in workspace A (mock `resolvedRootUris` pointing at a temp git repo on branch `feat-a`)
+- Create session in workspace B (mock pointing at a different temp git repo)
+- `resume_session` from workspace A → finds only workspace A session via git branch/SHA match
+- `resume_session` with no roots → finds both via recency fallback
+---
+### Status
+**Complete** (2026-02-18, tests tightened 2026-03-26 in #147). Full implementation shipped including integration and unit tests covering all workspace resolution variants and edge cases (26+ tests).
+---
+## Priority 2: MCP Progress Notifications for Workflow Execution
+### Problem
+Long workflows (10+ steps, loops, subagents) take minutes to complete. Agents have no visibility into progress — they call `continue_workflow` and wait.
+### Solution: Send `notifications/progress`
+When a `continue_workflow` advance completes, send a progress notification to the client:
+```json
+{
+  "method": "notifications/progress",
+  "params": {
+    "progressToken": "...", // from original request._meta.progressToken
+    "progress": 3,
+    "total": 10,
+    "message": "Completed step 3/10: Hypothesis Development"
+  }
+}
+```
+**Agent UX**: The client UI shows "WorkRail: Step 3/10 (Hypothesis Development)" while the tool call is in-flight.
+---
+### Implementation
+**File**: `src/mcp/handlers/v2-execution/advance.ts`
+**After** successful append, before returning:
+```typescript
+// Send progress notification if client requested it.
+// progressToken must be threaded from CallToolRequestSchema handler
+// through ToolContext (or via a server reference passed to V2Dependencies).
+if (progressToken) {
+  const dag = projectRunDagV2(truthAfter.events);
+  if (dag.isOk()) {
+    const run = dag.value.runsById[runId];
+    // Count only 'step' nodes — not 'blocked_attempt' or 'checkpoint' nodes.
+    // Post-ADR 008, nodesById includes blocked_attempt nodes; counting all of them
+    // would inflate 'total' and make progress percentages wrong.
+    const stepNodes = Object.values(run?.nodesById ?? {}).filter(n => n.nodeKind === 'step');
+    const totalSteps = stepNodes.length;
+    const completedSteps = stepNodes.filter(n => n.isComplete).length;
+    // Correct SDK API is sendNotification, not notification.
+    await server.sendNotification({
+      method: 'notifications/progress',
+      params: {
+        progressToken,
+        progress: completedSteps,
+        total: totalSteps,
+        message: `Completed step ${completedSteps}/${totalSteps}: ${currentStep.title}`,
+      },
+    });
+  }
+}
+```
+**Three implementation details to resolve before building**:
+1. **`progressToken` plumbing**: `request._meta?.progressToken` is available in the raw
+   `CallToolRequestSchema` handler, not in `executeAdvance`. Thread it through `ToolContext`
+   (or a dedicated `RequestMeta` field) before calling the advance logic.
+2. **`server` reference**: `advance.ts` has no access to the MCP `Server` instance today.
+   Pass it via `V2Dependencies` or a `NotificationSender` port (interface segregation — expose
+   only `sendNotification`, not the full server).
+3. **`completedSteps` count**: See inline note above — filter by `nodeKind === 'step'` to
+   exclude `blocked_attempt` and `checkpoint` nodes, which are in the same DAG post-ADR 008.
+**Philosophy**:
+- Pure projection (DAG -> progress count)
+- Side effect at edge (notification send)
+- Opt-in (only if client provides progressToken)
+---
+### Open Question
+Should progress be:
+- **Step-granular** (1 notification per step) — simple, but may spam for 50-step workflows
+- **Percentage-based** (notify on 10%, 20%, ..., 100%) — fewer notifications, but requires more logic
+- **Time-based** (notify every 5 seconds) — smooth UX, but requires background timers
+**Recommendation**: Start with step-granular (simplest, matches the execution model). Add throttling later if needed.
+---
+## Priority 3: Session State Change Notifications (Console/Dashboard Integration)
+### Problem
+When Console/Dashboard UI exists, users may have multiple views open:
+- Session list showing all sessions
+- Session detail showing a specific session's DAG
+- Workflow execution view
+When an agent advances a workflow, these views become stale. Currently they'd need manual refresh or polling.
+### Solution: `notifications/resources/updated`
+After durable events are written, notify clients watching that session:
+```json
+{
+  "method": "notifications/resources/updated",
+  "params": {
+    "uri": "workrail://session/sess_abc123",
+    "changes": {
+      "lastEventIndex": 42,
+      "preferredTipNodeId": "node_xyz",
+      "isComplete": false
+    }
+  }
+}
+```
+**Console benefit**: Auto-refresh session views when new events are written.
+---
+### Implementation
+**Requires**:
+1. Resource URI schema for sessions (`workrail://session/{sessionId}`)
+2. Subscription tracking (which clients are watching which sessions)
+3. Notification dispatch after `sessionStore.append()`
+**Defer until**: Console UI exists (YAGNI — no UI to refresh yet)
+---
+## Priority 4: Logging Notifications (Server Diagnostics)
+### Problem
+When session health issues occur (lock contention, corruption detected, validation errors), agents see tool errors but operators have no server-side visibility.
+### Solution: `notifications/logging/message`
+Structured server logs sent to clients:
+```json
+{
+  "method": "notifications/logging/message",
+  "params": {
+    "level": "warning",
+    "logger": "workrail.session.gate",
+    "data": "Session lock held for >5s — another process may be stuck"
+  }
+}
+```
+**Operator benefit**: Real-time server diagnostics visible in Firebender console.
+**When to send**:
+- Lock timeout warnings (held >5s)
+- Session corruption detected
+- Keyring initialization failures
+- Feature flag changes
+**Philosophy**: Errors as data, observability at edges
+---
+## Priority 5: Dynamic Tool List Updates
+### Problem
+Feature flags control which tools are available. Changing a flag requires agent reconnect to see new tools. (Note: `WORKRAIL_ENABLE_V2_TOOLS` has been removed -- v2 is default-on. This priority applies to any future feature flags.)
+### Solution: `notifications/tools/list_changed`
+When feature flags change:
+```json
+{
+  "method": "notifications/tools/list_changed",
+  "params": {}
+}
+```
+Client re-fetches tool list via `tools/list`.
+**Complexity**: Requires runtime feature flag mutation (currently environment variables, immutable after boot).
+**Defer until**: Feature flags become mutable via Console UI.
+---
+## Priority 6: Async Workflow Execution via MCP Tasks
+### Problem
+Long workflows block the agent's tool call. A 50-step workflow might take 10+ minutes, during which the agent is waiting on a single `continue_workflow` call.
+### Solution: MCP Tasks for async workflows
+**Flow**:
+1. Agent: `start_workflow` with `task: { ttl: 600000, pollInterval: 5000 }`
+2. Server: Returns `taskId`, begins async execution
+3. Client: Polls `tasks/get` every 5s to check status
+4. Server: Sends `notifications/tasks/status` when steps complete
+5. Agent: Sees progress, continues other work
+6. Server: Workflow completes, task result available
+7. Agent: `tasks/get` returns final result
+**Benefits**:
+- Agent can do other work while workflow runs
+- Progress via notifications instead of blocking
+- Timeout-friendly (long workflows don't need infinite tool call timeout)
+**Complexity**:
+- Requires background workflow executor thread
+- Task result storage + TTL management
+- Cancellation support
+**Defer until**: Workflows routinely take >60s (YAGNI for current 2-10 step workflows).
+---
+## Summary Table
+| Enhancement | Priority | Status | Blocks | Philosophy Aligned |
+|-------------|----------|--------|--------|-------------------|
+| MCP Roots Protocol | P1 (bug fix) | Complete (2026-02-18, #75/#78/#147) | Cross-workspace resume | Yes -- pure functions, immutable |
+| Progress Notifications | P2 | Planned (3 open design issues) | Agent UX for long workflows | Yes -- side effects at edges |
+| Resource Update Notifications | P3 | Deferred (no UI) | Console auto-refresh | Yes -- event-driven |
+| Logging Notifications | P4 | Deferred | Operator visibility | Yes -- errors as data |
+| Tool List Change Notifications | P5 | Deferred | Runtime flag changes | Partial -- requires mutable flags |
+| Async Workflows via Tasks | P6 | Deferred (YAGNI) | 10min+ workflows | Partial -- requires background threads |
+---
+## Related Work from Earlier Session
+From the "unfleshed v2 ideas" inventory:
+### Already Addressed This Session
+- **MCP Roots Protocol** -- Per-request workspace anchor resolution; `RootsReader`/`RootsWriter` capability split; `fileURLToPath` URI handling; `resolvedRootUris` snapshot at CallTool boundary (2026-02-18)
+- **Workflow migration** -- All while-loops migrated to `wr.contracts.loop_control` (PR #69)
+- **ADR 008 completion** -- Terminal block path + projection query (this session)
+- **Deprecated path removal** -- `advance_recorded.outcome.kind='blocked'` removed from builder (this session, PR #70)
+- **SessionManager Result refactoring** -- All methods return `Result`, no throws (this session, PR #70)
+- **V2ToolContext + requireV2 guard** -- Eliminated `ctx.v2!` assertions (this session, PR #70)
+- **Branded contractRef** -- `ArtifactContractRef` type instead of `string` (this session, PR #70)
+- **Compiler contract validation** -- Compile-time check for unknown contract refs (this session, PR #70)
+- **Manual test plan** -- 23 scenarios for slices 4b, 4c, ADR 008, loop artifacts (this session)
+- **Optimistic pre-lock dedup** -- Checkpoint replay skips gate (this session, PR #73)
+### Still Open
+1. ~~**Unflag v2 tools**~~ (done -- v2 is default-on, feature flag gate removed)
+2. **Console/Dashboard UI** — Zero UI exists, substrate complete
+3. **Agent Cascade Protocol** — Cross-IDE delegation model, design complete
+4. **Enforceable verification contracts** — `verify` block is instructional-only
+5. **Parallel forEach execution** — Concurrent loop iterations
+6. **Subagent composition** — Chained outputs (researcher → challenger → analyzer)
+7. **Evidence validation contracts** — Replace prose `validationCriteria` with structured artifacts
+---
+## Decision: What to Do Next
+### Done: MCP Roots Protocol
+Implemented 2026-02-18. Per-request workspace anchor resolution, correct `listRoots()` flow,
+`fileURLToPath` URI handling, `resolvedRootUris` snapshot at CallTool boundary.
+### Next: Complete manual test plan validation
+Run the E1+E2 cross-workspace resume scenarios to verify the roots fix works end-to-end.
+Manual test plans from the v2 phases have been archived; verification is now covered by the
+26+ automated tests added with the roots fix.
+### Done: Unflag v2 tools (Production Readiness)
+V2 is default-on. Feature flag gate removed.
+### Later: Progress Notifications (UX Improvement)
+- **Impact**: Better agent feedback for long workflows
+- **Effort**: Moderate — three design issues must be resolved first (see P2 above):
+  1. `progressToken` threading through `ToolContext`
+  2. `NotificationSender` port to give advance handler access to `sendNotification`
+  3. Node counting: filter `step` nodes only, exclude `blocked_attempt` + `checkpoint`
+- **Risk**: Low (opt-in via progressToken)
+### Recommended Sequence
+1. ~~**MCP Roots**~~ (done)
+2. **Complete manual test plan validation** (run all 23 scenarios with roots fix)
+3. ~~**Unflag v2 tools**~~ (done)
+4. **Resolve P2 design issues** then implement progress notifications
+---
+## Open Questions
+1. **Should resume_session support multi-root matching?** Current plan uses only `roots[0]`. If a client has 3 workspace roots, should sessions from any of them be eligible?
+   - **Recommendation**: No (YAGNI). Use primary root only.
+2. **What if client sends roots but they're all non-file:// URIs?** (e.g., `vscode-vfs://github/...`)
+   - **Recommendation**: Graceful fallback to server CWD with a warning log.
+3. **Should workspace anchor resolution be cached per root URI?** Git commands are expensive (fork + exec).
+   - **Recommendation**: No for v1. Add caching later if profiling shows it's a bottleneck.
+---
+## References
+- MCP Roots Spec: `https://modelcontextprotocol.io/specification/draft/client/roots`
+- MCP SDK Types: `@modelcontextprotocol/sdk/types` (v1.24.0)
+- Design Locks: `docs/design/v2-core-design-locks.md` §15 (single-writer), §1.3 (rehydrate separation)

package/docs/plans/workflow-categories-candidates.md ADDED Viewed

@@ -0,0 +1,105 @@
+# Workflow Categories Design Candidates
+## Problem Understanding
+**Core tensions:**
+1. Hash stability: category metadata cannot go in workflow JSON without breaking workflowHash when recategorized
+2. Default behavior: making summary the default changes the implicit contract of list_workflows (agents expecting full list must adapt)
+3. Overlay freshness: a separate categories file can drift from the actual workflow registry
+**Likely seam**: `handleV2ListWorkflows` in `src/mcp/handlers/v2-workflow.ts` + `V2ListWorkflowsInput` in `src/mcp/v2/tools.ts` + `V2WorkflowListOutputSchema` in `src/mcp/output-schemas.ts`
+**What makes it hard**: The overlay must be authoritative metadata about the registry without being part of the compilation pipeline. Junior devs would put it in workflow JSON or infer it dynamically — both approaches break for different reasons.
+## Philosophy Constraints
+- **Determinism**: category assignment must be explicit, not inferred
+- **Make illegal states unrepresentable**: uncategorized workflows should be a validation warning, not silent
+- **YAGNI**: don't add compiler complexity for C when A solves it more simply
+- **Explicit domain types**: category should be a typed enum, not a free string
+## Impact Surface
+- `spec/workflow-categories.json` (new file)
+- `V2ListWorkflowsInput` (new optional `category` field)
+- `V2WorkflowListOutputSchema` (new optional `categorySummary` field)
+- `handleV2ListWorkflows` (response branching logic)
+- `validate-workflows-registry.ts` (new uncategorized workflow warning)
+- `workflow-for-workflows.v2.json` Phase 7 (should stamp category when authoring)
+## Candidates
+### A: Spec overlay file + `category` filter param ✓ RECOMMENDED
+**Summary**: `spec/workflow-categories.json` maps workflow IDs to domain categories. `list_workflows` without `category` returns compact `categorySummary`. With `category`, returns full filtered list.
+- **Tensions resolved**: hash stability, backwards compatibility, token reduction
+- **Tensions accepted**: overlay can drift (mitigated by validate:registry check)
+- **Boundary**: spec/ directory + V2ListWorkflowsInput + output schema + handler
+- **Failure mode**: new workflow added but not categorized — shows as uncategorized, validator warns
+- **Repo pattern**: adapts `includeSources` pattern directly
+- **Gains**: clean separation, CI-checkable, zero workflow file changes
+- **Losses**: extra file to maintain
+- **Scope**: best-fit
+- **Philosophy**: honors determinism, make-illegal-states-unrepresentable
+### B: Naming convention inference (no overlay)
+**Summary**: Infer category from workflow ID prefix at runtime. `routine-*` → routines, `test-*` → testing, everything else guessed from description keywords.
+- **Failure mode**: ~70% of workflows mis-categorized (only routine-* and test-* have reliable prefixes)
+- **Repo pattern**: departs
+- **Scope**: too narrow — doesn't work for most of the catalog
+- **Philosophy**: conflicts with determinism
+### C: `category` field in workflow JSON with hash isolation
+**Summary**: Add `category` to workflow JSON but strip it from the compiled snapshot before hashing.
+- **Failure mode**: compiler regression accidentally includes `category` in hash, silently invalidating sessions
+- **Repo pattern**: departs — no existing field excluded from compilation this way
+- **Scope**: too broad — adds significant compiler complexity
+- **Philosophy**: violates YAGNI
+## Comparison and Recommendation
+**A wins on every axis**: hash stability, backwards compatibility, clean boundary, CI-checkable, follows includeSources pattern, minimal code change.
+B covers ~30% of workflows. C adds compiler complexity for a problem A already solves.
+**Implementation shape for A:**
+1. `spec/workflow-categories.json` — `{ categories: [...], workflows: { workflowId: { category, hidden? } } }`
+2. `V2ListWorkflowsInput`: add `category?: string`
+3. `V2WorkflowListOutputSchema`: add `categorySummary?: { category, displayName, count, representatives }[]`
+4. `handleV2ListWorkflows`: when no `category`, return `categorySummary`; when `category` present, return filtered full list
+5. `validate:registry`: warn on uncategorized non-hidden workflows
+6. Token budget: summary ~500 tokens; per-category full list ~800 tokens for 3-5 workflows
+**Natural taxonomy (10 categories):**
+| Category | Count | Examples |
+|---|---|---|
+| coding | 3 | coding-task, cross-platform-code-conversion |
+| review_audit | 3 | mr-review, production-readiness-audit, architecture-scalability-audit |
+| investigation | 2 | bug-investigation, workflow-diagnose |
+| design | 2 | ui-ux-design, wr.discovery |
+| documentation | 3 | document-creation, scoped-documentation, documentation-update |
+| tickets | 4 | adaptive-ticket-creation, ticket-grooming, intelligent-test-case-generation |
+| learning | 4 | personal-learning-*, presentation-creation, relocation |
+| routines | ~10 | all routine-* |
+| authoring | 1-2 | workflow-for-workflows |
+| testing | 3 | test-* (hidden from default summary) |
+## Self-Critique
+**Strongest counter-argument**: two-file maintenance burden (workflow JSON + overlay). Mitigated by: validate:registry warning on uncategorized workflows makes omission loud; workflow-for-workflows can be updated to prompt for category at authoring time.
+**Pivot condition**: if teams want per-workspace custom categories, A needs extension (workspace-level categories.json overlay). Defer to v2.
+## Open Questions for Main Agent
+1. Should `testing` workflows be `hidden: true` (excluded from summary) or shown in their own testing category?
+2. Should routines be surfaced in summary mode at all, or hidden by default (they're internal, not user-invoked)?
+3. Should the `categorySummary` include a short description per category (e.g., "Review code changes, audit systems") or just names + counts?
+4. What's the right `displayName` for `review_audit`? "Review & Audit"?
+5. Should `workflow-for-workflows` Phase 7 be updated to stamp the category, or is that a separate ticket?