npm - opencode-swarm-plugin - Versions diffs - 0.44.0 → 0.44.2 - Mend

opencode-swarm-plugin 0.44.0 → 0.44.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (215) hide show

package/bin/swarm.serve.test.ts +6 -4
package/bin/swarm.ts +18 -12
package/dist/compaction-prompt-scoring.js +139 -0
package/dist/eval-capture.js +12811 -0
package/dist/hive.d.ts.map +1 -1
package/dist/hive.js +14834 -0
package/dist/index.d.ts +18 -0
package/dist/index.d.ts.map +1 -1
package/dist/index.js +7743 -62593
package/dist/plugin.js +24052 -78907
package/dist/swarm-orchestrate.d.ts.map +1 -1
package/dist/swarm-prompts.d.ts.map +1 -1
package/dist/swarm-prompts.js +39407 -0
package/dist/swarm-review.d.ts.map +1 -1
package/dist/swarm-validation.d.ts +127 -0
package/dist/swarm-validation.d.ts.map +1 -0
package/dist/validators/index.d.ts +7 -0
package/dist/validators/index.d.ts.map +1 -0
package/dist/validators/schema-validator.d.ts +58 -0
package/dist/validators/schema-validator.d.ts.map +1 -0
package/package.json +17 -5
package/.changeset/swarm-insights-data-layer.md +0 -63
package/.hive/analysis/eval-failure-analysis-2025-12-25.md +0 -331
package/.hive/analysis/session-data-quality-audit.md +0 -320
package/.hive/eval-results.json +0 -483
package/.hive/issues.jsonl +0 -138
package/.hive/memories.jsonl +0 -729
package/.opencode/eval-history.jsonl +0 -327
package/.turbo/turbo-build.log +0 -9
package/CHANGELOG.md +0 -2286
package/SCORER-ANALYSIS.md +0 -598
package/docs/analysis/subagent-coordination-patterns.md +0 -902
package/docs/analysis-socratic-planner-pattern.md +0 -504
package/docs/planning/ADR-001-monorepo-structure.md +0 -171
package/docs/planning/ADR-002-package-extraction.md +0 -393
package/docs/planning/ADR-003-performance-improvements.md +0 -451
package/docs/planning/ADR-004-message-queue-features.md +0 -187
package/docs/planning/ADR-005-devtools-observability.md +0 -202
package/docs/planning/ADR-007-swarm-enhancements-worktree-review.md +0 -168
package/docs/planning/ADR-008-worker-handoff-protocol.md +0 -293
package/docs/planning/ADR-009-oh-my-opencode-patterns.md +0 -353
package/docs/planning/ADR-010-cass-inhousing.md +0 -1215
package/docs/planning/ROADMAP.md +0 -368
package/docs/semantic-memory-cli-syntax.md +0 -123
package/docs/swarm-mail-architecture.md +0 -1147
package/docs/testing/context-recovery-test.md +0 -470
package/evals/ARCHITECTURE.md +0 -1189
package/evals/README.md +0 -768
package/evals/compaction-prompt.eval.ts +0 -149
package/evals/compaction-resumption.eval.ts +0 -289
package/evals/coordinator-behavior.eval.ts +0 -307
package/evals/coordinator-session.eval.ts +0 -154
package/evals/evalite.config.ts.bak +0 -15
package/evals/example.eval.ts +0 -31
package/evals/fixtures/cass-baseline.ts +0 -217
package/evals/fixtures/compaction-cases.ts +0 -350
package/evals/fixtures/compaction-prompt-cases.ts +0 -311
package/evals/fixtures/coordinator-sessions.ts +0 -328
package/evals/fixtures/decomposition-cases.ts +0 -105
package/evals/lib/compaction-loader.test.ts +0 -248
package/evals/lib/compaction-loader.ts +0 -320
package/evals/lib/data-loader.evalite-test.ts +0 -289
package/evals/lib/data-loader.test.ts +0 -345
package/evals/lib/data-loader.ts +0 -281
package/evals/lib/llm.ts +0 -115
package/evals/scorers/compaction-prompt-scorers.ts +0 -145
package/evals/scorers/compaction-scorers.ts +0 -305
package/evals/scorers/coordinator-discipline.evalite-test.ts +0 -539
package/evals/scorers/coordinator-discipline.ts +0 -325
package/evals/scorers/index.test.ts +0 -146
package/evals/scorers/index.ts +0 -328
package/evals/scorers/outcome-scorers.evalite-test.ts +0 -27
package/evals/scorers/outcome-scorers.ts +0 -349
package/evals/swarm-decomposition.eval.ts +0 -121
package/examples/commands/swarm.md +0 -745
package/examples/plugin-wrapper-template.ts +0 -2515
package/examples/skills/hive-workflow/SKILL.md +0 -212
package/examples/skills/skill-creator/SKILL.md +0 -223
package/examples/skills/swarm-coordination/SKILL.md +0 -292
package/global-skills/cli-builder/SKILL.md +0 -344
package/global-skills/cli-builder/references/advanced-patterns.md +0 -244
package/global-skills/learning-systems/SKILL.md +0 -644
package/global-skills/skill-creator/LICENSE.txt +0 -202
package/global-skills/skill-creator/SKILL.md +0 -352
package/global-skills/skill-creator/references/output-patterns.md +0 -82
package/global-skills/skill-creator/references/workflows.md +0 -28
package/global-skills/swarm-coordination/SKILL.md +0 -995
package/global-skills/swarm-coordination/references/coordinator-patterns.md +0 -235
package/global-skills/swarm-coordination/references/strategies.md +0 -138
package/global-skills/system-design/SKILL.md +0 -213
package/global-skills/testing-patterns/SKILL.md +0 -430
package/global-skills/testing-patterns/references/dependency-breaking-catalog.md +0 -586
package/opencode-swarm-plugin-0.30.7.tgz +0 -0
package/opencode-swarm-plugin-0.31.0.tgz +0 -0
package/scripts/cleanup-test-memories.ts +0 -346
package/scripts/init-skill.ts +0 -222
package/scripts/migrate-unknown-sessions.ts +0 -349
package/scripts/validate-skill.ts +0 -204
package/src/agent-mail.ts +0 -1724
package/src/anti-patterns.test.ts +0 -1167
package/src/anti-patterns.ts +0 -448
package/src/compaction-capture.integration.test.ts +0 -257
package/src/compaction-hook.test.ts +0 -838
package/src/compaction-hook.ts +0 -1204
package/src/compaction-observability.integration.test.ts +0 -139
package/src/compaction-observability.test.ts +0 -187
package/src/compaction-observability.ts +0 -324
package/src/compaction-prompt-scorers.test.ts +0 -475
package/src/compaction-prompt-scoring.ts +0 -300
package/src/contributor-tools.test.ts +0 -133
package/src/contributor-tools.ts +0 -201
package/src/dashboard.test.ts +0 -611
package/src/dashboard.ts +0 -462
package/src/error-enrichment.test.ts +0 -403
package/src/error-enrichment.ts +0 -219
package/src/eval-capture.test.ts +0 -1015
package/src/eval-capture.ts +0 -929
package/src/eval-gates.test.ts +0 -306
package/src/eval-gates.ts +0 -218
package/src/eval-history.test.ts +0 -508
package/src/eval-history.ts +0 -214
package/src/eval-learning.test.ts +0 -378
package/src/eval-learning.ts +0 -360
package/src/eval-runner.test.ts +0 -223
package/src/eval-runner.ts +0 -402
package/src/export-tools.test.ts +0 -476
package/src/export-tools.ts +0 -257
package/src/hive.integration.test.ts +0 -2241
package/src/hive.ts +0 -1628
package/src/index.ts +0 -940
package/src/learning.integration.test.ts +0 -1815
package/src/learning.ts +0 -1079
package/src/logger.test.ts +0 -189
package/src/logger.ts +0 -135
package/src/mandate-promotion.test.ts +0 -473
package/src/mandate-promotion.ts +0 -239
package/src/mandate-storage.integration.test.ts +0 -601
package/src/mandate-storage.test.ts +0 -578
package/src/mandate-storage.ts +0 -794
package/src/mandates.ts +0 -540
package/src/memory-tools.test.ts +0 -195
package/src/memory-tools.ts +0 -344
package/src/memory.integration.test.ts +0 -334
package/src/memory.test.ts +0 -158
package/src/memory.ts +0 -527
package/src/model-selection.test.ts +0 -188
package/src/model-selection.ts +0 -68
package/src/observability-tools.test.ts +0 -359
package/src/observability-tools.ts +0 -871
package/src/output-guardrails.test.ts +0 -438
package/src/output-guardrails.ts +0 -381
package/src/pattern-maturity.test.ts +0 -1160
package/src/pattern-maturity.ts +0 -525
package/src/planning-guardrails.test.ts +0 -491
package/src/planning-guardrails.ts +0 -438
package/src/plugin.ts +0 -23
package/src/post-compaction-tracker.test.ts +0 -251
package/src/post-compaction-tracker.ts +0 -237
package/src/query-tools.test.ts +0 -636
package/src/query-tools.ts +0 -324
package/src/rate-limiter.integration.test.ts +0 -466
package/src/rate-limiter.ts +0 -774
package/src/replay-tools.test.ts +0 -496
package/src/replay-tools.ts +0 -240
package/src/repo-crawl.integration.test.ts +0 -441
package/src/repo-crawl.ts +0 -610
package/src/schemas/cell-events.test.ts +0 -347
package/src/schemas/cell-events.ts +0 -807
package/src/schemas/cell.ts +0 -257
package/src/schemas/evaluation.ts +0 -166
package/src/schemas/index.test.ts +0 -199
package/src/schemas/index.ts +0 -286
package/src/schemas/mandate.ts +0 -232
package/src/schemas/swarm-context.ts +0 -115
package/src/schemas/task.ts +0 -161
package/src/schemas/worker-handoff.test.ts +0 -302
package/src/schemas/worker-handoff.ts +0 -131
package/src/sessions/agent-discovery.test.ts +0 -137
package/src/sessions/agent-discovery.ts +0 -112
package/src/sessions/index.ts +0 -15
package/src/skills.integration.test.ts +0 -1192
package/src/skills.test.ts +0 -643
package/src/skills.ts +0 -1549
package/src/storage.integration.test.ts +0 -341
package/src/storage.ts +0 -884
package/src/structured.integration.test.ts +0 -817
package/src/structured.test.ts +0 -1046
package/src/structured.ts +0 -762
package/src/swarm-decompose.test.ts +0 -188
package/src/swarm-decompose.ts +0 -1302
package/src/swarm-deferred.integration.test.ts +0 -157
package/src/swarm-deferred.test.ts +0 -38
package/src/swarm-insights.test.ts +0 -214
package/src/swarm-insights.ts +0 -459
package/src/swarm-mail.integration.test.ts +0 -970
package/src/swarm-mail.ts +0 -739
package/src/swarm-orchestrate.integration.test.ts +0 -282
package/src/swarm-orchestrate.test.ts +0 -548
package/src/swarm-orchestrate.ts +0 -3084
package/src/swarm-prompts.test.ts +0 -1270
package/src/swarm-prompts.ts +0 -2077
package/src/swarm-research.integration.test.ts +0 -701
package/src/swarm-research.test.ts +0 -698
package/src/swarm-research.ts +0 -472
package/src/swarm-review.integration.test.ts +0 -285
package/src/swarm-review.test.ts +0 -879
package/src/swarm-review.ts +0 -709
package/src/swarm-strategies.ts +0 -407
package/src/swarm-worktree.test.ts +0 -501
package/src/swarm-worktree.ts +0 -575
package/src/swarm.integration.test.ts +0 -2377
package/src/swarm.ts +0 -38
package/src/tool-adapter.integration.test.ts +0 -1221
package/src/tool-availability.ts +0 -461
package/tsconfig.json +0 -28

package/docs/planning/ADR-005-devtools-observability.md DELETED Viewed

@@ -1,202 +0,0 @@
-# ADR-005: DevTools + Observability
-## Status
-Proposed
-## Context
-Swarm Mail currently has no visibility:
-- No UI to inspect events, messages, locks
-- No metrics on latency, queue depth, throughput
-- No distributed tracing across agents
-- Hard to debug coordination issues
-Need both developer tools (UI + CLI) and production observability (metrics + tracing).
-## Decision
-Build layered observability:
-### 1. DevTools UI (SvelteKit)
-**Stack:**
-- SvelteKit for SSR + static export
-- Vite for dev server + build
-- Server-Sent Events (SSE) for real-time updates
-- Embeddable static build
-**Features:**
-- Event stream viewer (filterable, searchable)
-- Message inbox/outbox per agent
-- File reservation timeline
-- Saga instance tracker (future)
-**Build:**
-```bash
-cd apps/devtools
-bun run build  # Static export to apps/devtools/build
-```
-**Embed in plugin:**
-```typescript
-// Serve static UI at /_swarm/devtools
-const server = serve({
-  port: 4000,
-  fetch: (req) => {
-    if (req.url.startsWith("/_swarm/devtools")) {
-      return serveStatic("apps/devtools/build");
-    }
-  },
-});
-```
-### 2. CLI (@effect/cli)
-**Commands:**
-```bash
-swarm events [--project <key>] [--type <type>] [--tail]
-swarm messages [--agent <name>] [--unread]
-swarm locks [--agent <name>]
-swarm replay --from <sequence> [--to <sequence>]
-swarm metrics
-```
-**Implementation:**
-```typescript
-import { Command } from "@effect/cli";
-const eventsCommand = Command.make(
-  "events",
-  {
-    project: Options.string("project").optional,
-    type: Options.string("type").optional,
-    tail: Options.boolean("tail"),
-  },
-  ({ project, type, tail }) => {
-    // Query events table, optionally --tail with live query
-  },
-);
-```
-### 3. Metrics (Prometheus)
-**Histograms:**
-- `swarm_message_latency_seconds` - Send to receive time
-- `swarm_lock_contention_seconds` - Time waiting for lock
-- `swarm_queue_depth` - Unread messages per agent
-**Counters:**
-- `swarm_events_total{type}` - Events by type
-- `swarm_messages_sent_total{sender, recipient}`
-- `swarm_locks_acquired_total{agent}`
-**Example:**
-```typescript
-import { Registry, Histogram } from 'prom-client'
-const messageLat ency = new Histogram({
-  name: 'swarm_message_latency_seconds',
-  help: 'Message delivery latency',
-  buckets: [0.01, 0.05, 0.1, 0.5, 1.0, 5.0]
-})
-// Record latency
-const start = Date.now()
-await sendMessage(msg)
-const latency = (Date.now() - start) / 1000
-messageLatency.observe(latency)
-```
-### 4. Distributed Tracing (OpenTelemetry)
-**Integration:**
-```typescript
-import { @effect/opentelemetry } from '@effect/opentelemetry'
-import { NodeTracerProvider } from '@opentelemetry/sdk-trace-node'
-const provider = new NodeTracerProvider()
-const tracer = provider.getTracer('swarm-mail')
-// Trace message send
-const span = tracer.startSpan('sendMessage', {
-  attributes: {
-    'swarm.sender': 'AgentA',
-    'swarm.recipient': 'AgentB',
-    'swarm.thread_id': 'bd-123'
-  }
-})
-await sendMessage(msg)
-span.end()
-```
-**Trace Propagation:**
-- Add trace_id to message metadata
-- Worker agents continue traces from parents
-- Visualize full swarm execution flow
-## Consequences
-### Easier
-- **Visibility** - See all events, messages, locks in real-time
-- **Debugging** - Trace issues across agents via distributed tracing
-- **Performance** - Identify slow operations via histograms
-- **Operations** - CLI for prod debugging without UI
-### More Difficult
-- **Maintenance** - Another app to maintain (DevTools UI)
-- **Bundle size** - Metrics/tracing deps increase plugin size
-- **Performance overhead** - Instrumentation adds latency
-- **Configuration** - Metrics exporters, trace backends
-## Implementation Notes
-### Phase 1: CLI (Week 1)
-- Add @effect/cli dependency
-- Implement events, messages, locks commands
-- Test with real swarm sessions
-### Phase 2: DevTools UI (Week 2-3)
-- Scaffold SvelteKit app
-- Build event stream viewer
-- Add SSE endpoint for real-time updates
-- Static export + embed in plugin
-### Phase 3: Metrics (Week 4)
-- Add prom-client dependency
-- Instrument send/receive latency
-- Add queue depth gauge
-- Expose /metrics endpoint
-### Phase 4: Tracing (Week 5)
-- Add @effect/opentelemetry
-- Instrument message send/receive
-- Propagate trace context
-- Test with Jaeger/Zipkin
-### Success Criteria
-- [ ] CLI can tail events in real-time
-- [ ] DevTools UI shows live message stream
-- [ ] Metrics exposed at /metrics endpoint
-- [ ] Traces visible in Jaeger UI
-- [ ] Documentation for all observability tools

package/docs/planning/ADR-007-swarm-enhancements-worktree-review.md DELETED Viewed

@@ -1,168 +0,0 @@
-# ADR-007: Swarm Enhancements - Worktree Isolation + Structured Review
-## Status
-Proposed
-## Context
-After reviewing [nexxeln/opencode-config](https://github.com/nexxeln/opencode-config), we identified several patterns that would strengthen our swarm coordination:
-1. **Git worktree isolation** - Each worker gets a complete isolated copy of the repo
-2. **Structured review loop** - Workers must pass review before completion
-3. **Retry options on abort** - Clean recovery paths when things go wrong
-Currently our swarm uses:
-- **File reservations** via Swarm Mail for conflict prevention
-- **UBS scan** on completion for bug detection
-- **Manual cleanup** on abort
-## Decision
-### 1. Optional Worktree Isolation Mode
-Add `isolation` parameter to swarm initialization:
-```typescript
-swarm_init({
-  task: "Large refactor across 50 files",
-  isolation: "worktree"  // or "reservation" (default)
-})
-```
-**When to use worktrees:**
-- Large refactors touching many files
-- High risk of merge conflicts
-- Need complete isolation (different node_modules, etc.)
-**When to use reservations (default):**
-- Most swarm tasks
-- Quick parallel work
-- Lower overhead
-**Worktree lifecycle:**
-```
-swarm_worktree_create(task_id) → /path/to/worktree
-  ↓
-worker does work in worktree
-  ↓
-swarm_worktree_merge(task_id)  → cherry-pick commit to main
-  ↓
-swarm_worktree_cleanup(task_id) → remove worktree
-```
-**On abort:** Hard reset main to start commit, delete all worktrees.
-### 2. Structured Review Step
-The coordinator reviews worker output before marking complete. This replaces the current "trust but verify with UBS" approach.
-**Review flow:**
-```
-worker completes → coordinator reviews → approved/needs_changes
-                                              ↓
-                                    if needs_changes: worker fixes (max 3 attempts)
-                                              ↓
-                                    if approved: mark complete
-```
-**Review prompt includes:**
-- Epic goal (the big picture)
-- Task requirements
-- What completed tasks this builds on (dependency context)
-- What future tasks depend on this (downstream context)
-- The actual code changes
-**Why coordinator reviews (not separate reviewer agent):**
-- Coordinator already has full epic context loaded
-- Avoids spawning another agent just for review
-- Keeps the feedback loop tight
-- Coordinator can make judgment calls about "good enough"
-**Review criteria:**
-1. Does it fulfill the task requirements?
-2. Does it serve the epic goal?
-3. Will downstream tasks be able to use it?
-4. Are there critical bugs? (UBS scan still runs)
-### 3. Retry Options on Abort
-When a swarm aborts (user request or failure), provide clear recovery paths:
-```json
-{
-  "retry_options": {
-    "same_plan": "/swarm --retry",
-    "edit_plan": "/swarm --retry --edit",
-    "fresh_start": "/swarm \"original task\""
-  }
-}
-```
-**`--retry`**: Resume with same plan, skip completed tasks
-**`--retry --edit`**: Show plan for modification before resuming
-**Fresh start**: Decompose from scratch
-This requires persisting swarm session state (already have this via Hive cells).
-## Implementation
-### Phase 1: Structured Review (Priority)
-1. Add review step to `swarm_complete`
-2. Create review prompt with epic context injection
-3. Handle needs_changes → worker retry loop (max 3)
-4. Keep UBS scan as additional safety net
-### Phase 2: Worktree Isolation
-1. Add `isolation` mode to `swarm_init`
-2. Implement worktree lifecycle tools
-3. Update worker prompts to work in worktree path
-4. Add cherry-pick merge on completion
-5. Add cleanup on abort
-### Phase 3: Retry Options
-1. Persist session state for recovery
-2. Add `--retry` and `--retry --edit` flags
-3. Skip completed tasks on retry
-4. Show plan editor for `--edit` mode
-## Consequences
-### Positive
-- **Better quality**: Structured review catches issues before integration
-- **Safer large refactors**: Worktree isolation eliminates merge conflicts
-- **Cleaner recovery**: Retry options reduce friction after failures
-- **Coordinator stays in control**: Review keeps human-in-the-loop feel
-### Negative
-- **More complexity**: Two isolation modes to maintain
-- **Slower completion**: Review step adds latency
-- **Disk usage**: Worktrees consume space (mitigated by cleanup)
-### Neutral
-- **Credit**: Patterns inspired by nexxeln/opencode-config - should acknowledge in docs
-## Alternatives Considered
-### Separate Reviewer Agent
-nexxeln uses a dedicated reviewer subagent. We chose coordinator-as-reviewer because:
-- Avoids context duplication (coordinator already has epic context)
-- Faster feedback loop
-- Coordinator can make "ship it" judgment calls
-### Staged Changes on Finalize
-nexxeln soft-resets to leave changes staged for user review. We're skipping this because:
-- Our flow already has explicit commit step
-- Hive tracks what changed
-- User can always `git diff` before committing
-### Always Use Worktrees
-Could simplify by always using worktrees. Rejected because:
-- Overkill for most tasks
-- Slower setup/teardown
-- File reservations work fine for typical parallel work
-## References
-- [nexxeln/opencode-config](https://github.com/nexxeln/opencode-config) - Source of inspiration
-- Epic: `bd-lf2p4u-mjaja96b9da` - Swarm Enhancements

package/docs/planning/ADR-008-worker-handoff-protocol.md DELETED Viewed

@@ -1,293 +0,0 @@
-# ADR-008: Worker Handoff Protocol - Structured Contracts Over Prose
-## Status
-Proposed
-## Context
-The current `SUBTASK_PROMPT_V2` is a **280-line prose instruction manual** that gets injected into every swarm worker's context. This approach has fundamental problems:
-### Current Problems
-1. **Workers ignore prose** - Long text instructions get skimmed or missed entirely
-2. **No validation** - Can't programmatically verify workers followed protocol
-3. **Context bloat** - 280 lines * N workers burns tokens fast
-4. **Drift and violations** - Workers modify files outside their scope, no automatic detection
-5. **Manual error recovery** - Coordinator can't auto-detect contract violations
-**Concrete example of failure:**
-```
-Worker assigned: ["src/auth/service.ts"]
-Worker actually touched: ["src/auth/service.ts", "src/lib/jwt.ts", "src/types/user.ts"]
-                         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
-                         Scope creep undetected until swarm_complete
-```
-Current `swarm_complete` validates `files_touched ⊆ files_owned`, but the **contract** was never machine-readable to begin with.
-### Research & Inspirations
-From "Patterns for Building AI Agents" and production event-driven systems:
-**mdflow adapter pattern:**
-- Convention-based behavior inference
-- Template variables define expectations
-- Minimal configuration, maximum clarity
-**Bellemare's event-driven orchestration:**
-- Explicit contracts between services
-- Commands vs Events distinction
-- Contract violations fail fast with clear errors
-**Key insight:** Agents need **two channels**:
-1. **Contract** (machine-readable, validated) - WHAT to do, WHERE to do it
-2. **Context** (human-readable, advisory) - WHY it matters, HOW it fits together
-## Decision
-Replace 280-line prose with **WorkerHandoff envelope** that separates contract from context.
-### WorkerHandoff Structure
-```typescript
-interface WorkerHandoff {
-  // Machine-readable - enforced by tools
-  contract: {
-    task_id: string;              // Cell ID for tracking
-    files_owned: string[];        // Exclusive write access (validated)
-    files_readonly: string[];     // Can read, MUST NOT modify (validated)
-    dependencies_completed: string[];  // Tasks that finished before this
-    success_criteria: string[];   // Exit conditions (checkable)
-  };
-  // Human-readable - advisory context
-  context: {
-    epic_summary: string;         // Big picture goal
-    your_role: string;            // What this subtask accomplishes
-    what_others_did: string;      // Dependency outputs
-    what_comes_next: string;      // Downstream task expectations
-  };
-  // Escalation paths - when things go wrong
-  escalation: {
-    blocked_contact: string;      // "coordinator" or agent name
-    scope_change_protocol: string; // "swarmmail_send + await approval"
-  };
-}
-```
-### Example Handoff
-```json
-{
-  "contract": {
-    "task_id": "bd-123.2",
-    "files_owned": ["src/auth/service.ts", "src/auth/service.test.ts"],
-    "files_readonly": ["src/types/user.ts", "src/lib/jwt.ts"],
-    "dependencies_completed": ["bd-123.1"],
-    "success_criteria": [
-      "AuthService.login() returns JWT token",
-      "Tests pass: bun test src/auth/service.test.ts",
-      "Type check passes: tsc --noEmit"
-    ]
-  },
-  "context": {
-    "epic_summary": "Add OAuth authentication to user service",
-    "your_role": "Implement AuthService with JWT token generation",
-    "what_others_did": "bd-123.1 created User schema with email/password fields",
-    "what_comes_next": "bd-123.3 will integrate this service into API routes"
-  },
-  "escalation": {
-    "blocked_contact": "coordinator",
-    "scope_change_protocol": "swarmmail_send(subject='Scope Change', ack_required=true)"
-  }
-}
-```
-### Validation in swarm_complete
-```typescript
-// swarm_complete now validates against contract
-function validateCompletion(handoff: WorkerHandoff, result: CompletionReport) {
-  const violations: string[] = [];
-  // 1. File scope violations
-  const unauthorized = result.files_touched.filter(
-    f => !handoff.contract.files_owned.includes(f)
-  );
-  if (unauthorized.length > 0) {
-    violations.push(`Touched unauthorized files: ${unauthorized.join(", ")}`);
-  }
-  // 2. Success criteria (checkable ones)
-  for (const criterion of handoff.contract.success_criteria) {
-    if (criterion.startsWith("Tests pass:")) {
-      // Run the test command, validate exit 0
-    }
-    if (criterion.startsWith("Type check passes:")) {
-      // Run tsc --noEmit, validate exit 0
-    }
-  }
-  // 3. Learning signals from violations
-  if (violations.length > 0) {
-    recordLearningSignal({
-      task_id: handoff.contract.task_id,
-      violation_type: "scope_creep",
-      details: violations,
-      impact: "negative"  // Penalize decomposition strategy
-    });
-  }
-  return { valid: violations.length === 0, violations };
-}
-```
-### Integration with Existing Tools
-**swarm_spawn_subtask generates handoffs:**
-```typescript
-export const swarm_spawn_subtask = tool(/* ... */)
-  .handler(async ({ input, context }) => {
-    const handoff: WorkerHandoff = {
-      contract: {
-        task_id: input.bead_id,
-        files_owned: input.files,
-        files_readonly: inferReadonlyFiles(input.files, epicContext),
-        dependencies_completed: input.dependencies_completed || [],
-        success_criteria: generateSuccessCriteria(input.subtask_description)
-      },
-      context: {
-        epic_summary: epicContext.summary,
-        your_role: input.subtask_title,
-        what_others_did: summarizeDependencies(input.dependencies_completed),
-        what_comes_next: summarizeDownstream(input.bead_id)
-      },
-      escalation: {
-        blocked_contact: "coordinator",
-        scope_change_protocol: "swarmmail_send(subject='Scope Change', ack_required=true)"
-      }
-    };
-    return formatHandoff(handoff); // Compact JSON + minimal prose wrapper
-  });
-```
-**swarm_complete validates contract:**
-```typescript
-export const swarm_complete = tool(/* ... */)
-  .handler(async ({ input, context }) => {
-    const handoff = getStoredHandoff(input.bead_id);
-    const validation = validateCompletion(handoff, {
-      files_touched: input.files_touched,
-      summary: input.summary
-    });
-    if (!validation.valid) {
-      throw new Error(
-        `Contract violations detected:\n${validation.violations.join("\n")}`
-      );
-    }
-    // Proceed with UBS scan, reservation release, etc.
-  });
-```
-## Consequences
-### Positive
-- **Validation enforced** - Can't complete with contract violations
-- **Clear boundaries** - Workers know exactly what's in/out of scope
-- **Better learning** - Scope creep violations feed back into strategy selection
-- **Context efficiency** - Contract is ~30 lines JSON vs 280 lines prose
-- **Fail fast** - Violations detected immediately, not during merge
-- **Programmatic recovery** - Coordinator can auto-detect and reassign work
-### Negative
-- **Requires storage** - Handoffs must persist (already have event store)
-- **Success criteria limited** - Can't validate all criteria automatically
-- **Migration cost** - Existing `SUBTASK_PROMPT_V2` users need update
-- **More upfront work** - Coordinator must generate better contracts
-### Neutral
-- **Prose still exists** - `context` field provides human explanation, just smaller
-- **Not eliminating checklist** - 9-step survival checklist stays, but moves to tool enforcement
-## Implementation Notes
-### Phase 1: Storage & Schema
-1. Add `WorkerHandoff` schema to swarm-mail event types
-2. Store handoffs in event log when spawning subtasks
-3. Retrieve handoffs in `swarm_complete` for validation
-### Phase 2: Generation Logic
-1. Implement `inferReadonlyFiles()` - analyze imports/dependencies
-2. Implement `generateSuccessCriteria()` - parse task description for checkable conditions
-3. Implement `summarizeDependencies()` and `summarizeDownstream()` - build context from epic graph
-### Phase 3: Validation
-1. Add contract validation to `swarm_complete`
-2. Implement checkable criteria runners (test commands, type checks)
-3. Record learning signals for violations
-### Phase 4: Migration
-1. Update `formatSubtaskPromptV2` to generate handoff JSON
-2. Deprecate 280-line prose template
-3. Update tests for new handoff format
-### Phase 5: Enhanced Features (Future)
-1. **Readonly enforcement** - Detect modifications to `files_readonly` via git diff
-2. **Dependency validation** - Verify `dependencies_completed` actually ran first
-3. **Auto-generated success criteria** - Parse test files, infer criteria from code
-## Alternatives Considered
-### Keep Prose, Add Validation
-Keep `SUBTASK_PROMPT_V2` but add validation after-the-fact. **Rejected** because:
-- Still burns 280 lines of context per worker
-- Workers still ignore prose
-- Validation happens too late (after work done)
-### Minimal Contract Only
-Remove context entirely, pure machine contract. **Rejected** because:
-- Workers need WHY to make good judgment calls
-- Context helps with edge cases not in contract
-- Loss of human readability hurts debugging
-### Command Pattern (Bellemare Style)
-Full event-sourcing with Command objects. **Rejected** because:
-- Over-engineered for current needs
-- Already have event store for coordination
-- Contract + context is simpler and sufficient
-## References
-- **"Patterns for Building AI Agents"** - Subagent context sharing patterns
-- **mdflow** - Convention-based adapter design, template variable contracts
-- **Bellemare's "Building Event-Driven Microservices"** - Explicit contracts, fail-fast validation
-- **Current implementation:** `src/swarm-prompts.ts` (SUBTASK_PROMPT_V2, lines 253-530)
-- **Related:** ADR-007 (Structured Review), ADR-002 (Package Extraction)
-## Success Criteria
-- [ ] `WorkerHandoff` schema defined and validated with Zod
-- [ ] `swarm_spawn_subtask` generates handoffs instead of raw prose
-- [ ] `swarm_complete` validates contract before accepting completion
-- [ ] Scope violations trigger learning signals (negative feedback)
-- [ ] Workers receive handoff as JSON + compact context wrapper (<50 lines)
-- [ ] Test suite validates contract enforcement catches violations
-- [ ] Migration path documented for existing swarm users