npm - @chllming/wave-orchestration - Versions diffs - 0.6.1 → 0.6.2 - Mend

@chllming/wave-orchestration 0.6.1 → 0.6.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (26) hide show

package/CHANGELOG.md +9 -0
package/README.md +75 -30
package/docs/README.md +15 -3
package/docs/concepts/context7-vs-skills.md +24 -0
package/docs/concepts/runtime-agnostic-orchestration.md +17 -2
package/docs/concepts/what-is-a-wave.md +28 -0
package/docs/evals/README.md +2 -0
package/docs/guides/terminal-surfaces.md +2 -0
package/docs/plans/wave-orchestrator.md +11 -3
package/docs/reference/runtime-config/README.md +4 -4
package/docs/reference/runtime-config/claude.md +6 -1
package/docs/reference/runtime-config/codex.md +2 -2
package/docs/reference/runtime-config/opencode.md +1 -1
package/docs/research/agent-context-sources.md +2 -0
package/docs/research/coordination-failure-review.md +37 -13
package/package.json +1 -1
package/releases/manifest.json +18 -0
package/scripts/wave-orchestrator/agent-state.mjs +10 -3
package/scripts/wave-orchestrator/config.mjs +19 -0
package/scripts/wave-orchestrator/dashboard-renderer.mjs +150 -20
package/scripts/wave-orchestrator/dashboard-state.mjs +8 -0
package/scripts/wave-orchestrator/executors.mjs +67 -4
package/scripts/wave-orchestrator/launcher-runtime.mjs +1 -0
package/scripts/wave-orchestrator/launcher.mjs +245 -10
package/scripts/wave-orchestrator/terminals.mjs +25 -0
package/scripts/wave-orchestrator/wave-files.mjs +31 -0

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,15 @@
 ## Unreleased
+## 0.6.2 - 2026-03-22
+- Added first-class `claude.effort` support across config profiles, lane overrides, and per-agent `### Executor` blocks, and now emit `--effort` in Claude launch previews and live runs.
+- Clarified operator runtime visibility with additive `launch-preview.json` `limits` metadata, including explicit known turn ceilings for Claude/OpenCode and explicit Codex opacity when Wave does not emit a turn-limit flag.
+- Clarified dashboard and terminal UX: global wave counts now distinguish done, active, pending, and failed agents; the current-wave dashboard keeps a stable terminal name; and TTY dashboards use simple color cues for faster scanning.
+- Pruned stale dry-run executor preview directories when wave agent sets shrink, so manual inspection of `.tmp/.../dry-run/executors/` matches the current manifest.
+- Preserved already-landed implementation slices for shared promoted components by retrying only the sibling owners that still owe closure proof instead of blindly replaying the landed owner.
+- Added release-surface alignment regression coverage and updated the shipped docs so README, runtime-config references, changelog, and release metadata match the `0.6.2` package surface.
 ## 0.6.1 - 2026-03-22
 - Published the post-merge `main` source as `0.6.1` so the default branch, tagged source, and package docs all agree on the current release.

package/README.md CHANGED Viewed

@@ -1,48 +1,91 @@
 # Wave Orchestration
-Wave Orchestration is a repository harness for running multi-agent work in bounded waves. You define shared plan docs plus per-wave markdown, the launcher validates the wave, compiles prompts and inboxes, runs implementation agents first, then performs staged closure. Every run writes durable state under `.tmp/<lane>-wave-launcher/` so humans can inspect progress, replay outcomes, and intervene only when needed.
+Wave Orchestration is my framework for "vibe-coding." It keeps the speed of agentic coding, but makes the runtime, coordination, and context model explicit enough to inspect, replay, and improve.
+The framework does three things:
+1. It abstracts the agent runtime away without flattening everything to the lowest common denominator. The same waves, skills, planning, evaluation, proof, and traces can run across Claude, Codex, and OpenCode while still preserving runtime-native features through executor adapters.
+2. It runs work as a blackboard-style multi-agent system. Agents do not just exchange chat messages; they work against shared state, generated inboxes, explicit ownership, and staged closure, and a wave keeps going until the declared goals, proof, production-live criteria, or eval targets are actually satisfied.
+3. It compiles context dynamically for the task at hand. Shared memory, generated runtime files, project defaults, skills, Context7, and cached external docs are assembled at runtime so you do not have to hand-maintain separate Claude, Codex, or other context files.
+## Core Ideas
+- `One orchestrator, many runtimes.`
+  Planning, skills, evals, proof, and traces stay constant while the executor adapter changes.
+- `A blackboard-style multi-agent system.`
+  The coordination log is canonical shared state; the rolling board, shared summary, inboxes, ledger, and integration views are generated projections over that state.
+- `Completion is goal-driven and proof-bounded.`
+  Waves close only when deliverables, proof artifacts, eval targets, dependencies, and closure stewards agree.
+- `Context is compiled, not hand-maintained.`
+  Wave builds runtime context from repo state, project memory, skills, Context7, and generated overlays.
+- `The system is inspectable and replayable.`
+  Dry-run previews, logs, dashboards, ledgers, traces, and replay make the system debuggable instead of mysterious.
+## How The Architecture Works
+1. Define shared docs plus `docs/plans/waves/wave-<n>.md` files, or generate them with `wave draft`.
+2. Run `wave launch --dry-run` to validate the wave and materialize prompts, shared summaries, inboxes, dashboards, and executor previews before any live execution.
+3. During live execution, implementation agents write claims, evidence, requests, and decisions into the canonical coordination log instead of relying on ad hoc terminal narration.
+4. The launcher compiles blackboard projections from that state: rolling board, shared summary, per-agent inboxes, ledger, docs queue, dependency views, and integration summaries.
+5. Closure runs only when the integrated state is ready: optional `cont-EVAL` (`E0`), optional security review, integration (`A8`), documentation (`A9`), and `cont-QA` (`A0`).
+## Architecture Surfaces
+- `Wave contract`
+  Shared plan docs, wave markdown, deliverables, proof artifacts, and eval targets define the goal.
+- `Shared state`
+  The coordination log is the source of truth; the board is for humans, not the scheduler.
+- `Runtime abstraction`
+  Executor adapters preserve Codex, Claude, and OpenCode-specific launch features without changing the higher-level wave contract.
+- `Compiled context`
+  Project profile memory, shared summary, inboxes, skills, Context7, and runtime overlays are generated for the chosen executor.
+- `Proof and closure`
+  Exit contracts, proof artifacts, eval markers, and closure stewards stop waves from closing on narrative-only PASS.
+- `Replay and audit`
+  Traces capture the attempt so failures can be inspected and replayed instead of guessed from screenshots.
-## How It Works
+## Example Output
-1. Write shared docs and one or more `docs/plans/waves/wave-<n>.md` files.
-2. Run `wave launch --dry-run` to validate the wave and materialize prompts, inboxes, dashboards, and executor previews.
-3. A real launch runs implementation agents first. Agents post claims, evidence, requests, and decisions into the coordination log and rolling message board.
-4. When implementation gates pass, closure runs in order: optional `cont-EVAL` (`E0`), integration (`A8`), documentation (`A9`), and `cont-QA` (`A0`).
-5. Operators use the generated ledgers, inboxes, feedback queue, dependency views, and traces instead of guessing from raw terminal output.
+Representative rolling message board output from a real wave run:
-## Features
+<img src="./docs/image.png" alt="Example rolling message board output showing claims, evidence, requests, and cont-QA closure for a wave run" width="100%" />
-- Planner foundation with saved project profile memory, draft specs, and rendered wave markdown
-- Implementation-first execution with staged closure and retry support
-- Durable coordination log, rolling message board, compiled inboxes, and per-wave ledger
-- Dry-run prompt and executor preview mode before any real agent launch
-- Context7 bundle selection, caching, and prompt injection
-- Multi-executor support for Codex, Claude Code, OpenCode, and a local smoke executor
-- Cross-runtime skill packs loaded from `skills/` and resolved by lane, role, runtime, deploy kind, and per-agent attachment
-- Human feedback routing, clarification triage, helper assignment, and cross-lane dependencies
-- Replayable trace bundles for regression and release verification
+## Common MAS Failure Cases
-## Example Output
+Recent multi-agent research keeps returning to the same failure modes:
-Representative rolling message board output from a real wave run:
+- `Cosmetic board, no canonical state`
+  Agents appear coordinated, but there is no machine-trustable source of truth underneath the conversation.
+- `Hidden evidence never gets pooled`
+  One agent has the critical fact, but it never reaches shared state before closure.
+- `Communication without global-state reconstruction`
+  Agents exchange information, but nobody reconstructs the correct cross-agent picture.
+- `Simultaneous coordination collapse`
+  A team that looks fine in serial work falls apart when multiple owners, blockers, or resources must move together.
+- `Expert signal gets averaged away`
+  The strongest specialist view is diluted into a weaker compromise.
+- `Contradictions get smoothed over`
+  Conflicts are narrated away instead of being turned into explicit repair work.
+- `Premature closure`
+  Agents say they are done before proof, evals, or integrated state actually support PASS.
-<img src="./docs/image.png" alt="Example rolling message board output showing claims, evidence, requests, and cont-QA closure for a wave run" width="100%" />
+Wave is built to mitigate those failures with canonical shared state, generated blackboard projections, explicit ownership, goal-driven, proof-bounded closure, and replayable traces. For the research framing and the current gaps, see [docs/research/coordination-failure-review.md](./docs/research/coordination-failure-review.md).
 ## Quick Start
 Current release:
-- `@chllming/wave-orchestration@0.6.1`
-- Release tag: [`v0.6.1`](https://github.com/chllming/wave-orchestration/releases/tag/v0.6.1)
+- `@chllming/wave-orchestration@0.6.2`
+- Release tag: [`v0.6.2`](https://github.com/chllming/wave-orchestration/releases/tag/v0.6.2)
 - Public install path: npmjs
 - Authenticated fallback: GitHub Packages
-Highlights in `0.6.1`:
+Highlights in `0.6.2`:
-- `cont-EVAL` (`E0`) is now a first-class optional eval stage before integration, separate from final `cont-QA` closure.
-- Optional security review now has a dedicated role, report path, and `[wave-security]` closure marker.
-- `wave adhoc plan|run|show|promote` now supports transient operator requests on the same launcher substrate.
-- Starter docs and skills now cover the current `0.6.1` closure, benchmark, security, and provider surfaces.
+- Runtime previews and docs now expose first-class Claude effort plus structured limit metadata, making known Claude/OpenCode ceilings explicit and Codex opacity explicit.
+- The global dashboard and VS Code terminal surfaces are easier to read: active vs pending counts are distinct, the current-wave dashboard keeps a stable terminal name, and TTY dashboards now use simple color cues.
+- Dry-run executor preview directories now prune stale agent folders when a wave shrinks.
+- Shared promoted-component retries now preserve already-landed owner slices and relaunch only the sibling owners still needed for closure.
 Requirements:
@@ -59,7 +102,7 @@ pnpm add -D @chllming/wave-orchestration
 pnpm exec wave init
 pnpm exec wave doctor
 pnpm exec wave launch --lane main --dry-run --no-dashboard
-pnpm exec wave coord show --lane main --wave 0 --dry-run
+pnpm exec wave coord show --lane main --wave 0 --dry-run --json
 ```
 If the repo already has Wave config, plans, or waves you want to keep:
@@ -99,14 +142,16 @@ node scripts/wave.mjs launch --lane main --dry-run --no-dashboard
 ## Learn More
 - [docs/README.md](./docs/README.md): docs map and suggested structure
-- [docs/concepts/what-is-a-wave.md](./docs/concepts/what-is-a-wave.md): wave anatomy, lifecycle, and closure model
+- [docs/concepts/what-is-a-wave.md](./docs/concepts/what-is-a-wave.md): wave anatomy, blackboard execution model, and proof-bounded closure
+- [docs/concepts/runtime-agnostic-orchestration.md](./docs/concepts/runtime-agnostic-orchestration.md): how one orchestration substrate spans Claude, Codex, OpenCode, and local execution
+- [docs/concepts/context7-vs-skills.md](./docs/concepts/context7-vs-skills.md): compiled context, external truth, and repo-owned operating knowledge
 - [docs/guides/planner.md](./docs/guides/planner.md): `wave project` and `wave draft` workflow
-- [docs/concepts/context7-vs-skills.md](./docs/concepts/context7-vs-skills.md): when to use external docs vs repo-owned skills
 - [docs/guides/terminal-surfaces.md](./docs/guides/terminal-surfaces.md): tmux, VS Code terminal registry, and dry-run surfaces
 - [docs/plans/wave-orchestrator.md](./docs/plans/wave-orchestrator.md): operator runbook
 - [docs/plans/context7-wave-orchestrator.md](./docs/plans/context7-wave-orchestrator.md): Context7 setup and bundle authoring
 - [docs/reference/runtime-config/README.md](./docs/reference/runtime-config/README.md): executor, runtime, and skill-projection configuration
 - [docs/reference/skills.md](./docs/reference/skills.md): skill bundle format, resolution order, and runtime projection
+- [docs/research/coordination-failure-review.md](./docs/research/coordination-failure-review.md): MAS failure modes from the research and how Wave responds
 - [CHANGELOG.md](./CHANGELOG.md): release history
 ## Research Sources

package/docs/README.md CHANGED Viewed

@@ -1,6 +1,12 @@
 # Wave Documentation
-This repository now uses a layered docs structure, but the useful path is journey-first:
+These docs are organized around three core ideas:
+- one orchestrator, many runtimes across Claude, Codex, OpenCode, and local execution
+- a blackboard-style multi-agent system with goal-driven, proof-bounded closure
+- compiled context from shared state, skills, runtime files, and Context7 instead of hand-maintained per-runtime context files
+The useful path is journey-first:
 - start with one core concept doc
 - then use one end-to-end workflow guide
@@ -22,7 +28,11 @@ This repository now uses a layered docs structure, but the useful path is journe
 ## Start Here
 - New to Wave:
-  Read [concepts/what-is-a-wave.md](./concepts/what-is-a-wave.md). It now covers the core execution model, runtime posture, closure, and state model in one place.
+  Read [concepts/what-is-a-wave.md](./concepts/what-is-a-wave.md). It covers the blackboard execution model, proof-bounded closure, runtime posture, and durable state model in one place.
+- Want the runtime abstraction story:
+  Read [concepts/runtime-agnostic-orchestration.md](./concepts/runtime-agnostic-orchestration.md) to see how planning, skills, evals, proof, and traces stay stable across Claude, Codex, OpenCode, and local execution.
+- Want the context story:
+  Read [concepts/context7-vs-skills.md](./concepts/context7-vs-skills.md) for the compiled-context model: shared summary, inboxes, project defaults, skills, Context7, and runtime overlays.
 - Drafting or revising waves:
   Read [guides/author-and-run-waves.md](./guides/author-and-run-waves.md), then use [plans/wave-orchestrator.md](./plans/wave-orchestrator.md) as the operator runbook.
 - Adding a security review pass:
@@ -37,8 +47,10 @@ This repository now uses a layered docs structure, but the useful path is journe
   Start with [guides/author-and-run-waves.md](./guides/author-and-run-waves.md), then use [plans/wave-orchestrator.md](./plans/wave-orchestrator.md) for the live operator flow.
 - Tuning runtime behavior:
   Read [reference/runtime-config/README.md](./reference/runtime-config/README.md) and [reference/skills.md](./reference/skills.md).
+- Want the research framing behind the design:
+  Read [research/coordination-failure-review.md](./research/coordination-failure-review.md) for the common MAS failure modes and how Wave tries to mitigate them, then use [research/agent-context-sources.md](./research/agent-context-sources.md) as the bibliography.
 - Looking for supporting concept pages:
-  Use [concepts/runtime-agnostic-orchestration.md](./concepts/runtime-agnostic-orchestration.md), [concepts/operating-modes.md](./concepts/operating-modes.md), and [concepts/context7-vs-skills.md](./concepts/context7-vs-skills.md) after the main concept and workflow docs.
+  Use [concepts/operating-modes.md](./concepts/operating-modes.md) after the main concept, runtime, and context docs.
 ## Package vs Repo-Owned Material

package/docs/concepts/context7-vs-skills.md CHANGED Viewed

@@ -4,6 +4,30 @@ Context7 and skills solve different problems.
 Use Context7 for external library truth. Use skills for repo-owned, reusable operating knowledge.
+That comparison matters because Wave treats context as something to compile at runtime, not something humans should maintain separately for Claude, Codex, OpenCode, and every other executor.
+## Compiled Context, Not Hand-Maintained Context Files
+The active context for an agent is assembled from multiple layers:
+- repository source and the wave's owned files
+- wave markdown and shared plan docs
+- generated shared summary and per-agent inbox
+- saved project defaults such as `.wave/project-profile.json`
+- resolved repo-owned skills
+- selected Context7 snippets for external library truth
+- generated runtime overlays and launch artifacts
+Because of that, the question is not "which hand-written context file does this runtime use?" The question is "which context sources does this wave compile for the selected runtime right now?"
+Runtime-specific context is still real, but it is mostly generated:
+- Claude gets merged system-prompt and settings overlays
+- Codex gets executor flags plus runtime-projected skills
+- OpenCode gets generated config, attachments, and runtime instructions
+That keeps the context model unified even when the transport layer differs.
 ## Short Version
 - Context7

package/docs/concepts/runtime-agnostic-orchestration.md CHANGED Viewed

@@ -1,15 +1,22 @@
 # Runtime-Agnostic Orchestration
+In short: one orchestrator, many runtimes.
 Wave is runtime agnostic at the orchestration layer.
-That means planning, coordination, closure, and traces do not depend on whether the selected executor is Codex, Claude Code, OpenCode, or the local smoke executor.
+That means planning, skills, evaluation, proof, coordination, closure, and traces do not depend on whether the selected executor is Codex, Claude Code, OpenCode, or the local smoke executor.
+Wave abstracts the runtime away without flattening everything to the lowest common denominator. The wave contract stays stable while the executor adapter preserves the useful runtime-native features.
 ## What Stays The Same Across Runtimes
 These layers are runtime-neutral:
 - wave parsing and validation
+- planner-produced wave specs and authored wave markdown
+- eval targets, deliverables, and proof artifacts
 - component and closure gates
+- skill resolution and attachment policy
 - compiled shared summaries and per-agent inboxes
 - coordination log and rendered message board
 - helper assignments and dependency handling
@@ -34,11 +41,19 @@ Runtime-specific behavior is isolated to the executor adapter layer:
 The orchestration substrate above those adapters does not need to know how the runtime transports prompts.
+This is the important distinction:
+- the orchestration layer owns goals, ownership, proof, and shared state
+- the executor adapter owns prompt transport, runtime-native flags, files, and settings
+That split is what lets Wave stay portable without giving up runtime-specific leverage.
 ## Why This Matters
 Runtime agnosticism gives you:
-- the same plan and closure model across vendors
+- the same plan, skill, and closure model across vendors
+- the same eval and proof model across vendors
 - replay and audit surfaces that do not care which runtime produced the work
 - per-role runtime choice without rewriting authoring conventions
 - retry-time fallback without inventing a second planning model

package/docs/concepts/what-is-a-wave.md CHANGED Viewed

@@ -2,6 +2,8 @@
 A wave is the main planning and execution unit in Wave Orchestration.
+It turns free-form agent runs into a bounded blackboard-style work package with shared state, explicit ownership, dynamic context, goal-driven execution, and proof-bounded closure.
 It is not just a prompt file. A wave is a bounded slice of repository work with:
 - explicit scope
@@ -34,6 +36,16 @@ Waves force a higher planning bar than ad hoc prompts. A good wave answers:
 - What evidence closes the wave?
 - Which dependencies, helper requests, or escalations can still block completion?
+## Why This Is A Blackboard-Style Model
+Wave is blackboard-style because agents work against shared state instead of treating chat output as the system of record.
+- the canonical coordination log is the machine-readable source of truth
+- the rolling board is a human projection over that state, not the scheduler's authority
+- shared summaries and per-agent inboxes are compiled views over the same state
+- helper assignments, clarification flow, dependencies, and integration all operate on that shared state
+- closure depends on the integrated state, not on whether an agent says "done"
 ## Wave Anatomy
 Wave markdown is the authored execution surface today. A typical wave can include:
@@ -136,6 +148,22 @@ Current live waves are strict about closure artifacts:
 - `cont-QA` must emit both a final `Verdict:` line and a final `[wave-gate]` marker.
 - Replay keeps read-only compatibility with older traces and older evaluator-era artifacts, but live waves do not pass on verdict-only or underspecified closure markers.
+## Context Is Compiled At Runtime
+Wave also treats context as something to compile for the current task, not something humans should hand-maintain separately for each runtime.
+The active context for an agent is assembled from:
+- repository source and owned files
+- wave markdown and shared plan docs
+- saved project defaults such as `.wave/project-profile.json`
+- the generated shared summary and the agent's inbox
+- resolved skills and runtime-specific skill projections
+- selected Context7 snippets for external library truth
+- generated executor overlays and launch artifacts
+That is why switching an agent between Codex, Claude, or OpenCode does not require maintaining separate parallel context files. The orchestrator recomputes the context package for the selected runtime and the current wave state.
 ## What Makes A Wave "Done"
 A wave is not done because an agent said so. It is done only when the runtime surfaces agree:

package/docs/evals/README.md CHANGED Viewed

@@ -16,6 +16,8 @@ The catalog is reference metadata, not a run-history database. It tells the wave
 For a full authored wave example that uses these patterns, see [docs/reference/sample-waves.md](../reference/sample-waves.md).
+These benchmark families are also Wave's operator-facing vocabulary for common MAS failure modes. For the research-side framing and the current architectural gaps, see [docs/research/coordination-failure-review.md](../research/coordination-failure-review.md).
 ## Migrating From Legacy Evaluator Waves
 If your `0.5.4`-era repo still talks about a single `evaluator` role, split that surface before adopting `0.6.1`:

package/docs/guides/terminal-surfaces.md CHANGED Viewed

@@ -45,6 +45,8 @@ Use `tmux` when:
 By default the launcher can start per-wave dashboard sessions in tmux.
+When `--terminal-surface vscode` is active, Wave also maintains a stable current-wave dashboard terminal entry instead of creating a new wave-numbered dashboard attach target for every wave transition.
 Important flags:
 - `--no-dashboard`

package/docs/plans/wave-orchestrator.md CHANGED Viewed

@@ -4,6 +4,14 @@ The Wave Orchestrator coordinates repository work as bounded execution waves.
 For the broader docs map, concept pages, and workflow guides, start at [docs/README.md](../README.md).
+This runbook is the operational view of the architecture:
+- one wave contract defines goals, ownership, proof, and closure
+- one canonical coordination log acts as the shared blackboard state
+- generated board, shared summary, inboxes, ledger, and integration outputs are projections over that state
+- executor adapters preserve Claude, Codex, and OpenCode-specific runtime features at the edge
+- closure makes completion depend on integrated proof and shared state, not on free-form agent narration
 ## What It Does
 - parses wave plans from `docs/plans/waves/`
@@ -260,7 +268,7 @@ The launcher entrypoint in `scripts/wave-orchestrator/launcher.mjs` now delegate
 - Skills resolve only after that executor choice is known. Runtime-specific skill overlays are regenerated whenever retry-time fallback changes the selected executor.
 - Runtime mix targets are enforced before launch and again before any retry-time fallback reassignment.
 - Fallbacks are declared in profiles or lane policy, can be applied automatically on retry when the next executor is available and still satisfies mix targets, and are recorded in the ledger, integration summary, and traces when used.
-- Generic `budget.minutes` caps per-agent attempt timeouts. Generic `budget.turns` seeds `claude.maxTurns` and `opencode.steps` when executor-specific values are not set.
+- Generic `budget.minutes` caps per-agent attempt timeouts. Generic `budget.turns` seeds `claude.maxTurns` and `opencode.steps` when executor-specific values are not set; Codex turn ceilings remain external to Wave and show up in preview metadata as opaque when Wave cannot inspect them.
 - The launcher writes runtime overlay files under `.tmp/<lane>-wave-launcher/executors/`; these should stay ignored and local.
 Runtime authoring examples:
@@ -294,7 +302,7 @@ Runtime authoring examples:
 - opencode.config_json: {"instructions":["Keep shared-plan edits concise."]}
 ````
-Dry-run is the intended validation path for these runtime surfaces. `wave launch --dry-run --no-dashboard` now writes compiled prompts, merged runtime overlays, and `launch-preview.json` files under `.tmp/<lane>-wave-launcher/dry-run/` so the harness can verify invocation shape without requiring the executor binaries to run.
+Dry-run is the intended validation path for these runtime surfaces. `wave launch --dry-run --no-dashboard` now writes compiled prompts, merged runtime overlays, and `launch-preview.json` files under `.tmp/<lane>-wave-launcher/dry-run/` so the harness can verify invocation shape, attempt budgets, and known or opaque turn-limit metadata without requiring the executor binaries to run.
 ## Human Feedback Queue
@@ -308,7 +316,7 @@ pnpm exec wave feedback respond --id <request-id> --response "..."
 ## Closure Sweep
-If implementation agents ran, the launcher does not stop at `exit 0`. It checks implementation exit contracts, promoted component proof, helper assignments, required dependencies, and the integration recommendation first. When present, `cont-EVAL` must satisfy its declared eval targets before integration can close. Optional security review then runs before integration so the reviewer can publish findings and approval-sensitive actions while the wave is still active. In the default planner shape `E0` is report-only; if a wave explicitly assigns `E0` non-report files, the launcher also applies the normal implementation proof gates to that role. Security reviewers stay report-only by default. Documentation and cont-QA closure only run after integration is explicitly ready for doc closure; if `cont-EVAL`, security review, or integration reports more work, or if helper assignments or required dependency tickets remain open, the wave stops there and retries only the implicated owners plus the relevant closure steward.
+If implementation agents ran, the launcher does not stop at `exit 0`. It checks implementation exit contracts, promoted component proof, helper assignments, required dependencies, and the integration recommendation first. When present, `cont-EVAL` must satisfy its declared eval targets before integration can close. Optional security review then runs before integration so the reviewer can publish findings and approval-sensitive actions while the wave is still active. In the default planner shape `E0` is report-only; if a wave explicitly assigns `E0` non-report files, the launcher also applies the normal implementation proof gates to that role. Security reviewers stay report-only by default. Documentation and cont-QA closure only run after integration is explicitly ready for doc closure; if `cont-EVAL`, security review, or integration reports more work, or if helper assignments or required dependency tickets remain open, the wave stops there and retries only the implicated owners plus the relevant closure steward. When multiple implementation agents share a promoted component, owners that already landed valid proof stay reusable while the launcher retries only the sibling owners that still owe closure evidence.
 Live closure is fail-closed:

package/docs/reference/runtime-config/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Runtime Configuration Reference
-This directory is the canonical reference for executor configuration in Wave `0.6.1`.
+This directory is the canonical reference for executor configuration in the packaged Wave release.
 Use it when you need the full supported surface for:
@@ -65,7 +65,7 @@ These fields are shared across runtimes:
 | Model | `model` in profile, `executors.claude.model`, `executors.opencode.model` | `model` | Codex uses shared `model` from profile or agent only |
 | Fallbacks | `fallbacks` in profile | `fallbacks` | Runtime ids used for retry-time reassignment |
 | Tags | `tags` in profile | `tags` | Stored in resolved executor state for policy and traces |
-| Budget turns | `budget.turns` in profile | `budget.turns` | Seeds Claude `maxTurns` and OpenCode `steps` when runtime-specific values are absent |
+| Budget turns | `budget.turns` in profile | `budget.turns` | Seeds Claude `maxTurns` and OpenCode `steps` when runtime-specific values are absent; it does not set a Codex turn limit |
 | Budget minutes | `budget.minutes` in profile | `budget.minutes` | Caps attempt timeout |
 ## Runtime Pages
@@ -83,7 +83,7 @@ Wave writes runtime artifacts here:
 Common files:
-- `launch-preview.json`: resolved invocation lines, env vars, and retry mode
+- `launch-preview.json`: resolved invocation lines, env vars, retry mode, and structured attempt/turn-limit metadata
 - `skills.resolved.md`: compact metadata-first skill catalog for the selected agent and runtime
 - `skills.expanded.md`: full canonical/debug skill payload with `SKILL.md` bodies and adapters
 - `skills.metadata.json`: resolved skill ids, activation metadata, permissions, hashes, and generated artifact paths
@@ -100,7 +100,7 @@ Runtime-specific delivery:
 - OpenCode injects the compact catalog into `opencode.json` and attaches `skill.json`, `SKILL.md`, the selected adapter, and recursive `references/**` files through `--file`.
 - Local keeps skills prompt-only.
-`launch-preview.json` also records the resolved skill metadata so dry-run can verify the exact runtime plus skill combination before any live launch.
+`launch-preview.json` also records the resolved skill metadata plus a `limits` section. For Claude and OpenCode, that section reports the known turn ceiling and whether it came from the runtime-specific setting or generic `budget.turns`. For Codex, it explicitly records that Wave emitted no turn-limit flag and that any effective ceiling may come from the selected Codex profile or upstream runtime.
 ## Recommended Validation Path

package/docs/reference/runtime-config/claude.md CHANGED Viewed

@@ -12,6 +12,7 @@ Wave launches Claude headlessly with `claude -p --no-session-persistence`.
 | Prompt mode | `executors.claude.appendSystemPromptMode` | n/a | Uses `--append-system-prompt-file` or `--system-prompt-file` |
 | Permission mode | `executors.claude.permissionMode`, `executors.profiles.<name>.claude.permissionMode` | `claude.permission_mode` | Adds `--permission-mode <mode>` |
 | Permission prompt tool | `executors.claude.permissionPromptTool`, `executors.profiles.<name>.claude.permissionPromptTool` | `claude.permission_prompt_tool` | Adds `--permission-prompt-tool <tool>` |
+| Effort | `executors.claude.effort`, `executors.profiles.<name>.claude.effort` | `claude.effort` | Adds `--effort low|medium|high|max` |
 | Max turns | `executors.claude.maxTurns`, `executors.profiles.<name>.claude.maxTurns` | `claude.max_turns` | Adds `--max-turns <n>` |
 | MCP config | `executors.claude.mcpConfig`, `executors.profiles.<name>.claude.mcpConfig` | `claude.mcp_config` | Adds repeated `--mcp-config <path>` |
 | Strict MCP mode | `executors.claude.strictMcpConfig`, `executors.profiles.<name>.claude.strictMcpConfig` | n/a | Adds `--strict-mcp-config` |
@@ -27,6 +28,8 @@ Wave launches Claude headlessly with `claude -p --no-session-persistence`.
 Wave always writes `claude-system-prompt.txt` for the harness runtime instructions.
+Wave validates the effort enum only. Model-specific compatibility for values such as `max` remains enforced by Claude Code itself.
 Wave writes `claude-settings.json` only when at least one inline overlay input is present:
 - `settingsJson`
@@ -57,6 +60,7 @@ If no inline overlay data is present, Wave passes the base `claude.settings` fil
         },
         "claude": {
           "agent": "reviewer",
+          "effort": "high",
           "permissionMode": "plan",
           "allowedTools": ["Read"],
           "disallowedTools": ["Edit"]
@@ -84,6 +88,7 @@ If no inline overlay data is present, Wave passes the base `claude.settings` fil
 - id: claude
 - model: claude-sonnet-4-6
+- claude.effort: high
 - claude.permission_mode: plan
 - claude.max_turns: 4
 - claude.settings_json: {"permissions":{"allow":["Read"]}}
@@ -102,4 +107,4 @@ For a dry run, inspect:
 - `claude-settings.json`, when generated
 - `launch-preview.json`
-`launch-preview.json` shows the final `claude -p` invocation and whether `--settings`, `--allowedTools`, `--disallowedTools`, `--mcp-config`, or `--system-prompt-file` were included.
+`launch-preview.json` shows the final `claude -p` invocation, whether `--effort`, `--settings`, `--allowedTools`, `--disallowedTools`, `--mcp-config`, or `--system-prompt-file` were included, and the resolved `limits` block for attempt timeout plus known turn ceiling.

package/docs/reference/runtime-config/codex.md CHANGED Viewed

@@ -20,6 +20,7 @@ Wave launches Codex with `codex exec` and pipes the generated task prompt throug
 ## Notes
 - There is no `executors.codex.model` key today. Use profile `model` or per-agent `model`.
+- Generic `budget.turns` does not set a Codex turn limit. If Codex stops on a turn ceiling, that limit came from the selected Codex profile or upstream Codex runtime, not from a Wave-emitted CLI flag.
 - `codex.images`, `codex.add_dirs`, and `codex.config` accept either a string array in `wave.config.json` or a comma-separated list in a wave file.
 - Relative paths are passed to Codex relative to the repository root because Wave launches the executor from the repo workspace.
@@ -35,7 +36,6 @@ Wave launches Codex with `codex exec` and pipes the generated task prompt throug
         "model": "gpt-5-codex",
         "fallbacks": ["claude", "opencode"],
         "budget": {
-          "turns": 12,
           "minutes": 45
         },
         "codex": {
@@ -78,4 +78,4 @@ For a dry run, inspect:
 - `launch-preview.json` for the final `codex exec` command
 - any referenced prompt file under `.tmp/<lane>-wave-launcher/dry-run/prompts/`
-The preview records the exact `--profile`, repeated `-c`, `--image`, and `--add-dir` flags that Wave would use in a live launch.
+The preview records the exact `--profile`, repeated `-c`, `--image`, and `--add-dir` flags that Wave would use in a live launch. It also includes a `limits` block that makes Wave's Codex visibility explicit: `turnLimitSource: "not-set-by-wave"` means Wave emitted no Codex turn-limit flag, so any effective ceiling is external to the Wave CLI invocation.

package/docs/reference/runtime-config/opencode.md CHANGED Viewed

@@ -90,4 +90,4 @@ For a dry run, inspect:
 - `opencode.json`
 - `launch-preview.json`
-`launch-preview.json` shows the final `opencode run` command and the exported `OPENCODE_CONFIG` path.
+`launch-preview.json` shows the final `opencode run` command, the exported `OPENCODE_CONFIG` path, and the resolved `limits` block for attempt timeout plus known step ceiling.

package/docs/research/agent-context-sources.md CHANGED Viewed

@@ -7,6 +7,8 @@ summary: "Primary external sources used as inspiration for planning, harness des
 This repository does not commit converted paper/article caches. Keep any hydrated local copies under `docs/research/agent-context-cache/` or another ignored cache directory.
+For a narrative synthesis of the most relevant MAS failure modes and how Wave responds to them, start with [coordination-failure-review.md](./coordination-failure-review.md) and then use this page as the bibliography.
 ## Practice Articles
 - [Harness engineering: leveraging Codex in an agent-first world](https://openai.com/index/harness-engineering/)

package/docs/research/coordination-failure-review.md CHANGED Viewed

@@ -17,7 +17,28 @@ The Wave orchestrator addresses several coordination failure modes constructivel
 That is materially stronger than the common "agents talk in a shared channel and we hope that was enough" pattern criticized by recent multi-agent papers.
-The main weakness is empirical, not architectural. The repo does not yet contain a benchmark family that proves the blackboard actually helps agents reconstruct distributed state under HiddenBench or Silo-Bench style pressure, or that it handles DPBench-style simultaneous coordination reliably.
+The main weakness is empirical, not architectural. The repo now carries coordination-oriented benchmark vocabulary, but it does not yet present enough hard evidence that the blackboard reconstructs distributed state under HiddenBench or Silo-Bench style pressure, or that it handles DPBench-style simultaneous coordination reliably.
+## Common MAS Failure Cases
+The research cited in this repo keeps returning to a fairly stable set of failure modes. In Wave language, the common ones are:
+- `Cosmetic board, no canonical state`
+  Agents appear coordinated because they share a board or chat, but there is no machine-trustable source of truth underneath. Wave responds with a canonical coordination log and treats the board as a projection.
+- `Hidden evidence never gets pooled`
+  One agent has the decision-changing fact, but it never reaches the shared state before closure. Wave responds with shared summaries, per-agent inboxes, and integration gating, but this still needs stronger empirical validation.
+- `Communication without global-state reconstruction`
+  Agents exchange information, yet nobody reconstructs the correct cross-agent picture. Wave responds with integration summaries and barrier-based closure so the final decision depends on integrated state rather than message volume.
+- `Simultaneous coordination collapse`
+  A team that looks competent in serial work falls apart when multiple owners, blockers, or resources must move together. Wave responds with helper assignments, dependency barriers, and staged closure, but still lacks a stronger contention benchmark story.
+- `Expert signal gets averaged away`
+  The strongest specialist view is diluted into a weaker compromise. Wave responds with explicit ownership, named stewards, and capability routing instead of free-form consensus, though expertise weighting is still shallow.
+- `Blackboard projection drift`
+  The raw shared state may be right, but summaries, inboxes, ledgers, or integration artifacts lose the important fact. Wave responds by compiling those surfaces from canonical state and by adding `blackboard-fidelity` to the eval vocabulary.
+- `Contradictions get smoothed over`
+  Conflicting claims look resolved in prose, but the system never turns them into bounded repair work. Wave responds with clarification flow, integration barriers, and contradiction-oriented eval vocabulary, though subtle semantic conflicts can still leak through.
+- `Premature closure`
+  Agents say they are done before proof, evals, or integrated state actually support PASS. Wave responds with structured proof markers, exit contracts, eval gates, closure stewards, and replay-visible traces.
 ## What The Papers Warn About
@@ -175,23 +196,26 @@ That alignment matters. In many MAS projects the docs promise a blackboard, but
 ## What Is Still Missing To Make The Claim Credible
-### 1. No distributed-information benchmark family yet
+### 1. The benchmark vocabulary exists, but the empirical proof is still thin
-The biggest gap is in [docs/evals/benchmark-catalog.json](../evals/benchmark-catalog.json). The current families are:
+[docs/evals/benchmark-catalog.json](../evals/benchmark-catalog.json) and [docs/evals/README.md](../evals/README.md) now define coordination-oriented benchmark families such as:
-- `service-output`
-- `latency`
-- `quality-regression`
+- `hidden-profile-pooling`
+- `silo-escape`
+- `simultaneous-coordination`
+- `expertise-leverage`
+- `blackboard-fidelity`
+- `contradiction-recovery`
-There is nothing yet for:
+That is a real improvement because the repo now has a vocabulary for the exact MAS failures the research highlights.
-- hidden-profile reconstruction
-- silo escape under partial information
-- blackboard consistency across raw log, summary, inboxes, ledger, and integration state
-- contradiction injection and recovery
-- simultaneous coordination under contention
+The remaining gap is not the absence of categories. The gap is still empirical proof:
-So the repo can reasonably claim "we built mechanisms intended to mitigate these failures," but it cannot yet claim "we demonstrated that these mechanisms overcome the failures highlighted by HiddenBench, Silo-Bench, or DPBench."
+- not enough published results showing those families are exercised systematically
+- not enough evidence that the blackboard actually improves hidden-state reconstruction
+- not enough stress data showing simultaneous coordination remains reliable under contention
+So the repo can reasonably claim "we built mechanisms and eval categories intended to mitigate these failures," but it still cannot claim "we demonstrated that those mechanisms consistently overcome the failures highlighted by HiddenBench, Silo-Bench, or DPBench."
 ### 2. Information integration is supported, but not measured directly

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@chllming/wave-orchestration",
-  "version": "0.6.1",
+  "version": "0.6.2",
   "license": "MIT",
   "description": "Generic wave-based multi-agent orchestration for repository work.",
   "repository": {

package/releases/manifest.json CHANGED Viewed

@@ -2,6 +2,24 @@
   "schemaVersion": 1,
   "packageName": "@chllming/wave-orchestration",
   "releases": [
+    {
+      "version": "0.6.2",
+      "date": "2026-03-22",
+      "summary": "Runtime preview visibility, dashboard/operator UX fixes, dry-run cleanup, and safer shared-component retries.",
+      "features": [
+        "Claude runtime config now exposes first-class `claude.effort`, and runtime previews now include structured `limits` metadata for known attempt and turn ceilings.",
+        "Codex previews and docs now make turn-limit visibility explicit: Wave records when it emitted no Codex turn-limit flag and warns that any effective ceiling may come from the selected Codex profile or upstream runtime.",
+        "The dashboard surface now distinguishes done, active, pending, and failed counts, keeps a stable `Current Wave Dashboard` terminal target, and adds simple TTY color cues for faster scanning.",
+        "Dry-run executor preview directories are pruned when wave agent sets shrink, so stale overlay folders no longer linger under `.tmp/.../dry-run/executors/`.",
+        "Shared promoted-component retries now preserve already-landed owner slices and relaunch only the sibling owners still required for closure proof."
+      ],
+      "manualSteps": [
+        "After upgrading, rerun `pnpm exec wave doctor` and `pnpm exec wave launch --lane main --dry-run --no-dashboard` to inspect the new preview `limits` metadata and confirm your repo runtime config still resolves as expected.",
+        "If you relied on a local Claude wrapper only to inject `--effort`, move that setting into `wave.config.json` or the agent `### Executor` block and retire the wrapper when convenient.",
+        "If you document Codex turn ceilings in repo-local guidance, update that guidance to reflect that Wave now reports Codex ceiling visibility as opaque unless the limit is surfaced by the selected Codex profile or runtime."
+      ],
+      "breaking": false
+    },
     {
       "version": "0.6.1",
       "date": "2026-03-22",