@chllming/wave-orchestration 0.6.1 → 0.6.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (34) hide show
  1. package/CHANGELOG.md +16 -0
  2. package/README.md +79 -30
  3. package/docs/README.md +15 -3
  4. package/docs/concepts/context7-vs-skills.md +24 -0
  5. package/docs/concepts/runtime-agnostic-orchestration.md +17 -2
  6. package/docs/concepts/what-is-a-wave.md +28 -0
  7. package/docs/evals/README.md +2 -0
  8. package/docs/guides/terminal-surfaces.md +2 -0
  9. package/docs/plans/current-state.md +2 -1
  10. package/docs/plans/wave-orchestrator.md +22 -3
  11. package/docs/reference/runtime-config/README.md +4 -4
  12. package/docs/reference/runtime-config/claude.md +6 -1
  13. package/docs/reference/runtime-config/codex.md +2 -2
  14. package/docs/reference/runtime-config/opencode.md +1 -1
  15. package/docs/research/agent-context-sources.md +2 -0
  16. package/docs/research/coordination-failure-review.md +37 -13
  17. package/package.json +1 -1
  18. package/releases/manifest.json +33 -0
  19. package/scripts/wave-autonomous.mjs +2 -4
  20. package/scripts/wave-orchestrator/adhoc.mjs +32 -11
  21. package/scripts/wave-orchestrator/agent-state.mjs +10 -3
  22. package/scripts/wave-orchestrator/autonomous.mjs +20 -6
  23. package/scripts/wave-orchestrator/config.mjs +19 -0
  24. package/scripts/wave-orchestrator/dashboard-renderer.mjs +150 -20
  25. package/scripts/wave-orchestrator/dashboard-state.mjs +8 -0
  26. package/scripts/wave-orchestrator/executors.mjs +67 -4
  27. package/scripts/wave-orchestrator/install.mjs +198 -25
  28. package/scripts/wave-orchestrator/launcher-runtime.mjs +1 -0
  29. package/scripts/wave-orchestrator/launcher.mjs +249 -10
  30. package/scripts/wave-orchestrator/package-update-notice.mjs +230 -0
  31. package/scripts/wave-orchestrator/package-version.mjs +32 -0
  32. package/scripts/wave-orchestrator/terminals.mjs +25 -0
  33. package/scripts/wave-orchestrator/wave-files.mjs +31 -0
  34. package/scripts/wave.mjs +12 -2
package/CHANGELOG.md CHANGED
@@ -2,6 +2,22 @@
2
2
 
3
3
  ## Unreleased
4
4
 
5
+ ## 0.6.3 - 2026-03-22
6
+
7
+ - Added a best-effort npmjs update notice on `wave launch`, `wave autonomous`, and `wave adhoc run`, with cached lookup state under `.wave/package-update-check.json` and opt-out via `WAVE_SKIP_UPDATE_CHECK=1`.
8
+ - Added `wave self-update`, which detects the workspace package manager, updates `@chllming/wave-orchestration`, prints the changelog delta since the recorded install, and then runs `wave upgrade`.
9
+ - Suppressed duplicate notices for nested launcher calls so autonomous and ad-hoc runs announce at most once, while keeping JSON-oriented stdout surfaces clean by emitting notices on stderr.
10
+ - Documented the new update flow and added regression coverage for notice caching, package-manager-aware self-update, and nested-launch suppression.
11
+
12
+ ## 0.6.2 - 2026-03-22
13
+
14
+ - Added first-class `claude.effort` support across config profiles, lane overrides, and per-agent `### Executor` blocks, and now emit `--effort` in Claude launch previews and live runs.
15
+ - Clarified operator runtime visibility with additive `launch-preview.json` `limits` metadata, including explicit known turn ceilings for Claude/OpenCode and explicit Codex opacity when Wave does not emit a turn-limit flag.
16
+ - Clarified dashboard and terminal UX: global wave counts now distinguish done, active, pending, and failed agents; the current-wave dashboard keeps a stable terminal name; and TTY dashboards use simple color cues for faster scanning.
17
+ - Pruned stale dry-run executor preview directories when wave agent sets shrink, so manual inspection of `.tmp/.../dry-run/executors/` matches the current manifest.
18
+ - Preserved already-landed implementation slices for shared promoted components by retrying only the sibling owners that still owe closure proof instead of blindly replaying the landed owner.
19
+ - Added release-surface alignment regression coverage and updated the shipped docs so README, runtime-config references, changelog, and release metadata match the `0.6.2` package surface.
20
+
5
21
  ## 0.6.1 - 2026-03-22
6
22
 
7
23
  - Published the post-merge `main` source as `0.6.1` so the default branch, tagged source, and package docs all agree on the current release.
package/README.md CHANGED
@@ -1,48 +1,90 @@
1
1
  # Wave Orchestration
2
2
 
3
- Wave Orchestration is a repository harness for running multi-agent work in bounded waves. You define shared plan docs plus per-wave markdown, the launcher validates the wave, compiles prompts and inboxes, runs implementation agents first, then performs staged closure. Every run writes durable state under `.tmp/<lane>-wave-launcher/` so humans can inspect progress, replay outcomes, and intervene only when needed.
3
+ Wave Orchestration is my framework for "vibe-coding." It keeps the speed of agentic coding, but makes the runtime, coordination, and context model explicit enough to inspect, replay, and improve.
4
+
5
+ The framework does three things:
6
+
7
+ 1. It abstracts the agent runtime away without flattening everything to the lowest common denominator. The same waves, skills, planning, evaluation, proof, and traces can run across Claude, Codex, and OpenCode while still preserving runtime-native features through executor adapters.
8
+ 2. It runs work as a blackboard-style multi-agent system. Agents do not just exchange chat messages; they work against shared state, generated inboxes, explicit ownership, and staged closure, and a wave keeps going until the declared goals, proof, production-live criteria, or eval targets are actually satisfied.
9
+ 3. It compiles context dynamically for the task at hand. Shared memory, generated runtime files, project defaults, skills, Context7, and cached external docs are assembled at runtime so you do not have to hand-maintain separate Claude, Codex, or other context files.
10
+
11
+ ## Core Ideas
12
+
13
+ - `One orchestrator, many runtimes.`
14
+ Planning, skills, evals, proof, and traces stay constant while the executor adapter changes.
15
+ - `A blackboard-style multi-agent system.`
16
+ The coordination log is canonical shared state; the rolling board, shared summary, inboxes, ledger, and integration views are generated projections over that state.
17
+ - `Completion is goal-driven and proof-bounded.`
18
+ Waves close only when deliverables, proof artifacts, eval targets, dependencies, and closure stewards agree.
19
+ - `Context is compiled, not hand-maintained.`
20
+ Wave builds runtime context from repo state, project memory, skills, Context7, and generated overlays.
21
+ - `The system is inspectable and replayable.`
22
+ Dry-run previews, logs, dashboards, ledgers, traces, and replay make the system debuggable instead of mysterious.
23
+
24
+ ## How The Architecture Works
25
+
26
+ 1. Define shared docs plus `docs/plans/waves/wave-<n>.md` files, or generate them with `wave draft`.
27
+ 2. Run `wave launch --dry-run` to validate the wave and materialize prompts, shared summaries, inboxes, dashboards, and executor previews before any live execution.
28
+ 3. During live execution, implementation agents write claims, evidence, requests, and decisions into the canonical coordination log instead of relying on ad hoc terminal narration.
29
+ 4. The launcher compiles blackboard projections from that state: rolling board, shared summary, per-agent inboxes, ledger, docs queue, dependency views, and integration summaries.
30
+ 5. Closure runs only when the integrated state is ready: optional `cont-EVAL` (`E0`), optional security review, integration (`A8`), documentation (`A9`), and `cont-QA` (`A0`).
31
+
32
+ ## Architecture Surfaces
33
+
34
+ - `Wave contract`
35
+ Shared plan docs, wave markdown, deliverables, proof artifacts, and eval targets define the goal.
36
+ - `Shared state`
37
+ The coordination log is the source of truth; the board is for humans, not the scheduler.
38
+ - `Runtime abstraction`
39
+ Executor adapters preserve Codex, Claude, and OpenCode-specific launch features without changing the higher-level wave contract.
40
+ - `Compiled context`
41
+ Project profile memory, shared summary, inboxes, skills, Context7, and runtime overlays are generated for the chosen executor.
42
+ - `Proof and closure`
43
+ Exit contracts, proof artifacts, eval markers, and closure stewards stop waves from closing on narrative-only PASS.
44
+ - `Replay and audit`
45
+ Traces capture the attempt so failures can be inspected and replayed instead of guessed from screenshots.
4
46
 
5
- ## How It Works
47
+ ## Example Output
6
48
 
7
- 1. Write shared docs and one or more `docs/plans/waves/wave-<n>.md` files.
8
- 2. Run `wave launch --dry-run` to validate the wave and materialize prompts, inboxes, dashboards, and executor previews.
9
- 3. A real launch runs implementation agents first. Agents post claims, evidence, requests, and decisions into the coordination log and rolling message board.
10
- 4. When implementation gates pass, closure runs in order: optional `cont-EVAL` (`E0`), integration (`A8`), documentation (`A9`), and `cont-QA` (`A0`).
11
- 5. Operators use the generated ledgers, inboxes, feedback queue, dependency views, and traces instead of guessing from raw terminal output.
49
+ Representative rolling message board output from a real wave run:
12
50
 
13
- ## Features
51
+ <img src="./docs/image.png" alt="Example rolling message board output showing claims, evidence, requests, and cont-QA closure for a wave run" width="100%" />
14
52
 
15
- - Planner foundation with saved project profile memory, draft specs, and rendered wave markdown
16
- - Implementation-first execution with staged closure and retry support
17
- - Durable coordination log, rolling message board, compiled inboxes, and per-wave ledger
18
- - Dry-run prompt and executor preview mode before any real agent launch
19
- - Context7 bundle selection, caching, and prompt injection
20
- - Multi-executor support for Codex, Claude Code, OpenCode, and a local smoke executor
21
- - Cross-runtime skill packs loaded from `skills/` and resolved by lane, role, runtime, deploy kind, and per-agent attachment
22
- - Human feedback routing, clarification triage, helper assignment, and cross-lane dependencies
23
- - Replayable trace bundles for regression and release verification
53
+ ## Common MAS Failure Cases
24
54
 
25
- ## Example Output
55
+ Recent multi-agent research keeps returning to the same failure modes:
26
56
 
27
- Representative rolling message board output from a real wave run:
57
+ - `Cosmetic board, no canonical state`
58
+ Agents appear coordinated, but there is no machine-trustable source of truth underneath the conversation.
59
+ - `Hidden evidence never gets pooled`
60
+ One agent has the critical fact, but it never reaches shared state before closure.
61
+ - `Communication without global-state reconstruction`
62
+ Agents exchange information, but nobody reconstructs the correct cross-agent picture.
63
+ - `Simultaneous coordination collapse`
64
+ A team that looks fine in serial work falls apart when multiple owners, blockers, or resources must move together.
65
+ - `Expert signal gets averaged away`
66
+ The strongest specialist view is diluted into a weaker compromise.
67
+ - `Contradictions get smoothed over`
68
+ Conflicts are narrated away instead of being turned into explicit repair work.
69
+ - `Premature closure`
70
+ Agents say they are done before proof, evals, or integrated state actually support PASS.
28
71
 
29
- <img src="./docs/image.png" alt="Example rolling message board output showing claims, evidence, requests, and cont-QA closure for a wave run" width="100%" />
72
+ Wave is built to mitigate those failures with canonical shared state, generated blackboard projections, explicit ownership, goal-driven, proof-bounded closure, and replayable traces. For the research framing and the current gaps, see [docs/research/coordination-failure-review.md](./docs/research/coordination-failure-review.md).
30
73
 
31
74
  ## Quick Start
32
75
 
33
76
  Current release:
34
77
 
35
- - `@chllming/wave-orchestration@0.6.1`
36
- - Release tag: [`v0.6.1`](https://github.com/chllming/wave-orchestration/releases/tag/v0.6.1)
78
+ - `@chllming/wave-orchestration@0.6.3`
79
+ - Release tag: [`v0.6.3`](https://github.com/chllming/wave-orchestration/releases/tag/v0.6.3)
37
80
  - Public install path: npmjs
38
81
  - Authenticated fallback: GitHub Packages
39
82
 
40
- Highlights in `0.6.1`:
83
+ Highlights in `0.6.3`:
41
84
 
42
- - `cont-EVAL` (`E0`) is now a first-class optional eval stage before integration, separate from final `cont-QA` closure.
43
- - Optional security review now has a dedicated role, report path, and `[wave-security]` closure marker.
44
- - `wave adhoc plan|run|show|promote` now supports transient operator requests on the same launcher substrate.
45
- - Starter docs and skills now cover the current `0.6.1` closure, benchmark, security, and provider surfaces.
85
+ - Runtime launch entrypoints now check npmjs for a newer published package in the background, cache the result under `.wave/package-update-check.json`, and warn on stderr when the workspace is behind.
86
+ - `wave self-update` now gives downstream repos a one-command update path that detects the workspace package manager, updates the dependency, shows the changelog delta, and records the workspace upgrade report.
87
+ - Autonomous and ad-hoc flows suppress nested notices so operators see at most one update banner per top-level run, and structured stdout remains clean for JSON consumers.
46
88
 
47
89
  Requirements:
48
90
 
@@ -59,7 +101,7 @@ pnpm add -D @chllming/wave-orchestration
59
101
  pnpm exec wave init
60
102
  pnpm exec wave doctor
61
103
  pnpm exec wave launch --lane main --dry-run --no-dashboard
62
- pnpm exec wave coord show --lane main --wave 0 --dry-run
104
+ pnpm exec wave coord show --lane main --wave 0 --dry-run --json
63
105
  ```
64
106
 
65
107
  If the repo already has Wave config, plans, or waves you want to keep:
@@ -70,6 +112,8 @@ pnpm exec wave init --adopt-existing
70
112
 
71
113
  Fresh init also seeds a starter `skills/` library plus `docs/evals/benchmark-catalog.json`. The launcher projects those skill bundles into Codex, Claude, OpenCode, and local executor overlays after the final runtime for each agent is resolved, and waves that include `cont-EVAL` can declare `## Eval targets` against that catalog.
72
114
 
115
+ When runtime launch commands detect a newer npmjs release, Wave prints a non-blocking update notice on stderr. The fast path is `pnpm exec wave self-update`, which updates the dependency, prints the changelog delta, and then records the workspace upgrade report.
116
+
73
117
  ## Common Commands
74
118
 
75
119
  ```bash
@@ -86,6 +130,9 @@ pnpm exec wave dep show --lane main --wave 0 --json
86
130
 
87
131
  # Run autonomous mode after the wave set is stable
88
132
  pnpm exec wave autonomous --lane main --executor codex --codex-sandbox danger-full-access
133
+
134
+ # Pull the latest published package and record the workspace upgrade
135
+ pnpm exec wave self-update
89
136
  ```
90
137
 
91
138
  ## Develop This Package
@@ -99,14 +146,16 @@ node scripts/wave.mjs launch --lane main --dry-run --no-dashboard
99
146
  ## Learn More
100
147
 
101
148
  - [docs/README.md](./docs/README.md): docs map and suggested structure
102
- - [docs/concepts/what-is-a-wave.md](./docs/concepts/what-is-a-wave.md): wave anatomy, lifecycle, and closure model
149
+ - [docs/concepts/what-is-a-wave.md](./docs/concepts/what-is-a-wave.md): wave anatomy, blackboard execution model, and proof-bounded closure
150
+ - [docs/concepts/runtime-agnostic-orchestration.md](./docs/concepts/runtime-agnostic-orchestration.md): how one orchestration substrate spans Claude, Codex, OpenCode, and local execution
151
+ - [docs/concepts/context7-vs-skills.md](./docs/concepts/context7-vs-skills.md): compiled context, external truth, and repo-owned operating knowledge
103
152
  - [docs/guides/planner.md](./docs/guides/planner.md): `wave project` and `wave draft` workflow
104
- - [docs/concepts/context7-vs-skills.md](./docs/concepts/context7-vs-skills.md): when to use external docs vs repo-owned skills
105
153
  - [docs/guides/terminal-surfaces.md](./docs/guides/terminal-surfaces.md): tmux, VS Code terminal registry, and dry-run surfaces
106
154
  - [docs/plans/wave-orchestrator.md](./docs/plans/wave-orchestrator.md): operator runbook
107
155
  - [docs/plans/context7-wave-orchestrator.md](./docs/plans/context7-wave-orchestrator.md): Context7 setup and bundle authoring
108
156
  - [docs/reference/runtime-config/README.md](./docs/reference/runtime-config/README.md): executor, runtime, and skill-projection configuration
109
157
  - [docs/reference/skills.md](./docs/reference/skills.md): skill bundle format, resolution order, and runtime projection
158
+ - [docs/research/coordination-failure-review.md](./docs/research/coordination-failure-review.md): MAS failure modes from the research and how Wave responds
110
159
  - [CHANGELOG.md](./CHANGELOG.md): release history
111
160
 
112
161
  ## Research Sources
package/docs/README.md CHANGED
@@ -1,6 +1,12 @@
1
1
  # Wave Documentation
2
2
 
3
- This repository now uses a layered docs structure, but the useful path is journey-first:
3
+ These docs are organized around three core ideas:
4
+
5
+ - one orchestrator, many runtimes across Claude, Codex, OpenCode, and local execution
6
+ - a blackboard-style multi-agent system with goal-driven, proof-bounded closure
7
+ - compiled context from shared state, skills, runtime files, and Context7 instead of hand-maintained per-runtime context files
8
+
9
+ The useful path is journey-first:
4
10
 
5
11
  - start with one core concept doc
6
12
  - then use one end-to-end workflow guide
@@ -22,7 +28,11 @@ This repository now uses a layered docs structure, but the useful path is journe
22
28
  ## Start Here
23
29
 
24
30
  - New to Wave:
25
- Read [concepts/what-is-a-wave.md](./concepts/what-is-a-wave.md). It now covers the core execution model, runtime posture, closure, and state model in one place.
31
+ Read [concepts/what-is-a-wave.md](./concepts/what-is-a-wave.md). It covers the blackboard execution model, proof-bounded closure, runtime posture, and durable state model in one place.
32
+ - Want the runtime abstraction story:
33
+ Read [concepts/runtime-agnostic-orchestration.md](./concepts/runtime-agnostic-orchestration.md) to see how planning, skills, evals, proof, and traces stay stable across Claude, Codex, OpenCode, and local execution.
34
+ - Want the context story:
35
+ Read [concepts/context7-vs-skills.md](./concepts/context7-vs-skills.md) for the compiled-context model: shared summary, inboxes, project defaults, skills, Context7, and runtime overlays.
26
36
  - Drafting or revising waves:
27
37
  Read [guides/author-and-run-waves.md](./guides/author-and-run-waves.md), then use [plans/wave-orchestrator.md](./plans/wave-orchestrator.md) as the operator runbook.
28
38
  - Adding a security review pass:
@@ -37,8 +47,10 @@ This repository now uses a layered docs structure, but the useful path is journe
37
47
  Start with [guides/author-and-run-waves.md](./guides/author-and-run-waves.md), then use [plans/wave-orchestrator.md](./plans/wave-orchestrator.md) for the live operator flow.
38
48
  - Tuning runtime behavior:
39
49
  Read [reference/runtime-config/README.md](./reference/runtime-config/README.md) and [reference/skills.md](./reference/skills.md).
50
+ - Want the research framing behind the design:
51
+ Read [research/coordination-failure-review.md](./research/coordination-failure-review.md) for the common MAS failure modes and how Wave tries to mitigate them, then use [research/agent-context-sources.md](./research/agent-context-sources.md) as the bibliography.
40
52
  - Looking for supporting concept pages:
41
- Use [concepts/runtime-agnostic-orchestration.md](./concepts/runtime-agnostic-orchestration.md), [concepts/operating-modes.md](./concepts/operating-modes.md), and [concepts/context7-vs-skills.md](./concepts/context7-vs-skills.md) after the main concept and workflow docs.
53
+ Use [concepts/operating-modes.md](./concepts/operating-modes.md) after the main concept, runtime, and context docs.
42
54
 
43
55
  ## Package vs Repo-Owned Material
44
56
 
@@ -4,6 +4,30 @@ Context7 and skills solve different problems.
4
4
 
5
5
  Use Context7 for external library truth. Use skills for repo-owned, reusable operating knowledge.
6
6
 
7
+ That comparison matters because Wave treats context as something to compile at runtime, not something humans should maintain separately for Claude, Codex, OpenCode, and every other executor.
8
+
9
+ ## Compiled Context, Not Hand-Maintained Context Files
10
+
11
+ The active context for an agent is assembled from multiple layers:
12
+
13
+ - repository source and the wave's owned files
14
+ - wave markdown and shared plan docs
15
+ - generated shared summary and per-agent inbox
16
+ - saved project defaults such as `.wave/project-profile.json`
17
+ - resolved repo-owned skills
18
+ - selected Context7 snippets for external library truth
19
+ - generated runtime overlays and launch artifacts
20
+
21
+ Because of that, the question is not "which hand-written context file does this runtime use?" The question is "which context sources does this wave compile for the selected runtime right now?"
22
+
23
+ Runtime-specific context is still real, but it is mostly generated:
24
+
25
+ - Claude gets merged system-prompt and settings overlays
26
+ - Codex gets executor flags plus runtime-projected skills
27
+ - OpenCode gets generated config, attachments, and runtime instructions
28
+
29
+ That keeps the context model unified even when the transport layer differs.
30
+
7
31
  ## Short Version
8
32
 
9
33
  - Context7
@@ -1,15 +1,22 @@
1
1
  # Runtime-Agnostic Orchestration
2
2
 
3
+ In short: one orchestrator, many runtimes.
4
+
3
5
  Wave is runtime agnostic at the orchestration layer.
4
6
 
5
- That means planning, coordination, closure, and traces do not depend on whether the selected executor is Codex, Claude Code, OpenCode, or the local smoke executor.
7
+ That means planning, skills, evaluation, proof, coordination, closure, and traces do not depend on whether the selected executor is Codex, Claude Code, OpenCode, or the local smoke executor.
8
+
9
+ Wave abstracts the runtime away without flattening everything to the lowest common denominator. The wave contract stays stable while the executor adapter preserves the useful runtime-native features.
6
10
 
7
11
  ## What Stays The Same Across Runtimes
8
12
 
9
13
  These layers are runtime-neutral:
10
14
 
11
15
  - wave parsing and validation
16
+ - planner-produced wave specs and authored wave markdown
17
+ - eval targets, deliverables, and proof artifacts
12
18
  - component and closure gates
19
+ - skill resolution and attachment policy
13
20
  - compiled shared summaries and per-agent inboxes
14
21
  - coordination log and rendered message board
15
22
  - helper assignments and dependency handling
@@ -34,11 +41,19 @@ Runtime-specific behavior is isolated to the executor adapter layer:
34
41
 
35
42
  The orchestration substrate above those adapters does not need to know how the runtime transports prompts.
36
43
 
44
+ This is the important distinction:
45
+
46
+ - the orchestration layer owns goals, ownership, proof, and shared state
47
+ - the executor adapter owns prompt transport, runtime-native flags, files, and settings
48
+
49
+ That split is what lets Wave stay portable without giving up runtime-specific leverage.
50
+
37
51
  ## Why This Matters
38
52
 
39
53
  Runtime agnosticism gives you:
40
54
 
41
- - the same plan and closure model across vendors
55
+ - the same plan, skill, and closure model across vendors
56
+ - the same eval and proof model across vendors
42
57
  - replay and audit surfaces that do not care which runtime produced the work
43
58
  - per-role runtime choice without rewriting authoring conventions
44
59
  - retry-time fallback without inventing a second planning model
@@ -2,6 +2,8 @@
2
2
 
3
3
  A wave is the main planning and execution unit in Wave Orchestration.
4
4
 
5
+ It turns free-form agent runs into a bounded blackboard-style work package with shared state, explicit ownership, dynamic context, goal-driven execution, and proof-bounded closure.
6
+
5
7
  It is not just a prompt file. A wave is a bounded slice of repository work with:
6
8
 
7
9
  - explicit scope
@@ -34,6 +36,16 @@ Waves force a higher planning bar than ad hoc prompts. A good wave answers:
34
36
  - What evidence closes the wave?
35
37
  - Which dependencies, helper requests, or escalations can still block completion?
36
38
 
39
+ ## Why This Is A Blackboard-Style Model
40
+
41
+ Wave is blackboard-style because agents work against shared state instead of treating chat output as the system of record.
42
+
43
+ - the canonical coordination log is the machine-readable source of truth
44
+ - the rolling board is a human projection over that state, not the scheduler's authority
45
+ - shared summaries and per-agent inboxes are compiled views over the same state
46
+ - helper assignments, clarification flow, dependencies, and integration all operate on that shared state
47
+ - closure depends on the integrated state, not on whether an agent says "done"
48
+
37
49
  ## Wave Anatomy
38
50
 
39
51
  Wave markdown is the authored execution surface today. A typical wave can include:
@@ -136,6 +148,22 @@ Current live waves are strict about closure artifacts:
136
148
  - `cont-QA` must emit both a final `Verdict:` line and a final `[wave-gate]` marker.
137
149
  - Replay keeps read-only compatibility with older traces and older evaluator-era artifacts, but live waves do not pass on verdict-only or underspecified closure markers.
138
150
 
151
+ ## Context Is Compiled At Runtime
152
+
153
+ Wave also treats context as something to compile for the current task, not something humans should hand-maintain separately for each runtime.
154
+
155
+ The active context for an agent is assembled from:
156
+
157
+ - repository source and owned files
158
+ - wave markdown and shared plan docs
159
+ - saved project defaults such as `.wave/project-profile.json`
160
+ - the generated shared summary and the agent's inbox
161
+ - resolved skills and runtime-specific skill projections
162
+ - selected Context7 snippets for external library truth
163
+ - generated executor overlays and launch artifacts
164
+
165
+ That is why switching an agent between Codex, Claude, or OpenCode does not require maintaining separate parallel context files. The orchestrator recomputes the context package for the selected runtime and the current wave state.
166
+
139
167
  ## What Makes A Wave "Done"
140
168
 
141
169
  A wave is not done because an agent said so. It is done only when the runtime surfaces agree:
@@ -16,6 +16,8 @@ The catalog is reference metadata, not a run-history database. It tells the wave
16
16
 
17
17
  For a full authored wave example that uses these patterns, see [docs/reference/sample-waves.md](../reference/sample-waves.md).
18
18
 
19
+ These benchmark families are also Wave's operator-facing vocabulary for common MAS failure modes. For the research-side framing and the current architectural gaps, see [docs/research/coordination-failure-review.md](../research/coordination-failure-review.md).
20
+
19
21
  ## Migrating From Legacy Evaluator Waves
20
22
 
21
23
  If your `0.5.4`-era repo still talks about a single `evaluator` role, split that surface before adopting `0.6.1`:
@@ -45,6 +45,8 @@ Use `tmux` when:
45
45
 
46
46
  By default the launcher can start per-wave dashboard sessions in tmux.
47
47
 
48
+ When `--terminal-surface vscode` is active, Wave also maintains a stable current-wave dashboard terminal entry instead of creating a new wave-numbered dashboard attach target for every wave transition.
49
+
48
50
  Important flags:
49
51
 
50
52
  - `--no-dashboard`
@@ -1,8 +1,9 @@
1
1
  # Current State
2
2
 
3
- - The starter workspace in this source repo reflects the `0.6.1` package release surface.
3
+ - The starter workspace in this source repo reflects the `0.6.3` package release surface.
4
4
  - The repository contains the published `@chllming/wave-orchestration` package plus the starter scaffold used by `wave init`.
5
5
  - The runtime is package-first and non-destructive for adopting repos: `wave init --adopt-existing` records existing repo-owned plans, waves, prompts, and config without overwriting them, and `wave upgrade` writes only `.wave/install-state.json` plus `.wave/upgrade-history/`.
6
+ - Runtime launch entrypoints now perform a best-effort npmjs version check, cache the result under `.wave/package-update-check.json`, and point operators at `pnpm exec wave self-update` when a newer published package exists.
6
7
  - This source repo is itself kept as an adopted Wave workspace, so `node scripts/wave.mjs doctor --json` should pass from the repo root.
7
8
  - The default lane is `main`.
8
9
  - Planner foundation is now shipped:
@@ -4,6 +4,14 @@ The Wave Orchestrator coordinates repository work as bounded execution waves.
4
4
 
5
5
  For the broader docs map, concept pages, and workflow guides, start at [docs/README.md](../README.md).
6
6
 
7
+ This runbook is the operational view of the architecture:
8
+
9
+ - one wave contract defines goals, ownership, proof, and closure
10
+ - one canonical coordination log acts as the shared blackboard state
11
+ - generated board, shared summary, inboxes, ledger, and integration outputs are projections over that state
12
+ - executor adapters preserve Claude, Codex, and OpenCode-specific runtime features at the edge
13
+ - closure makes completion depend on integrated proof and shared state, not on free-form agent narration
14
+
7
15
  ## What It Does
8
16
 
9
17
  - parses wave plans from `docs/plans/waves/`
@@ -46,6 +54,7 @@ For the broader docs map, concept pages, and workflow guides, start at [docs/REA
46
54
  - `pnpm exec wave dep show --lane main --wave 0 --json`
47
55
  - `pnpm exec wave dep post --owner-lane main --requester-lane release --owner-wave 0 --requester-wave 2 --agent launcher --summary "Need shared-plan reconciliation" --target capability:docs-shared-plan --required`
48
56
  - `pnpm exec wave upgrade`
57
+ - `pnpm exec wave self-update`
49
58
 
50
59
  ## Configuration
51
60
 
@@ -143,6 +152,16 @@ Required inbound dependencies block autonomous next-wave start and lane finaliza
143
152
 
144
153
  ## Upgrade Flow
145
154
 
155
+ Fast path:
156
+
157
+ ```bash
158
+ pnpm exec wave self-update
159
+ ```
160
+
161
+ That command updates the dependency through the workspace package manager, prints the changelog delta since the recorded install, and then runs `wave upgrade` to record the new install-state and upgrade report.
162
+
163
+ Manual path:
164
+
146
165
  1. Upgrade the package version:
147
166
 
148
167
  ```bash
@@ -260,7 +279,7 @@ The launcher entrypoint in `scripts/wave-orchestrator/launcher.mjs` now delegate
260
279
  - Skills resolve only after that executor choice is known. Runtime-specific skill overlays are regenerated whenever retry-time fallback changes the selected executor.
261
280
  - Runtime mix targets are enforced before launch and again before any retry-time fallback reassignment.
262
281
  - Fallbacks are declared in profiles or lane policy, can be applied automatically on retry when the next executor is available and still satisfies mix targets, and are recorded in the ledger, integration summary, and traces when used.
263
- - Generic `budget.minutes` caps per-agent attempt timeouts. Generic `budget.turns` seeds `claude.maxTurns` and `opencode.steps` when executor-specific values are not set.
282
+ - Generic `budget.minutes` caps per-agent attempt timeouts. Generic `budget.turns` seeds `claude.maxTurns` and `opencode.steps` when executor-specific values are not set; Codex turn ceilings remain external to Wave and show up in preview metadata as opaque when Wave cannot inspect them.
264
283
  - The launcher writes runtime overlay files under `.tmp/<lane>-wave-launcher/executors/`; these should stay ignored and local.
265
284
 
266
285
  Runtime authoring examples:
@@ -294,7 +313,7 @@ Runtime authoring examples:
294
313
  - opencode.config_json: {"instructions":["Keep shared-plan edits concise."]}
295
314
  ````
296
315
 
297
- Dry-run is the intended validation path for these runtime surfaces. `wave launch --dry-run --no-dashboard` now writes compiled prompts, merged runtime overlays, and `launch-preview.json` files under `.tmp/<lane>-wave-launcher/dry-run/` so the harness can verify invocation shape without requiring the executor binaries to run.
316
+ Dry-run is the intended validation path for these runtime surfaces. `wave launch --dry-run --no-dashboard` now writes compiled prompts, merged runtime overlays, and `launch-preview.json` files under `.tmp/<lane>-wave-launcher/dry-run/` so the harness can verify invocation shape, attempt budgets, and known or opaque turn-limit metadata without requiring the executor binaries to run.
298
317
 
299
318
  ## Human Feedback Queue
300
319
 
@@ -308,7 +327,7 @@ pnpm exec wave feedback respond --id <request-id> --response "..."
308
327
 
309
328
  ## Closure Sweep
310
329
 
311
- If implementation agents ran, the launcher does not stop at `exit 0`. It checks implementation exit contracts, promoted component proof, helper assignments, required dependencies, and the integration recommendation first. When present, `cont-EVAL` must satisfy its declared eval targets before integration can close. Optional security review then runs before integration so the reviewer can publish findings and approval-sensitive actions while the wave is still active. In the default planner shape `E0` is report-only; if a wave explicitly assigns `E0` non-report files, the launcher also applies the normal implementation proof gates to that role. Security reviewers stay report-only by default. Documentation and cont-QA closure only run after integration is explicitly ready for doc closure; if `cont-EVAL`, security review, or integration reports more work, or if helper assignments or required dependency tickets remain open, the wave stops there and retries only the implicated owners plus the relevant closure steward.
330
+ If implementation agents ran, the launcher does not stop at `exit 0`. It checks implementation exit contracts, promoted component proof, helper assignments, required dependencies, and the integration recommendation first. When present, `cont-EVAL` must satisfy its declared eval targets before integration can close. Optional security review then runs before integration so the reviewer can publish findings and approval-sensitive actions while the wave is still active. In the default planner shape `E0` is report-only; if a wave explicitly assigns `E0` non-report files, the launcher also applies the normal implementation proof gates to that role. Security reviewers stay report-only by default. Documentation and cont-QA closure only run after integration is explicitly ready for doc closure; if `cont-EVAL`, security review, or integration reports more work, or if helper assignments or required dependency tickets remain open, the wave stops there and retries only the implicated owners plus the relevant closure steward. When multiple implementation agents share a promoted component, owners that already landed valid proof stay reusable while the launcher retries only the sibling owners that still owe closure evidence.
312
331
 
313
332
  Live closure is fail-closed:
314
333
 
@@ -1,6 +1,6 @@
1
1
  # Runtime Configuration Reference
2
2
 
3
- This directory is the canonical reference for executor configuration in Wave `0.6.1`.
3
+ This directory is the canonical reference for executor configuration in the packaged Wave release.
4
4
 
5
5
  Use it when you need the full supported surface for:
6
6
 
@@ -65,7 +65,7 @@ These fields are shared across runtimes:
65
65
  | Model | `model` in profile, `executors.claude.model`, `executors.opencode.model` | `model` | Codex uses shared `model` from profile or agent only |
66
66
  | Fallbacks | `fallbacks` in profile | `fallbacks` | Runtime ids used for retry-time reassignment |
67
67
  | Tags | `tags` in profile | `tags` | Stored in resolved executor state for policy and traces |
68
- | Budget turns | `budget.turns` in profile | `budget.turns` | Seeds Claude `maxTurns` and OpenCode `steps` when runtime-specific values are absent |
68
+ | Budget turns | `budget.turns` in profile | `budget.turns` | Seeds Claude `maxTurns` and OpenCode `steps` when runtime-specific values are absent; it does not set a Codex turn limit |
69
69
  | Budget minutes | `budget.minutes` in profile | `budget.minutes` | Caps attempt timeout |
70
70
 
71
71
  ## Runtime Pages
@@ -83,7 +83,7 @@ Wave writes runtime artifacts here:
83
83
 
84
84
  Common files:
85
85
 
86
- - `launch-preview.json`: resolved invocation lines, env vars, and retry mode
86
+ - `launch-preview.json`: resolved invocation lines, env vars, retry mode, and structured attempt/turn-limit metadata
87
87
  - `skills.resolved.md`: compact metadata-first skill catalog for the selected agent and runtime
88
88
  - `skills.expanded.md`: full canonical/debug skill payload with `SKILL.md` bodies and adapters
89
89
  - `skills.metadata.json`: resolved skill ids, activation metadata, permissions, hashes, and generated artifact paths
@@ -100,7 +100,7 @@ Runtime-specific delivery:
100
100
  - OpenCode injects the compact catalog into `opencode.json` and attaches `skill.json`, `SKILL.md`, the selected adapter, and recursive `references/**` files through `--file`.
101
101
  - Local keeps skills prompt-only.
102
102
 
103
- `launch-preview.json` also records the resolved skill metadata so dry-run can verify the exact runtime plus skill combination before any live launch.
103
+ `launch-preview.json` also records the resolved skill metadata plus a `limits` section. For Claude and OpenCode, that section reports the known turn ceiling and whether it came from the runtime-specific setting or generic `budget.turns`. For Codex, it explicitly records that Wave emitted no turn-limit flag and that any effective ceiling may come from the selected Codex profile or upstream runtime.
104
104
 
105
105
  ## Recommended Validation Path
106
106
 
@@ -12,6 +12,7 @@ Wave launches Claude headlessly with `claude -p --no-session-persistence`.
12
12
  | Prompt mode | `executors.claude.appendSystemPromptMode` | n/a | Uses `--append-system-prompt-file` or `--system-prompt-file` |
13
13
  | Permission mode | `executors.claude.permissionMode`, `executors.profiles.<name>.claude.permissionMode` | `claude.permission_mode` | Adds `--permission-mode <mode>` |
14
14
  | Permission prompt tool | `executors.claude.permissionPromptTool`, `executors.profiles.<name>.claude.permissionPromptTool` | `claude.permission_prompt_tool` | Adds `--permission-prompt-tool <tool>` |
15
+ | Effort | `executors.claude.effort`, `executors.profiles.<name>.claude.effort` | `claude.effort` | Adds `--effort low|medium|high|max` |
15
16
  | Max turns | `executors.claude.maxTurns`, `executors.profiles.<name>.claude.maxTurns` | `claude.max_turns` | Adds `--max-turns <n>` |
16
17
  | MCP config | `executors.claude.mcpConfig`, `executors.profiles.<name>.claude.mcpConfig` | `claude.mcp_config` | Adds repeated `--mcp-config <path>` |
17
18
  | Strict MCP mode | `executors.claude.strictMcpConfig`, `executors.profiles.<name>.claude.strictMcpConfig` | n/a | Adds `--strict-mcp-config` |
@@ -27,6 +28,8 @@ Wave launches Claude headlessly with `claude -p --no-session-persistence`.
27
28
 
28
29
  Wave always writes `claude-system-prompt.txt` for the harness runtime instructions.
29
30
 
31
+ Wave validates the effort enum only. Model-specific compatibility for values such as `max` remains enforced by Claude Code itself.
32
+
30
33
  Wave writes `claude-settings.json` only when at least one inline overlay input is present:
31
34
 
32
35
  - `settingsJson`
@@ -57,6 +60,7 @@ If no inline overlay data is present, Wave passes the base `claude.settings` fil
57
60
  },
58
61
  "claude": {
59
62
  "agent": "reviewer",
63
+ "effort": "high",
60
64
  "permissionMode": "plan",
61
65
  "allowedTools": ["Read"],
62
66
  "disallowedTools": ["Edit"]
@@ -84,6 +88,7 @@ If no inline overlay data is present, Wave passes the base `claude.settings` fil
84
88
 
85
89
  - id: claude
86
90
  - model: claude-sonnet-4-6
91
+ - claude.effort: high
87
92
  - claude.permission_mode: plan
88
93
  - claude.max_turns: 4
89
94
  - claude.settings_json: {"permissions":{"allow":["Read"]}}
@@ -102,4 +107,4 @@ For a dry run, inspect:
102
107
  - `claude-settings.json`, when generated
103
108
  - `launch-preview.json`
104
109
 
105
- `launch-preview.json` shows the final `claude -p` invocation and whether `--settings`, `--allowedTools`, `--disallowedTools`, `--mcp-config`, or `--system-prompt-file` were included.
110
+ `launch-preview.json` shows the final `claude -p` invocation, whether `--effort`, `--settings`, `--allowedTools`, `--disallowedTools`, `--mcp-config`, or `--system-prompt-file` were included, and the resolved `limits` block for attempt timeout plus known turn ceiling.
@@ -20,6 +20,7 @@ Wave launches Codex with `codex exec` and pipes the generated task prompt throug
20
20
  ## Notes
21
21
 
22
22
  - There is no `executors.codex.model` key today. Use profile `model` or per-agent `model`.
23
+ - Generic `budget.turns` does not set a Codex turn limit. If Codex stops on a turn ceiling, that limit came from the selected Codex profile or upstream Codex runtime, not from a Wave-emitted CLI flag.
23
24
  - `codex.images`, `codex.add_dirs`, and `codex.config` accept either a string array in `wave.config.json` or a comma-separated list in a wave file.
24
25
  - Relative paths are passed to Codex relative to the repository root because Wave launches the executor from the repo workspace.
25
26
 
@@ -35,7 +36,6 @@ Wave launches Codex with `codex exec` and pipes the generated task prompt throug
35
36
  "model": "gpt-5-codex",
36
37
  "fallbacks": ["claude", "opencode"],
37
38
  "budget": {
38
- "turns": 12,
39
39
  "minutes": 45
40
40
  },
41
41
  "codex": {
@@ -78,4 +78,4 @@ For a dry run, inspect:
78
78
  - `launch-preview.json` for the final `codex exec` command
79
79
  - any referenced prompt file under `.tmp/<lane>-wave-launcher/dry-run/prompts/`
80
80
 
81
- The preview records the exact `--profile`, repeated `-c`, `--image`, and `--add-dir` flags that Wave would use in a live launch.
81
+ The preview records the exact `--profile`, repeated `-c`, `--image`, and `--add-dir` flags that Wave would use in a live launch. It also includes a `limits` block that makes Wave's Codex visibility explicit: `turnLimitSource: "not-set-by-wave"` means Wave emitted no Codex turn-limit flag, so any effective ceiling is external to the Wave CLI invocation.
@@ -90,4 +90,4 @@ For a dry run, inspect:
90
90
  - `opencode.json`
91
91
  - `launch-preview.json`
92
92
 
93
- `launch-preview.json` shows the final `opencode run` command and the exported `OPENCODE_CONFIG` path.
93
+ `launch-preview.json` shows the final `opencode run` command, the exported `OPENCODE_CONFIG` path, and the resolved `limits` block for attempt timeout plus known step ceiling.
@@ -7,6 +7,8 @@ summary: "Primary external sources used as inspiration for planning, harness des
7
7
 
8
8
  This repository does not commit converted paper/article caches. Keep any hydrated local copies under `docs/research/agent-context-cache/` or another ignored cache directory.
9
9
 
10
+ For a narrative synthesis of the most relevant MAS failure modes and how Wave responds to them, start with [coordination-failure-review.md](./coordination-failure-review.md) and then use this page as the bibliography.
11
+
10
12
  ## Practice Articles
11
13
 
12
14
  - [Harness engineering: leveraging Codex in an agent-first world](https://openai.com/index/harness-engineering/)