@chllming/wave-orchestration 0.6.1 → 0.6.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +16 -0
- package/README.md +79 -30
- package/docs/README.md +15 -3
- package/docs/concepts/context7-vs-skills.md +24 -0
- package/docs/concepts/runtime-agnostic-orchestration.md +17 -2
- package/docs/concepts/what-is-a-wave.md +28 -0
- package/docs/evals/README.md +2 -0
- package/docs/guides/terminal-surfaces.md +2 -0
- package/docs/plans/current-state.md +2 -1
- package/docs/plans/wave-orchestrator.md +22 -3
- package/docs/reference/runtime-config/README.md +4 -4
- package/docs/reference/runtime-config/claude.md +6 -1
- package/docs/reference/runtime-config/codex.md +2 -2
- package/docs/reference/runtime-config/opencode.md +1 -1
- package/docs/research/agent-context-sources.md +2 -0
- package/docs/research/coordination-failure-review.md +37 -13
- package/package.json +1 -1
- package/releases/manifest.json +33 -0
- package/scripts/wave-autonomous.mjs +2 -4
- package/scripts/wave-orchestrator/adhoc.mjs +32 -11
- package/scripts/wave-orchestrator/agent-state.mjs +10 -3
- package/scripts/wave-orchestrator/autonomous.mjs +20 -6
- package/scripts/wave-orchestrator/config.mjs +19 -0
- package/scripts/wave-orchestrator/dashboard-renderer.mjs +150 -20
- package/scripts/wave-orchestrator/dashboard-state.mjs +8 -0
- package/scripts/wave-orchestrator/executors.mjs +67 -4
- package/scripts/wave-orchestrator/install.mjs +198 -25
- package/scripts/wave-orchestrator/launcher-runtime.mjs +1 -0
- package/scripts/wave-orchestrator/launcher.mjs +249 -10
- package/scripts/wave-orchestrator/package-update-notice.mjs +230 -0
- package/scripts/wave-orchestrator/package-version.mjs +32 -0
- package/scripts/wave-orchestrator/terminals.mjs +25 -0
- package/scripts/wave-orchestrator/wave-files.mjs +31 -0
- package/scripts/wave.mjs +12 -2
package/CHANGELOG.md
CHANGED
|
@@ -2,6 +2,22 @@
|
|
|
2
2
|
|
|
3
3
|
## Unreleased
|
|
4
4
|
|
|
5
|
+
## 0.6.3 - 2026-03-22
|
|
6
|
+
|
|
7
|
+
- Added a best-effort npmjs update notice on `wave launch`, `wave autonomous`, and `wave adhoc run`, with cached lookup state under `.wave/package-update-check.json` and opt-out via `WAVE_SKIP_UPDATE_CHECK=1`.
|
|
8
|
+
- Added `wave self-update`, which detects the workspace package manager, updates `@chllming/wave-orchestration`, prints the changelog delta since the recorded install, and then runs `wave upgrade`.
|
|
9
|
+
- Suppressed duplicate notices for nested launcher calls so autonomous and ad-hoc runs announce at most once, while keeping JSON-oriented stdout surfaces clean by emitting notices on stderr.
|
|
10
|
+
- Documented the new update flow and added regression coverage for notice caching, package-manager-aware self-update, and nested-launch suppression.
|
|
11
|
+
|
|
12
|
+
## 0.6.2 - 2026-03-22
|
|
13
|
+
|
|
14
|
+
- Added first-class `claude.effort` support across config profiles, lane overrides, and per-agent `### Executor` blocks, and now emit `--effort` in Claude launch previews and live runs.
|
|
15
|
+
- Clarified operator runtime visibility with additive `launch-preview.json` `limits` metadata, including explicit known turn ceilings for Claude/OpenCode and explicit Codex opacity when Wave does not emit a turn-limit flag.
|
|
16
|
+
- Clarified dashboard and terminal UX: global wave counts now distinguish done, active, pending, and failed agents; the current-wave dashboard keeps a stable terminal name; and TTY dashboards use simple color cues for faster scanning.
|
|
17
|
+
- Pruned stale dry-run executor preview directories when wave agent sets shrink, so manual inspection of `.tmp/.../dry-run/executors/` matches the current manifest.
|
|
18
|
+
- Preserved already-landed implementation slices for shared promoted components by retrying only the sibling owners that still owe closure proof instead of blindly replaying the landed owner.
|
|
19
|
+
- Added release-surface alignment regression coverage and updated the shipped docs so README, runtime-config references, changelog, and release metadata match the `0.6.2` package surface.
|
|
20
|
+
|
|
5
21
|
## 0.6.1 - 2026-03-22
|
|
6
22
|
|
|
7
23
|
- Published the post-merge `main` source as `0.6.1` so the default branch, tagged source, and package docs all agree on the current release.
|
package/README.md
CHANGED
|
@@ -1,48 +1,90 @@
|
|
|
1
1
|
# Wave Orchestration
|
|
2
2
|
|
|
3
|
-
Wave Orchestration is
|
|
3
|
+
Wave Orchestration is my framework for "vibe-coding." It keeps the speed of agentic coding, but makes the runtime, coordination, and context model explicit enough to inspect, replay, and improve.
|
|
4
|
+
|
|
5
|
+
The framework does three things:
|
|
6
|
+
|
|
7
|
+
1. It abstracts the agent runtime away without flattening everything to the lowest common denominator. The same waves, skills, planning, evaluation, proof, and traces can run across Claude, Codex, and OpenCode while still preserving runtime-native features through executor adapters.
|
|
8
|
+
2. It runs work as a blackboard-style multi-agent system. Agents do not just exchange chat messages; they work against shared state, generated inboxes, explicit ownership, and staged closure, and a wave keeps going until the declared goals, proof, production-live criteria, or eval targets are actually satisfied.
|
|
9
|
+
3. It compiles context dynamically for the task at hand. Shared memory, generated runtime files, project defaults, skills, Context7, and cached external docs are assembled at runtime so you do not have to hand-maintain separate Claude, Codex, or other context files.
|
|
10
|
+
|
|
11
|
+
## Core Ideas
|
|
12
|
+
|
|
13
|
+
- `One orchestrator, many runtimes.`
|
|
14
|
+
Planning, skills, evals, proof, and traces stay constant while the executor adapter changes.
|
|
15
|
+
- `A blackboard-style multi-agent system.`
|
|
16
|
+
The coordination log is canonical shared state; the rolling board, shared summary, inboxes, ledger, and integration views are generated projections over that state.
|
|
17
|
+
- `Completion is goal-driven and proof-bounded.`
|
|
18
|
+
Waves close only when deliverables, proof artifacts, eval targets, dependencies, and closure stewards agree.
|
|
19
|
+
- `Context is compiled, not hand-maintained.`
|
|
20
|
+
Wave builds runtime context from repo state, project memory, skills, Context7, and generated overlays.
|
|
21
|
+
- `The system is inspectable and replayable.`
|
|
22
|
+
Dry-run previews, logs, dashboards, ledgers, traces, and replay make the system debuggable instead of mysterious.
|
|
23
|
+
|
|
24
|
+
## How The Architecture Works
|
|
25
|
+
|
|
26
|
+
1. Define shared docs plus `docs/plans/waves/wave-<n>.md` files, or generate them with `wave draft`.
|
|
27
|
+
2. Run `wave launch --dry-run` to validate the wave and materialize prompts, shared summaries, inboxes, dashboards, and executor previews before any live execution.
|
|
28
|
+
3. During live execution, implementation agents write claims, evidence, requests, and decisions into the canonical coordination log instead of relying on ad hoc terminal narration.
|
|
29
|
+
4. The launcher compiles blackboard projections from that state: rolling board, shared summary, per-agent inboxes, ledger, docs queue, dependency views, and integration summaries.
|
|
30
|
+
5. Closure runs only when the integrated state is ready: optional `cont-EVAL` (`E0`), optional security review, integration (`A8`), documentation (`A9`), and `cont-QA` (`A0`).
|
|
31
|
+
|
|
32
|
+
## Architecture Surfaces
|
|
33
|
+
|
|
34
|
+
- `Wave contract`
|
|
35
|
+
Shared plan docs, wave markdown, deliverables, proof artifacts, and eval targets define the goal.
|
|
36
|
+
- `Shared state`
|
|
37
|
+
The coordination log is the source of truth; the board is for humans, not the scheduler.
|
|
38
|
+
- `Runtime abstraction`
|
|
39
|
+
Executor adapters preserve Codex, Claude, and OpenCode-specific launch features without changing the higher-level wave contract.
|
|
40
|
+
- `Compiled context`
|
|
41
|
+
Project profile memory, shared summary, inboxes, skills, Context7, and runtime overlays are generated for the chosen executor.
|
|
42
|
+
- `Proof and closure`
|
|
43
|
+
Exit contracts, proof artifacts, eval markers, and closure stewards stop waves from closing on narrative-only PASS.
|
|
44
|
+
- `Replay and audit`
|
|
45
|
+
Traces capture the attempt so failures can be inspected and replayed instead of guessed from screenshots.
|
|
4
46
|
|
|
5
|
-
##
|
|
47
|
+
## Example Output
|
|
6
48
|
|
|
7
|
-
|
|
8
|
-
2. Run `wave launch --dry-run` to validate the wave and materialize prompts, inboxes, dashboards, and executor previews.
|
|
9
|
-
3. A real launch runs implementation agents first. Agents post claims, evidence, requests, and decisions into the coordination log and rolling message board.
|
|
10
|
-
4. When implementation gates pass, closure runs in order: optional `cont-EVAL` (`E0`), integration (`A8`), documentation (`A9`), and `cont-QA` (`A0`).
|
|
11
|
-
5. Operators use the generated ledgers, inboxes, feedback queue, dependency views, and traces instead of guessing from raw terminal output.
|
|
49
|
+
Representative rolling message board output from a real wave run:
|
|
12
50
|
|
|
13
|
-
|
|
51
|
+
<img src="./docs/image.png" alt="Example rolling message board output showing claims, evidence, requests, and cont-QA closure for a wave run" width="100%" />
|
|
14
52
|
|
|
15
|
-
|
|
16
|
-
- Implementation-first execution with staged closure and retry support
|
|
17
|
-
- Durable coordination log, rolling message board, compiled inboxes, and per-wave ledger
|
|
18
|
-
- Dry-run prompt and executor preview mode before any real agent launch
|
|
19
|
-
- Context7 bundle selection, caching, and prompt injection
|
|
20
|
-
- Multi-executor support for Codex, Claude Code, OpenCode, and a local smoke executor
|
|
21
|
-
- Cross-runtime skill packs loaded from `skills/` and resolved by lane, role, runtime, deploy kind, and per-agent attachment
|
|
22
|
-
- Human feedback routing, clarification triage, helper assignment, and cross-lane dependencies
|
|
23
|
-
- Replayable trace bundles for regression and release verification
|
|
53
|
+
## Common MAS Failure Cases
|
|
24
54
|
|
|
25
|
-
|
|
55
|
+
Recent multi-agent research keeps returning to the same failure modes:
|
|
26
56
|
|
|
27
|
-
|
|
57
|
+
- `Cosmetic board, no canonical state`
|
|
58
|
+
Agents appear coordinated, but there is no machine-trustable source of truth underneath the conversation.
|
|
59
|
+
- `Hidden evidence never gets pooled`
|
|
60
|
+
One agent has the critical fact, but it never reaches shared state before closure.
|
|
61
|
+
- `Communication without global-state reconstruction`
|
|
62
|
+
Agents exchange information, but nobody reconstructs the correct cross-agent picture.
|
|
63
|
+
- `Simultaneous coordination collapse`
|
|
64
|
+
A team that looks fine in serial work falls apart when multiple owners, blockers, or resources must move together.
|
|
65
|
+
- `Expert signal gets averaged away`
|
|
66
|
+
The strongest specialist view is diluted into a weaker compromise.
|
|
67
|
+
- `Contradictions get smoothed over`
|
|
68
|
+
Conflicts are narrated away instead of being turned into explicit repair work.
|
|
69
|
+
- `Premature closure`
|
|
70
|
+
Agents say they are done before proof, evals, or integrated state actually support PASS.
|
|
28
71
|
|
|
29
|
-
|
|
72
|
+
Wave is built to mitigate those failures with canonical shared state, generated blackboard projections, explicit ownership, goal-driven, proof-bounded closure, and replayable traces. For the research framing and the current gaps, see [docs/research/coordination-failure-review.md](./docs/research/coordination-failure-review.md).
|
|
30
73
|
|
|
31
74
|
## Quick Start
|
|
32
75
|
|
|
33
76
|
Current release:
|
|
34
77
|
|
|
35
|
-
- `@chllming/wave-orchestration@0.6.
|
|
36
|
-
- Release tag: [`v0.6.
|
|
78
|
+
- `@chllming/wave-orchestration@0.6.3`
|
|
79
|
+
- Release tag: [`v0.6.3`](https://github.com/chllming/wave-orchestration/releases/tag/v0.6.3)
|
|
37
80
|
- Public install path: npmjs
|
|
38
81
|
- Authenticated fallback: GitHub Packages
|
|
39
82
|
|
|
40
|
-
Highlights in `0.6.
|
|
83
|
+
Highlights in `0.6.3`:
|
|
41
84
|
|
|
42
|
-
-
|
|
43
|
-
-
|
|
44
|
-
-
|
|
45
|
-
- Starter docs and skills now cover the current `0.6.1` closure, benchmark, security, and provider surfaces.
|
|
85
|
+
- Runtime launch entrypoints now check npmjs for a newer published package in the background, cache the result under `.wave/package-update-check.json`, and warn on stderr when the workspace is behind.
|
|
86
|
+
- `wave self-update` now gives downstream repos a one-command update path that detects the workspace package manager, updates the dependency, shows the changelog delta, and records the workspace upgrade report.
|
|
87
|
+
- Autonomous and ad-hoc flows suppress nested notices so operators see at most one update banner per top-level run, and structured stdout remains clean for JSON consumers.
|
|
46
88
|
|
|
47
89
|
Requirements:
|
|
48
90
|
|
|
@@ -59,7 +101,7 @@ pnpm add -D @chllming/wave-orchestration
|
|
|
59
101
|
pnpm exec wave init
|
|
60
102
|
pnpm exec wave doctor
|
|
61
103
|
pnpm exec wave launch --lane main --dry-run --no-dashboard
|
|
62
|
-
pnpm exec wave coord show --lane main --wave 0 --dry-run
|
|
104
|
+
pnpm exec wave coord show --lane main --wave 0 --dry-run --json
|
|
63
105
|
```
|
|
64
106
|
|
|
65
107
|
If the repo already has Wave config, plans, or waves you want to keep:
|
|
@@ -70,6 +112,8 @@ pnpm exec wave init --adopt-existing
|
|
|
70
112
|
|
|
71
113
|
Fresh init also seeds a starter `skills/` library plus `docs/evals/benchmark-catalog.json`. The launcher projects those skill bundles into Codex, Claude, OpenCode, and local executor overlays after the final runtime for each agent is resolved, and waves that include `cont-EVAL` can declare `## Eval targets` against that catalog.
|
|
72
114
|
|
|
115
|
+
When runtime launch commands detect a newer npmjs release, Wave prints a non-blocking update notice on stderr. The fast path is `pnpm exec wave self-update`, which updates the dependency, prints the changelog delta, and then records the workspace upgrade report.
|
|
116
|
+
|
|
73
117
|
## Common Commands
|
|
74
118
|
|
|
75
119
|
```bash
|
|
@@ -86,6 +130,9 @@ pnpm exec wave dep show --lane main --wave 0 --json
|
|
|
86
130
|
|
|
87
131
|
# Run autonomous mode after the wave set is stable
|
|
88
132
|
pnpm exec wave autonomous --lane main --executor codex --codex-sandbox danger-full-access
|
|
133
|
+
|
|
134
|
+
# Pull the latest published package and record the workspace upgrade
|
|
135
|
+
pnpm exec wave self-update
|
|
89
136
|
```
|
|
90
137
|
|
|
91
138
|
## Develop This Package
|
|
@@ -99,14 +146,16 @@ node scripts/wave.mjs launch --lane main --dry-run --no-dashboard
|
|
|
99
146
|
## Learn More
|
|
100
147
|
|
|
101
148
|
- [docs/README.md](./docs/README.md): docs map and suggested structure
|
|
102
|
-
- [docs/concepts/what-is-a-wave.md](./docs/concepts/what-is-a-wave.md): wave anatomy,
|
|
149
|
+
- [docs/concepts/what-is-a-wave.md](./docs/concepts/what-is-a-wave.md): wave anatomy, blackboard execution model, and proof-bounded closure
|
|
150
|
+
- [docs/concepts/runtime-agnostic-orchestration.md](./docs/concepts/runtime-agnostic-orchestration.md): how one orchestration substrate spans Claude, Codex, OpenCode, and local execution
|
|
151
|
+
- [docs/concepts/context7-vs-skills.md](./docs/concepts/context7-vs-skills.md): compiled context, external truth, and repo-owned operating knowledge
|
|
103
152
|
- [docs/guides/planner.md](./docs/guides/planner.md): `wave project` and `wave draft` workflow
|
|
104
|
-
- [docs/concepts/context7-vs-skills.md](./docs/concepts/context7-vs-skills.md): when to use external docs vs repo-owned skills
|
|
105
153
|
- [docs/guides/terminal-surfaces.md](./docs/guides/terminal-surfaces.md): tmux, VS Code terminal registry, and dry-run surfaces
|
|
106
154
|
- [docs/plans/wave-orchestrator.md](./docs/plans/wave-orchestrator.md): operator runbook
|
|
107
155
|
- [docs/plans/context7-wave-orchestrator.md](./docs/plans/context7-wave-orchestrator.md): Context7 setup and bundle authoring
|
|
108
156
|
- [docs/reference/runtime-config/README.md](./docs/reference/runtime-config/README.md): executor, runtime, and skill-projection configuration
|
|
109
157
|
- [docs/reference/skills.md](./docs/reference/skills.md): skill bundle format, resolution order, and runtime projection
|
|
158
|
+
- [docs/research/coordination-failure-review.md](./docs/research/coordination-failure-review.md): MAS failure modes from the research and how Wave responds
|
|
110
159
|
- [CHANGELOG.md](./CHANGELOG.md): release history
|
|
111
160
|
|
|
112
161
|
## Research Sources
|
package/docs/README.md
CHANGED
|
@@ -1,6 +1,12 @@
|
|
|
1
1
|
# Wave Documentation
|
|
2
2
|
|
|
3
|
-
|
|
3
|
+
These docs are organized around three core ideas:
|
|
4
|
+
|
|
5
|
+
- one orchestrator, many runtimes across Claude, Codex, OpenCode, and local execution
|
|
6
|
+
- a blackboard-style multi-agent system with goal-driven, proof-bounded closure
|
|
7
|
+
- compiled context from shared state, skills, runtime files, and Context7 instead of hand-maintained per-runtime context files
|
|
8
|
+
|
|
9
|
+
The useful path is journey-first:
|
|
4
10
|
|
|
5
11
|
- start with one core concept doc
|
|
6
12
|
- then use one end-to-end workflow guide
|
|
@@ -22,7 +28,11 @@ This repository now uses a layered docs structure, but the useful path is journe
|
|
|
22
28
|
## Start Here
|
|
23
29
|
|
|
24
30
|
- New to Wave:
|
|
25
|
-
Read [concepts/what-is-a-wave.md](./concepts/what-is-a-wave.md). It
|
|
31
|
+
Read [concepts/what-is-a-wave.md](./concepts/what-is-a-wave.md). It covers the blackboard execution model, proof-bounded closure, runtime posture, and durable state model in one place.
|
|
32
|
+
- Want the runtime abstraction story:
|
|
33
|
+
Read [concepts/runtime-agnostic-orchestration.md](./concepts/runtime-agnostic-orchestration.md) to see how planning, skills, evals, proof, and traces stay stable across Claude, Codex, OpenCode, and local execution.
|
|
34
|
+
- Want the context story:
|
|
35
|
+
Read [concepts/context7-vs-skills.md](./concepts/context7-vs-skills.md) for the compiled-context model: shared summary, inboxes, project defaults, skills, Context7, and runtime overlays.
|
|
26
36
|
- Drafting or revising waves:
|
|
27
37
|
Read [guides/author-and-run-waves.md](./guides/author-and-run-waves.md), then use [plans/wave-orchestrator.md](./plans/wave-orchestrator.md) as the operator runbook.
|
|
28
38
|
- Adding a security review pass:
|
|
@@ -37,8 +47,10 @@ This repository now uses a layered docs structure, but the useful path is journe
|
|
|
37
47
|
Start with [guides/author-and-run-waves.md](./guides/author-and-run-waves.md), then use [plans/wave-orchestrator.md](./plans/wave-orchestrator.md) for the live operator flow.
|
|
38
48
|
- Tuning runtime behavior:
|
|
39
49
|
Read [reference/runtime-config/README.md](./reference/runtime-config/README.md) and [reference/skills.md](./reference/skills.md).
|
|
50
|
+
- Want the research framing behind the design:
|
|
51
|
+
Read [research/coordination-failure-review.md](./research/coordination-failure-review.md) for the common MAS failure modes and how Wave tries to mitigate them, then use [research/agent-context-sources.md](./research/agent-context-sources.md) as the bibliography.
|
|
40
52
|
- Looking for supporting concept pages:
|
|
41
|
-
Use [concepts/
|
|
53
|
+
Use [concepts/operating-modes.md](./concepts/operating-modes.md) after the main concept, runtime, and context docs.
|
|
42
54
|
|
|
43
55
|
## Package vs Repo-Owned Material
|
|
44
56
|
|
|
@@ -4,6 +4,30 @@ Context7 and skills solve different problems.
|
|
|
4
4
|
|
|
5
5
|
Use Context7 for external library truth. Use skills for repo-owned, reusable operating knowledge.
|
|
6
6
|
|
|
7
|
+
That comparison matters because Wave treats context as something to compile at runtime, not something humans should maintain separately for Claude, Codex, OpenCode, and every other executor.
|
|
8
|
+
|
|
9
|
+
## Compiled Context, Not Hand-Maintained Context Files
|
|
10
|
+
|
|
11
|
+
The active context for an agent is assembled from multiple layers:
|
|
12
|
+
|
|
13
|
+
- repository source and the wave's owned files
|
|
14
|
+
- wave markdown and shared plan docs
|
|
15
|
+
- generated shared summary and per-agent inbox
|
|
16
|
+
- saved project defaults such as `.wave/project-profile.json`
|
|
17
|
+
- resolved repo-owned skills
|
|
18
|
+
- selected Context7 snippets for external library truth
|
|
19
|
+
- generated runtime overlays and launch artifacts
|
|
20
|
+
|
|
21
|
+
Because of that, the question is not "which hand-written context file does this runtime use?" The question is "which context sources does this wave compile for the selected runtime right now?"
|
|
22
|
+
|
|
23
|
+
Runtime-specific context is still real, but it is mostly generated:
|
|
24
|
+
|
|
25
|
+
- Claude gets merged system-prompt and settings overlays
|
|
26
|
+
- Codex gets executor flags plus runtime-projected skills
|
|
27
|
+
- OpenCode gets generated config, attachments, and runtime instructions
|
|
28
|
+
|
|
29
|
+
That keeps the context model unified even when the transport layer differs.
|
|
30
|
+
|
|
7
31
|
## Short Version
|
|
8
32
|
|
|
9
33
|
- Context7
|
|
@@ -1,15 +1,22 @@
|
|
|
1
1
|
# Runtime-Agnostic Orchestration
|
|
2
2
|
|
|
3
|
+
In short: one orchestrator, many runtimes.
|
|
4
|
+
|
|
3
5
|
Wave is runtime agnostic at the orchestration layer.
|
|
4
6
|
|
|
5
|
-
That means planning, coordination, closure, and traces do not depend on whether the selected executor is Codex, Claude Code, OpenCode, or the local smoke executor.
|
|
7
|
+
That means planning, skills, evaluation, proof, coordination, closure, and traces do not depend on whether the selected executor is Codex, Claude Code, OpenCode, or the local smoke executor.
|
|
8
|
+
|
|
9
|
+
Wave abstracts the runtime away without flattening everything to the lowest common denominator. The wave contract stays stable while the executor adapter preserves the useful runtime-native features.
|
|
6
10
|
|
|
7
11
|
## What Stays The Same Across Runtimes
|
|
8
12
|
|
|
9
13
|
These layers are runtime-neutral:
|
|
10
14
|
|
|
11
15
|
- wave parsing and validation
|
|
16
|
+
- planner-produced wave specs and authored wave markdown
|
|
17
|
+
- eval targets, deliverables, and proof artifacts
|
|
12
18
|
- component and closure gates
|
|
19
|
+
- skill resolution and attachment policy
|
|
13
20
|
- compiled shared summaries and per-agent inboxes
|
|
14
21
|
- coordination log and rendered message board
|
|
15
22
|
- helper assignments and dependency handling
|
|
@@ -34,11 +41,19 @@ Runtime-specific behavior is isolated to the executor adapter layer:
|
|
|
34
41
|
|
|
35
42
|
The orchestration substrate above those adapters does not need to know how the runtime transports prompts.
|
|
36
43
|
|
|
44
|
+
This is the important distinction:
|
|
45
|
+
|
|
46
|
+
- the orchestration layer owns goals, ownership, proof, and shared state
|
|
47
|
+
- the executor adapter owns prompt transport, runtime-native flags, files, and settings
|
|
48
|
+
|
|
49
|
+
That split is what lets Wave stay portable without giving up runtime-specific leverage.
|
|
50
|
+
|
|
37
51
|
## Why This Matters
|
|
38
52
|
|
|
39
53
|
Runtime agnosticism gives you:
|
|
40
54
|
|
|
41
|
-
- the same plan and closure model across vendors
|
|
55
|
+
- the same plan, skill, and closure model across vendors
|
|
56
|
+
- the same eval and proof model across vendors
|
|
42
57
|
- replay and audit surfaces that do not care which runtime produced the work
|
|
43
58
|
- per-role runtime choice without rewriting authoring conventions
|
|
44
59
|
- retry-time fallback without inventing a second planning model
|
|
@@ -2,6 +2,8 @@
|
|
|
2
2
|
|
|
3
3
|
A wave is the main planning and execution unit in Wave Orchestration.
|
|
4
4
|
|
|
5
|
+
It turns free-form agent runs into a bounded blackboard-style work package with shared state, explicit ownership, dynamic context, goal-driven execution, and proof-bounded closure.
|
|
6
|
+
|
|
5
7
|
It is not just a prompt file. A wave is a bounded slice of repository work with:
|
|
6
8
|
|
|
7
9
|
- explicit scope
|
|
@@ -34,6 +36,16 @@ Waves force a higher planning bar than ad hoc prompts. A good wave answers:
|
|
|
34
36
|
- What evidence closes the wave?
|
|
35
37
|
- Which dependencies, helper requests, or escalations can still block completion?
|
|
36
38
|
|
|
39
|
+
## Why This Is A Blackboard-Style Model
|
|
40
|
+
|
|
41
|
+
Wave is blackboard-style because agents work against shared state instead of treating chat output as the system of record.
|
|
42
|
+
|
|
43
|
+
- the canonical coordination log is the machine-readable source of truth
|
|
44
|
+
- the rolling board is a human projection over that state, not the scheduler's authority
|
|
45
|
+
- shared summaries and per-agent inboxes are compiled views over the same state
|
|
46
|
+
- helper assignments, clarification flow, dependencies, and integration all operate on that shared state
|
|
47
|
+
- closure depends on the integrated state, not on whether an agent says "done"
|
|
48
|
+
|
|
37
49
|
## Wave Anatomy
|
|
38
50
|
|
|
39
51
|
Wave markdown is the authored execution surface today. A typical wave can include:
|
|
@@ -136,6 +148,22 @@ Current live waves are strict about closure artifacts:
|
|
|
136
148
|
- `cont-QA` must emit both a final `Verdict:` line and a final `[wave-gate]` marker.
|
|
137
149
|
- Replay keeps read-only compatibility with older traces and older evaluator-era artifacts, but live waves do not pass on verdict-only or underspecified closure markers.
|
|
138
150
|
|
|
151
|
+
## Context Is Compiled At Runtime
|
|
152
|
+
|
|
153
|
+
Wave also treats context as something to compile for the current task, not something humans should hand-maintain separately for each runtime.
|
|
154
|
+
|
|
155
|
+
The active context for an agent is assembled from:
|
|
156
|
+
|
|
157
|
+
- repository source and owned files
|
|
158
|
+
- wave markdown and shared plan docs
|
|
159
|
+
- saved project defaults such as `.wave/project-profile.json`
|
|
160
|
+
- the generated shared summary and the agent's inbox
|
|
161
|
+
- resolved skills and runtime-specific skill projections
|
|
162
|
+
- selected Context7 snippets for external library truth
|
|
163
|
+
- generated executor overlays and launch artifacts
|
|
164
|
+
|
|
165
|
+
That is why switching an agent between Codex, Claude, or OpenCode does not require maintaining separate parallel context files. The orchestrator recomputes the context package for the selected runtime and the current wave state.
|
|
166
|
+
|
|
139
167
|
## What Makes A Wave "Done"
|
|
140
168
|
|
|
141
169
|
A wave is not done because an agent said so. It is done only when the runtime surfaces agree:
|
package/docs/evals/README.md
CHANGED
|
@@ -16,6 +16,8 @@ The catalog is reference metadata, not a run-history database. It tells the wave
|
|
|
16
16
|
|
|
17
17
|
For a full authored wave example that uses these patterns, see [docs/reference/sample-waves.md](../reference/sample-waves.md).
|
|
18
18
|
|
|
19
|
+
These benchmark families are also Wave's operator-facing vocabulary for common MAS failure modes. For the research-side framing and the current architectural gaps, see [docs/research/coordination-failure-review.md](../research/coordination-failure-review.md).
|
|
20
|
+
|
|
19
21
|
## Migrating From Legacy Evaluator Waves
|
|
20
22
|
|
|
21
23
|
If your `0.5.4`-era repo still talks about a single `evaluator` role, split that surface before adopting `0.6.1`:
|
|
@@ -45,6 +45,8 @@ Use `tmux` when:
|
|
|
45
45
|
|
|
46
46
|
By default the launcher can start per-wave dashboard sessions in tmux.
|
|
47
47
|
|
|
48
|
+
When `--terminal-surface vscode` is active, Wave also maintains a stable current-wave dashboard terminal entry instead of creating a new wave-numbered dashboard attach target for every wave transition.
|
|
49
|
+
|
|
48
50
|
Important flags:
|
|
49
51
|
|
|
50
52
|
- `--no-dashboard`
|
|
@@ -1,8 +1,9 @@
|
|
|
1
1
|
# Current State
|
|
2
2
|
|
|
3
|
-
- The starter workspace in this source repo reflects the `0.6.
|
|
3
|
+
- The starter workspace in this source repo reflects the `0.6.3` package release surface.
|
|
4
4
|
- The repository contains the published `@chllming/wave-orchestration` package plus the starter scaffold used by `wave init`.
|
|
5
5
|
- The runtime is package-first and non-destructive for adopting repos: `wave init --adopt-existing` records existing repo-owned plans, waves, prompts, and config without overwriting them, and `wave upgrade` writes only `.wave/install-state.json` plus `.wave/upgrade-history/`.
|
|
6
|
+
- Runtime launch entrypoints now perform a best-effort npmjs version check, cache the result under `.wave/package-update-check.json`, and point operators at `pnpm exec wave self-update` when a newer published package exists.
|
|
6
7
|
- This source repo is itself kept as an adopted Wave workspace, so `node scripts/wave.mjs doctor --json` should pass from the repo root.
|
|
7
8
|
- The default lane is `main`.
|
|
8
9
|
- Planner foundation is now shipped:
|
|
@@ -4,6 +4,14 @@ The Wave Orchestrator coordinates repository work as bounded execution waves.
|
|
|
4
4
|
|
|
5
5
|
For the broader docs map, concept pages, and workflow guides, start at [docs/README.md](../README.md).
|
|
6
6
|
|
|
7
|
+
This runbook is the operational view of the architecture:
|
|
8
|
+
|
|
9
|
+
- one wave contract defines goals, ownership, proof, and closure
|
|
10
|
+
- one canonical coordination log acts as the shared blackboard state
|
|
11
|
+
- generated board, shared summary, inboxes, ledger, and integration outputs are projections over that state
|
|
12
|
+
- executor adapters preserve Claude, Codex, and OpenCode-specific runtime features at the edge
|
|
13
|
+
- closure makes completion depend on integrated proof and shared state, not on free-form agent narration
|
|
14
|
+
|
|
7
15
|
## What It Does
|
|
8
16
|
|
|
9
17
|
- parses wave plans from `docs/plans/waves/`
|
|
@@ -46,6 +54,7 @@ For the broader docs map, concept pages, and workflow guides, start at [docs/REA
|
|
|
46
54
|
- `pnpm exec wave dep show --lane main --wave 0 --json`
|
|
47
55
|
- `pnpm exec wave dep post --owner-lane main --requester-lane release --owner-wave 0 --requester-wave 2 --agent launcher --summary "Need shared-plan reconciliation" --target capability:docs-shared-plan --required`
|
|
48
56
|
- `pnpm exec wave upgrade`
|
|
57
|
+
- `pnpm exec wave self-update`
|
|
49
58
|
|
|
50
59
|
## Configuration
|
|
51
60
|
|
|
@@ -143,6 +152,16 @@ Required inbound dependencies block autonomous next-wave start and lane finaliza
|
|
|
143
152
|
|
|
144
153
|
## Upgrade Flow
|
|
145
154
|
|
|
155
|
+
Fast path:
|
|
156
|
+
|
|
157
|
+
```bash
|
|
158
|
+
pnpm exec wave self-update
|
|
159
|
+
```
|
|
160
|
+
|
|
161
|
+
That command updates the dependency through the workspace package manager, prints the changelog delta since the recorded install, and then runs `wave upgrade` to record the new install-state and upgrade report.
|
|
162
|
+
|
|
163
|
+
Manual path:
|
|
164
|
+
|
|
146
165
|
1. Upgrade the package version:
|
|
147
166
|
|
|
148
167
|
```bash
|
|
@@ -260,7 +279,7 @@ The launcher entrypoint in `scripts/wave-orchestrator/launcher.mjs` now delegate
|
|
|
260
279
|
- Skills resolve only after that executor choice is known. Runtime-specific skill overlays are regenerated whenever retry-time fallback changes the selected executor.
|
|
261
280
|
- Runtime mix targets are enforced before launch and again before any retry-time fallback reassignment.
|
|
262
281
|
- Fallbacks are declared in profiles or lane policy, can be applied automatically on retry when the next executor is available and still satisfies mix targets, and are recorded in the ledger, integration summary, and traces when used.
|
|
263
|
-
- Generic `budget.minutes` caps per-agent attempt timeouts. Generic `budget.turns` seeds `claude.maxTurns` and `opencode.steps` when executor-specific values are not set.
|
|
282
|
+
- Generic `budget.minutes` caps per-agent attempt timeouts. Generic `budget.turns` seeds `claude.maxTurns` and `opencode.steps` when executor-specific values are not set; Codex turn ceilings remain external to Wave and show up in preview metadata as opaque when Wave cannot inspect them.
|
|
264
283
|
- The launcher writes runtime overlay files under `.tmp/<lane>-wave-launcher/executors/`; these should stay ignored and local.
|
|
265
284
|
|
|
266
285
|
Runtime authoring examples:
|
|
@@ -294,7 +313,7 @@ Runtime authoring examples:
|
|
|
294
313
|
- opencode.config_json: {"instructions":["Keep shared-plan edits concise."]}
|
|
295
314
|
````
|
|
296
315
|
|
|
297
|
-
Dry-run is the intended validation path for these runtime surfaces. `wave launch --dry-run --no-dashboard` now writes compiled prompts, merged runtime overlays, and `launch-preview.json` files under `.tmp/<lane>-wave-launcher/dry-run/` so the harness can verify invocation shape without requiring the executor binaries to run.
|
|
316
|
+
Dry-run is the intended validation path for these runtime surfaces. `wave launch --dry-run --no-dashboard` now writes compiled prompts, merged runtime overlays, and `launch-preview.json` files under `.tmp/<lane>-wave-launcher/dry-run/` so the harness can verify invocation shape, attempt budgets, and known or opaque turn-limit metadata without requiring the executor binaries to run.
|
|
298
317
|
|
|
299
318
|
## Human Feedback Queue
|
|
300
319
|
|
|
@@ -308,7 +327,7 @@ pnpm exec wave feedback respond --id <request-id> --response "..."
|
|
|
308
327
|
|
|
309
328
|
## Closure Sweep
|
|
310
329
|
|
|
311
|
-
If implementation agents ran, the launcher does not stop at `exit 0`. It checks implementation exit contracts, promoted component proof, helper assignments, required dependencies, and the integration recommendation first. When present, `cont-EVAL` must satisfy its declared eval targets before integration can close. Optional security review then runs before integration so the reviewer can publish findings and approval-sensitive actions while the wave is still active. In the default planner shape `E0` is report-only; if a wave explicitly assigns `E0` non-report files, the launcher also applies the normal implementation proof gates to that role. Security reviewers stay report-only by default. Documentation and cont-QA closure only run after integration is explicitly ready for doc closure; if `cont-EVAL`, security review, or integration reports more work, or if helper assignments or required dependency tickets remain open, the wave stops there and retries only the implicated owners plus the relevant closure steward.
|
|
330
|
+
If implementation agents ran, the launcher does not stop at `exit 0`. It checks implementation exit contracts, promoted component proof, helper assignments, required dependencies, and the integration recommendation first. When present, `cont-EVAL` must satisfy its declared eval targets before integration can close. Optional security review then runs before integration so the reviewer can publish findings and approval-sensitive actions while the wave is still active. In the default planner shape `E0` is report-only; if a wave explicitly assigns `E0` non-report files, the launcher also applies the normal implementation proof gates to that role. Security reviewers stay report-only by default. Documentation and cont-QA closure only run after integration is explicitly ready for doc closure; if `cont-EVAL`, security review, or integration reports more work, or if helper assignments or required dependency tickets remain open, the wave stops there and retries only the implicated owners plus the relevant closure steward. When multiple implementation agents share a promoted component, owners that already landed valid proof stay reusable while the launcher retries only the sibling owners that still owe closure evidence.
|
|
312
331
|
|
|
313
332
|
Live closure is fail-closed:
|
|
314
333
|
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
# Runtime Configuration Reference
|
|
2
2
|
|
|
3
|
-
This directory is the canonical reference for executor configuration in Wave
|
|
3
|
+
This directory is the canonical reference for executor configuration in the packaged Wave release.
|
|
4
4
|
|
|
5
5
|
Use it when you need the full supported surface for:
|
|
6
6
|
|
|
@@ -65,7 +65,7 @@ These fields are shared across runtimes:
|
|
|
65
65
|
| Model | `model` in profile, `executors.claude.model`, `executors.opencode.model` | `model` | Codex uses shared `model` from profile or agent only |
|
|
66
66
|
| Fallbacks | `fallbacks` in profile | `fallbacks` | Runtime ids used for retry-time reassignment |
|
|
67
67
|
| Tags | `tags` in profile | `tags` | Stored in resolved executor state for policy and traces |
|
|
68
|
-
| Budget turns | `budget.turns` in profile | `budget.turns` | Seeds Claude `maxTurns` and OpenCode `steps` when runtime-specific values are absent |
|
|
68
|
+
| Budget turns | `budget.turns` in profile | `budget.turns` | Seeds Claude `maxTurns` and OpenCode `steps` when runtime-specific values are absent; it does not set a Codex turn limit |
|
|
69
69
|
| Budget minutes | `budget.minutes` in profile | `budget.minutes` | Caps attempt timeout |
|
|
70
70
|
|
|
71
71
|
## Runtime Pages
|
|
@@ -83,7 +83,7 @@ Wave writes runtime artifacts here:
|
|
|
83
83
|
|
|
84
84
|
Common files:
|
|
85
85
|
|
|
86
|
-
- `launch-preview.json`: resolved invocation lines, env vars, and
|
|
86
|
+
- `launch-preview.json`: resolved invocation lines, env vars, retry mode, and structured attempt/turn-limit metadata
|
|
87
87
|
- `skills.resolved.md`: compact metadata-first skill catalog for the selected agent and runtime
|
|
88
88
|
- `skills.expanded.md`: full canonical/debug skill payload with `SKILL.md` bodies and adapters
|
|
89
89
|
- `skills.metadata.json`: resolved skill ids, activation metadata, permissions, hashes, and generated artifact paths
|
|
@@ -100,7 +100,7 @@ Runtime-specific delivery:
|
|
|
100
100
|
- OpenCode injects the compact catalog into `opencode.json` and attaches `skill.json`, `SKILL.md`, the selected adapter, and recursive `references/**` files through `--file`.
|
|
101
101
|
- Local keeps skills prompt-only.
|
|
102
102
|
|
|
103
|
-
`launch-preview.json` also records the resolved skill metadata
|
|
103
|
+
`launch-preview.json` also records the resolved skill metadata plus a `limits` section. For Claude and OpenCode, that section reports the known turn ceiling and whether it came from the runtime-specific setting or generic `budget.turns`. For Codex, it explicitly records that Wave emitted no turn-limit flag and that any effective ceiling may come from the selected Codex profile or upstream runtime.
|
|
104
104
|
|
|
105
105
|
## Recommended Validation Path
|
|
106
106
|
|
|
@@ -12,6 +12,7 @@ Wave launches Claude headlessly with `claude -p --no-session-persistence`.
|
|
|
12
12
|
| Prompt mode | `executors.claude.appendSystemPromptMode` | n/a | Uses `--append-system-prompt-file` or `--system-prompt-file` |
|
|
13
13
|
| Permission mode | `executors.claude.permissionMode`, `executors.profiles.<name>.claude.permissionMode` | `claude.permission_mode` | Adds `--permission-mode <mode>` |
|
|
14
14
|
| Permission prompt tool | `executors.claude.permissionPromptTool`, `executors.profiles.<name>.claude.permissionPromptTool` | `claude.permission_prompt_tool` | Adds `--permission-prompt-tool <tool>` |
|
|
15
|
+
| Effort | `executors.claude.effort`, `executors.profiles.<name>.claude.effort` | `claude.effort` | Adds `--effort low|medium|high|max` |
|
|
15
16
|
| Max turns | `executors.claude.maxTurns`, `executors.profiles.<name>.claude.maxTurns` | `claude.max_turns` | Adds `--max-turns <n>` |
|
|
16
17
|
| MCP config | `executors.claude.mcpConfig`, `executors.profiles.<name>.claude.mcpConfig` | `claude.mcp_config` | Adds repeated `--mcp-config <path>` |
|
|
17
18
|
| Strict MCP mode | `executors.claude.strictMcpConfig`, `executors.profiles.<name>.claude.strictMcpConfig` | n/a | Adds `--strict-mcp-config` |
|
|
@@ -27,6 +28,8 @@ Wave launches Claude headlessly with `claude -p --no-session-persistence`.
|
|
|
27
28
|
|
|
28
29
|
Wave always writes `claude-system-prompt.txt` for the harness runtime instructions.
|
|
29
30
|
|
|
31
|
+
Wave validates the effort enum only. Model-specific compatibility for values such as `max` remains enforced by Claude Code itself.
|
|
32
|
+
|
|
30
33
|
Wave writes `claude-settings.json` only when at least one inline overlay input is present:
|
|
31
34
|
|
|
32
35
|
- `settingsJson`
|
|
@@ -57,6 +60,7 @@ If no inline overlay data is present, Wave passes the base `claude.settings` fil
|
|
|
57
60
|
},
|
|
58
61
|
"claude": {
|
|
59
62
|
"agent": "reviewer",
|
|
63
|
+
"effort": "high",
|
|
60
64
|
"permissionMode": "plan",
|
|
61
65
|
"allowedTools": ["Read"],
|
|
62
66
|
"disallowedTools": ["Edit"]
|
|
@@ -84,6 +88,7 @@ If no inline overlay data is present, Wave passes the base `claude.settings` fil
|
|
|
84
88
|
|
|
85
89
|
- id: claude
|
|
86
90
|
- model: claude-sonnet-4-6
|
|
91
|
+
- claude.effort: high
|
|
87
92
|
- claude.permission_mode: plan
|
|
88
93
|
- claude.max_turns: 4
|
|
89
94
|
- claude.settings_json: {"permissions":{"allow":["Read"]}}
|
|
@@ -102,4 +107,4 @@ For a dry run, inspect:
|
|
|
102
107
|
- `claude-settings.json`, when generated
|
|
103
108
|
- `launch-preview.json`
|
|
104
109
|
|
|
105
|
-
`launch-preview.json` shows the final `claude -p` invocation
|
|
110
|
+
`launch-preview.json` shows the final `claude -p` invocation, whether `--effort`, `--settings`, `--allowedTools`, `--disallowedTools`, `--mcp-config`, or `--system-prompt-file` were included, and the resolved `limits` block for attempt timeout plus known turn ceiling.
|
|
@@ -20,6 +20,7 @@ Wave launches Codex with `codex exec` and pipes the generated task prompt throug
|
|
|
20
20
|
## Notes
|
|
21
21
|
|
|
22
22
|
- There is no `executors.codex.model` key today. Use profile `model` or per-agent `model`.
|
|
23
|
+
- Generic `budget.turns` does not set a Codex turn limit. If Codex stops on a turn ceiling, that limit came from the selected Codex profile or upstream Codex runtime, not from a Wave-emitted CLI flag.
|
|
23
24
|
- `codex.images`, `codex.add_dirs`, and `codex.config` accept either a string array in `wave.config.json` or a comma-separated list in a wave file.
|
|
24
25
|
- Relative paths are passed to Codex relative to the repository root because Wave launches the executor from the repo workspace.
|
|
25
26
|
|
|
@@ -35,7 +36,6 @@ Wave launches Codex with `codex exec` and pipes the generated task prompt throug
|
|
|
35
36
|
"model": "gpt-5-codex",
|
|
36
37
|
"fallbacks": ["claude", "opencode"],
|
|
37
38
|
"budget": {
|
|
38
|
-
"turns": 12,
|
|
39
39
|
"minutes": 45
|
|
40
40
|
},
|
|
41
41
|
"codex": {
|
|
@@ -78,4 +78,4 @@ For a dry run, inspect:
|
|
|
78
78
|
- `launch-preview.json` for the final `codex exec` command
|
|
79
79
|
- any referenced prompt file under `.tmp/<lane>-wave-launcher/dry-run/prompts/`
|
|
80
80
|
|
|
81
|
-
The preview records the exact `--profile`, repeated `-c`, `--image`, and `--add-dir` flags that Wave would use in a live launch.
|
|
81
|
+
The preview records the exact `--profile`, repeated `-c`, `--image`, and `--add-dir` flags that Wave would use in a live launch. It also includes a `limits` block that makes Wave's Codex visibility explicit: `turnLimitSource: "not-set-by-wave"` means Wave emitted no Codex turn-limit flag, so any effective ceiling is external to the Wave CLI invocation.
|
|
@@ -90,4 +90,4 @@ For a dry run, inspect:
|
|
|
90
90
|
- `opencode.json`
|
|
91
91
|
- `launch-preview.json`
|
|
92
92
|
|
|
93
|
-
`launch-preview.json` shows the final `opencode run` command
|
|
93
|
+
`launch-preview.json` shows the final `opencode run` command, the exported `OPENCODE_CONFIG` path, and the resolved `limits` block for attempt timeout plus known step ceiling.
|
|
@@ -7,6 +7,8 @@ summary: "Primary external sources used as inspiration for planning, harness des
|
|
|
7
7
|
|
|
8
8
|
This repository does not commit converted paper/article caches. Keep any hydrated local copies under `docs/research/agent-context-cache/` or another ignored cache directory.
|
|
9
9
|
|
|
10
|
+
For a narrative synthesis of the most relevant MAS failure modes and how Wave responds to them, start with [coordination-failure-review.md](./coordination-failure-review.md) and then use this page as the bibliography.
|
|
11
|
+
|
|
10
12
|
## Practice Articles
|
|
11
13
|
|
|
12
14
|
- [Harness engineering: leveraging Codex in an agent-first world](https://openai.com/index/harness-engineering/)
|