@open-agent-toolkit/cli 0.1.6 → 0.1.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: oat-reviewer
3
- version: 1.0.1
3
+ version: 1.0.2
4
4
  description: Unified reviewer for OAT projects - mode-aware verification of requirements/design alignment and code quality. Writes review artifact to disk.
5
5
  tools: Read, Bash, Grep, Glob, Write
6
6
  color: yellow
@@ -32,6 +32,8 @@ Reviews catch issues before they ship:
32
32
 
33
33
  Your review artifact feeds into `oat-project-review-receive`, which converts findings into plan tasks for systematic gap closure.
34
34
 
35
+ Some findings are artifact drift rather than implementation defects. If shipped implementation is defensible but `spec.md`, `design.md`, or `plan.md` is stale, frame the issue as artifact alignment and say which artifact should change. Do not require a code fix solely because the design artifact lagged behind implementation.
36
+
35
37
  ## Inputs
36
38
 
37
39
  You will be given a "Review Scope" block including:
@@ -161,6 +163,11 @@ For each design decision relevant to scope:
161
163
  - Do endpoints match the design?
162
164
  - Are error responses as specified?
163
165
 
166
+ 4. **Artifact drift classification**
167
+ - If implementation diverges from design/spec/plan, decide whether the implementation is wrong or the artifact is stale.
168
+ - When the implementation is defensible, write the finding as stale-artifact alignment guidance instead of a code defect.
169
+ - Include enough rationale for `oat-project-review-receive` to convert the finding into an artifact-alignment task or explicit deferral.
170
+
164
171
  ### Step 6: Verify Code Quality
165
172
 
166
173
  This step applies to **code reviews** only.
@@ -204,6 +211,7 @@ Group findings by severity:
204
211
  - Missing error handling
205
212
  - Significant maintainability issues
206
213
  - Missing tests for important paths
214
+ - Stale spec/design/plan artifact that conflicts with a defensible implementation and should be aligned before closeout
207
215
 
208
216
  **Minor** (fix if time permits)
209
217
 
@@ -211,6 +219,7 @@ Group findings by severity:
211
219
  - Style issues
212
220
  - Minor refactoring opportunities
213
221
  - Documentation gaps
222
+ - Low-impact artifact wording drift where implementation is defensible and the stale wording is unlikely to mislead near-term work
214
223
 
215
224
  ### Step 8: Write Review Artifact
216
225
 
@@ -1,6 +1,6 @@
1
1
  {
2
- "cli": "0.1.6",
3
- "docs-config": "0.1.6",
4
- "docs-theme": "0.1.6",
5
- "docs-transforms": "0.1.6"
2
+ "cli": "0.1.7",
3
+ "docs-config": "0.1.7",
4
+ "docs-theme": "0.1.7",
5
+ "docs-transforms": "0.1.7"
6
6
  }
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: oat-project-implement
3
- version: 2.0.16
3
+ version: 2.0.17
4
4
  description: Use when plan.md is ready for execution. Dispatches phase-level subagents with bounded fix loops; supports plan-declared parallel phase groups with worktree-isolated execution and ordered fan-in.
5
5
  argument-hint: '[--retry-limit <N>] [--dry-run]'
6
6
  disable-model-invocation: true
@@ -28,6 +28,9 @@ After every code commit and after every phase/review-fix completion, you MUST co
28
28
  **CRITICAL — Review boundaries require a committed artifact baseline.**
29
29
  Do not enter checkpoint review, final review, revise, or PR-final handoff with dirty core project artifacts (`discovery.md`, `spec.md`, `design.md`, `plan.md`, `implementation.md`, `state.md`, plus `.oat/state.md` when refreshed). If one of those boundaries is next and artifact bookkeeping is still uncommitted, stop and create the bookkeeping commit first.
30
30
 
31
+ **CRITICAL — Intentional artifact divergence must be recorded.**
32
+ If implementation intentionally diverges from `spec.md`, `design.md`, or `plan.md`, record the delta in `implementation.md` before the next phase/review boundary. Include what diverged, why it diverged, whether the implementation or original artifact is now source of truth, and any follow-up artifact updates or explicit deferral. Do not leave accepted design drift only in chat, a review artifact, or code comments; final summary generation depends on `implementation.md` preserving the delta.
33
+
31
34
  ## Progress Indicators (User-Facing)
32
35
 
33
36
  When executing this skill, provide lightweight progress feedback so the user can tell what's happening after they confirm.
@@ -557,6 +560,7 @@ For each phase `pNN` in the plan (or each phase in the current parallel group),
557
560
  spec: {PROJECT_PATH}/spec.md
558
561
  implementation: {PROJECT_PATH}/implementation.md
559
562
  discovery: {PROJECT_PATH}/discovery.md
563
+ delta_recording: record any intentional divergence from spec/design/plan in implementation.md with rationale, source of truth, and follow-up artifact disposition
560
564
  commit_convention: {from plan.md header}
561
565
  workflow_mode: {from state.md or plan.md frontmatter}
562
566
  model_axis: {selected:<value> | inherited | not-applicable | host-auto; omit if unknown}
@@ -793,6 +797,14 @@ Append a new entry to the `## Orchestration Runs` section between the `<!-- orch
793
797
  #### Outstanding Items
794
798
 
795
799
  - {None | list of excluded phases with review paths and worktree paths}
800
+
801
+ #### Artifact / Design Deltas
802
+
803
+ Run-scoped snapshot only. The durable record is `## Deviations from Plan / Design`; consolidate any non-`None` entries there at the next phase boundary.
804
+
805
+ | Task / Review | Source Artifact | Planned / Documented | Actual / Accepted | Reason | Source of Truth | Follow-up |
806
+ | ----------------------------- | ----------------------------------- | ------------------------------- | -------------------------------------- | ---------------------------- | ------------------------- | ------------------------------------------- |
807
+ | {task_id/review_id or `None`} | {spec.md/design.md/plan.md section} | {planned behavior/taxonomy/API} | {actual shipped behavior/taxonomy/API} | {why divergence is accepted} | {implementation/artifact} | {artifact update task or explicit deferral} |
796
808
  ```
797
809
 
798
810
  Append only — never overwrite prior run entries.
@@ -887,6 +899,14 @@ When pausing:
887
899
  - Verification run
888
900
  - Notable decisions/deviations
889
901
 
902
+ **Design/artifact deltas (required when present):**
903
+
904
+ - If a completed task intentionally diverged from `spec.md`, `design.md`, or `plan.md`, update the `## Deviations from Plan / Design` table in `implementation.md`.
905
+ - For existing project artifacts, treat any `## Deviations...` heading as the deviations section; migrate to the preferred `## Deviations from Plan / Design` heading and table shape when already touching the section.
906
+ - Each delta must include: the affected source artifact/section, the planned/documented expectation, the actual shipped implementation, the reason the divergence is accepted, the current source of truth, and any follow-up artifact update task or explicit deferral.
907
+ - If the implementation is now source of truth and the design/spec/plan is stale, write that directly. Do not treat the stale artifact as a no-op just because code is correct.
908
+ - If no deltas exist for the phase, do not invent one; leave the table unchanged.
909
+
890
910
  **Bookkeeping commit (required):**
891
911
 
892
912
  **DO NOT SKIP.** This commit prevents state drift across sessions.
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: oat-project-review-provide
3
- version: 1.3.3
3
+ version: 1.3.4
4
4
  description: Use when completed work in an active OAT project needs a quality gate before merge. Performs a lifecycle-scoped review after a task, phase, or full implementation, unlike oat-review-provide.
5
5
  disable-model-invocation: true
6
6
  user-invocable: true
@@ -15,6 +15,8 @@ Request and execute a code or artifact review for the current project scope.
15
15
 
16
16
  Produce an independent review artifact that verifies requirements/design alignment (mode-aware) and code quality.
17
17
 
18
+ Reviewers should distinguish implementation defects from artifact drift. If code is defensible but `spec.md`, `design.md`, or `plan.md` is stale, frame the finding as artifact alignment rather than a required code change.
19
+
18
20
  ## Prerequisites
19
21
 
20
22
  **Required:** Active project with at least one completed task.
@@ -481,6 +483,12 @@ Build the "Review Scope" metadata for the reviewer:
481
483
  - Deferred Medium count: {DEFERRED_MEDIUM_COUNT}
482
484
  - Deferred Minor count: {DEFERRED_MINOR_COUNT}
483
485
  {DEFERRED_LEDGER}
486
+
487
+ **Design Drift Review Guidance:**
488
+
489
+ - If implementation differs from `spec.md`, `design.md`, or `plan.md`, decide whether the code should change or whether the artifact is stale.
490
+ - Use artifact-alignment framing when shipped implementation is defensible and the lifecycle artifact should be updated.
491
+ - Do not force a code-defect framing for accepted design drift; `oat-project-review-receive` can convert artifact drift into alignment tasks or explicit deferrals.
484
492
  ```
485
493
 
486
494
  ### Step 6: Execute Review (3-Tier Capability Model)
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: oat-project-review-receive
3
- version: 1.4.1
3
+ version: 1.4.2
4
4
  description: Use when review findings from oat-project-review-provide need closure. Converts review artifacts into actionable plan tasks.
5
5
  disable-model-invocation: true
6
6
  user-invocable: true
@@ -52,6 +52,7 @@ When executing this skill, provide lightweight progress feedback so the user can
52
52
  - No re-reviewing
53
53
  - For `artifact` reviews: no converting findings into plan tasks
54
54
  - For `artifact` reviews: no deferring findings by default
55
+ - No treating accepted design/code drift as a no-op; accepted drift must be converted to an artifact-alignment task or explicitly deferred, with an `implementation.md` note
55
56
 
56
57
  **ALLOWED Activities:**
57
58
 
@@ -167,6 +168,10 @@ For each finding, build a structured register entry:
167
168
  - Agent analysis (agree/disagree + why)
168
169
  - Recommendation (convert to task now vs defer with rationale)
169
170
  - Task Scope (`Large` | `Moderate` | `Minor` | `Negligible`)
171
+ - Drift disposition, when applicable:
172
+ - `code_fix_required` — implementation should change
173
+ - `artifact_alignment_required` — shipped implementation is defensible; design/docs/plan should be aligned
174
+ - `explicit_deferral` — drift is accepted for now with a concrete rationale and follow-up trigger
170
175
  - For `artifact` reviews, use dispositions:
171
176
  - `resolve_in_artifact`
172
177
  - `rejected_with_rationale` (invalid/not applicable)
@@ -386,6 +391,10 @@ Add a note to implementation.md:
386
391
 
387
392
  **New tasks added:** {task_ids}
388
393
 
394
+ **Design drift / artifact alignment notes:**
395
+
396
+ - {finding_id}: {review found stale design/spec/plan relative to shipped implementation; why the implementation is accepted; source of truth; artifact-alignment task ID or explicit deferral}
397
+
389
398
  **Next:** Execute fix tasks via the `oat-project-implement` skill.
390
399
 
391
400
  After the fix tasks are complete:
@@ -399,6 +408,10 @@ After the fix tasks are complete:
399
408
  - If `{PROJECT_PATH}/implementation.md` exists, ensure it will resume correctly after this skill:
400
409
  - If `oat_current_task_id` is `null` (or points at already-completed work), set it to the **first newly-added review-fix task ID** (or the next incomplete task in plan order).
401
410
  - Update the Progress Overview table totals (tasks + completed) if they are present and depend on task counts.
411
+ - If any finding is resolved by accepting the shipped implementation and aligning stale artifacts instead of changing code, add an explicit review note under the current "Review Received" section and update `## Deviations from Plan / Design` when that table exists.
412
+ - For existing project artifacts, treat any `## Deviations...` heading as the deviations section; migrate to the preferred `## Deviations from Plan / Design` heading and table shape when already touching the section.
413
+ - The note must say the review found design/spec/plan drift, why the shipped implementation is accepted, which source is now authoritative, and which artifact-alignment task will update the stale artifact.
414
+ - If the artifact update is intentionally deferred, record the deferral rationale and follow-up trigger in `implementation.md`.
402
415
  - Update `{PROJECT_PATH}/state.md` frontmatter so routing/UI is accurate:
403
416
  - `oat_phase: implement`
404
417
  - `oat_phase_status: in_progress`
@@ -503,6 +516,13 @@ If any Medium is proposed for deferral:
503
516
  - If user declines deferral, convert that Medium to a fix task now.
504
517
  - If user approves deferral, record rationale in `implementation.md` under "Deferred Findings (Medium)".
505
518
 
519
+ Design drift handling applies before Medium/Minor convenience deferrals:
520
+
521
+ - If a review finding reveals that the design artifact is stale relative to a defensible implementation, do not treat this as a no-op.
522
+ - Either convert the finding to an artifact-alignment task or record an explicit deferral.
523
+ - In both cases, add an `implementation.md` review note so final summary generation can preserve the design delta.
524
+ - The note must include what drift was found, why the implementation is accepted, whether implementation or artifact is source of truth, and the artifact task or deferral that will align the lifecycle record.
525
+
506
526
  Minor findings handling is scope-aware:
507
527
 
508
528
  - If `scope != final`:
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: oat-project-summary
3
- version: 1.1.1
3
+ version: 1.1.2
4
4
  description: Use when the user requests or confirms summarizing an active OAT project — e.g. "summarize the project", "generate the summary", "run oat-project-summary", or confirms a previously offered summary run. Do NOT auto-invoke when implementation completes. Generates summary.md from project artifacts as institutional memory.
5
5
  disable-model-invocation: false
6
6
  user-invocable: true
@@ -149,6 +149,8 @@ For each section, synthesize content from the relevant artifacts. Apply these ru
149
149
 
150
150
  **Grounding rule:** Prefer implementation.md outcomes over design.md plans. If the implementation diverged from the design, reflect what actually happened.
151
151
 
152
+ **Design delta rule:** Populate `Design Deltas` from both direct implementation deviations and review-received design drift decisions recorded in `implementation.md`. A review finding may decide that shipped implementation is defensible and the artifact is stale; when `implementation.md` records that acceptance, carry it forward as a design delta with the rationale and follow-up artifact disposition.
153
+
152
154
  **Section omission rule:** If a section would have no meaningful content, omit it entirely (remove the heading). Do not leave empty sections or "N/A" placeholders.
153
155
 
154
156
  **Conciseness constraint (NFR3):** Target under 200 lines total. If a draft exceeds this, trim narrative sections (What Was Implemented, Notable Challenges) to essential points. Revision History entries: 2-3 sentences per round max.
@@ -157,18 +159,18 @@ For each section, synthesize content from the relevant artifacts. Apply these ru
157
159
 
158
160
  **Section sources:**
159
161
 
160
- | Section | Primary Sources |
161
- | -------------------- | ----------------------------------------------------------- |
162
- | Overview | discovery.md initial request, spec.md problem statement |
163
- | What Was Implemented | implementation.md task outcomes, plan.md phase structure |
164
- | Key Decisions | design.md decisions, implementation.md notes/decisions |
165
- | Design Deltas | implementation.md deviations table |
166
- | Notable Challenges | implementation.md issues/blockers in task notes |
167
- | Tradeoffs Made | implementation.md decisions, design.md tradeoff sections |
168
- | Integration Notes | implementation.md notes about cross-cutting concerns |
169
- | Revision History | plan.md p-revN phases, implementation.md revision notes |
170
- | Follow-up Items | implementation.md deferred findings, plan.md deferred items |
171
- | Associated Issues | state.md `associated_issues` field |
162
+ | Section | Primary Sources |
163
+ | -------------------- | ---------------------------------------------------------------------- |
164
+ | Overview | discovery.md initial request, spec.md problem statement |
165
+ | What Was Implemented | implementation.md task outcomes, plan.md phase structure |
166
+ | Key Decisions | design.md decisions, implementation.md notes/decisions |
167
+ | Design Deltas | implementation.md deviations table; review-received design drift notes |
168
+ | Notable Challenges | implementation.md issues/blockers in task notes |
169
+ | Tradeoffs Made | implementation.md decisions, design.md tradeoff sections |
170
+ | Integration Notes | implementation.md notes about cross-cutting concerns |
171
+ | Revision History | plan.md p-revN phases, implementation.md revision notes |
172
+ | Follow-up Items | implementation.md deferred findings, plan.md deferred items |
173
+ | Associated Issues | state.md `associated_issues` field |
172
174
 
173
175
  **For incremental updates (re-run):**
174
176
 
@@ -165,13 +165,13 @@ Chronological log of implementation progress.
165
165
 
166
166
  ---
167
167
 
168
- ## Deviations from Plan
168
+ ## Deviations from Plan / Design
169
169
 
170
- Document any deviations from the original plan.
170
+ Document any intentional deviations from the original plan, spec, or design. Include accepted review findings where the shipped implementation is source of truth and a lifecycle artifact needs alignment.
171
171
 
172
- | Task | Planned | Actual | Reason |
173
- | ---- | ------- | ------ | ------ |
174
- | - | - | - | - |
172
+ | Task / Review | Source Artifact | Planned / Documented | Actual / Accepted | Reason | Source of Truth | Follow-up |
173
+ | ------------- | --------------- | -------------------- | ----------------- | ------ | --------------- | --------- |
174
+ | - | - | - | - | - | - | - |
175
175
 
176
176
  ## Test Results
177
177
 
@@ -37,7 +37,8 @@ where decisions were made or changed during implementation.}
37
37
  <!-- Omit this section if there were no deviations from the original design. -->
38
38
 
39
39
  {Where the final result diverged from the original design and why.
40
- Pull from implementation.md deviations table.}
40
+ Pull from implementation.md deviations table and review-received design drift notes.
41
+ Include accepted cases where shipped implementation became source of truth and a stale lifecycle artifact needs alignment or was explicitly deferred.}
41
42
 
42
43
  ## Notable Challenges
43
44
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@open-agent-toolkit/cli",
3
- "version": "0.1.6",
3
+ "version": "0.1.7",
4
4
  "private": false,
5
5
  "description": "Open Agent Toolkit CLI",
6
6
  "homepage": "https://github.com/voxmedia/open-agent-toolkit/tree/main/packages/cli",
@@ -33,7 +33,7 @@
33
33
  "ora": "^9.0.0",
34
34
  "yaml": "2.8.2",
35
35
  "zod": "^3.25.76",
36
- "@open-agent-toolkit/control-plane": "0.1.6"
36
+ "@open-agent-toolkit/control-plane": "0.1.7"
37
37
  },
38
38
  "devDependencies": {
39
39
  "@types/node": "^22.10.0",