codebyplan 1.13.28 → 1.13.30
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/cli.js +728 -18
- package/package.json +1 -1
- package/templates/README.md +16 -13
- package/templates/agents/cbp-cc-executor.md +6 -9
- package/templates/agents/cbp-improve-round.md +1 -1
- package/templates/agents/cbp-round-executor.md +1 -2
- package/templates/agents/cbp-task-check.md +12 -8
- package/templates/hooks/cbp-mcp-round-sync.sh +9 -0
- package/templates/rules/README.md +13 -8
- package/templates/rules/cbp-operating-gotchas.md +64 -0
- package/templates/settings.project.base.json +3 -3
- package/templates/skills/cbp-build-cc-agent/SKILL.md +3 -4
- package/templates/skills/cbp-build-cc-agent/examples/with-skills-preload.md +2 -3
- package/templates/skills/cbp-build-cc-agent/reference/frontmatter-fields.md +0 -1
- package/templates/skills/cbp-build-cc-agent/scripts/validate-agent.sh +0 -6
- package/templates/skills/cbp-build-cc-agent/templates/agent.md +1 -2
- package/templates/skills/cbp-build-cc-claude-file/SKILL.md +16 -2
- package/templates/skills/cbp-build-cc-claude-file/reference/what-belongs.md +1 -1
- package/templates/skills/cbp-build-cc-mode/SKILL.md +5 -4
- package/templates/skills/cbp-build-cc-rule/SKILL.md +2 -2
- package/templates/skills/cbp-build-cc-settings/reference/cbp-permission-policy.md +3 -2
- package/templates/skills/cbp-build-cc-skill/reference/cbp-quality.md +1 -1
- package/templates/skills/cbp-merge-main/SKILL.md +1 -1
- package/templates/skills/cbp-round-complete/SKILL.md +164 -0
- package/templates/skills/cbp-round-end/SKILL.md +16 -14
- package/templates/skills/cbp-round-end/reference/findings-presentation.md +7 -17
- package/templates/skills/cbp-round-execute/SKILL.md +4 -0
- package/templates/skills/cbp-round-input/SKILL.md +6 -6
- package/templates/skills/cbp-round-start/SKILL.md +12 -15
- package/templates/skills/cbp-round-update/SKILL.md +31 -143
- package/templates/skills/cbp-standalone-task-check/SKILL.md +2 -2
- package/templates/skills/cbp-standalone-task-complete/SKILL.md +4 -3
- package/templates/skills/cbp-standalone-task-testing/SKILL.md +4 -4
- package/templates/skills/cbp-task-check/SKILL.md +3 -3
- package/templates/skills/cbp-task-complete/SKILL.md +7 -6
- package/templates/skills/cbp-task-testing/SKILL.md +3 -5
- package/templates/skills/cbp-todo/SKILL.md +1 -1
- package/templates/skills/cbp-build-cc-memory/SKILL.md +0 -201
- package/templates/skills/cbp-build-cc-memory/examples/feedback-memory.md +0 -11
- package/templates/skills/cbp-build-cc-memory/examples/project-memory.md +0 -11
- package/templates/skills/cbp-build-cc-memory/examples/reference-memory.md +0 -13
- package/templates/skills/cbp-build-cc-memory/examples/user-memory.md +0 -14
- package/templates/skills/cbp-build-cc-memory/reference/memory-types.md +0 -59
- package/templates/skills/cbp-build-cc-memory/reference/when-to-save.md +0 -62
- package/templates/skills/cbp-build-cc-memory/templates/MEMORY-index.md +0 -4
- package/templates/skills/cbp-build-cc-memory/templates/memory-entry.md +0 -15
|
@@ -0,0 +1,164 @@
|
|
|
1
|
+
---
|
|
2
|
+
scope: org-shared
|
|
3
|
+
name: cbp-round-complete
|
|
4
|
+
description: Reconcile user git-add approvals, complete the round, and route to the next step
|
|
5
|
+
argument-hint: [chk-task-round | task-round]
|
|
6
|
+
triggers: [cbp-task-check, cbp-standalone-task-check, cbp-round-input]
|
|
7
|
+
effort: low
|
|
8
|
+
---
|
|
9
|
+
|
|
10
|
+
## Kind Detection
|
|
11
|
+
|
|
12
|
+
Inspect the resolved identifier from argument parsing to determine the task kind:
|
|
13
|
+
|
|
14
|
+
| Identifier shape | KIND |
|
|
15
|
+
|-----------------|------|
|
|
16
|
+
| `{task}-{round}` (2-segment, e.g. `45-2`) | `standalone` |
|
|
17
|
+
| `{chk}-{task}-{round}` (3-segment, e.g. `141-3-1`) | `checkpoint` |
|
|
18
|
+
| _(empty / free-text)_ | Check `get_current_standalone_task` first; if found → `standalone`. Else → `checkpoint` via `get_current_task`. |
|
|
19
|
+
|
|
20
|
+
Set `KIND` for the rest of this skill. MCP tool names vary by KIND:
|
|
21
|
+
|
|
22
|
+
| Operation | `checkpoint` KIND | `standalone` KIND |
|
|
23
|
+
|-----------|------------------|-------------------|
|
|
24
|
+
| Get task | `get_current_task(repo_id)` | `get_current_standalone_task(repo_id)` |
|
|
25
|
+
| Get rounds | `get_rounds(task_id)` | `get_standalone_rounds(standalone_task_id)` |
|
|
26
|
+
| Update round | `update_round(round_id, ...)` | `update_standalone_round(standalone_round_id, ...)` |
|
|
27
|
+
| Complete round | `complete_round(round_id, duration_minutes?)` | `complete_standalone_round(standalone_round_id, duration_minutes?, caller_worktree_id)` ⚠️ `caller_worktree_id` is REQUIRED for standalone |
|
|
28
|
+
|
|
29
|
+
# Round Complete Command
|
|
30
|
+
|
|
31
|
+
The **permission-gated finalizer** for a round that `/cbp-round-update` triaged as clean. It reconciles which files the **user** approved via `git add`, completes the round, and routes to the next step.
|
|
32
|
+
|
|
33
|
+
This skill is gated by an `ask`-tier `Skill(cbp-round-complete)` permission rule in `settings.json`. **The permission prompt IS the user confirmation** — there is NO AskUserQuestion inside this skill. If the user declines the permission, the skill does not run: nothing is synced, no round is completed, and the user can stage files and re-invoke (directly or by re-running `/cbp-round-update`) when ready.
|
|
34
|
+
|
|
35
|
+
## HARD GATE — Every Step Must Execute
|
|
36
|
+
|
|
37
|
+
Step 2 (sync-approvals CLI) MUST exit 0. If it fails, do NOT proceed to Step 3. Before completing the round, verify:
|
|
38
|
+
|
|
39
|
+
- [ ] `codebyplan round sync-approvals` exited 0
|
|
40
|
+
|
|
41
|
+
If this is false: DO NOT proceed to Step 3.
|
|
42
|
+
|
|
43
|
+
## Instructions
|
|
44
|
+
|
|
45
|
+
### Step 1: Parse `$ARGUMENTS`
|
|
46
|
+
|
|
47
|
+
Parse the argument using the canonical chk-task-round notation (see `cbp-round-start` Step 0 "CHK / TASK / ROUND Identifier Notation Vocabulary"):
|
|
48
|
+
|
|
49
|
+
| Shape | Regex | Resolves to |
|
|
50
|
+
|-------|-------|-------------|
|
|
51
|
+
| `{chk}-{task}-{round}` (e.g. `108-1-2`) | `^[0-9]+-[0-9]+-[0-9]+$` | Checkpoint-bound: CHK-{chk} TASK-{task} ROUND-{round} |
|
|
52
|
+
| `{task}-{round}` (e.g. `45-2`) | `^[0-9]+-[0-9]+$` | Standalone: standalone TASK-{task} ROUND-{round} |
|
|
53
|
+
| _(empty)_ | — | Use Kind Detection to find active task and latest round |
|
|
54
|
+
|
|
55
|
+
Anything else is malformed — surface this error and stop:
|
|
56
|
+
|
|
57
|
+
```
|
|
58
|
+
round-complete: invalid argument `{value}`. Expected:
|
|
59
|
+
108-1-2 → CHK-108 TASK-1 ROUND-2 (checkpoint-bound)
|
|
60
|
+
45-2 → standalone TASK-45 ROUND-2
|
|
61
|
+
(empty) → active task and latest round
|
|
62
|
+
```
|
|
63
|
+
|
|
64
|
+
Note that `108-1` is **valid** here — it resolves to standalone TASK-108 ROUND-1 per the 2-segment task-round form. To target a checkpoint-bound round, use the 3-segment form `108-1-2`.
|
|
65
|
+
|
|
66
|
+
### Step 1.5: Get Current Task and Round
|
|
67
|
+
|
|
68
|
+
Given the parse from Step 1:
|
|
69
|
+
|
|
70
|
+
| Parse | Resolution path |
|
|
71
|
+
|-------|-----------------|
|
|
72
|
+
| `{chk}-{task}-{round}` | MCP `get_checkpoints(repo_id)` → filter `number === {chk}`. MCP `get_tasks(checkpoint_id)` → filter `number === {task}`. MCP `get_rounds(task_id)` → filter `number === {round}`. |
|
|
73
|
+
| `{task}-{round}` | MCP `get_standalone_rounds` via `get_current_standalone_task` or direct task lookup → filter `number === {round}`. |
|
|
74
|
+
| _(empty)_ | Use Kind Detection: checkpoint KIND → MCP `get_current_task(repo_id)` + `get_rounds(task_id)`; standalone KIND → MCP `get_current_standalone_task(repo_id)` + `get_standalone_rounds(standalone_task_id)`. |
|
|
75
|
+
|
|
76
|
+
If no task found: `No active task. Nothing to complete.`
|
|
77
|
+
|
|
78
|
+
### Step 2: Sync git diff + approvals via CLI
|
|
79
|
+
|
|
80
|
+
Reconcile which files the user has approved by staging them. Run:
|
|
81
|
+
|
|
82
|
+
```
|
|
83
|
+
npx codebyplan round sync-approvals --round-id <round_id> --task-id <task_id>
|
|
84
|
+
```
|
|
85
|
+
|
|
86
|
+
The CLI auto-resolves the caller worktree id with the following precedence:
|
|
87
|
+
1. `--caller-worktree-id <uuid>` override (if passed — skips all resolution)
|
|
88
|
+
2. Per-device branch-keyed cache (`.codebyplan/worktree.local.json`)
|
|
89
|
+
3. In-process tuple API call: `POST /worktrees/resolve` using `(device_id, repo_path, branch)`
|
|
90
|
+
|
|
91
|
+
On the write path (non `--dry-run`), if the worktree id cannot be resolved the CLI **hard-fails with exit 1** and prints an actionable message. To pre-populate the cache:
|
|
92
|
+
|
|
93
|
+
```
|
|
94
|
+
npx codebyplan resolve-worktree --cache
|
|
95
|
+
```
|
|
96
|
+
|
|
97
|
+
If this worktree is not yet registered, run `npx codebyplan setup` first, then re-run `/cbp-round-complete`.
|
|
98
|
+
|
|
99
|
+
The CLI parses `git status --short`, merges drift + staging + web-UI flag, and writes both round and task (forwarding `caller_worktree_id` on both writes so the server honors the feat-worktree lock). A **cleanly staged** file (`git add`-ed, no further unstaged changes) becomes `user_approved: true`.
|
|
100
|
+
|
|
101
|
+
Read the stdout JSON: `{ added, stale_marked, reactivated, total_files }`.
|
|
102
|
+
|
|
103
|
+
If the command exits non-zero, surface the stderr and STOP. Do NOT proceed to Step 3.
|
|
104
|
+
|
|
105
|
+
This is the **single** explicit reconcile owned by this skill. (The `cbp-mcp-round-sync.sh` PostToolUse hook fires again right after Step 3's `complete_round` — see the note below — but that is the existing post-complete safety net, not a duplicate run to schedule here.)
|
|
106
|
+
|
|
107
|
+
### Step 3: Complete the Round
|
|
108
|
+
|
|
109
|
+
Calculate duration from the round's `started_at` to now in minutes.
|
|
110
|
+
|
|
111
|
+
- **checkpoint KIND**: MCP `complete_round(round_id, duration_minutes)`.
|
|
112
|
+
- **standalone KIND**: MCP `complete_standalone_round(standalone_round_id, duration_minutes, caller_worktree_id)`. ⚠️ `caller_worktree_id` is REQUIRED — resolve via `CALLER_WT=$(npx codebyplan resolve-worktree 2>/dev/null)`. If `CALLER_WT` is empty, surface this warning and ask the user to confirm before proceeding:
|
|
113
|
+
|
|
114
|
+
```
|
|
115
|
+
Warning: could not resolve caller_worktree_id (npx codebyplan resolve-worktree returned empty).
|
|
116
|
+
The complete_standalone_round call may be rejected by the pre-guard. Proceed anyway? (yes / no)
|
|
117
|
+
```
|
|
118
|
+
|
|
119
|
+
If the user confirms yes, proceed with `caller_worktree_id: ""`. If no, stop.
|
|
120
|
+
|
|
121
|
+
`complete_round` / `complete_standalone_round` sets the round `completed`, locks all `file_changes` for the round (`approval_locked: true`), and returns `unapproved_files[]` + `unapproved_count`. Hold those for routing.
|
|
122
|
+
|
|
123
|
+
> **PostToolUse hook note**: completing the round fires the `cbp-mcp-round-sync.sh` PostToolUse hook (matcher `mcp__codebyplan__complete_round`), which runs `sync-approvals` once more as a post-complete safety net for any approval drift between Step 2 and the lock. This is **expected** and is not double-processing — Step 2 is the pre-complete reconcile that makes `unapproved_count` accurate for routing; the hook is the existing catch-up. Note the hook matches `complete_round` only — `complete_standalone_round` is **not** covered by it (a pre-existing gap), so standalone rounds rely solely on this skill's Step 2 reconcile.
|
|
124
|
+
|
|
125
|
+
### Step 4: Route
|
|
126
|
+
|
|
127
|
+
**4a — Count files** — Display: `"Round N complete — Files: X total, Y approved, Z pending"`.
|
|
128
|
+
|
|
129
|
+
**4b — Route on `unapproved_count`** (from Step 3's `complete_round` response):
|
|
130
|
+
|
|
131
|
+
- **`unapproved_count === 0`** (every file user-approved): the user has signed off on the whole round.
|
|
132
|
+
- checkpoint KIND → auto-trigger `/cbp-task-check`.
|
|
133
|
+
- standalone KIND → auto-trigger `/cbp-standalone-task-check`.
|
|
134
|
+
- **`unapproved_count > 0`** (user withheld approval on some files): the unstaged files are the signal that more work is wanted on them. Auto-trigger `/cbp-round-input` — its Step 2 deep analysis reads exactly those `user_approved === false` files and formulates the next round's requirements. This route is **independent of how many files are staged**; round-input is reachable even when zero files were staged.
|
|
135
|
+
|
|
136
|
+
- **Degenerate auto-loop guard**: if the just-completed round had `round.context.auto_loop_mode === true` AND it was a clean exit (no `improve_round_findings[]`, no hard-fail — which is why `/cbp-round-update` triaged it to round-complete in the first place), do NOT auto-trigger `/cbp-round-input`. Its auto-loop path transcribes the prior round's findings verbatim, and a clean round has none — auto-triggering would spin on an empty input. Instead surface the clean-exit note below and STOP; the user stages the pending files and re-invokes (or runs `/cbp-round-input` manually). Persist `round.context.round_complete.degenerate_auto_loop_exit = true`.
|
|
137
|
+
|
|
138
|
+
```
|
|
139
|
+
## Round N Complete — Auto-loop finished clean
|
|
140
|
+
|
|
141
|
+
**Files**: X total, Y approved, Z pending
|
|
142
|
+
|
|
143
|
+
Pending files passed all checks; they are just not staged. Stage them
|
|
144
|
+
(`git add <path>`) to finish the task, or run /cbp-round-input to start
|
|
145
|
+
another round.
|
|
146
|
+
```
|
|
147
|
+
|
|
148
|
+
Persist a breadcrumb on the round via `update_round` / `update_standalone_round` per KIND: `round.context.round_complete = { staged_count, unstaged_count, route, decided_at }`.
|
|
149
|
+
|
|
150
|
+
## Key Rules
|
|
151
|
+
|
|
152
|
+
- **Permission prompt = confirmation** — gated by `ask`-tier `Skill(cbp-round-complete)`. NEVER add an AskUserQuestion to confirm running; the harness prompt is the gate. A declined permission is a clean no-op.
|
|
153
|
+
- **Step 2 (CLI) must exit 0** — if it fails, STOP before `complete_round`. The merge semantics are enforced by the CLI.
|
|
154
|
+
- **NEVER ask the user to git add files** — Step 2 only reads staging status. **NEVER stage files** — Claude does not touch the git staging area; the user's `git add` is the approval signal.
|
|
155
|
+
- **standalone KIND Step 3**: `caller_worktree_id` is REQUIRED for `complete_standalone_round` — always resolve and pass it.
|
|
156
|
+
- **Auto-triggered by `/cbp-round-update`** (clean triage), or run manually by the user.
|
|
157
|
+
|
|
158
|
+
## Integration
|
|
159
|
+
|
|
160
|
+
- **Gates**: `ask`-tier `Skill(cbp-round-complete)` permission prompt — the harness confirms before the skill runs; a decline makes NO writes. There is no in-skill AskUserQuestion.
|
|
161
|
+
- **Triggered by**: `/cbp-round-update` (auto, clean triage), or user manually
|
|
162
|
+
- **Reads**: MCP `get_current_task` / `get_current_standalone_task`, `get_rounds` / `get_standalone_rounds` (per KIND); delegates git+approval sync to `npx codebyplan round sync-approvals`
|
|
163
|
+
- **Writes**: MCP `complete_round` / `complete_standalone_round` (per KIND); `update_round` / `update_standalone_round` (`round_complete` breadcrumb); round+task `files_changed` written by the CLI
|
|
164
|
+
- **Triggers**: `/cbp-task-check` (checkpoint KIND, all files approved), `/cbp-standalone-task-check` (standalone KIND, all files approved), `/cbp-round-input` (some files unapproved — fires independent of staging count)
|
|
@@ -16,7 +16,7 @@ See `reference/inline-fallback.md` for full trigger table, procedure, and covera
|
|
|
16
16
|
## Pipeline
|
|
17
17
|
|
|
18
18
|
```
|
|
19
|
-
/cbp-round-execute → /cbp-round-end → [code review +
|
|
19
|
+
/cbp-round-execute → /cbp-round-end → [code review + auto-apply in-scope] → /cbp-round-update
|
|
20
20
|
```
|
|
21
21
|
|
|
22
22
|
## Identifier Notation
|
|
@@ -126,9 +126,13 @@ Wait for agent to complete. If the spawn fails for any reason, apply the inline-
|
|
|
126
126
|
|
|
127
127
|
**If `status: 'no_findings'`:** show `### Code Review\nNo issues found. Code looks good.` and skip to Step 8.
|
|
128
128
|
|
|
129
|
-
**If findings exist**, present them grouped by severity (table + per-finding details)
|
|
129
|
+
**If findings exist**, present them grouped by severity (table + per-finding details).
|
|
130
130
|
|
|
131
|
-
|
|
131
|
+
**Under `auto_loop_mode === true`**: do NOT auto-apply here — Step 8's auto-loop path accepts all findings into `improve_round_findings[]` and defers the fixes to the next loop round. Skip straight to Step 8.
|
|
132
|
+
|
|
133
|
+
**Manual mode**: **auto-apply all in-scope findings inline**. A finding is *in-scope* when every file it references is within the round's `files_changed[]`. The round-end orchestrator (main context — it has Edit/Write) applies these fixes directly; the `cbp-improve-round` agent stays read-only/advisory and never writes. Record each applied fix in `round.context.inline_fix_log` (findings indices, rationale, `fixes[]`, applied_at). After applying, re-run the verification scoped to the modified files (hook syntax check for `.sh`; `cbp-testing-qa-agent` for code) per `reference/findings-presentation.md`; if it fails, do NOT record the fix — treat the finding as out-of-scope instead. Findings that reference files OUTSIDE `files_changed[]` are **out-of-scope** — do NOT apply them; save them to `improve_round_findings[]` so Step 8 routes them to `/cbp-round-input` or a new task. There is no findings-decision AskUserQuestion — the round was already approved at the `/cbp-round-execute` permission prompt. The baseline-regression gate above is the ONLY user decision in this step.
|
|
134
|
+
|
|
135
|
+
Example tables and the in-scope/out-of-scope classification: see `reference/findings-presentation.md`.
|
|
132
136
|
|
|
133
137
|
### Step 8: Route Based on Decisions
|
|
134
138
|
|
|
@@ -136,33 +140,31 @@ Example tables and the `inline` option gating spec: see `reference/findings-pres
|
|
|
136
140
|
|
|
137
141
|
- Auto-accept ALL findings into `improve_round_findings[]` regardless of severity (the user opted into the loop).
|
|
138
142
|
- Skip the polish-spiral stop-gate (auto-loop has its own cap-exhausted termination).
|
|
139
|
-
- Skip
|
|
143
|
+
- Skip Step 7's inline auto-apply (findings are deferred to the next loop round, not applied this round).
|
|
140
144
|
- Save findings via `update_round` exactly as in manual mode.
|
|
141
|
-
- Auto-trigger `/cbp-round-update` immediately. round-update
|
|
145
|
+
- Auto-trigger `/cbp-round-update` immediately. round-update triages the round and either routes to `/cbp-round-input` (spawn another round) or `/cbp-round-complete` (clean exit) — see cbp-round-update SKILL.md Step 2/3.
|
|
142
146
|
|
|
143
147
|
**Else (manual mode — flag absent or false):**
|
|
144
148
|
|
|
145
|
-
|
|
149
|
+
Step 7 already auto-applied in-scope findings and logged them to `round.context.inline_fix_log`. Now record any out-of-scope findings and route:
|
|
146
150
|
|
|
147
|
-
1.
|
|
148
|
-
2.
|
|
149
|
-
3. Save accepted/rejected findings to round context via MCP `update_round`:
|
|
151
|
+
1. **Polish-spiral stop-gate** (round 2+ only): if this is round 2 or later AND the prior round also ended with code-review fixes, surface a one-line stop-gate via AskUserQuestion — *defer remaining polish to a follow-up task* vs *continue with another round*. This is a genuine user decision about scope (it guards against endless low-value polish loops), not a flow-control prompt. Skip on round 1.
|
|
152
|
+
2. Save out-of-scope findings (those NOT auto-applied in Step 7) to round context via MCP `update_round`:
|
|
150
153
|
```json
|
|
151
154
|
{
|
|
152
155
|
"context": {
|
|
153
|
-
"improve_round_findings": [
|
|
154
|
-
"improve_round_rejected": [rejected findings with user reasons]
|
|
156
|
+
"improve_round_findings": [out-of-scope findings]
|
|
155
157
|
}
|
|
156
158
|
}
|
|
157
159
|
```
|
|
158
|
-
|
|
160
|
+
3. Auto-trigger `/cbp-round-update`. round-update triages the round: if out-of-scope findings (or a hard-fail) remain it routes to `/cbp-round-input` (which picks up the findings from round context and includes them in the new round's requirements automatically); if the round is clean it routes to `/cbp-round-complete` (the permission-gated finalizer that reconciles the user's `git add`s and completes the round).
|
|
159
161
|
|
|
160
162
|
## Key Rules
|
|
161
163
|
|
|
162
|
-
- Claude NEVER git adds files — user
|
|
164
|
+
- Claude NEVER git adds files — user approval is via git staging at `/cbp-round-complete`
|
|
163
165
|
- Auto-triggers `/cbp-round-update` after findings are handled
|
|
164
166
|
- `/cbp-round-end` is auto-triggered by `/cbp-round-execute` (user does not call it directly)
|
|
165
|
-
-
|
|
167
|
+
- In-scope findings are **auto-applied inline** by the round-end orchestrator (the round was already approved at the `/cbp-round-execute` permission); out-of-scope findings route to `/cbp-round-input`. `cbp-improve-round` stays read-only/advisory. Baseline-regression accept (Step 7 gate) stays a user decision — baselines are NEVER auto-accepted.
|
|
166
168
|
|
|
167
169
|
## Integration
|
|
168
170
|
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
# Findings Presentation in `/cbp-round-end` Step 7
|
|
2
2
|
|
|
3
|
-
When `improve-round` returns findings, Step 7 presents them grouped by severity
|
|
3
|
+
When `improve-round` returns findings, Step 7 presents them grouped by severity, then **auto-applies in-scope findings inline** (manual mode) or defers them to the next loop round (auto-loop mode). There is no findings-decision prompt.
|
|
4
4
|
|
|
5
5
|
## Example output
|
|
6
6
|
|
|
@@ -22,26 +22,16 @@ When `improve-round` returns findings, Step 7 presents them grouped by severity
|
|
|
22
22
|
[description + suggested fix from agent]
|
|
23
23
|
```
|
|
24
24
|
|
|
25
|
-
##
|
|
25
|
+
## Auto-apply model (manual mode)
|
|
26
26
|
|
|
27
|
-
|
|
28
|
-
Which findings should be fixed?
|
|
29
|
-
- "all" — fix all findings in a new round
|
|
30
|
-
- "1,2" — fix specific findings by number
|
|
31
|
-
- "none" — skip all, proceed to round-update
|
|
32
|
-
- "inline" — fix in THIS round before proceeding (only offered when all findings qualify under the Trivial-Resolution Exception below)
|
|
33
|
-
- Or explain why specific findings are not issues
|
|
34
|
-
```
|
|
35
|
-
|
|
36
|
-
## "inline" option gating
|
|
27
|
+
Step 7 auto-applies all **in-scope** findings inline — no user prompt. A finding is *in-scope* when every file it references is within the round's `files_changed[]`; it is *out-of-scope* otherwise.
|
|
37
28
|
|
|
38
|
-
|
|
29
|
+
- **In-scope** → the round-end orchestrator (main context, has Edit/Write) applies the fix directly via `Edit` / `Write`, re-runs the verification commands (hook syntax check + `cbp-testing-qa-agent` scoped to modified files), and records it in `round.context.inline_fix_log = { findings: [ids], rationale, fixes: [...], applied_at: <ISO> }`. The `cbp-improve-round` agent stays read-only/advisory and never writes.
|
|
30
|
+
- **Out-of-scope** → saved to `round.context.improve_round_findings[]`; Step 8 routes them to `/cbp-round-input` (next round) or a new task per the Infra Issue Absorption Contract below.
|
|
39
31
|
|
|
40
|
-
|
|
41
|
-
2. Each fix is under ~5 minutes of executor time
|
|
42
|
-
3. Verification is automatic — the existing test/lint/audit pipeline confirms the change
|
|
32
|
+
The only user decision in Step 7 is the **baseline-regression accept** gate (baselines are NEVER auto-accepted). Under `auto_loop_mode`, Step 7 does not auto-apply — all findings are accepted into `improve_round_findings[]` and deferred to the next loop round.
|
|
43
33
|
|
|
44
|
-
|
|
34
|
+
The **Trivial-Resolution Exception** below still governs the deeper bypass cases (skipping executor / testing-qa / improve-round for ≤5-line non-logic corrective rounds); it is referenced by `/cbp-round-execute` and `/cbp-task-testing` for infra-issue absorption.
|
|
45
35
|
|
|
46
36
|
---
|
|
47
37
|
|
|
@@ -15,6 +15,10 @@ Execution and validation phase. Receives the approved plan from `/cbp-round-star
|
|
|
15
15
|
/cbp-round-start → /cbp-round-execute → /cbp-round-end (auto)
|
|
16
16
|
```
|
|
17
17
|
|
|
18
|
+
## Approval Model
|
|
19
|
+
|
|
20
|
+
The `ask`-tier `Skill(cbp-round-execute)` permission prompt (configured in `settings.json`) is the **plan-approval gate** handed off from `/cbp-round-start`: confirming the permission approves the plan; declining it returns control to `/cbp-round-start` (re-plan with feedback) or `/cbp-round-input` (wrong direction). Once execution begins, the executors (`cbp-round-executor`, `cbp-mechanical-edits`) and the 3-INLINE / 3-SURVEY paths apply edits **automatically** — there is NO in-skill AskUserQuestion for approval. The only downstream user decisions are genuine ones: the dev-server start prompt (Step 4) and the baseline-regression accept gate (`/cbp-round-end` Step 7).
|
|
21
|
+
|
|
18
22
|
## Identifier Notation
|
|
19
23
|
|
|
20
24
|
This skill operates on the **active** task/round resolved via MCP `get_current_task` / `get_rounds` and does not accept a positional identifier argument. Canonical chk-task-round notation is defined in `cbp-round-start` Step 0 "CHK / TASK / ROUND Identifier Notation Vocabulary".
|
|
@@ -24,7 +24,6 @@ Set `KIND` for the rest of this skill. MCP tool names vary by KIND:
|
|
|
24
24
|
| Get rounds | `get_rounds(task_id)` | `get_standalone_rounds(standalone_task_id)` |
|
|
25
25
|
| Add round | `add_round(task_id, ...)` | `add_standalone_round(standalone_task_id, ...)` |
|
|
26
26
|
| Update round | `update_round(round_id, ...)` | `update_standalone_round(standalone_round_id, ...)` |
|
|
27
|
-
| Complete round | `complete_round(round_id, duration_minutes?)` | `complete_standalone_round(standalone_round_id, duration_minutes?, caller_worktree_id)` ⚠️ `caller_worktree_id` is REQUIRED for standalone |
|
|
28
27
|
| Update task | `update_task(task_id, ...)` | `update_standalone_task(standalone_task_id, ...)` |
|
|
29
28
|
|
|
30
29
|
# Round Input Command
|
|
@@ -33,7 +32,7 @@ Gathers input for a new round. Performs deep analysis of unapproved files, requi
|
|
|
33
32
|
|
|
34
33
|
## When Used
|
|
35
34
|
|
|
36
|
-
- After `/cbp-round-update` routes here (unapproved
|
|
35
|
+
- After `/cbp-round-update` triages a round as not-clean and routes here, or `/cbp-round-complete` routes here (files left unapproved after completing the round)
|
|
37
36
|
- After `/cbp-round-execute` Step 6 routes here (structural failure or retry-exhausted hard-fail)
|
|
38
37
|
- After `/clear` + `/cbp-todo` reloads context and triggers this
|
|
39
38
|
- When user wants to start a new round with specific changes
|
|
@@ -78,8 +77,9 @@ If the argument matches the numeric regex, resolve the target task/round from DB
|
|
|
78
77
|
**2f:** Extract testing-qa failures from latest round context (`context.testing_qa_output`)
|
|
79
78
|
|
|
80
79
|
**2g:** Extract code review findings from latest round context (`context.improve_round_findings`).
|
|
81
|
-
These are
|
|
82
|
-
that
|
|
80
|
+
These are out-of-scope findings from the `improve-round` agent — bugs, logic errors, edge cases
|
|
81
|
+
that round-end could not auto-apply inline (they reference files outside the prior round's
|
|
82
|
+
`files_changed[]`). Include them as high-priority requirements.
|
|
83
83
|
|
|
84
84
|
**2h:** Identify root causes — not "file X is wrong" but "requirement Y was not met because Z"
|
|
85
85
|
|
|
@@ -175,12 +175,12 @@ If this command is triggered **directly** (not via `/cbp-todo`) and no context i
|
|
|
175
175
|
- **Deep analysis is MANDATORY** — always runs, even if arguments provided (for context)
|
|
176
176
|
- **Analysis reads from DB (MCP)**, not conversation history
|
|
177
177
|
- **Follow-up rounds get same depth as round 1** — no quick-fix behavior
|
|
178
|
-
- **Never ask to git add** — file approval is
|
|
178
|
+
- **Never ask to git add** — user file approval (git staging) is reconciled by `/cbp-round-complete`
|
|
179
179
|
- **Update all context locations** — task, checkpoint, and round should all have consistent information
|
|
180
180
|
|
|
181
181
|
## Integration
|
|
182
182
|
|
|
183
|
-
- **Triggered by**: `/cbp-round-update` (auto, unapproved
|
|
183
|
+
- **Triggered by**: `/cbp-round-update` (auto, not-clean triage), `/cbp-round-complete` (auto, files left unapproved after completing the round), `/cbp-round-execute` (auto, on hard-fail after retry exhausted), `/cbp-todo` (after /clear), user manually
|
|
184
184
|
- **Reads**: MCP `get_current_task` / `get_current_standalone_task`, `get_rounds` / `get_standalone_rounds` (per KIND), file contents (Read tool)
|
|
185
185
|
- **Writes**: MCP `update_task` / `update_standalone_task` (context), `update_checkpoint` (context, if checkpoint KIND and needed)
|
|
186
186
|
- **Triggers**: `/cbp-round-start` (auto)
|
|
@@ -30,7 +30,7 @@ Set `KIND` for the rest of this skill. MCP tool names vary by KIND:
|
|
|
30
30
|
|
|
31
31
|
# Round Start Command
|
|
32
32
|
|
|
33
|
-
Planning phase for a new round. Analyzes context, creates plan,
|
|
33
|
+
Planning phase for a new round. Analyzes context, creates a plan, then auto-triggers `/cbp-round-execute` — the `ask`-tier permission prompt on that skill IS the user's plan approval. NO execution or testing — those are separate commands.
|
|
34
34
|
|
|
35
35
|
## Inline-Fallback for Planner Spawn Failure
|
|
36
36
|
|
|
@@ -42,17 +42,17 @@ Procedure summary (pointer back to canonical):
|
|
|
42
42
|
2. Walk the planner's documented Phase 0-8 checklist inline using `Read` / `Grep` / `Bash` / MCP `get_*` — `agents/cbp-task-planner.md` is the inline script. Phase 1.5 (Requirement Premise Verification) and Phase 4.7 (Migration Shape-Distribution Pre-Flight) are MANDATORY in fallback mode — these are the gates the agent uniquely enforces; skipping them produces unverified plans.
|
|
43
43
|
3. Populate the planner's output contract (`approved_plan` shape: `files_to_modify[]`, `deliverables`, `specialist_needs`, `round_type`, `shape_distribution` if applicable, `context_summary`) with `mode: 'inline_fallback'`.
|
|
44
44
|
4. Apply the pre-emptive-skip rule: when the same failure class fired in the previous spawn of this session, skip the spawn attempt entirely and go straight to inline.
|
|
45
|
-
5. Continue the skill — do NOT abort.
|
|
45
|
+
5. Continue the skill — do NOT abort. Step 9 auto-triggers `/cbp-round-execute`; the `ask`-tier permission prompt on that skill is the user's plan approval (see Step 8).
|
|
46
46
|
|
|
47
47
|
Inline-fallback is NOT a quality downgrade trapdoor — Phase 1.5 row-by-row verification is mandatory. A fallback plan that skipped premise verification is a regression caught by the next session's cbp-improve-round.
|
|
48
48
|
|
|
49
49
|
## Pipeline
|
|
50
50
|
|
|
51
51
|
```
|
|
52
|
-
/cbp-round-start (planning) →
|
|
52
|
+
/cbp-round-start (planning) → /cbp-round-execute (ask-tier permission = plan approval)
|
|
53
53
|
```
|
|
54
54
|
|
|
55
|
-
**Auto-loop mode**: when `round.context.auto_loop_mode === true` flows in from `/cbp-round-input`, Step 6 (Q&A) and Step 8
|
|
55
|
+
**Auto-loop mode**: when `round.context.auto_loop_mode === true` flows in from `/cbp-round-input`, Step 6 (Q&A) is skipped and Step 8's `/cbp-round-execute` permission is auto-approved. See cbp-round-update SKILL.md Step 3b (auto-loop decision) and cbp-round-end SKILL.md Step 8 for the full contract.
|
|
56
56
|
|
|
57
57
|
## Instructions
|
|
58
58
|
|
|
@@ -176,7 +176,7 @@ input:
|
|
|
176
176
|
|
|
177
177
|
Wait for planner output.
|
|
178
178
|
|
|
179
|
-
### Step 8:
|
|
179
|
+
### Step 8: Present Plan
|
|
180
180
|
|
|
181
181
|
Present the plan to user:
|
|
182
182
|
|
|
@@ -208,24 +208,21 @@ Present the plan to user:
|
|
|
208
208
|
|
|
209
209
|
Single-wave plans present the existing flat plan view (no wave table) — backward compatible.
|
|
210
210
|
|
|
211
|
-
**
|
|
211
|
+
**Plan approval is the `ask`-tier `Skill(cbp-round-execute)` permission prompt** — there is NO approve/needs-changes/wrong AskUserQuestion here. After presenting the plan, proceed to Step 9, which auto-triggers `/cbp-round-execute`; the harness then shows the `ask`-tier permission prompt, and confirming it IS the user's go-ahead on the plan.
|
|
212
212
|
|
|
213
|
-
**
|
|
213
|
+
**Denied-execute handling** — if the user declines the `/cbp-round-execute` permission, the plan does not run. Treat the decline as "the plan must change":
|
|
214
214
|
|
|
215
|
-
|
|
216
|
-
|
|
217
|
-
3. **No — totally wrong** — discard plan, return to `/cbp-round-input` for new requirements
|
|
215
|
+
- **Minor changes**: collect the user's feedback, re-spawn `cbp-task-planner` with it as a constraint (re-run Step 7), present the revised plan, and re-trigger `/cbp-round-execute`.
|
|
216
|
+
- **Wrong direction**: save the rejection reason to round context and auto-trigger `/cbp-round-input` for new requirements.
|
|
218
217
|
|
|
219
|
-
**If "
|
|
220
|
-
**If "Needs changes"**: collect user feedback, re-spawn `cbp-task-planner` with feedback as constraint, present revised plan, ask again.
|
|
221
|
-
**If "Totally wrong"**: save rejection reason to round context, auto-trigger `/cbp-round-input`.
|
|
218
|
+
**If `auto_loop_mode === true`**: the loop auto-approves — log `round.context.plan_approval = { mode: "auto_loop", auto_approved_at: <ISO> }`, surface a one-line note `"Auto-approved under auto_loop_mode (round N of cap C)"`, and proceed to Step 9 (the `/cbp-round-execute` permission is auto-approved under the loop).
|
|
222
219
|
|
|
223
220
|
### Step 9: Auto-trigger Round Execute
|
|
224
221
|
|
|
225
|
-
|
|
222
|
+
Save planner output to round context via MCP `update_round` / `update_standalone_round` per KIND, then trigger `/cbp-round-execute`. The `ask`-tier permission prompt on `/cbp-round-execute` is the user's plan approval (see Step 8).
|
|
226
223
|
|
|
227
224
|
```
|
|
228
|
-
|
|
225
|
+
Starting execution phase...
|
|
229
226
|
```
|
|
230
227
|
|
|
231
228
|
## Key Rules
|