codebyplan 1.13.27 → 1.13.29
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/cli.js +1 -1
- package/package.json +1 -1
- package/templates/agents/cbp-improve-round.md +1 -1
- package/templates/agents/cbp-task-check.md +12 -8
- package/templates/hooks/README.md +21 -2
- package/templates/hooks/cbp-context-window-notify.sh +43 -0
- package/templates/hooks/cbp-mcp-round-sync.sh +9 -0
- package/templates/hooks/cbp-test-hooks.sh +119 -0
- package/templates/hooks/hooks.json +10 -0
- package/templates/settings.project.base.json +3 -2
- package/templates/skills/cbp-build-cc-mode/SKILL.md +4 -3
- package/templates/skills/cbp-build-cc-settings/reference/cbp-permission-policy.md +3 -2
- package/templates/skills/cbp-build-cc-skill/reference/cbp-quality.md +1 -1
- package/templates/skills/cbp-merge-main/SKILL.md +1 -1
- package/templates/skills/cbp-round-complete/SKILL.md +164 -0
- package/templates/skills/cbp-round-end/SKILL.md +16 -14
- package/templates/skills/cbp-round-end/reference/findings-presentation.md +7 -17
- package/templates/skills/cbp-round-execute/SKILL.md +4 -0
- package/templates/skills/cbp-round-input/SKILL.md +6 -6
- package/templates/skills/cbp-round-start/SKILL.md +12 -15
- package/templates/skills/cbp-round-update/SKILL.md +31 -143
- package/templates/skills/cbp-standalone-task-check/SKILL.md +2 -2
- package/templates/skills/cbp-standalone-task-complete/SKILL.md +4 -3
- package/templates/skills/cbp-standalone-task-testing/SKILL.md +4 -4
- package/templates/skills/cbp-task-check/SKILL.md +3 -3
- package/templates/skills/cbp-task-complete/SKILL.md +7 -6
- package/templates/skills/cbp-task-testing/SKILL.md +3 -5
- package/templates/skills/cbp-todo/SKILL.md +1 -1
|
@@ -10,6 +10,8 @@ effort: xhigh
|
|
|
10
10
|
|
|
11
11
|
Complete the current task. Auto-triggered by `/cbp-task-testing` when all tests pass. Can also be run manually.
|
|
12
12
|
|
|
13
|
+
This skill is gated by an `ask`-tier `Skill(cbp-task-complete)` permission rule in `settings.json`. **The permission prompt IS the user confirmation** — there is NO AskUserQuestion inside this skill. A declined permission is a clean no-op (nothing committed, merged, pushed, or completed).
|
|
14
|
+
|
|
13
15
|
## Instructions
|
|
14
16
|
|
|
15
17
|
### Step 1: Parse `$ARGUMENTS`
|
|
@@ -90,12 +92,10 @@ Stop here.
|
|
|
90
92
|
|
|
91
93
|
Load `task.qa` and `task.files_changed`:
|
|
92
94
|
|
|
93
|
-
1. **QA**: count items by status (pass / fail / pending / skipped) across all types.
|
|
94
|
-
2. **Files**: list any file with `user_approved === false
|
|
95
|
-
|
|
96
|
-
**If issues exist**, AskUserQuestion: `Complete anyway` / `Run QA first` (suggest `/cbp-task-check`) / `Cancel`. On `Run QA first` or `Cancel`, stop. On `Complete anyway`, continue.
|
|
95
|
+
1. **QA**: count items by status (pass / fail / pending / skipped) across all types.
|
|
96
|
+
2. **Files**: list any file with `user_approved === false`.
|
|
97
97
|
|
|
98
|
-
|
|
98
|
+
If any QA item is `fail`/`pending` or any file is unapproved, **surface the warnings in the output and continue** — record them for the Step 9 summary. There is NO confirmation AskUserQuestion here: `Skill(cbp-task-complete)` is `ask`-tier, so the harness permission prompt that gated this skill IS the user's confirmation to complete. The hard gates in Steps 2–2.6 (all rounds completed, ≥1 round has `testing_qa_output`, `check_verdict` READY, `task_testing_output.all_passed`) already block completion when prerequisites are unmet; these QA / file-approval items are warnings, not blockers.
|
|
99
99
|
|
|
100
100
|
### Step 4: Aggregate Files Changed
|
|
101
101
|
|
|
@@ -142,7 +142,7 @@ Call `complete_task(task_id)`. The server resolves the caller's worktree identit
|
|
|
142
142
|
|
|
143
143
|
Apply the `cleanup` skill inline to remove orphan references to deleted/modified files. Then apply `migration` to propagate renames/moves to consumers. Both run without sub-agent spawns. Skip cleanup if no deletions/modifications; skip migration if cleanup handled everything.
|
|
144
144
|
|
|
145
|
-
### Step 9: Show Result and Route
|
|
145
|
+
### Step 9: Show Result and Route
|
|
146
146
|
|
|
147
147
|
Show the completion summary:
|
|
148
148
|
|
|
@@ -153,6 +153,7 @@ Show the completion summary:
|
|
|
153
153
|
**Rounds**: [N] completed
|
|
154
154
|
**Files**: [N] changed
|
|
155
155
|
**Commit**: [hash]
|
|
156
|
+
**Warnings**: [any QA / file-approval warnings from Step 3, or "none"]
|
|
156
157
|
```
|
|
157
158
|
|
|
158
159
|
Then route. Same-context transitions (next task in this checkpoint) auto-trigger via the Skill tool. Cross-context transitions (checkpoint done → /cbp-checkpoint-check, session end) surface as a single directive 'Next: /clear, then /cbp-X' for the user to invoke after refreshing context.
|
|
@@ -9,7 +9,7 @@ effort: xhigh
|
|
|
9
9
|
|
|
10
10
|
# Task Testing Command
|
|
11
11
|
|
|
12
|
-
Comprehensive task-level testing —
|
|
12
|
+
Comprehensive task-level testing — the **cross-round double-check** run once after all rounds complete. Per-round QA (per-app build/lint/types, the `console.log`/debug scan, the OWASP/secret grep, `pnpm audit`) is owned by each round's `testing-qa-agent`; this skill does NOT re-run it. Instead it tests the **entire delivered feature holistically** across the full task diff — catching cross-package and cross-round problems no single round can see. Runs inline — no sub-agent.
|
|
13
13
|
|
|
14
14
|
## When Used
|
|
15
15
|
|
|
@@ -19,7 +19,7 @@ Comprehensive task-level testing — runs all automated tests and walks the user
|
|
|
19
19
|
|
|
20
20
|
## Scope vs Round-Level Validation
|
|
21
21
|
|
|
22
|
-
Per-wave `testing-qa-agent` runs inside `/cbp-round-execute` Step 5. This skill adds the cross-
|
|
22
|
+
Per-wave `testing-qa-agent` runs inside `/cbp-round-execute` Step 5 and **owns per-round QA**: per-app build/lint/types, the `console.log`/debug scan, the OWASP/secret full-diff grep, and `pnpm audit`. This skill does NOT repeat them. It adds only the cross-round layer invisible within a single round: workspace-wide lint, workspace tsc, and the full test suite (which catch cross-package breakage), plus the cross-round code review (Step 6.5), the autonomous sim screenshot loop (Step 6.x), and the user manual walkthrough (Step 8).
|
|
23
23
|
|
|
24
24
|
## Instructions
|
|
25
25
|
|
|
@@ -109,11 +109,9 @@ Capture stdout and stderr for each check.
|
|
|
109
109
|
| Full-repo lint | `pnpm -w lint` | Always |
|
|
110
110
|
| Full-repo types | `pnpm exec tsc --noEmit` | Source files changed |
|
|
111
111
|
| Full-repo unit tests | `pnpm test --run` | Source files in aggregated_files |
|
|
112
|
-
| Full-repo audit | `pnpm audit` | Always |
|
|
113
112
|
| Per-package E2E | `pnpm --filter <pkg> e2e:test` | UI files in aggregated_files |
|
|
114
|
-
| Full-diff security scan | inline grep or `security-agent` | Always |
|
|
115
113
|
|
|
116
|
-
Per-file lint + format are enforced by `lint-format-on-edit.sh`
|
|
114
|
+
These are the workspace-wide / cross-package checks only — per-app build/lint/types, the `console.log`/debug scan, the OWASP/secret grep, and `pnpm audit` already ran per-round inside `testing-qa-agent` and are NOT repeated here. Per-file lint + format are enforced by `lint-format-on-edit.sh` per edit. This step catches cross-package issues invisible to per-wave checks.
|
|
117
115
|
|
|
118
116
|
**Soft tests** (report, don't block):
|
|
119
117
|
|
|
@@ -133,7 +133,7 @@ Once the gates pass, load the context the head command needs. This ensures `/cle
|
|
|
133
133
|
| `/cbp-checkpoint-start` | Load checkpoint via MCP `get_checkpoints` + `get_tasks(checkpoint_id)`. Display checkpoint title, status, claim state, first pending task |
|
|
134
134
|
| `/cbp-task-start [N]` | Load via MCP `get_current_task`. Display checkpoint title + task title/requirements summary |
|
|
135
135
|
| `/cbp-round-start` | Load via MCP `get_current_task` + `get_rounds(task_id)`. Display checkpoint + task + round count + last round summary |
|
|
136
|
-
| `/cbp-round-update` | Load via MCP `get_current_task` + `get_rounds(task_id)`. Display checkpoint + task + files_changed
|
|
136
|
+
| `/cbp-round-update` | Load via MCP `get_current_task` + `get_rounds(task_id)`. Display checkpoint + task + files_changed triage summary (claude_approved, findings, hard_fail) |
|
|
137
137
|
| `/cbp-round-input` | **Full context load** (see Step 2b) |
|
|
138
138
|
| `/cbp-task-check` | Load via MCP `get_current_task`. Display checkpoint + task + files summary |
|
|
139
139
|
| `/cbp-task-testing` | Load via MCP `get_current_task` + `get_rounds(task_id)`. Display checkpoint + task + testing status summary |
|