@skilly-hand/skilly-hand 0.29.2 → 0.29.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +16 -0
- package/catalog/README.md +2 -2
- package/catalog/skills/spec-driven-development/SKILL.md +95 -144
- package/catalog/skills/spec-driven-development/agents/apply.md +30 -15
- package/catalog/skills/spec-driven-development/agents/orchestrate.md +23 -14
- package/catalog/skills/spec-driven-development/agents/plan.md +19 -17
- package/catalog/skills/spec-driven-development/agents/verify.md +40 -19
- package/catalog/skills/spec-driven-development/assets/delta-spec-template.md +50 -15
- package/catalog/skills/spec-driven-development/assets/design-template.md +20 -14
- package/catalog/skills/spec-driven-development/assets/spec-template.md +41 -20
- package/catalog/skills/spec-driven-development/assets/validation-checklist.md +28 -21
- package/catalog/skills/spec-driven-development/manifest.json +4 -4
- package/catalog/skills/test-driven-development/SKILL.md +92 -117
- package/catalog/skills/test-driven-development/assets/tdd-cycle.md +63 -447
- package/catalog/skills/test-driven-development/manifest.json +5 -5
- package/package.json +1 -1
- package/packages/catalog/package.json +1 -1
- package/packages/cli/package.json +1 -1
- package/packages/core/package.json +1 -1
- package/packages/detectors/package.json +1 -1
package/CHANGELOG.md
CHANGED
|
@@ -16,6 +16,22 @@ All notable changes to this project are documented in this file.
|
|
|
16
16
|
### Removed
|
|
17
17
|
- _None._
|
|
18
18
|
|
|
19
|
+
## [0.29.3] - 2026-06-20
|
|
20
|
+
[View on npm](https://www.npmjs.com/package/@skilly-hand/skilly-hand/v/0.29.3)
|
|
21
|
+
|
|
22
|
+
### Added
|
|
23
|
+
- _None._
|
|
24
|
+
|
|
25
|
+
### Changed
|
|
26
|
+
- Reworked `spec-driven-development` around a portable lifecycle with repository-native capability discovery, evidence-backed task state, explicit change control, and archive invariants.
|
|
27
|
+
- Rebuilt `test-driven-development` guidance around evidence-based RED, GREEN, and REFACTOR cycles without fixed framework or test-runner assumptions.
|
|
28
|
+
|
|
29
|
+
### Fixed
|
|
30
|
+
- _None._
|
|
31
|
+
|
|
32
|
+
### Removed
|
|
33
|
+
- _None._
|
|
34
|
+
|
|
19
35
|
## [0.29.2] - 2026-06-20
|
|
20
36
|
[View on npm](https://www.npmjs.com/package/@skilly-hand/skilly-hand/v/0.29.2)
|
|
21
37
|
|
package/catalog/README.md
CHANGED
|
@@ -20,8 +20,8 @@ Published portable skills consumed by the `skilly-hand` CLI.
|
|
|
20
20
|
| `react-guidelines` | Guide React and Next.js code generation, review, and performance tuning using latest stable React verification and modern framework best practices. Trigger: generating, reviewing, refactoring, or optimizing React code artifacts in React projects. | react, frontend, workflow, best-practices | all |
|
|
21
21
|
| `review-rangers` | Review code, decisions, and artifacts through a multi-perspective committee and a domain expert safety guard, then synthesize a structured verdict. | core, workflow, review, quality | all |
|
|
22
22
|
| `roaster` | Challenge plans with constructive roast-style critique that exposes weak assumptions, missing angles, shallow sequencing, and unclear success criteria. Trigger: when the user proposes, requests, or evaluates a plan of any kind. | core, workflow, planning, quality | all |
|
|
23
|
-
| `spec-driven-development` | Plan, execute, and verify multi-step work through versioned specs with small, testable tasks. | core, workflow, planning | all |
|
|
24
|
-
| `test-driven-development` | Guide implementation
|
|
23
|
+
| `spec-driven-development` | Plan, execute, and verify multi-step work through versioned specs with small, testable tasks. Trigger: planning or executing feature work, bug fixes, and multi-phase implementation. | core, workflow, planning | all |
|
|
24
|
+
| `test-driven-development` | Guide implementation through evidence-based RED, GREEN, and REFACTOR cycles without assuming a language, framework, or test runner. Trigger: implementing testable behavior or reproducing a regression with tests first. | testing, workflow, quality, core | all |
|
|
25
25
|
| `token-optimizer` | Classify task complexity and right-size reasoning depth, context gathering, and response detail to reduce wasted tokens. | core, workflow, efficiency | all |
|
|
26
26
|
| `user-story-crafting` | Create and refine user stories with structured quality gates, splitting heuristics, and lightweight story mapping for release slicing. Trigger: writing, restructuring, splitting, or sequencing user stories for delivery-ready backlog work. | product, workflow, planning, quality | all |
|
|
27
27
|
|
|
@@ -1,12 +1,12 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: "spec-driven-development"
|
|
3
|
-
description: "Plan, execute, and verify multi-step work through versioned specs with small, testable tasks."
|
|
3
|
+
description: "Plan, execute, and verify multi-step work through versioned specs with small, testable tasks. Trigger: planning or executing feature work, bug fixes, and multi-phase implementation."
|
|
4
4
|
skillMetadata:
|
|
5
5
|
author: "skilly-hand"
|
|
6
|
-
last-edit: "2026-
|
|
6
|
+
last-edit: "2026-06-20"
|
|
7
7
|
license: "Apache-2.0"
|
|
8
|
-
version: "1.0
|
|
9
|
-
changelog: "Added
|
|
8
|
+
version: "1.1.0"
|
|
9
|
+
changelog: "Added a portable SDD lifecycle with capability-based routing, task evidence, change control, and archive invariants; prevents fixed tool dependencies and duplicated task state; affects planning, apply, verify, orchestrate, and spec templates"
|
|
10
10
|
auto-invoke: "Planning or executing feature work, bug fixes, and multi-phase implementation"
|
|
11
11
|
allowed-tools:
|
|
12
12
|
- "Read"
|
|
@@ -22,192 +22,143 @@ skillMetadata:
|
|
|
22
22
|
|
|
23
23
|
## When to Use
|
|
24
24
|
|
|
25
|
-
Use this skill when
|
|
25
|
+
Use this skill when work spans multiple steps, requirements need written boundaries, or progress must survive across contributors or sessions.
|
|
26
26
|
|
|
27
|
-
|
|
28
|
-
- Requirements are easy to misinterpret without written constraints.
|
|
29
|
-
- You need a repeatable plan that can be reviewed before coding.
|
|
30
|
-
- Several contributors or sessions may touch the same feature.
|
|
27
|
+
Skip it for trivial edits, urgent recovery work, and tasks with no meaningful verification path.
|
|
31
28
|
|
|
32
|
-
|
|
29
|
+
## Portable Contract
|
|
33
30
|
|
|
34
|
-
|
|
35
|
-
- Emergency fixes where immediate response matters more than process.
|
|
36
|
-
- Tasks with no meaningful verification path.
|
|
31
|
+
The workflow MUST remain executable with this skill alone.
|
|
37
32
|
|
|
38
|
-
|
|
39
|
-
|
|
40
|
-
|
|
41
|
-
|
|
42
|
-
|
|
43
|
-
2. Review and refine scope, constraints, and tasks.
|
|
44
|
-
3. Execute one small task at a time.
|
|
45
|
-
4. Verify each task and the end-to-end outcome, ending with a required `review-rangers` final gate.
|
|
46
|
-
5. Archive to `.sdd/archive/` when complete.
|
|
47
|
-
|
|
48
|
-
Recommended task size:
|
|
49
|
-
|
|
50
|
-
- Up to 3 files per task.
|
|
51
|
-
- Around 30 minutes of implementation effort.
|
|
52
|
-
- Clear, concrete verify step.
|
|
33
|
+
- Treat integrations as optional capabilities, never required product names.
|
|
34
|
+
- Discover available tools, commands, and repository conventions before selecting them.
|
|
35
|
+
- When a capability is unavailable, use a local structured fallback or record a blocker.
|
|
36
|
+
- Keep requirements, tasks, progress, evidence, and changes in `spec.md` as the single source of truth.
|
|
37
|
+
- Do not create a second task list that can drift from the spec.
|
|
53
38
|
|
|
54
|
-
|
|
55
|
-
|
|
56
|
-
## OpenSpec Complementary Support
|
|
39
|
+
## Lifecycle
|
|
57
40
|
|
|
58
|
-
|
|
41
|
+
```text
|
|
42
|
+
DRAFT -> APPROVED -> IN_PROGRESS -> VERIFYING -> COMPLETE -> ARCHIVED
|
|
43
|
+
| | |
|
|
44
|
+
+----------> BLOCKED <------+
|
|
45
|
+
```
|
|
59
46
|
|
|
60
|
-
|
|
47
|
+
Rules:
|
|
61
48
|
|
|
62
|
-
|
|
63
|
-
|
|
64
|
-
|
|
49
|
+
1. Planning creates or updates `.sdd/active/<work-name>/spec.md`.
|
|
50
|
+
2. Implementation begins after the approval policy is satisfied.
|
|
51
|
+
3. Only one task should normally be `IN_PROGRESS` at a time.
|
|
52
|
+
4. A task becomes `DONE` only after its verify step passes and evidence is recorded.
|
|
53
|
+
5. Changed requirements return affected tasks to planning before implementation continues.
|
|
54
|
+
6. Archive only after feature validation passes and no task remains open or blocked.
|
|
65
55
|
|
|
66
|
-
|
|
56
|
+
Valid task states: `TODO`, `IN_PROGRESS`, `BLOCKED`, `DONE`.
|
|
67
57
|
|
|
68
|
-
|
|
69
|
-
- If OpenSpec is unavailable, continue in `.sdd` and document assumptions directly in the active spec.
|
|
58
|
+
## Approval Policy
|
|
70
59
|
|
|
71
|
-
|
|
72
|
-
| --- | --- |
|
|
73
|
-
| Single-session or straightforward work with clear requirements | Work spans multiple sessions and needs persistent planning context |
|
|
74
|
-
| Existing `.sdd` artifacts already provide enough review clarity | Team needs proposal/design/tasks artifacts for async review |
|
|
75
|
-
| Requirement changes are small and easy to track in-place | Requirement deltas are complex and need explicit change framing |
|
|
60
|
+
Use an explicit human checkpoint when the user requests one, requirements remain ambiguous, risk is material, or the next action is difficult to reverse. Otherwise, a documented self-review may satisfy approval.
|
|
76
61
|
|
|
77
|
-
|
|
78
|
-
|
|
79
|
-
---
|
|
62
|
+
Record the chosen approval policy in the spec. Do not assume every environment supports interactive checkpoints.
|
|
80
63
|
|
|
81
64
|
## Spec Structure
|
|
82
65
|
|
|
83
|
-
A practical spec
|
|
84
|
-
|
|
85
|
-
- `Why`: problem and urgency.
|
|
86
|
-
- `What`: concrete deliverable.
|
|
87
|
-
- `Constraints`: `MUST`, `MUST NOT`, out-of-scope boundaries.
|
|
88
|
-
- `Current State`: relevant code context.
|
|
89
|
-
- `Tasks`: small implementation units with verify steps.
|
|
90
|
-
- `Validation`: full feature checks after all tasks.
|
|
91
|
-
|
|
92
|
-
For existing features with behavior changes, use a delta format (`ADDED`, `MODIFIED`, `REMOVED`) instead of rewriting everything.
|
|
93
|
-
|
|
94
|
-
---
|
|
66
|
+
A practical spec contains:
|
|
95
67
|
|
|
96
|
-
|
|
68
|
+
- `Why`: problem and value.
|
|
69
|
+
- `What`: concrete, testable deliverable.
|
|
70
|
+
- `Constraints`: enforceable `MUST`, `SHOULD`, `MAY`, and `MUST NOT` statements.
|
|
71
|
+
- `Out of Scope`: explicit boundaries.
|
|
72
|
+
- `Current State`: verified context and integration points.
|
|
73
|
+
- `Approval Policy`: checkpoint or self-review rule.
|
|
74
|
+
- `Tasks`: small units with scenarios, capabilities, files, verify steps, and done definitions.
|
|
75
|
+
- `Progress`: task state and evidence.
|
|
76
|
+
- `Validation`: end-to-end checks.
|
|
77
|
+
- `Change Log`: requirement or scope changes that affect execution.
|
|
97
78
|
|
|
98
|
-
|
|
99
|
-
| --- | --- |
|
|
100
|
-
| New feature with no previous spec | Behavior change to an existing feature |
|
|
101
|
-
| Greenfield implementation | Bug fix or requirement adjustment |
|
|
102
|
-
| No requirement baseline exists | Existing requirements already exist |
|
|
79
|
+
### Task Contract
|
|
103
80
|
|
|
104
|
-
|
|
81
|
+
Each task MUST define:
|
|
105
82
|
|
|
106
|
-
|
|
83
|
+
```markdown
|
|
84
|
+
### T1: Title
|
|
107
85
|
|
|
108
|
-
|
|
109
|
-
|
|
110
|
-
|
|
111
|
-
|
|
112
|
-
|
|
113
|
-
|
|
114
|
-
|
|
115
|
-
|
|
116
|
-
- Keep task scope small: if a task touches more than 3 files or needs more than about 30 minutes, split it.
|
|
117
|
-
- Keep verification fast: each task should be verifiable in 2 minutes or less.
|
|
118
|
-
- Keep completion explicit: each task must have a one-sentence definition of done.
|
|
86
|
+
**What:** Observable outcome.
|
|
87
|
+
**Required Capabilities:** Semantic needs, or `none`.
|
|
88
|
+
**Files:** Expected scope, or `discover` when not yet known.
|
|
89
|
+
**Scenario:** GIVEN / WHEN / THEN, when behavior is involved.
|
|
90
|
+
**Verify:** Project-discovered command or concrete manual check.
|
|
91
|
+
**Done:** One sentence describing completion.
|
|
92
|
+
```
|
|
119
93
|
|
|
120
|
-
|
|
94
|
+
Capabilities describe needs such as test design, accessibility review, or security analysis. They MUST NOT require a particular skill, agent, vendor, or service. Resolve them against what is actually available at execution time.
|
|
121
95
|
|
|
122
|
-
|
|
123
|
-
Does the task touch > 3 files?
|
|
124
|
-
YES -> split it
|
|
96
|
+
## Full vs Delta Spec
|
|
125
97
|
|
|
126
|
-
|
|
127
|
-
YES -> split it
|
|
98
|
+
Use a full spec for new work without an existing requirement baseline. Use a delta spec for changes to established behavior.
|
|
128
99
|
|
|
129
|
-
|
|
130
|
-
|
|
100
|
+
- `ADDED`: new requirement and scenarios.
|
|
101
|
+
- `MODIFIED`: complete replacement requirement plus previous behavior reference.
|
|
102
|
+
- `REMOVED`: removed requirement plus reason.
|
|
131
103
|
|
|
132
|
-
|
|
133
|
-
NO -> task is too vague; split it
|
|
134
|
-
```
|
|
104
|
+
Before archiving a delta, reconcile it with the maintained requirement baseline when one exists. If no baseline exists, archive the delta as the historical record.
|
|
135
105
|
|
|
136
|
-
##
|
|
106
|
+
## Task Sizing
|
|
137
107
|
|
|
138
|
-
|
|
108
|
+
Prefer tasks that:
|
|
139
109
|
|
|
140
|
-
|
|
141
|
-
|
|
142
|
-
|
|
110
|
+
- Have one observable outcome.
|
|
111
|
+
- Touch a small, related file set.
|
|
112
|
+
- Can be completed without hidden dependencies.
|
|
113
|
+
- Have a fast, deterministic verify step.
|
|
114
|
+
- Have a one-sentence definition of done.
|
|
143
115
|
|
|
144
|
-
|
|
145
|
-
MUST use existing auth middleware.
|
|
146
|
-
MUST NOT add new runtime dependencies.
|
|
147
|
-
```
|
|
116
|
+
Split a task when its concerns, dependencies, or verification cannot be explained independently. File counts and time estimates are heuristics, not universal gates.
|
|
148
117
|
|
|
149
|
-
|
|
118
|
+
## Change Control
|
|
150
119
|
|
|
151
|
-
|
|
152
|
-
WRONG:
|
|
153
|
-
T1: Build the whole authentication feature.
|
|
120
|
+
When requirements change during execution:
|
|
154
121
|
|
|
155
|
-
|
|
156
|
-
|
|
157
|
-
|
|
158
|
-
|
|
159
|
-
|
|
122
|
+
1. Stop the affected task at a stable point.
|
|
123
|
+
2. Record the change and reason in `Change Log`.
|
|
124
|
+
3. Update affected constraints, scenarios, tasks, and validation.
|
|
125
|
+
4. Mark invalidated evidence as superseded.
|
|
126
|
+
5. Reapply the approval policy before continuing.
|
|
160
127
|
|
|
161
|
-
|
|
128
|
+
Do not silently stretch a task to absorb new behavior.
|
|
162
129
|
|
|
163
|
-
|
|
164
|
-
WRONG:
|
|
165
|
-
Verify: It works.
|
|
130
|
+
## Verification and Review
|
|
166
131
|
|
|
167
|
-
|
|
168
|
-
Verify: npm test -- src/auth/login.test.ts
|
|
169
|
-
```
|
|
132
|
+
Verification checks behavior against the spec, not against implementation intent.
|
|
170
133
|
|
|
171
|
-
|
|
134
|
+
- Run every task verify step using project-discovered commands.
|
|
135
|
+
- Check every `MUST` and `MUST NOT` constraint explicitly.
|
|
136
|
+
- Separate automated evidence, manual evidence, warnings, and blockers.
|
|
137
|
+
- Perform a final structured review using an available review capability or the fallback checklist in `agents/verify.md`.
|
|
138
|
+
- A missing optional integration is not a failure when the local fallback was completed.
|
|
172
139
|
|
|
173
|
-
|
|
174
|
-
WRONG:
|
|
175
|
-
T1: Create component and migrate all pages to it.
|
|
140
|
+
## Archive Invariants
|
|
176
141
|
|
|
177
|
-
|
|
178
|
-
T1: Create component.
|
|
179
|
-
T2: Migrate page A.
|
|
180
|
-
T3: Migrate page B.
|
|
181
|
-
```
|
|
142
|
+
Archive to `.sdd/archive/<YYYY-MM-DD>-<work-name>/` only when:
|
|
182
143
|
|
|
183
|
-
|
|
144
|
+
- All tasks are `DONE`.
|
|
145
|
+
- Validation passes or approved manual checks are recorded.
|
|
146
|
+
- No blocker is unresolved.
|
|
147
|
+
- Constraint and final-review evidence is present.
|
|
148
|
+
- Delta reconciliation is complete when applicable.
|
|
184
149
|
|
|
185
|
-
|
|
150
|
+
Generate the ISO date from the current environment; do not assume a particular shell command or VCS.
|
|
186
151
|
|
|
187
152
|
## Modes
|
|
188
153
|
|
|
189
|
-
|
|
190
|
-
|
|
191
|
-
-
|
|
192
|
-
-
|
|
193
|
-
- Verification mode: [agents/verify.md](agents/verify.md)
|
|
194
|
-
- Orchestrator mode: [agents/orchestrate.md](agents/orchestrate.md)
|
|
195
|
-
|
|
196
|
-
---
|
|
154
|
+
- Planning: [agents/plan.md](agents/plan.md)
|
|
155
|
+
- Implementation: [agents/apply.md](agents/apply.md)
|
|
156
|
+
- Verification: [agents/verify.md](agents/verify.md)
|
|
157
|
+
- Orchestration: [agents/orchestrate.md](agents/orchestrate.md)
|
|
197
158
|
|
|
198
159
|
## Templates
|
|
199
160
|
|
|
200
|
-
-
|
|
161
|
+
- Full spec: [assets/spec-template.md](assets/spec-template.md)
|
|
201
162
|
- Delta spec: [assets/delta-spec-template.md](assets/delta-spec-template.md)
|
|
202
163
|
- Design decisions: [assets/design-template.md](assets/design-template.md)
|
|
203
164
|
- Validation checklist: [assets/validation-checklist.md](assets/validation-checklist.md)
|
|
204
|
-
|
|
205
|
-
---
|
|
206
|
-
|
|
207
|
-
## Commands
|
|
208
|
-
|
|
209
|
-
```bash
|
|
210
|
-
mkdir -p .sdd/active/<feature-name>
|
|
211
|
-
cp .skilly-hand/catalog/spec-driven-development/assets/spec-template.md .sdd/active/<feature-name>/spec.md
|
|
212
|
-
cp .skilly-hand/catalog/spec-driven-development/assets/design-template.md .sdd/active/<feature-name>/design.md
|
|
213
|
-
```
|
|
@@ -2,25 +2,40 @@
|
|
|
2
2
|
|
|
3
3
|
## Purpose
|
|
4
4
|
|
|
5
|
-
Implement approved spec
|
|
5
|
+
Implement approved tasks one at a time and keep the spec synchronized with execution evidence.
|
|
6
6
|
|
|
7
|
-
##
|
|
7
|
+
## Procedure
|
|
8
8
|
|
|
9
|
-
|
|
10
|
-
|
|
9
|
+
1. Read `spec.md`, relevant design decisions, and repository instructions.
|
|
10
|
+
2. Confirm the approval policy is satisfied and dependencies are `DONE`.
|
|
11
|
+
3. Resolve required capabilities from available local skills, tools, or structured self-review.
|
|
12
|
+
4. Mark the selected task `IN_PROGRESS`.
|
|
13
|
+
5. Implement only the assigned outcome.
|
|
14
|
+
6. When the task declares test-driven work, follow the available TDD guidance or the portable RED/GREEN/REFACTOR contract described in the task.
|
|
15
|
+
7. Run the task verify step using commands discovered from the project.
|
|
16
|
+
8. Record concise evidence: command or check, result, and relevant output summary.
|
|
17
|
+
9. Mark the task `DONE` only after verification passes; otherwise mark it `BLOCKED` with the cause.
|
|
18
|
+
10. Append a change-log entry only for requirement, scope, or design changes, not routine progress.
|
|
11
19
|
|
|
12
|
-
##
|
|
20
|
+
## Scope Change Rule
|
|
21
|
+
|
|
22
|
+
If implementation reveals new behavior, conflicting constraints, or a wider blast radius, stop the affected task and return it to planning. Do not silently add work or rewrite acceptance criteria to match the implementation.
|
|
23
|
+
|
|
24
|
+
## Evidence Contract
|
|
25
|
+
|
|
26
|
+
Evidence must be reproducible and proportionate:
|
|
27
|
+
|
|
28
|
+
```text
|
|
29
|
+
Check: <command or manual procedure>
|
|
30
|
+
Result: PASS | FAIL | NOT_RUN
|
|
31
|
+
Summary: <short factual result>
|
|
32
|
+
```
|
|
13
33
|
|
|
14
|
-
|
|
15
|
-
2. Execute one task at a time.
|
|
16
|
-
3. Keep changes scoped to the assigned task.
|
|
17
|
-
4. Run the task verify step before marking done.
|
|
18
|
-
5. Record progress against task IDs.
|
|
19
|
-
6. Stop and report blockers when constraints conflict.
|
|
34
|
+
Do not claim a command ran when it did not. Record unavailable or human-only checks explicitly.
|
|
20
35
|
|
|
21
36
|
## Quality Bar
|
|
22
37
|
|
|
23
|
-
-
|
|
24
|
-
-
|
|
25
|
-
-
|
|
26
|
-
-
|
|
38
|
+
- Changes remain inside task scope.
|
|
39
|
+
- Verification evidence supports the `DONE` state.
|
|
40
|
+
- Optional integrations have a documented local fallback.
|
|
41
|
+
- No automatic commit, issue update, or remote action is assumed.
|
|
@@ -2,24 +2,33 @@
|
|
|
2
2
|
|
|
3
3
|
## Purpose
|
|
4
4
|
|
|
5
|
-
Coordinate planning, implementation, and
|
|
5
|
+
Coordinate planning, implementation, verification, and archive while remaining usable without delegation or external integrations.
|
|
6
6
|
|
|
7
|
-
##
|
|
7
|
+
## Workflow
|
|
8
8
|
|
|
9
|
-
|
|
10
|
-
|
|
9
|
+
1. PLAN: create or update the active spec.
|
|
10
|
+
2. APPROVE: apply the spec's checkpoint or self-review policy.
|
|
11
|
+
3. APPLY: execute one ready task at a time.
|
|
12
|
+
4. VERIFY TASK: require evidence before selecting the next task.
|
|
13
|
+
5. REPLAN: return changed requirements or blocked dependencies to planning.
|
|
14
|
+
6. VERIFY FEATURE: run validation and the portable final review gate.
|
|
15
|
+
7. ARCHIVE: enforce archive invariants and move the completed work once.
|
|
11
16
|
|
|
12
|
-
##
|
|
17
|
+
## Capability Resolution
|
|
18
|
+
|
|
19
|
+
- Delegate a phase only when a suitable capability is available and delegation improves the outcome.
|
|
20
|
+
- Otherwise execute the corresponding local mode directly.
|
|
21
|
+
- Never invent resource names, identifiers, issue keys, commands, or service availability.
|
|
22
|
+
- Remote writes, commits, and comments require their own discovered workflow and user authorization.
|
|
23
|
+
|
|
24
|
+
## Checkpoint Policy
|
|
13
25
|
|
|
14
|
-
|
|
15
|
-
2. REVIEW CHECKPOINT: Confirm the plan is approved.
|
|
16
|
-
3. APPLY: Execute agreed task batch.
|
|
17
|
-
4. VERIFY CHECKPOINT: Validate outputs against the spec and run the required final `review-rangers` gate.
|
|
18
|
-
5. REPEAT: Continue by phase or task batch.
|
|
19
|
-
6. ARCHIVE: Move completed work from `.sdd/active/` to `.sdd/archive/`.
|
|
26
|
+
Pause for explicit user approval when required by the spec, risk, ambiguity, or an irreversible action. For low-risk autonomous work, record a self-review decision and continue.
|
|
20
27
|
|
|
21
28
|
## Coordination Rules
|
|
22
29
|
|
|
23
|
-
-
|
|
24
|
-
- Keep
|
|
25
|
-
-
|
|
30
|
+
- Keep `spec.md` authoritative.
|
|
31
|
+
- Keep at most one task `IN_PROGRESS` unless independence is explicit.
|
|
32
|
+
- Do not advance past failed verification.
|
|
33
|
+
- Surface blockers with evidence and the smallest decision needed.
|
|
34
|
+
- Do not declare completion while open tasks or unresolved blockers remain.
|
|
@@ -2,26 +2,28 @@
|
|
|
2
2
|
|
|
3
3
|
## Purpose
|
|
4
4
|
|
|
5
|
-
Turn a request into an executable,
|
|
6
|
-
|
|
7
|
-
## Inputs
|
|
8
|
-
|
|
9
|
-
- Goal or problem statement.
|
|
10
|
-
- Known constraints.
|
|
11
|
-
- Current state references (files, systems, behaviors).
|
|
5
|
+
Turn a request into an executable spec without assuming a framework, toolchain, or external service.
|
|
12
6
|
|
|
13
7
|
## Procedure
|
|
14
8
|
|
|
15
|
-
1.
|
|
16
|
-
2.
|
|
17
|
-
3.
|
|
18
|
-
4.
|
|
19
|
-
5.
|
|
20
|
-
6.
|
|
9
|
+
1. Inspect the repository for applicable instructions, conventions, commands, and existing behavior.
|
|
10
|
+
2. Clarify only missing information that cannot be discovered safely.
|
|
11
|
+
3. Choose a full or delta spec.
|
|
12
|
+
4. Define scope, constraints, non-goals, approval policy, and validation.
|
|
13
|
+
5. Create tasks using the task contract from the parent skill.
|
|
14
|
+
6. Add behavioral scenarios where outcomes can be observed.
|
|
15
|
+
7. Describe required capabilities semantically and use `none` when no special capability is needed.
|
|
16
|
+
8. Initialize one progress row per task with `TODO`.
|
|
17
|
+
9. Self-review with the validation checklist and apply the approval policy.
|
|
21
18
|
|
|
22
19
|
## Quality Bar
|
|
23
20
|
|
|
24
|
-
-
|
|
25
|
-
-
|
|
26
|
-
-
|
|
27
|
-
-
|
|
21
|
+
- Another implementer can execute without guessing intent.
|
|
22
|
+
- No task requires a named agent, skill, vendor, framework, or command that has not been discovered locally.
|
|
23
|
+
- Tasks, progress rows, and validation are mutually consistent.
|
|
24
|
+
- Risks and meaningful alternatives live in `design.md`; routine choices stay in the spec.
|
|
25
|
+
- `spec.md` is the only task source of truth.
|
|
26
|
+
|
|
27
|
+
## Blockers
|
|
28
|
+
|
|
29
|
+
Mark planning `BLOCKED` only when a required decision or fact cannot be discovered and a reasonable assumption would create material risk. Record the missing input and its impact.
|
|
@@ -2,31 +2,52 @@
|
|
|
2
2
|
|
|
3
3
|
## Purpose
|
|
4
4
|
|
|
5
|
-
Validate
|
|
5
|
+
Validate implementation against the approved spec using reproducible evidence and a portable final review.
|
|
6
6
|
|
|
7
|
-
##
|
|
7
|
+
## Procedure
|
|
8
8
|
|
|
9
|
-
|
|
10
|
-
|
|
9
|
+
1. Read the complete spec and relevant design decisions.
|
|
10
|
+
2. Confirm every task has a terminal state and `DONE` tasks have evidence.
|
|
11
|
+
3. Run or inspect each task verify step.
|
|
12
|
+
4. Run feature-level validation discovered from the repository and listed in the spec.
|
|
13
|
+
5. Evaluate every `MUST` and `MUST NOT` constraint independently.
|
|
14
|
+
6. Identify manual checks that still require human confirmation.
|
|
15
|
+
7. Perform the final review gate below.
|
|
16
|
+
8. Report blockers, warnings, and evidence separately.
|
|
11
17
|
|
|
12
|
-
##
|
|
18
|
+
## Portable Final Review Gate
|
|
19
|
+
|
|
20
|
+
Use an installed review capability when one is available and applicable. Otherwise perform this local review:
|
|
21
|
+
|
|
22
|
+
- Correctness: implementation satisfies every scenario and constraint.
|
|
23
|
+
- Regression risk: affected existing behavior has appropriate checks.
|
|
24
|
+
- Scope: no unapproved behavior or dependency was introduced.
|
|
25
|
+
- Maintainability: changes follow discovered project conventions.
|
|
26
|
+
- Safety: security, privacy, accessibility, data, and destructive-operation risks were considered when relevant.
|
|
27
|
+
- Evidence: results are reproducible and unsupported claims are absent.
|
|
28
|
+
|
|
29
|
+
Any unresolved correctness, constraint, or safety blocker fails verification. Missing optional tooling does not fail verification when this fallback is completed.
|
|
30
|
+
|
|
31
|
+
## Report Contract
|
|
32
|
+
|
|
33
|
+
```markdown
|
|
34
|
+
## Verification Report
|
|
13
35
|
|
|
14
|
-
|
|
15
|
-
|
|
16
|
-
3. Run feature-level validation commands.
|
|
17
|
-
4. Confirm constraints (`MUST`, `MUST NOT`) were respected.
|
|
18
|
-
5. Run a final structured `review-rangers` pass over the full change set.
|
|
19
|
-
6. Report pass/fail per area with concrete evidence.
|
|
36
|
+
### Task Evidence
|
|
37
|
+
| Task | Check | Result | Evidence |
|
|
20
38
|
|
|
21
|
-
###
|
|
39
|
+
### Constraints
|
|
40
|
+
| Constraint | Result | Evidence |
|
|
22
41
|
|
|
23
|
-
|
|
24
|
-
-
|
|
25
|
-
-
|
|
26
|
-
-
|
|
42
|
+
### Final Review
|
|
43
|
+
- Result: PASS | FAIL
|
|
44
|
+
- Blockers: none | list
|
|
45
|
+
- Warnings: none | list
|
|
46
|
+
- Manual checks: none | list
|
|
47
|
+
```
|
|
27
48
|
|
|
28
49
|
## Quality Bar
|
|
29
50
|
|
|
30
|
-
-
|
|
31
|
-
-
|
|
32
|
-
-
|
|
51
|
+
- Results distinguish `PASS`, `FAIL`, and `NOT_RUN`.
|
|
52
|
+
- Manual validation is never reported as automated evidence.
|
|
53
|
+
- Archive readiness is an explicit conclusion, not an implication.
|