@skilly-hand/skilly-hand 0.29.2 → 0.29.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -16,6 +16,22 @@ All notable changes to this project are documented in this file.
16
16
  ### Removed
17
17
  - _None._
18
18
 
19
+ ## [0.29.3] - 2026-06-20
20
+ [View on npm](https://www.npmjs.com/package/@skilly-hand/skilly-hand/v/0.29.3)
21
+
22
+ ### Added
23
+ - _None._
24
+
25
+ ### Changed
26
+ - Reworked `spec-driven-development` around a portable lifecycle with repository-native capability discovery, evidence-backed task state, explicit change control, and archive invariants.
27
+ - Rebuilt `test-driven-development` guidance around evidence-based RED, GREEN, and REFACTOR cycles without fixed framework or test-runner assumptions.
28
+
29
+ ### Fixed
30
+ - _None._
31
+
32
+ ### Removed
33
+ - _None._
34
+
19
35
  ## [0.29.2] - 2026-06-20
20
36
  [View on npm](https://www.npmjs.com/package/@skilly-hand/skilly-hand/v/0.29.2)
21
37
 
package/catalog/README.md CHANGED
@@ -20,8 +20,8 @@ Published portable skills consumed by the `skilly-hand` CLI.
20
20
  | `react-guidelines` | Guide React and Next.js code generation, review, and performance tuning using latest stable React verification and modern framework best practices. Trigger: generating, reviewing, refactoring, or optimizing React code artifacts in React projects. | react, frontend, workflow, best-practices | all |
21
21
  | `review-rangers` | Review code, decisions, and artifacts through a multi-perspective committee and a domain expert safety guard, then synthesize a structured verdict. | core, workflow, review, quality | all |
22
22
  | `roaster` | Challenge plans with constructive roast-style critique that exposes weak assumptions, missing angles, shallow sequencing, and unclear success criteria. Trigger: when the user proposes, requests, or evaluates a plan of any kind. | core, workflow, planning, quality | all |
23
- | `spec-driven-development` | Plan, execute, and verify multi-step work through versioned specs with small, testable tasks. | core, workflow, planning | all |
24
- | `test-driven-development` | Guide implementation using the RED GREEN REFACTOR TDD cycle: write a failing test first, write the minimum code to pass, then refactor while tests stay green. | testing, workflow, quality, core | all |
23
+ | `spec-driven-development` | Plan, execute, and verify multi-step work through versioned specs with small, testable tasks. Trigger: planning or executing feature work, bug fixes, and multi-phase implementation. | core, workflow, planning | all |
24
+ | `test-driven-development` | Guide implementation through evidence-based RED, GREEN, and REFACTOR cycles without assuming a language, framework, or test runner. Trigger: implementing testable behavior or reproducing a regression with tests first. | testing, workflow, quality, core | all |
25
25
  | `token-optimizer` | Classify task complexity and right-size reasoning depth, context gathering, and response detail to reduce wasted tokens. | core, workflow, efficiency | all |
26
26
  | `user-story-crafting` | Create and refine user stories with structured quality gates, splitting heuristics, and lightweight story mapping for release slicing. Trigger: writing, restructuring, splitting, or sequencing user stories for delivery-ready backlog work. | product, workflow, planning, quality | all |
27
27
 
@@ -1,12 +1,12 @@
1
1
  ---
2
2
  name: "spec-driven-development"
3
- description: "Plan, execute, and verify multi-step work through versioned specs with small, testable tasks."
3
+ description: "Plan, execute, and verify multi-step work through versioned specs with small, testable tasks. Trigger: planning or executing feature work, bug fixes, and multi-phase implementation."
4
4
  skillMetadata:
5
5
  author: "skilly-hand"
6
- last-edit: "2026-04-03"
6
+ last-edit: "2026-06-20"
7
7
  license: "Apache-2.0"
8
- version: "1.0.3"
9
- changelog: "Added OpenSpec complementary support routing guidance to spec-driven-development instructions; improves planning continuity and review clarity when local SDD needs reinforcement; affects spec-driven-development SKILL guidance and manifest metadata"
8
+ version: "1.1.0"
9
+ changelog: "Added a portable SDD lifecycle with capability-based routing, task evidence, change control, and archive invariants; prevents fixed tool dependencies and duplicated task state; affects planning, apply, verify, orchestrate, and spec templates"
10
10
  auto-invoke: "Planning or executing feature work, bug fixes, and multi-phase implementation"
11
11
  allowed-tools:
12
12
  - "Read"
@@ -22,192 +22,143 @@ skillMetadata:
22
22
 
23
23
  ## When to Use
24
24
 
25
- Use this skill when:
25
+ Use this skill when work spans multiple steps, requirements need written boundaries, or progress must survive across contributors or sessions.
26
26
 
27
- - Work spans multiple commits or phases.
28
- - Requirements are easy to misinterpret without written constraints.
29
- - You need a repeatable plan that can be reviewed before coding.
30
- - Several contributors or sessions may touch the same feature.
27
+ Skip it for trivial edits, urgent recovery work, and tasks with no meaningful verification path.
31
28
 
32
- Do not use this skill for:
29
+ ## Portable Contract
33
30
 
34
- - Trivial one-file edits.
35
- - Emergency fixes where immediate response matters more than process.
36
- - Tasks with no meaningful verification path.
31
+ The workflow MUST remain executable with this skill alone.
37
32
 
38
- ---
39
-
40
- ## Core Workflow
41
-
42
- 1. Define the spec in `.sdd/active/<feature-name>/spec.md`.
43
- 2. Review and refine scope, constraints, and tasks.
44
- 3. Execute one small task at a time.
45
- 4. Verify each task and the end-to-end outcome, ending with a required `review-rangers` final gate.
46
- 5. Archive to `.sdd/archive/` when complete.
47
-
48
- Recommended task size:
49
-
50
- - Up to 3 files per task.
51
- - Around 30 minutes of implementation effort.
52
- - Clear, concrete verify step.
33
+ - Treat integrations as optional capabilities, never required product names.
34
+ - Discover available tools, commands, and repository conventions before selecting them.
35
+ - When a capability is unavailable, use a local structured fallback or record a blocker.
36
+ - Keep requirements, tasks, progress, evidence, and changes in `spec.md` as the single source of truth.
37
+ - Do not create a second task list that can drift from the spec.
53
38
 
54
- ---
55
-
56
- ## OpenSpec Complementary Support
39
+ ## Lifecycle
57
40
 
58
- Default execution SHOULD remain the local `.sdd` workflow.
41
+ ```text
42
+ DRAFT -> APPROVED -> IN_PROGRESS -> VERIFYING -> COMPLETE -> ARCHIVED
43
+ | | |
44
+ +----------> BLOCKED <------+
45
+ ```
59
46
 
60
- Recur to OpenSpec support when the task needs complementary structure for:
47
+ Rules:
61
48
 
62
- - Multi-session continuity where planning context must persist across chats.
63
- - Shareable planning artifacts for review before implementation.
64
- - Requirement-delta clarity that benefits from explicit change proposals.
49
+ 1. Planning creates or updates `.sdd/active/<work-name>/spec.md`.
50
+ 2. Implementation begins after the approval policy is satisfied.
51
+ 3. Only one task should normally be `IN_PROGRESS` at a time.
52
+ 4. A task becomes `DONE` only after its verify step passes and evidence is recorded.
53
+ 5. Changed requirements return affected tasks to planning before implementation continues.
54
+ 6. Archive only after feature validation passes and no task remains open or blocked.
65
55
 
66
- Routing rules:
56
+ Valid task states: `TODO`, `IN_PROGRESS`, `BLOCKED`, `DONE`.
67
57
 
68
- - Keep the local `.sdd/active/<feature-name>/spec.md` as the execution source of truth unless the team explicitly standardizes on OpenSpec paths.
69
- - If OpenSpec is unavailable, continue in `.sdd` and document assumptions directly in the active spec.
58
+ ## Approval Policy
70
59
 
71
- | Use local SDD only | Use OpenSpec support |
72
- | --- | --- |
73
- | Single-session or straightforward work with clear requirements | Work spans multiple sessions and needs persistent planning context |
74
- | Existing `.sdd` artifacts already provide enough review clarity | Team needs proposal/design/tasks artifacts for async review |
75
- | Requirement changes are small and easy to track in-place | Requirement deltas are complex and need explicit change framing |
60
+ Use an explicit human checkpoint when the user requests one, requirements remain ambiguous, risk is material, or the next action is difficult to reverse. Otherwise, a documented self-review may satisfy approval.
76
61
 
77
- Reference (informational): [https://openspec.dev/](https://openspec.dev/)
78
-
79
- ---
62
+ Record the chosen approval policy in the spec. Do not assume every environment supports interactive checkpoints.
80
63
 
81
64
  ## Spec Structure
82
65
 
83
- A practical spec includes:
84
-
85
- - `Why`: problem and urgency.
86
- - `What`: concrete deliverable.
87
- - `Constraints`: `MUST`, `MUST NOT`, out-of-scope boundaries.
88
- - `Current State`: relevant code context.
89
- - `Tasks`: small implementation units with verify steps.
90
- - `Validation`: full feature checks after all tasks.
91
-
92
- For existing features with behavior changes, use a delta format (`ADDED`, `MODIFIED`, `REMOVED`) instead of rewriting everything.
93
-
94
- ---
66
+ A practical spec contains:
95
67
 
96
- ## When to Use Delta vs Full Spec
68
+ - `Why`: problem and value.
69
+ - `What`: concrete, testable deliverable.
70
+ - `Constraints`: enforceable `MUST`, `SHOULD`, `MAY`, and `MUST NOT` statements.
71
+ - `Out of Scope`: explicit boundaries.
72
+ - `Current State`: verified context and integration points.
73
+ - `Approval Policy`: checkpoint or self-review rule.
74
+ - `Tasks`: small units with scenarios, capabilities, files, verify steps, and done definitions.
75
+ - `Progress`: task state and evidence.
76
+ - `Validation`: end-to-end checks.
77
+ - `Change Log`: requirement or scope changes that affect execution.
97
78
 
98
- | Use Full Spec | Use Delta Spec |
99
- | --- | --- |
100
- | New feature with no previous spec | Behavior change to an existing feature |
101
- | Greenfield implementation | Bug fix or requirement adjustment |
102
- | No requirement baseline exists | Existing requirements already exist |
79
+ ### Task Contract
103
80
 
104
- ## Archive Behavior
81
+ Each task MUST define:
105
82
 
106
- When archiving a delta spec, apply changes to the base specification:
83
+ ```markdown
84
+ ### T1: Title
107
85
 
108
- - `ADDED`: append new requirements.
109
- - `MODIFIED`: replace the previous requirement text.
110
- - `REMOVED`: delete the requirement and keep a short reason in commit history.
111
-
112
- Then move work from `.sdd/active/<feature-name>/` to `.sdd/archive/<feature-name>/`.
113
-
114
- ## Task Design Principles
115
-
116
- - Keep task scope small: if a task touches more than 3 files or needs more than about 30 minutes, split it.
117
- - Keep verification fast: each task should be verifiable in 2 minutes or less.
118
- - Keep completion explicit: each task must have a one-sentence definition of done.
86
+ **What:** Observable outcome.
87
+ **Required Capabilities:** Semantic needs, or `none`.
88
+ **Files:** Expected scope, or `discover` when not yet known.
89
+ **Scenario:** GIVEN / WHEN / THEN, when behavior is involved.
90
+ **Verify:** Project-discovered command or concrete manual check.
91
+ **Done:** One sentence describing completion.
92
+ ```
119
93
 
120
- ## Decision Tree: When to Break Tasks Smaller
94
+ Capabilities describe needs such as test design, accessibility review, or security analysis. They MUST NOT require a particular skill, agent, vendor, or service. Resolve them against what is actually available at execution time.
121
95
 
122
- ```text
123
- Does the task touch > 3 files?
124
- YES -> split it
96
+ ## Full vs Delta Spec
125
97
 
126
- Will the task take > 30 minutes?
127
- YES -> split it
98
+ Use a full spec for new work without an existing requirement baseline. Use a delta spec for changes to established behavior.
128
99
 
129
- Can the task be verified in <= 2 minutes?
130
- NO -> add a tighter verify step
100
+ - `ADDED`: new requirement and scenarios.
101
+ - `MODIFIED`: complete replacement requirement plus previous behavior reference.
102
+ - `REMOVED`: removed requirement plus reason.
131
103
 
132
- Can "done" be described in one sentence?
133
- NO -> task is too vague; split it
134
- ```
104
+ Before archiving a delta, reconcile it with the maintained requirement baseline when one exists. If no baseline exists, archive the delta as the historical record.
135
105
 
136
- ## Common Mistakes to Avoid
106
+ ## Task Sizing
137
107
 
138
- ### 1) Vague Constraints
108
+ Prefer tasks that:
139
109
 
140
- ```text
141
- WRONG:
142
- Must use best practices.
110
+ - Have one observable outcome.
111
+ - Touch a small, related file set.
112
+ - Can be completed without hidden dependencies.
113
+ - Have a fast, deterministic verify step.
114
+ - Have a one-sentence definition of done.
143
115
 
144
- RIGHT:
145
- MUST use existing auth middleware.
146
- MUST NOT add new runtime dependencies.
147
- ```
116
+ Split a task when its concerns, dependencies, or verification cannot be explained independently. File counts and time estimates are heuristics, not universal gates.
148
117
 
149
- ### 2) Oversized Tasks
118
+ ## Change Control
150
119
 
151
- ```text
152
- WRONG:
153
- T1: Build the whole authentication feature.
120
+ When requirements change during execution:
154
121
 
155
- RIGHT:
156
- T1: Add token verification middleware.
157
- T2: Add login endpoint behavior.
158
- T3: Add integration test for login flow.
159
- ```
122
+ 1. Stop the affected task at a stable point.
123
+ 2. Record the change and reason in `Change Log`.
124
+ 3. Update affected constraints, scenarios, tasks, and validation.
125
+ 4. Mark invalidated evidence as superseded.
126
+ 5. Reapply the approval policy before continuing.
160
127
 
161
- ### 3) Missing Verification
128
+ Do not silently stretch a task to absorb new behavior.
162
129
 
163
- ```text
164
- WRONG:
165
- Verify: It works.
130
+ ## Verification and Review
166
131
 
167
- RIGHT:
168
- Verify: npm test -- src/auth/login.test.ts
169
- ```
132
+ Verification checks behavior against the spec, not against implementation intent.
170
133
 
171
- ### 4) Mixed Concerns in One Task
134
+ - Run every task verify step using project-discovered commands.
135
+ - Check every `MUST` and `MUST NOT` constraint explicitly.
136
+ - Separate automated evidence, manual evidence, warnings, and blockers.
137
+ - Perform a final structured review using an available review capability or the fallback checklist in `agents/verify.md`.
138
+ - A missing optional integration is not a failure when the local fallback was completed.
172
139
 
173
- ```text
174
- WRONG:
175
- T1: Create component and migrate all pages to it.
140
+ ## Archive Invariants
176
141
 
177
- RIGHT:
178
- T1: Create component.
179
- T2: Migrate page A.
180
- T3: Migrate page B.
181
- ```
142
+ Archive to `.sdd/archive/<YYYY-MM-DD>-<work-name>/` only when:
182
143
 
183
- Use the full preflight and pre-archive checks in [assets/validation-checklist.md](assets/validation-checklist.md).
144
+ - All tasks are `DONE`.
145
+ - Validation passes or approved manual checks are recorded.
146
+ - No blocker is unresolved.
147
+ - Constraint and final-review evidence is present.
148
+ - Delta reconciliation is complete when applicable.
184
149
 
185
- ---
150
+ Generate the ISO date from the current environment; do not assume a particular shell command or VCS.
186
151
 
187
152
  ## Modes
188
153
 
189
- Use these mode guides for role-focused execution:
190
-
191
- - Planning mode: [agents/plan.md](agents/plan.md)
192
- - Implementation mode: [agents/apply.md](agents/apply.md)
193
- - Verification mode: [agents/verify.md](agents/verify.md)
194
- - Orchestrator mode: [agents/orchestrate.md](agents/orchestrate.md)
195
-
196
- ---
154
+ - Planning: [agents/plan.md](agents/plan.md)
155
+ - Implementation: [agents/apply.md](agents/apply.md)
156
+ - Verification: [agents/verify.md](agents/verify.md)
157
+ - Orchestration: [agents/orchestrate.md](agents/orchestrate.md)
197
158
 
198
159
  ## Templates
199
160
 
200
- - Feature spec: [assets/spec-template.md](assets/spec-template.md)
161
+ - Full spec: [assets/spec-template.md](assets/spec-template.md)
201
162
  - Delta spec: [assets/delta-spec-template.md](assets/delta-spec-template.md)
202
163
  - Design decisions: [assets/design-template.md](assets/design-template.md)
203
164
  - Validation checklist: [assets/validation-checklist.md](assets/validation-checklist.md)
204
-
205
- ---
206
-
207
- ## Commands
208
-
209
- ```bash
210
- mkdir -p .sdd/active/<feature-name>
211
- cp .skilly-hand/catalog/spec-driven-development/assets/spec-template.md .sdd/active/<feature-name>/spec.md
212
- cp .skilly-hand/catalog/spec-driven-development/assets/design-template.md .sdd/active/<feature-name>/design.md
213
- ```
@@ -2,25 +2,40 @@
2
2
 
3
3
  ## Purpose
4
4
 
5
- Implement approved spec tasks in small, verifiable increments.
5
+ Implement approved tasks one at a time and keep the spec synchronized with execution evidence.
6
6
 
7
- ## Inputs
7
+ ## Procedure
8
8
 
9
- - Spec name/path under `.sdd/active/`.
10
- - Task range or specific task IDs.
9
+ 1. Read `spec.md`, relevant design decisions, and repository instructions.
10
+ 2. Confirm the approval policy is satisfied and dependencies are `DONE`.
11
+ 3. Resolve required capabilities from available local skills, tools, or structured self-review.
12
+ 4. Mark the selected task `IN_PROGRESS`.
13
+ 5. Implement only the assigned outcome.
14
+ 6. When the task declares test-driven work, follow the available TDD guidance or the portable RED/GREEN/REFACTOR contract described in the task.
15
+ 7. Run the task verify step using commands discovered from the project.
16
+ 8. Record concise evidence: command or check, result, and relevant output summary.
17
+ 9. Mark the task `DONE` only after verification passes; otherwise mark it `BLOCKED` with the cause.
18
+ 10. Append a change-log entry only for requirement, scope, or design changes, not routine progress.
11
19
 
12
- ## Procedure
20
+ ## Scope Change Rule
21
+
22
+ If implementation reveals new behavior, conflicting constraints, or a wider blast radius, stop the affected task and return it to planning. Do not silently add work or rewrite acceptance criteria to match the implementation.
23
+
24
+ ## Evidence Contract
25
+
26
+ Evidence must be reproducible and proportionate:
27
+
28
+ ```text
29
+ Check: <command or manual procedure>
30
+ Result: PASS | FAIL | NOT_RUN
31
+ Summary: <short factual result>
32
+ ```
13
33
 
14
- 1. Read `spec.md` and `design.md` (if present).
15
- 2. Execute one task at a time.
16
- 3. Keep changes scoped to the assigned task.
17
- 4. Run the task verify step before marking done.
18
- 5. Record progress against task IDs.
19
- 6. Stop and report blockers when constraints conflict.
34
+ Do not claim a command ran when it did not. Record unavailable or human-only checks explicitly.
20
35
 
21
36
  ## Quality Bar
22
37
 
23
- - Behavior matches task intent.
24
- - Verify steps pass before moving on.
25
- - No hidden scope creep.
26
- - Progress summary lists completed tasks and changed files.
38
+ - Changes remain inside task scope.
39
+ - Verification evidence supports the `DONE` state.
40
+ - Optional integrations have a documented local fallback.
41
+ - No automatic commit, issue update, or remote action is assumed.
@@ -2,24 +2,33 @@
2
2
 
3
3
  ## Purpose
4
4
 
5
- Coordinate planning, implementation, and verification through explicit checkpoints.
5
+ Coordinate planning, implementation, verification, and archive while remaining usable without delegation or external integrations.
6
6
 
7
- ## Inputs
7
+ ## Workflow
8
8
 
9
- - High-level request.
10
- - Context and constraints.
9
+ 1. PLAN: create or update the active spec.
10
+ 2. APPROVE: apply the spec's checkpoint or self-review policy.
11
+ 3. APPLY: execute one ready task at a time.
12
+ 4. VERIFY TASK: require evidence before selecting the next task.
13
+ 5. REPLAN: return changed requirements or blocked dependencies to planning.
14
+ 6. VERIFY FEATURE: run validation and the portable final review gate.
15
+ 7. ARCHIVE: enforce archive invariants and move the completed work once.
11
16
 
12
- ## Workflow
17
+ ## Capability Resolution
18
+
19
+ - Delegate a phase only when a suitable capability is available and delegation improves the outcome.
20
+ - Otherwise execute the corresponding local mode directly.
21
+ - Never invent resource names, identifiers, issue keys, commands, or service availability.
22
+ - Remote writes, commits, and comments require their own discovered workflow and user authorization.
23
+
24
+ ## Checkpoint Policy
13
25
 
14
- 1. PLAN: Produce or update the spec.
15
- 2. REVIEW CHECKPOINT: Confirm the plan is approved.
16
- 3. APPLY: Execute agreed task batch.
17
- 4. VERIFY CHECKPOINT: Validate outputs against the spec and run the required final `review-rangers` gate.
18
- 5. REPEAT: Continue by phase or task batch.
19
- 6. ARCHIVE: Move completed work from `.sdd/active/` to `.sdd/archive/`.
26
+ Pause for explicit user approval when required by the spec, risk, ambiguity, or an irreversible action. For low-risk autonomous work, record a self-review decision and continue.
20
27
 
21
28
  ## Coordination Rules
22
29
 
23
- - Pause at checkpoints before continuing.
24
- - Keep phase summaries short and decision-oriented.
25
- - Surface blockers with options, not ambiguity.
30
+ - Keep `spec.md` authoritative.
31
+ - Keep at most one task `IN_PROGRESS` unless independence is explicit.
32
+ - Do not advance past failed verification.
33
+ - Surface blockers with evidence and the smallest decision needed.
34
+ - Do not declare completion while open tasks or unresolved blockers remain.
@@ -2,26 +2,28 @@
2
2
 
3
3
  ## Purpose
4
4
 
5
- Turn a request into an executable, reviewable spec before implementation starts.
6
-
7
- ## Inputs
8
-
9
- - Goal or problem statement.
10
- - Known constraints.
11
- - Current state references (files, systems, behaviors).
5
+ Turn a request into an executable spec without assuming a framework, toolchain, or external service.
12
6
 
13
7
  ## Procedure
14
8
 
15
- 1. Clarify scope and success criteria.
16
- 2. Decide full spec vs delta spec.
17
- 3. Fill spec sections: Why, What, Constraints, Current State, Tasks, Validation.
18
- 4. Break tasks into small units with explicit verify commands.
19
- 5. Add `design.md` if architecture decisions or trade-offs are non-obvious.
20
- 6. Return the spec summary for review and approval.
9
+ 1. Inspect the repository for applicable instructions, conventions, commands, and existing behavior.
10
+ 2. Clarify only missing information that cannot be discovered safely.
11
+ 3. Choose a full or delta spec.
12
+ 4. Define scope, constraints, non-goals, approval policy, and validation.
13
+ 5. Create tasks using the task contract from the parent skill.
14
+ 6. Add behavioral scenarios where outcomes can be observed.
15
+ 7. Describe required capabilities semantically and use `none` when no special capability is needed.
16
+ 8. Initialize one progress row per task with `TODO`.
17
+ 9. Self-review with the validation checklist and apply the approval policy.
21
18
 
22
19
  ## Quality Bar
23
20
 
24
- - Constraints are enforceable, not vague.
25
- - Every task has files and a verify step.
26
- - Out-of-scope items are explicit.
27
- - Another engineer can execute without guessing intent.
21
+ - Another implementer can execute without guessing intent.
22
+ - No task requires a named agent, skill, vendor, framework, or command that has not been discovered locally.
23
+ - Tasks, progress rows, and validation are mutually consistent.
24
+ - Risks and meaningful alternatives live in `design.md`; routine choices stay in the spec.
25
+ - `spec.md` is the only task source of truth.
26
+
27
+ ## Blockers
28
+
29
+ Mark planning `BLOCKED` only when a required decision or fact cannot be discovered and a reasonable assumption would create material risk. Record the missing input and its impact.
@@ -2,31 +2,52 @@
2
2
 
3
3
  ## Purpose
4
4
 
5
- Validate that implementation matches the approved spec and passes quality checks.
5
+ Validate implementation against the approved spec using reproducible evidence and a portable final review.
6
6
 
7
- ## Inputs
7
+ ## Procedure
8
8
 
9
- - Spec path.
10
- - Implementation result from apply mode.
9
+ 1. Read the complete spec and relevant design decisions.
10
+ 2. Confirm every task has a terminal state and `DONE` tasks have evidence.
11
+ 3. Run or inspect each task verify step.
12
+ 4. Run feature-level validation discovered from the repository and listed in the spec.
13
+ 5. Evaluate every `MUST` and `MUST NOT` constraint independently.
14
+ 6. Identify manual checks that still require human confirmation.
15
+ 7. Perform the final review gate below.
16
+ 8. Report blockers, warnings, and evidence separately.
11
17
 
12
- ## Procedure
18
+ ## Portable Final Review Gate
19
+
20
+ Use an installed review capability when one is available and applicable. Otherwise perform this local review:
21
+
22
+ - Correctness: implementation satisfies every scenario and constraint.
23
+ - Regression risk: affected existing behavior has appropriate checks.
24
+ - Scope: no unapproved behavior or dependency was introduced.
25
+ - Maintainability: changes follow discovered project conventions.
26
+ - Safety: security, privacy, accessibility, data, and destructive-operation risks were considered when relevant.
27
+ - Evidence: results are reproducible and unsupported claims are absent.
28
+
29
+ Any unresolved correctness, constraint, or safety blocker fails verification. Missing optional tooling does not fail verification when this fallback is completed.
30
+
31
+ ## Report Contract
32
+
33
+ ```markdown
34
+ ## Verification Report
13
35
 
14
- 1. Read acceptance intent from `spec.md` and `design.md` (if present).
15
- 2. Run task-level verification evidence checks.
16
- 3. Run feature-level validation commands.
17
- 4. Confirm constraints (`MUST`, `MUST NOT`) were respected.
18
- 5. Run a final structured `review-rangers` pass over the full change set.
19
- 6. Report pass/fail per area with concrete evidence.
36
+ ### Task Evidence
37
+ | Task | Check | Result | Evidence |
20
38
 
21
- ### Required Final Gate (`review-rangers`)
39
+ ### Constraints
40
+ | Constraint | Result | Evidence |
22
41
 
23
- - Validate selected agent targets vs actual instruction files/symlinks written.
24
- - Validate stale managed target cleanup after re-install/reselection.
25
- - Validate backup and restore safety (including uninstall restore behavior).
26
- - Any unresolved `review-rangers` blocker keeps verification in failed state.
42
+ ### Final Review
43
+ - Result: PASS | FAIL
44
+ - Blockers: none | list
45
+ - Warnings: none | list
46
+ - Manual checks: none | list
47
+ ```
27
48
 
28
49
  ## Quality Bar
29
50
 
30
- - End-to-end validation is explicit.
31
- - Gaps are tied to exact tasks or constraints.
32
- - Report separates blockers from non-blocking warnings.
51
+ - Results distinguish `PASS`, `FAIL`, and `NOT_RUN`.
52
+ - Manual validation is never reported as automated evidence.
53
+ - Archive readiness is an explicit conclusion, not an implication.