@zigrivers/scaffold 2.1.2 → 2.28.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +272 -59
- package/knowledge/core/adr-craft.md +53 -0
- package/knowledge/core/ai-memory-management.md +246 -0
- package/knowledge/core/api-design.md +4 -0
- package/knowledge/core/claude-md-patterns.md +254 -0
- package/knowledge/core/coding-conventions.md +246 -0
- package/knowledge/core/database-design.md +4 -0
- package/knowledge/core/design-system-tokens.md +465 -0
- package/knowledge/core/dev-environment.md +223 -0
- package/knowledge/core/domain-modeling.md +4 -0
- package/knowledge/core/eval-craft.md +1008 -0
- package/knowledge/core/multi-model-review-dispatch.md +250 -0
- package/knowledge/core/operations-runbook.md +37 -226
- package/knowledge/core/project-structure-patterns.md +231 -0
- package/knowledge/core/review-step-template.md +247 -0
- package/knowledge/core/{security-review.md → security-best-practices.md} +5 -1
- package/knowledge/core/task-decomposition.md +57 -34
- package/knowledge/core/task-tracking.md +225 -0
- package/knowledge/core/tech-stack-selection.md +214 -0
- package/knowledge/core/testing-strategy.md +63 -70
- package/knowledge/core/user-stories.md +69 -60
- package/knowledge/core/user-story-innovation.md +57 -0
- package/knowledge/core/ux-specification.md +5 -148
- package/knowledge/finalization/apply-fixes-and-freeze.md +165 -14
- package/knowledge/product/prd-craft.md +55 -34
- package/knowledge/review/review-adr.md +32 -0
- package/knowledge/review/{review-api-contracts.md → review-api-design.md} +34 -1
- package/knowledge/review/{review-database-schema.md → review-database-design.md} +27 -1
- package/knowledge/review/review-domain-modeling.md +33 -0
- package/knowledge/review/review-implementation-tasks.md +50 -0
- package/knowledge/review/review-operations.md +55 -0
- package/knowledge/review/review-prd.md +33 -0
- package/knowledge/review/review-security.md +53 -0
- package/knowledge/review/review-system-architecture.md +28 -0
- package/knowledge/review/review-testing-strategy.md +51 -0
- package/knowledge/review/review-user-stories.md +54 -0
- package/knowledge/review/{review-ux-spec.md → review-ux-specification.md} +37 -1
- package/methodology/custom-defaults.yml +32 -3
- package/methodology/deep.yml +32 -3
- package/methodology/mvp.yml +32 -3
- package/package.json +2 -1
- package/pipeline/architecture/review-architecture.md +18 -6
- package/pipeline/architecture/system-architecture.md +14 -2
- package/pipeline/consolidation/claude-md-optimization.md +73 -0
- package/pipeline/consolidation/workflow-audit.md +73 -0
- package/pipeline/decisions/adrs.md +14 -2
- package/pipeline/decisions/review-adrs.md +18 -5
- package/pipeline/environment/ai-memory-setup.md +70 -0
- package/pipeline/environment/automated-pr-review.md +70 -0
- package/pipeline/environment/design-system.md +73 -0
- package/pipeline/environment/dev-env-setup.md +65 -0
- package/pipeline/environment/git-workflow.md +71 -0
- package/pipeline/finalization/apply-fixes-and-freeze.md +1 -1
- package/pipeline/finalization/developer-onboarding-guide.md +1 -1
- package/pipeline/finalization/implementation-playbook.md +3 -3
- package/pipeline/foundation/beads.md +68 -0
- package/pipeline/foundation/coding-standards.md +68 -0
- package/pipeline/foundation/project-structure.md +69 -0
- package/pipeline/foundation/tdd.md +60 -0
- package/pipeline/foundation/tech-stack.md +74 -0
- package/pipeline/integration/add-e2e-testing.md +65 -0
- package/pipeline/modeling/domain-modeling.md +14 -2
- package/pipeline/modeling/review-domain-modeling.md +18 -5
- package/pipeline/parity/platform-parity-review.md +70 -0
- package/pipeline/planning/implementation-plan-review.md +56 -0
- package/pipeline/planning/{implementation-tasks.md → implementation-plan.md} +29 -9
- package/pipeline/pre/create-prd.md +13 -4
- package/pipeline/pre/innovate-prd.md +37 -8
- package/pipeline/pre/innovate-user-stories.md +38 -7
- package/pipeline/pre/review-prd.md +18 -6
- package/pipeline/pre/review-user-stories.md +23 -6
- package/pipeline/pre/user-stories.md +12 -2
- package/pipeline/quality/create-evals.md +102 -0
- package/pipeline/quality/operations.md +38 -13
- package/pipeline/quality/review-operations.md +17 -5
- package/pipeline/quality/review-security.md +17 -5
- package/pipeline/quality/review-testing.md +20 -8
- package/pipeline/quality/security.md +25 -3
- package/pipeline/quality/story-tests.md +73 -0
- package/pipeline/specification/api-contracts.md +17 -2
- package/pipeline/specification/database-schema.md +17 -2
- package/pipeline/specification/review-api.md +18 -6
- package/pipeline/specification/review-database.md +18 -6
- package/pipeline/specification/review-ux.md +19 -7
- package/pipeline/specification/ux-spec.md +29 -10
- package/pipeline/validation/critical-path-walkthrough.md +34 -7
- package/pipeline/validation/cross-phase-consistency.md +34 -7
- package/pipeline/validation/decision-completeness.md +34 -7
- package/pipeline/validation/dependency-graph-validation.md +34 -7
- package/pipeline/validation/implementability-dry-run.md +34 -7
- package/pipeline/validation/scope-creep-check.md +34 -7
- package/pipeline/validation/traceability-matrix.md +34 -7
- package/skills/multi-model-dispatch/SKILL.md +326 -0
- package/skills/scaffold-pipeline/SKILL.md +195 -0
- package/skills/scaffold-runner/SKILL.md +465 -0
- package/pipeline/planning/review-tasks.md +0 -38
- package/pipeline/quality/testing-strategy.md +0 -42
|
@@ -2,11 +2,11 @@
|
|
|
2
2
|
name: review-ux
|
|
3
3
|
description: Review UX specification for completeness and usability
|
|
4
4
|
phase: "specification"
|
|
5
|
-
order:
|
|
5
|
+
order: 860
|
|
6
6
|
dependencies: [ux-spec]
|
|
7
|
-
outputs: [docs/reviews/review-ux.md]
|
|
7
|
+
outputs: [docs/reviews/review-ux.md, docs/reviews/ux/review-summary.md, docs/reviews/ux/codex-review.json, docs/reviews/ux/gemini-review.json]
|
|
8
8
|
conditional: "if-needed"
|
|
9
|
-
knowledge-base: [review-methodology, review-ux-
|
|
9
|
+
knowledge-base: [review-methodology, review-ux-specification]
|
|
10
10
|
---
|
|
11
11
|
|
|
12
12
|
## Purpose
|
|
@@ -14,14 +14,20 @@ Review UX specification targeting UX-specific failure modes: user journey gaps,
|
|
|
14
14
|
accessibility issues, incomplete interaction states, design system inconsistencies,
|
|
15
15
|
and missing error states.
|
|
16
16
|
|
|
17
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
18
|
+
independent review validation.
|
|
19
|
+
|
|
17
20
|
## Inputs
|
|
18
21
|
- docs/ux-spec.md (required) — spec to review
|
|
19
|
-
- docs/
|
|
22
|
+
- docs/plan.md (required) — for journey coverage
|
|
20
23
|
- docs/api-contracts.md (optional) — for data shape alignment
|
|
21
24
|
|
|
22
25
|
## Expected Outputs
|
|
23
26
|
- docs/reviews/review-ux.md — findings and resolution log
|
|
24
27
|
- docs/ux-spec.md — updated with fixes
|
|
28
|
+
- docs/reviews/ux/review-summary.md (depth 4+) — multi-model review synthesis
|
|
29
|
+
- docs/reviews/ux/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
30
|
+
- docs/reviews/ux/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
25
31
|
|
|
26
32
|
## Quality Criteria
|
|
27
33
|
- User journey coverage verified against PRD
|
|
@@ -29,10 +35,16 @@ and missing error states.
|
|
|
29
35
|
- All interaction states covered
|
|
30
36
|
- Design system consistency verified
|
|
31
37
|
- Error states present for all failure-capable actions
|
|
38
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
32
39
|
|
|
33
40
|
## Methodology Scaling
|
|
34
|
-
- **deep**: Full multi-pass review.
|
|
35
|
-
|
|
41
|
+
- **deep**: Full multi-pass review. Multi-model review dispatched to Codex and
|
|
42
|
+
Gemini if available, with graceful fallback to Claude-only enhanced review.
|
|
43
|
+
**mvp**: Journey coverage only.
|
|
44
|
+
- **custom:depth(1-5)**: Depth 1-3: scale passes with depth. Depth 4: full
|
|
45
|
+
review + one external model (if CLI available). Depth 5: full review +
|
|
46
|
+
multi-model with reconciliation.
|
|
36
47
|
|
|
37
48
|
## Mode Detection
|
|
38
|
-
Re-review mode if previous review exists.
|
|
49
|
+
Re-review mode if previous review exists. If multi-model review artifacts exist
|
|
50
|
+
under docs/reviews/ux/, preserve prior findings still valid.
|
|
@@ -1,8 +1,8 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: ux-spec
|
|
3
|
-
description: Specify
|
|
3
|
+
description: Specify user flows, interaction states, component architecture, accessibility, and responsive behavior
|
|
4
4
|
phase: "specification"
|
|
5
|
-
order:
|
|
5
|
+
order: 850
|
|
6
6
|
dependencies: [review-architecture]
|
|
7
7
|
outputs: [docs/ux-spec.md]
|
|
8
8
|
conditional: "if-needed"
|
|
@@ -10,25 +10,28 @@ knowledge-base: [ux-specification]
|
|
|
10
10
|
---
|
|
11
11
|
|
|
12
12
|
## Purpose
|
|
13
|
-
Define the user experience specification: user flows,
|
|
14
|
-
|
|
15
|
-
This is the
|
|
13
|
+
Define the user experience specification: user flows, interaction state machines,
|
|
14
|
+
component architecture (hierarchy and data flow), accessibility requirements, and
|
|
15
|
+
responsive behavior. This is the interaction and behavior blueprint for the frontend.
|
|
16
|
+
Visual tokens and component appearance are defined in `docs/design-system.md` — this
|
|
17
|
+
step consumes those tokens, it does not redefine them.
|
|
16
18
|
|
|
17
19
|
## Inputs
|
|
18
|
-
- docs/
|
|
20
|
+
- docs/plan.md (required) — user requirements and personas
|
|
19
21
|
- docs/system-architecture.md (required) — frontend architecture
|
|
20
22
|
- docs/api-contracts.md (optional) — data shapes for UI components
|
|
21
23
|
- docs/user-stories.md (required) — user journeys driving flow design
|
|
24
|
+
- docs/design-system.md (optional) — design tokens and component visual specs to reference
|
|
22
25
|
|
|
23
26
|
## Expected Outputs
|
|
24
27
|
- docs/ux-spec.md — UX specification with flows, components, design system
|
|
25
28
|
|
|
26
29
|
## Quality Criteria
|
|
27
|
-
- Every PRD user journey has a corresponding flow
|
|
30
|
+
- Every PRD user journey has a corresponding flow with all states documented
|
|
28
31
|
- Component hierarchy covers all UI states (loading, error, empty, populated)
|
|
29
|
-
-
|
|
32
|
+
- References design tokens from docs/design-system.md (does not redefine them)
|
|
30
33
|
- Accessibility requirements documented (WCAG level, keyboard nav, screen readers)
|
|
31
|
-
- Responsive breakpoints defined with behavior per breakpoint
|
|
34
|
+
- Responsive breakpoints defined with layout behavior per breakpoint
|
|
32
35
|
- Error states documented for every user action that can fail
|
|
33
36
|
|
|
34
37
|
## Methodology Scaling
|
|
@@ -40,4 +43,20 @@ This is the visual and interaction blueprint for the frontend.
|
|
|
40
43
|
system. Depth 4-5: full specification with accessibility.
|
|
41
44
|
|
|
42
45
|
## Mode Detection
|
|
43
|
-
|
|
46
|
+
Check for docs/ux-spec.md. If it exists, operate in update mode: read existing
|
|
47
|
+
flows and component hierarchy, diff against updated user stories and system
|
|
48
|
+
architecture. Preserve existing interaction patterns, state machines, and
|
|
49
|
+
component data flow definitions. Add new flows for new user stories or features.
|
|
50
|
+
Update component hierarchy if architecture changed frontend structure. Never
|
|
51
|
+
remove documented accessibility requirements.
|
|
52
|
+
|
|
53
|
+
## Update Mode Specifics
|
|
54
|
+
- **Detect prior artifact**: docs/ux-spec.md exists
|
|
55
|
+
- **Preserve**: existing user flows, interaction state machines, component
|
|
56
|
+
hierarchy, accessibility requirements, responsive breakpoint definitions
|
|
57
|
+
- **Triggers for update**: user stories added or changed, architecture changed
|
|
58
|
+
frontend components, design system tokens updated, API contracts changed
|
|
59
|
+
data shapes available to UI
|
|
60
|
+
- **Conflict resolution**: if a user story was rewritten, update its flow
|
|
61
|
+
in-place rather than creating a duplicate; reconcile component hierarchy
|
|
62
|
+
changes with existing state machine definitions
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: critical-path-walkthrough
|
|
3
3
|
description: Walk critical user journeys end-to-end across all specs
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/critical-path-walkthrough.md]
|
|
5
|
+
order: 1340
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/critical-path-walkthrough.md, docs/validation/critical-path-walkthrough/review-summary.md, docs/validation/critical-path-walkthrough/codex-review.json, docs/validation/critical-path-walkthrough/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [critical-path-analysis]
|
|
10
10
|
---
|
|
@@ -16,22 +16,49 @@ architecture components, database operations, and implementation tasks.
|
|
|
16
16
|
Use story acceptance criteria as the definition of "correct behavior" when
|
|
17
17
|
verifying completeness and consistency at every layer.
|
|
18
18
|
|
|
19
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
20
|
+
independent journey walkthroughs — different models catch different
|
|
21
|
+
spec gaps along the critical path.
|
|
22
|
+
|
|
19
23
|
## Inputs
|
|
20
|
-
- All phase output artifacts (docs/
|
|
24
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
21
25
|
docs/system-architecture.md, etc.)
|
|
22
26
|
|
|
23
27
|
## Expected Outputs
|
|
24
28
|
- docs/validation/critical-path-walkthrough.md — findings report
|
|
29
|
+
- docs/validation/critical-path-walkthrough/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
30
|
+
- docs/validation/critical-path-walkthrough/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
31
|
+
- docs/validation/critical-path-walkthrough/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
25
32
|
|
|
26
33
|
## Quality Criteria
|
|
27
34
|
- Analysis is comprehensive (not superficial)
|
|
28
35
|
- Findings are actionable (specific file, section, and issue)
|
|
29
36
|
- Severity categorization (P0-P3)
|
|
37
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
38
|
+
|
|
39
|
+
## Finding Disposition
|
|
40
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
41
|
+
fix tasks and re-run affected upstream steps.
|
|
42
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
43
|
+
documented in an ADR. Flag to project lead.
|
|
44
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
45
|
+
post-launch with tracking issue.
|
|
46
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
47
|
+
|
|
48
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
49
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
50
|
+
proceeding without acknowledgment.
|
|
30
51
|
|
|
31
52
|
## Methodology Scaling
|
|
32
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
53
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
54
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
55
|
+
Claude-only enhanced validation.
|
|
33
56
|
- **mvp**: High-level scan for blocking issues only.
|
|
34
|
-
- **custom:depth(1-5)**:
|
|
57
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
58
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
59
|
+
analysis + multi-model with reconciliation.
|
|
35
60
|
|
|
36
61
|
## Mode Detection
|
|
37
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
62
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
63
|
+
multi-model artifacts exist under docs/validation/critical-path-walkthrough/,
|
|
64
|
+
they are regenerated each run.
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: cross-phase-consistency
|
|
3
3
|
description: Audit naming, assumptions, data flows, interface contracts across all phases
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/cross-phase-consistency.md]
|
|
5
|
+
order: 1310
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/cross-phase-consistency.md, docs/validation/cross-phase-consistency/review-summary.md, docs/validation/cross-phase-consistency/codex-review.json, docs/validation/cross-phase-consistency/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [cross-phase-consistency]
|
|
10
10
|
---
|
|
@@ -14,22 +14,49 @@ Audit naming, assumptions, data flows, interface contracts across all phases.
|
|
|
14
14
|
Ensure consistent terminology, compatible assumptions, and aligned interfaces
|
|
15
15
|
between every pair of phase artifacts.
|
|
16
16
|
|
|
17
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
18
|
+
independent consistency validation — different models catch different
|
|
19
|
+
drift patterns.
|
|
20
|
+
|
|
17
21
|
## Inputs
|
|
18
|
-
- All phase output artifacts (docs/
|
|
22
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
19
23
|
docs/system-architecture.md, etc.)
|
|
20
24
|
|
|
21
25
|
## Expected Outputs
|
|
22
26
|
- docs/validation/cross-phase-consistency.md — findings report
|
|
27
|
+
- docs/validation/cross-phase-consistency/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
28
|
+
- docs/validation/cross-phase-consistency/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
29
|
+
- docs/validation/cross-phase-consistency/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
23
30
|
|
|
24
31
|
## Quality Criteria
|
|
25
32
|
- Analysis is comprehensive (not superficial)
|
|
26
33
|
- Findings are actionable (specific file, section, and issue)
|
|
27
34
|
- Severity categorization (P0-P3)
|
|
35
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
36
|
+
|
|
37
|
+
## Finding Disposition
|
|
38
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
39
|
+
fix tasks and re-run affected upstream steps.
|
|
40
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
41
|
+
documented in an ADR. Flag to project lead.
|
|
42
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
43
|
+
post-launch with tracking issue.
|
|
44
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
45
|
+
|
|
46
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
47
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
48
|
+
proceeding without acknowledgment.
|
|
28
49
|
|
|
29
50
|
## Methodology Scaling
|
|
30
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
51
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
52
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
53
|
+
Claude-only enhanced validation.
|
|
31
54
|
- **mvp**: High-level scan for blocking issues only.
|
|
32
|
-
- **custom:depth(1-5)**:
|
|
55
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
56
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
57
|
+
analysis + multi-model with reconciliation.
|
|
33
58
|
|
|
34
59
|
## Mode Detection
|
|
35
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
60
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
61
|
+
multi-model artifacts exist under docs/validation/cross-phase-consistency/,
|
|
62
|
+
they are regenerated each run.
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: decision-completeness
|
|
3
3
|
description: Verify all decisions are recorded, justified, non-contradictory
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/decision-completeness.md]
|
|
5
|
+
order: 1330
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/decision-completeness.md, docs/validation/decision-completeness/review-summary.md, docs/validation/decision-completeness/codex-review.json, docs/validation/decision-completeness/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [decision-completeness]
|
|
10
10
|
---
|
|
@@ -15,22 +15,49 @@ significant architectural and technology decision has a corresponding ADR,
|
|
|
15
15
|
that no two ADRs contradict each other, and that all decisions have clear
|
|
16
16
|
rationale.
|
|
17
17
|
|
|
18
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
19
|
+
independent decision audit — different models surface different implicit
|
|
20
|
+
decisions.
|
|
21
|
+
|
|
18
22
|
## Inputs
|
|
19
|
-
- All phase output artifacts (docs/
|
|
23
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
20
24
|
docs/system-architecture.md, etc.)
|
|
21
25
|
|
|
22
26
|
## Expected Outputs
|
|
23
27
|
- docs/validation/decision-completeness.md — findings report
|
|
28
|
+
- docs/validation/decision-completeness/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
29
|
+
- docs/validation/decision-completeness/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
30
|
+
- docs/validation/decision-completeness/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
24
31
|
|
|
25
32
|
## Quality Criteria
|
|
26
33
|
- Analysis is comprehensive (not superficial)
|
|
27
34
|
- Findings are actionable (specific file, section, and issue)
|
|
28
35
|
- Severity categorization (P0-P3)
|
|
36
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
37
|
+
|
|
38
|
+
## Finding Disposition
|
|
39
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
40
|
+
fix tasks and re-run affected upstream steps.
|
|
41
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
42
|
+
documented in an ADR. Flag to project lead.
|
|
43
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
44
|
+
post-launch with tracking issue.
|
|
45
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
46
|
+
|
|
47
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
48
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
49
|
+
proceeding without acknowledgment.
|
|
29
50
|
|
|
30
51
|
## Methodology Scaling
|
|
31
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
52
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
53
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
54
|
+
Claude-only enhanced validation.
|
|
32
55
|
- **mvp**: High-level scan for blocking issues only.
|
|
33
|
-
- **custom:depth(1-5)**:
|
|
56
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
57
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
58
|
+
analysis + multi-model with reconciliation.
|
|
34
59
|
|
|
35
60
|
## Mode Detection
|
|
36
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
61
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
62
|
+
multi-model artifacts exist under docs/validation/decision-completeness/,
|
|
63
|
+
they are regenerated each run.
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: dependency-graph-validation
|
|
3
3
|
description: Verify task dependency graphs are acyclic, complete, correctly ordered
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/dependency-graph-validation.md]
|
|
5
|
+
order: 1360
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/dependency-graph-validation.md, docs/validation/dependency-graph-validation/review-summary.md, docs/validation/dependency-graph-validation/codex-review.json, docs/validation/dependency-graph-validation/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [dependency-validation]
|
|
10
10
|
---
|
|
@@ -15,22 +15,49 @@ Validate that the implementation task dependency graph forms a valid DAG,
|
|
|
15
15
|
that all dependencies are satisfied before dependent tasks, and that no
|
|
16
16
|
critical tasks are missing from the graph.
|
|
17
17
|
|
|
18
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
19
|
+
independent graph validation — different models catch different ordering
|
|
20
|
+
and completeness issues.
|
|
21
|
+
|
|
18
22
|
## Inputs
|
|
19
|
-
- All phase output artifacts (docs/
|
|
23
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
20
24
|
docs/system-architecture.md, etc.)
|
|
21
25
|
|
|
22
26
|
## Expected Outputs
|
|
23
27
|
- docs/validation/dependency-graph-validation.md — findings report
|
|
28
|
+
- docs/validation/dependency-graph-validation/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
29
|
+
- docs/validation/dependency-graph-validation/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
30
|
+
- docs/validation/dependency-graph-validation/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
24
31
|
|
|
25
32
|
## Quality Criteria
|
|
26
33
|
- Analysis is comprehensive (not superficial)
|
|
27
34
|
- Findings are actionable (specific file, section, and issue)
|
|
28
35
|
- Severity categorization (P0-P3)
|
|
36
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
37
|
+
|
|
38
|
+
## Finding Disposition
|
|
39
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
40
|
+
fix tasks and re-run affected upstream steps.
|
|
41
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
42
|
+
documented in an ADR. Flag to project lead.
|
|
43
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
44
|
+
post-launch with tracking issue.
|
|
45
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
46
|
+
|
|
47
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
48
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
49
|
+
proceeding without acknowledgment.
|
|
29
50
|
|
|
30
51
|
## Methodology Scaling
|
|
31
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
52
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
53
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
54
|
+
Claude-only enhanced validation.
|
|
32
55
|
- **mvp**: High-level scan for blocking issues only.
|
|
33
|
-
- **custom:depth(1-5)**:
|
|
56
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
57
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
58
|
+
analysis + multi-model with reconciliation.
|
|
34
59
|
|
|
35
60
|
## Mode Detection
|
|
36
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
61
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
62
|
+
multi-model artifacts exist under docs/validation/dependency-graph-validation/,
|
|
63
|
+
they are regenerated each run.
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: implementability-dry-run
|
|
3
3
|
description: Dry-run specs as implementing agent, catching ambiguity
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/implementability-dry-run.md]
|
|
5
|
+
order: 1350
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/implementability-dry-run.md, docs/validation/implementability-dry-run/review-summary.md, docs/validation/implementability-dry-run/codex-review.json, docs/validation/implementability-dry-run/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [implementability-review]
|
|
10
10
|
---
|
|
@@ -15,22 +15,49 @@ AI agent would experience when picking up each implementation task: are the
|
|
|
15
15
|
inputs clear, are the acceptance criteria testable, are there ambiguities
|
|
16
16
|
that would force the agent to guess?
|
|
17
17
|
|
|
18
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
19
|
+
independent dry-runs — different models encounter different ambiguities
|
|
20
|
+
when simulating implementation.
|
|
21
|
+
|
|
18
22
|
## Inputs
|
|
19
|
-
- All phase output artifacts (docs/
|
|
23
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
20
24
|
docs/system-architecture.md, etc.)
|
|
21
25
|
|
|
22
26
|
## Expected Outputs
|
|
23
27
|
- docs/validation/implementability-dry-run.md — findings report
|
|
28
|
+
- docs/validation/implementability-dry-run/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
29
|
+
- docs/validation/implementability-dry-run/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
30
|
+
- docs/validation/implementability-dry-run/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
24
31
|
|
|
25
32
|
## Quality Criteria
|
|
26
33
|
- Analysis is comprehensive (not superficial)
|
|
27
34
|
- Findings are actionable (specific file, section, and issue)
|
|
28
35
|
- Severity categorization (P0-P3)
|
|
36
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
37
|
+
|
|
38
|
+
## Finding Disposition
|
|
39
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
40
|
+
fix tasks and re-run affected upstream steps.
|
|
41
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
42
|
+
documented in an ADR. Flag to project lead.
|
|
43
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
44
|
+
post-launch with tracking issue.
|
|
45
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
46
|
+
|
|
47
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
48
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
49
|
+
proceeding without acknowledgment.
|
|
29
50
|
|
|
30
51
|
## Methodology Scaling
|
|
31
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
52
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
53
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
54
|
+
Claude-only enhanced validation.
|
|
32
55
|
- **mvp**: High-level scan for blocking issues only.
|
|
33
|
-
- **custom:depth(1-5)**:
|
|
56
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
57
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
58
|
+
analysis + multi-model with reconciliation.
|
|
34
59
|
|
|
35
60
|
## Mode Detection
|
|
36
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
61
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
62
|
+
multi-model artifacts exist under docs/validation/implementability-dry-run/,
|
|
63
|
+
they are regenerated each run.
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: scope-creep-check
|
|
3
3
|
description: Verify specs stay aligned to PRD boundaries
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/scope-creep-check.md]
|
|
5
|
+
order: 1370
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/scope-creep-check.md, docs/validation/scope-creep-check/review-summary.md, docs/validation/scope-creep-check/codex-review.json, docs/validation/scope-creep-check/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [scope-management]
|
|
10
10
|
---
|
|
@@ -17,22 +17,49 @@ should not introduce features not in the PRD — UX-level enhancements are
|
|
|
17
17
|
allowed only via the innovation step with explicit user approval. Flag any
|
|
18
18
|
scope expansion for explicit approval.
|
|
19
19
|
|
|
20
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
21
|
+
independent scope analysis — different models interpret PRD boundaries
|
|
22
|
+
differently, surfacing subtle creep.
|
|
23
|
+
|
|
20
24
|
## Inputs
|
|
21
|
-
- All phase output artifacts (docs/
|
|
25
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
22
26
|
docs/system-architecture.md, etc.)
|
|
23
27
|
|
|
24
28
|
## Expected Outputs
|
|
25
29
|
- docs/validation/scope-creep-check.md — findings report
|
|
30
|
+
- docs/validation/scope-creep-check/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
31
|
+
- docs/validation/scope-creep-check/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
32
|
+
- docs/validation/scope-creep-check/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
26
33
|
|
|
27
34
|
## Quality Criteria
|
|
28
35
|
- Analysis is comprehensive (not superficial)
|
|
29
36
|
- Findings are actionable (specific file, section, and issue)
|
|
30
37
|
- Severity categorization (P0-P3)
|
|
38
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
39
|
+
|
|
40
|
+
## Finding Disposition
|
|
41
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
42
|
+
fix tasks and re-run affected upstream steps.
|
|
43
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
44
|
+
documented in an ADR. Flag to project lead.
|
|
45
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
46
|
+
post-launch with tracking issue.
|
|
47
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
48
|
+
|
|
49
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
50
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
51
|
+
proceeding without acknowledgment.
|
|
31
52
|
|
|
32
53
|
## Methodology Scaling
|
|
33
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
54
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
55
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
56
|
+
Claude-only enhanced validation.
|
|
34
57
|
- **mvp**: High-level scan for blocking issues only.
|
|
35
|
-
- **custom:depth(1-5)**:
|
|
58
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
59
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
60
|
+
analysis + multi-model with reconciliation.
|
|
36
61
|
|
|
37
62
|
## Mode Detection
|
|
38
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
63
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
64
|
+
multi-model artifacts exist under docs/validation/scope-creep-check/,
|
|
65
|
+
they are regenerated each run.
|
|
@@ -2,9 +2,9 @@
|
|
|
2
2
|
name: traceability-matrix
|
|
3
3
|
description: Build traceability from PRD requirements through architecture to implementation tasks
|
|
4
4
|
phase: "validation"
|
|
5
|
-
order:
|
|
6
|
-
dependencies: [review
|
|
7
|
-
outputs: [docs/validation/traceability-matrix.md]
|
|
5
|
+
order: 1320
|
|
6
|
+
dependencies: [implementation-plan-review, review-security]
|
|
7
|
+
outputs: [docs/validation/traceability-matrix.md, docs/validation/traceability-matrix/review-summary.md, docs/validation/traceability-matrix/codex-review.json, docs/validation/traceability-matrix/gemini-review.json]
|
|
8
8
|
conditional: null
|
|
9
9
|
knowledge-base: [traceability]
|
|
10
10
|
---
|
|
@@ -15,22 +15,49 @@ to implementation tasks. Verify the full chain: PRD → User Stories → Domain
|
|
|
15
15
|
Model → Architecture → Tasks, with no orphans in either direction. Every PRD
|
|
16
16
|
requirement must trace to at least one story, every story to at least one task.
|
|
17
17
|
|
|
18
|
+
At depth 4+, dispatches to external AI models (Codex, Gemini) for
|
|
19
|
+
independent traceability validation — different models catch different
|
|
20
|
+
coverage gaps.
|
|
21
|
+
|
|
18
22
|
## Inputs
|
|
19
|
-
- All phase output artifacts (docs/
|
|
23
|
+
- All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
|
|
20
24
|
docs/system-architecture.md, etc.)
|
|
21
25
|
|
|
22
26
|
## Expected Outputs
|
|
23
27
|
- docs/validation/traceability-matrix.md — findings report
|
|
28
|
+
- docs/validation/traceability-matrix/review-summary.md (depth 4+) — multi-model validation synthesis
|
|
29
|
+
- docs/validation/traceability-matrix/codex-review.json (depth 4+, if available) — raw Codex findings
|
|
30
|
+
- docs/validation/traceability-matrix/gemini-review.json (depth 4+, if available) — raw Gemini findings
|
|
24
31
|
|
|
25
32
|
## Quality Criteria
|
|
26
33
|
- Analysis is comprehensive (not superficial)
|
|
27
34
|
- Findings are actionable (specific file, section, and issue)
|
|
28
35
|
- Severity categorization (P0-P3)
|
|
36
|
+
- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
|
|
37
|
+
|
|
38
|
+
## Finding Disposition
|
|
39
|
+
- **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
|
|
40
|
+
fix tasks and re-run affected upstream steps.
|
|
41
|
+
- **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
|
|
42
|
+
documented in an ADR. Flag to project lead.
|
|
43
|
+
- **P2 (medium)**: Document in implementation plan as tech debt. May defer to
|
|
44
|
+
post-launch with tracking issue.
|
|
45
|
+
- **P3 (minor)**: Log for future improvement. No action required before implementation.
|
|
46
|
+
|
|
47
|
+
Findings are reported in the validation output file with severity, affected artifact,
|
|
48
|
+
and recommended resolution. P0/P1 findings block the implementation-plan step from
|
|
49
|
+
proceeding without acknowledgment.
|
|
29
50
|
|
|
30
51
|
## Methodology Scaling
|
|
31
|
-
- **deep**: Exhaustive analysis with all sub-checks.
|
|
52
|
+
- **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
|
|
53
|
+
dispatched to Codex and Gemini if available, with graceful fallback to
|
|
54
|
+
Claude-only enhanced validation.
|
|
32
55
|
- **mvp**: High-level scan for blocking issues only.
|
|
33
|
-
- **custom:depth(1-5)**:
|
|
56
|
+
- **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
|
|
57
|
+
full analysis + one external model (if CLI available). Depth 5: full
|
|
58
|
+
analysis + multi-model with reconciliation.
|
|
34
59
|
|
|
35
60
|
## Mode Detection
|
|
36
|
-
Not applicable — validation always runs fresh against current artifacts.
|
|
61
|
+
Not applicable — validation always runs fresh against current artifacts. If
|
|
62
|
+
multi-model artifacts exist under docs/validation/traceability-matrix/,
|
|
63
|
+
they are regenerated each run.
|