@zigrivers/scaffold 2.1.1 → 2.28.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (100) hide show
  1. package/README.md +272 -59
  2. package/dist/project/frontmatter.d.ts.map +1 -1
  3. package/dist/project/frontmatter.js +4 -0
  4. package/dist/project/frontmatter.js.map +1 -1
  5. package/knowledge/core/adr-craft.md +53 -0
  6. package/knowledge/core/ai-memory-management.md +246 -0
  7. package/knowledge/core/api-design.md +4 -0
  8. package/knowledge/core/claude-md-patterns.md +254 -0
  9. package/knowledge/core/coding-conventions.md +246 -0
  10. package/knowledge/core/database-design.md +4 -0
  11. package/knowledge/core/design-system-tokens.md +465 -0
  12. package/knowledge/core/dev-environment.md +223 -0
  13. package/knowledge/core/domain-modeling.md +4 -0
  14. package/knowledge/core/eval-craft.md +1008 -0
  15. package/knowledge/core/multi-model-review-dispatch.md +250 -0
  16. package/knowledge/core/operations-runbook.md +37 -226
  17. package/knowledge/core/project-structure-patterns.md +231 -0
  18. package/knowledge/core/review-step-template.md +247 -0
  19. package/knowledge/core/{security-review.md → security-best-practices.md} +5 -1
  20. package/knowledge/core/task-decomposition.md +57 -34
  21. package/knowledge/core/task-tracking.md +225 -0
  22. package/knowledge/core/tech-stack-selection.md +214 -0
  23. package/knowledge/core/testing-strategy.md +63 -70
  24. package/knowledge/core/user-stories.md +69 -60
  25. package/knowledge/core/user-story-innovation.md +57 -0
  26. package/knowledge/core/ux-specification.md +5 -148
  27. package/knowledge/finalization/apply-fixes-and-freeze.md +165 -14
  28. package/knowledge/product/prd-craft.md +55 -34
  29. package/knowledge/review/review-adr.md +32 -0
  30. package/knowledge/review/{review-api-contracts.md → review-api-design.md} +34 -1
  31. package/knowledge/review/{review-database-schema.md → review-database-design.md} +27 -1
  32. package/knowledge/review/review-domain-modeling.md +33 -0
  33. package/knowledge/review/review-implementation-tasks.md +50 -0
  34. package/knowledge/review/review-operations.md +55 -0
  35. package/knowledge/review/review-prd.md +33 -0
  36. package/knowledge/review/review-security.md +53 -0
  37. package/knowledge/review/review-system-architecture.md +28 -0
  38. package/knowledge/review/review-testing-strategy.md +51 -0
  39. package/knowledge/review/review-user-stories.md +54 -0
  40. package/knowledge/review/{review-ux-spec.md → review-ux-specification.md} +37 -1
  41. package/methodology/custom-defaults.yml +32 -3
  42. package/methodology/deep.yml +32 -3
  43. package/methodology/mvp.yml +32 -3
  44. package/package.json +2 -1
  45. package/pipeline/architecture/review-architecture.md +18 -6
  46. package/pipeline/architecture/system-architecture.md +14 -2
  47. package/pipeline/consolidation/claude-md-optimization.md +73 -0
  48. package/pipeline/consolidation/workflow-audit.md +73 -0
  49. package/pipeline/decisions/adrs.md +14 -2
  50. package/pipeline/decisions/review-adrs.md +18 -5
  51. package/pipeline/environment/ai-memory-setup.md +70 -0
  52. package/pipeline/environment/automated-pr-review.md +70 -0
  53. package/pipeline/environment/design-system.md +73 -0
  54. package/pipeline/environment/dev-env-setup.md +65 -0
  55. package/pipeline/environment/git-workflow.md +71 -0
  56. package/pipeline/finalization/apply-fixes-and-freeze.md +1 -1
  57. package/pipeline/finalization/developer-onboarding-guide.md +1 -1
  58. package/pipeline/finalization/implementation-playbook.md +3 -3
  59. package/pipeline/foundation/beads.md +68 -0
  60. package/pipeline/foundation/coding-standards.md +68 -0
  61. package/pipeline/foundation/project-structure.md +69 -0
  62. package/pipeline/foundation/tdd.md +60 -0
  63. package/pipeline/foundation/tech-stack.md +74 -0
  64. package/pipeline/integration/add-e2e-testing.md +65 -0
  65. package/pipeline/modeling/domain-modeling.md +14 -2
  66. package/pipeline/modeling/review-domain-modeling.md +18 -5
  67. package/pipeline/parity/platform-parity-review.md +70 -0
  68. package/pipeline/planning/implementation-plan-review.md +56 -0
  69. package/pipeline/planning/{implementation-tasks.md → implementation-plan.md} +29 -9
  70. package/pipeline/pre/create-prd.md +13 -4
  71. package/pipeline/pre/innovate-prd.md +37 -8
  72. package/pipeline/pre/innovate-user-stories.md +38 -7
  73. package/pipeline/pre/review-prd.md +18 -6
  74. package/pipeline/pre/review-user-stories.md +23 -6
  75. package/pipeline/pre/user-stories.md +12 -2
  76. package/pipeline/quality/create-evals.md +102 -0
  77. package/pipeline/quality/operations.md +38 -13
  78. package/pipeline/quality/review-operations.md +17 -5
  79. package/pipeline/quality/review-security.md +17 -5
  80. package/pipeline/quality/review-testing.md +20 -8
  81. package/pipeline/quality/security.md +25 -3
  82. package/pipeline/quality/story-tests.md +73 -0
  83. package/pipeline/specification/api-contracts.md +17 -2
  84. package/pipeline/specification/database-schema.md +17 -2
  85. package/pipeline/specification/review-api.md +18 -6
  86. package/pipeline/specification/review-database.md +18 -6
  87. package/pipeline/specification/review-ux.md +19 -7
  88. package/pipeline/specification/ux-spec.md +29 -10
  89. package/pipeline/validation/critical-path-walkthrough.md +34 -7
  90. package/pipeline/validation/cross-phase-consistency.md +34 -7
  91. package/pipeline/validation/decision-completeness.md +34 -7
  92. package/pipeline/validation/dependency-graph-validation.md +34 -7
  93. package/pipeline/validation/implementability-dry-run.md +34 -7
  94. package/pipeline/validation/scope-creep-check.md +34 -7
  95. package/pipeline/validation/traceability-matrix.md +34 -7
  96. package/skills/multi-model-dispatch/SKILL.md +326 -0
  97. package/skills/scaffold-pipeline/SKILL.md +195 -0
  98. package/skills/scaffold-runner/SKILL.md +465 -0
  99. package/pipeline/planning/review-tasks.md +0 -38
  100. package/pipeline/quality/testing-strategy.md +0 -42
@@ -2,11 +2,11 @@
2
2
  name: review-ux
3
3
  description: Review UX specification for completeness and usability
4
4
  phase: "specification"
5
- order: 18
5
+ order: 860
6
6
  dependencies: [ux-spec]
7
- outputs: [docs/reviews/review-ux.md]
7
+ outputs: [docs/reviews/review-ux.md, docs/reviews/ux/review-summary.md, docs/reviews/ux/codex-review.json, docs/reviews/ux/gemini-review.json]
8
8
  conditional: "if-needed"
9
- knowledge-base: [review-methodology, review-ux-spec]
9
+ knowledge-base: [review-methodology, review-ux-specification]
10
10
  ---
11
11
 
12
12
  ## Purpose
@@ -14,14 +14,20 @@ Review UX specification targeting UX-specific failure modes: user journey gaps,
14
14
  accessibility issues, incomplete interaction states, design system inconsistencies,
15
15
  and missing error states.
16
16
 
17
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
18
+ independent review validation.
19
+
17
20
  ## Inputs
18
21
  - docs/ux-spec.md (required) — spec to review
19
- - docs/prd.md (required) — for journey coverage
22
+ - docs/plan.md (required) — for journey coverage
20
23
  - docs/api-contracts.md (optional) — for data shape alignment
21
24
 
22
25
  ## Expected Outputs
23
26
  - docs/reviews/review-ux.md — findings and resolution log
24
27
  - docs/ux-spec.md — updated with fixes
28
+ - docs/reviews/ux/review-summary.md (depth 4+) — multi-model review synthesis
29
+ - docs/reviews/ux/codex-review.json (depth 4+, if available) — raw Codex findings
30
+ - docs/reviews/ux/gemini-review.json (depth 4+, if available) — raw Gemini findings
25
31
 
26
32
  ## Quality Criteria
27
33
  - User journey coverage verified against PRD
@@ -29,10 +35,16 @@ and missing error states.
29
35
  - All interaction states covered
30
36
  - Design system consistency verified
31
37
  - Error states present for all failure-capable actions
38
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
32
39
 
33
40
  ## Methodology Scaling
34
- - **deep**: Full multi-pass review. **mvp**: Journey coverage only.
35
- - **custom:depth(1-5)**: Scale passes with depth.
41
+ - **deep**: Full multi-pass review. Multi-model review dispatched to Codex and
42
+ Gemini if available, with graceful fallback to Claude-only enhanced review.
43
+ **mvp**: Journey coverage only.
44
+ - **custom:depth(1-5)**: Depth 1-3: scale passes with depth. Depth 4: full
45
+ review + one external model (if CLI available). Depth 5: full review +
46
+ multi-model with reconciliation.
36
47
 
37
48
  ## Mode Detection
38
- Re-review mode if previous review exists.
49
+ Re-review mode if previous review exists. If multi-model review artifacts exist
50
+ under docs/reviews/ux/, preserve prior findings still valid.
@@ -1,8 +1,8 @@
1
1
  ---
2
2
  name: ux-spec
3
- description: Specify UI/UX design including design system
3
+ description: Specify user flows, interaction states, component architecture, accessibility, and responsive behavior
4
4
  phase: "specification"
5
- order: 17
5
+ order: 850
6
6
  dependencies: [review-architecture]
7
7
  outputs: [docs/ux-spec.md]
8
8
  conditional: "if-needed"
@@ -10,25 +10,28 @@ knowledge-base: [ux-specification]
10
10
  ---
11
11
 
12
12
  ## Purpose
13
- Define the user experience specification: user flows, wireframes, component
14
- hierarchy, interaction patterns, and design system (tokens, components, patterns).
15
- This is the visual and interaction blueprint for the frontend.
13
+ Define the user experience specification: user flows, interaction state machines,
14
+ component architecture (hierarchy and data flow), accessibility requirements, and
15
+ responsive behavior. This is the interaction and behavior blueprint for the frontend.
16
+ Visual tokens and component appearance are defined in `docs/design-system.md` — this
17
+ step consumes those tokens, it does not redefine them.
16
18
 
17
19
  ## Inputs
18
- - docs/prd.md (required) — user requirements and personas
20
+ - docs/plan.md (required) — user requirements and personas
19
21
  - docs/system-architecture.md (required) — frontend architecture
20
22
  - docs/api-contracts.md (optional) — data shapes for UI components
21
23
  - docs/user-stories.md (required) — user journeys driving flow design
24
+ - docs/design-system.md (optional) — design tokens and component visual specs to reference
22
25
 
23
26
  ## Expected Outputs
24
27
  - docs/ux-spec.md — UX specification with flows, components, design system
25
28
 
26
29
  ## Quality Criteria
27
- - Every PRD user journey has a corresponding flow
30
+ - Every PRD user journey has a corresponding flow with all states documented
28
31
  - Component hierarchy covers all UI states (loading, error, empty, populated)
29
- - Design system defines tokens (colors, spacing, typography) and base components
32
+ - References design tokens from docs/design-system.md (does not redefine them)
30
33
  - Accessibility requirements documented (WCAG level, keyboard nav, screen readers)
31
- - Responsive breakpoints defined with behavior per breakpoint
34
+ - Responsive breakpoints defined with layout behavior per breakpoint
32
35
  - Error states documented for every user action that can fail
33
36
 
34
37
  ## Methodology Scaling
@@ -40,4 +43,20 @@ This is the visual and interaction blueprint for the frontend.
40
43
  system. Depth 4-5: full specification with accessibility.
41
44
 
42
45
  ## Mode Detection
43
- Update mode if spec exists.
46
+ Check for docs/ux-spec.md. If it exists, operate in update mode: read existing
47
+ flows and component hierarchy, diff against updated user stories and system
48
+ architecture. Preserve existing interaction patterns, state machines, and
49
+ component data flow definitions. Add new flows for new user stories or features.
50
+ Update component hierarchy if architecture changed frontend structure. Never
51
+ remove documented accessibility requirements.
52
+
53
+ ## Update Mode Specifics
54
+ - **Detect prior artifact**: docs/ux-spec.md exists
55
+ - **Preserve**: existing user flows, interaction state machines, component
56
+ hierarchy, accessibility requirements, responsive breakpoint definitions
57
+ - **Triggers for update**: user stories added or changed, architecture changed
58
+ frontend components, design system tokens updated, API contracts changed
59
+ data shapes available to UI
60
+ - **Conflict resolution**: if a user story was rewritten, update its flow
61
+ in-place rather than creating a duplicate; reconcile component hierarchy
62
+ changes with existing state machine definitions
@@ -2,9 +2,9 @@
2
2
  name: critical-path-walkthrough
3
3
  description: Walk critical user journeys end-to-end across all specs
4
4
  phase: "validation"
5
- order: 30
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/critical-path-walkthrough.md]
5
+ order: 1340
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/critical-path-walkthrough.md, docs/validation/critical-path-walkthrough/review-summary.md, docs/validation/critical-path-walkthrough/codex-review.json, docs/validation/critical-path-walkthrough/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [critical-path-analysis]
10
10
  ---
@@ -16,22 +16,49 @@ architecture components, database operations, and implementation tasks.
16
16
  Use story acceptance criteria as the definition of "correct behavior" when
17
17
  verifying completeness and consistency at every layer.
18
18
 
19
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
20
+ independent journey walkthroughs — different models catch different
21
+ spec gaps along the critical path.
22
+
19
23
  ## Inputs
20
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
24
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
21
25
  docs/system-architecture.md, etc.)
22
26
 
23
27
  ## Expected Outputs
24
28
  - docs/validation/critical-path-walkthrough.md — findings report
29
+ - docs/validation/critical-path-walkthrough/review-summary.md (depth 4+) — multi-model validation synthesis
30
+ - docs/validation/critical-path-walkthrough/codex-review.json (depth 4+, if available) — raw Codex findings
31
+ - docs/validation/critical-path-walkthrough/gemini-review.json (depth 4+, if available) — raw Gemini findings
25
32
 
26
33
  ## Quality Criteria
27
34
  - Analysis is comprehensive (not superficial)
28
35
  - Findings are actionable (specific file, section, and issue)
29
36
  - Severity categorization (P0-P3)
37
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
38
+
39
+ ## Finding Disposition
40
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
41
+ fix tasks and re-run affected upstream steps.
42
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
43
+ documented in an ADR. Flag to project lead.
44
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
45
+ post-launch with tracking issue.
46
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
47
+
48
+ Findings are reported in the validation output file with severity, affected artifact,
49
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
50
+ proceeding without acknowledgment.
30
51
 
31
52
  ## Methodology Scaling
32
- - **deep**: Exhaustive analysis with all sub-checks.
53
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
54
+ dispatched to Codex and Gemini if available, with graceful fallback to
55
+ Claude-only enhanced validation.
33
56
  - **mvp**: High-level scan for blocking issues only.
34
- - **custom:depth(1-5)**: Scale thoroughness with depth.
57
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
58
+ full analysis + one external model (if CLI available). Depth 5: full
59
+ analysis + multi-model with reconciliation.
35
60
 
36
61
  ## Mode Detection
37
- Not applicable — validation always runs fresh against current artifacts.
62
+ Not applicable — validation always runs fresh against current artifacts. If
63
+ multi-model artifacts exist under docs/validation/critical-path-walkthrough/,
64
+ they are regenerated each run.
@@ -2,9 +2,9 @@
2
2
  name: cross-phase-consistency
3
3
  description: Audit naming, assumptions, data flows, interface contracts across all phases
4
4
  phase: "validation"
5
- order: 27
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/cross-phase-consistency.md]
5
+ order: 1310
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/cross-phase-consistency.md, docs/validation/cross-phase-consistency/review-summary.md, docs/validation/cross-phase-consistency/codex-review.json, docs/validation/cross-phase-consistency/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [cross-phase-consistency]
10
10
  ---
@@ -14,22 +14,49 @@ Audit naming, assumptions, data flows, interface contracts across all phases.
14
14
  Ensure consistent terminology, compatible assumptions, and aligned interfaces
15
15
  between every pair of phase artifacts.
16
16
 
17
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
18
+ independent consistency validation — different models catch different
19
+ drift patterns.
20
+
17
21
  ## Inputs
18
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
22
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
19
23
  docs/system-architecture.md, etc.)
20
24
 
21
25
  ## Expected Outputs
22
26
  - docs/validation/cross-phase-consistency.md — findings report
27
+ - docs/validation/cross-phase-consistency/review-summary.md (depth 4+) — multi-model validation synthesis
28
+ - docs/validation/cross-phase-consistency/codex-review.json (depth 4+, if available) — raw Codex findings
29
+ - docs/validation/cross-phase-consistency/gemini-review.json (depth 4+, if available) — raw Gemini findings
23
30
 
24
31
  ## Quality Criteria
25
32
  - Analysis is comprehensive (not superficial)
26
33
  - Findings are actionable (specific file, section, and issue)
27
34
  - Severity categorization (P0-P3)
35
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
36
+
37
+ ## Finding Disposition
38
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
39
+ fix tasks and re-run affected upstream steps.
40
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
41
+ documented in an ADR. Flag to project lead.
42
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
43
+ post-launch with tracking issue.
44
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
45
+
46
+ Findings are reported in the validation output file with severity, affected artifact,
47
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
48
+ proceeding without acknowledgment.
28
49
 
29
50
  ## Methodology Scaling
30
- - **deep**: Exhaustive analysis with all sub-checks.
51
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
52
+ dispatched to Codex and Gemini if available, with graceful fallback to
53
+ Claude-only enhanced validation.
31
54
  - **mvp**: High-level scan for blocking issues only.
32
- - **custom:depth(1-5)**: Scale thoroughness with depth.
55
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
56
+ full analysis + one external model (if CLI available). Depth 5: full
57
+ analysis + multi-model with reconciliation.
33
58
 
34
59
  ## Mode Detection
35
- Not applicable — validation always runs fresh against current artifacts.
60
+ Not applicable — validation always runs fresh against current artifacts. If
61
+ multi-model artifacts exist under docs/validation/cross-phase-consistency/,
62
+ they are regenerated each run.
@@ -2,9 +2,9 @@
2
2
  name: decision-completeness
3
3
  description: Verify all decisions are recorded, justified, non-contradictory
4
4
  phase: "validation"
5
- order: 29
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/decision-completeness.md]
5
+ order: 1330
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/decision-completeness.md, docs/validation/decision-completeness/review-summary.md, docs/validation/decision-completeness/codex-review.json, docs/validation/decision-completeness/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [decision-completeness]
10
10
  ---
@@ -15,22 +15,49 @@ significant architectural and technology decision has a corresponding ADR,
15
15
  that no two ADRs contradict each other, and that all decisions have clear
16
16
  rationale.
17
17
 
18
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
19
+ independent decision audit — different models surface different implicit
20
+ decisions.
21
+
18
22
  ## Inputs
19
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
23
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
20
24
  docs/system-architecture.md, etc.)
21
25
 
22
26
  ## Expected Outputs
23
27
  - docs/validation/decision-completeness.md — findings report
28
+ - docs/validation/decision-completeness/review-summary.md (depth 4+) — multi-model validation synthesis
29
+ - docs/validation/decision-completeness/codex-review.json (depth 4+, if available) — raw Codex findings
30
+ - docs/validation/decision-completeness/gemini-review.json (depth 4+, if available) — raw Gemini findings
24
31
 
25
32
  ## Quality Criteria
26
33
  - Analysis is comprehensive (not superficial)
27
34
  - Findings are actionable (specific file, section, and issue)
28
35
  - Severity categorization (P0-P3)
36
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
37
+
38
+ ## Finding Disposition
39
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
40
+ fix tasks and re-run affected upstream steps.
41
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
42
+ documented in an ADR. Flag to project lead.
43
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
44
+ post-launch with tracking issue.
45
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
46
+
47
+ Findings are reported in the validation output file with severity, affected artifact,
48
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
49
+ proceeding without acknowledgment.
29
50
 
30
51
  ## Methodology Scaling
31
- - **deep**: Exhaustive analysis with all sub-checks.
52
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
53
+ dispatched to Codex and Gemini if available, with graceful fallback to
54
+ Claude-only enhanced validation.
32
55
  - **mvp**: High-level scan for blocking issues only.
33
- - **custom:depth(1-5)**: Scale thoroughness with depth.
56
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
57
+ full analysis + one external model (if CLI available). Depth 5: full
58
+ analysis + multi-model with reconciliation.
34
59
 
35
60
  ## Mode Detection
36
- Not applicable — validation always runs fresh against current artifacts.
61
+ Not applicable — validation always runs fresh against current artifacts. If
62
+ multi-model artifacts exist under docs/validation/decision-completeness/,
63
+ they are regenerated each run.
@@ -2,9 +2,9 @@
2
2
  name: dependency-graph-validation
3
3
  description: Verify task dependency graphs are acyclic, complete, correctly ordered
4
4
  phase: "validation"
5
- order: 32
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/dependency-graph-validation.md]
5
+ order: 1360
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/dependency-graph-validation.md, docs/validation/dependency-graph-validation/review-summary.md, docs/validation/dependency-graph-validation/codex-review.json, docs/validation/dependency-graph-validation/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [dependency-validation]
10
10
  ---
@@ -15,22 +15,49 @@ Validate that the implementation task dependency graph forms a valid DAG,
15
15
  that all dependencies are satisfied before dependent tasks, and that no
16
16
  critical tasks are missing from the graph.
17
17
 
18
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
19
+ independent graph validation — different models catch different ordering
20
+ and completeness issues.
21
+
18
22
  ## Inputs
19
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
23
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
20
24
  docs/system-architecture.md, etc.)
21
25
 
22
26
  ## Expected Outputs
23
27
  - docs/validation/dependency-graph-validation.md — findings report
28
+ - docs/validation/dependency-graph-validation/review-summary.md (depth 4+) — multi-model validation synthesis
29
+ - docs/validation/dependency-graph-validation/codex-review.json (depth 4+, if available) — raw Codex findings
30
+ - docs/validation/dependency-graph-validation/gemini-review.json (depth 4+, if available) — raw Gemini findings
24
31
 
25
32
  ## Quality Criteria
26
33
  - Analysis is comprehensive (not superficial)
27
34
  - Findings are actionable (specific file, section, and issue)
28
35
  - Severity categorization (P0-P3)
36
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
37
+
38
+ ## Finding Disposition
39
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
40
+ fix tasks and re-run affected upstream steps.
41
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
42
+ documented in an ADR. Flag to project lead.
43
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
44
+ post-launch with tracking issue.
45
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
46
+
47
+ Findings are reported in the validation output file with severity, affected artifact,
48
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
49
+ proceeding without acknowledgment.
29
50
 
30
51
  ## Methodology Scaling
31
- - **deep**: Exhaustive analysis with all sub-checks.
52
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
53
+ dispatched to Codex and Gemini if available, with graceful fallback to
54
+ Claude-only enhanced validation.
32
55
  - **mvp**: High-level scan for blocking issues only.
33
- - **custom:depth(1-5)**: Scale thoroughness with depth.
56
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
57
+ full analysis + one external model (if CLI available). Depth 5: full
58
+ analysis + multi-model with reconciliation.
34
59
 
35
60
  ## Mode Detection
36
- Not applicable — validation always runs fresh against current artifacts.
61
+ Not applicable — validation always runs fresh against current artifacts. If
62
+ multi-model artifacts exist under docs/validation/dependency-graph-validation/,
63
+ they are regenerated each run.
@@ -2,9 +2,9 @@
2
2
  name: implementability-dry-run
3
3
  description: Dry-run specs as implementing agent, catching ambiguity
4
4
  phase: "validation"
5
- order: 31
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/implementability-dry-run.md]
5
+ order: 1350
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/implementability-dry-run.md, docs/validation/implementability-dry-run/review-summary.md, docs/validation/implementability-dry-run/codex-review.json, docs/validation/implementability-dry-run/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [implementability-review]
10
10
  ---
@@ -15,22 +15,49 @@ AI agent would experience when picking up each implementation task: are the
15
15
  inputs clear, are the acceptance criteria testable, are there ambiguities
16
16
  that would force the agent to guess?
17
17
 
18
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
19
+ independent dry-runs — different models encounter different ambiguities
20
+ when simulating implementation.
21
+
18
22
  ## Inputs
19
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
23
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
20
24
  docs/system-architecture.md, etc.)
21
25
 
22
26
  ## Expected Outputs
23
27
  - docs/validation/implementability-dry-run.md — findings report
28
+ - docs/validation/implementability-dry-run/review-summary.md (depth 4+) — multi-model validation synthesis
29
+ - docs/validation/implementability-dry-run/codex-review.json (depth 4+, if available) — raw Codex findings
30
+ - docs/validation/implementability-dry-run/gemini-review.json (depth 4+, if available) — raw Gemini findings
24
31
 
25
32
  ## Quality Criteria
26
33
  - Analysis is comprehensive (not superficial)
27
34
  - Findings are actionable (specific file, section, and issue)
28
35
  - Severity categorization (P0-P3)
36
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
37
+
38
+ ## Finding Disposition
39
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
40
+ fix tasks and re-run affected upstream steps.
41
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
42
+ documented in an ADR. Flag to project lead.
43
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
44
+ post-launch with tracking issue.
45
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
46
+
47
+ Findings are reported in the validation output file with severity, affected artifact,
48
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
49
+ proceeding without acknowledgment.
29
50
 
30
51
  ## Methodology Scaling
31
- - **deep**: Exhaustive analysis with all sub-checks.
52
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
53
+ dispatched to Codex and Gemini if available, with graceful fallback to
54
+ Claude-only enhanced validation.
32
55
  - **mvp**: High-level scan for blocking issues only.
33
- - **custom:depth(1-5)**: Scale thoroughness with depth.
56
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
57
+ full analysis + one external model (if CLI available). Depth 5: full
58
+ analysis + multi-model with reconciliation.
34
59
 
35
60
  ## Mode Detection
36
- Not applicable — validation always runs fresh against current artifacts.
61
+ Not applicable — validation always runs fresh against current artifacts. If
62
+ multi-model artifacts exist under docs/validation/implementability-dry-run/,
63
+ they are regenerated each run.
@@ -2,9 +2,9 @@
2
2
  name: scope-creep-check
3
3
  description: Verify specs stay aligned to PRD boundaries
4
4
  phase: "validation"
5
- order: 33
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/scope-creep-check.md]
5
+ order: 1370
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/scope-creep-check.md, docs/validation/scope-creep-check/review-summary.md, docs/validation/scope-creep-check/codex-review.json, docs/validation/scope-creep-check/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [scope-management]
10
10
  ---
@@ -17,22 +17,49 @@ should not introduce features not in the PRD — UX-level enhancements are
17
17
  allowed only via the innovation step with explicit user approval. Flag any
18
18
  scope expansion for explicit approval.
19
19
 
20
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
21
+ independent scope analysis — different models interpret PRD boundaries
22
+ differently, surfacing subtle creep.
23
+
20
24
  ## Inputs
21
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
25
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
22
26
  docs/system-architecture.md, etc.)
23
27
 
24
28
  ## Expected Outputs
25
29
  - docs/validation/scope-creep-check.md — findings report
30
+ - docs/validation/scope-creep-check/review-summary.md (depth 4+) — multi-model validation synthesis
31
+ - docs/validation/scope-creep-check/codex-review.json (depth 4+, if available) — raw Codex findings
32
+ - docs/validation/scope-creep-check/gemini-review.json (depth 4+, if available) — raw Gemini findings
26
33
 
27
34
  ## Quality Criteria
28
35
  - Analysis is comprehensive (not superficial)
29
36
  - Findings are actionable (specific file, section, and issue)
30
37
  - Severity categorization (P0-P3)
38
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
39
+
40
+ ## Finding Disposition
41
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
42
+ fix tasks and re-run affected upstream steps.
43
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
44
+ documented in an ADR. Flag to project lead.
45
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
46
+ post-launch with tracking issue.
47
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
48
+
49
+ Findings are reported in the validation output file with severity, affected artifact,
50
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
51
+ proceeding without acknowledgment.
31
52
 
32
53
  ## Methodology Scaling
33
- - **deep**: Exhaustive analysis with all sub-checks.
54
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
55
+ dispatched to Codex and Gemini if available, with graceful fallback to
56
+ Claude-only enhanced validation.
34
57
  - **mvp**: High-level scan for blocking issues only.
35
- - **custom:depth(1-5)**: Scale thoroughness with depth.
58
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
59
+ full analysis + one external model (if CLI available). Depth 5: full
60
+ analysis + multi-model with reconciliation.
36
61
 
37
62
  ## Mode Detection
38
- Not applicable — validation always runs fresh against current artifacts.
63
+ Not applicable — validation always runs fresh against current artifacts. If
64
+ multi-model artifacts exist under docs/validation/scope-creep-check/,
65
+ they are regenerated each run.
@@ -2,9 +2,9 @@
2
2
  name: traceability-matrix
3
3
  description: Build traceability from PRD requirements through architecture to implementation tasks
4
4
  phase: "validation"
5
- order: 28
6
- dependencies: [review-tasks, review-security]
7
- outputs: [docs/validation/traceability-matrix.md]
5
+ order: 1320
6
+ dependencies: [implementation-plan-review, review-security]
7
+ outputs: [docs/validation/traceability-matrix.md, docs/validation/traceability-matrix/review-summary.md, docs/validation/traceability-matrix/codex-review.json, docs/validation/traceability-matrix/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [traceability]
10
10
  ---
@@ -15,22 +15,49 @@ to implementation tasks. Verify the full chain: PRD → User Stories → Domain
15
15
  Model → Architecture → Tasks, with no orphans in either direction. Every PRD
16
16
  requirement must trace to at least one story, every story to at least one task.
17
17
 
18
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
19
+ independent traceability validation — different models catch different
20
+ coverage gaps.
21
+
18
22
  ## Inputs
19
- - All phase output artifacts (docs/prd.md, docs/domain-models/, docs/adrs/,
23
+ - All phase output artifacts (docs/plan.md, docs/domain-models/, docs/adrs/,
20
24
  docs/system-architecture.md, etc.)
21
25
 
22
26
  ## Expected Outputs
23
27
  - docs/validation/traceability-matrix.md — findings report
28
+ - docs/validation/traceability-matrix/review-summary.md (depth 4+) — multi-model validation synthesis
29
+ - docs/validation/traceability-matrix/codex-review.json (depth 4+, if available) — raw Codex findings
30
+ - docs/validation/traceability-matrix/gemini-review.json (depth 4+, if available) — raw Gemini findings
24
31
 
25
32
  ## Quality Criteria
26
33
  - Analysis is comprehensive (not superficial)
27
34
  - Findings are actionable (specific file, section, and issue)
28
35
  - Severity categorization (P0-P3)
36
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
37
+
38
+ ## Finding Disposition
39
+ - **P0 (blocking)**: Must be resolved before proceeding to implementation. Create
40
+ fix tasks and re-run affected upstream steps.
41
+ - **P1 (critical)**: Should be resolved; proceeding requires explicit risk acceptance
42
+ documented in an ADR. Flag to project lead.
43
+ - **P2 (medium)**: Document in implementation plan as tech debt. May defer to
44
+ post-launch with tracking issue.
45
+ - **P3 (minor)**: Log for future improvement. No action required before implementation.
46
+
47
+ Findings are reported in the validation output file with severity, affected artifact,
48
+ and recommended resolution. P0/P1 findings block the implementation-plan step from
49
+ proceeding without acknowledgment.
29
50
 
30
51
  ## Methodology Scaling
31
- - **deep**: Exhaustive analysis with all sub-checks.
52
+ - **deep**: Exhaustive analysis with all sub-checks. Multi-model validation
53
+ dispatched to Codex and Gemini if available, with graceful fallback to
54
+ Claude-only enhanced validation.
32
55
  - **mvp**: High-level scan for blocking issues only.
33
- - **custom:depth(1-5)**: Scale thoroughness with depth.
56
+ - **custom:depth(1-5)**: Depth 1-3: scale thoroughness with depth. Depth 4:
57
+ full analysis + one external model (if CLI available). Depth 5: full
58
+ analysis + multi-model with reconciliation.
34
59
 
35
60
  ## Mode Detection
36
- Not applicable — validation always runs fresh against current artifacts.
61
+ Not applicable — validation always runs fresh against current artifacts. If
62
+ multi-model artifacts exist under docs/validation/traceability-matrix/,
63
+ they are regenerated each run.