@zigrivers/scaffold 2.1.2 → 2.28.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (97) hide show
  1. package/README.md +272 -59
  2. package/knowledge/core/adr-craft.md +53 -0
  3. package/knowledge/core/ai-memory-management.md +246 -0
  4. package/knowledge/core/api-design.md +4 -0
  5. package/knowledge/core/claude-md-patterns.md +254 -0
  6. package/knowledge/core/coding-conventions.md +246 -0
  7. package/knowledge/core/database-design.md +4 -0
  8. package/knowledge/core/design-system-tokens.md +465 -0
  9. package/knowledge/core/dev-environment.md +223 -0
  10. package/knowledge/core/domain-modeling.md +4 -0
  11. package/knowledge/core/eval-craft.md +1008 -0
  12. package/knowledge/core/multi-model-review-dispatch.md +250 -0
  13. package/knowledge/core/operations-runbook.md +37 -226
  14. package/knowledge/core/project-structure-patterns.md +231 -0
  15. package/knowledge/core/review-step-template.md +247 -0
  16. package/knowledge/core/{security-review.md → security-best-practices.md} +5 -1
  17. package/knowledge/core/task-decomposition.md +57 -34
  18. package/knowledge/core/task-tracking.md +225 -0
  19. package/knowledge/core/tech-stack-selection.md +214 -0
  20. package/knowledge/core/testing-strategy.md +63 -70
  21. package/knowledge/core/user-stories.md +69 -60
  22. package/knowledge/core/user-story-innovation.md +57 -0
  23. package/knowledge/core/ux-specification.md +5 -148
  24. package/knowledge/finalization/apply-fixes-and-freeze.md +165 -14
  25. package/knowledge/product/prd-craft.md +55 -34
  26. package/knowledge/review/review-adr.md +32 -0
  27. package/knowledge/review/{review-api-contracts.md → review-api-design.md} +34 -1
  28. package/knowledge/review/{review-database-schema.md → review-database-design.md} +27 -1
  29. package/knowledge/review/review-domain-modeling.md +33 -0
  30. package/knowledge/review/review-implementation-tasks.md +50 -0
  31. package/knowledge/review/review-operations.md +55 -0
  32. package/knowledge/review/review-prd.md +33 -0
  33. package/knowledge/review/review-security.md +53 -0
  34. package/knowledge/review/review-system-architecture.md +28 -0
  35. package/knowledge/review/review-testing-strategy.md +51 -0
  36. package/knowledge/review/review-user-stories.md +54 -0
  37. package/knowledge/review/{review-ux-spec.md → review-ux-specification.md} +37 -1
  38. package/methodology/custom-defaults.yml +32 -3
  39. package/methodology/deep.yml +32 -3
  40. package/methodology/mvp.yml +32 -3
  41. package/package.json +2 -1
  42. package/pipeline/architecture/review-architecture.md +18 -6
  43. package/pipeline/architecture/system-architecture.md +14 -2
  44. package/pipeline/consolidation/claude-md-optimization.md +73 -0
  45. package/pipeline/consolidation/workflow-audit.md +73 -0
  46. package/pipeline/decisions/adrs.md +14 -2
  47. package/pipeline/decisions/review-adrs.md +18 -5
  48. package/pipeline/environment/ai-memory-setup.md +70 -0
  49. package/pipeline/environment/automated-pr-review.md +70 -0
  50. package/pipeline/environment/design-system.md +73 -0
  51. package/pipeline/environment/dev-env-setup.md +65 -0
  52. package/pipeline/environment/git-workflow.md +71 -0
  53. package/pipeline/finalization/apply-fixes-and-freeze.md +1 -1
  54. package/pipeline/finalization/developer-onboarding-guide.md +1 -1
  55. package/pipeline/finalization/implementation-playbook.md +3 -3
  56. package/pipeline/foundation/beads.md +68 -0
  57. package/pipeline/foundation/coding-standards.md +68 -0
  58. package/pipeline/foundation/project-structure.md +69 -0
  59. package/pipeline/foundation/tdd.md +60 -0
  60. package/pipeline/foundation/tech-stack.md +74 -0
  61. package/pipeline/integration/add-e2e-testing.md +65 -0
  62. package/pipeline/modeling/domain-modeling.md +14 -2
  63. package/pipeline/modeling/review-domain-modeling.md +18 -5
  64. package/pipeline/parity/platform-parity-review.md +70 -0
  65. package/pipeline/planning/implementation-plan-review.md +56 -0
  66. package/pipeline/planning/{implementation-tasks.md → implementation-plan.md} +29 -9
  67. package/pipeline/pre/create-prd.md +13 -4
  68. package/pipeline/pre/innovate-prd.md +37 -8
  69. package/pipeline/pre/innovate-user-stories.md +38 -7
  70. package/pipeline/pre/review-prd.md +18 -6
  71. package/pipeline/pre/review-user-stories.md +23 -6
  72. package/pipeline/pre/user-stories.md +12 -2
  73. package/pipeline/quality/create-evals.md +102 -0
  74. package/pipeline/quality/operations.md +38 -13
  75. package/pipeline/quality/review-operations.md +17 -5
  76. package/pipeline/quality/review-security.md +17 -5
  77. package/pipeline/quality/review-testing.md +20 -8
  78. package/pipeline/quality/security.md +25 -3
  79. package/pipeline/quality/story-tests.md +73 -0
  80. package/pipeline/specification/api-contracts.md +17 -2
  81. package/pipeline/specification/database-schema.md +17 -2
  82. package/pipeline/specification/review-api.md +18 -6
  83. package/pipeline/specification/review-database.md +18 -6
  84. package/pipeline/specification/review-ux.md +19 -7
  85. package/pipeline/specification/ux-spec.md +29 -10
  86. package/pipeline/validation/critical-path-walkthrough.md +34 -7
  87. package/pipeline/validation/cross-phase-consistency.md +34 -7
  88. package/pipeline/validation/decision-completeness.md +34 -7
  89. package/pipeline/validation/dependency-graph-validation.md +34 -7
  90. package/pipeline/validation/implementability-dry-run.md +34 -7
  91. package/pipeline/validation/scope-creep-check.md +34 -7
  92. package/pipeline/validation/traceability-matrix.md +34 -7
  93. package/skills/multi-model-dispatch/SKILL.md +326 -0
  94. package/skills/scaffold-pipeline/SKILL.md +195 -0
  95. package/skills/scaffold-runner/SKILL.md +465 -0
  96. package/pipeline/planning/review-tasks.md +0 -38
  97. package/pipeline/quality/testing-strategy.md +0 -42
@@ -2,9 +2,10 @@
2
2
  name: domain-modeling
3
3
  description: Deep domain modeling across all identified project domains
4
4
  phase: "modeling"
5
- order: 7
5
+ order: 510
6
6
  dependencies: [review-user-stories]
7
7
  outputs: [docs/domain-models/]
8
+ reads: [coding-standards]
8
9
  conditional: null
9
10
  knowledge-base: [domain-modeling]
10
11
  ---
@@ -17,7 +18,7 @@ Use user stories and their acceptance criteria to discover entities, events,
17
18
  and aggregate boundaries. User actions reveal the domain model.
18
19
 
19
20
  ## Inputs
20
- - docs/prd.md (required) — requirements defining the problem space
21
+ - docs/plan.md (required) — requirements defining the problem space
21
22
  - docs/reviews/pre-review-prd.md (optional) — review findings for context
22
23
  - docs/prd-innovation.md (optional) — innovation findings and approved enhancements
23
24
  - docs/user-stories.md (required) — user stories with acceptance criteria for domain discovery
@@ -55,3 +56,14 @@ and aggregate boundaries. User actions reveal the domain model.
55
56
  If docs/domain-models/ exists, operate in update mode: read existing models,
56
57
  identify changes needed based on updated PRD or new understanding. Preserve
57
58
  existing decisions unless explicitly revisiting them.
59
+
60
+ ## Update Mode Specifics
61
+ - **Detect prior artifact**: docs/domain-models/ directory exists with model files
62
+ - **Preserve**: existing entity definitions, aggregate boundaries, domain events,
63
+ invariants, ubiquitous language terms, bounded context interfaces
64
+ - **Triggers for update**: PRD features added or changed, user stories revealed
65
+ new domain concepts, innovation suggestions accepted new capabilities,
66
+ implementation revealed modeling gaps
67
+ - **Conflict resolution**: if a new feature introduces an entity name that
68
+ conflicts with existing ubiquitous language, resolve by renaming the new
69
+ entity and documenting the distinction; never silently merge aggregates
@@ -2,9 +2,9 @@
2
2
  name: review-domain-modeling
3
3
  description: Review domain models for completeness, consistency, and downstream readiness
4
4
  phase: "modeling"
5
- order: 8
5
+ order: 520
6
6
  dependencies: [domain-modeling]
7
- outputs: [docs/reviews/review-domain-modeling.md]
7
+ outputs: [docs/reviews/review-domain-modeling.md, docs/reviews/domain-modeling/review-summary.md, docs/reviews/domain-modeling/codex-review.json, docs/reviews/domain-modeling/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [review-methodology, review-domain-modeling]
10
10
  ---
@@ -14,13 +14,19 @@ Deep multi-pass review of the domain models, targeting the specific failure mode
14
14
  of domain modeling artifacts. Identify issues, create a fix plan, execute fixes,
15
15
  and re-validate.
16
16
 
17
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
18
+ independent review validation.
19
+
17
20
  ## Inputs
18
21
  - docs/domain-models/ (required) — domain models to review
19
- - docs/prd.md (required) — source requirements for coverage checking
22
+ - docs/plan.md (required) — source requirements for coverage checking
20
23
 
21
24
  ## Expected Outputs
22
25
  - docs/reviews/review-domain-modeling.md — review findings, fix plan, and resolution log
23
26
  - docs/domain-models/ — updated with fixes
27
+ - docs/reviews/domain-modeling/review-summary.md (depth 4+) — multi-model review synthesis
28
+ - docs/reviews/domain-modeling/codex-review.json (depth 4+, if available) — raw Codex findings
29
+ - docs/reviews/domain-modeling/gemini-review.json (depth 4+, if available) — raw Gemini findings
24
30
 
25
31
  ## Quality Criteria
26
32
  - All review passes executed with findings documented
@@ -28,14 +34,21 @@ and re-validate.
28
34
  - Fix plan created for P0 and P1 findings
29
35
  - Fixes applied and re-validated
30
36
  - Downstream readiness confirmed (decisions phase can proceed)
37
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
31
38
 
32
39
  ## Methodology Scaling
33
40
  - **deep**: All review passes from the knowledge base. Full findings report
34
- with severity categorization. Fixes applied and re-validated.
41
+ with severity categorization. Fixes applied and re-validated. Multi-model
42
+ review dispatched to Codex and Gemini if available, with graceful fallback
43
+ to Claude-only enhanced review.
35
44
  - **mvp**: Quick consistency check. Focus on blocking issues only.
36
45
  - **custom:depth(1-5)**: Depth 1-2: blocking issues only. Depth 3: add coverage
37
- and consistency passes. Depth 4-5: full multi-pass review.
46
+ and consistency passes. Depth 4: full multi-pass review + one external model
47
+ (if CLI available). Depth 5: full multi-pass review + multi-model with
48
+ reconciliation.
38
49
 
39
50
  ## Mode Detection
40
51
  If docs/reviews/review-domain-modeling.md exists, this is a re-review. Read previous
41
52
  findings, check which were addressed, run review passes again on updated models.
53
+ If multi-model review artifacts exist under docs/reviews/domain-modeling/,
54
+ preserve prior findings still valid.
@@ -0,0 +1,70 @@
1
+ ---
2
+ name: platform-parity-review
3
+ description: Audit all documentation for platform-specific gaps across target platforms
4
+ phase: "parity"
5
+ order: 1010
6
+ dependencies: [review-architecture, review-database, review-api, review-ux]
7
+ outputs: [docs/reviews/platform-parity-review.md, docs/reviews/platform-parity/review-summary.md, docs/reviews/platform-parity/codex-review.json, docs/reviews/platform-parity/gemini-review.json]
8
+ reads: [user-stories, coding-standards, tech-stack, project-structure, tdd]
9
+ conditional: "if-needed"
10
+ knowledge-base: [cross-phase-consistency]
11
+ ---
12
+
13
+ ## Purpose
14
+ When the project targets multiple platforms (web, iOS, Android, desktop), audit
15
+ all documentation to ensure every target platform is thoroughly addressed.
16
+ Identify gaps where one platform was assumed but another was not considered,
17
+ verify feature parity across targets, and ensure platform-specific testing,
18
+ input patterns, and UX considerations are documented.
19
+
20
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
21
+ independent platform gap analysis.
22
+
23
+ ## Inputs
24
+ - docs/plan.md (required) — target platforms and version requirements
25
+ - docs/tech-stack.md (required) — cross-platform framework and build approach
26
+ - docs/user-stories.md (required) — stories to check for platform coverage
27
+ - docs/coding-standards.md (required) — platform-specific conventions
28
+ - docs/project-structure.md (optional) — platform-specific file organization
29
+ - docs/tdd-standards.md (optional) — platform-specific testing approach
30
+ - docs/design-system.md (optional) — responsive breakpoints and platform patterns
31
+ - docs/implementation-plan.md (optional) — tasks covering each platform
32
+ - CLAUDE.md (required) — platform-specific workflow notes
33
+
34
+ ## Expected Outputs
35
+ - docs/reviews/platform-parity-review.md — platform gap analysis report with
36
+ findings per document, feature parity matrix, and recommended fixes
37
+ - docs/reviews/platform-parity/review-summary.md (depth 4+) — multi-model review synthesis
38
+ - docs/reviews/platform-parity/codex-review.json (depth 4+, if available) — raw Codex findings
39
+ - docs/reviews/platform-parity/gemini-review.json (depth 4+, if available) — raw Gemini findings
40
+
41
+ ## Quality Criteria
42
+ - All target platforms identified from PRD and tech-stack.md
43
+ - Every user story checked for platform-specific acceptance criteria
44
+ - Feature parity matrix shows which features work on which platforms
45
+ - Input pattern differences documented (touch vs. mouse, keyboard shortcuts, gestures)
46
+ - Platform-specific testing documented (Playwright for web, Maestro for mobile)
47
+ - Navigation patterns appropriate per platform (sidebar vs. tab bar, etc.)
48
+ - Offline/connectivity handling addressed per platform (if applicable)
49
+ - Web version is treated as first-class (not afterthought) if PRD specifies it
50
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
51
+
52
+ ## Methodology Scaling
53
+ - **deep**: Comprehensive platform audit across all documents, feature parity
54
+ matrix, input pattern analysis, navigation pattern review, offline handling,
55
+ accessibility per platform, and detailed fix recommendations. Multi-model
56
+ review dispatched to Codex and Gemini if available, with graceful fallback
57
+ to Claude-only enhanced review.
58
+ - **mvp**: Quick check of user stories and tech-stack for platform coverage.
59
+ Identify top 3 platform gaps. Skip detailed feature parity matrix.
60
+ - **custom:depth(1-5)**: Depth 1-2: user stories platform check. Depth 3: add
61
+ tech-stack and coding-standards. Depth 4: add feature parity matrix + one
62
+ external model (if CLI available). Depth 5: full suite across all documents
63
+ + multi-model with reconciliation.
64
+
65
+ ## Mode Detection
66
+ Update mode if docs/reviews/platform-parity-review.md exists. In update mode:
67
+ re-run audit against current documents, preserve prior findings still valid,
68
+ note which gaps have been addressed since last review. If multi-model review
69
+ artifacts exist under docs/reviews/platform-parity/, preserve prior findings
70
+ still valid.
@@ -0,0 +1,56 @@
1
+ ---
2
+ name: implementation-plan-review
3
+ description: Review implementation tasks for coverage, feasibility, and multi-model validation
4
+ phase: "planning"
5
+ order: 1220
6
+ dependencies: [implementation-plan]
7
+ outputs: [docs/reviews/review-tasks.md, docs/reviews/implementation-plan/task-coverage.json, docs/reviews/implementation-plan/review-summary.md]
8
+ conditional: null
9
+ knowledge-base: [review-methodology, review-implementation-tasks, task-decomposition]
10
+ ---
11
+
12
+ ## Purpose
13
+ Review implementation tasks targeting task-specific failure modes: architecture
14
+ coverage gaps, missing dependencies, tasks too large or too vague for agents,
15
+ critical path inaccuracy, and invalid parallelization assumptions. At depth 4+,
16
+ dispatch to independent AI models (Codex/Gemini CLIs) for multi-model validation
17
+ and produce a structured coverage matrix and review summary.
18
+
19
+ ## Inputs
20
+ - docs/implementation-plan.md (required) — tasks to review
21
+ - docs/system-architecture.md (required) — for coverage checking
22
+ - docs/domain-models/ (required) — for completeness
23
+ - docs/user-stories.md (required) — for AC coverage mapping
24
+ - docs/plan.md (required) — for traceability
25
+ - docs/project-structure.md (required) — for file contention analysis
26
+ - docs/tdd-standards.md (required) — for test requirement verification
27
+
28
+ ## Expected Outputs
29
+ - docs/reviews/review-tasks.md — findings and resolution log
30
+ - docs/implementation-plan.md — updated with fixes
31
+ - docs/reviews/implementation-plan/task-coverage.json — AC-to-task coverage matrix (depth 3+)
32
+ - docs/reviews/implementation-plan/review-summary.md — multi-model review summary (depth 4+)
33
+ - docs/reviews/implementation-plan/codex-review.json — raw Codex findings (depth 4+, if available)
34
+ - docs/reviews/implementation-plan/gemini-review.json — raw Gemini findings (depth 4+, if available)
35
+
36
+ ## Quality Criteria
37
+ - Architecture coverage verified (every component has tasks)
38
+ - Dependency graph is valid DAG
39
+ - No task is too large for a single agent session
40
+ - Critical path is accurate
41
+ - Parallelization assumptions are valid
42
+ - Every acceptance criterion maps to at least one task (100% AC coverage)
43
+ - Task descriptions are unambiguous enough for AI agents (agent-implementable)
44
+ - At depth 4+: independent model reviews completed and reconciled
45
+
46
+ ## Methodology Scaling
47
+ - **deep**: Full multi-pass review with multi-model validation. AC coverage
48
+ matrix. Independent Codex/Gemini dispatches. Detailed reconciliation report.
49
+ - **mvp**: Coverage check only. No external model dispatch.
50
+ - **custom:depth(1-5)**: Depth 1-2: coverage check. Depth 3: add dependency
51
+ analysis and AC coverage matrix. Depth 4: add one external model. Depth 5:
52
+ full multi-model with reconciliation.
53
+
54
+ ## Mode Detection
55
+ Re-review mode if previous review exists. If multi-model review artifacts exist
56
+ under docs/reviews/implementation-plan/, preserve prior findings still valid.
@@ -1,10 +1,11 @@
1
1
  ---
2
- name: implementation-tasks
2
+ name: implementation-plan
3
3
  description: Break architecture into implementable tasks with dependencies
4
4
  phase: "planning"
5
- order: 25
6
- dependencies: [testing-strategy, operations, security]
7
- outputs: [docs/implementation-tasks.md]
5
+ order: 1210
6
+ dependencies: [tdd, operations, security, review-architecture, create-evals]
7
+ outputs: [docs/implementation-plan.md]
8
+ reads: [create-prd]
8
9
  conditional: null
9
10
  knowledge-base: [task-decomposition]
10
11
  ---
@@ -19,9 +20,9 @@ The primary mapping is Story → Task(s), with PRD as the traceability root.
19
20
  - docs/system-architecture.md (required) — components to implement
20
21
  - docs/domain-models/ (required) — domain logic to implement
21
22
  - docs/adrs/ (required) — technology constraints
22
- - docs/prd.md (required) — features to trace tasks back to
23
+ - docs/plan.md (required) — features to trace tasks back to
23
24
  - docs/user-stories.md (required) — stories to derive tasks from
24
- - docs/testing-strategy.md (required) — testing requirements to incorporate into tasks
25
+ - docs/tdd-standards.md (required) — testing requirements to incorporate into tasks
25
26
  - docs/operations-runbook.md (optional) — ops requirements to incorporate into tasks
26
27
  - docs/security-review.md (optional) — security requirements to incorporate into tasks
27
28
  - docs/database-schema.md (optional) — data layer tasks
@@ -29,7 +30,7 @@ The primary mapping is Story → Task(s), with PRD as the traceability root.
29
30
  - docs/ux-spec.md (optional) — frontend tasks
30
31
 
31
32
  ## Expected Outputs
32
- - docs/implementation-tasks.md — task list with dependencies, sizing, and
33
+ - docs/implementation-plan.md — task list with dependencies, sizing, and
33
34
  assignment recommendations
34
35
 
35
36
  ## Quality Criteria
@@ -41,8 +42,10 @@ The primary mapping is Story → Task(s), with PRD as the traceability root.
41
42
  - Tasks incorporate security controls from the security review where applicable
42
43
  - Tasks incorporate operational requirements (monitoring, deployment) where applicable
43
44
  - Critical path is identified
44
- - Parallelization opportunities are marked
45
+ - Parallelization opportunities are marked with wave plan
45
46
  - Every user story maps to at least one task
47
+ - High-risk tasks are flagged with risk type and mitigation
48
+ - Wave summary produced with agent allocation recommendation
46
49
 
47
50
  ## Methodology Scaling
48
51
  - **deep**: Detailed task breakdown with story-to-task tracing. Dependency graph.
@@ -54,4 +57,21 @@ The primary mapping is Story → Task(s), with PRD as the traceability root.
54
57
  and sizing. Depth 4-5: full breakdown with parallelization.
55
58
 
56
59
  ## Mode Detection
57
- Update mode if tasks exist. Re-derive from updated architecture.
60
+ Check for docs/implementation-plan.md. If it exists, operate in update mode:
61
+ read existing task list and diff against current architecture, user stories,
62
+ and specification documents. Preserve completed task statuses and existing
63
+ dependency relationships. Add new tasks for new stories or architecture
64
+ components. Re-derive wave plan if dependencies changed. Never remove tasks
65
+ that are in-progress or completed.
66
+
67
+ ## Update Mode Specifics
68
+ - **Detect prior artifact**: docs/implementation-plan.md exists
69
+ - **Preserve**: completed and in-progress task statuses, existing task IDs,
70
+ dependency relationships for stable tasks, wave assignments for tasks
71
+ already started, agent allocation history
72
+ - **Triggers for update**: architecture changed (new components need tasks),
73
+ user stories added or changed, security review identified new requirements,
74
+ operations runbook added deployment tasks, specification docs changed
75
+ - **Conflict resolution**: if architecture restructured a component that has
76
+ in-progress tasks, flag for user review rather than silently reassigning;
77
+ re-derive critical path only for unstarted tasks
@@ -2,9 +2,9 @@
2
2
  name: create-prd
3
3
  description: Create a product requirements document from a project idea
4
4
  phase: "pre"
5
- order: 1
5
+ order: 110
6
6
  dependencies: []
7
- outputs: [docs/prd.md]
7
+ outputs: [docs/plan.md]
8
8
  conditional: null
9
9
  knowledge-base: [prd-craft]
10
10
  ---
@@ -19,7 +19,7 @@ This is the foundation document that all subsequent phases reference.
19
19
  - Existing project files (if brownfield — any README, docs, or code)
20
20
 
21
21
  ## Expected Outputs
22
- - docs/prd.md — Product requirements document
22
+ - docs/plan.md — Product requirements document
23
23
 
24
24
  ## Quality Criteria
25
25
  - Problem statement is specific and testable (not vague aspirations)
@@ -40,6 +40,15 @@ This is the foundation document that all subsequent phases reference.
40
40
  phased delivery.
41
41
 
42
42
  ## Mode Detection
43
- If docs/prd.md exists, operate in update mode: read existing content, identify
43
+ If docs/plan.md exists, operate in update mode: read existing content, identify
44
44
  what has changed or been learned since it was written, propose targeted updates.
45
45
  Preserve existing decisions unless explicitly revisiting them.
46
+
47
+ ## Update Mode Specifics
48
+ - **Detect prior artifact**: docs/plan.md exists
49
+ - **Preserve**: problem statement, existing feature definitions, success criteria,
50
+ user personas, and scope boundaries unless user explicitly requests changes
51
+ - **Triggers for update**: user provides new requirements, scope adjustment
52
+ requested, constraints changed (timeline, budget, team), new user research
53
+ - **Conflict resolution**: new features are appended to the feature list with
54
+ clear versioning; changed constraints are documented with rationale for change
@@ -2,9 +2,9 @@
2
2
  name: innovate-prd
3
3
  description: Discover feature-level innovation opportunities in the PRD
4
4
  phase: "pre"
5
- order: 3
5
+ order: 130
6
6
  dependencies: [review-prd]
7
- outputs: [docs/prd-innovation.md]
7
+ outputs: [docs/prd-innovation.md, docs/reviews/prd-innovation/review-summary.md, docs/reviews/prd-innovation/codex-review.json, docs/reviews/prd-innovation/gemini-review.json]
8
8
  conditional: "if-needed"
9
9
  knowledge-base: [prd-innovation, prd-craft]
10
10
  ---
@@ -15,14 +15,21 @@ new capabilities, competitive positioning, and defensive product gaps. It is
15
15
  NOT UX-level enhancement (that belongs in user story innovation) — it focuses
16
16
  on whether the right features are in the PRD at all.
17
17
 
18
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
19
+ independent innovation brainstorming — different models surface different
20
+ creative opportunities and competitive insights.
21
+
18
22
  ## Inputs
19
- - docs/prd.md (required) — PRD to analyze for innovation opportunities
23
+ - docs/plan.md (required) — PRD to analyze for innovation opportunities
20
24
  - docs/reviews/pre-review-prd.md (optional) — review findings for context
21
25
 
22
26
  ## Expected Outputs
23
27
  - docs/prd-innovation.md — innovation findings, suggestions with cost/impact
24
28
  assessment, and disposition (accepted/rejected/deferred)
25
- - docs/prd.md — updated with approved innovations
29
+ - docs/plan.md — updated with approved innovations
30
+ - docs/reviews/prd-innovation/review-summary.md (depth 4+) — multi-model innovation synthesis
31
+ - docs/reviews/prd-innovation/codex-review.json (depth 4+, if available) — raw Codex suggestions
32
+ - docs/reviews/prd-innovation/gemini-review.json (depth 4+, if available) — raw Gemini suggestions
26
33
 
27
34
  ## Quality Criteria
28
35
  - Enhancements are feature-level, not UX-level polish
@@ -31,17 +38,39 @@ on whether the right features are in the PRD at all.
31
38
  - Approved innovations are documented to the same standard as existing features
32
39
  - PRD scope boundaries are respected — no uncontrolled scope creep
33
40
  - User approval is obtained before modifying the PRD
41
+ - (depth 4+) Multi-model suggestions deduplicated and synthesized with unique ideas from each model highlighted
34
42
 
35
43
  ## Methodology Scaling
36
44
  - **deep**: Full innovation pass across all categories (competitive research,
37
45
  UX gaps, AI-native opportunities, defensive product thinking). Cost/impact
38
- matrix. Detailed integration of approved innovations into PRD.
46
+ matrix. Detailed integration of approved innovations into PRD. Multi-model
47
+ innovation dispatched to Codex and Gemini if available, with graceful
48
+ fallback to Claude-only enhanced brainstorming.
39
49
  - **mvp**: Not applicable — this step is conditional and skipped in MVP.
40
50
  - **custom:depth(1-5)**: Depth 1-2: not typically enabled. Depth 3: quick scan
41
- for obvious gaps and missing expected features. Depth 4-5: full innovation
42
- pass with evaluation framework.
51
+ for obvious gaps and missing expected features. Depth 4: full innovation
52
+ pass + one external model (if CLI available). Depth 5: full innovation pass
53
+ + multi-model with deduplication and synthesis.
54
+
55
+ ## Conditional Evaluation
56
+ Enable when: project has a competitive landscape section in plan.md, user explicitly
57
+ requests an innovation pass, or the PRD review (review-prd) identifies feature gaps
58
+ or missing capabilities. Skip when: PRD is minimal/exploratory, depth < 3, or user
59
+ explicitly declines innovation.
43
60
 
44
61
  ## Mode Detection
45
62
  If docs/prd-innovation.md exists, this is a re-innovation pass. Read previous
46
63
  suggestions and their disposition (accepted/rejected/deferred), focus on new
47
- opportunities from PRD changes since last run.
64
+ opportunities from PRD changes since last run. If multi-model artifacts exist
65
+ under docs/reviews/prd-innovation/, preserve prior suggestion dispositions.
66
+
67
+ ## Update Mode Specifics
68
+ - **Detect prior artifact**: docs/prd-innovation.md exists with suggestion
69
+ dispositions
70
+ - **Preserve**: accepted/rejected/deferred dispositions from prior runs,
71
+ cost/impact assessments already reviewed by user, multi-model review artifacts
72
+ - **Triggers for update**: PRD scope changed (new features added or removed),
73
+ user requests re-evaluation of deferred suggestions, new external model
74
+ available for additional perspectives
75
+ - **Conflict resolution**: if a previously rejected suggestion is now relevant
76
+ due to PRD changes, re-propose with updated rationale referencing the change
@@ -2,9 +2,9 @@
2
2
  name: innovate-user-stories
3
3
  description: Discover UX-level enhancements and innovation opportunities in user stories
4
4
  phase: "pre"
5
- order: 6
5
+ order: 160
6
6
  dependencies: [review-user-stories]
7
- outputs: [docs/user-stories-innovation.md]
7
+ outputs: [docs/user-stories-innovation.md, docs/reviews/user-stories-innovation/review-summary.md, docs/reviews/user-stories-innovation/codex-review.json, docs/reviews/user-stories-innovation/gemini-review.json]
8
8
  conditional: "if-needed"
9
9
  knowledge-base: [user-stories, user-story-innovation]
10
10
  ---
@@ -16,14 +16,21 @@ innovation — `innovate-prd`) — it focuses on making existing features better
16
16
  through smart defaults,
17
17
  progressive disclosure, accessibility improvements, and AI-native capabilities.
18
18
 
19
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
20
+ independent UX innovation brainstorming — different models surface different
21
+ enhancement opportunities.
22
+
19
23
  ## Inputs
20
24
  - docs/user-stories.md (required) — stories to enhance
21
- - docs/prd.md (required) — PRD boundaries (innovation must not exceed scope)
25
+ - docs/plan.md (required) — PRD boundaries (innovation must not exceed scope)
22
26
 
23
27
  ## Expected Outputs
24
28
  - docs/user-stories-innovation.md — innovation findings, suggestions with
25
29
  cost/impact assessment, and disposition (accepted/rejected/deferred)
26
30
  - docs/user-stories.md — updated with approved enhancements
31
+ - docs/reviews/user-stories-innovation/review-summary.md (depth 4+) — multi-model innovation synthesis
32
+ - docs/reviews/user-stories-innovation/codex-review.json (depth 4+, if available) — raw Codex suggestions
33
+ - docs/reviews/user-stories-innovation/gemini-review.json (depth 4+, if available) — raw Gemini suggestions
27
34
 
28
35
  ## Quality Criteria
29
36
  - Enhancements are UX-level, not new features
@@ -31,17 +38,41 @@ progressive disclosure, accessibility improvements, and AI-native capabilities.
31
38
  - Each suggestion has a clear user benefit
32
39
  - Approved enhancements are integrated into existing stories (not new stories)
33
40
  - PRD scope boundaries are respected — no scope creep
41
+ - (depth 4+) Multi-model suggestions deduplicated and synthesized with unique ideas from each model highlighted
34
42
 
35
43
  ## Methodology Scaling
36
44
  - **deep**: Full innovation pass across all three categories (high-value
37
45
  low-effort, differentiators, defensive gaps). Cost/impact matrix.
38
- Detailed integration of approved enhancements into stories.
46
+ Detailed integration of approved enhancements into stories. Multi-model
47
+ innovation dispatched to Codex and Gemini if available, with graceful
48
+ fallback to Claude-only enhanced brainstorming.
39
49
  - **mvp**: Not applicable — this step is conditional and skipped in MVP.
40
50
  - **custom:depth(1-5)**: Depth 1-2: not typically enabled. Depth 3: quick
41
- scan for obvious improvements. Depth 4-5: full innovation pass with
42
- evaluation framework.
51
+ scan for obvious improvements. Depth 4: full innovation pass + one external
52
+ model (if CLI available). Depth 5: full innovation pass + multi-model with
53
+ deduplication and synthesis.
54
+
55
+ ## Conditional Evaluation
56
+ Enable when: user stories review identifies UX gaps, project targets a consumer-facing
57
+ audience, or progressive disclosure patterns would benefit users. Skip when: stories
58
+ are backend-only with no user-facing UI, depth < 3, or user explicitly declines
59
+ innovation.
43
60
 
44
61
  ## Mode Detection
45
62
  If docs/user-stories-innovation.md exists, this is a re-innovation pass. Read
46
63
  previous suggestions and their disposition (accepted/rejected), focus on new
47
- opportunities from story changes since last run.
64
+ opportunities from story changes since last run. If multi-model artifacts
65
+ exist under docs/reviews/user-stories-innovation/, preserve prior suggestion
66
+ dispositions.
67
+
68
+ ## Update Mode Specifics
69
+ - **Detect prior artifact**: docs/user-stories-innovation.md exists with
70
+ suggestion dispositions
71
+ - **Preserve**: accepted/rejected dispositions from prior runs, cost/impact
72
+ assessments already reviewed, multi-model review artifacts
73
+ - **Triggers for update**: user stories changed (new stories added, existing
74
+ stories rewritten), PRD innovation accepted new features that need UX
75
+ enhancement analysis
76
+ - **Conflict resolution**: if a previously rejected UX enhancement is now
77
+ relevant due to story changes, re-propose with updated rationale; never
78
+ re-suggest rejected enhancements without a material change in context
@@ -2,9 +2,9 @@
2
2
  name: review-prd
3
3
  description: Multi-pass review of the PRD for completeness, clarity, and downstream readiness
4
4
  phase: "pre"
5
- order: 2
5
+ order: 120
6
6
  dependencies: [create-prd]
7
- outputs: [docs/reviews/pre-review-prd.md]
7
+ outputs: [docs/reviews/pre-review-prd.md, docs/reviews/prd/review-summary.md, docs/reviews/prd/codex-review.json, docs/reviews/prd/gemini-review.json]
8
8
  conditional: null
9
9
  knowledge-base: [review-methodology, review-prd, prd-craft, gap-analysis]
10
10
  ---
@@ -15,13 +15,19 @@ product requirements artifacts. Identify issues, create a fix plan, execute
15
15
  fixes, and re-validate. Ensures the PRD is complete, clear, consistent, and
16
16
  ready for User Stories to consume.
17
17
 
18
+ At depth 4+, dispatches to external AI models (Codex, Gemini) for
19
+ independent review validation.
20
+
18
21
  ## Inputs
19
- - docs/prd.md (required) — PRD to review
22
+ - docs/plan.md (required) — PRD to review
20
23
  - Project idea or brief (context from user, if available)
21
24
 
22
25
  ## Expected Outputs
23
26
  - docs/reviews/pre-review-prd.md — review findings, fix plan, and resolution log
24
- - docs/prd.md — updated with fixes
27
+ - docs/plan.md — updated with fixes
28
+ - docs/reviews/prd/review-summary.md (depth 4+) — multi-model review synthesis
29
+ - docs/reviews/prd/codex-review.json (depth 4+, if available) — raw Codex findings
30
+ - docs/reviews/prd/gemini-review.json (depth 4+, if available) — raw Gemini findings
25
31
 
26
32
  ## Quality Criteria
27
33
  - All review passes executed with findings documented
@@ -29,16 +35,22 @@ ready for User Stories to consume.
29
35
  - Fix plan created for P0 and P1 findings
30
36
  - Fixes applied and re-validated
31
37
  - Downstream readiness confirmed (User Stories can proceed)
38
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
32
39
 
33
40
  ## Methodology Scaling
34
41
  - **deep**: All 8 review passes from the knowledge base. Full findings report
35
- with severity categorization. Fixes applied and re-validated.
42
+ with severity categorization. Fixes applied and re-validated. Multi-model
43
+ review dispatched to Codex and Gemini if available, with graceful fallback
44
+ to Claude-only enhanced review.
36
45
  - **mvp**: Passes 1-2 only (Problem Statement Rigor, Persona Coverage). Focus
37
46
  on blocking gaps — requirements too vague to write stories from.
38
47
  - **custom:depth(1-5)**: Depth 1-2: passes 1-2 only (Problem Statement Rigor,
39
48
  Persona Coverage). Depth 3: passes 1-4 (add Feature Scoping, Success
40
- Criteria). Depth 4-5: all 8 passes.
49
+ Criteria). Depth 4: all 8 passes + one external model review (if CLI
50
+ available). Depth 5: all 8 passes + multi-model review with reconciliation.
41
51
 
42
52
  ## Mode Detection
43
53
  If docs/reviews/pre-review-prd.md exists, this is a re-review. Read previous
44
54
  findings, check which were addressed, run review passes again on updated PRD.
55
+ If multi-model review artifacts exist under docs/reviews/prd/, preserve prior
56
+ findings still valid.
@@ -2,9 +2,9 @@
2
2
  name: review-user-stories
3
3
  description: Multi-pass review of user stories for PRD coverage, quality, and downstream readiness
4
4
  phase: "pre"
5
- order: 5
5
+ order: 150
6
6
  dependencies: [user-stories]
7
- outputs: [docs/reviews/pre-review-user-stories.md]
7
+ outputs: [docs/reviews/pre-review-user-stories.md, docs/reviews/user-stories/requirements-index.md, docs/reviews/user-stories/coverage.json, docs/reviews/user-stories/review-summary.md]
8
8
  conditional: null
9
9
  knowledge-base: [review-methodology, review-user-stories]
10
10
  ---
@@ -14,13 +14,23 @@ Deep multi-pass review of user stories, targeting failure modes specific to
14
14
  story artifacts. Identify coverage gaps, quality issues, and downstream
15
15
  readiness problems. Create a fix plan, execute fixes, and re-validate.
16
16
 
17
+ At higher depths, builds a formal requirements index with traceability matrix
18
+ and optionally dispatches to external AI models (Codex, Gemini) for
19
+ independent coverage validation.
20
+
17
21
  ## Inputs
18
22
  - docs/user-stories.md (required) — stories to review
19
- - docs/prd.md (required) — source requirements for coverage checking
23
+ - docs/plan.md (required) — source requirements for coverage checking
24
+ - docs/reviews/user-stories/ artifacts (optional) — prior review findings in update mode
20
25
 
21
26
  ## Expected Outputs
22
27
  - docs/reviews/pre-review-user-stories.md — review findings, fix plan, and resolution log
23
28
  - docs/user-stories.md — updated with fixes
29
+ - docs/reviews/user-stories/requirements-index.md (depth 4+) — atomic requirements
30
+ extracted from PRD with REQ-xxx IDs
31
+ - docs/reviews/user-stories/coverage.json (depth 4+) — requirement-to-story mapping
32
+ - docs/reviews/user-stories/review-summary.md (depth 5) — multi-model review
33
+ synthesis with coverage verification
24
34
 
25
35
  ## Quality Criteria
26
36
  - All review passes executed with findings documented
@@ -28,16 +38,23 @@ readiness problems. Create a fix plan, execute fixes, and re-validate.
28
38
  - Fix plan created for P0 and P1 findings
29
39
  - Fixes applied and re-validated
30
40
  - Downstream readiness confirmed (modeling phase can proceed)
41
+ - (depth 4+) Every atomic PRD requirement has a REQ-xxx ID in the requirements index
42
+ - (depth 4+) Coverage matrix maps every REQ to at least one US (100% coverage target)
43
+ - (depth 5) Multi-model findings synthesized with consensus/disagreement analysis
31
44
 
32
45
  ## Methodology Scaling
33
46
  - **deep**: All 6 review passes from the knowledge base. Full findings report
34
- with severity categorization. Fixes applied and re-validated.
47
+ with severity categorization. Fixes applied and re-validated. Requirements
48
+ index and coverage matrix built. Multi-model review dispatched to Codex and
49
+ Gemini if available, with graceful fallback to Claude-only enhanced review.
35
50
  - **mvp**: Pass 1 only (PRD coverage). Focus on blocking gaps — PRD features
36
51
  with no corresponding story.
37
52
  - **custom:depth(1-5)**: Depth 1: pass 1 only. Depth 2: passes 1-2.
38
- Depth 3: passes 1-4. Depth 4-5: all 6 passes.
53
+ Depth 3: passes 1-4. Depth 4: all 6 passes + requirements index + coverage
54
+ matrix. Depth 5: all of depth 4 + multi-model review (if CLIs available).
39
55
 
40
56
  ## Mode Detection
41
57
  If docs/reviews/pre-review-user-stories.md exists, this is a re-review. Read
42
58
  previous findings, check which were addressed, run review passes again on
43
- updated stories.
59
+ updated stories. If docs/reviews/user-stories/requirements-index.md exists,
60
+ preserve requirement IDs — never renumber REQ-xxx IDs.
@@ -2,7 +2,7 @@
2
2
  name: user-stories
3
3
  description: Translate PRD features into user stories with acceptance criteria
4
4
  phase: "pre"
5
- order: 4
5
+ order: 140
6
6
  dependencies: [review-prd]
7
7
  outputs: [docs/user-stories.md]
8
8
  conditional: null
@@ -16,7 +16,7 @@ that are testable and specific enough to drive domain modeling, UX design, and
16
16
  task decomposition downstream.
17
17
 
18
18
  ## Inputs
19
- - docs/prd.md (required) — features, personas, and requirements to translate
19
+ - docs/plan.md (required) — features, personas, and requirements to translate
20
20
  - docs/reviews/pre-review-prd.md (optional) — review findings for context
21
21
  - docs/prd-innovation.md (optional) — innovation findings and approved enhancements
22
22
 
@@ -46,3 +46,13 @@ task decomposition downstream.
46
46
  If docs/user-stories.md exists, operate in update mode: read existing stories,
47
47
  identify changes needed based on updated PRD, categorize as ADD/RESTRUCTURE/
48
48
  PRESERVE, get approval before modifying. Preserve existing story IDs.
49
+
50
+ ## Update Mode Specifics
51
+ - **Detect prior artifact**: docs/user-stories.md exists
52
+ - **Preserve**: existing story IDs, epic groupings, acceptance criteria that
53
+ haven't been invalidated, story-to-PRD-feature traceability
54
+ - **Triggers for update**: PRD features added or changed, innovation suggestions
55
+ accepted, user personas expanded, review findings require story adjustments
56
+ - **Conflict resolution**: never reuse a retired story ID; if a story's scope
57
+ changed, update its acceptance criteria in-place rather than creating a
58
+ duplicate story