npm - @zigrivers/scaffold - Versions diffs - 2.28.1 → 2.38.1 - Mend

@zigrivers/scaffold 2.28.1 → 2.38.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (375) hide show

package/README.md +309 -136
package/dist/cli/commands/build.d.ts.map +1 -1
package/dist/cli/commands/build.js +94 -14
package/dist/cli/commands/build.js.map +1 -1
package/dist/cli/commands/build.test.js +30 -5
package/dist/cli/commands/build.test.js.map +1 -1
package/dist/cli/commands/check.d.ts +12 -0
package/dist/cli/commands/check.d.ts.map +1 -0
package/dist/cli/commands/check.js +311 -0
package/dist/cli/commands/check.js.map +1 -0
package/dist/cli/commands/check.test.d.ts +2 -0
package/dist/cli/commands/check.test.d.ts.map +1 -0
package/dist/cli/commands/check.test.js +412 -0
package/dist/cli/commands/check.test.js.map +1 -0
package/dist/cli/commands/complete.d.ts +12 -0
package/dist/cli/commands/complete.d.ts.map +1 -0
package/dist/cli/commands/complete.js +103 -0
package/dist/cli/commands/complete.js.map +1 -0
package/dist/cli/commands/complete.test.d.ts +2 -0
package/dist/cli/commands/complete.test.d.ts.map +1 -0
package/dist/cli/commands/complete.test.js +133 -0
package/dist/cli/commands/complete.test.js.map +1 -0
package/dist/cli/commands/dashboard.d.ts.map +1 -1
package/dist/cli/commands/dashboard.js +12 -8
package/dist/cli/commands/dashboard.js.map +1 -1
package/dist/cli/commands/info.d.ts.map +1 -1
package/dist/cli/commands/info.js +4 -0
package/dist/cli/commands/info.js.map +1 -1
package/dist/cli/commands/knowledge.d.ts.map +1 -1
package/dist/cli/commands/knowledge.js +6 -2
package/dist/cli/commands/knowledge.js.map +1 -1
package/dist/cli/commands/knowledge.test.js +16 -11
package/dist/cli/commands/knowledge.test.js.map +1 -1
package/dist/cli/commands/next.d.ts.map +1 -1
package/dist/cli/commands/next.js +41 -13
package/dist/cli/commands/next.js.map +1 -1
package/dist/cli/commands/next.test.js +3 -0
package/dist/cli/commands/next.test.js.map +1 -1
package/dist/cli/commands/reset.d.ts +1 -0
package/dist/cli/commands/reset.d.ts.map +1 -1
package/dist/cli/commands/reset.js +179 -67
package/dist/cli/commands/reset.js.map +1 -1
package/dist/cli/commands/reset.test.js +360 -0
package/dist/cli/commands/reset.test.js.map +1 -1
package/dist/cli/commands/rework.d.ts +20 -0
package/dist/cli/commands/rework.d.ts.map +1 -0
package/dist/cli/commands/rework.js +332 -0
package/dist/cli/commands/rework.js.map +1 -0
package/dist/cli/commands/rework.test.d.ts +2 -0
package/dist/cli/commands/rework.test.d.ts.map +1 -0
package/dist/cli/commands/rework.test.js +297 -0
package/dist/cli/commands/rework.test.js.map +1 -0
package/dist/cli/commands/run.d.ts.map +1 -1
package/dist/cli/commands/run.js +59 -31
package/dist/cli/commands/run.js.map +1 -1
package/dist/cli/commands/run.test.js +288 -6
package/dist/cli/commands/run.test.js.map +1 -1
package/dist/cli/commands/skill.d.ts +12 -0
package/dist/cli/commands/skill.d.ts.map +1 -0
package/dist/cli/commands/skill.js +123 -0
package/dist/cli/commands/skill.js.map +1 -0
package/dist/cli/commands/skill.test.d.ts +2 -0
package/dist/cli/commands/skill.test.d.ts.map +1 -0
package/dist/cli/commands/skill.test.js +297 -0
package/dist/cli/commands/skill.test.js.map +1 -0
package/dist/cli/commands/skip.d.ts +1 -1
package/dist/cli/commands/skip.d.ts.map +1 -1
package/dist/cli/commands/skip.js +123 -57
package/dist/cli/commands/skip.js.map +1 -1
package/dist/cli/commands/skip.test.js +91 -0
package/dist/cli/commands/skip.test.js.map +1 -1
package/dist/cli/commands/status.d.ts +1 -0
package/dist/cli/commands/status.d.ts.map +1 -1
package/dist/cli/commands/status.js +57 -10
package/dist/cli/commands/status.js.map +1 -1
package/dist/cli/commands/status.test.js +81 -0
package/dist/cli/commands/status.test.js.map +1 -1
package/dist/cli/commands/update.test.js +252 -0
package/dist/cli/commands/update.test.js.map +1 -1
package/dist/cli/commands/version.test.js +171 -1
package/dist/cli/commands/version.test.js.map +1 -1
package/dist/cli/index.d.ts.map +1 -1
package/dist/cli/index.js +8 -0
package/dist/cli/index.js.map +1 -1
package/dist/core/adapters/adapter.d.ts +14 -0
package/dist/core/adapters/adapter.d.ts.map +1 -1
package/dist/core/adapters/adapter.js.map +1 -1
package/dist/core/adapters/adapter.test.js +10 -0
package/dist/core/adapters/adapter.test.js.map +1 -1
package/dist/core/adapters/claude-code.d.ts.map +1 -1
package/dist/core/adapters/claude-code.js +47 -10
package/dist/core/adapters/claude-code.js.map +1 -1
package/dist/core/adapters/claude-code.test.js +41 -20
package/dist/core/adapters/claude-code.test.js.map +1 -1
package/dist/core/adapters/codex.d.ts.map +1 -1
package/dist/core/adapters/codex.js +5 -1
package/dist/core/adapters/codex.js.map +1 -1
package/dist/core/adapters/codex.test.js +5 -0
package/dist/core/adapters/codex.test.js.map +1 -1
package/dist/core/adapters/universal.d.ts.map +1 -1
package/dist/core/adapters/universal.js +0 -1
package/dist/core/adapters/universal.js.map +1 -1
package/dist/core/adapters/universal.test.js +5 -0
package/dist/core/adapters/universal.test.js.map +1 -1
package/dist/core/assembly/context-gatherer.d.ts.map +1 -1
package/dist/core/assembly/context-gatherer.js +5 -2
package/dist/core/assembly/context-gatherer.js.map +1 -1
package/dist/core/assembly/engine.d.ts.map +1 -1
package/dist/core/assembly/engine.js +10 -2
package/dist/core/assembly/engine.js.map +1 -1
package/dist/core/assembly/engine.test.js +19 -0
package/dist/core/assembly/engine.test.js.map +1 -1
package/dist/core/assembly/knowledge-loader.d.ts +25 -0
package/dist/core/assembly/knowledge-loader.d.ts.map +1 -1
package/dist/core/assembly/knowledge-loader.js +75 -2
package/dist/core/assembly/knowledge-loader.js.map +1 -1
package/dist/core/assembly/knowledge-loader.test.js +388 -1
package/dist/core/assembly/knowledge-loader.test.js.map +1 -1
package/dist/core/assembly/meta-prompt-loader.d.ts +6 -0
package/dist/core/assembly/meta-prompt-loader.d.ts.map +1 -1
package/dist/core/assembly/meta-prompt-loader.js +41 -25
package/dist/core/assembly/meta-prompt-loader.js.map +1 -1
package/dist/core/assembly/preset-loader.d.ts +10 -0
package/dist/core/assembly/preset-loader.d.ts.map +1 -1
package/dist/core/assembly/preset-loader.js +26 -1
package/dist/core/assembly/preset-loader.js.map +1 -1
package/dist/core/assembly/preset-loader.test.js +65 -1
package/dist/core/assembly/preset-loader.test.js.map +1 -1
package/dist/core/assembly/update-mode.d.ts.map +1 -1
package/dist/core/assembly/update-mode.js +10 -4
package/dist/core/assembly/update-mode.js.map +1 -1
package/dist/core/assembly/update-mode.test.js +47 -0
package/dist/core/assembly/update-mode.test.js.map +1 -1
package/dist/core/dependency/dependency.d.ts.map +1 -1
package/dist/core/dependency/dependency.js +3 -2
package/dist/core/dependency/dependency.js.map +1 -1
package/dist/core/dependency/dependency.test.js +2 -0
package/dist/core/dependency/dependency.test.js.map +1 -1
package/dist/core/dependency/eligibility.js +3 -3
package/dist/core/dependency/eligibility.js.map +1 -1
package/dist/core/dependency/eligibility.test.js +2 -0
package/dist/core/dependency/eligibility.test.js.map +1 -1
package/dist/core/dependency/graph.d.ts.map +1 -1
package/dist/core/dependency/graph.js +4 -0
package/dist/core/dependency/graph.js.map +1 -1
package/dist/core/dependency/graph.test.d.ts +2 -0
package/dist/core/dependency/graph.test.d.ts.map +1 -0
package/dist/core/dependency/graph.test.js +262 -0
package/dist/core/dependency/graph.test.js.map +1 -0
package/dist/core/rework/phase-selector.d.ts +24 -0
package/dist/core/rework/phase-selector.d.ts.map +1 -0
package/dist/core/rework/phase-selector.js +98 -0
package/dist/core/rework/phase-selector.js.map +1 -0
package/dist/core/rework/phase-selector.test.d.ts +2 -0
package/dist/core/rework/phase-selector.test.d.ts.map +1 -0
package/dist/core/rework/phase-selector.test.js +138 -0
package/dist/core/rework/phase-selector.test.js.map +1 -0
package/dist/dashboard/generator.d.ts +48 -17
package/dist/dashboard/generator.d.ts.map +1 -1
package/dist/dashboard/generator.js +75 -5
package/dist/dashboard/generator.js.map +1 -1
package/dist/dashboard/generator.test.js +213 -5
package/dist/dashboard/generator.test.js.map +1 -1
package/dist/dashboard/template.d.ts +1 -1
package/dist/dashboard/template.d.ts.map +1 -1
package/dist/dashboard/template.js +755 -114
package/dist/dashboard/template.js.map +1 -1
package/dist/e2e/knowledge.test.js +4 -3
package/dist/e2e/knowledge.test.js.map +1 -1
package/dist/e2e/pipeline.test.js +2 -0
package/dist/e2e/pipeline.test.js.map +1 -1
package/dist/e2e/rework.test.d.ts +6 -0
package/dist/e2e/rework.test.d.ts.map +1 -0
package/dist/e2e/rework.test.js +226 -0
package/dist/e2e/rework.test.js.map +1 -0
package/dist/index.js +0 -0
package/dist/project/adopt.test.js +2 -0
package/dist/project/adopt.test.js.map +1 -1
package/dist/project/claude-md.js +2 -2
package/dist/project/claude-md.js.map +1 -1
package/dist/project/claude-md.test.js +4 -4
package/dist/project/claude-md.test.js.map +1 -1
package/dist/project/detector.d.ts.map +1 -1
package/dist/project/detector.js +4 -1
package/dist/project/detector.js.map +1 -1
package/dist/project/frontmatter.d.ts.map +1 -1
package/dist/project/frontmatter.js +54 -15
package/dist/project/frontmatter.js.map +1 -1
package/dist/project/frontmatter.test.js +2 -2
package/dist/project/frontmatter.test.js.map +1 -1
package/dist/state/rework-manager.d.ts +16 -0
package/dist/state/rework-manager.d.ts.map +1 -0
package/dist/state/rework-manager.js +126 -0
package/dist/state/rework-manager.js.map +1 -0
package/dist/state/rework-manager.test.d.ts +2 -0
package/dist/state/rework-manager.test.d.ts.map +1 -0
package/dist/state/rework-manager.test.js +191 -0
package/dist/state/rework-manager.test.js.map +1 -0
package/dist/state/state-manager.d.ts +13 -0
package/dist/state/state-manager.d.ts.map +1 -1
package/dist/state/state-manager.js +39 -2
package/dist/state/state-manager.js.map +1 -1
package/dist/state/state-manager.test.js +74 -1
package/dist/state/state-manager.test.js.map +1 -1
package/dist/state/state-migration.d.ts +23 -0
package/dist/state/state-migration.d.ts.map +1 -0
package/dist/state/state-migration.js +144 -0
package/dist/state/state-migration.js.map +1 -0
package/dist/state/state-migration.test.d.ts +2 -0
package/dist/state/state-migration.test.d.ts.map +1 -0
package/dist/state/state-migration.test.js +451 -0
package/dist/state/state-migration.test.js.map +1 -0
package/dist/types/assembly.d.ts +2 -0
package/dist/types/assembly.d.ts.map +1 -1
package/dist/types/dependency.d.ts +2 -2
package/dist/types/dependency.d.ts.map +1 -1
package/dist/types/frontmatter.d.ts +100 -7
package/dist/types/frontmatter.d.ts.map +1 -1
package/dist/types/frontmatter.js +89 -1
package/dist/types/frontmatter.js.map +1 -1
package/dist/types/index.d.ts +1 -0
package/dist/types/index.d.ts.map +1 -1
package/dist/types/index.js +1 -0
package/dist/types/index.js.map +1 -1
package/dist/types/lock.d.ts +1 -1
package/dist/types/lock.d.ts.map +1 -1
package/dist/types/rework.d.ts +36 -0
package/dist/types/rework.d.ts.map +1 -0
package/dist/types/rework.js +2 -0
package/dist/types/rework.js.map +1 -0
package/dist/utils/errors.d.ts +1 -0
package/dist/utils/errors.d.ts.map +1 -1
package/dist/utils/errors.js +8 -0
package/dist/utils/errors.js.map +1 -1
package/dist/utils/fs.d.ts +6 -0
package/dist/utils/fs.d.ts.map +1 -1
package/dist/utils/fs.js +13 -0
package/dist/utils/fs.js.map +1 -1
package/dist/validation/config-validator.test.d.ts +2 -0
package/dist/validation/config-validator.test.d.ts.map +1 -0
package/dist/validation/config-validator.test.js +210 -0
package/dist/validation/config-validator.test.js.map +1 -0
package/dist/validation/dependency-validator.test.d.ts +2 -0
package/dist/validation/dependency-validator.test.d.ts.map +1 -0
package/dist/validation/dependency-validator.test.js +215 -0
package/dist/validation/dependency-validator.test.js.map +1 -0
package/dist/validation/frontmatter-validator.test.d.ts +2 -0
package/dist/validation/frontmatter-validator.test.d.ts.map +1 -0
package/dist/validation/frontmatter-validator.test.js +371 -0
package/dist/validation/frontmatter-validator.test.js.map +1 -0
package/dist/validation/state-validator.test.d.ts +2 -0
package/dist/validation/state-validator.test.d.ts.map +1 -0
package/dist/validation/state-validator.test.js +325 -0
package/dist/validation/state-validator.test.js.map +1 -0
package/dist/wizard/suggestion.test.d.ts +2 -0
package/dist/wizard/suggestion.test.d.ts.map +1 -0
package/dist/wizard/suggestion.test.js +115 -0
package/dist/wizard/suggestion.test.js.map +1 -0
package/dist/wizard/wizard.d.ts.map +1 -1
package/dist/wizard/wizard.js +34 -1
package/dist/wizard/wizard.js.map +1 -1
package/knowledge/core/adr-craft.md +4 -0
package/knowledge/core/api-design.md +4 -0
package/knowledge/core/automated-review-tooling.md +203 -0
package/knowledge/core/coding-conventions.md +1 -1
package/knowledge/core/database-design.md +4 -0
package/knowledge/core/design-system-tokens.md +4 -0
package/knowledge/core/domain-modeling.md +4 -0
package/knowledge/core/git-workflow-patterns.md +200 -0
package/knowledge/core/operations-runbook.md +5 -1
package/knowledge/core/security-best-practices.md +4 -0
package/knowledge/core/system-architecture.md +5 -1
package/knowledge/core/task-decomposition.md +118 -3
package/knowledge/core/user-story-innovation.md +13 -0
package/knowledge/core/ux-specification.md +13 -0
package/knowledge/execution/enhancement-workflow.md +201 -0
package/knowledge/execution/task-claiming-strategy.md +130 -0
package/knowledge/execution/tdd-execution-loop.md +172 -0
package/knowledge/execution/worktree-management.md +205 -0
package/knowledge/finalization/apply-fixes-and-freeze.md +12 -0
package/knowledge/finalization/developer-onboarding.md +4 -0
package/knowledge/finalization/implementation-playbook.md +83 -5
package/knowledge/product/gap-analysis.md +5 -1
package/knowledge/product/prd-innovation.md +12 -0
package/knowledge/product/vision-craft.md +213 -0
package/knowledge/review/review-adr.md +12 -0
package/knowledge/review/review-api-design.md +13 -0
package/knowledge/review/review-database-design.md +13 -0
package/knowledge/review/review-domain-modeling.md +5 -1
package/knowledge/review/review-implementation-tasks.md +58 -1
package/knowledge/review/review-methodology.md +11 -0
package/knowledge/review/review-operations.md +12 -0
package/knowledge/review/review-prd.md +13 -0
package/knowledge/review/review-security.md +12 -0
package/knowledge/review/review-system-architecture.md +4 -2
package/knowledge/review/review-testing-strategy.md +11 -0
package/knowledge/review/review-user-stories.md +11 -0
package/knowledge/review/review-ux-specification.md +13 -1
package/knowledge/review/review-vision.md +255 -0
package/knowledge/tools/release-management.md +222 -0
package/knowledge/tools/session-analysis.md +215 -0
package/knowledge/tools/version-strategy.md +200 -0
package/knowledge/validation/critical-path-analysis.md +1 -1
package/knowledge/validation/cross-phase-consistency.md +12 -0
package/knowledge/validation/decision-completeness.md +13 -1
package/knowledge/validation/dependency-validation.md +12 -0
package/knowledge/validation/scope-management.md +12 -0
package/knowledge/validation/traceability.md +12 -0
package/methodology/README.md +37 -0
package/methodology/custom-defaults.yml +12 -1
package/methodology/deep.yml +11 -0
package/methodology/mvp.yml +11 -0
package/package.json +3 -3
package/pipeline/architecture/review-architecture.md +18 -7
package/pipeline/architecture/system-architecture.md +11 -8
package/pipeline/build/multi-agent-resume.md +245 -0
package/pipeline/build/multi-agent-start.md +236 -0
package/pipeline/build/new-enhancement.md +456 -0
package/pipeline/build/quick-task.md +381 -0
package/pipeline/build/single-agent-resume.md +210 -0
package/pipeline/build/single-agent-start.md +207 -0
package/pipeline/consolidation/claude-md-optimization.md +11 -8
package/pipeline/consolidation/workflow-audit.md +15 -11
package/pipeline/decisions/adrs.md +7 -5
package/pipeline/decisions/review-adrs.md +14 -6
package/pipeline/environment/ai-memory-setup.md +18 -12
package/pipeline/environment/automated-pr-review.md +10 -4
package/pipeline/environment/design-system.md +9 -7
package/pipeline/environment/dev-env-setup.md +8 -5
package/pipeline/environment/git-workflow.md +3 -1
package/pipeline/finalization/apply-fixes-and-freeze.md +16 -5
package/pipeline/finalization/developer-onboarding-guide.md +22 -8
package/pipeline/finalization/implementation-playbook.md +40 -11
package/pipeline/foundation/beads.md +10 -7
package/pipeline/foundation/coding-standards.md +6 -3
package/pipeline/foundation/project-structure.md +5 -1
package/pipeline/foundation/tdd.md +10 -6
package/pipeline/foundation/tech-stack.md +9 -9
package/pipeline/integration/add-e2e-testing.md +21 -6
package/pipeline/modeling/domain-modeling.md +10 -7
package/pipeline/modeling/review-domain-modeling.md +17 -6
package/pipeline/parity/platform-parity-review.md +31 -11
package/pipeline/planning/implementation-plan-review.md +21 -10
package/pipeline/planning/implementation-plan.md +52 -19
package/pipeline/pre/create-prd.md +22 -7
package/pipeline/pre/innovate-prd.md +10 -8
package/pipeline/pre/innovate-user-stories.md +9 -7
package/pipeline/pre/review-prd.md +11 -2
package/pipeline/pre/review-user-stories.md +12 -3
package/pipeline/pre/user-stories.md +12 -7
package/pipeline/quality/create-evals.md +10 -6
package/pipeline/quality/operations.md +16 -12
package/pipeline/quality/review-operations.md +19 -10
package/pipeline/quality/review-security.md +21 -11
package/pipeline/quality/review-testing.md +23 -12
package/pipeline/quality/security.md +17 -13
package/pipeline/quality/story-tests.md +6 -4
package/pipeline/specification/api-contracts.md +11 -6
package/pipeline/specification/database-schema.md +12 -6
package/pipeline/specification/review-api.md +18 -9
package/pipeline/specification/review-database.md +18 -9
package/pipeline/specification/review-ux.md +20 -10
package/pipeline/specification/ux-spec.md +8 -5
package/pipeline/validation/critical-path-walkthrough.md +14 -7
package/pipeline/validation/cross-phase-consistency.md +14 -7
package/pipeline/validation/decision-completeness.md +14 -7
package/pipeline/validation/dependency-graph-validation.md +15 -7
package/pipeline/validation/implementability-dry-run.md +15 -7
package/pipeline/validation/scope-creep-check.md +15 -7
package/pipeline/validation/traceability-matrix.md +20 -7
package/pipeline/vision/create-vision.md +267 -0
package/pipeline/vision/innovate-vision.md +157 -0
package/pipeline/vision/review-vision.md +149 -0
package/skills/scaffold-pipeline/SKILL.md +33 -18
package/skills/scaffold-runner/SKILL.md +172 -18

package/pipeline/planning/implementation-plan.md CHANGED Viewed

@@ -1,11 +1,12 @@
 ---
 name: implementation-plan
 description: Break architecture into implementable tasks with dependencies
+summary: "Breaks your user stories and architecture into concrete tasks — each scoped to ~150 lines of code and 3 files max, with clear acceptance criteria, no ambiguous decisions, and explicit dependencies."
 phase: "planning"
 order: 1210
 dependencies: [tdd, operations, security, review-architecture, create-evals]
 outputs: [docs/implementation-plan.md]
-reads: [create-prd]
+reads: [create-prd, story-tests, database-schema, api-contracts, ux-spec]
 conditional: null
 knowledge-base: [task-decomposition]
 ---
@@ -17,9 +18,9 @@ have clear inputs/outputs, and be small enough for a single agent session.
 The primary mapping is Story → Task(s), with PRD as the traceability root.
 ## Inputs
-- docs/system-architecture.md (required) — components to implement
-- docs/domain-models/ (required) — domain logic to implement
-- docs/adrs/ (required) — technology constraints
+- docs/system-architecture.md (optional — not available in MVP) — components to implement
+- docs/domain-models/ (optional — not available in MVP) — domain logic to implement
+- docs/adrs/ (optional — not available in MVP) — technology constraints
 - docs/plan.md (required) — features to trace tasks back to
 - docs/user-stories.md (required) — stories to derive tasks from
 - docs/tdd-standards.md (required) — testing requirements to incorporate into tasks
@@ -28,31 +29,41 @@ The primary mapping is Story → Task(s), with PRD as the traceability root.
 - docs/database-schema.md (optional) — data layer tasks
 - docs/api-contracts.md (optional) — API implementation tasks
 - docs/ux-spec.md (optional) — frontend tasks
+- tests/acceptance/ (optional) — test skeletons to reference in task descriptions
+- docs/story-tests-map.md (optional) — AC-to-test mapping for task coverage verification
 ## Expected Outputs
 - docs/implementation-plan.md — task list with dependencies, sizing, and
   assignment recommendations
 ## Quality Criteria
-- Every architecture component has implementation tasks
-- Task dependencies form a valid DAG (no cycles)
-- Each task is scoped for a single agent session (not too large, not too small)
-- Tasks include acceptance criteria (how to know it's done)
-- Tasks incorporate testing requirements from the testing strategy
-- Tasks incorporate security controls from the security review where applicable
-- Tasks incorporate operational requirements (monitoring, deployment) where applicable
-- Critical path is identified
-- Parallelization opportunities are marked with wave plan
-- Every user story maps to at least one task
-- High-risk tasks are flagged with risk type and mitigation
-- Wave summary produced with agent allocation recommendation
+- (mvp) Every architecture component has implementation tasks
+- (mvp) Task dependencies form a valid DAG (no cycles)
+- (mvp) Each task produces ~150 lines of net-new application code (excluding tests and generated files)
+- (mvp) Tasks include acceptance criteria (how to know it's done)
+- (mvp) Tasks incorporate testing requirements from the testing strategy
+- (deep) Tasks reference corresponding test skeletons from tests/acceptance/ where applicable
+- (deep) Tasks incorporate security controls from the security review where applicable
+- (deep) Tasks incorporate operational requirements (monitoring, deployment) where applicable
+- (deep) Critical path is identified
+- (deep) Parallelization opportunities are marked with wave plan
+- (mvp) Every user story maps to at least one task
+- (deep) High-risk tasks are flagged with risk type and mitigation
+- (deep) Wave summary produced with agent allocation recommendation
+- (mvp) No task modifies more than 3 application files (test files excluded; exceptions require justification)
+- (mvp) No task contains unresolved design decisions (agents implement, they don't architect)
+- (mvp) Every code-producing task includes co-located test requirements
+- (deep) Critical path identified with estimated total duration
 ## Methodology Scaling
 - **deep**: Detailed task breakdown with story-to-task tracing. Dependency graph.
   Sizing estimates. Parallelization plan. Agent context requirements per task.
   Phased delivery milestones.
-- **mvp**: Ordered task list with brief descriptions. Key dependencies noted.
-  Enough to start working sequentially.
+- **mvp**: Ordered task list derived from PRD features and user stories only
+  (architecture, domain models, and ADRs are not available at this depth).
+  Each task has a brief description, rough size estimate, and key dependency.
+  Enough to start working sequentially. Skip architecture decomposition —
+  work directly from user story acceptance criteria.
 - **custom:depth(1-5)**: Depth 1-2: ordered list. Depth 3: add dependencies
   and sizing. Depth 4-5: full breakdown with parallelization.
@@ -68,10 +79,32 @@ that are in-progress or completed.
 - **Detect prior artifact**: docs/implementation-plan.md exists
 - **Preserve**: completed and in-progress task statuses, existing task IDs,
   dependency relationships for stable tasks, wave assignments for tasks
-  already started, agent allocation history
+  already started, agent allocation history, architecture decisions,
+  component boundaries
 - **Triggers for update**: architecture changed (new components need tasks),
   user stories added or changed, security review identified new requirements,
   operations runbook added deployment tasks, specification docs changed
 - **Conflict resolution**: if architecture restructured a component that has
   in-progress tasks, flag for user review rather than silently reassigning;
   re-derive critical path only for unstarted tasks
+## Task Size Constraints
+Before finalizing the implementation plan, scan every task against the five agent
+executability rules from the task-decomposition knowledge base:
+1. **Three-File Rule** — Count application files each task modifies (exclude test files).
+   Any task touching 4+ files must be split by layer or concern.
+2. **150-Line Budget** — Estimate net-new application code lines per task. Any task
+   likely to produce 200+ lines must be split by feature slice or entity.
+3. **Single-Concern Rule** — Check each task description for "and" connecting unrelated
+   work. Split if the task spans multiple architectural layers or feature domains.
+4. **Decision-Free Execution** — Verify all design decisions are resolved in the task
+   description. No "choose", "determine", "decide", or "evaluate options" language.
+   Resolve decisions inline before presenting the plan.
+5. **Test Co-location** — Confirm every code-producing task includes its test
+   requirements. No "write tests later" aggregation tasks.
+Tasks that fail any rule should be split inline. If a task genuinely can't be split
+further, annotate with `<!-- agent-size-exception: reason -->`. The implementation
+plan review will flag unjustified exceptions.

package/pipeline/pre/create-prd.md CHANGED Viewed

@@ -1,18 +1,22 @@
 ---
 name: create-prd
 description: Create a product requirements document from a project idea
+summary: "Translates your vision (or idea, if no vision exists) into a product requirements document with problem statement, user personas, prioritized feature list, constraints, non-functional requirements, and measurable success criteria."
 phase: "pre"
 order: 110
 dependencies: []
 outputs: [docs/plan.md]
 conditional: null
 knowledge-base: [prd-craft]
+reads: [create-vision]
 ---
 ## Purpose
 Transform a project idea into a structured product requirements document that
 defines the problem, target users, features, constraints, and success criteria.
 This is the foundation document that all subsequent phases reference.
+The PRD drives user stories, architecture decisions, and implementation planning
+throughout the entire pipeline.
 ## Inputs
 - Project idea (provided by user verbally or in a brief)
@@ -22,12 +26,13 @@ This is the foundation document that all subsequent phases reference.
 - docs/plan.md — Product requirements document
 ## Quality Criteria
-- Problem statement is specific and testable (not vague aspirations)
-- Target users are identified with their needs
-- Features are scoped with clear boundaries (what's in, what's out)
-- Success criteria are measurable
-- Constraints (technical, timeline, budget, team) are documented
-- Non-functional requirements are explicit (performance, security, accessibility)
+- (mvp) Problem statement names a specific user group, a specific pain point, and a falsifiable hypothesis about the solution
+- (mvp) Target users are identified with their needs
+- (mvp) Features are scoped with clear boundaries (what's in, what's out)
+- (mvp) Success criteria are measurable
+- (mvp) Each non-functional requirement has a measurable target or threshold (e.g., 'page load < 2s', 'WCAG AA')
+- (mvp) No two sections contain contradictory statements about the same concept
+- (deep) Constraints (technical, timeline, budget, team) are documented
 ## Methodology Scaling
 - **deep**: Comprehensive PRD. Competitive analysis, detailed user personas,
@@ -47,8 +52,18 @@ Preserve existing decisions unless explicitly revisiting them.
 ## Update Mode Specifics
 - **Detect prior artifact**: docs/plan.md exists
 - **Preserve**: problem statement, existing feature definitions, success criteria,
-  user personas, and scope boundaries unless user explicitly requests changes
+  user personas, scope boundaries, and enhancement markers (`<!-- enhancement: ... -->`)
+  unless user explicitly requests changes
 - **Triggers for update**: user provides new requirements, scope adjustment
   requested, constraints changed (timeline, budget, team), new user research
 - **Conflict resolution**: new features are appended to the feature list with
   clear versioning; changed constraints are documented with rationale for change
+### Understand the Vision
+**If `docs/vision.md` exists**: Read it completely. This is your strategic foundation — the vision document has already established the problem space, target audience, value proposition, competitive landscape, and guiding principles. Skip the vision discovery questions below and use the vision document as the North Star for this PRD. Reference it throughout, ensuring every requirement aligns with the stated vision and guiding principles. Focus your discovery questions on translating the vision into concrete product requirements rather than re-exploring strategic direction.
+**If `docs/vision.md` does NOT exist**:
+- What problem does this solve and for whom? Push me to be specific about the target user.
+- What does success look like? How will we know this is working?
+- What's the single most important thing this app must do well?

package/pipeline/pre/innovate-prd.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
 name: innovate-prd
 description: Discover feature-level innovation opportunities in the PRD
+summary: "Analyzes the PRD for feature-level gaps — competitive blind spots, UX enhancements, AI-native possibilities — and proposes additions for your approval."
 phase: "pre"
 order: 130
 dependencies: [review-prd]
-outputs: [docs/prd-innovation.md, docs/reviews/prd-innovation/review-summary.md, docs/reviews/prd-innovation/codex-review.json, docs/reviews/prd-innovation/gemini-review.json]
+outputs: [docs/prd-innovation.md, docs/plan.md, docs/reviews/prd-innovation/review-summary.md, docs/reviews/prd-innovation/codex-review.json, docs/reviews/prd-innovation/gemini-review.json]
 conditional: "if-needed"
-knowledge-base: [prd-innovation, prd-craft]
+knowledge-base: [prd-innovation, prd-craft, multi-model-review-dispatch]
 ---
 ## Purpose
@@ -32,12 +33,13 @@ creative opportunities and competitive insights.
 - docs/reviews/prd-innovation/gemini-review.json (depth 4+, if available) — raw Gemini suggestions
 ## Quality Criteria
-- Enhancements are feature-level, not UX-level polish
-- Each suggestion has a cost estimate (trivial/moderate/significant)
-- Each suggestion has a clear user benefit and impact assessment
-- Approved innovations are documented to the same standard as existing features
-- PRD scope boundaries are respected — no uncontrolled scope creep
+- (mvp) Enhancements are feature-level, not UX-level polish
+- (mvp) Each suggestion has a cost estimate (trivial/moderate/significant)
+- (mvp) Each suggestion has a clear user benefit and impact assessment
+- (mvp) Each approved innovation includes: problem it solves, target users, scope boundaries, and success criteria
+- (mvp) PRD scope boundaries are respected — no uncontrolled scope creep
 - User approval is obtained before modifying the PRD
+- User approval for each accepted innovation documented as a question-response pair with timestamp (e.g., "Q: Accept feature X? A: Yes — 2025-01-15T14:30Z")
 - (depth 4+) Multi-model suggestions deduplicated and synthesized with unique ideas from each model highlighted
 ## Methodology Scaling
@@ -47,7 +49,7 @@ creative opportunities and competitive insights.
   innovation dispatched to Codex and Gemini if available, with graceful
   fallback to Claude-only enhanced brainstorming.
 - **mvp**: Not applicable — this step is conditional and skipped in MVP.
-- **custom:depth(1-5)**: Depth 1-2: not typically enabled. Depth 3: quick scan
+- **custom:depth(1-5)**: Depth 1-2: skip (not enough context for meaningful innovation at this depth). Depth 3: quick scan
   for obvious gaps and missing expected features. Depth 4: full innovation
   pass + one external model (if CLI available). Depth 5: full innovation pass
   + multi-model with deduplication and synthesis.

package/pipeline/pre/innovate-user-stories.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
 name: innovate-user-stories
 description: Discover UX-level enhancements and innovation opportunities in user stories
+summary: "Identifies UX enhancement opportunities — progressive disclosure, smart defaults, accessibility improvements — and integrates approved changes into existing stories."
 phase: "pre"
 order: 160
 dependencies: [review-user-stories]
 outputs: [docs/user-stories-innovation.md, docs/reviews/user-stories-innovation/review-summary.md, docs/reviews/user-stories-innovation/codex-review.json, docs/reviews/user-stories-innovation/gemini-review.json]
 conditional: "if-needed"
-knowledge-base: [user-stories, user-story-innovation]
+knowledge-base: [user-stories, user-story-innovation, multi-model-review-dispatch]
 ---
 ## Purpose
@@ -33,11 +34,12 @@ enhancement opportunities.
 - docs/reviews/user-stories-innovation/gemini-review.json (depth 4+, if available) — raw Gemini suggestions
 ## Quality Criteria
-- Enhancements are UX-level, not new features
-- Each suggestion has a cost estimate (trivial/moderate/significant)
-- Each suggestion has a clear user benefit
-- Approved enhancements are integrated into existing stories (not new stories)
-- PRD scope boundaries are respected — no scope creep
+- (mvp) Enhancements are UX-level, not new features
+- (mvp) Each suggestion has a cost estimate (trivial/moderate/significant)
+- (mvp) Each suggestion has a clear user benefit
+- (mvp) Approved enhancements are integrated into existing stories (not new stories)
+- (mvp) PRD scope boundaries are respected — no scope creep
+- User approval for each accepted innovation documented as a question-response pair with timestamp (e.g., "Q: Accept enhancement X? A: Yes — 2025-01-15T14:30Z")
 - (depth 4+) Multi-model suggestions deduplicated and synthesized with unique ideas from each model highlighted
 ## Methodology Scaling
@@ -47,7 +49,7 @@ enhancement opportunities.
   innovation dispatched to Codex and Gemini if available, with graceful
   fallback to Claude-only enhanced brainstorming.
 - **mvp**: Not applicable — this step is conditional and skipped in MVP.
-- **custom:depth(1-5)**: Depth 1-2: not typically enabled. Depth 3: quick
+- **custom:depth(1-5)**: Depth 1-2: skip (not enough context for meaningful innovation at this depth). Depth 3: quick
   scan for obvious improvements. Depth 4: full innovation pass + one external
   model (if CLI available). Depth 5: full innovation pass + multi-model with
   deduplication and synthesis.

package/pipeline/pre/review-prd.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
 name: review-prd
 description: Multi-pass review of the PRD for completeness, clarity, and downstream readiness
+summary: "Reviews the PRD across eight passes — problem rigor, persona coverage, feature scoping, success criteria, internal consistency, constraints, non-functional requirements — and fixes blocking issues."
 phase: "pre"
 order: 120
 dependencies: [create-prd]
 outputs: [docs/reviews/pre-review-prd.md, docs/reviews/prd/review-summary.md, docs/reviews/prd/codex-review.json, docs/reviews/prd/gemini-review.json]
 conditional: null
-knowledge-base: [review-methodology, review-prd, prd-craft, gap-analysis]
+knowledge-base: [review-methodology, review-prd, prd-craft, gap-analysis, multi-model-review-dispatch, review-step-template]
 ---
 ## Purpose
@@ -30,11 +31,12 @@ independent review validation.
 - docs/reviews/prd/gemini-review.json (depth 4+, if available) — raw Gemini findings
 ## Quality Criteria
+- (mvp) Passes 1-2 executed with findings documented
 - All review passes executed with findings documented
 - Every finding categorized by severity (P0-P3)
 - Fix plan created for P0 and P1 findings
 - Fixes applied and re-validated
-- Downstream readiness confirmed (User Stories can proceed)
+- (mvp) Downstream readiness confirmed (User Stories can proceed)
 - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
 ## Methodology Scaling
@@ -54,3 +56,10 @@ If docs/reviews/pre-review-prd.md exists, this is a re-review. Read previous
 findings, check which were addressed, run review passes again on updated PRD.
 If multi-model review artifacts exist under docs/reviews/prd/, preserve prior
 findings still valid.
+## Update Mode Specifics
+- **Detect**: `docs/reviews/review-prd.md` exists with tracking comment
+- **Preserve**: Prior findings still valid, resolution decisions, multi-model review artifacts
+- **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
+- **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate

package/pipeline/pre/review-user-stories.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
 name: review-user-stories
 description: Multi-pass review of user stories for PRD coverage, quality, and downstream readiness
+summary: "Verifies every PRD feature maps to at least one story, checks that acceptance criteria are specific enough to test, validates story independence, and builds a requirements traceability index at higher depths."
 phase: "pre"
 order: 150
 dependencies: [user-stories]
 outputs: [docs/reviews/pre-review-user-stories.md, docs/reviews/user-stories/requirements-index.md, docs/reviews/user-stories/coverage.json, docs/reviews/user-stories/review-summary.md]
 conditional: null
-knowledge-base: [review-methodology, review-user-stories]
+knowledge-base: [review-methodology, review-user-stories, multi-model-review-dispatch, review-step-template]
 ---
 ## Purpose
@@ -33,14 +34,15 @@ independent coverage validation.
   synthesis with coverage verification
 ## Quality Criteria
+- (mvp) Pass 1 (PRD coverage) executed with findings documented
 - All review passes executed with findings documented
 - Every finding categorized by severity (P0-P3)
 - Fix plan created for P0 and P1 findings
 - Fixes applied and re-validated
-- Downstream readiness confirmed (modeling phase can proceed)
+- (mvp) Every story has at least one testable acceptance criterion, and every PRD feature maps to at least one story
 - (depth 4+) Every atomic PRD requirement has a REQ-xxx ID in the requirements index
 - (depth 4+) Coverage matrix maps every REQ to at least one US (100% coverage target)
-- (depth 5) Multi-model findings synthesized with consensus/disagreement analysis
+- (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
 ## Methodology Scaling
 - **deep**: All 6 review passes from the knowledge base. Full findings report
@@ -58,3 +60,10 @@ If docs/reviews/pre-review-user-stories.md exists, this is a re-review. Read
 previous findings, check which were addressed, run review passes again on
 updated stories. If docs/reviews/user-stories/requirements-index.md exists,
 preserve requirement IDs — never renumber REQ-xxx IDs.
+## Update Mode Specifics
+- **Detect**: `docs/reviews/pre-review-user-stories.md` exists with tracking comment
+- **Preserve**: Prior findings still valid, REQ-xxx IDs, resolution decisions, multi-model review artifacts
+- **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
+- **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate

package/pipeline/pre/user-stories.md CHANGED Viewed

@@ -1,10 +1,12 @@
 ---
 name: user-stories
 description: Translate PRD features into user stories with acceptance criteria
+summary: "Breaks every PRD feature into user stories organized by epic, each with testable acceptance criteria in Given/When/Then format."
 phase: "pre"
 order: 140
 dependencies: [review-prd]
 outputs: [docs/user-stories.md]
+reads: [innovate-prd]
 conditional: null
 knowledge-base: [user-stories]
 ---
@@ -25,12 +27,13 @@ task decomposition downstream.
   criteria scaled to the configured depth level
 ## Quality Criteria
-- Every PRD feature maps to at least one user story
-- Stories follow INVEST criteria (Independent, Negotiable, Valuable, Estimable, Small, Testable)
-- Acceptance criteria are testable — unambiguous pass/fail
-- No story too large to implement in 1-3 focused agent sessions
-- Every PRD persona is represented in at least one story
-- Stories describe user behavior, not implementation details
+- (mvp) Every PRD feature maps to at least one user story
+- (deep) Stories follow INVEST criteria (Independent, Negotiable, Valuable, Estimable, Small, Testable)
+- (mvp) Acceptance criteria are testable — unambiguous pass/fail
+- (deep) No story has more than 7 acceptance criteria
+- (mvp) Every PRD persona is represented in at least one story
+- (mvp) Stories describe user behavior, not implementation details
+- (mvp) Each story is independent — reordering stories does not break acceptance criteria
 ## Methodology Scaling
 - **deep**: Full story template with IDs, persona journey maps, cross-story
@@ -50,7 +53,9 @@ PRESERVE, get approval before modifying. Preserve existing story IDs.
 ## Update Mode Specifics
 - **Detect prior artifact**: docs/user-stories.md exists
 - **Preserve**: existing story IDs, epic groupings, acceptance criteria that
-  haven't been invalidated, story-to-PRD-feature traceability
+  haven't been invalidated, story-to-PRD-feature traceability, enhancement
+  markers (`<!-- enhancement: ... -->`), priority decisions, story ID format
+  (US-xxx)
 - **Triggers for update**: PRD features added or changed, innovation suggestions
   accepted, user personas expanded, review findings require story adjustments
 - **Conflict resolution**: never reuse a retired story ID; if a story's scope

package/pipeline/quality/create-evals.md CHANGED Viewed

@@ -1,11 +1,12 @@
 ---
 name: create-evals
 description: Generate project-specific eval checks from standards documentation
+summary: "Generates automated checks that verify your code matches your documented standards — file placement, naming conventions, feature-to-test coverage, API contract alignment — using your project's own test framework."
 phase: "quality"
 order: 920
 dependencies: [tdd, story-tests]
 outputs: [tests/evals/, docs/eval-standards.md]
-reads: [story-tests]
+reads: [security, dev-env-setup, api-contracts, database-schema, ux-spec]
 conditional: null
 knowledge-base: [eval-craft, testing-strategy]
 ---
@@ -70,6 +71,8 @@ Supporting:
 - (deep) Adherence, security, and error-handling evals include exclusion mechanisms
 - (deep) docs/eval-standards.md explicitly documents what evals do NOT check
 - (deep) Full eval suite runs in under 30 seconds
+- (mvp) `make eval` (or equivalent) runs and all generated evals pass
+- (deep) Eval false-positive assessment: each eval category documents at least one scenario where valid code might incorrectly fail, with exclusion mechanism
 ## Methodology Scaling
 - **deep**: All 13 eval categories (conditional on doc existence). Stack-specific
@@ -83,11 +86,12 @@ Supporting:
   - Depth 5: All 13 categories (Security, API, Database, Accessibility, Performance)
 ## Mode Detection
-Update mode if tests/evals/ directory exists. In update mode: regenerate
-consistency, structure, cross-doc, and conditional category evals. Preserve
-adherence, security, and error-handling eval exclusions. Regenerate coverage
-evals only if plan.md or user-stories.md changed. Add/remove conditional
-categories based on whether their source doc exists.
+Update mode if tests/evals/ directory or docs/eval-standards.md exists. In
+update mode: regenerate consistency, structure, cross-doc, and conditional
+category evals. Preserve adherence, security, and error-handling eval
+exclusions. Regenerate coverage evals only if plan.md or user-stories.md
+changed. Add/remove conditional categories based on whether their source doc
+exists.
 ## Update Mode Specifics
 - **Detect prior artifact**: tests/evals/ directory exists with eval test files

package/pipeline/quality/operations.md CHANGED Viewed

@@ -1,10 +1,12 @@
 ---
 name: operations
 description: Define deployment pipeline, deployment strategy, monitoring, alerting, and incident response
+summary: "Designs your deployment pipeline (build, test, deploy, verify, rollback), defines monitoring metrics with alert thresholds, and writes incident response procedures with rollback instructions."
 phase: "quality"
 order: 930
 dependencies: [review-testing]
 outputs: [docs/operations-runbook.md]
+reads: [system-architecture, adrs, dev-env-setup, git-workflow]
 conditional: null
 knowledge-base: [operations-runbook]
 ---
@@ -26,19 +28,21 @@ development setup rather than redefining it.
 - docs/operations-runbook.md — production operations and deployment runbook
 ## Quality Criteria
-- Deployment pipeline extends existing CI (build, deploy, post-deploy stages)
-- Deployment pipeline has explicit stages (build → test → deploy → verify → rollback-ready)
-- Does not redefine base CI stages (lint, test) from git-workflow
-- Deployment strategy chosen with rollback procedure
-- Rollback procedure tested with specific trigger conditions (e.g., error rate > X%, health check failure)
-- Runbook structured by operational scenario (deployment, rollback, incident, scaling)
-- Monitoring covers key metrics (latency, error rate, saturation)
-- Each monitoring metric has an explicit threshold with rationale
-- Health check endpoints defined with expected response codes and latency bounds
-- Log aggregation strategy specifies retention period and searchable fields
-- Alerting thresholds are justified, not arbitrary
+- (mvp) Deployment pipeline extends existing CI (build, deploy, post-deploy stages)
+- (mvp) Deployment pipeline has explicit stages (build → test → deploy → verify → rollback-ready)
+- (mvp) Does not redefine base CI stages (lint, test) from git-workflow
+- (mvp) Deployment strategy chosen with rollback procedure
+- (deep) Rollback procedure tested with specific trigger conditions (e.g., error rate > X%, health check failure)
+- (deep) Runbook structured by operational scenario (deployment, rollback, incident, scaling)
+- (mvp) Monitoring covers key metrics (latency, error rate, saturation)
+- (deep) Each monitoring metric has an explicit threshold with rationale
+- (deep) Health check endpoints defined with expected response codes and latency bounds
+- (deep) Log aggregation strategy specifies retention period and searchable fields
+- (deep) Each alert threshold documents: the metric, threshold value, business impact if crossed, and mitigation action
 - References docs/dev-setup.md for local dev — does not redefine it
-- Incident response process defined
+- (deep) Incident response process defined
+- (deep) Recovery Time Objective (RTO) and Recovery Point Objective (RPO) documented for each critical service
+- (deep) Secret rotation procedure documented and tested
 ## Methodology Scaling
 - **deep**: Full runbook. Deployment topology diagrams. Monitoring dashboard

package/pipeline/quality/review-operations.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
 name: review-operations
 description: Review operations runbook for completeness and safety
+summary: "Verifies the full deployment lifecycle is documented, monitoring covers latency/errors/saturation, alert thresholds have rationale, and common failure scenarios have runbook entries."
 phase: "quality"
 order: 940
 dependencies: [operations]
 outputs: [docs/reviews/review-operations.md, docs/reviews/operations/review-summary.md, docs/reviews/operations/codex-review.json, docs/reviews/operations/gemini-review.json]
 conditional: null
-knowledge-base: [review-methodology, review-operations]
+knowledge-base: [review-methodology, review-operations, multi-model-review-dispatch, review-step-template]
 ---
 ## Purpose
@@ -29,21 +30,29 @@ independent review validation.
 - docs/reviews/operations/gemini-review.json (depth 4+, if available) — raw Gemini findings
 ## Quality Criteria
-- Deployment lifecycle fully documented (deploy, verify, rollback)
-- Monitoring covers all critical metrics
-- Alert thresholds have rationale
-- Common failure scenarios have runbook entries
-- Dev environment parity assessed
+- (mvp) Deployment lifecycle fully documented (deploy, verify, rollback)
+- (mvp) Monitoring verified against minimum set: latency, error rate, and saturation
+- (deep) Alert thresholds have rationale
+- (deep) Common failure scenarios have runbook entries
+- (deep) Dev/staging/production environment differences documented in operations runbook
+- Every finding categorized P0-P3 with specific runbook section, metric, and issue
+- Fix plan documented for all P0/P1 findings; fixes applied to operations-runbook.md and re-validated
+- Downstream readiness confirmed — no unresolved P0 or P1 findings remain before security step proceeds
 - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
 ## Methodology Scaling
 - **deep**: Full multi-pass review. Multi-model review dispatched to Codex and
   Gemini if available, with graceful fallback to Claude-only enhanced review.
-  **mvp**: Deployment coverage only.
-- **custom:depth(1-5)**: Depth 1-3: scale passes with depth. Depth 4: full
-  review + one external model (if CLI available). Depth 5: full review +
-  multi-model with reconciliation.
+- **mvp**: Deployment coverage only.
+- **custom:depth(1-5)**: Depth 1: monitoring and logging pass only. Depth 2: add deployment and rollback pass. Depth 3: add incident response and scaling passes. Depth 4: add external model review. Depth 5: multi-model review with reconciliation.
 ## Mode Detection
 Re-review mode if previous review exists. If multi-model review artifacts exist
 under docs/reviews/operations/, preserve prior findings still valid.
+## Update Mode Specifics
+- **Detect**: `docs/reviews/review-operations.md` exists with tracking comment
+- **Preserve**: Prior findings still valid, resolution decisions, multi-model review artifacts
+- **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
+- **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate

package/pipeline/quality/review-security.md CHANGED Viewed

@@ -1,12 +1,14 @@
 ---
 name: review-security
 description: Review security review for coverage and correctness
+summary: "Verifies OWASP coverage is complete, auth boundaries match API contracts, every secret is accounted for, and the threat model covers all trust boundaries. Highest priority for multi-model review."
 phase: "quality"
 order: 960
 dependencies: [security]
 outputs: [docs/reviews/review-security.md, docs/reviews/security/review-summary.md, docs/reviews/security/codex-review.json, docs/reviews/security/gemini-review.json]
 conditional: null
-knowledge-base: [review-methodology, review-security]
+reads: [api-contracts]
+knowledge-base: [review-methodology, review-security, multi-model-review-dispatch, review-step-template]
 ---
 ## Purpose
@@ -31,22 +33,30 @@ independent review validation.
 - docs/reviews/security/gemini-review.json (depth 4+, if available) — raw Gemini findings
 ## Quality Criteria
-- OWASP coverage verified for this project
-- Auth boundaries match API contract auth requirements
-- Secrets management is complete (no gaps)
-- Dependency audit scope covers all dependencies
-- Threat model covers all trust boundaries
-- Data classification is complete
+- (mvp) OWASP coverage verified for this project
+- (deep) Auth boundaries match API contract auth requirements
+- (deep) Secrets management covers: all environment variables, API keys, database credentials, and third-party tokens
+- (deep) Dependency audit scope covers all dependencies
+- (deep) Threat model covers all trust boundaries
+- (deep) Data classification covers every entity in the domain model
+- Every finding categorized P0-P3 with specific control, boundary, and issue
+- Fix plan documented for all P0/P1 findings; fixes applied to security-review.md and re-validated
+- Downstream readiness confirmed — no unresolved P0 or P1 findings remain before planning phase proceeds
 - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
 ## Methodology Scaling
 - **deep**: Full multi-pass review. Multi-model review dispatched to Codex and
   Gemini if available, with graceful fallback to Claude-only enhanced review.
-  **mvp**: OWASP coverage check only.
-- **custom:depth(1-5)**: Depth 1-3: scale passes with depth. Depth 4: full
-  review + one external model (if CLI available). Depth 5: full review +
-  multi-model with reconciliation.
+- **mvp**: OWASP coverage check only.
+- **custom:depth(1-5)**: Depth 1: OWASP top 10 and secrets management pass only. Depth 2: add auth boundary and input validation passes. Depth 3: add dependency audit and data protection passes. Depth 4: add external model security review. Depth 5: multi-model security review with reconciliation.
 ## Mode Detection
 Re-review mode if previous review exists. If multi-model review artifacts exist
 under docs/reviews/security/, preserve prior findings still valid.
+## Update Mode Specifics
+- **Detect**: `docs/reviews/review-security.md` exists with tracking comment
+- **Preserve**: Prior findings still valid, resolution decisions, multi-model review artifacts
+- **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
+- **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate