@zigrivers/scaffold 2.28.1 → 2.38.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (375) hide show
  1. package/README.md +309 -136
  2. package/dist/cli/commands/build.d.ts.map +1 -1
  3. package/dist/cli/commands/build.js +94 -14
  4. package/dist/cli/commands/build.js.map +1 -1
  5. package/dist/cli/commands/build.test.js +30 -5
  6. package/dist/cli/commands/build.test.js.map +1 -1
  7. package/dist/cli/commands/check.d.ts +12 -0
  8. package/dist/cli/commands/check.d.ts.map +1 -0
  9. package/dist/cli/commands/check.js +311 -0
  10. package/dist/cli/commands/check.js.map +1 -0
  11. package/dist/cli/commands/check.test.d.ts +2 -0
  12. package/dist/cli/commands/check.test.d.ts.map +1 -0
  13. package/dist/cli/commands/check.test.js +412 -0
  14. package/dist/cli/commands/check.test.js.map +1 -0
  15. package/dist/cli/commands/complete.d.ts +12 -0
  16. package/dist/cli/commands/complete.d.ts.map +1 -0
  17. package/dist/cli/commands/complete.js +103 -0
  18. package/dist/cli/commands/complete.js.map +1 -0
  19. package/dist/cli/commands/complete.test.d.ts +2 -0
  20. package/dist/cli/commands/complete.test.d.ts.map +1 -0
  21. package/dist/cli/commands/complete.test.js +133 -0
  22. package/dist/cli/commands/complete.test.js.map +1 -0
  23. package/dist/cli/commands/dashboard.d.ts.map +1 -1
  24. package/dist/cli/commands/dashboard.js +12 -8
  25. package/dist/cli/commands/dashboard.js.map +1 -1
  26. package/dist/cli/commands/info.d.ts.map +1 -1
  27. package/dist/cli/commands/info.js +4 -0
  28. package/dist/cli/commands/info.js.map +1 -1
  29. package/dist/cli/commands/knowledge.d.ts.map +1 -1
  30. package/dist/cli/commands/knowledge.js +6 -2
  31. package/dist/cli/commands/knowledge.js.map +1 -1
  32. package/dist/cli/commands/knowledge.test.js +16 -11
  33. package/dist/cli/commands/knowledge.test.js.map +1 -1
  34. package/dist/cli/commands/next.d.ts.map +1 -1
  35. package/dist/cli/commands/next.js +41 -13
  36. package/dist/cli/commands/next.js.map +1 -1
  37. package/dist/cli/commands/next.test.js +3 -0
  38. package/dist/cli/commands/next.test.js.map +1 -1
  39. package/dist/cli/commands/reset.d.ts +1 -0
  40. package/dist/cli/commands/reset.d.ts.map +1 -1
  41. package/dist/cli/commands/reset.js +179 -67
  42. package/dist/cli/commands/reset.js.map +1 -1
  43. package/dist/cli/commands/reset.test.js +360 -0
  44. package/dist/cli/commands/reset.test.js.map +1 -1
  45. package/dist/cli/commands/rework.d.ts +20 -0
  46. package/dist/cli/commands/rework.d.ts.map +1 -0
  47. package/dist/cli/commands/rework.js +332 -0
  48. package/dist/cli/commands/rework.js.map +1 -0
  49. package/dist/cli/commands/rework.test.d.ts +2 -0
  50. package/dist/cli/commands/rework.test.d.ts.map +1 -0
  51. package/dist/cli/commands/rework.test.js +297 -0
  52. package/dist/cli/commands/rework.test.js.map +1 -0
  53. package/dist/cli/commands/run.d.ts.map +1 -1
  54. package/dist/cli/commands/run.js +59 -31
  55. package/dist/cli/commands/run.js.map +1 -1
  56. package/dist/cli/commands/run.test.js +288 -6
  57. package/dist/cli/commands/run.test.js.map +1 -1
  58. package/dist/cli/commands/skill.d.ts +12 -0
  59. package/dist/cli/commands/skill.d.ts.map +1 -0
  60. package/dist/cli/commands/skill.js +123 -0
  61. package/dist/cli/commands/skill.js.map +1 -0
  62. package/dist/cli/commands/skill.test.d.ts +2 -0
  63. package/dist/cli/commands/skill.test.d.ts.map +1 -0
  64. package/dist/cli/commands/skill.test.js +297 -0
  65. package/dist/cli/commands/skill.test.js.map +1 -0
  66. package/dist/cli/commands/skip.d.ts +1 -1
  67. package/dist/cli/commands/skip.d.ts.map +1 -1
  68. package/dist/cli/commands/skip.js +123 -57
  69. package/dist/cli/commands/skip.js.map +1 -1
  70. package/dist/cli/commands/skip.test.js +91 -0
  71. package/dist/cli/commands/skip.test.js.map +1 -1
  72. package/dist/cli/commands/status.d.ts +1 -0
  73. package/dist/cli/commands/status.d.ts.map +1 -1
  74. package/dist/cli/commands/status.js +57 -10
  75. package/dist/cli/commands/status.js.map +1 -1
  76. package/dist/cli/commands/status.test.js +81 -0
  77. package/dist/cli/commands/status.test.js.map +1 -1
  78. package/dist/cli/commands/update.test.js +252 -0
  79. package/dist/cli/commands/update.test.js.map +1 -1
  80. package/dist/cli/commands/version.test.js +171 -1
  81. package/dist/cli/commands/version.test.js.map +1 -1
  82. package/dist/cli/index.d.ts.map +1 -1
  83. package/dist/cli/index.js +8 -0
  84. package/dist/cli/index.js.map +1 -1
  85. package/dist/core/adapters/adapter.d.ts +14 -0
  86. package/dist/core/adapters/adapter.d.ts.map +1 -1
  87. package/dist/core/adapters/adapter.js.map +1 -1
  88. package/dist/core/adapters/adapter.test.js +10 -0
  89. package/dist/core/adapters/adapter.test.js.map +1 -1
  90. package/dist/core/adapters/claude-code.d.ts.map +1 -1
  91. package/dist/core/adapters/claude-code.js +47 -10
  92. package/dist/core/adapters/claude-code.js.map +1 -1
  93. package/dist/core/adapters/claude-code.test.js +41 -20
  94. package/dist/core/adapters/claude-code.test.js.map +1 -1
  95. package/dist/core/adapters/codex.d.ts.map +1 -1
  96. package/dist/core/adapters/codex.js +5 -1
  97. package/dist/core/adapters/codex.js.map +1 -1
  98. package/dist/core/adapters/codex.test.js +5 -0
  99. package/dist/core/adapters/codex.test.js.map +1 -1
  100. package/dist/core/adapters/universal.d.ts.map +1 -1
  101. package/dist/core/adapters/universal.js +0 -1
  102. package/dist/core/adapters/universal.js.map +1 -1
  103. package/dist/core/adapters/universal.test.js +5 -0
  104. package/dist/core/adapters/universal.test.js.map +1 -1
  105. package/dist/core/assembly/context-gatherer.d.ts.map +1 -1
  106. package/dist/core/assembly/context-gatherer.js +5 -2
  107. package/dist/core/assembly/context-gatherer.js.map +1 -1
  108. package/dist/core/assembly/engine.d.ts.map +1 -1
  109. package/dist/core/assembly/engine.js +10 -2
  110. package/dist/core/assembly/engine.js.map +1 -1
  111. package/dist/core/assembly/engine.test.js +19 -0
  112. package/dist/core/assembly/engine.test.js.map +1 -1
  113. package/dist/core/assembly/knowledge-loader.d.ts +25 -0
  114. package/dist/core/assembly/knowledge-loader.d.ts.map +1 -1
  115. package/dist/core/assembly/knowledge-loader.js +75 -2
  116. package/dist/core/assembly/knowledge-loader.js.map +1 -1
  117. package/dist/core/assembly/knowledge-loader.test.js +388 -1
  118. package/dist/core/assembly/knowledge-loader.test.js.map +1 -1
  119. package/dist/core/assembly/meta-prompt-loader.d.ts +6 -0
  120. package/dist/core/assembly/meta-prompt-loader.d.ts.map +1 -1
  121. package/dist/core/assembly/meta-prompt-loader.js +41 -25
  122. package/dist/core/assembly/meta-prompt-loader.js.map +1 -1
  123. package/dist/core/assembly/preset-loader.d.ts +10 -0
  124. package/dist/core/assembly/preset-loader.d.ts.map +1 -1
  125. package/dist/core/assembly/preset-loader.js +26 -1
  126. package/dist/core/assembly/preset-loader.js.map +1 -1
  127. package/dist/core/assembly/preset-loader.test.js +65 -1
  128. package/dist/core/assembly/preset-loader.test.js.map +1 -1
  129. package/dist/core/assembly/update-mode.d.ts.map +1 -1
  130. package/dist/core/assembly/update-mode.js +10 -4
  131. package/dist/core/assembly/update-mode.js.map +1 -1
  132. package/dist/core/assembly/update-mode.test.js +47 -0
  133. package/dist/core/assembly/update-mode.test.js.map +1 -1
  134. package/dist/core/dependency/dependency.d.ts.map +1 -1
  135. package/dist/core/dependency/dependency.js +3 -2
  136. package/dist/core/dependency/dependency.js.map +1 -1
  137. package/dist/core/dependency/dependency.test.js +2 -0
  138. package/dist/core/dependency/dependency.test.js.map +1 -1
  139. package/dist/core/dependency/eligibility.js +3 -3
  140. package/dist/core/dependency/eligibility.js.map +1 -1
  141. package/dist/core/dependency/eligibility.test.js +2 -0
  142. package/dist/core/dependency/eligibility.test.js.map +1 -1
  143. package/dist/core/dependency/graph.d.ts.map +1 -1
  144. package/dist/core/dependency/graph.js +4 -0
  145. package/dist/core/dependency/graph.js.map +1 -1
  146. package/dist/core/dependency/graph.test.d.ts +2 -0
  147. package/dist/core/dependency/graph.test.d.ts.map +1 -0
  148. package/dist/core/dependency/graph.test.js +262 -0
  149. package/dist/core/dependency/graph.test.js.map +1 -0
  150. package/dist/core/rework/phase-selector.d.ts +24 -0
  151. package/dist/core/rework/phase-selector.d.ts.map +1 -0
  152. package/dist/core/rework/phase-selector.js +98 -0
  153. package/dist/core/rework/phase-selector.js.map +1 -0
  154. package/dist/core/rework/phase-selector.test.d.ts +2 -0
  155. package/dist/core/rework/phase-selector.test.d.ts.map +1 -0
  156. package/dist/core/rework/phase-selector.test.js +138 -0
  157. package/dist/core/rework/phase-selector.test.js.map +1 -0
  158. package/dist/dashboard/generator.d.ts +48 -17
  159. package/dist/dashboard/generator.d.ts.map +1 -1
  160. package/dist/dashboard/generator.js +75 -5
  161. package/dist/dashboard/generator.js.map +1 -1
  162. package/dist/dashboard/generator.test.js +213 -5
  163. package/dist/dashboard/generator.test.js.map +1 -1
  164. package/dist/dashboard/template.d.ts +1 -1
  165. package/dist/dashboard/template.d.ts.map +1 -1
  166. package/dist/dashboard/template.js +755 -114
  167. package/dist/dashboard/template.js.map +1 -1
  168. package/dist/e2e/knowledge.test.js +4 -3
  169. package/dist/e2e/knowledge.test.js.map +1 -1
  170. package/dist/e2e/pipeline.test.js +2 -0
  171. package/dist/e2e/pipeline.test.js.map +1 -1
  172. package/dist/e2e/rework.test.d.ts +6 -0
  173. package/dist/e2e/rework.test.d.ts.map +1 -0
  174. package/dist/e2e/rework.test.js +226 -0
  175. package/dist/e2e/rework.test.js.map +1 -0
  176. package/dist/index.js +0 -0
  177. package/dist/project/adopt.test.js +2 -0
  178. package/dist/project/adopt.test.js.map +1 -1
  179. package/dist/project/claude-md.js +2 -2
  180. package/dist/project/claude-md.js.map +1 -1
  181. package/dist/project/claude-md.test.js +4 -4
  182. package/dist/project/claude-md.test.js.map +1 -1
  183. package/dist/project/detector.d.ts.map +1 -1
  184. package/dist/project/detector.js +4 -1
  185. package/dist/project/detector.js.map +1 -1
  186. package/dist/project/frontmatter.d.ts.map +1 -1
  187. package/dist/project/frontmatter.js +54 -15
  188. package/dist/project/frontmatter.js.map +1 -1
  189. package/dist/project/frontmatter.test.js +2 -2
  190. package/dist/project/frontmatter.test.js.map +1 -1
  191. package/dist/state/rework-manager.d.ts +16 -0
  192. package/dist/state/rework-manager.d.ts.map +1 -0
  193. package/dist/state/rework-manager.js +126 -0
  194. package/dist/state/rework-manager.js.map +1 -0
  195. package/dist/state/rework-manager.test.d.ts +2 -0
  196. package/dist/state/rework-manager.test.d.ts.map +1 -0
  197. package/dist/state/rework-manager.test.js +191 -0
  198. package/dist/state/rework-manager.test.js.map +1 -0
  199. package/dist/state/state-manager.d.ts +13 -0
  200. package/dist/state/state-manager.d.ts.map +1 -1
  201. package/dist/state/state-manager.js +39 -2
  202. package/dist/state/state-manager.js.map +1 -1
  203. package/dist/state/state-manager.test.js +74 -1
  204. package/dist/state/state-manager.test.js.map +1 -1
  205. package/dist/state/state-migration.d.ts +23 -0
  206. package/dist/state/state-migration.d.ts.map +1 -0
  207. package/dist/state/state-migration.js +144 -0
  208. package/dist/state/state-migration.js.map +1 -0
  209. package/dist/state/state-migration.test.d.ts +2 -0
  210. package/dist/state/state-migration.test.d.ts.map +1 -0
  211. package/dist/state/state-migration.test.js +451 -0
  212. package/dist/state/state-migration.test.js.map +1 -0
  213. package/dist/types/assembly.d.ts +2 -0
  214. package/dist/types/assembly.d.ts.map +1 -1
  215. package/dist/types/dependency.d.ts +2 -2
  216. package/dist/types/dependency.d.ts.map +1 -1
  217. package/dist/types/frontmatter.d.ts +100 -7
  218. package/dist/types/frontmatter.d.ts.map +1 -1
  219. package/dist/types/frontmatter.js +89 -1
  220. package/dist/types/frontmatter.js.map +1 -1
  221. package/dist/types/index.d.ts +1 -0
  222. package/dist/types/index.d.ts.map +1 -1
  223. package/dist/types/index.js +1 -0
  224. package/dist/types/index.js.map +1 -1
  225. package/dist/types/lock.d.ts +1 -1
  226. package/dist/types/lock.d.ts.map +1 -1
  227. package/dist/types/rework.d.ts +36 -0
  228. package/dist/types/rework.d.ts.map +1 -0
  229. package/dist/types/rework.js +2 -0
  230. package/dist/types/rework.js.map +1 -0
  231. package/dist/utils/errors.d.ts +1 -0
  232. package/dist/utils/errors.d.ts.map +1 -1
  233. package/dist/utils/errors.js +8 -0
  234. package/dist/utils/errors.js.map +1 -1
  235. package/dist/utils/fs.d.ts +6 -0
  236. package/dist/utils/fs.d.ts.map +1 -1
  237. package/dist/utils/fs.js +13 -0
  238. package/dist/utils/fs.js.map +1 -1
  239. package/dist/validation/config-validator.test.d.ts +2 -0
  240. package/dist/validation/config-validator.test.d.ts.map +1 -0
  241. package/dist/validation/config-validator.test.js +210 -0
  242. package/dist/validation/config-validator.test.js.map +1 -0
  243. package/dist/validation/dependency-validator.test.d.ts +2 -0
  244. package/dist/validation/dependency-validator.test.d.ts.map +1 -0
  245. package/dist/validation/dependency-validator.test.js +215 -0
  246. package/dist/validation/dependency-validator.test.js.map +1 -0
  247. package/dist/validation/frontmatter-validator.test.d.ts +2 -0
  248. package/dist/validation/frontmatter-validator.test.d.ts.map +1 -0
  249. package/dist/validation/frontmatter-validator.test.js +371 -0
  250. package/dist/validation/frontmatter-validator.test.js.map +1 -0
  251. package/dist/validation/state-validator.test.d.ts +2 -0
  252. package/dist/validation/state-validator.test.d.ts.map +1 -0
  253. package/dist/validation/state-validator.test.js +325 -0
  254. package/dist/validation/state-validator.test.js.map +1 -0
  255. package/dist/wizard/suggestion.test.d.ts +2 -0
  256. package/dist/wizard/suggestion.test.d.ts.map +1 -0
  257. package/dist/wizard/suggestion.test.js +115 -0
  258. package/dist/wizard/suggestion.test.js.map +1 -0
  259. package/dist/wizard/wizard.d.ts.map +1 -1
  260. package/dist/wizard/wizard.js +34 -1
  261. package/dist/wizard/wizard.js.map +1 -1
  262. package/knowledge/core/adr-craft.md +4 -0
  263. package/knowledge/core/api-design.md +4 -0
  264. package/knowledge/core/automated-review-tooling.md +203 -0
  265. package/knowledge/core/coding-conventions.md +1 -1
  266. package/knowledge/core/database-design.md +4 -0
  267. package/knowledge/core/design-system-tokens.md +4 -0
  268. package/knowledge/core/domain-modeling.md +4 -0
  269. package/knowledge/core/git-workflow-patterns.md +200 -0
  270. package/knowledge/core/operations-runbook.md +5 -1
  271. package/knowledge/core/security-best-practices.md +4 -0
  272. package/knowledge/core/system-architecture.md +5 -1
  273. package/knowledge/core/task-decomposition.md +118 -3
  274. package/knowledge/core/user-story-innovation.md +13 -0
  275. package/knowledge/core/ux-specification.md +13 -0
  276. package/knowledge/execution/enhancement-workflow.md +201 -0
  277. package/knowledge/execution/task-claiming-strategy.md +130 -0
  278. package/knowledge/execution/tdd-execution-loop.md +172 -0
  279. package/knowledge/execution/worktree-management.md +205 -0
  280. package/knowledge/finalization/apply-fixes-and-freeze.md +12 -0
  281. package/knowledge/finalization/developer-onboarding.md +4 -0
  282. package/knowledge/finalization/implementation-playbook.md +83 -5
  283. package/knowledge/product/gap-analysis.md +5 -1
  284. package/knowledge/product/prd-innovation.md +12 -0
  285. package/knowledge/product/vision-craft.md +213 -0
  286. package/knowledge/review/review-adr.md +12 -0
  287. package/knowledge/review/review-api-design.md +13 -0
  288. package/knowledge/review/review-database-design.md +13 -0
  289. package/knowledge/review/review-domain-modeling.md +5 -1
  290. package/knowledge/review/review-implementation-tasks.md +58 -1
  291. package/knowledge/review/review-methodology.md +11 -0
  292. package/knowledge/review/review-operations.md +12 -0
  293. package/knowledge/review/review-prd.md +13 -0
  294. package/knowledge/review/review-security.md +12 -0
  295. package/knowledge/review/review-system-architecture.md +4 -2
  296. package/knowledge/review/review-testing-strategy.md +11 -0
  297. package/knowledge/review/review-user-stories.md +11 -0
  298. package/knowledge/review/review-ux-specification.md +13 -1
  299. package/knowledge/review/review-vision.md +255 -0
  300. package/knowledge/tools/release-management.md +222 -0
  301. package/knowledge/tools/session-analysis.md +215 -0
  302. package/knowledge/tools/version-strategy.md +200 -0
  303. package/knowledge/validation/critical-path-analysis.md +1 -1
  304. package/knowledge/validation/cross-phase-consistency.md +12 -0
  305. package/knowledge/validation/decision-completeness.md +13 -1
  306. package/knowledge/validation/dependency-validation.md +12 -0
  307. package/knowledge/validation/scope-management.md +12 -0
  308. package/knowledge/validation/traceability.md +12 -0
  309. package/methodology/README.md +37 -0
  310. package/methodology/custom-defaults.yml +12 -1
  311. package/methodology/deep.yml +11 -0
  312. package/methodology/mvp.yml +11 -0
  313. package/package.json +3 -3
  314. package/pipeline/architecture/review-architecture.md +18 -7
  315. package/pipeline/architecture/system-architecture.md +11 -8
  316. package/pipeline/build/multi-agent-resume.md +245 -0
  317. package/pipeline/build/multi-agent-start.md +236 -0
  318. package/pipeline/build/new-enhancement.md +456 -0
  319. package/pipeline/build/quick-task.md +381 -0
  320. package/pipeline/build/single-agent-resume.md +210 -0
  321. package/pipeline/build/single-agent-start.md +207 -0
  322. package/pipeline/consolidation/claude-md-optimization.md +11 -8
  323. package/pipeline/consolidation/workflow-audit.md +15 -11
  324. package/pipeline/decisions/adrs.md +7 -5
  325. package/pipeline/decisions/review-adrs.md +14 -6
  326. package/pipeline/environment/ai-memory-setup.md +18 -12
  327. package/pipeline/environment/automated-pr-review.md +10 -4
  328. package/pipeline/environment/design-system.md +9 -7
  329. package/pipeline/environment/dev-env-setup.md +8 -5
  330. package/pipeline/environment/git-workflow.md +3 -1
  331. package/pipeline/finalization/apply-fixes-and-freeze.md +16 -5
  332. package/pipeline/finalization/developer-onboarding-guide.md +22 -8
  333. package/pipeline/finalization/implementation-playbook.md +40 -11
  334. package/pipeline/foundation/beads.md +10 -7
  335. package/pipeline/foundation/coding-standards.md +6 -3
  336. package/pipeline/foundation/project-structure.md +5 -1
  337. package/pipeline/foundation/tdd.md +10 -6
  338. package/pipeline/foundation/tech-stack.md +9 -9
  339. package/pipeline/integration/add-e2e-testing.md +21 -6
  340. package/pipeline/modeling/domain-modeling.md +10 -7
  341. package/pipeline/modeling/review-domain-modeling.md +17 -6
  342. package/pipeline/parity/platform-parity-review.md +31 -11
  343. package/pipeline/planning/implementation-plan-review.md +21 -10
  344. package/pipeline/planning/implementation-plan.md +52 -19
  345. package/pipeline/pre/create-prd.md +22 -7
  346. package/pipeline/pre/innovate-prd.md +10 -8
  347. package/pipeline/pre/innovate-user-stories.md +9 -7
  348. package/pipeline/pre/review-prd.md +11 -2
  349. package/pipeline/pre/review-user-stories.md +12 -3
  350. package/pipeline/pre/user-stories.md +12 -7
  351. package/pipeline/quality/create-evals.md +10 -6
  352. package/pipeline/quality/operations.md +16 -12
  353. package/pipeline/quality/review-operations.md +19 -10
  354. package/pipeline/quality/review-security.md +21 -11
  355. package/pipeline/quality/review-testing.md +23 -12
  356. package/pipeline/quality/security.md +17 -13
  357. package/pipeline/quality/story-tests.md +6 -4
  358. package/pipeline/specification/api-contracts.md +11 -6
  359. package/pipeline/specification/database-schema.md +12 -6
  360. package/pipeline/specification/review-api.md +18 -9
  361. package/pipeline/specification/review-database.md +18 -9
  362. package/pipeline/specification/review-ux.md +20 -10
  363. package/pipeline/specification/ux-spec.md +8 -5
  364. package/pipeline/validation/critical-path-walkthrough.md +14 -7
  365. package/pipeline/validation/cross-phase-consistency.md +14 -7
  366. package/pipeline/validation/decision-completeness.md +14 -7
  367. package/pipeline/validation/dependency-graph-validation.md +15 -7
  368. package/pipeline/validation/implementability-dry-run.md +15 -7
  369. package/pipeline/validation/scope-creep-check.md +15 -7
  370. package/pipeline/validation/traceability-matrix.md +20 -7
  371. package/pipeline/vision/create-vision.md +267 -0
  372. package/pipeline/vision/innovate-vision.md +157 -0
  373. package/pipeline/vision/review-vision.md +149 -0
  374. package/skills/scaffold-pipeline/SKILL.md +33 -18
  375. package/skills/scaffold-runner/SKILL.md +172 -18
@@ -1,11 +1,12 @@
1
1
  ---
2
2
  name: implementation-plan
3
3
  description: Break architecture into implementable tasks with dependencies
4
+ summary: "Breaks your user stories and architecture into concrete tasks — each scoped to ~150 lines of code and 3 files max, with clear acceptance criteria, no ambiguous decisions, and explicit dependencies."
4
5
  phase: "planning"
5
6
  order: 1210
6
7
  dependencies: [tdd, operations, security, review-architecture, create-evals]
7
8
  outputs: [docs/implementation-plan.md]
8
- reads: [create-prd]
9
+ reads: [create-prd, story-tests, database-schema, api-contracts, ux-spec]
9
10
  conditional: null
10
11
  knowledge-base: [task-decomposition]
11
12
  ---
@@ -17,9 +18,9 @@ have clear inputs/outputs, and be small enough for a single agent session.
17
18
  The primary mapping is Story → Task(s), with PRD as the traceability root.
18
19
 
19
20
  ## Inputs
20
- - docs/system-architecture.md (required) — components to implement
21
- - docs/domain-models/ (required) — domain logic to implement
22
- - docs/adrs/ (required) — technology constraints
21
+ - docs/system-architecture.md (optional — not available in MVP) — components to implement
22
+ - docs/domain-models/ (optional — not available in MVP) — domain logic to implement
23
+ - docs/adrs/ (optional — not available in MVP) — technology constraints
23
24
  - docs/plan.md (required) — features to trace tasks back to
24
25
  - docs/user-stories.md (required) — stories to derive tasks from
25
26
  - docs/tdd-standards.md (required) — testing requirements to incorporate into tasks
@@ -28,31 +29,41 @@ The primary mapping is Story → Task(s), with PRD as the traceability root.
28
29
  - docs/database-schema.md (optional) — data layer tasks
29
30
  - docs/api-contracts.md (optional) — API implementation tasks
30
31
  - docs/ux-spec.md (optional) — frontend tasks
32
+ - tests/acceptance/ (optional) — test skeletons to reference in task descriptions
33
+ - docs/story-tests-map.md (optional) — AC-to-test mapping for task coverage verification
31
34
 
32
35
  ## Expected Outputs
33
36
  - docs/implementation-plan.md — task list with dependencies, sizing, and
34
37
  assignment recommendations
35
38
 
36
39
  ## Quality Criteria
37
- - Every architecture component has implementation tasks
38
- - Task dependencies form a valid DAG (no cycles)
39
- - Each task is scoped for a single agent session (not too large, not too small)
40
- - Tasks include acceptance criteria (how to know it's done)
41
- - Tasks incorporate testing requirements from the testing strategy
42
- - Tasks incorporate security controls from the security review where applicable
43
- - Tasks incorporate operational requirements (monitoring, deployment) where applicable
44
- - Critical path is identified
45
- - Parallelization opportunities are marked with wave plan
46
- - Every user story maps to at least one task
47
- - High-risk tasks are flagged with risk type and mitigation
48
- - Wave summary produced with agent allocation recommendation
40
+ - (mvp) Every architecture component has implementation tasks
41
+ - (mvp) Task dependencies form a valid DAG (no cycles)
42
+ - (mvp) Each task produces ~150 lines of net-new application code (excluding tests and generated files)
43
+ - (mvp) Tasks include acceptance criteria (how to know it's done)
44
+ - (mvp) Tasks incorporate testing requirements from the testing strategy
45
+ - (deep) Tasks reference corresponding test skeletons from tests/acceptance/ where applicable
46
+ - (deep) Tasks incorporate security controls from the security review where applicable
47
+ - (deep) Tasks incorporate operational requirements (monitoring, deployment) where applicable
48
+ - (deep) Critical path is identified
49
+ - (deep) Parallelization opportunities are marked with wave plan
50
+ - (mvp) Every user story maps to at least one task
51
+ - (deep) High-risk tasks are flagged with risk type and mitigation
52
+ - (deep) Wave summary produced with agent allocation recommendation
53
+ - (mvp) No task modifies more than 3 application files (test files excluded; exceptions require justification)
54
+ - (mvp) No task contains unresolved design decisions (agents implement, they don't architect)
55
+ - (mvp) Every code-producing task includes co-located test requirements
56
+ - (deep) Critical path identified with estimated total duration
49
57
 
50
58
  ## Methodology Scaling
51
59
  - **deep**: Detailed task breakdown with story-to-task tracing. Dependency graph.
52
60
  Sizing estimates. Parallelization plan. Agent context requirements per task.
53
61
  Phased delivery milestones.
54
- - **mvp**: Ordered task list with brief descriptions. Key dependencies noted.
55
- Enough to start working sequentially.
62
+ - **mvp**: Ordered task list derived from PRD features and user stories only
63
+ (architecture, domain models, and ADRs are not available at this depth).
64
+ Each task has a brief description, rough size estimate, and key dependency.
65
+ Enough to start working sequentially. Skip architecture decomposition —
66
+ work directly from user story acceptance criteria.
56
67
  - **custom:depth(1-5)**: Depth 1-2: ordered list. Depth 3: add dependencies
57
68
  and sizing. Depth 4-5: full breakdown with parallelization.
58
69
 
@@ -68,10 +79,32 @@ that are in-progress or completed.
68
79
  - **Detect prior artifact**: docs/implementation-plan.md exists
69
80
  - **Preserve**: completed and in-progress task statuses, existing task IDs,
70
81
  dependency relationships for stable tasks, wave assignments for tasks
71
- already started, agent allocation history
82
+ already started, agent allocation history, architecture decisions,
83
+ component boundaries
72
84
  - **Triggers for update**: architecture changed (new components need tasks),
73
85
  user stories added or changed, security review identified new requirements,
74
86
  operations runbook added deployment tasks, specification docs changed
75
87
  - **Conflict resolution**: if architecture restructured a component that has
76
88
  in-progress tasks, flag for user review rather than silently reassigning;
77
89
  re-derive critical path only for unstarted tasks
90
+
91
+ ## Task Size Constraints
92
+
93
+ Before finalizing the implementation plan, scan every task against the five agent
94
+ executability rules from the task-decomposition knowledge base:
95
+
96
+ 1. **Three-File Rule** — Count application files each task modifies (exclude test files).
97
+ Any task touching 4+ files must be split by layer or concern.
98
+ 2. **150-Line Budget** — Estimate net-new application code lines per task. Any task
99
+ likely to produce 200+ lines must be split by feature slice or entity.
100
+ 3. **Single-Concern Rule** — Check each task description for "and" connecting unrelated
101
+ work. Split if the task spans multiple architectural layers or feature domains.
102
+ 4. **Decision-Free Execution** — Verify all design decisions are resolved in the task
103
+ description. No "choose", "determine", "decide", or "evaluate options" language.
104
+ Resolve decisions inline before presenting the plan.
105
+ 5. **Test Co-location** — Confirm every code-producing task includes its test
106
+ requirements. No "write tests later" aggregation tasks.
107
+
108
+ Tasks that fail any rule should be split inline. If a task genuinely can't be split
109
+ further, annotate with `<!-- agent-size-exception: reason -->`. The implementation
110
+ plan review will flag unjustified exceptions.
@@ -1,18 +1,22 @@
1
1
  ---
2
2
  name: create-prd
3
3
  description: Create a product requirements document from a project idea
4
+ summary: "Translates your vision (or idea, if no vision exists) into a product requirements document with problem statement, user personas, prioritized feature list, constraints, non-functional requirements, and measurable success criteria."
4
5
  phase: "pre"
5
6
  order: 110
6
7
  dependencies: []
7
8
  outputs: [docs/plan.md]
8
9
  conditional: null
9
10
  knowledge-base: [prd-craft]
11
+ reads: [create-vision]
10
12
  ---
11
13
 
12
14
  ## Purpose
13
15
  Transform a project idea into a structured product requirements document that
14
16
  defines the problem, target users, features, constraints, and success criteria.
15
17
  This is the foundation document that all subsequent phases reference.
18
+ The PRD drives user stories, architecture decisions, and implementation planning
19
+ throughout the entire pipeline.
16
20
 
17
21
  ## Inputs
18
22
  - Project idea (provided by user verbally or in a brief)
@@ -22,12 +26,13 @@ This is the foundation document that all subsequent phases reference.
22
26
  - docs/plan.md — Product requirements document
23
27
 
24
28
  ## Quality Criteria
25
- - Problem statement is specific and testable (not vague aspirations)
26
- - Target users are identified with their needs
27
- - Features are scoped with clear boundaries (what's in, what's out)
28
- - Success criteria are measurable
29
- - Constraints (technical, timeline, budget, team) are documented
30
- - Non-functional requirements are explicit (performance, security, accessibility)
29
+ - (mvp) Problem statement names a specific user group, a specific pain point, and a falsifiable hypothesis about the solution
30
+ - (mvp) Target users are identified with their needs
31
+ - (mvp) Features are scoped with clear boundaries (what's in, what's out)
32
+ - (mvp) Success criteria are measurable
33
+ - (mvp) Each non-functional requirement has a measurable target or threshold (e.g., 'page load < 2s', 'WCAG AA')
34
+ - (mvp) No two sections contain contradictory statements about the same concept
35
+ - (deep) Constraints (technical, timeline, budget, team) are documented
31
36
 
32
37
  ## Methodology Scaling
33
38
  - **deep**: Comprehensive PRD. Competitive analysis, detailed user personas,
@@ -47,8 +52,18 @@ Preserve existing decisions unless explicitly revisiting them.
47
52
  ## Update Mode Specifics
48
53
  - **Detect prior artifact**: docs/plan.md exists
49
54
  - **Preserve**: problem statement, existing feature definitions, success criteria,
50
- user personas, and scope boundaries unless user explicitly requests changes
55
+ user personas, scope boundaries, and enhancement markers (`<!-- enhancement: ... -->`)
56
+ unless user explicitly requests changes
51
57
  - **Triggers for update**: user provides new requirements, scope adjustment
52
58
  requested, constraints changed (timeline, budget, team), new user research
53
59
  - **Conflict resolution**: new features are appended to the feature list with
54
60
  clear versioning; changed constraints are documented with rationale for change
61
+
62
+ ### Understand the Vision
63
+
64
+ **If `docs/vision.md` exists**: Read it completely. This is your strategic foundation — the vision document has already established the problem space, target audience, value proposition, competitive landscape, and guiding principles. Skip the vision discovery questions below and use the vision document as the North Star for this PRD. Reference it throughout, ensuring every requirement aligns with the stated vision and guiding principles. Focus your discovery questions on translating the vision into concrete product requirements rather than re-exploring strategic direction.
65
+
66
+ **If `docs/vision.md` does NOT exist**:
67
+ - What problem does this solve and for whom? Push me to be specific about the target user.
68
+ - What does success look like? How will we know this is working?
69
+ - What's the single most important thing this app must do well?
@@ -1,12 +1,13 @@
1
1
  ---
2
2
  name: innovate-prd
3
3
  description: Discover feature-level innovation opportunities in the PRD
4
+ summary: "Analyzes the PRD for feature-level gaps — competitive blind spots, UX enhancements, AI-native possibilities — and proposes additions for your approval."
4
5
  phase: "pre"
5
6
  order: 130
6
7
  dependencies: [review-prd]
7
- outputs: [docs/prd-innovation.md, docs/reviews/prd-innovation/review-summary.md, docs/reviews/prd-innovation/codex-review.json, docs/reviews/prd-innovation/gemini-review.json]
8
+ outputs: [docs/prd-innovation.md, docs/plan.md, docs/reviews/prd-innovation/review-summary.md, docs/reviews/prd-innovation/codex-review.json, docs/reviews/prd-innovation/gemini-review.json]
8
9
  conditional: "if-needed"
9
- knowledge-base: [prd-innovation, prd-craft]
10
+ knowledge-base: [prd-innovation, prd-craft, multi-model-review-dispatch]
10
11
  ---
11
12
 
12
13
  ## Purpose
@@ -32,12 +33,13 @@ creative opportunities and competitive insights.
32
33
  - docs/reviews/prd-innovation/gemini-review.json (depth 4+, if available) — raw Gemini suggestions
33
34
 
34
35
  ## Quality Criteria
35
- - Enhancements are feature-level, not UX-level polish
36
- - Each suggestion has a cost estimate (trivial/moderate/significant)
37
- - Each suggestion has a clear user benefit and impact assessment
38
- - Approved innovations are documented to the same standard as existing features
39
- - PRD scope boundaries are respected — no uncontrolled scope creep
36
+ - (mvp) Enhancements are feature-level, not UX-level polish
37
+ - (mvp) Each suggestion has a cost estimate (trivial/moderate/significant)
38
+ - (mvp) Each suggestion has a clear user benefit and impact assessment
39
+ - (mvp) Each approved innovation includes: problem it solves, target users, scope boundaries, and success criteria
40
+ - (mvp) PRD scope boundaries are respected — no uncontrolled scope creep
40
41
  - User approval is obtained before modifying the PRD
42
+ - User approval for each accepted innovation documented as a question-response pair with timestamp (e.g., "Q: Accept feature X? A: Yes — 2025-01-15T14:30Z")
41
43
  - (depth 4+) Multi-model suggestions deduplicated and synthesized with unique ideas from each model highlighted
42
44
 
43
45
  ## Methodology Scaling
@@ -47,7 +49,7 @@ creative opportunities and competitive insights.
47
49
  innovation dispatched to Codex and Gemini if available, with graceful
48
50
  fallback to Claude-only enhanced brainstorming.
49
51
  - **mvp**: Not applicable — this step is conditional and skipped in MVP.
50
- - **custom:depth(1-5)**: Depth 1-2: not typically enabled. Depth 3: quick scan
52
+ - **custom:depth(1-5)**: Depth 1-2: skip (not enough context for meaningful innovation at this depth). Depth 3: quick scan
51
53
  for obvious gaps and missing expected features. Depth 4: full innovation
52
54
  pass + one external model (if CLI available). Depth 5: full innovation pass
53
55
  + multi-model with deduplication and synthesis.
@@ -1,12 +1,13 @@
1
1
  ---
2
2
  name: innovate-user-stories
3
3
  description: Discover UX-level enhancements and innovation opportunities in user stories
4
+ summary: "Identifies UX enhancement opportunities — progressive disclosure, smart defaults, accessibility improvements — and integrates approved changes into existing stories."
4
5
  phase: "pre"
5
6
  order: 160
6
7
  dependencies: [review-user-stories]
7
8
  outputs: [docs/user-stories-innovation.md, docs/reviews/user-stories-innovation/review-summary.md, docs/reviews/user-stories-innovation/codex-review.json, docs/reviews/user-stories-innovation/gemini-review.json]
8
9
  conditional: "if-needed"
9
- knowledge-base: [user-stories, user-story-innovation]
10
+ knowledge-base: [user-stories, user-story-innovation, multi-model-review-dispatch]
10
11
  ---
11
12
 
12
13
  ## Purpose
@@ -33,11 +34,12 @@ enhancement opportunities.
33
34
  - docs/reviews/user-stories-innovation/gemini-review.json (depth 4+, if available) — raw Gemini suggestions
34
35
 
35
36
  ## Quality Criteria
36
- - Enhancements are UX-level, not new features
37
- - Each suggestion has a cost estimate (trivial/moderate/significant)
38
- - Each suggestion has a clear user benefit
39
- - Approved enhancements are integrated into existing stories (not new stories)
40
- - PRD scope boundaries are respected — no scope creep
37
+ - (mvp) Enhancements are UX-level, not new features
38
+ - (mvp) Each suggestion has a cost estimate (trivial/moderate/significant)
39
+ - (mvp) Each suggestion has a clear user benefit
40
+ - (mvp) Approved enhancements are integrated into existing stories (not new stories)
41
+ - (mvp) PRD scope boundaries are respected — no scope creep
42
+ - User approval for each accepted innovation documented as a question-response pair with timestamp (e.g., "Q: Accept enhancement X? A: Yes — 2025-01-15T14:30Z")
41
43
  - (depth 4+) Multi-model suggestions deduplicated and synthesized with unique ideas from each model highlighted
42
44
 
43
45
  ## Methodology Scaling
@@ -47,7 +49,7 @@ enhancement opportunities.
47
49
  innovation dispatched to Codex and Gemini if available, with graceful
48
50
  fallback to Claude-only enhanced brainstorming.
49
51
  - **mvp**: Not applicable — this step is conditional and skipped in MVP.
50
- - **custom:depth(1-5)**: Depth 1-2: not typically enabled. Depth 3: quick
52
+ - **custom:depth(1-5)**: Depth 1-2: skip (not enough context for meaningful innovation at this depth). Depth 3: quick
51
53
  scan for obvious improvements. Depth 4: full innovation pass + one external
52
54
  model (if CLI available). Depth 5: full innovation pass + multi-model with
53
55
  deduplication and synthesis.
@@ -1,12 +1,13 @@
1
1
  ---
2
2
  name: review-prd
3
3
  description: Multi-pass review of the PRD for completeness, clarity, and downstream readiness
4
+ summary: "Reviews the PRD across eight passes — problem rigor, persona coverage, feature scoping, success criteria, internal consistency, constraints, non-functional requirements — and fixes blocking issues."
4
5
  phase: "pre"
5
6
  order: 120
6
7
  dependencies: [create-prd]
7
8
  outputs: [docs/reviews/pre-review-prd.md, docs/reviews/prd/review-summary.md, docs/reviews/prd/codex-review.json, docs/reviews/prd/gemini-review.json]
8
9
  conditional: null
9
- knowledge-base: [review-methodology, review-prd, prd-craft, gap-analysis]
10
+ knowledge-base: [review-methodology, review-prd, prd-craft, gap-analysis, multi-model-review-dispatch, review-step-template]
10
11
  ---
11
12
 
12
13
  ## Purpose
@@ -30,11 +31,12 @@ independent review validation.
30
31
  - docs/reviews/prd/gemini-review.json (depth 4+, if available) — raw Gemini findings
31
32
 
32
33
  ## Quality Criteria
34
+ - (mvp) Passes 1-2 executed with findings documented
33
35
  - All review passes executed with findings documented
34
36
  - Every finding categorized by severity (P0-P3)
35
37
  - Fix plan created for P0 and P1 findings
36
38
  - Fixes applied and re-validated
37
- - Downstream readiness confirmed (User Stories can proceed)
39
+ - (mvp) Downstream readiness confirmed (User Stories can proceed)
38
40
  - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
39
41
 
40
42
  ## Methodology Scaling
@@ -54,3 +56,10 @@ If docs/reviews/pre-review-prd.md exists, this is a re-review. Read previous
54
56
  findings, check which were addressed, run review passes again on updated PRD.
55
57
  If multi-model review artifacts exist under docs/reviews/prd/, preserve prior
56
58
  findings still valid.
59
+
60
+ ## Update Mode Specifics
61
+
62
+ - **Detect**: `docs/reviews/review-prd.md` exists with tracking comment
63
+ - **Preserve**: Prior findings still valid, resolution decisions, multi-model review artifacts
64
+ - **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
65
+ - **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate
@@ -1,12 +1,13 @@
1
1
  ---
2
2
  name: review-user-stories
3
3
  description: Multi-pass review of user stories for PRD coverage, quality, and downstream readiness
4
+ summary: "Verifies every PRD feature maps to at least one story, checks that acceptance criteria are specific enough to test, validates story independence, and builds a requirements traceability index at higher depths."
4
5
  phase: "pre"
5
6
  order: 150
6
7
  dependencies: [user-stories]
7
8
  outputs: [docs/reviews/pre-review-user-stories.md, docs/reviews/user-stories/requirements-index.md, docs/reviews/user-stories/coverage.json, docs/reviews/user-stories/review-summary.md]
8
9
  conditional: null
9
- knowledge-base: [review-methodology, review-user-stories]
10
+ knowledge-base: [review-methodology, review-user-stories, multi-model-review-dispatch, review-step-template]
10
11
  ---
11
12
 
12
13
  ## Purpose
@@ -33,14 +34,15 @@ independent coverage validation.
33
34
  synthesis with coverage verification
34
35
 
35
36
  ## Quality Criteria
37
+ - (mvp) Pass 1 (PRD coverage) executed with findings documented
36
38
  - All review passes executed with findings documented
37
39
  - Every finding categorized by severity (P0-P3)
38
40
  - Fix plan created for P0 and P1 findings
39
41
  - Fixes applied and re-validated
40
- - Downstream readiness confirmed (modeling phase can proceed)
42
+ - (mvp) Every story has at least one testable acceptance criterion, and every PRD feature maps to at least one story
41
43
  - (depth 4+) Every atomic PRD requirement has a REQ-xxx ID in the requirements index
42
44
  - (depth 4+) Coverage matrix maps every REQ to at least one US (100% coverage target)
43
- - (depth 5) Multi-model findings synthesized with consensus/disagreement analysis
45
+ - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
44
46
 
45
47
  ## Methodology Scaling
46
48
  - **deep**: All 6 review passes from the knowledge base. Full findings report
@@ -58,3 +60,10 @@ If docs/reviews/pre-review-user-stories.md exists, this is a re-review. Read
58
60
  previous findings, check which were addressed, run review passes again on
59
61
  updated stories. If docs/reviews/user-stories/requirements-index.md exists,
60
62
  preserve requirement IDs — never renumber REQ-xxx IDs.
63
+
64
+ ## Update Mode Specifics
65
+
66
+ - **Detect**: `docs/reviews/pre-review-user-stories.md` exists with tracking comment
67
+ - **Preserve**: Prior findings still valid, REQ-xxx IDs, resolution decisions, multi-model review artifacts
68
+ - **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
69
+ - **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate
@@ -1,10 +1,12 @@
1
1
  ---
2
2
  name: user-stories
3
3
  description: Translate PRD features into user stories with acceptance criteria
4
+ summary: "Breaks every PRD feature into user stories organized by epic, each with testable acceptance criteria in Given/When/Then format."
4
5
  phase: "pre"
5
6
  order: 140
6
7
  dependencies: [review-prd]
7
8
  outputs: [docs/user-stories.md]
9
+ reads: [innovate-prd]
8
10
  conditional: null
9
11
  knowledge-base: [user-stories]
10
12
  ---
@@ -25,12 +27,13 @@ task decomposition downstream.
25
27
  criteria scaled to the configured depth level
26
28
 
27
29
  ## Quality Criteria
28
- - Every PRD feature maps to at least one user story
29
- - Stories follow INVEST criteria (Independent, Negotiable, Valuable, Estimable, Small, Testable)
30
- - Acceptance criteria are testable — unambiguous pass/fail
31
- - No story too large to implement in 1-3 focused agent sessions
32
- - Every PRD persona is represented in at least one story
33
- - Stories describe user behavior, not implementation details
30
+ - (mvp) Every PRD feature maps to at least one user story
31
+ - (deep) Stories follow INVEST criteria (Independent, Negotiable, Valuable, Estimable, Small, Testable)
32
+ - (mvp) Acceptance criteria are testable — unambiguous pass/fail
33
+ - (deep) No story has more than 7 acceptance criteria
34
+ - (mvp) Every PRD persona is represented in at least one story
35
+ - (mvp) Stories describe user behavior, not implementation details
36
+ - (mvp) Each story is independent — reordering stories does not break acceptance criteria
34
37
 
35
38
  ## Methodology Scaling
36
39
  - **deep**: Full story template with IDs, persona journey maps, cross-story
@@ -50,7 +53,9 @@ PRESERVE, get approval before modifying. Preserve existing story IDs.
50
53
  ## Update Mode Specifics
51
54
  - **Detect prior artifact**: docs/user-stories.md exists
52
55
  - **Preserve**: existing story IDs, epic groupings, acceptance criteria that
53
- haven't been invalidated, story-to-PRD-feature traceability
56
+ haven't been invalidated, story-to-PRD-feature traceability, enhancement
57
+ markers (`<!-- enhancement: ... -->`), priority decisions, story ID format
58
+ (US-xxx)
54
59
  - **Triggers for update**: PRD features added or changed, innovation suggestions
55
60
  accepted, user personas expanded, review findings require story adjustments
56
61
  - **Conflict resolution**: never reuse a retired story ID; if a story's scope
@@ -1,11 +1,12 @@
1
1
  ---
2
2
  name: create-evals
3
3
  description: Generate project-specific eval checks from standards documentation
4
+ summary: "Generates automated checks that verify your code matches your documented standards — file placement, naming conventions, feature-to-test coverage, API contract alignment — using your project's own test framework."
4
5
  phase: "quality"
5
6
  order: 920
6
7
  dependencies: [tdd, story-tests]
7
8
  outputs: [tests/evals/, docs/eval-standards.md]
8
- reads: [story-tests]
9
+ reads: [security, dev-env-setup, api-contracts, database-schema, ux-spec]
9
10
  conditional: null
10
11
  knowledge-base: [eval-craft, testing-strategy]
11
12
  ---
@@ -70,6 +71,8 @@ Supporting:
70
71
  - (deep) Adherence, security, and error-handling evals include exclusion mechanisms
71
72
  - (deep) docs/eval-standards.md explicitly documents what evals do NOT check
72
73
  - (deep) Full eval suite runs in under 30 seconds
74
+ - (mvp) `make eval` (or equivalent) runs and all generated evals pass
75
+ - (deep) Eval false-positive assessment: each eval category documents at least one scenario where valid code might incorrectly fail, with exclusion mechanism
73
76
 
74
77
  ## Methodology Scaling
75
78
  - **deep**: All 13 eval categories (conditional on doc existence). Stack-specific
@@ -83,11 +86,12 @@ Supporting:
83
86
  - Depth 5: All 13 categories (Security, API, Database, Accessibility, Performance)
84
87
 
85
88
  ## Mode Detection
86
- Update mode if tests/evals/ directory exists. In update mode: regenerate
87
- consistency, structure, cross-doc, and conditional category evals. Preserve
88
- adherence, security, and error-handling eval exclusions. Regenerate coverage
89
- evals only if plan.md or user-stories.md changed. Add/remove conditional
90
- categories based on whether their source doc exists.
89
+ Update mode if tests/evals/ directory or docs/eval-standards.md exists. In
90
+ update mode: regenerate consistency, structure, cross-doc, and conditional
91
+ category evals. Preserve adherence, security, and error-handling eval
92
+ exclusions. Regenerate coverage evals only if plan.md or user-stories.md
93
+ changed. Add/remove conditional categories based on whether their source doc
94
+ exists.
91
95
 
92
96
  ## Update Mode Specifics
93
97
  - **Detect prior artifact**: tests/evals/ directory exists with eval test files
@@ -1,10 +1,12 @@
1
1
  ---
2
2
  name: operations
3
3
  description: Define deployment pipeline, deployment strategy, monitoring, alerting, and incident response
4
+ summary: "Designs your deployment pipeline (build, test, deploy, verify, rollback), defines monitoring metrics with alert thresholds, and writes incident response procedures with rollback instructions."
4
5
  phase: "quality"
5
6
  order: 930
6
7
  dependencies: [review-testing]
7
8
  outputs: [docs/operations-runbook.md]
9
+ reads: [system-architecture, adrs, dev-env-setup, git-workflow]
8
10
  conditional: null
9
11
  knowledge-base: [operations-runbook]
10
12
  ---
@@ -26,19 +28,21 @@ development setup rather than redefining it.
26
28
  - docs/operations-runbook.md — production operations and deployment runbook
27
29
 
28
30
  ## Quality Criteria
29
- - Deployment pipeline extends existing CI (build, deploy, post-deploy stages)
30
- - Deployment pipeline has explicit stages (build → test → deploy → verify → rollback-ready)
31
- - Does not redefine base CI stages (lint, test) from git-workflow
32
- - Deployment strategy chosen with rollback procedure
33
- - Rollback procedure tested with specific trigger conditions (e.g., error rate > X%, health check failure)
34
- - Runbook structured by operational scenario (deployment, rollback, incident, scaling)
35
- - Monitoring covers key metrics (latency, error rate, saturation)
36
- - Each monitoring metric has an explicit threshold with rationale
37
- - Health check endpoints defined with expected response codes and latency bounds
38
- - Log aggregation strategy specifies retention period and searchable fields
39
- - Alerting thresholds are justified, not arbitrary
31
+ - (mvp) Deployment pipeline extends existing CI (build, deploy, post-deploy stages)
32
+ - (mvp) Deployment pipeline has explicit stages (build → test → deploy → verify → rollback-ready)
33
+ - (mvp) Does not redefine base CI stages (lint, test) from git-workflow
34
+ - (mvp) Deployment strategy chosen with rollback procedure
35
+ - (deep) Rollback procedure tested with specific trigger conditions (e.g., error rate > X%, health check failure)
36
+ - (deep) Runbook structured by operational scenario (deployment, rollback, incident, scaling)
37
+ - (mvp) Monitoring covers key metrics (latency, error rate, saturation)
38
+ - (deep) Each monitoring metric has an explicit threshold with rationale
39
+ - (deep) Health check endpoints defined with expected response codes and latency bounds
40
+ - (deep) Log aggregation strategy specifies retention period and searchable fields
41
+ - (deep) Each alert threshold documents: the metric, threshold value, business impact if crossed, and mitigation action
40
42
  - References docs/dev-setup.md for local dev — does not redefine it
41
- - Incident response process defined
43
+ - (deep) Incident response process defined
44
+ - (deep) Recovery Time Objective (RTO) and Recovery Point Objective (RPO) documented for each critical service
45
+ - (deep) Secret rotation procedure documented and tested
42
46
 
43
47
  ## Methodology Scaling
44
48
  - **deep**: Full runbook. Deployment topology diagrams. Monitoring dashboard
@@ -1,12 +1,13 @@
1
1
  ---
2
2
  name: review-operations
3
3
  description: Review operations runbook for completeness and safety
4
+ summary: "Verifies the full deployment lifecycle is documented, monitoring covers latency/errors/saturation, alert thresholds have rationale, and common failure scenarios have runbook entries."
4
5
  phase: "quality"
5
6
  order: 940
6
7
  dependencies: [operations]
7
8
  outputs: [docs/reviews/review-operations.md, docs/reviews/operations/review-summary.md, docs/reviews/operations/codex-review.json, docs/reviews/operations/gemini-review.json]
8
9
  conditional: null
9
- knowledge-base: [review-methodology, review-operations]
10
+ knowledge-base: [review-methodology, review-operations, multi-model-review-dispatch, review-step-template]
10
11
  ---
11
12
 
12
13
  ## Purpose
@@ -29,21 +30,29 @@ independent review validation.
29
30
  - docs/reviews/operations/gemini-review.json (depth 4+, if available) — raw Gemini findings
30
31
 
31
32
  ## Quality Criteria
32
- - Deployment lifecycle fully documented (deploy, verify, rollback)
33
- - Monitoring covers all critical metrics
34
- - Alert thresholds have rationale
35
- - Common failure scenarios have runbook entries
36
- - Dev environment parity assessed
33
+ - (mvp) Deployment lifecycle fully documented (deploy, verify, rollback)
34
+ - (mvp) Monitoring verified against minimum set: latency, error rate, and saturation
35
+ - (deep) Alert thresholds have rationale
36
+ - (deep) Common failure scenarios have runbook entries
37
+ - (deep) Dev/staging/production environment differences documented in operations runbook
38
+ - Every finding categorized P0-P3 with specific runbook section, metric, and issue
39
+ - Fix plan documented for all P0/P1 findings; fixes applied to operations-runbook.md and re-validated
40
+ - Downstream readiness confirmed — no unresolved P0 or P1 findings remain before security step proceeds
37
41
  - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
38
42
 
39
43
  ## Methodology Scaling
40
44
  - **deep**: Full multi-pass review. Multi-model review dispatched to Codex and
41
45
  Gemini if available, with graceful fallback to Claude-only enhanced review.
42
- **mvp**: Deployment coverage only.
43
- - **custom:depth(1-5)**: Depth 1-3: scale passes with depth. Depth 4: full
44
- review + one external model (if CLI available). Depth 5: full review +
45
- multi-model with reconciliation.
46
+ - **mvp**: Deployment coverage only.
47
+ - **custom:depth(1-5)**: Depth 1: monitoring and logging pass only. Depth 2: add deployment and rollback pass. Depth 3: add incident response and scaling passes. Depth 4: add external model review. Depth 5: multi-model review with reconciliation.
46
48
 
47
49
  ## Mode Detection
48
50
  Re-review mode if previous review exists. If multi-model review artifacts exist
49
51
  under docs/reviews/operations/, preserve prior findings still valid.
52
+
53
+ ## Update Mode Specifics
54
+
55
+ - **Detect**: `docs/reviews/review-operations.md` exists with tracking comment
56
+ - **Preserve**: Prior findings still valid, resolution decisions, multi-model review artifacts
57
+ - **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
58
+ - **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate
@@ -1,12 +1,14 @@
1
1
  ---
2
2
  name: review-security
3
3
  description: Review security review for coverage and correctness
4
+ summary: "Verifies OWASP coverage is complete, auth boundaries match API contracts, every secret is accounted for, and the threat model covers all trust boundaries. Highest priority for multi-model review."
4
5
  phase: "quality"
5
6
  order: 960
6
7
  dependencies: [security]
7
8
  outputs: [docs/reviews/review-security.md, docs/reviews/security/review-summary.md, docs/reviews/security/codex-review.json, docs/reviews/security/gemini-review.json]
8
9
  conditional: null
9
- knowledge-base: [review-methodology, review-security]
10
+ reads: [api-contracts]
11
+ knowledge-base: [review-methodology, review-security, multi-model-review-dispatch, review-step-template]
10
12
  ---
11
13
 
12
14
  ## Purpose
@@ -31,22 +33,30 @@ independent review validation.
31
33
  - docs/reviews/security/gemini-review.json (depth 4+, if available) — raw Gemini findings
32
34
 
33
35
  ## Quality Criteria
34
- - OWASP coverage verified for this project
35
- - Auth boundaries match API contract auth requirements
36
- - Secrets management is complete (no gaps)
37
- - Dependency audit scope covers all dependencies
38
- - Threat model covers all trust boundaries
39
- - Data classification is complete
36
+ - (mvp) OWASP coverage verified for this project
37
+ - (deep) Auth boundaries match API contract auth requirements
38
+ - (deep) Secrets management covers: all environment variables, API keys, database credentials, and third-party tokens
39
+ - (deep) Dependency audit scope covers all dependencies
40
+ - (deep) Threat model covers all trust boundaries
41
+ - (deep) Data classification covers every entity in the domain model
42
+ - Every finding categorized P0-P3 with specific control, boundary, and issue
43
+ - Fix plan documented for all P0/P1 findings; fixes applied to security-review.md and re-validated
44
+ - Downstream readiness confirmed — no unresolved P0 or P1 findings remain before planning phase proceeds
40
45
  - (depth 4+) Multi-model findings synthesized with consensus/disagreement analysis
41
46
 
42
47
  ## Methodology Scaling
43
48
  - **deep**: Full multi-pass review. Multi-model review dispatched to Codex and
44
49
  Gemini if available, with graceful fallback to Claude-only enhanced review.
45
- **mvp**: OWASP coverage check only.
46
- - **custom:depth(1-5)**: Depth 1-3: scale passes with depth. Depth 4: full
47
- review + one external model (if CLI available). Depth 5: full review +
48
- multi-model with reconciliation.
50
+ - **mvp**: OWASP coverage check only.
51
+ - **custom:depth(1-5)**: Depth 1: OWASP top 10 and secrets management pass only. Depth 2: add auth boundary and input validation passes. Depth 3: add dependency audit and data protection passes. Depth 4: add external model security review. Depth 5: multi-model security review with reconciliation.
49
52
 
50
53
  ## Mode Detection
51
54
  Re-review mode if previous review exists. If multi-model review artifacts exist
52
55
  under docs/reviews/security/, preserve prior findings still valid.
56
+
57
+ ## Update Mode Specifics
58
+
59
+ - **Detect**: `docs/reviews/review-security.md` exists with tracking comment
60
+ - **Preserve**: Prior findings still valid, resolution decisions, multi-model review artifacts
61
+ - **Triggers**: Upstream artifact changed since last review (compare tracking comment dates)
62
+ - **Conflict resolution**: Previously resolved findings reappearing = regression; flag and re-evaluate