@opengsd/gsd-core 1.2.0-rc.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (503) hide show
  1. package/LICENSE +21 -0
  2. package/README.ja-JP.md +870 -0
  3. package/README.ko-KR.md +861 -0
  4. package/README.md +301 -0
  5. package/README.pt-BR.md +492 -0
  6. package/README.zh-CN.md +842 -0
  7. package/agents/gsd-advisor-researcher.md +127 -0
  8. package/agents/gsd-ai-researcher.md +133 -0
  9. package/agents/gsd-assumptions-analyzer.md +105 -0
  10. package/agents/gsd-code-fixer.md +668 -0
  11. package/agents/gsd-code-reviewer.md +387 -0
  12. package/agents/gsd-codebase-mapper.md +853 -0
  13. package/agents/gsd-debug-session-manager.md +314 -0
  14. package/agents/gsd-debugger.md +1452 -0
  15. package/agents/gsd-doc-classifier.md +168 -0
  16. package/agents/gsd-doc-synthesizer.md +204 -0
  17. package/agents/gsd-doc-verifier.md +217 -0
  18. package/agents/gsd-doc-writer.md +615 -0
  19. package/agents/gsd-domain-researcher.md +153 -0
  20. package/agents/gsd-eval-auditor.md +191 -0
  21. package/agents/gsd-eval-planner.md +154 -0
  22. package/agents/gsd-executor.md +772 -0
  23. package/agents/gsd-framework-selector.md +160 -0
  24. package/agents/gsd-integration-checker.md +470 -0
  25. package/agents/gsd-intel-updater.md +342 -0
  26. package/agents/gsd-nyquist-auditor.md +203 -0
  27. package/agents/gsd-pattern-mapper.md +335 -0
  28. package/agents/gsd-phase-researcher.md +928 -0
  29. package/agents/gsd-plan-checker.md +978 -0
  30. package/agents/gsd-planner.md +1218 -0
  31. package/agents/gsd-project-researcher.md +677 -0
  32. package/agents/gsd-research-synthesizer.md +255 -0
  33. package/agents/gsd-roadmapper.md +688 -0
  34. package/agents/gsd-security-auditor.md +155 -0
  35. package/agents/gsd-ui-auditor.md +495 -0
  36. package/agents/gsd-ui-checker.md +309 -0
  37. package/agents/gsd-ui-researcher.md +380 -0
  38. package/agents/gsd-user-profiler.md +171 -0
  39. package/agents/gsd-verifier.md +917 -0
  40. package/bin/install.js +10936 -0
  41. package/bin/lib/ui-safety-gate.cjs +107 -0
  42. package/commands/gsd/add-tests.md +42 -0
  43. package/commands/gsd/ai-integration-phase.md +37 -0
  44. package/commands/gsd/audit-fix.md +34 -0
  45. package/commands/gsd/audit-milestone.md +37 -0
  46. package/commands/gsd/audit-uat.md +24 -0
  47. package/commands/gsd/autonomous.md +46 -0
  48. package/commands/gsd/capture.md +62 -0
  49. package/commands/gsd/cleanup.md +24 -0
  50. package/commands/gsd/code-review.md +59 -0
  51. package/commands/gsd/complete-milestone.md +143 -0
  52. package/commands/gsd/config.md +56 -0
  53. package/commands/gsd/debug.md +52 -0
  54. package/commands/gsd/discuss-phase.md +76 -0
  55. package/commands/gsd/docs-update.md +49 -0
  56. package/commands/gsd/eval-review.md +33 -0
  57. package/commands/gsd/execute-phase.md +64 -0
  58. package/commands/gsd/explore.md +27 -0
  59. package/commands/gsd/extract-learnings.md +23 -0
  60. package/commands/gsd/fast.md +31 -0
  61. package/commands/gsd/forensics.md +57 -0
  62. package/commands/gsd/graphify.md +199 -0
  63. package/commands/gsd/health.md +31 -0
  64. package/commands/gsd/help.md +28 -0
  65. package/commands/gsd/import.md +41 -0
  66. package/commands/gsd/inbox.md +39 -0
  67. package/commands/gsd/ingest-docs.md +42 -0
  68. package/commands/gsd/manager.md +45 -0
  69. package/commands/gsd/map-codebase.md +83 -0
  70. package/commands/gsd/milestone-summary.md +51 -0
  71. package/commands/gsd/mvp-phase.md +45 -0
  72. package/commands/gsd/new-milestone.md +45 -0
  73. package/commands/gsd/new-project.md +47 -0
  74. package/commands/gsd/ns-context.md +23 -0
  75. package/commands/gsd/ns-ideate.md +24 -0
  76. package/commands/gsd/ns-manage.md +29 -0
  77. package/commands/gsd/ns-project.md +22 -0
  78. package/commands/gsd/ns-review.md +26 -0
  79. package/commands/gsd/ns-workflow.md +28 -0
  80. package/commands/gsd/pause-work.md +43 -0
  81. package/commands/gsd/phase.md +56 -0
  82. package/commands/gsd/plan-phase.md +62 -0
  83. package/commands/gsd/plan-review-convergence.md +59 -0
  84. package/commands/gsd/pr-branch.md +26 -0
  85. package/commands/gsd/profile-user.md +46 -0
  86. package/commands/gsd/progress.md +47 -0
  87. package/commands/gsd/quick.md +174 -0
  88. package/commands/gsd/resume-work.md +30 -0
  89. package/commands/gsd/review-backlog.md +63 -0
  90. package/commands/gsd/review.md +41 -0
  91. package/commands/gsd/secure-phase.md +36 -0
  92. package/commands/gsd/settings.md +29 -0
  93. package/commands/gsd/ship.md +24 -0
  94. package/commands/gsd/sketch.md +60 -0
  95. package/commands/gsd/spec-phase.md +63 -0
  96. package/commands/gsd/spike.md +57 -0
  97. package/commands/gsd/stats.md +19 -0
  98. package/commands/gsd/surface.md +155 -0
  99. package/commands/gsd/thread.md +24 -0
  100. package/commands/gsd/ui-phase.md +35 -0
  101. package/commands/gsd/ui-review.md +33 -0
  102. package/commands/gsd/ultraplan-phase.md +34 -0
  103. package/commands/gsd/undo.md +35 -0
  104. package/commands/gsd/update.md +48 -0
  105. package/commands/gsd/validate-phase.md +36 -0
  106. package/commands/gsd/verify-work.md +39 -0
  107. package/commands/gsd/workspace.md +52 -0
  108. package/commands/gsd/workstreams.md +70 -0
  109. package/get-shit-done/bin/check-latest-version.cjs +106 -0
  110. package/get-shit-done/bin/gsd-tools.cjs +1676 -0
  111. package/get-shit-done/bin/lib/active-workstream-store.cjs +302 -0
  112. package/get-shit-done/bin/lib/adr-parser.cjs +394 -0
  113. package/get-shit-done/bin/lib/agent-command-router.cjs +65 -0
  114. package/get-shit-done/bin/lib/artifacts.cjs +53 -0
  115. package/get-shit-done/bin/lib/audit.cjs +755 -0
  116. package/get-shit-done/bin/lib/check-command-router.cjs +333 -0
  117. package/get-shit-done/bin/lib/cjs-command-router-adapter.cjs +118 -0
  118. package/get-shit-done/bin/lib/clock.cjs +96 -0
  119. package/get-shit-done/bin/lib/clusters.cjs +135 -0
  120. package/get-shit-done/bin/lib/code-review-flags.cjs +74 -0
  121. package/get-shit-done/bin/lib/command-aliases.cjs +815 -0
  122. package/get-shit-done/bin/lib/command-arg-projection.cjs +62 -0
  123. package/get-shit-done/bin/lib/command-routing-hub.cjs +388 -0
  124. package/get-shit-done/bin/lib/commands.cjs +1188 -0
  125. package/get-shit-done/bin/lib/config-schema.cjs +31 -0
  126. package/get-shit-done/bin/lib/config.cjs +728 -0
  127. package/get-shit-done/bin/lib/configuration.cjs +248 -0
  128. package/get-shit-done/bin/lib/context-utilization.cjs +47 -0
  129. package/get-shit-done/bin/lib/core.cjs +2121 -0
  130. package/get-shit-done/bin/lib/decisions.cjs +116 -0
  131. package/get-shit-done/bin/lib/docs.cjs +270 -0
  132. package/get-shit-done/bin/lib/drift.cjs +388 -0
  133. package/get-shit-done/bin/lib/fallow-runner.cjs +109 -0
  134. package/get-shit-done/bin/lib/frontmatter.cjs +389 -0
  135. package/get-shit-done/bin/lib/gap-checker.cjs +205 -0
  136. package/get-shit-done/bin/lib/graphify.cjs +592 -0
  137. package/get-shit-done/bin/lib/gsd2-import.cjs +514 -0
  138. package/get-shit-done/bin/lib/init-command-router.cjs +58 -0
  139. package/get-shit-done/bin/lib/init.cjs +2112 -0
  140. package/get-shit-done/bin/lib/install-profiles.cjs +603 -0
  141. package/get-shit-done/bin/lib/installer-migration-authoring.cjs +117 -0
  142. package/get-shit-done/bin/lib/installer-migration-report.cjs +354 -0
  143. package/get-shit-done/bin/lib/installer-migrations/000-first-time-baseline.cjs +220 -0
  144. package/get-shit-done/bin/lib/installer-migrations/001-legacy-orphan-files.cjs +41 -0
  145. package/get-shit-done/bin/lib/installer-migrations/002-codex-legacy-hooks-json.cjs +80 -0
  146. package/get-shit-done/bin/lib/installer-migrations.cjs +778 -0
  147. package/get-shit-done/bin/lib/intel.cjs +708 -0
  148. package/get-shit-done/bin/lib/learnings.cjs +421 -0
  149. package/get-shit-done/bin/lib/milestone.cjs +314 -0
  150. package/get-shit-done/bin/lib/model-catalog.cjs +212 -0
  151. package/get-shit-done/bin/lib/model-profiles.cjs +31 -0
  152. package/get-shit-done/bin/lib/observability/event.cjs +82 -0
  153. package/get-shit-done/bin/lib/observability/logger.cjs +174 -0
  154. package/get-shit-done/bin/lib/observability/redaction.cjs +50 -0
  155. package/get-shit-done/bin/lib/package-identity.cjs +31 -0
  156. package/get-shit-done/bin/lib/phase-command-router.cjs +191 -0
  157. package/get-shit-done/bin/lib/phase-lifecycle.cjs +80 -0
  158. package/get-shit-done/bin/lib/phase.cjs +1607 -0
  159. package/get-shit-done/bin/lib/phases-command-router.cjs +39 -0
  160. package/get-shit-done/bin/lib/plan-scan.cjs +97 -0
  161. package/get-shit-done/bin/lib/planning-workspace.cjs +238 -0
  162. package/get-shit-done/bin/lib/profile-output.cjs +1141 -0
  163. package/get-shit-done/bin/lib/profile-pipeline.cjs +539 -0
  164. package/get-shit-done/bin/lib/project-root.cjs +112 -0
  165. package/get-shit-done/bin/lib/prompt-budget.cjs +399 -0
  166. package/get-shit-done/bin/lib/review-reviewer-selection.cjs +125 -0
  167. package/get-shit-done/bin/lib/roadmap-command-router.cjs +28 -0
  168. package/get-shit-done/bin/lib/roadmap.cjs +650 -0
  169. package/get-shit-done/bin/lib/runtime-artifact-layout.cjs +301 -0
  170. package/get-shit-done/bin/lib/runtime-homes.cjs +222 -0
  171. package/get-shit-done/bin/lib/runtime-name-policy.cjs +83 -0
  172. package/get-shit-done/bin/lib/runtime-slash.cjs +112 -0
  173. package/get-shit-done/bin/lib/schema-detect.cjs +165 -0
  174. package/get-shit-done/bin/lib/secrets.cjs +32 -0
  175. package/get-shit-done/bin/lib/security.cjs +600 -0
  176. package/get-shit-done/bin/lib/semver-compare.cjs +35 -0
  177. package/get-shit-done/bin/lib/shell-command-projection.cjs +500 -0
  178. package/get-shit-done/bin/lib/state-command-router.cjs +252 -0
  179. package/get-shit-done/bin/lib/state-document.cjs +263 -0
  180. package/get-shit-done/bin/lib/state.cjs +2038 -0
  181. package/get-shit-done/bin/lib/surface.cjs +470 -0
  182. package/get-shit-done/bin/lib/task-command-router.cjs +81 -0
  183. package/get-shit-done/bin/lib/template.cjs +228 -0
  184. package/get-shit-done/bin/lib/uat.cjs +289 -0
  185. package/get-shit-done/bin/lib/update-context.cjs +209 -0
  186. package/get-shit-done/bin/lib/validate-command-router.cjs +83 -0
  187. package/get-shit-done/bin/lib/validate.cjs +92 -0
  188. package/get-shit-done/bin/lib/verify-command-router.cjs +40 -0
  189. package/get-shit-done/bin/lib/verify.cjs +1511 -0
  190. package/get-shit-done/bin/lib/workstream-inventory-builder.cjs +74 -0
  191. package/get-shit-done/bin/lib/workstream-inventory.cjs +146 -0
  192. package/get-shit-done/bin/lib/workstream-name-policy.cjs +94 -0
  193. package/get-shit-done/bin/lib/workstream.cjs +389 -0
  194. package/get-shit-done/bin/lib/worktree-safety.cjs +985 -0
  195. package/get-shit-done/bin/shared/config-defaults.manifest.json +97 -0
  196. package/get-shit-done/bin/shared/config-schema.manifest.json +175 -0
  197. package/get-shit-done/bin/shared/model-catalog.json +122 -0
  198. package/get-shit-done/bin/shared/runtime-aliases.manifest.json +75 -0
  199. package/get-shit-done/bin/verify-reapply-patches.cjs +352 -0
  200. package/get-shit-done/contexts/dev.md +21 -0
  201. package/get-shit-done/contexts/research.md +22 -0
  202. package/get-shit-done/contexts/review.md +23 -0
  203. package/get-shit-done/references/agent-contracts.md +79 -0
  204. package/get-shit-done/references/ai-evals.md +156 -0
  205. package/get-shit-done/references/ai-frameworks.md +186 -0
  206. package/get-shit-done/references/artifact-types.md +131 -0
  207. package/get-shit-done/references/autonomous-smart-discuss.md +277 -0
  208. package/get-shit-done/references/checkpoints.md +814 -0
  209. package/get-shit-done/references/common-bug-patterns.md +114 -0
  210. package/get-shit-done/references/context-budget.md +85 -0
  211. package/get-shit-done/references/continuation-format.md +253 -0
  212. package/get-shit-done/references/debugger-philosophy.md +76 -0
  213. package/get-shit-done/references/decimal-phase-calculation.md +64 -0
  214. package/get-shit-done/references/doc-conflict-engine.md +91 -0
  215. package/get-shit-done/references/domain-probes.md +125 -0
  216. package/get-shit-done/references/execute-mvp-tdd.md +81 -0
  217. package/get-shit-done/references/executor-examples.md +110 -0
  218. package/get-shit-done/references/few-shot-examples/plan-checker.md +73 -0
  219. package/get-shit-done/references/few-shot-examples/verifier.md +109 -0
  220. package/get-shit-done/references/gate-prompts.md +100 -0
  221. package/get-shit-done/references/gates.md +70 -0
  222. package/get-shit-done/references/git-integration.md +298 -0
  223. package/get-shit-done/references/git-planning-commit.md +40 -0
  224. package/get-shit-done/references/ios-scaffold.md +123 -0
  225. package/get-shit-done/references/mandatory-initial-read.md +2 -0
  226. package/get-shit-done/references/model-profile-resolution.md +38 -0
  227. package/get-shit-done/references/model-profiles.md +245 -0
  228. package/get-shit-done/references/mvp-concepts.md +49 -0
  229. package/get-shit-done/references/phase-argument-parsing.md +61 -0
  230. package/get-shit-done/references/planner-antipatterns.md +89 -0
  231. package/get-shit-done/references/planner-chunked.md +49 -0
  232. package/get-shit-done/references/planner-gap-closure.md +62 -0
  233. package/get-shit-done/references/planner-graphify-auto-update.md +67 -0
  234. package/get-shit-done/references/planner-human-verify-mode.md +57 -0
  235. package/get-shit-done/references/planner-interface-context.md +62 -0
  236. package/get-shit-done/references/planner-mvp-mode.md +53 -0
  237. package/get-shit-done/references/planner-reviews.md +39 -0
  238. package/get-shit-done/references/planner-revision.md +87 -0
  239. package/get-shit-done/references/planner-source-audit.md +73 -0
  240. package/get-shit-done/references/planning-config.md +471 -0
  241. package/get-shit-done/references/project-skills-discovery.md +19 -0
  242. package/get-shit-done/references/questioning.md +162 -0
  243. package/get-shit-done/references/revision-loop.md +97 -0
  244. package/get-shit-done/references/scout-codebase.md +51 -0
  245. package/get-shit-done/references/skeleton-template.md +48 -0
  246. package/get-shit-done/references/sketch-interactivity.md +41 -0
  247. package/get-shit-done/references/sketch-theme-system.md +94 -0
  248. package/get-shit-done/references/sketch-tooling.md +45 -0
  249. package/get-shit-done/references/sketch-variant-patterns.md +81 -0
  250. package/get-shit-done/references/spidr-splitting.md +69 -0
  251. package/get-shit-done/references/tdd.md +330 -0
  252. package/get-shit-done/references/thinking-models-debug.md +44 -0
  253. package/get-shit-done/references/thinking-models-execution.md +50 -0
  254. package/get-shit-done/references/thinking-models-planning.md +62 -0
  255. package/get-shit-done/references/thinking-models-research.md +50 -0
  256. package/get-shit-done/references/thinking-models-verification.md +55 -0
  257. package/get-shit-done/references/thinking-partner.md +96 -0
  258. package/get-shit-done/references/ui-brand.md +160 -0
  259. package/get-shit-done/references/universal-anti-patterns.md +63 -0
  260. package/get-shit-done/references/user-profiling.md +681 -0
  261. package/get-shit-done/references/user-story-template.md +58 -0
  262. package/get-shit-done/references/verification-overrides.md +227 -0
  263. package/get-shit-done/references/verification-patterns.md +612 -0
  264. package/get-shit-done/references/verify-mvp-mode.md +85 -0
  265. package/get-shit-done/references/workstream-flag.md +111 -0
  266. package/get-shit-done/references/worktree-path-safety.md +89 -0
  267. package/get-shit-done/templates/AI-SPEC.md +246 -0
  268. package/get-shit-done/templates/DEBUG.md +169 -0
  269. package/get-shit-done/templates/README.md +77 -0
  270. package/get-shit-done/templates/SECURITY.md +61 -0
  271. package/get-shit-done/templates/UAT.md +265 -0
  272. package/get-shit-done/templates/UI-SPEC.md +100 -0
  273. package/get-shit-done/templates/VALIDATION.md +76 -0
  274. package/get-shit-done/templates/claude-md.md +145 -0
  275. package/get-shit-done/templates/codebase/architecture.md +255 -0
  276. package/get-shit-done/templates/codebase/concerns.md +310 -0
  277. package/get-shit-done/templates/codebase/conventions.md +307 -0
  278. package/get-shit-done/templates/codebase/integrations.md +280 -0
  279. package/get-shit-done/templates/codebase/stack.md +186 -0
  280. package/get-shit-done/templates/codebase/structure.md +285 -0
  281. package/get-shit-done/templates/codebase/testing.md +480 -0
  282. package/get-shit-done/templates/config.json +62 -0
  283. package/get-shit-done/templates/context.md +352 -0
  284. package/get-shit-done/templates/continue-here.md +78 -0
  285. package/get-shit-done/templates/copilot-instructions.md +7 -0
  286. package/get-shit-done/templates/debug-subagent-prompt.md +91 -0
  287. package/get-shit-done/templates/dev-preferences.md +21 -0
  288. package/get-shit-done/templates/discovery.md +146 -0
  289. package/get-shit-done/templates/discussion-log.md +63 -0
  290. package/get-shit-done/templates/milestone-archive.md +123 -0
  291. package/get-shit-done/templates/milestone.md +115 -0
  292. package/get-shit-done/templates/phase-prompt.md +610 -0
  293. package/get-shit-done/templates/planner-subagent-prompt.md +117 -0
  294. package/get-shit-done/templates/project.md +186 -0
  295. package/get-shit-done/templates/requirements.md +231 -0
  296. package/get-shit-done/templates/research-project/ARCHITECTURE.md +204 -0
  297. package/get-shit-done/templates/research-project/FEATURES.md +147 -0
  298. package/get-shit-done/templates/research-project/PITFALLS.md +200 -0
  299. package/get-shit-done/templates/research-project/STACK.md +120 -0
  300. package/get-shit-done/templates/research-project/SUMMARY.md +170 -0
  301. package/get-shit-done/templates/research.md +592 -0
  302. package/get-shit-done/templates/retrospective.md +54 -0
  303. package/get-shit-done/templates/roadmap.md +202 -0
  304. package/get-shit-done/templates/spec.md +307 -0
  305. package/get-shit-done/templates/state.md +195 -0
  306. package/get-shit-done/templates/summary-complex.md +59 -0
  307. package/get-shit-done/templates/summary-minimal.md +41 -0
  308. package/get-shit-done/templates/summary-standard.md +48 -0
  309. package/get-shit-done/templates/summary.md +248 -0
  310. package/get-shit-done/templates/user-profile.md +146 -0
  311. package/get-shit-done/templates/user-setup.md +311 -0
  312. package/get-shit-done/templates/verification-report.md +322 -0
  313. package/get-shit-done/workflows/_runtime-launcher.snippet.sh +1 -0
  314. package/get-shit-done/workflows/add-backlog.md +91 -0
  315. package/get-shit-done/workflows/add-phase.md +113 -0
  316. package/get-shit-done/workflows/add-tests.md +355 -0
  317. package/get-shit-done/workflows/add-todo.md +161 -0
  318. package/get-shit-done/workflows/ai-integration-phase.md +295 -0
  319. package/get-shit-done/workflows/analyze-dependencies.md +96 -0
  320. package/get-shit-done/workflows/audit-fix.md +178 -0
  321. package/get-shit-done/workflows/audit-milestone.md +358 -0
  322. package/get-shit-done/workflows/audit-uat.md +110 -0
  323. package/get-shit-done/workflows/autonomous.md +795 -0
  324. package/get-shit-done/workflows/check-todos.md +180 -0
  325. package/get-shit-done/workflows/cleanup.md +155 -0
  326. package/get-shit-done/workflows/code-review-fix.md +502 -0
  327. package/get-shit-done/workflows/code-review.md +656 -0
  328. package/get-shit-done/workflows/complete-milestone.md +855 -0
  329. package/get-shit-done/workflows/debug.md +232 -0
  330. package/get-shit-done/workflows/diagnose-issues.md +241 -0
  331. package/get-shit-done/workflows/discovery-phase.md +291 -0
  332. package/get-shit-done/workflows/discuss-phase/modes/advisor.md +176 -0
  333. package/get-shit-done/workflows/discuss-phase/modes/all.md +28 -0
  334. package/get-shit-done/workflows/discuss-phase/modes/analyze.md +44 -0
  335. package/get-shit-done/workflows/discuss-phase/modes/auto.md +57 -0
  336. package/get-shit-done/workflows/discuss-phase/modes/batch.md +52 -0
  337. package/get-shit-done/workflows/discuss-phase/modes/chain.md +98 -0
  338. package/get-shit-done/workflows/discuss-phase/modes/default.md +141 -0
  339. package/get-shit-done/workflows/discuss-phase/modes/power.md +44 -0
  340. package/get-shit-done/workflows/discuss-phase/modes/text.md +55 -0
  341. package/get-shit-done/workflows/discuss-phase/templates/checkpoint.json +18 -0
  342. package/get-shit-done/workflows/discuss-phase/templates/context.md +136 -0
  343. package/get-shit-done/workflows/discuss-phase/templates/discussion-log.md +50 -0
  344. package/get-shit-done/workflows/discuss-phase-assumptions.md +675 -0
  345. package/get-shit-done/workflows/discuss-phase-power.md +291 -0
  346. package/get-shit-done/workflows/discuss-phase.md +499 -0
  347. package/get-shit-done/workflows/do.md +111 -0
  348. package/get-shit-done/workflows/docs-update.md +1162 -0
  349. package/get-shit-done/workflows/edit-phase.md +295 -0
  350. package/get-shit-done/workflows/eval-review.md +156 -0
  351. package/get-shit-done/workflows/execute-phase/steps/codebase-drift-gate.md +82 -0
  352. package/get-shit-done/workflows/execute-phase/steps/per-plan-worktree-gate.md +94 -0
  353. package/get-shit-done/workflows/execute-phase/steps/post-merge-gate.md +117 -0
  354. package/get-shit-done/workflows/execute-phase.md +1709 -0
  355. package/get-shit-done/workflows/execute-plan.md +526 -0
  356. package/get-shit-done/workflows/explore.md +144 -0
  357. package/get-shit-done/workflows/extract-learnings.md +243 -0
  358. package/get-shit-done/workflows/fast.md +124 -0
  359. package/get-shit-done/workflows/forensics.md +279 -0
  360. package/get-shit-done/workflows/graduation.md +196 -0
  361. package/get-shit-done/workflows/health.md +224 -0
  362. package/get-shit-done/workflows/help/modes/brief.md +22 -0
  363. package/get-shit-done/workflows/help/modes/default.md +50 -0
  364. package/get-shit-done/workflows/help/modes/full.md +784 -0
  365. package/get-shit-done/workflows/help/modes/topic.md +74 -0
  366. package/get-shit-done/workflows/help.md +24 -0
  367. package/get-shit-done/workflows/import.md +254 -0
  368. package/get-shit-done/workflows/inbox.md +387 -0
  369. package/get-shit-done/workflows/ingest-docs.md +339 -0
  370. package/get-shit-done/workflows/insert-phase.md +152 -0
  371. package/get-shit-done/workflows/list-phase-assumptions.md +178 -0
  372. package/get-shit-done/workflows/list-workspaces.md +57 -0
  373. package/get-shit-done/workflows/manager.md +393 -0
  374. package/get-shit-done/workflows/map-codebase.md +444 -0
  375. package/get-shit-done/workflows/milestone-summary.md +224 -0
  376. package/get-shit-done/workflows/mvp-phase.md +222 -0
  377. package/get-shit-done/workflows/new-milestone.md +635 -0
  378. package/get-shit-done/workflows/new-project.md +1555 -0
  379. package/get-shit-done/workflows/new-workspace.md +240 -0
  380. package/get-shit-done/workflows/next.md +299 -0
  381. package/get-shit-done/workflows/node-repair.md +92 -0
  382. package/get-shit-done/workflows/note.md +158 -0
  383. package/get-shit-done/workflows/pause-work.md +244 -0
  384. package/get-shit-done/workflows/plan-milestone-gaps.md +281 -0
  385. package/get-shit-done/workflows/plan-phase.md +1809 -0
  386. package/get-shit-done/workflows/plan-review-convergence.md +346 -0
  387. package/get-shit-done/workflows/plant-seed.md +230 -0
  388. package/get-shit-done/workflows/pr-branch.md +157 -0
  389. package/get-shit-done/workflows/profile-user.md +453 -0
  390. package/get-shit-done/workflows/progress.md +699 -0
  391. package/get-shit-done/workflows/quick.md +1039 -0
  392. package/get-shit-done/workflows/reapply-patches.md +426 -0
  393. package/get-shit-done/workflows/remove-phase.md +156 -0
  394. package/get-shit-done/workflows/remove-workspace.md +108 -0
  395. package/get-shit-done/workflows/resume-project.md +332 -0
  396. package/get-shit-done/workflows/review.md +623 -0
  397. package/get-shit-done/workflows/scan.md +105 -0
  398. package/get-shit-done/workflows/secure-phase.md +180 -0
  399. package/get-shit-done/workflows/session-report.md +146 -0
  400. package/get-shit-done/workflows/settings-advanced.md +620 -0
  401. package/get-shit-done/workflows/settings-integrations.md +312 -0
  402. package/get-shit-done/workflows/settings.md +552 -0
  403. package/get-shit-done/workflows/ship.md +356 -0
  404. package/get-shit-done/workflows/sketch-wrap-up.md +286 -0
  405. package/get-shit-done/workflows/sketch.md +361 -0
  406. package/get-shit-done/workflows/spec-phase.md +262 -0
  407. package/get-shit-done/workflows/spike-wrap-up.md +307 -0
  408. package/get-shit-done/workflows/spike.md +453 -0
  409. package/get-shit-done/workflows/stats.md +80 -0
  410. package/get-shit-done/workflows/sync-skills.md +182 -0
  411. package/get-shit-done/workflows/thread.md +222 -0
  412. package/get-shit-done/workflows/transition.md +694 -0
  413. package/get-shit-done/workflows/ui-phase.md +328 -0
  414. package/get-shit-done/workflows/ui-review.md +193 -0
  415. package/get-shit-done/workflows/ultraplan-phase.md +199 -0
  416. package/get-shit-done/workflows/undo.md +314 -0
  417. package/get-shit-done/workflows/update.md +443 -0
  418. package/get-shit-done/workflows/validate-phase.md +179 -0
  419. package/get-shit-done/workflows/verify-phase.md +544 -0
  420. package/get-shit-done/workflows/verify-work.md +781 -0
  421. package/hooks/dist/gsd-check-update-worker.js +95 -0
  422. package/hooks/dist/gsd-check-update.js +64 -0
  423. package/hooks/dist/gsd-context-monitor.js +195 -0
  424. package/hooks/dist/gsd-graphify-update.sh +158 -0
  425. package/hooks/dist/gsd-phase-boundary.sh +47 -0
  426. package/hooks/dist/gsd-prompt-guard.js +97 -0
  427. package/hooks/dist/gsd-read-guard.js +101 -0
  428. package/hooks/dist/gsd-read-injection-scanner.js +203 -0
  429. package/hooks/dist/gsd-session-state.sh +59 -0
  430. package/hooks/dist/gsd-statusline.js +548 -0
  431. package/hooks/dist/gsd-update-banner.js +134 -0
  432. package/hooks/dist/gsd-validate-commit.sh +57 -0
  433. package/hooks/dist/gsd-workflow-guard.js +166 -0
  434. package/hooks/dist/lib/git-cmd.js +150 -0
  435. package/hooks/dist/lib/gsd-graphify-rebuild.sh +65 -0
  436. package/hooks/gsd-check-update-worker.js +95 -0
  437. package/hooks/gsd-check-update.js +64 -0
  438. package/hooks/gsd-context-monitor.js +195 -0
  439. package/hooks/gsd-graphify-update.sh +158 -0
  440. package/hooks/gsd-phase-boundary.sh +47 -0
  441. package/hooks/gsd-prompt-guard.js +97 -0
  442. package/hooks/gsd-read-guard.js +101 -0
  443. package/hooks/gsd-read-injection-scanner.js +203 -0
  444. package/hooks/gsd-session-state.sh +59 -0
  445. package/hooks/gsd-statusline.js +548 -0
  446. package/hooks/gsd-update-banner.js +134 -0
  447. package/hooks/gsd-validate-commit.sh +57 -0
  448. package/hooks/gsd-workflow-guard.js +166 -0
  449. package/hooks/lib/git-cmd.js +150 -0
  450. package/hooks/lib/gsd-graphify-rebuild.sh +65 -0
  451. package/hooks/managed-hooks-registry.cjs +34 -0
  452. package/package.json +102 -0
  453. package/scripts/affected-tests-lib.cjs +541 -0
  454. package/scripts/audit-workflow-script-paths.cjs +73 -0
  455. package/scripts/base64-scan.sh +339 -0
  456. package/scripts/build-hooks.js +236 -0
  457. package/scripts/changeset/README.md +129 -0
  458. package/scripts/changeset/cli.cjs +392 -0
  459. package/scripts/changeset/github-release-notes.cjs +199 -0
  460. package/scripts/changeset/lint.cjs +110 -0
  461. package/scripts/changeset/new.cjs +137 -0
  462. package/scripts/changeset/parse.cjs +114 -0
  463. package/scripts/changeset/render.cjs +34 -0
  464. package/scripts/changeset/serialize.cjs +130 -0
  465. package/scripts/check-alias-drift.cjs +108 -0
  466. package/scripts/check-env.cjs +302 -0
  467. package/scripts/check-npm-integrity.cjs +209 -0
  468. package/scripts/ci-guard-runner.cjs +16 -0
  469. package/scripts/ci-prepare-test-scope.cjs +46 -0
  470. package/scripts/ci-rebase-check.cjs +85 -0
  471. package/scripts/ci-test-scope.cjs +302 -0
  472. package/scripts/command-contract-helpers.cjs +64 -0
  473. package/scripts/diff-touches-shipped-paths.cjs +147 -0
  474. package/scripts/fix-slash-commands.cjs +147 -0
  475. package/scripts/gen-inventory-manifest.cjs +109 -0
  476. package/scripts/generate-package-identity.cjs +104 -0
  477. package/scripts/lint-command-contract.cjs +108 -0
  478. package/scripts/lint-descriptions.cjs +83 -0
  479. package/scripts/lint-docs-required.cjs +222 -0
  480. package/scripts/lint-no-source-grep-extras.cjs +81 -0
  481. package/scripts/lint-no-source-grep.cjs +174 -0
  482. package/scripts/lint-package-identity-drift.cjs +141 -0
  483. package/scripts/lint-pr-check-project-dir.cjs +98 -0
  484. package/scripts/lint-shared-module-handsync.cjs +388 -0
  485. package/scripts/lint-shell-command-projection-drift.cjs +57 -0
  486. package/scripts/lint-skill-deps.cjs +180 -0
  487. package/scripts/lint-test-file-count.allowlist.json +36 -0
  488. package/scripts/lint-test-file-count.cjs +190 -0
  489. package/scripts/pr-template-policy.cjs +268 -0
  490. package/scripts/prompt-injection-scan.sh +203 -0
  491. package/scripts/release-tarball-smoke.cjs +627 -0
  492. package/scripts/run-affected-tests.cjs +6 -0
  493. package/scripts/run-cross-platform-tests.cjs +63 -0
  494. package/scripts/run-tests.cjs +282 -0
  495. package/scripts/secret-scan-lint.sh +231 -0
  496. package/scripts/secret-scan.sh +358 -0
  497. package/scripts/setup-branch-protection.sh +236 -0
  498. package/scripts/shared-module-handsync-allowlist.json +183 -0
  499. package/scripts/strip-prose-atrefs.cjs +106 -0
  500. package/scripts/sync-rulesets.sh +34 -0
  501. package/scripts/sync-runtime-launcher.cjs +402 -0
  502. package/scripts/test-failure-reasons.cjs +34 -0
  503. package/scripts/workflow-policy.cjs +450 -0
@@ -0,0 +1,544 @@
1
+ <purpose>
2
+ Verify phase goal achievement through goal-backward analysis. Check that the codebase delivers what the phase promised, not just that tasks completed.
3
+
4
+ Executed by a verification subagent spawned from execute-phase.md.
5
+ </purpose>
6
+
7
+ <core_principle>
8
+ **Task completion ≠ Goal achievement**
9
+
10
+ A task "create chat component" can be marked complete when the component is a placeholder. The task was done — but the goal "working chat interface" was not achieved.
11
+
12
+ Goal-backward verification:
13
+ 1. What must be TRUE for the goal to be achieved?
14
+ 2. What must EXIST for those truths to hold?
15
+ 3. What must be WIRED for those artifacts to function?
16
+ 4. What must TESTS PROVE for those truths to be evidenced?
17
+
18
+ Then verify each level against the actual codebase.
19
+ </core_principle>
20
+
21
+ <required_reading>
22
+ @~/.claude/get-shit-done/references/verification-patterns.md
23
+ @~/.claude/get-shit-done/templates/verification-report.md
24
+ </required_reading>
25
+
26
+ <process>
27
+
28
+ <step name="load_context" priority="first">
29
+ Load phase operation context:
30
+
31
+ ```bash
32
+ _GSD_SHIM_NAME="gsd-tools.cjs"; _GSD_RUNTIME_ROOT="${RUNTIME_DIR:-$(git rev-parse --show-toplevel 2>/dev/null || pwd)}"; GSD_TOOLS="${_GSD_RUNTIME_ROOT}/get-shit-done/bin/${_GSD_SHIM_NAME}"; if [ -f "$GSD_TOOLS" ]; then gsd_run() { node "$GSD_TOOLS" "$@"; }; elif [ -f "${_GSD_RUNTIME_ROOT}/.claude/get-shit-done/bin/${_GSD_SHIM_NAME}" ]; then GSD_TOOLS="${_GSD_RUNTIME_ROOT}/.claude/get-shit-done/bin/${_GSD_SHIM_NAME}"; gsd_run() { node "$GSD_TOOLS" "$@"; }; elif command -v gsd-tools >/dev/null 2>&1; then GSD_TOOLS="$(command -v gsd-tools)"; gsd_run() { "$GSD_TOOLS" "$@"; }; elif [ -f "$HOME/.claude/get-shit-done/bin/${_GSD_SHIM_NAME}" ]; then GSD_TOOLS="$HOME/.claude/get-shit-done/bin/${_GSD_SHIM_NAME}"; gsd_run() { node "$GSD_TOOLS" "$@"; }; else echo "ERROR: gsd-tools.cjs not found at $GSD_TOOLS and gsd-tools is not on PATH. Run: npx -y @opengsd/gsd-core@latest --claude --local" >&2; exit 1; fi
33
+ INIT=$(gsd_run query init.phase-op "${PHASE_ARG}")
34
+ if [[ "$INIT" == @file:* ]]; then INIT=$(cat "${INIT#@file:}"); fi
35
+ ```
36
+
37
+ Extract from init JSON: `phase_dir`, `phase_number`, `phase_name`, `has_plans`, `plan_count`.
38
+
39
+ Then load phase details and list plans/summaries:
40
+ ```bash
41
+ gsd_run query roadmap.get-phase "${phase_number}"
42
+ grep -E "^| ${phase_number}" .planning/REQUIREMENTS.md 2>/dev/null || true
43
+ ls "$phase_dir"/*-SUMMARY.md "$phase_dir"/*-PLAN.md 2>/dev/null || true
44
+ ```
45
+
46
+ Load full milestone phases for deferred-item filtering (Step 9b):
47
+ ```bash
48
+ gsd_run query roadmap.analyze
49
+ ```
50
+
51
+ Extract **phase goal** from ROADMAP.md (the outcome to verify, not tasks), **requirements** from REQUIREMENTS.md if it exists, and **all milestone phases** from roadmap analyze (for cross-referencing gaps against later phases).
52
+ </step>
53
+
54
+ <step name="establish_must_haves">
55
+ **Option A: Must-haves in PLAN frontmatter**
56
+
57
+ Use `gsd-tools.cjs query` verify handlers (or legacy gsd-tools) to extract must_haves from each PLAN:
58
+
59
+ ```bash
60
+ for plan in "$PHASE_DIR"/*-PLAN.md; do
61
+ MUST_HAVES=$(gsd_run query frontmatter.get "$plan" --field must_haves)
62
+ echo "=== $plan ===" && echo "$MUST_HAVES"
63
+ done
64
+ ```
65
+
66
+ Returns JSON: `{ truths: [...], artifacts: [...], key_links: [...] }`
67
+
68
+ Aggregate all must_haves across plans for phase-level verification.
69
+
70
+ **Option B: Use Success Criteria from ROADMAP.md**
71
+
72
+ If no must_haves in frontmatter (MUST_HAVES returns error or empty), check for Success Criteria:
73
+
74
+ ```bash
75
+ PHASE_DATA=$(gsd_run query roadmap.get-phase "${phase_number}" --raw)
76
+ ```
77
+
78
+ Parse the `success_criteria` array from the JSON output. If non-empty:
79
+ 1. Use each Success Criterion directly as a **truth** (they are already written as observable, testable behaviors)
80
+ 2. Derive **artifacts** (concrete file paths for each truth)
81
+ 3. Derive **key links** (critical wiring where stubs hide)
82
+ 4. Document the must-haves before proceeding
83
+
84
+ Success Criteria from ROADMAP.md are the contract — they override PLAN-level must_haves when both exist.
85
+
86
+ **Option C: Derive from phase goal (fallback)**
87
+
88
+ If no must_haves in frontmatter AND no Success Criteria in ROADMAP:
89
+ 1. State the goal from ROADMAP.md
90
+ 2. Derive **truths** (3-7 observable behaviors, each testable)
91
+ 3. Derive **artifacts** (concrete file paths for each truth)
92
+ 4. Derive **key links** (critical wiring where stubs hide)
93
+ 5. Document derived must-haves before proceeding
94
+ </step>
95
+
96
+ <step name="verify_truths">
97
+ For each observable truth, determine if the codebase enables it.
98
+
99
+ **Status:** ✓ VERIFIED (all supporting artifacts pass) | ✗ FAILED (artifact missing/stub/unwired) | ? UNCERTAIN (needs human)
100
+
101
+ For each truth: identify supporting artifacts → check artifact status → check wiring → determine truth status.
102
+
103
+ **Example:** Truth "User can see existing messages" depends on Chat.tsx (renders), /api/chat GET (provides), Message model (schema). If Chat.tsx is a stub or API returns hardcoded [] → FAILED. If all exist, are substantive, and connected → VERIFIED.
104
+ </step>
105
+
106
+ <step name="verify_artifacts">
107
+ Use `gsd-tools.cjs query verify.artifacts` (or legacy gsd-tools) for artifact verification against must_haves in each PLAN:
108
+
109
+ ```bash
110
+ for plan in "$PHASE_DIR"/*-PLAN.md; do
111
+ ARTIFACT_RESULT=$(gsd_run query verify.artifacts "$plan")
112
+ echo "=== $plan ===" && echo "$ARTIFACT_RESULT"
113
+ done
114
+ ```
115
+
116
+ Parse JSON result: `{ all_passed, passed, total, artifacts: [{path, exists, issues, passed}] }`
117
+
118
+ **Artifact status from result:**
119
+ - `exists=false` → MISSING
120
+ - `issues` not empty → STUB (check issues for "Only N lines" or "Missing pattern")
121
+ - `passed=true` → VERIFIED (Levels 1-2 pass)
122
+
123
+ **Level 3 — Wired (manual check for artifacts that pass Levels 1-2):**
124
+ ```bash
125
+ grep -r "import.*$artifact_name" src/ --include="*.ts" --include="*.tsx" # IMPORTED
126
+ grep -r "$artifact_name" src/ --include="*.ts" --include="*.tsx" | grep -v "import" # USED
127
+ ```
128
+ WIRED = imported AND used. ORPHANED = exists but not imported/used.
129
+
130
+ | Exists | Substantive | Wired | Status |
131
+ |--------|-------------|-------|--------|
132
+ | ✓ | ✓ | ✓ | ✓ VERIFIED |
133
+ | ✓ | ✓ | ✗ | ⚠️ ORPHANED |
134
+ | ✓ | ✗ | - | ✗ STUB |
135
+ | ✗ | - | - | ✗ MISSING |
136
+
137
+ **Export-level spot check (WARNING severity):**
138
+
139
+ For artifacts that pass Level 3, spot-check individual exports:
140
+ - Extract key exported symbols (functions, constants, classes — skip types/interfaces)
141
+ - For each, grep for usage outside the defining file
142
+ - Flag exports with zero external call sites as "exported but unused"
143
+
144
+ This catches dead stores like `setPlan()` that exist in a wired file but are
145
+ never actually called. Report as WARNING — may indicate incomplete cross-plan
146
+ wiring or leftover code from plan revisions.
147
+ </step>
148
+
149
+ <step name="verify_wiring">
150
+ Use `gsd-tools.cjs query verify.key-links` (or legacy gsd-tools) for key link verification against must_haves in each PLAN:
151
+
152
+ ```bash
153
+ for plan in "$PHASE_DIR"/*-PLAN.md; do
154
+ LINKS_RESULT=$(gsd_run query verify.key-links "$plan")
155
+ echo "=== $plan ===" && echo "$LINKS_RESULT"
156
+ done
157
+ ```
158
+
159
+ Parse JSON result: `{ all_verified, verified, total, links: [{from, to, via, verified, detail}] }`
160
+
161
+ **Link status from result:**
162
+ - `verified=true` → WIRED
163
+ - `verified=false` with "not found" → NOT_WIRED
164
+ - `verified=false` with "Pattern not found" → PARTIAL
165
+
166
+ **Fallback patterns (if key_links not in must_haves):**
167
+
168
+ | Pattern | Check | Status |
169
+ |---------|-------|--------|
170
+ | Component → API | fetch/axios call to API path, response used (await/.then/setState) | WIRED / PARTIAL (call but unused response) / NOT_WIRED |
171
+ | API → Database | Prisma/DB query on model, result returned via res.json() | WIRED / PARTIAL (query but not returned) / NOT_WIRED |
172
+ | Form → Handler | onSubmit with real implementation (fetch/axios/mutate/dispatch), not console.log/empty | WIRED / STUB (log-only/empty) / NOT_WIRED |
173
+ | State → Render | useState variable appears in JSX (`{stateVar}` or `{stateVar.property}`) | WIRED / NOT_WIRED |
174
+
175
+ Record status and evidence for each key link.
176
+ </step>
177
+
178
+ <step name="verify_requirements">
179
+ If REQUIREMENTS.md exists:
180
+ ```bash
181
+ grep -E "Phase ${PHASE_NUM}" .planning/REQUIREMENTS.md 2>/dev/null || true
182
+ ```
183
+
184
+ For each requirement: parse description → identify supporting truths/artifacts → status: ✓ SATISFIED / ✗ BLOCKED / ? NEEDS HUMAN.
185
+ </step>
186
+
187
+ <step name="verify_decisions">
188
+ **Decision coverage validation gate (issue #2492).**
189
+
190
+ After requirements coverage, also check that each trackable CONTEXT.md
191
+ `<decisions>` entry shows up somewhere in the shipped artifacts (plans,
192
+ SUMMARY.md, files modified by the phase, or recent commit subjects on the
193
+ phase branch).
194
+
195
+ This gate is **non-blocking / warning only** by deliberate asymmetry with
196
+ the plan-phase translation gate. The plan-phase gate already blocked at
197
+ translation time, so by the time verification runs every decision has
198
+ either been translated or explicitly deferred. This gate's job is to
199
+ surface decisions that *were* translated but vanished during execution —
200
+ that's a soft signal because "honors a decision" is a fuzzy substring
201
+ heuristic, and we don't want a paraphrase miss to fail an otherwise good
202
+ phase.
203
+
204
+ **Skip if** `workflow.context_coverage_gate` is explicitly set to `false`
205
+ (absent key = enabled). Also skip cleanly when CONTEXT.md is missing or has
206
+ no `<decisions>` block.
207
+
208
+ ```bash
209
+ GATE_CFG=$(gsd_run query config-get workflow.context_coverage_gate 2>/dev/null || echo "true")
210
+ if [ "$GATE_CFG" != "false" ]; then
211
+ # Discover the phase CONTEXT.md via glob expansion rather than `ls | head`
212
+ # (review F17 / ShellCheck SC2012). Globs preserve filenames containing
213
+ # spaces and avoid an extra subprocess.
214
+ CONTEXT_PATH=""
215
+ for f in "${PHASE_DIR}"/*-CONTEXT.md; do
216
+ [ -e "$f" ] && CONTEXT_PATH="$f" && break
217
+ done
218
+ DECISION_RESULT=$(gsd_run query check.decision-coverage-verify "${PHASE_DIR}" "${CONTEXT_PATH}")
219
+ fi
220
+ ```
221
+
222
+ The handler returns JSON `{ skipped, blocking: false, total, honored,
223
+ not_honored: [...], message }`.
224
+
225
+ **Reporting:** Append the handler's `message` (a `### Decision Coverage`
226
+ section) to VERIFICATION.md regardless of outcome — even when all
227
+ decisions are honored, recording the count helps reviewers spot drift over
228
+ time. Set `decision_coverage` in the verification result to
229
+ `{honored, total, not_honored: [...]}` so downstream tooling can read it.
230
+
231
+ **Status impact:** none. The decision gate does NOT influence the
232
+ `gaps_found` / `human_needed` / `passed` decision tree in
233
+ `determine_status`. Its findings are warnings the user reviews and may act
234
+ on by re-opening the phase or by acknowledging the decision was abandoned
235
+ intentionally.
236
+ </step>
237
+
238
+ <step name="behavioral_verification">
239
+ **Run the project's test suite and CLI commands to verify behavior, not just structure.**
240
+
241
+ Static checks (grep, file existence, wiring) catch structural gaps but miss runtime
242
+ failures. This step runs actual tests and project commands to verify the phase goal
243
+ is behaviorally achieved.
244
+
245
+ This follows Anthropic's harness engineering principle: separating generation from
246
+ evaluation, with the evaluator interacting with the running system rather than
247
+ inspecting static artifacts.
248
+
249
+ **Step 1: Run test suite**
250
+
251
+ ```bash
252
+ # Resolve test command: project config > Makefile > language sniff
253
+ TEST_CMD=$(gsd_run query config-get workflow.test_command --default "" 2>/dev/null || true)
254
+ if [ -z "$TEST_CMD" ]; then
255
+ if [ -f "Makefile" ] && grep -q "^test:" Makefile; then
256
+ TEST_CMD="make test"
257
+ elif [ -f "Justfile" ] || [ -f "justfile" ]; then
258
+ TEST_CMD="just test"
259
+ elif [ -f "package.json" ]; then
260
+ TEST_CMD="npm test"
261
+ elif [ -f "Cargo.toml" ]; then
262
+ TEST_CMD="cargo test"
263
+ elif [ -f "go.mod" ]; then
264
+ TEST_CMD="go test ./..."
265
+ elif [ -f "pyproject.toml" ] || [ -f "requirements.txt" ]; then
266
+ TEST_CMD="python -m pytest -q --tb=short 2>&1 || uv run python -m pytest -q --tb=short"
267
+ else
268
+ TEST_CMD="false"
269
+ echo "⚠ No test runner detected — skipping test suite"
270
+ fi
271
+ fi
272
+ # Detect test runner and run all tests (timeout: 5 minutes)
273
+ TEST_EXIT=0
274
+ timeout 300 bash -c "$TEST_CMD" 2>&1
275
+ TEST_EXIT=$?
276
+ if [ "${TEST_EXIT}" -eq 0 ]; then
277
+ echo "✓ Test suite passed"
278
+ elif [ "${TEST_EXIT}" -eq 124 ]; then
279
+ echo "⚠ Test suite timed out after 5 minutes"
280
+ else
281
+ echo "✗ Test suite failed (exit code ${TEST_EXIT})"
282
+ fi
283
+ ```
284
+
285
+ Record: total tests, passed, failed, coverage (if available).
286
+
287
+ **If any tests fail:** Mark as `behavioral_failures` — these are BLOCKER severity
288
+ regardless of whether static checks passed. A phase cannot be verified if tests fail.
289
+
290
+ **Step 2: Run project CLI/commands from success criteria (if testable)**
291
+
292
+ For each success criterion that describes a user command (e.g., "User can run
293
+ `mixtiq validate`", "User can run `npm start`"):
294
+
295
+ 1. Check if the command exists and required inputs are available:
296
+ - Look for example files in `templates/`, `fixtures/`, `test/`, `examples/`, or `testdata/`
297
+ - Check if the CLI binary/script exists on PATH or in the project
298
+ 2. **If no suitable inputs or fixtures exist:** Mark as `? NEEDS HUMAN` with reason
299
+ "No test fixtures available — requires manual verification" and move on.
300
+ Do NOT invent example inputs.
301
+ 3. If inputs are available: run the command and verify it exits successfully.
302
+
303
+ ```bash
304
+ # Only run if both command and input exist
305
+ if command -v {project_cli} &>/dev/null && [ -f "{example_input}" ]; then
306
+ {project_cli} {example_input} 2>&1
307
+ fi
308
+ ```
309
+
310
+ Record: command, exit code, output summary, pass/fail (or SKIPPED if no fixtures).
311
+
312
+ **Step 3: Report**
313
+
314
+ ```
315
+ ## Behavioral Verification
316
+
317
+ | Check | Result | Detail |
318
+ |-------|--------|--------|
319
+ | Test suite | {N} passed, {M} failed | {first failure if any} |
320
+ | {CLI command 1} | ✓ / ✗ | {output summary} |
321
+ | {CLI command 2} | ✓ / ✗ | {output summary} |
322
+ ```
323
+
324
+ **If all behavioral checks pass:** Continue to scan_antipatterns.
325
+ **If any fail:** Add to verification gaps with BLOCKER severity.
326
+ </step>
327
+
328
+ <step name="scan_antipatterns">
329
+ Extract files modified in this phase from SUMMARY.md, scan each:
330
+
331
+ | Pattern | Search | Severity |
332
+ |---------|--------|----------|
333
+ | TBD/FIXME/XXX without same-line `issue #123`, `PR #123`, `#123`, or `DEF-*` reference | `grep -n -e TBD -e FIXME -e XXX` | 🛑 Blocker |
334
+ | TODO/HACK | `grep -n -e TODO -e HACK` | ⚠️ Warning |
335
+ | Placeholder content | `grep -n -iE "placeholder\|coming soon\|will be here"` | 🛑 Blocker |
336
+ | Empty returns | `grep -n -E "return null\|return \{\}\|return \[\]\|=> \{\}"` | ⚠️ Warning |
337
+ | Log-only functions | Functions containing only console.log | ⚠️ Warning |
338
+
339
+ Categorize: 🛑 Blocker (prevents goal) | ⚠️ Warning (incomplete) | ℹ️ Info (notable).
340
+ </step>
341
+
342
+ <step name="audit_test_quality">
343
+ **Verify that tests PROVE what they claim to prove.**
344
+
345
+ This step catches test-level deceptions that pass all prior checks: files exist, are substantive, are wired, and tests pass — but the tests don't actually validate the requirement.
346
+
347
+ **1. Identify requirement-linked test files**
348
+
349
+ From PLAN and SUMMARY files, map each requirement to the test files that are supposed to prove it.
350
+
351
+ **2. Disabled test scan**
352
+
353
+ For ALL test files linked to requirements, search for disabled/skipped patterns:
354
+
355
+ ```bash
356
+ grep -rn -E "it\.skip|describe\.skip|test\.skip|xit\(|xdescribe\(|xtest\(|@pytest\.mark\.skip|@unittest\.skip|#\[ignore\]|\.pending|it\.todo|test\.todo" "$TEST_FILE"
357
+ ```
358
+
359
+ **Rule:** A disabled test linked to a requirement = requirement NOT tested.
360
+ - 🛑 BLOCKER if the disabled test is the only test proving that requirement
361
+ - ⚠️ WARNING if other active tests also cover the requirement
362
+
363
+ **3. Circular test detection**
364
+
365
+ Search for scripts/utilities that generate expected values by running the system under test:
366
+
367
+ ```bash
368
+ grep -rn -E "writeFileSync|writeFile|fs\.write|open\(.*w\)" "$TEST_DIRS"
369
+ ```
370
+
371
+ For each match, check if it also imports the system/service/module being tested. If a script both imports the system-under-test AND writes expected output values → CIRCULAR.
372
+
373
+ **Circular test indicators:**
374
+ - Script imports a service AND writes to fixture files
375
+ - Expected values have comments like "computed from engine", "captured from baseline"
376
+ - Script filename contains "capture", "baseline", "generate", "snapshot" in test context
377
+ - Expected values were added in the same commit as the test assertions
378
+
379
+ **Rule:** A test comparing system output against values generated by the same system is circular. It proves consistency, not correctness.
380
+
381
+ **4. Expected value provenance** (for comparison/parity/migration requirements)
382
+
383
+ When a requirement demands comparison with an external source ("identical to X", "matches Y", "same output as Z"):
384
+
385
+ - Is the external source actually invoked or referenced in the test pipeline?
386
+ - Do fixture files contain data sourced from the external system?
387
+ - Or do all expected values come from the new system itself or from mathematical formulas?
388
+
389
+ **Provenance classification:**
390
+ - VALID: Expected value from external/legacy system output, manual capture, or independent oracle
391
+ - PARTIAL: Expected value from mathematical derivation (proves formula, not system match)
392
+ - CIRCULAR: Expected value from the system being tested
393
+ - UNKNOWN: No provenance information — treat as SUSPECT
394
+
395
+ **5. Assertion strength**
396
+
397
+ For each test linked to a requirement, classify the strongest assertion:
398
+
399
+ | Level | Examples | Proves |
400
+ |-------|---------|--------|
401
+ | Existence | `toBeDefined()`, `!= null` | Something returned |
402
+ | Type | `typeof x === 'number'` | Correct shape |
403
+ | Status | `code === 200` | No error |
404
+ | Value | `toEqual(expected)`, `toBeCloseTo(x)` | Specific value |
405
+ | Behavioral | Multi-step workflow assertions | End-to-end correctness |
406
+
407
+ If a requirement demands value-level or behavioral-level proof and the test only has existence/type/status assertions → INSUFFICIENT.
408
+
409
+ **6. Coverage quantity**
410
+
411
+ If a requirement specifies a quantity of test cases (e.g., "30 calculations"), check if the actual number of active (non-skipped) test cases meets the requirement.
412
+
413
+ **Reporting — add to VERIFICATION.md:**
414
+
415
+ ```markdown
416
+ ### Test Quality Audit
417
+
418
+ | Test File | Linked Req | Active | Skipped | Circular | Assertion Level | Verdict |
419
+ |-----------|-----------|--------|---------|----------|----------------|---------|
420
+
421
+ **Disabled tests on requirements:** {N} → {BLOCKER if any req has ONLY disabled tests}
422
+ **Circular patterns detected:** {N} → {BLOCKER if any}
423
+ **Insufficient assertions:** {N} → {WARNING}
424
+ ```
425
+
426
+ **Impact on status:** Any BLOCKER from test quality audit ��� overall status = `gaps_found`, regardless of other checks passing.
427
+ </step>
428
+
429
+ <step name="identify_human_verification">
430
+ **First: determine if this is an infrastructure/foundation phase.**
431
+
432
+ Infrastructure and foundation phases — code foundations, database schema, internal APIs, data models, build tooling, CI/CD, internal service integrations — have no user-facing elements by definition. For these phases:
433
+
434
+ - Do NOT invent artificial manual steps (e.g., "manually run git commits", "manually invoke methods", "manually check database state").
435
+ - Mark human verification as **N/A** with rationale: "Infrastructure/foundation phase — no user-facing elements to test manually."
436
+ - Set `human_verification: []` and do **not** produce a `human_needed` status solely due to lack of user-facing features.
437
+ - Only add human verification items if the phase goal or success criteria explicitly describe something a user would interact with (UI, CLI command output visible to end users, external service UX).
438
+
439
+ **How to determine if a phase is infrastructure/foundation:**
440
+ - Phase goal or name contains: "foundation", "infrastructure", "schema", "database", "internal API", "data model", "scaffolding", "pipeline", "tooling", "CI", "migrations", "service layer", "backend", "core library"
441
+ - Phase success criteria describe only technical artifacts (files exist, tests pass, schema is valid) with no user interaction required
442
+ - There is no UI, CLI output visible to end users, or real-time behavior to observe
443
+
444
+ **If the phase IS infrastructure/foundation:** auto-pass UAT — skip the human verification items list entirely. Log:
445
+
446
+ ```markdown
447
+ ## Human Verification
448
+
449
+ N/A — Infrastructure/foundation phase with no user-facing elements.
450
+ All acceptance criteria are verifiable programmatically.
451
+ ```
452
+
453
+ **If the phase IS user-facing:** Only flag items that genuinely require a human. Do not invent steps.
454
+
455
+ **Always needs human (user-facing phases only):** Visual appearance, user flow completion, real-time behavior (WebSocket/SSE), external service integration, performance feel, error message clarity.
456
+
457
+ **Needs human if uncertain (user-facing phases only):** Complex wiring grep can't trace, dynamic state-dependent behavior, edge cases.
458
+
459
+ Format each as: Test Name → What to do → Expected result → Why can't verify programmatically.
460
+ </step>
461
+
462
+ <step name="determine_status">
463
+ Classify status using this decision tree IN ORDER (most restrictive first):
464
+
465
+ 1. IF any truth FAILED, artifact MISSING/STUB, key link NOT_WIRED, blocker found, **or test quality audit found blockers (disabled requirement tests, circular tests)**:
466
+ → **gaps_found**
467
+
468
+ 2. IF the previous step produced ANY human verification items:
469
+ → **human_needed** (even if all truths VERIFIED and score is N/N)
470
+
471
+ 3. IF all checks pass AND no human verification items:
472
+ → **passed**
473
+
474
+ **passed is ONLY valid when no human verification items exist.**
475
+
476
+ **Score:** `verified_truths / total_truths`
477
+ </step>
478
+
479
+ <step name="filter_deferred_items">
480
+ Before reporting gaps, cross-reference each gap against later phases in the milestone using the full roadmap data loaded in load_context (from `roadmap analyze`).
481
+
482
+ For each potential gap identified in determine_status:
483
+ 1. Check if the gap's failed truth or missing item is covered by a later phase's goal or success criteria
484
+ 2. **Match criteria:** The gap's concern appears in a later phase's goal text, success criteria text, or the later phase's name clearly suggests it covers this area
485
+ 3. If a clear match is found → move the gap to a `deferred` list with the matching phase reference and evidence text
486
+ 4. If no match in any later phase → keep as a real `gap`
487
+
488
+ **Important:** Be conservative. Only defer a gap when there is clear, specific evidence in a later phase. Vague or tangential matches should NOT cause deferral — when in doubt, keep it as a real gap.
489
+
490
+ **Deferred items do NOT affect the status determination.** Recalculate after filtering:
491
+ - If gaps list is now empty and no human items exist → `passed`
492
+ - If gaps list is now empty but human items exist → `human_needed`
493
+ - If gaps list still has items → `gaps_found`
494
+
495
+ Include deferred items in VERIFICATION.md frontmatter (`deferred:` section) and body (Deferred Items table) for transparency. If no deferred items exist, omit these sections.
496
+ </step>
497
+
498
+ <step name="generate_fix_plans">
499
+ If gaps_found:
500
+
501
+ 1. **Cluster related gaps:** API stub + component unwired → "Wire frontend to backend". Multiple missing → "Complete core implementation". Wiring only → "Connect existing components".
502
+
503
+ 2. **Generate plan per cluster:** Objective, 2-3 tasks (files/action/verify each), re-verify step. Keep focused: single concern per plan.
504
+
505
+ 3. **Order by dependency:** Fix missing → fix stubs → fix wiring → **fix test evidence** → verify.
506
+ </step>
507
+
508
+ <step name="create_report">
509
+ ```bash
510
+ REPORT_PATH="$PHASE_DIR/${PHASE_NUM}-VERIFICATION.md"
511
+ ```
512
+
513
+ Fill template sections: frontmatter (phase/timestamp/status/score), goal achievement, artifact table, wiring table, requirements coverage, anti-patterns, human verification, gaps summary, fix plans (if gaps_found), metadata.
514
+
515
+ See ~/.claude/get-shit-done/templates/verification-report.md for complete template.
516
+ </step>
517
+
518
+ <step name="return_to_orchestrator">
519
+ Return status (`passed` | `gaps_found` | `human_needed`), score (N/M must-haves), report path.
520
+
521
+ If gaps_found: list gaps + recommended fix plan names.
522
+ If human_needed: list items requiring human testing.
523
+
524
+ Orchestrator routes: `passed` → update_roadmap | `gaps_found` → create/execute fixes, re-verify | `human_needed` → present to user.
525
+ </step>
526
+
527
+ </process>
528
+
529
+ <success_criteria>
530
+ - [ ] Must-haves established (from frontmatter or derived)
531
+ - [ ] All truths verified with status and evidence
532
+ - [ ] All artifacts checked at all three levels
533
+ - [ ] All key links verified
534
+ - [ ] Requirements coverage assessed (if applicable)
535
+ - [ ] CONTEXT.md decisions checked against shipped artifacts (#2492 — non-blocking)
536
+ - [ ] Anti-patterns scanned and categorized
537
+ - [ ] Test quality audited (disabled tests, circular patterns, assertion strength, provenance)
538
+ - [ ] Human verification items identified
539
+ - [ ] Overall status determined
540
+ - [ ] Deferred items filtered against later milestone phases (if gaps found)
541
+ - [ ] Fix plans generated (if gaps_found after filtering)
542
+ - [ ] VERIFICATION.md created with complete report
543
+ - [ ] Results returned to orchestrator
544
+ </success_criteria>