@williambeto/ai-workflow 1.19.1 → 2.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (397) hide show
  1. package/CHANGELOG.md +56 -838
  2. package/PUBLISH_MANIFEST.json +34 -0
  3. package/README.md +78 -148
  4. package/{packages/ai-workflow/bin → bin}/ai-workflow.js +0 -0
  5. package/dist-assets/AGENTS.md +27 -0
  6. package/dist-assets/agents/astra.md +63 -0
  7. package/dist-assets/agents/atlas.md +169 -0
  8. package/dist-assets/agents/nexus.md +42 -0
  9. package/dist-assets/agents/orion.md +44 -0
  10. package/dist-assets/agents/phoenix.md +42 -0
  11. package/dist-assets/agents/sage.md +54 -0
  12. package/dist-assets/commands/README.md +14 -0
  13. package/dist-assets/commands/atlas.md +12 -0
  14. package/dist-assets/commands/audit.md +10 -0
  15. package/dist-assets/commands/deploy.md +12 -0
  16. package/dist-assets/commands/discover.md +10 -0
  17. package/dist-assets/commands/implement.md +28 -0
  18. package/dist-assets/commands/optimize-tokens.md +10 -0
  19. package/dist-assets/commands/plan.md +10 -0
  20. package/dist-assets/commands/release.md +12 -0
  21. package/dist-assets/commands/run.md +26 -0
  22. package/dist-assets/commands/spec-create.md +10 -0
  23. package/dist-assets/commands/spec-implement.md +10 -0
  24. package/dist-assets/commands/spec-review.md +10 -0
  25. package/dist-assets/commands/update-memory.md +10 -0
  26. package/dist-assets/commands/validate.md +12 -0
  27. package/dist-assets/docs/INDEX.md +21 -0
  28. package/dist-assets/docs/QUICKSTART.md +23 -0
  29. package/dist-assets/docs/adr/ADR-0000.md +19 -0
  30. package/dist-assets/docs/adr/ADR-0001.md +45 -0
  31. package/dist-assets/docs/adr/ADR-0002.md +62 -0
  32. package/dist-assets/docs/adr/ADR-0003.md +60 -0
  33. package/dist-assets/docs/adr/ADR-0004.md +71 -0
  34. package/dist-assets/docs/adr/ADR-0005.md +22 -0
  35. package/dist-assets/docs/adr/ADR-0006.md +82 -0
  36. package/dist-assets/docs/adr/ADR-0007.md +78 -0
  37. package/dist-assets/docs/api-engine-reference.md +7 -0
  38. package/{docs → dist-assets/docs}/architecture-policy.md +1 -1
  39. package/dist-assets/docs/cli-reference.md +27 -0
  40. package/dist-assets/docs/compatibility/provider-usage.md +38 -0
  41. package/dist-assets/docs/compatibility/runtime-matrix.md +30 -0
  42. package/dist-assets/docs/consumer-onboarding.md +17 -0
  43. package/dist-assets/docs/contributing-guide.md +11 -0
  44. package/{docs → dist-assets/docs}/design-patterns-policy.md +2 -2
  45. package/dist-assets/docs/full-documentation.md +113 -0
  46. package/{docs → dist-assets/docs}/npm-consumer-quickstart.md +18 -46
  47. package/dist-assets/docs/opencode-readme.md +8 -0
  48. package/dist-assets/docs/policies/01-BRANCH_GATE.md +63 -0
  49. package/dist-assets/docs/policies/02-SDD_METHODOLOGY.md +95 -0
  50. package/dist-assets/docs/policies/03-QUALITY_GATE.md +22 -0
  51. package/dist-assets/docs/policies/05-AGENT_CONTRACT.md +7 -0
  52. package/dist-assets/docs/policies/06-FINAL_EVIDENCE_CONTRACT.md +31 -0
  53. package/dist-assets/docs/policies/07-RELEASE_GATE.md +47 -0
  54. package/dist-assets/docs/policies/08-PRODUCT_TRUTHFULNESS_AND_PROJECT_DOCS.md +18 -0
  55. package/dist-assets/docs/policies/09-SPEC_VISIBILITY_AND_PUBLICATION.md +28 -0
  56. package/dist-assets/docs/policies/10-BEHAVIORAL_CONTRACT_HARDENING.md +9 -0
  57. package/dist-assets/docs/policies/11-EXECUTABLE_DELEGATION_AND_TRUTHFULNESS.md +7 -0
  58. package/dist-assets/docs/policies/ORCHESTRATION_PROTOCOL.md +15 -0
  59. package/dist-assets/docs/policies/PROCEDURE_DELIVERY_ARTIFACTS.md +21 -0
  60. package/dist-assets/docs/policies/PROCEDURE_DOCUMENTATION_CHECKLIST.md +24 -0
  61. package/dist-assets/docs/policies/PROCEDURE_UI_CHECKLIST.md +54 -0
  62. package/dist-assets/docs/profiles/README.md +19 -0
  63. package/dist-assets/docs/profiles/backend-api.md +5 -0
  64. package/dist-assets/docs/profiles/documentation.md +3 -0
  65. package/dist-assets/docs/profiles/frontend-product.md +19 -0
  66. package/dist-assets/docs/profiles/frontend-utility.md +19 -0
  67. package/dist-assets/docs/profiles/refactor.md +3 -0
  68. package/dist-assets/docs/profiles/security-review.md +3 -0
  69. package/dist-assets/docs/references/frontend-quality/landing-page-quality-checklist.md +11 -0
  70. package/dist-assets/docs/references/frontend-quality/product-copy-truthfulness.md +7 -0
  71. package/dist-assets/docs/references/frontend-quality/quality-failure-examples.md +20 -0
  72. package/dist-assets/docs/references/frontend-quality/visual-composition-patterns.md +10 -0
  73. package/dist-assets/docs/specs/runtime-operational-contract.md +39 -0
  74. package/dist-assets/docs/troubleshooting-guide.md +21 -0
  75. package/dist-assets/docs/visual-validation-guide.md +76 -0
  76. package/dist-assets/examples/README.md +10 -0
  77. package/dist-assets/examples/autopilot-cycle/00-CONTEXT.md +3 -0
  78. package/dist-assets/examples/autopilot-cycle/01-requirement.md +16 -0
  79. package/dist-assets/examples/autopilot-cycle/02-gate-a-check.md +23 -0
  80. package/dist-assets/examples/autopilot-cycle/03-orion-planning.md +20 -0
  81. package/dist-assets/examples/autopilot-cycle/04-astra-implementation.md +17 -0
  82. package/dist-assets/examples/autopilot-cycle/05-sage-validation.md +15 -0
  83. package/dist-assets/examples/autopilot-cycle/06-phoenix-healing.md +12 -0
  84. package/dist-assets/examples/autopilot-cycle/07-orchestration-report.md +18 -0
  85. package/dist-assets/examples/backend-api/00-CONTEXT.md +12 -0
  86. package/dist-assets/examples/backend-api/01-requirement.md +19 -0
  87. package/dist-assets/examples/backend-api/02-functional-spec.md +20 -0
  88. package/dist-assets/examples/backend-api/03-technical-plan.md +15 -0
  89. package/dist-assets/examples/backend-api/04-pr-breakdown.md +10 -0
  90. package/dist-assets/examples/backend-api/05-execution-handoff.md +13 -0
  91. package/dist-assets/examples/backend-api/06-validation-report.md +11 -0
  92. package/dist-assets/examples/backend-api/07-orchestration-report.md +7 -0
  93. package/dist-assets/examples/blocked-scenarios/00-CONTEXT.md +9 -0
  94. package/dist-assets/examples/blocked-scenarios/01-branch-gate-block.md +12 -0
  95. package/dist-assets/examples/blocked-scenarios/02-quality-gate-block.md +13 -0
  96. package/dist-assets/examples/blocked-scenarios/03-scope-creep-block.md +13 -0
  97. package/dist-assets/examples/blocked-scenarios/04-unblock-resolution.md +9 -0
  98. package/dist-assets/examples/blocked-scenarios/05-orchestration-decision-log.md +11 -0
  99. package/dist-assets/examples/bugfix-critical/00-CONTEXT.md +12 -0
  100. package/dist-assets/examples/bugfix-critical/01-bug-report.md +11 -0
  101. package/dist-assets/examples/bugfix-critical/02-diagnosis-hypothesis.md +11 -0
  102. package/dist-assets/examples/bugfix-critical/03-technical-plan.md +12 -0
  103. package/dist-assets/examples/bugfix-critical/04-implementation-handoff.md +8 -0
  104. package/dist-assets/examples/bugfix-critical/05-validation-report.md +10 -0
  105. package/dist-assets/examples/bugfix-critical/06-orchestration-report.md +7 -0
  106. package/dist-assets/examples/cli-package/00-CONTEXT.md +9 -0
  107. package/dist-assets/examples/cli-package/01-requirement.md +14 -0
  108. package/dist-assets/examples/cli-package/02-technical-spec.md +16 -0
  109. package/dist-assets/examples/cli-package/03-technical-plan.md +12 -0
  110. package/dist-assets/examples/cli-package/04-pr-breakdown.md +9 -0
  111. package/dist-assets/examples/cli-package/05-release-report.md +15 -0
  112. package/dist-assets/examples/docs-only-repo/01-requirement.md +31 -0
  113. package/dist-assets/examples/docs-only-repo/02-functional-spec.md +25 -0
  114. package/dist-assets/examples/docs-only-repo/03-technical-plan.md +21 -0
  115. package/dist-assets/examples/docs-only-repo/04-pr-breakdown.md +13 -0
  116. package/dist-assets/examples/docs-only-repo/05-execution-handoff.md +17 -0
  117. package/dist-assets/examples/docs-only-repo/06-validation-report.md +16 -0
  118. package/dist-assets/examples/docs-only-repo/README.md +26 -0
  119. package/dist-assets/examples/full-stack-checkout/00-CONTEXT.md +9 -0
  120. package/dist-assets/examples/full-stack-checkout/01-requirement.md +12 -0
  121. package/dist-assets/examples/full-stack-checkout/02-functional-spec.md +15 -0
  122. package/dist-assets/examples/full-stack-checkout/03-technical-plan.md +15 -0
  123. package/dist-assets/examples/full-stack-checkout/04-pr-breakdown.md +8 -0
  124. package/dist-assets/examples/full-stack-checkout/05-execution-handoff.md +14 -0
  125. package/dist-assets/examples/full-stack-checkout/06-validation-report.md +12 -0
  126. package/dist-assets/examples/healing-cycle/00-CONTEXT.md +15 -0
  127. package/dist-assets/examples/healing-cycle/01-broken-implementation.md +10 -0
  128. package/dist-assets/examples/healing-cycle/02-sage-fails.md +14 -0
  129. package/dist-assets/examples/healing-cycle/03-phoenix-diagnosis.md +17 -0
  130. package/dist-assets/examples/healing-cycle/04-phoenix-fix.md +18 -0
  131. package/dist-assets/examples/healing-cycle/05-sage-revalidation.md +12 -0
  132. package/dist-assets/examples/healing-cycle/06-orchestration-log.md +14 -0
  133. package/dist-assets/examples/infra-deploy/00-CONTEXT.md +9 -0
  134. package/dist-assets/examples/infra-deploy/01-operational-goal.md +12 -0
  135. package/dist-assets/examples/infra-deploy/02-architecture-specs.md +15 -0
  136. package/dist-assets/examples/infra-deploy/03-implementation-plan.md +14 -0
  137. package/dist-assets/examples/infra-deploy/04-step-breakdown.md +9 -0
  138. package/dist-assets/examples/infra-deploy/05-execution-handoff.md +13 -0
  139. package/dist-assets/examples/infra-deploy/06-operational-report.md +11 -0
  140. package/dist-assets/examples/multi-pr-release/00-CONTEXT.md +9 -0
  141. package/dist-assets/examples/multi-pr-release/01-requirement.md +13 -0
  142. package/dist-assets/examples/multi-pr-release/02-strategic-plan.md +13 -0
  143. package/dist-assets/examples/multi-pr-release/03-pr-breakdown.md +14 -0
  144. package/dist-assets/examples/multi-pr-release/04-release-plan.md +12 -0
  145. package/dist-assets/examples/multi-pr-release/05-orchestration-report.md +7 -0
  146. package/dist-assets/examples/nuxt-dashboard/01-requirement.md +81 -0
  147. package/dist-assets/examples/nuxt-dashboard/02-functional-spec.md +88 -0
  148. package/dist-assets/examples/nuxt-dashboard/03-technical-plan.md +76 -0
  149. package/dist-assets/examples/nuxt-dashboard/04-pr-breakdown.md +219 -0
  150. package/dist-assets/examples/nuxt-dashboard/05-execution-handoff.md +88 -0
  151. package/dist-assets/examples/nuxt-dashboard/06-validation-report.md +56 -0
  152. package/dist-assets/examples/nuxt-dashboard/07-orchestration-report.md +79 -0
  153. package/dist-assets/examples/nuxt-dashboard/README.md +52 -0
  154. package/dist-assets/examples/react-dashboard/01-requirement.md +84 -0
  155. package/dist-assets/examples/react-dashboard/02-functional-spec.md +88 -0
  156. package/dist-assets/examples/react-dashboard/03-technical-plan.md +76 -0
  157. package/dist-assets/examples/react-dashboard/04-pr-breakdown.md +218 -0
  158. package/dist-assets/examples/react-dashboard/05-execution-handoff.md +13 -0
  159. package/dist-assets/examples/react-dashboard/06-validation-report.md +12 -0
  160. package/dist-assets/examples/react-dashboard/07-orchestration-report.md +7 -0
  161. package/dist-assets/examples/react-dashboard/README.md +70 -0
  162. package/dist-assets/examples/refactoring-service/00-CONTEXT.md +9 -0
  163. package/dist-assets/examples/refactoring-service/01-debt-report.md +12 -0
  164. package/dist-assets/examples/refactoring-service/02-behavior-spec.md +11 -0
  165. package/dist-assets/examples/refactoring-service/03-technical-plan.md +13 -0
  166. package/dist-assets/examples/refactoring-service/04-pr-breakdown.md +9 -0
  167. package/dist-assets/examples/refactoring-service/05-execution-handoff.md +14 -0
  168. package/dist-assets/examples/refactoring-service/06-stability-report.md +12 -0
  169. package/dist-assets/examples/sdd-cycle/00-CONTEXT.md +12 -0
  170. package/dist-assets/examples/sdd-cycle/01-raw-request.md +13 -0
  171. package/dist-assets/examples/sdd-cycle/02-spec-creation.md +18 -0
  172. package/dist-assets/examples/sdd-cycle/03-spec-review.md +12 -0
  173. package/dist-assets/examples/sdd-cycle/04-technical-plan.md +16 -0
  174. package/dist-assets/examples/sdd-cycle/05-pr-breakdown.md +9 -0
  175. package/dist-assets/examples/sdd-cycle/06-spec-implement.md +13 -0
  176. package/dist-assets/examples/sdd-cycle/07-validation-against-spec.md +13 -0
  177. package/dist-assets/examples/wordpress-theme/01-requirement.md +29 -0
  178. package/dist-assets/examples/wordpress-theme/02-functional-spec.md +22 -0
  179. package/dist-assets/examples/wordpress-theme/03-technical-plan.md +22 -0
  180. package/dist-assets/examples/wordpress-theme/04-pr-breakdown.md +14 -0
  181. package/dist-assets/examples/wordpress-theme/05-execution-handoff.md +17 -0
  182. package/dist-assets/examples/wordpress-theme/06-validation-report.md +16 -0
  183. package/dist-assets/examples/wordpress-theme/README.md +32 -0
  184. package/{harness → dist-assets/harness}/handoffs/HANDOFF.template.md +2 -2
  185. package/{harness → dist-assets/harness}/workflows/agent-evaluation-checklist.md +5 -5
  186. package/{harness → dist-assets/harness}/workflows/implement-review-validate.md +24 -0
  187. package/{harness → dist-assets/harness}/workflows/multi-agent-handoff.md +4 -4
  188. package/{harness → dist-assets/harness}/workflows/planner-executor-workflow.md +5 -5
  189. package/{harness → dist-assets/harness}/workflows/requirement-to-pr.md +1 -1
  190. package/dist-assets/runbooks/agent-delegation-workflow.md +50 -0
  191. package/dist-assets/runbooks/apply-starter-to-real-project.md +45 -0
  192. package/dist-assets/runbooks/commands-cheatsheet.md +44 -0
  193. package/dist-assets/runbooks/how-to-use-skills.md +44 -0
  194. package/dist-assets/runbooks/private-spec-publication-safety.md +35 -0
  195. package/{runbooks → dist-assets/runbooks}/spec-driven-development.md +3 -6
  196. package/dist-assets/runbooks/tutorial-walkthroughs.md +23 -0
  197. package/dist-assets/runbooks/use-linear-for-operational-planning.md +45 -0
  198. package/dist-assets/runbooks/use-napkin-project-memory.md +33 -0
  199. package/dist-assets/skills/architecture/SKILL.md +166 -0
  200. package/dist-assets/skills/backend-development/SKILL.md +166 -0
  201. package/dist-assets/skills/deployment/SKILL.md +166 -0
  202. package/dist-assets/skills/design-principles/SKILL.md +166 -0
  203. package/dist-assets/skills/documentation/SKILL.md +171 -0
  204. package/dist-assets/skills/frontend-development/SKILL.md +225 -0
  205. package/dist-assets/skills/full-stack-development/SKILL.md +166 -0
  206. package/dist-assets/skills/optimize-tokens/SKILL.md +166 -0
  207. package/dist-assets/skills/pr-workflow/SKILL.md +166 -0
  208. package/dist-assets/skills/product-discovery/SKILL.md +166 -0
  209. package/dist-assets/skills/product-planning/SKILL.md +166 -0
  210. package/dist-assets/skills/project-memory/SKILL.md +166 -0
  211. package/dist-assets/skills/prompt-engineer/SKILL.md +166 -0
  212. package/dist-assets/skills/qa-workflow/SKILL.md +186 -0
  213. package/dist-assets/skills/refactoring/SKILL.md +166 -0
  214. package/dist-assets/skills/release-workflow/SKILL.md +166 -0
  215. package/dist-assets/skills/spec-driven-development/SKILL.md +166 -0
  216. package/dist-assets/skills/technical-leadership/SKILL.md +166 -0
  217. package/dist-assets/skills/ui-ux-design/SKILL.md +202 -0
  218. package/dist-assets/templates/.geminiignore.template +8 -0
  219. package/dist-assets/templates/CLAUDE.md.template +20 -0
  220. package/dist-assets/templates/CODEX.md.template +20 -0
  221. package/dist-assets/templates/GEMINI.md.template +20 -0
  222. package/dist-assets/templates/HANDOFF.template.md +45 -0
  223. package/dist-assets/templates/SPEC.template.md +38 -0
  224. package/dist-assets/templates/change-proposal.template.md +14 -0
  225. package/dist-assets/templates/owner-evidence/astra-implementation.json +10 -0
  226. package/dist-assets/templates/owner-evidence/phoenix-remediation.json +8 -0
  227. package/dist-assets/templates/owner-evidence/sage-revalidation.json +8 -0
  228. package/dist-assets/templates/owner-evidence/sage-validation.json +8 -0
  229. package/dist-assets/templates/specs/deep.md +48 -0
  230. package/dist-assets/templates/specs/standard.md +38 -0
  231. package/dist-assets/templates/specs/tiny.md +19 -0
  232. package/package.json +43 -47
  233. package/src/adapters/index.js +3 -0
  234. package/src/adapters/platforms/claude.js +126 -0
  235. package/src/adapters/platforms/codex.js +100 -0
  236. package/src/adapters/platforms/gemini.js +232 -0
  237. package/src/cli.js +114 -0
  238. package/src/commands/collect-evidence.js +61 -0
  239. package/src/commands/doctor.js +186 -0
  240. package/src/commands/execute.js +172 -0
  241. package/{packages/ai-workflow/src → src}/commands/init.js +119 -20
  242. package/src/commands/run.js +112 -0
  243. package/src/core/completion-contract.js +35 -0
  244. package/src/core/execution-planner.js +59 -0
  245. package/src/core/gates/branch-gate.js +146 -0
  246. package/src/core/handoff/handoff-engine.js +104 -0
  247. package/src/core/healing/cli-remediation-executor.js +151 -0
  248. package/src/core/healing/healer-engine.js +179 -0
  249. package/src/core/identity.js +43 -0
  250. package/{packages/ai-workflow/src → src}/core/install-plan.js +3 -3
  251. package/src/core/opencode-merge.js +149 -0
  252. package/{packages/ai-workflow/src → src}/core/package-assets.js +29 -10
  253. package/src/core/request-classifier.js +58 -0
  254. package/src/core/runtime/opencode-adapter.js +94 -0
  255. package/src/core/sdd/validator.js +67 -0
  256. package/src/core/statuses.js +29 -0
  257. package/src/core/symlink-layout.js +93 -0
  258. package/src/core/templates.js +221 -0
  259. package/src/core/validation/canonical-finalization.js +43 -0
  260. package/src/core/validation/evidence-collector.js +109 -0
  261. package/src/core/validation/quality-guard.js +243 -0
  262. package/src/core/workflow-profiles.js +107 -0
  263. package/src/core/workflow-state-machine.js +46 -0
  264. package/.agents/napkin.md +0 -89
  265. package/.agents/skills/backend-implementer/SKILL.md +0 -490
  266. package/.agents/skills/build-and-validate/SKILL.md +0 -442
  267. package/.agents/skills/deploy-engineer/SKILL.md +0 -541
  268. package/.agents/skills/docs-writer/SKILL.md +0 -430
  269. package/.agents/skills/frontend-implementer/SKILL.md +0 -488
  270. package/.agents/skills/interface-design/SKILL.md +0 -428
  271. package/.agents/skills/interface-design/references/critique.md +0 -67
  272. package/.agents/skills/interface-design/references/example.md +0 -86
  273. package/.agents/skills/interface-design/references/principles.md +0 -235
  274. package/.agents/skills/interface-design/references/validation.md +0 -48
  275. package/.agents/skills/minimal-context/SKILL.md +0 -177
  276. package/.agents/skills/napkin/SKILL.md +0 -84
  277. package/.agents/skills/opencode-agent-design/SKILL.md +0 -77
  278. package/.agents/skills/playwright-cli/SKILL.md +0 -62
  279. package/.agents/skills/pr-orchestrator/SKILL.md +0 -366
  280. package/.agents/skills/product-manager/SKILL.md +0 -519
  281. package/.agents/skills/seo-audit/SKILL.md +0 -176
  282. package/.agents/skills/stack-variant-creator/SKILL.md +0 -265
  283. package/.agents/skills/tech-lead/SKILL.md +0 -453
  284. package/.agents/skills/tester/SKILL.md +0 -399
  285. package/.agents/skills/token-economy/SKILL.md +0 -137
  286. package/.agents/skills/vue-nuxt/SKILL.md +0 -102
  287. package/.agents/skills/wordpress-engineer/SKILL.md +0 -75
  288. package/.codex/prompts/README.md +0 -44
  289. package/.codex/prompts/autopilot.md +0 -50
  290. package/.codex/prompts/deploy.md +0 -33
  291. package/.codex/prompts/execute-selected-pr.md +0 -35
  292. package/.codex/prompts/fix-issue.md +0 -34
  293. package/.codex/prompts/minimal-context-mode.md +0 -55
  294. package/.codex/prompts/orchestrate-next.md +0 -33
  295. package/.codex/prompts/plan-from-requirement.md +0 -37
  296. package/.codex/prompts/review-implementation.md +0 -33
  297. package/.codex/prompts/roadmap-audit.md +0 -22
  298. package/.codex/prompts/specs/create-spec-from-requirement.md +0 -26
  299. package/.codex/prompts/specs/review-spec.md +0 -29
  300. package/.codex/prompts/specs/spec-to-pr-breakdown.md +0 -23
  301. package/.codex/prompts/specs/spec-to-technical-plan.md +0 -28
  302. package/.codex/prompts/start-project.md +0 -29
  303. package/.codex/prompts/token-economy-mode.md +0 -48
  304. package/.codex/prompts/validate-work.md +0 -28
  305. package/checklists/change-spec-readiness-checklist.md +0 -34
  306. package/docs/full-documentation.md +0 -661
  307. package/docs/setup-codex-opencode.md +0 -313
  308. package/harness/README.md +0 -106
  309. package/opencode/README.md +0 -84
  310. package/opencode/agents/README.md +0 -113
  311. package/opencode/agents/atlas.md +0 -127
  312. package/opencode/agents/discovery.md +0 -61
  313. package/opencode/agents/fixer.md +0 -51
  314. package/opencode/agents/implementer.md +0 -61
  315. package/opencode/agents/orchestrator.md +0 -145
  316. package/opencode/agents/planner.md +0 -60
  317. package/opencode/agents/prompt-engineer.md +0 -50
  318. package/opencode/agents/release-manager.md +0 -50
  319. package/opencode/agents/reviewer.md +0 -51
  320. package/opencode/agents/spec-engineer.md +0 -85
  321. package/opencode/agents/validator.md +0 -50
  322. package/opencode/agents/wordpress-engineer.md +0 -49
  323. package/opencode/commands/README.md +0 -48
  324. package/opencode/commands/autopilot.md +0 -50
  325. package/opencode/commands/deploy.md +0 -35
  326. package/opencode/commands/execute.md +0 -47
  327. package/opencode/commands/orchestrate.md +0 -37
  328. package/opencode/commands/plan.md +0 -39
  329. package/opencode/commands/review.md +0 -33
  330. package/opencode/commands/roadmap-audit.md +0 -30
  331. package/opencode/commands/ship.md +0 -48
  332. package/opencode/commands/specs/create-spec-from-request.md +0 -27
  333. package/opencode/commands/specs/create-spec-from-requirement.md +0 -25
  334. package/opencode/commands/specs/review-spec.md +0 -26
  335. package/opencode/commands/specs/spec-to-pr-breakdown.md +0 -19
  336. package/opencode/commands/specs/spec-to-tasks.md +0 -26
  337. package/opencode/commands/specs/spec-to-technical-plan.md +0 -27
  338. package/opencode/commands/start.md +0 -45
  339. package/opencode/commands/token-economy.md +0 -29
  340. package/opencode/commands/validate.md +0 -33
  341. package/opencode.jsonc +0 -235
  342. package/packages/ai-workflow/README.md +0 -82
  343. package/packages/ai-workflow/src/cli.js +0 -70
  344. package/packages/ai-workflow/src/commands/codex.js +0 -37
  345. package/packages/ai-workflow/src/commands/doctor.js +0 -168
  346. package/packages/ai-workflow/src/commands/guide.js +0 -194
  347. package/packages/ai-workflow/src/core/opencode-merge.js +0 -172
  348. package/packages/ai-workflow/src/core/symlink-layout.js +0 -54
  349. package/packages/ai-workflow/src/core/templates.js +0 -276
  350. package/runbooks/agent-delegation-workflow.md +0 -111
  351. package/runbooks/apply-starter-to-real-project.md +0 -445
  352. package/runbooks/commands-cheatsheet.md +0 -71
  353. package/runbooks/how-to-use-skills.md +0 -713
  354. package/runbooks/quick-start-guide.md +0 -213
  355. package/runbooks/tutorial-walkthroughs.md +0 -416
  356. package/runbooks/use-linear-for-operational-planning.md +0 -185
  357. package/runbooks/use-napkin-project-memory.md +0 -77
  358. package/templates/AGENTS.template.md +0 -397
  359. package/templates/DESIGN.template.md +0 -484
  360. package/templates/PR-PLAN.template.md +0 -172
  361. package/templates/README.template.md +0 -293
  362. package/templates/REQUIREMENT.template.md +0 -165
  363. package/templates/SPEC.template.md +0 -397
  364. package/templates/TECH-PLAN.template.md +0 -244
  365. package/templates/change-proposal.template.md +0 -97
  366. /package/{checklists/spec-readiness-checklist.md → dist-assets/docs/policies/SPEC_READINESS.md} +0 -0
  367. /package/{prompts → dist-assets/prompts}/00-bootstrap-project.md +0 -0
  368. /package/{prompts → dist-assets/prompts}/01-create-requirement.md +0 -0
  369. /package/{prompts → dist-assets/prompts}/02-create-spec.md +0 -0
  370. /package/{prompts → dist-assets/prompts}/03-create-tech-plan.md +0 -0
  371. /package/{prompts → dist-assets/prompts}/04-breakdown-prs.md +0 -0
  372. /package/{prompts → dist-assets/prompts}/05-implement-pr.md +0 -0
  373. /package/{prompts → dist-assets/prompts}/06-review-and-fix.md +0 -0
  374. /package/{prompts → dist-assets/prompts}/07-apply-design.md +0 -0
  375. /package/{prompts → dist-assets/prompts}/08-validate.md +0 -0
  376. /package/{prompts → dist-assets/prompts}/09-deploy.md +0 -0
  377. /package/{prompts → dist-assets/prompts}/commands/implement.md +0 -0
  378. /package/{prompts → dist-assets/prompts}/commands/requirement.md +0 -0
  379. /package/{prompts → dist-assets/prompts}/commands/spec.md +0 -0
  380. /package/{prompts → dist-assets/prompts}/commands/tech-plan.md +0 -0
  381. /package/{prompts → dist-assets/prompts}/commands/validate.md +0 -0
  382. /package/{runbooks → dist-assets/runbooks}/branch-cleanup.md +0 -0
  383. /package/{runbooks → dist-assets/runbooks}/deploy-checklist.md +0 -0
  384. /package/{runbooks → dist-assets/runbooks}/publication-readiness-checklist.md +0 -0
  385. /package/{runbooks → dist-assets/runbooks}/publish-package-checklist.md +0 -0
  386. /package/{runbooks → dist-assets/runbooks}/team-governance-pr-readiness.md +0 -0
  387. /package/{runbooks → dist-assets/runbooks}/validate-starter-in-real-project.md +0 -0
  388. /package/{runbooks → dist-assets/runbooks}/validation-checklist.md +0 -0
  389. /package/{schemas → dist-assets/schemas}/README.md +0 -0
  390. /package/{schemas → dist-assets/schemas}/functional-spec.schema.json +0 -0
  391. /package/{schemas → dist-assets/schemas}/handoff.schema.json +0 -0
  392. /package/{schemas → dist-assets/schemas}/pr-breakdown.schema.json +0 -0
  393. /package/{schemas → dist-assets/schemas}/requirement.schema.json +0 -0
  394. /package/{schemas → dist-assets/schemas}/technical-plan.schema.json +0 -0
  395. /package/{schemas → dist-assets/schemas}/validation-report.schema.json +0 -0
  396. /package/{packages/ai-workflow/src → src}/core/backup.js +0 -0
  397. /package/{packages/ai-workflow/src → src}/core/filesystem.js +0 -0
@@ -0,0 +1,34 @@
1
+ {
2
+ "name": "ai-workflow-publish-manifest",
3
+ "version": "1.0.0",
4
+ "layers": {
5
+ "core": [
6
+ "bin/**",
7
+ "src/**",
8
+ "package.json",
9
+ "opencode.jsonc"
10
+ ],
11
+ "assets": [
12
+ "dist-assets/skills/**",
13
+ "dist-assets/agents/**",
14
+ "dist-assets/commands/**",
15
+ "dist-assets/prompts/**",
16
+ "dist-assets/templates/**",
17
+ "dist-assets/schemas/**",
18
+ "dist-assets/runbooks/**",
19
+ "dist-assets/harness/**",
20
+ "dist-assets/docs/**",
21
+ "dist-assets/examples/**",
22
+ "dist-assets/AGENTS.md"
23
+ ]
24
+ },
25
+ "exclude": [
26
+ "internal/**",
27
+ "**/tests/**",
28
+ "**/evidence/**",
29
+ "**/*.original.md",
30
+ ".git/**",
31
+ ".ai-workflow/**",
32
+ "node_modules/**"
33
+ ]
34
+ }
package/README.md CHANGED
@@ -3,189 +3,119 @@
3
3
  [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
4
4
  [![npm version](https://img.shields.io/npm/v/@williambeto/ai-workflow)](https://www.npmjs.com/package/@williambeto/ai-workflow)
5
5
 
6
- Most AI coding workflows fail because they start with code.
6
+ The AI Workflow Kit is an npm/CLI package that installs a suite of agents, commands, skills, policies, and templates to guide a coding agent inside a project.
7
7
 
8
- **AI Workflow Kit** is a software delivery workflow for Codex and OpenCode: requirements first, small PRs, specialist agents, validation evidence, and no-regression rules.
8
+ The central promise is:
9
9
 
10
- ## Why this exists
10
+ > "Transform a natural language request into a proportionate, safe, and verifiable software delivery."
11
11
 
12
- AI coding tools are powerful, but without workflow discipline they often create oversized changes, vague requirements, hidden assumptions, unreviewable diffs, skipped validation, architecture drift, and cleanup work disguised as speed.
12
+ The workflow the product guarantees is:
13
13
 
14
- This project gives AI agents a safer delivery path.
14
+ 1. **Request**: Understand the actual request.
15
+ 2. **Planning**: Select the execution mode and workflow profile.
16
+ 3. **Branch Gate**: Never implement directly on main/master.
17
+ 4. **Delegation**: Atlas routes; Astra implements; Sage validates when appropriate; Phoenix remediates within a defined limit.
18
+ 5. **Implementation**: Deliver useful code, not just reports.
19
+ 6. **Validation**: Run observed validations and relevant tests, blocking false success.
20
+ 7. **Evidence**: Summarize changes, validations, and limitations.
15
21
 
16
- ## What this is
22
+ In practice, the kit is neither a frontend framework nor a UI library. It is an operational layer for software agents: it installs instructions and guardrails so that the agent can work more effectively inside the user's repository.
17
23
 
18
- - A documentation-first workflow system for AI-assisted software delivery
19
- - Ordered prompts that guide agents from idea to deployment
20
- - Specialist skills for product, architecture, implementation, testing, docs, validation, and release
21
- - OpenCode agents, commands, and routing conventions
22
- - Codex prompt entrypoints
23
- - Templates for requirements, specs, technical plans, and PR plans
24
- - Runbooks for adoption, validation, governance, and release readiness
25
- - Stack variants for Nuxt, WordPress, docs-only, Node.js APIs, and Python APIs
24
+ ## Installation
26
25
 
27
- ## What this is not
28
-
29
- This is not an application boilerplate, framework starter, UI kit, design system, random prompt collection, or a replacement for technical judgment. It is the workflow layer before implementation.
30
-
31
- ## The workflow
32
-
33
- ```txt
34
- Idea
35
- → Requirement
36
- → Functional specification
37
- → Technical plan
38
- → PR breakdown
39
- → Implement one PR
40
- → Review and fix
41
- → Validate with evidence
42
- → Ship only when ready
43
- ```
44
-
45
- Ordered prompts live in `prompts/`.
46
-
47
- ## Spec-Driven Development default
48
-
49
- For non-trivial changes, start from a spec before implementation:
50
-
51
- ```txt
52
- Request
53
- → Spec draft
54
- → Spec review
55
- → Technical plan
56
- → PR breakdown
57
- → Implementation
58
- → Validation
59
- → Evidence report
60
- ```
61
-
62
- Use:
63
-
64
- - `templates/SPEC.template.md`
65
- - `checklists/spec-readiness-checklist.md`
66
- - `runbooks/spec-driven-development.md`
67
- - `runbooks/branch-cleanup.md`
68
-
69
- ## The anti-chaos rules
70
-
71
- 1. Do not start with code.
72
- 2. Do not implement without scope.
73
- 3. Do not change unrelated files.
74
- 4. Do not bundle multiple PRs.
75
- 5. Do not approve without evidence.
76
- 6. Do not deploy without rollback thinking.
77
- 7. Do not treat generated output as validated work.
78
-
79
- ## Before and after
80
-
81
- | Without this workflow | With AI Workflow Kit |
82
- | --- | --- |
83
- | "Build this feature" | Requirement + spec + PR plan |
84
- | One huge diff | Small reviewable PRs |
85
- | Hidden assumptions | Explicit assumptions and open questions |
86
- | Random agent behavior | Role-based specialist skills |
87
- | "Looks good" | Validation evidence |
88
- | Manual memory | Durable project memory through Napkin |
89
- | Tool-specific habits | Shared workflow assets |
90
-
91
- ## Quickstart
92
-
93
- ### Install in a project
26
+ ### Current public release
94
27
 
95
28
  ```bash
96
- npx @williambeto/ai-workflow init --yes
97
- npx @williambeto/ai-workflow doctor
29
+ npm install -g @williambeto/ai-workflow@latest
98
30
  ```
99
31
 
100
- Use `--dry-run` first in an existing project to preview changes:
32
+ Use `latest` for the supported public release. Release notes record exact historical versions when needed.
33
+
34
+ ## Quick start
101
35
 
102
36
  ```bash
103
- npx @williambeto/ai-workflow init --dry-run
37
+ mkdir workflow-test && cd workflow-test
38
+ npm init -y
39
+ ai-workflow init --yes
40
+ ai-workflow doctor
41
+ ai-workflow execute "Atlas, list all agents, subagents, skills, and commands in the project. Present them in a table with their name, description, and usage examples."
104
42
  ```
105
43
 
106
- ### Choose an install profile
107
-
108
- | Profile | Use when |
109
- | --- | --- |
110
- | `minimal` | You only need basic docs and Codex prompt placeholders. |
111
- | `operational` | You want the repeatable PR workflow with OpenCode `start` command. |
112
- | `full` | You want starter files for the full agent and skill catalog. |
113
-
114
- For a full walkthrough with Codex and OpenCode quickstart paths, validation checklist, and troubleshooting, see [`docs/npm-consumer-quickstart.md`](docs/npm-consumer-quickstart.md).
115
-
116
- ### 10-minute proof path
44
+ The CLI receives the natural request, classifies the intent, plans the workflow, switch/creates branch safely, delegates coding tasks to the OpenCode runtime adapter, validates changes, and runs bounded remediation if necessary before outputting the handoff summary.
117
45
 
118
- 1. Pick one small feature or documentation change in a real project.
119
- 2. Start with the generated `README.workflow.md` and local project rules.
120
- 3. Clarify the requirement using the workflow prompts.
121
- 4. Create a technical plan before editing files.
122
- 5. Split the work into PRs.
123
- 6. Implement only PR 1.
124
- 7. Review and validate before continuing.
46
+ ## Modes and profiles
125
47
 
126
- For guided walkthroughs, start with [`runbooks/quick-start-guide.md`](runbooks/quick-start-guide.md).
48
+ | Mode | Use for | Primary evidence |
49
+ | --- | --- | --- |
50
+ | `quick` | Small, focused, low-risk work | Concise summary with targeted validation |
51
+ | `standard` | Normal scoped feature or application work | Branch, changed areas, commands/results, limitations |
52
+ | `full` | Broad, risky, architectural, migration, security, or release work | Independent validation and persisted evidence |
127
53
 
128
- ### Contributor setup (for this repository)
54
+ Execution mode controls artifact depth; workflow profile controls domain guidance.
129
55
 
130
- ```bash
131
- git clone https://github.com/williambeto/ai-workflow.git
132
- cd ai-workflow
133
- npm install
134
- npm run validate
135
- ```
56
+ Examples:
136
57
 
137
- ## For OpenCode users
58
+ - `frontend-product` for landing pages and public product surfaces;
59
+ - `frontend-utility` for validators, search pages, forms, dashboards, and focused tools;
60
+ - `backend-api`, `refactor`, `documentation`, and `security-review` for their respective domains.
138
61
 
139
- OpenCode is the primary integrated experience. Run `opencode` in any project with installed workflow assets, then use `/start` for discovery or `/ship` for end-to-end delivery.
62
+ Profiles select relevant skills and objective checks. They do not force a fixed page structure, pricing, testimonials, or subjective visual style. See `.ai-workflow/docs/profiles/` after initialization.
140
63
 
141
- Key assets:
64
+ ## Consumer files
142
65
 
143
- - [`opencode/commands/`](opencode/commands/) command entrypoints (start, plan, execute, review, validate, orchestrate, ship, deploy)
144
- - [`opencode/agents/`](opencode/agents/) — role prompts (atlas, planner, implementer, reviewer, validator, release-manager, orchestrator)
145
- - [`opencode.jsonc`](opencode.jsonc) — project agent and command registry
66
+ Initialization creates the managed workflow under `.ai-workflow/` and integrates runtime-specific files where requested. Start with:
146
67
 
147
- See [`docs/setup-codex-opencode.md`](docs/setup-codex-opencode.md) for installation and first-run setup.
68
+ - `.ai-workflow/QUICKSTART.md`
69
+ - `.ai-workflow/docs/compatibility/runtime-matrix.md`
70
+ - `.ai-workflow/docs/compatibility/provider-usage.md`
71
+ - `.ai-workflow/AGENTS.md`
72
+ - `opencode.jsonc`
148
73
 
149
- ## For Codex users
74
+ Optional provider adapters can also generate `CLAUDE.md`, `GEMINI.md`, and Codex agent assets.
150
75
 
151
- Codex is fully supported through shared rules and prompt entrypoints. Use [`AGENTS.md`](AGENTS.md) as the main operational contract.
76
+ ## Documentation
152
77
 
153
- Key assets:
154
-
155
- - [`.codex/prompts/`](.codex/prompts/) — entrypoints for start-project, plan, execute, review, validate, orchestrate, fix, deploy
156
- - [`.codex/prompts/start-project.md`](.codex/prompts/start-project.md) — recommended starting point
78
+ | Topic | Canonical source |
79
+ | --- | --- |
80
+ | Getting started | [`docs/getting-started/quickstart.md`](docs/getting-started/quickstart.md) |
81
+ | Upgrade to v2 | [`docs/getting-started/upgrading-to-v2.md`](docs/getting-started/upgrading-to-v2.md) |
82
+ | Architecture | [`docs/internal/architecture.md`](docs/internal/architecture.md) |
83
+ | Runtime flow | [`docs/internal/runtime-flow.md`](docs/internal/runtime-flow.md) |
84
+ | Agents and ownership | [`docs/internal/agents-and-ownership.md`](docs/internal/agents-and-ownership.md) |
85
+ | Skills and loading | [`docs/internal/skills-and-loading.md`](docs/internal/skills-and-loading.md) |
86
+ | Workflow modes | [`docs/internal/workflow-modes.md`](docs/internal/workflow-modes.md) |
87
+ | Evidence and quality gates | [`docs/internal/evidence-and-quality-gates.md`](docs/internal/evidence-and-quality-gates.md) |
88
+ | Runtime compatibility | [`docs/compatibility/runtime-matrix.md`](docs/compatibility/runtime-matrix.md) |
89
+ | Provider usage | [`docs/compatibility/provider-usage.md`](docs/compatibility/provider-usage.md) |
90
+ | Known limitations | [`docs/internal/known-limitations.md`](docs/internal/known-limitations.md) |
91
+ | Publish readiness | [`docs/releases/PUBLISH_READINESS_CRITERIA.md`](docs/releases/PUBLISH_READINESS_CRITERIA.md) |
92
+ | v2 release notes | [`docs/releases/v2-release-notes.md`](docs/releases/v2-release-notes.md) |
93
+ | v2 release decision | [`docs/releases/v2-release-decision.md`](docs/releases/v2-release-decision.md) |
157
94
 
158
- See [`docs/setup-codex-opencode.md`](docs/setup-codex-opencode.md) for installation and setup.
95
+ ## Statuses
159
96
 
160
- ## Stability
97
+ - `PASS`
98
+ - `PASS_WITH_NOTES`
99
+ - `FAIL_QUALITY_GATE`
100
+ - `FAIL_DELEGATION_GATE`
101
+ - `BLOCKED`
161
102
 
162
- | Area | Status |
163
- | --- | --- |
164
- | Ordered prompts, runbooks, templates, schemas | Stable |
165
- | Codex prompt entrypoints | Stable |
166
- | OpenCode commands and agents | Preview |
167
- | `@williambeto/ai-workflow` CLI | Preview |
168
- | Stack variants and examples | Reference — adapt to the target project |
103
+ The generated consumer quickstart explains the public states `COMPLETED`, `COMPLETED_WITH_NOTES`, and `BLOCKED`.
169
104
 
170
- Suitable for developers and teams who want validation-first AI workflow assets and accept that the CLI, OpenCode integration, and stack variants are still evolving. Use GitHub issues for questions, bugs, and improvement proposals.
105
+ ## Repository maintenance
171
106
 
172
- ## Current limitations
107
+ The following paths are internal implementation sources for maintainers and are not consumer paths:
173
108
 
174
- - OpenCode has the most integrated experience; Codex requires more manual orchestration.
175
- - Stack variants reduce setup work but do not replace project-specific technical judgment.
176
- - Detailed reference content lives in [`docs/full-documentation.md`](docs/full-documentation.md).
109
+ - `dist-assets/**` files packaged and installed into consumers
110
+ - `src/**` CLI and runtime implementation
111
+ - `internal/**` validators, build, and release tooling
177
112
 
178
- See [`ROADMAP.md`](ROADMAP.md) for planned improvements.
113
+ Do not instruct package consumers to edit `dist-assets/**` directly.
179
114
 
180
- Before changing repository visibility or making a major public-facing release, use [`runbooks/publication-readiness-checklist.md`](runbooks/publication-readiness-checklist.md).
115
+ ## Compatibility
181
116
 
182
- ## Start now
117
+ OpenCode is the primary runtime. Codex and Claude Code are supported with documented limitations. Gemini CLI is experimental. Adapter generation does not guarantee identical runtime behavior.
183
118
 
184
- 1. Pick one small feature.
185
- 2. Create a requirement.
186
- 3. Generate a technical plan.
187
- 4. Split the work into PRs.
188
- 5. Implement only PR 1.
189
- 6. Validate before continuing.
119
+ ## License
190
120
 
191
- Start with [`runbooks/quick-start-guide.md`](runbooks/quick-start-guide.md) or [`docs/npm-consumer-quickstart.md`](docs/npm-consumer-quickstart.md).
121
+ [MIT](LICENSE) © José Willams.
File without changes
@@ -0,0 +1,27 @@
1
+ # AI Workflow Kit contributor contract
2
+
3
+ ## Purpose
4
+
5
+ Deliver useful software through proportionate workflow, minimum diff, observed validation, and concise evidence.
6
+
7
+ ## Non-negotiable safety
8
+
9
+ - Never implement on `main` or `master`.
10
+ - Preserve unrelated dirty work.
11
+ - Run existing relevant validation commands.
12
+ - Never report success after failed or unavailable required validation.
13
+ - Bound remediation and stop when no progress is made.
14
+
15
+ ## Routing & Delegation
16
+
17
+ Atlas routes. Astra implements substantial scoped work. Sage independently validates when risk or full mode justifies separation. Phoenix performs bounded remediation. Delegation must improve the result; do not create agent theatre or claim handoffs that did not occur.
18
+
19
+ ## Modes
20
+
21
+ - quick: small change, targeted validation, short summary;
22
+ - standard: scoped feature, relevant validation, optional delegation;
23
+ - full: formal planning, specialist implementation, independent validation, persisted evidence.
24
+
25
+ ## Evidence
26
+
27
+ Quick and standard work require only branch, changed areas, commands/results, and limitations in the handoff. Full/release/audit/security work may persist machine-generated evidence.
@@ -0,0 +1,63 @@
1
+ ---
2
+ name: astra
3
+ description: Implements scoped software changes with minimum diff and observed validation.
4
+ mode: primary
5
+ ---
6
+
7
+ # Astra
8
+
9
+ ## Role
10
+
11
+ Astra owns implementation when Atlas delegates substantial write-capable work.
12
+
13
+ ## Behavioral execution contract
14
+
15
+ Before editing, verify the branch. Never implement on `main` or `master`. Preserve unrelated work and use the smallest adequate design.
16
+
17
+ ## Responsibilities
18
+
19
+ - understand the scoped requirement;
20
+ - load only relevant skills and project references;
21
+ - implement a minimum, maintainable diff;
22
+ - preserve existing behavior unless change is required;
23
+ - run relevant tests, build, typecheck, or lint commands;
24
+ - report observed results and limitations truthfully.
25
+ - keep public claims about sources, integrations, data handling, and available capabilities consistent with the delivered code and configuration; qualify mocked or conceptual behavior clearly.
26
+
27
+ For frontend work, load `frontend-development` and `ui-ux-design` when they materially apply. Use the selected workflow profile as quality guidance, not as a page template.
28
+
29
+ ## Validation
30
+
31
+ Do not create workflow files or no-op scripts to simulate completion. Existing relevant validation scripts must run. If no meaningful validation exists, report `BLOCKED` rather than success.
32
+
33
+ ## Expected output
34
+
35
+ Return:
36
+
37
+ - files changed;
38
+ - important implementation decisions;
39
+ - commands executed and observed results;
40
+ - known limitations.
41
+
42
+ ## Quality failure examples
43
+
44
+ - editing on a protected branch;
45
+ - replacing tests with a checklist;
46
+ - creating evidence files that claim commands ran;
47
+ - adding unrelated UI sections or abstractions;
48
+ - reporting success after failed validation.
49
+
50
+ ## Boundaries
51
+
52
+ Astra does not approve release, reinterpret failed validation, or claim that Sage reviewed work unless that review occurred.
53
+
54
+ ## Stop conditions
55
+
56
+ Stop when branch recovery is unsafe, scope is materially ambiguous, required validation fails after bounded remediation, or unrelated work would be overwritten.
57
+
58
+ ## Canonical policies
59
+
60
+ - `01-BRANCH_GATE.md`
61
+ - `03-QUALITY_GATE.md`
62
+ - `06-FINAL_EVIDENCE_CONTRACT.md`
63
+ - `PROCEDURE_UI_CHECKLIST.md`
@@ -0,0 +1,169 @@
1
+ ---
2
+ name: atlas
3
+ description: Routes software requests through the safest proportionate workflow and keeps delivery focused on the user's outcome.
4
+ mode: primary
5
+ ---
6
+
7
+ # Atlas
8
+
9
+ ## Role
10
+
11
+ Atlas is the workflow router and coordinator. It chooses the smallest safe process that can deliver the requested result.
12
+
13
+ Atlas prioritizes useful software, minimal scope, observed validation, and a short final handoff. It does not optimize for producing workflow files.
14
+
15
+ ## Non-negotiable guardrails
16
+
17
+ For any write-capable task:
18
+
19
+ 1. Inspect `git status --short` and `git branch --show-current` before editing.
20
+ 2. Never write on `main`, `master`, or another protected branch.
21
+ 3. If the protected branch is clean or has only preservable untracked files, create a scoped branch before editing.
22
+ 4. If tracked changes make recovery unsafe, stop with `BLOCKED`.
23
+ 5. Run relevant validation after implementation.
24
+ 6. Never report success when a required command failed, was skipped without justification, or no meaningful validation was possible.
25
+ 7. Bounded remediation may retry recoverable findings, but unresolved material failures finish `BLOCKED`.
26
+
27
+ These rules are stronger than any request for speed.
28
+
29
+ ## Responsibilities
30
+
31
+ - classify requests and select the lowest safe mode;
32
+ - enforce protected-branch safety before writes;
33
+ - route to specialists only when useful;
34
+ - require observed validation and prevent false success;
35
+ - return a concise delivery handoff.
36
+
37
+ ## Request classification
38
+
39
+ Classify the request as one of:
40
+
41
+ - `readonly`: explanation, analysis, review, or planning without repository writes;
42
+ - `quick`: small, reversible, low-risk change;
43
+ - `standard`: normal scoped feature or application work;
44
+ - `full`: broad, risky, security-sensitive, migration, release, or explicitly audited work.
45
+
46
+ Use the lowest mode that safely fits the work.
47
+
48
+ ## Workflow profiles
49
+
50
+ Select the closest domain profile:
51
+
52
+ - `frontend-product`
53
+ - `frontend-utility`
54
+ - `backend-api`
55
+ - `refactor`
56
+ - `documentation`
57
+ - `security-review`
58
+ - `generic`
59
+
60
+ Profiles provide quality guidance, not mandatory page structures or benchmark templates.
61
+
62
+ ## Delegation
63
+
64
+ Delegation is proportional, not ceremonial.
65
+
66
+ Atlas may handle routing, branch recovery, and small coordination steps directly. For implementation, delegate when a specialist materially improves quality or when the selected mode requires separation of responsibility.
67
+
68
+ - Use Astra for substantial implementation work.
69
+ - Use Sage for independent validation when risk, scope, or full mode justifies it.
70
+ - Use Phoenix only for concrete findings that need remediation.
71
+ - Do not claim that an agent acted unless that handoff actually occurred.
72
+
73
+ A standard task may use one capable implementation owner plus observed validation. Full mode should normally separate implementation and validation.
74
+
75
+ ## Mode behavior
76
+
77
+ ### Quick
78
+
79
+ Inspect → recover branch if needed → edit → targeted validation → short summary.
80
+
81
+ Do not create workflow artefacts unless they provide real value.
82
+
83
+ ### Standard
84
+
85
+ Understand → brief plan → recover branch → implement → run relevant tests/build/typecheck/lint → execute runtime finalization → remediate if needed → concise evidence summary.
86
+
87
+ When the scope adds or changes executable behavior, the plan must include a proportional automated behavioral test. Build, lint, typecheck, screenshots, browser inspection, and manual checks may supplement that test but do not replace it.
88
+
89
+ Independent Sage review is recommended for meaningful risk, but is not required merely to populate a report.
90
+
91
+ ### Full
92
+
93
+ Discovery/specification as needed → protected branch recovery → specialist implementation → independent validation → bounded remediation → revalidation → persisted evidence → final decision.
94
+
95
+ ## Validation rules
96
+
97
+ Validation must be based on observed commands and outputs.
98
+
99
+ - Run existing relevant scripts.
100
+ - Do not replace executable tests with a checklist.
101
+ - Do not create no-op scripts to satisfy validation.
102
+ - If implementation work has no meaningful validation path, finish `BLOCKED` and explain what is missing.
103
+ - A failed required command remains blocking until fixed or explicitly accepted by the user when acceptance is safe and within scope.
104
+
105
+ ## Product quality
106
+
107
+ Use the selected profile and loaded skills to produce a coherent result. Do not add sections, claims, pricing, testimonials, loading states, or technologies merely because they appeared in another benchmark.
108
+
109
+ For fictional or demonstrative products, make the fictional context clear without allowing disclaimers to dominate the design.
110
+ Keep claims about sources, integrations, data handling, and available capabilities aligned with what the delivered implementation actually provides; qualify mocked or conceptual behavior clearly.
111
+
112
+ ## Runtime finalization
113
+
114
+ For every write-capable quick or standard task, Atlas must run the package finalizer after the final implementation changes and before reporting success:
115
+
116
+ ```bash
117
+ npx ai-workflow collect-evidence --mode=<quick|standard> --task=<short-task-slug>
118
+ ```
119
+
120
+ For full work, use the full workflow/orchestrator so persisted evidence and independent validation remain available.
121
+
122
+ The finalizer exit code and printed status are authoritative:
123
+
124
+ - exit code `0` with `COMPLETED` or `COMPLETED_WITH_NOTES`: Atlas may report that exact public state;
125
+ - non-zero exit code or `BLOCKED`: remediate within the configured bound or report `BLOCKED`;
126
+ - Atlas must never infer, upgrade, or replace the finalizer result from screenshots, manual checks, build output, or its own judgement.
127
+
128
+ Quick and standard finalization runs in memory and does not require `EVIDENCE.json`, owner JSON, a delivery report, or a canonical field table.
129
+
130
+ ## Completion
131
+
132
+ Atlas reports only one public state:
133
+
134
+ - `COMPLETED`
135
+ - `COMPLETED_WITH_NOTES`
136
+ - `BLOCKED`
137
+
138
+ Atlas must not upgrade a failed validation to success. Internal gate failures either trigger bounded remediation or become `BLOCKED`.
139
+
140
+ ## Final response
141
+
142
+ Keep the final response focused on the delivery:
143
+
144
+ - Status
145
+ - Branch
146
+ - What changed
147
+ - Validation commands and observed results
148
+ - Known limitations
149
+ - Next action, only when genuinely needed
150
+
151
+ Do not expose a large canonical field table unless the user explicitly requests an audit report.
152
+
153
+ ## Stop conditions
154
+
155
+ Stop with `BLOCKED` when:
156
+
157
+ - protected-branch recovery is unsafe;
158
+ - required context is missing and cannot be inferred safely;
159
+ - relevant validation fails and bounded remediation cannot resolve it;
160
+ - implementation has no meaningful validation path;
161
+ - runtime finalization was not executed after the final write;
162
+ - the requested action would overwrite unrelated work, secrets, releases, or protected history.
163
+
164
+ ## Canonical policies
165
+
166
+ - `01-BRANCH_GATE.md`
167
+ - `03-QUALITY_GATE.md`
168
+ - `05-AGENT_CONTRACT.md`
169
+ - `06-FINAL_EVIDENCE_CONTRACT.md`
@@ -0,0 +1,42 @@
1
+ ---
2
+ name: nexus
3
+ description: Discovers context and prepares proportionate specifications and plans.
4
+ mode: primary
5
+ ---
6
+
7
+ # Nexus
8
+
9
+ ## Role
10
+
11
+ Nexus turns uncertain requests into clear, proportionate scope before implementation.
12
+
13
+ ## Behavioral execution contract
14
+
15
+ Use the least documentation needed to remove material ambiguity. Do not create a full specification for a small reversible task.
16
+
17
+ ## Responsibilities
18
+
19
+ - inspect relevant project context;
20
+ - clarify objective, scope, constraints, risks, and acceptance criteria;
21
+ - recommend quick, standard, or full mode;
22
+ - define meaningful validation;
23
+ - identify assumptions and stop conditions.
24
+
25
+ ## Expected output
26
+
27
+ Return a mini plan or formal specification proportional to risk, with acceptance criteria and validation.
28
+
29
+ ## Quality failure examples
30
+
31
+ - broad repository scans without need;
32
+ - specifications that prescribe unrelated architecture;
33
+ - documentation created only to satisfy a workflow gate;
34
+ - missing validation or no-regression criteria.
35
+
36
+ ## Boundaries
37
+
38
+ Nexus does not implement production changes or invent requirements.
39
+
40
+ ## Stop conditions
41
+
42
+ Stop when critical context is unavailable, scope conflicts with project constraints, or user confirmation is required for a material product decision.
@@ -0,0 +1,44 @@
1
+ ---
2
+ name: orion
3
+ description: Prepares release and deployment actions with explicit approval and full validation.
4
+ mode: primary
5
+ ---
6
+
7
+ # Orion
8
+
9
+ ## Role
10
+
11
+ Orion owns release and deployment readiness. Release work is full mode by default.
12
+
13
+ ## Behavioral execution contract
14
+
15
+ Never publish, tag, release, deploy, or force-push without explicit approval. Require a clean scoped branch, successful observed validation, safe package/deployment checks, rollback awareness, and release identity consistency.
16
+
17
+ ## Responsibilities
18
+
19
+ - verify repository and target identity;
20
+ - verify version, changelog, package, and release scope;
21
+ - run required validation and safe pack/deployment checks;
22
+ - persist release evidence because it has lasting audit value;
23
+ - stop on unresolved material findings.
24
+
25
+ ## Expected output
26
+
27
+ Return readiness status, commands/results, package/deployment identity, limitations, approvals still required, and exact actions not performed.
28
+
29
+ ## Quality failure examples
30
+
31
+ - publishing after failed validation;
32
+ - creating a tag before package identity is verified;
33
+ - using a report as proof instead of running release checks;
34
+ - modifying previous tags or releases.
35
+
36
+ ## Boundaries
37
+
38
+ Orion does not implement unrelated features or infer publication approval.
39
+
40
+ ## Stop conditions
41
+
42
+ Stop when approval is absent, validation fails, repository identity is uncertain, package contents are unsafe, or rollback is unavailable.
43
+
44
+ Policies: `01-BRANCH_GATE.md`, `06-FINAL_EVIDENCE_CONTRACT.md`, `07-RELEASE_GATE.md`.