@nathapp/nax 0.28.0 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (376) hide show
  1. package/CHANGELOG.md +13 -2
  2. package/dist/nax.js +72691 -0
  3. package/package.json +12 -4
  4. package/src/cli/config.ts +3 -1
  5. package/src/config/defaults.ts +1 -0
  6. package/src/config/schemas.ts +1 -0
  7. package/src/config/types.ts +1 -0
  8. package/src/context/builder.ts +10 -1
  9. package/src/prompts/sections/role-task.ts +4 -2
  10. package/src/review/runner.ts +6 -1
  11. package/src/version.ts +2 -1
  12. package/.claude/rules/01-project-conventions.md +0 -34
  13. package/.claude/rules/02-test-architecture.md +0 -39
  14. package/.claude/rules/03-test-writing.md +0 -58
  15. package/.claude/rules/04-forbidden-patterns.md +0 -29
  16. package/.claude/settings.json +0 -15
  17. package/.githooks/pre-commit +0 -16
  18. package/.gitlab-ci.yml +0 -103
  19. package/.mcp.json +0 -8
  20. package/BRIEF.md +0 -140
  21. package/CLAUDE.md +0 -143
  22. package/US-007-IMPLEMENTATION.md +0 -139
  23. package/biome.json +0 -14
  24. package/bun.lock +0 -163
  25. package/bunfig.toml +0 -12
  26. package/docker-compose.test.yml +0 -15
  27. package/docs/20260216-fix-plan-context-review.md +0 -56
  28. package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
  29. package/docs/20260216-v02-plan.md +0 -136
  30. package/docs/20260216-v02-review.md +0 -685
  31. package/docs/20260217-dogfood-findings.md +0 -56
  32. package/docs/20260217-p2-plus-plan.md +0 -117
  33. package/docs/20260217-partial-fixes-plan.md +0 -62
  34. package/docs/20260217-plan-analyze-spec.md +0 -117
  35. package/docs/20260217-post-impl-review.md +0 -1137
  36. package/docs/20260217-quick-wins-plan.md +0 -66
  37. package/docs/20260217-split-runner-plan.md +0 -75
  38. package/docs/20260217-v03-impl-plan.md +0 -80
  39. package/docs/20260217-v03-post-impl-review.md +0 -589
  40. package/docs/20260217-v04-impl-plan.md +0 -86
  41. package/docs/20260217-v05-post-impl-review.md +0 -850
  42. package/docs/20260217-v06-post-impl-review.md +0 -817
  43. package/docs/20260218-adr003-port-plan.md +0 -151
  44. package/docs/20260218-review-adr003-verification.md +0 -175
  45. package/docs/20260219-fix-plan-bug16-19.md +0 -79
  46. package/docs/20260219-fix-plan-bug20-22.md +0 -114
  47. package/docs/20260219-plan-llm-routing.md +0 -116
  48. package/docs/20260219-review-bug20-22-fixes.md +0 -135
  49. package/docs/20260219-routing-baseline-keyword.md +0 -63
  50. package/docs/20260220-plan-structured-logging-p1.md +0 -80
  51. package/docs/20260220-plan-structured-logging-p2.md +0 -37
  52. package/docs/20260220-review-llm-routing.md +0 -180
  53. package/docs/20260220-review-post-fix-llm-routing.md +0 -70
  54. package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
  55. package/docs/20260221-fix-plan-routing-mode.md +0 -125
  56. package/docs/20260221-review-v0.9-implementation.md +0 -379
  57. package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
  58. package/docs/20260223-fix-plan-prompt-audit.md +0 -62
  59. package/docs/20260224-nax-roadmap-phases.md +0 -189
  60. package/docs/20260225-phase2-llm-service-layer.md +0 -401
  61. package/docs/20260225-review-v0.10.1.md +0 -187
  62. package/docs/20260303-v010-implementation-plan.md +0 -165
  63. package/docs/20260304-review-nax.md +0 -492
  64. package/docs/CLAUDE.md.bak +0 -191
  65. package/docs/ROADMAP.md +0 -390
  66. package/docs/SPEC-rectification.md +0 -0
  67. package/docs/SPEC.md +0 -324
  68. package/docs/US-001-plugin-loading-verification.md +0 -152
  69. package/docs/adr/ADR-005-implementation-plan.md +0 -655
  70. package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
  71. package/docs/architecture-analysis.md +0 -1076
  72. package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
  73. package/docs/bugs-from-dogfood-run-c.md +0 -243
  74. package/docs/code-review-20260228.md +0 -612
  75. package/docs/code-review-v0.15.0.md +0 -629
  76. package/docs/hook-lifecycle-test-plan.md +0 -149
  77. package/docs/releases/v0.11.0-and-earlier.md +0 -20
  78. package/docs/releases/v0.12.0.md +0 -15
  79. package/docs/releases/v0.13.0.md +0 -14
  80. package/docs/releases/v0.14.0.md +0 -20
  81. package/docs/releases/v0.14.1.md +0 -36
  82. package/docs/releases/v0.14.2.md +0 -51
  83. package/docs/releases/v0.14.3.md +0 -174
  84. package/docs/releases/v0.14.4.md +0 -94
  85. package/docs/releases/v0.15.0.md +0 -502
  86. package/docs/releases/v0.15.1.md +0 -170
  87. package/docs/releases/v0.15.3.md +0 -193
  88. package/docs/specs/bug-039-orphan-processes.md +0 -131
  89. package/docs/specs/bug-040-review-rectification.md +0 -82
  90. package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
  91. package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
  92. package/docs/specs/bun-pty-migration.md +0 -171
  93. package/docs/specs/central-run-registry.md +0 -116
  94. package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
  95. package/docs/specs/feat-011-file-context-strategy.md +0 -73
  96. package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
  97. package/docs/specs/feat-013-test-after-review.md +0 -89
  98. package/docs/specs/feat-014-heartbeat-observability.md +0 -127
  99. package/docs/specs/status-file-consolidation.md +0 -93
  100. package/docs/specs/status-file-v0.10.1.md +0 -812
  101. package/docs/specs/trigger-completion.md +0 -145
  102. package/docs/specs/verification-architecture-v2.md +0 -343
  103. package/docs/tdd/strategies.md +0 -97
  104. package/docs/v0.10-global-config.md +0 -206
  105. package/docs/v0.10-plugin-system.md +0 -415
  106. package/docs/v0.10-prompt-optimizer.md +0 -234
  107. package/docs/v0.3-spec.md +0 -244
  108. package/docs/v0.4-spec.md +0 -140
  109. package/docs/v0.5-spec.md +0 -237
  110. package/docs/v0.6-spec.md +0 -371
  111. package/docs/v0.7-spec.md +0 -177
  112. package/docs/v0.8-llm-routing.md +0 -206
  113. package/docs/v0.8-structured-logging.md +0 -132
  114. package/docs/v0.9.3-prompt-audit.md +0 -112
  115. package/examples/plugins/console-reporter/index.test.ts +0 -207
  116. package/examples/plugins/console-reporter/index.ts +0 -110
  117. package/memory/topic/feat-010-baseref.md +0 -28
  118. package/memory/topic/feat-013-test-after-deprecation.md +0 -22
  119. package/nax/config.json +0 -154
  120. package/nax/features/bug-039-medium/prd.json +0 -45
  121. package/nax/features/bugfix-v0171/prd.json +0 -52
  122. package/nax/features/central-run-registry/prd.json +0 -105
  123. package/nax/features/config-management/prd.json +0 -108
  124. package/nax/features/config-management/progress.txt +0 -5
  125. package/nax/features/diagnose/acceptance.test.ts +0 -414
  126. package/nax/features/diagnose/prd.json +0 -41
  127. package/nax/features/nax-compliance/prd.json +0 -52
  128. package/nax/features/nax-compliance/progress.txt +0 -1
  129. package/nax/features/orchestration-fixes/prd.json +0 -89
  130. package/nax/features/orchestration-fixes/progress.txt +0 -1
  131. package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
  132. package/nax/features/plugin-integration/prd.json +0 -208
  133. package/nax/features/plugin-integration/progress.txt +0 -5
  134. package/nax/features/post-rearch-bugfix/prd.json +0 -137
  135. package/nax/features/precheck/prd.json +0 -205
  136. package/nax/features/precheck/progress.txt +0 -15
  137. package/nax/features/prompt-builder/prd.json +0 -152
  138. package/nax/features/prompt-builder/progress.txt +0 -3
  139. package/nax/features/review-quality/prd.json +0 -55
  140. package/nax/features/routing-persistence/prd.json +0 -104
  141. package/nax/features/routing-persistence/progress.txt +0 -1
  142. package/nax/features/smart-test-runner/plan.md +0 -7
  143. package/nax/features/smart-test-runner/prd.json +0 -203
  144. package/nax/features/smart-test-runner/progress.txt +0 -13
  145. package/nax/features/smart-test-runner/spec.md +0 -7
  146. package/nax/features/smart-test-runner/tasks.md +0 -8
  147. package/nax/features/status-file-consolidation/prd.json +0 -106
  148. package/nax/features/structured-logging/prd.json +0 -199
  149. package/nax/features/trigger-completion/prd.json +0 -150
  150. package/nax/features/trigger-completion/progress.txt +0 -7
  151. package/nax/features/unlock/prd.json +0 -36
  152. package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
  153. package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
  154. package/nax/features/v0.19.0-hardening/plan.md +0 -7
  155. package/nax/features/v0.19.0-hardening/prd.json +0 -84
  156. package/nax/features/v0.19.0-hardening/progress.txt +0 -7
  157. package/nax/features/v0.19.0-hardening/spec.md +0 -18
  158. package/nax/features/v0.19.0-hardening/tasks.md +0 -8
  159. package/nax/features/verify-v2/prd.json +0 -79
  160. package/nax/features/verify-v2/progress.txt +0 -3
  161. package/nax/status.json +0 -36
  162. package/test/COVERAGE-GAPS.md +0 -333
  163. package/test/e2e/cm-003-default-view.test.ts +0 -195
  164. package/test/e2e/plan-analyze-run.test.ts +0 -902
  165. package/test/helpers/helpers.test.ts +0 -295
  166. package/test/helpers/timeout.ts +0 -42
  167. package/test/integration/US-002-TEST-SUMMARY.md +0 -107
  168. package/test/integration/US-003-TEST-SUMMARY.md +0 -149
  169. package/test/integration/US-004-TEST-SUMMARY.md +0 -106
  170. package/test/integration/US-005-TEST-SUMMARY.md +0 -138
  171. package/test/integration/US-007-TEST-SUMMARY.md +0 -100
  172. package/test/integration/cli/agent-validation.test.ts +0 -439
  173. package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
  174. package/test/integration/cli/cli-config-default-view.test.ts +0 -230
  175. package/test/integration/cli/cli-config-diff.test.ts +0 -461
  176. package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
  177. package/test/integration/cli/cli-config.test.ts +0 -737
  178. package/test/integration/cli/cli-diagnose.test.ts +0 -595
  179. package/test/integration/cli/cli-logs.test.ts +0 -346
  180. package/test/integration/cli/cli-plugins.test.ts +0 -679
  181. package/test/integration/cli/cli-precheck.test.ts +0 -372
  182. package/test/integration/cli/cli-run-headless.test.ts +0 -174
  183. package/test/integration/cli/cli.test.ts +0 -76
  184. package/test/integration/cli/precheck-integration.test.ts +0 -476
  185. package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
  186. package/test/integration/cli/precheck.test.ts +0 -806
  187. package/test/integration/config/config-loader.test.ts +0 -266
  188. package/test/integration/config/config.test.ts +0 -444
  189. package/test/integration/config/merger.test.ts +0 -466
  190. package/test/integration/config/paths.test.ts +0 -52
  191. package/test/integration/config/security-loader.test.ts +0 -83
  192. package/test/integration/context/context-integration.test.ts +0 -703
  193. package/test/integration/context/context-path-security.test.ts +0 -173
  194. package/test/integration/context/context-provider-injection.test.ts +0 -507
  195. package/test/integration/context/context-verification-integration.test.ts +0 -296
  196. package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
  197. package/test/integration/execution/execution-isolation.test.ts +0 -143
  198. package/test/integration/execution/execution.test.ts +0 -634
  199. package/test/integration/execution/feature-status-write.test.ts +0 -302
  200. package/test/integration/execution/parallel.test.ts +0 -251
  201. package/test/integration/execution/prd-pause.test.ts +0 -205
  202. package/test/integration/execution/prd-resolvers.test.ts +0 -186
  203. package/test/integration/execution/progress.test.ts +0 -34
  204. package/test/integration/execution/runner-batching.test.ts +0 -682
  205. package/test/integration/execution/runner-config-plugins.test.ts +0 -462
  206. package/test/integration/execution/runner-escalation.test.ts +0 -561
  207. package/test/integration/execution/runner-fixes.test.ts +0 -400
  208. package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
  209. package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
  210. package/test/integration/execution/status-file-integration.test.ts +0 -289
  211. package/test/integration/execution/status-file.test.ts +0 -380
  212. package/test/integration/execution/status-writer.test.ts +0 -447
  213. package/test/integration/execution/story-id-in-events.test.ts +0 -274
  214. package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
  215. package/test/integration/pipeline/hooks.test.ts +0 -363
  216. package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
  217. package/test/integration/pipeline/pipeline-events.test.ts +0 -476
  218. package/test/integration/pipeline/pipeline.test.ts +0 -660
  219. package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
  220. package/test/integration/pipeline/verify-stage.test.ts +0 -286
  221. package/test/integration/plan/analyze-integration.test.ts +0 -262
  222. package/test/integration/plan/analyze-scanner.test.ts +0 -132
  223. package/test/integration/plan/logger.test.ts +0 -461
  224. package/test/integration/plan/plan.test.ts +0 -157
  225. package/test/integration/plugins/config-integration.test.ts +0 -173
  226. package/test/integration/plugins/config-resolution.test.ts +0 -523
  227. package/test/integration/plugins/loader.test.ts +0 -644
  228. package/test/integration/plugins/plugins-registry.test.ts +0 -747
  229. package/test/integration/plugins/validator.test.ts +0 -564
  230. package/test/integration/prompts/pb-004-migration.test.ts +0 -523
  231. package/test/integration/review/review-config-commands.test.ts +0 -320
  232. package/test/integration/review/review-config-schema.test.ts +0 -117
  233. package/test/integration/review/review-plugin-integration.test.ts +0 -729
  234. package/test/integration/review/review.test.ts +0 -150
  235. package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
  236. package/test/integration/routing/plugin-routing-core.test.ts +0 -527
  237. package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
  238. package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
  239. package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
  240. package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
  241. package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
  242. package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
  243. package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
  244. package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
  245. package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
  246. package/test/integration/tmp/headless-test/test.jsonl +0 -30
  247. package/test/integration/verification/test-scanner.test.ts +0 -403
  248. package/test/integration/verification/verification-asset-check.test.ts +0 -143
  249. package/test/integration/worktree/manager.test.ts +0 -218
  250. package/test/integration/worktree/worktree-merge.test.ts +0 -341
  251. package/test/manual/logging-formatter-demo.ts +0 -158
  252. package/test/ui/tui-agent-panel.test.tsx +0 -99
  253. package/test/ui/tui-pty-integration.test.tsx +0 -146
  254. package/test/unit/acceptance.test.ts +0 -187
  255. package/test/unit/agent-stderr-capture.test.ts +0 -147
  256. package/test/unit/agents/claude.test.ts +0 -107
  257. package/test/unit/analyze-classifier.test.ts +0 -216
  258. package/test/unit/analyze.test.ts +0 -224
  259. package/test/unit/auto-detect.test.ts +0 -250
  260. package/test/unit/cli-status-project-level.test.ts +0 -283
  261. package/test/unit/cli-status.test.ts +0 -418
  262. package/test/unit/commands/common.test.ts +0 -321
  263. package/test/unit/commands/logs.test.ts +0 -458
  264. package/test/unit/commands/runs.test.ts +0 -303
  265. package/test/unit/commands/unlock.test.ts +0 -320
  266. package/test/unit/config/defaults.test.ts +0 -70
  267. package/test/unit/config/quality-commands-schema.test.ts +0 -72
  268. package/test/unit/config/regression-gate-schema.test.ts +0 -160
  269. package/test/unit/config/smart-runner-flag.test.ts +0 -250
  270. package/test/unit/constitution-generators.test.ts +0 -161
  271. package/test/unit/constitution.test.ts +0 -210
  272. package/test/unit/context/context-autodetect.test.ts +0 -297
  273. package/test/unit/context/context-build.test.ts +0 -575
  274. package/test/unit/context/context-coverage.test.ts +0 -236
  275. package/test/unit/context/context-error.test.ts +0 -93
  276. package/test/unit/context/context-estimate-tokens.test.ts +0 -201
  277. package/test/unit/context/context-format.test.ts +0 -302
  278. package/test/unit/context/context-isolation.test.ts +0 -267
  279. package/test/unit/context/context-sort.test.ts +0 -93
  280. package/test/unit/context/context-story.test.ts +0 -108
  281. package/test/unit/context/prior-failures.test.ts +0 -463
  282. package/test/unit/context.test.ts +0 -1726
  283. package/test/unit/cost.test.ts +0 -231
  284. package/test/unit/crash-recovery.test.ts +0 -309
  285. package/test/unit/escalation.test.ts +0 -127
  286. package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
  287. package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
  288. package/test/unit/execution/pid-registry.test.ts +0 -241
  289. package/test/unit/execution/sequential-executor.test.ts +0 -235
  290. package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
  291. package/test/unit/execution/structured-failure.test.ts +0 -415
  292. package/test/unit/execution-logging-stderr.test.ts +0 -157
  293. package/test/unit/execution-stage.test.ts +0 -123
  294. package/test/unit/fix-generator.test.ts +0 -276
  295. package/test/unit/formatters.test.ts +0 -468
  296. package/test/unit/greenfield.test.ts +0 -180
  297. package/test/unit/hooks/shell-security.test.ts +0 -40
  298. package/test/unit/interaction/auto-plugin.test.ts +0 -162
  299. package/test/unit/interaction/human-review-trigger.test.ts +0 -165
  300. package/test/unit/interaction-network-failures.test.ts +0 -390
  301. package/test/unit/interaction-plugins.test.ts +0 -472
  302. package/test/unit/logging/formatter.test.ts +0 -456
  303. package/test/unit/merge.test.ts +0 -269
  304. package/test/unit/metrics/aggregator.test.ts +0 -164
  305. package/test/unit/metrics/tracker.test.ts +0 -186
  306. package/test/unit/metrics.test.ts +0 -276
  307. package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
  308. package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
  309. package/test/unit/pipeline/event-bus.test.ts +0 -105
  310. package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
  311. package/test/unit/pipeline/runner-retry.test.ts +0 -89
  312. package/test/unit/pipeline/stages/autofix.test.ts +0 -97
  313. package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
  314. package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
  315. package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
  316. package/test/unit/pipeline/stages/rectify.test.ts +0 -101
  317. package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
  318. package/test/unit/pipeline/stages/review.test.ts +0 -201
  319. package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
  320. package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
  321. package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
  322. package/test/unit/pipeline/stages/verify.test.ts +0 -267
  323. package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
  324. package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
  325. package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
  326. package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
  327. package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
  328. package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
  329. package/test/unit/prd-auto-default.test.ts +0 -291
  330. package/test/unit/prd-failure-category.test.ts +0 -177
  331. package/test/unit/prd-get-next-story.test.ts +0 -215
  332. package/test/unit/precheck/checks-warnings.test.ts +0 -114
  333. package/test/unit/precheck-checks.test.ts +0 -841
  334. package/test/unit/precheck-story-size-gate.test.ts +0 -288
  335. package/test/unit/precheck-types.test.ts +0 -143
  336. package/test/unit/prompts/builder.test.ts +0 -258
  337. package/test/unit/prompts/loader.test.ts +0 -355
  338. package/test/unit/prompts/sections/conventions.test.ts +0 -30
  339. package/test/unit/prompts/sections/isolation.test.ts +0 -35
  340. package/test/unit/prompts/sections/role-task.test.ts +0 -40
  341. package/test/unit/prompts/sections/sections.test.ts +0 -238
  342. package/test/unit/prompts/sections/story.test.ts +0 -45
  343. package/test/unit/prompts/sections/verdict.test.ts +0 -58
  344. package/test/unit/prompts.test.ts +0 -476
  345. package/test/unit/queue.test.ts +0 -237
  346. package/test/unit/rectification.test.ts +0 -285
  347. package/test/unit/registry.test.ts +0 -288
  348. package/test/unit/review/runner.test.ts +0 -117
  349. package/test/unit/routing/content-hash.test.ts +0 -99
  350. package/test/unit/routing/routing-stability.test.ts +0 -208
  351. package/test/unit/routing/strategies/llm.test.ts +0 -306
  352. package/test/unit/routing-advanced.test.ts +0 -313
  353. package/test/unit/routing-core.test.ts +0 -341
  354. package/test/unit/routing-strategies.test.ts +0 -440
  355. package/test/unit/storyid-events.test.ts +0 -213
  356. package/test/unit/tdd-verdict.test.ts +0 -492
  357. package/test/unit/test-output-parser.test.ts +0 -377
  358. package/test/unit/ui/tui-controls.test.ts +0 -335
  359. package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
  360. package/test/unit/ui/tui-layout.test.ts +0 -379
  361. package/test/unit/ui/tui-stories.test.ts +0 -333
  362. package/test/unit/unit-isolation.test.ts +0 -135
  363. package/test/unit/utils/git.test.ts +0 -50
  364. package/test/unit/utils/path-security.test.ts +0 -47
  365. package/test/unit/utils-helpers.test.ts +0 -318
  366. package/test/unit/verdict.test.ts +0 -325
  367. package/test/unit/verification/orchestrator-types.test.ts +0 -54
  368. package/test/unit/verification/orchestrator.test.ts +0 -66
  369. package/test/unit/verification/smart-runner-config.test.ts +0 -163
  370. package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
  371. package/test/unit/verification/smart-runner.test.ts +0 -262
  372. package/test/unit/verification/strategies/acceptance.test.ts +0 -33
  373. package/test/unit/verification/strategies/regression.test.ts +0 -87
  374. package/test/unit/verification/strategies/scoped.test.ts +0 -100
  375. package/test/unit/worktree-manager.test.ts +0 -159
  376. package/tsconfig.json +0 -27
@@ -1,197 +0,0 @@
1
- # Fix Plan: v0.9.1 — Routing Respect + TDD Isolation Rework
2
-
3
- **Date:** 2026-02-22
4
- **Branch:** fix/v0.9.1-routing-isolation
5
- **Base:** Revert commits 211a884 and 4fa39a4, then apply clean fixes
6
-
7
- ## Context
8
-
9
- Two commits (211a884, 4fa39a4) attempted to fix 4 issues but introduced problems:
10
- 1. `determineTestStrategy()` still overrides LLM complexity via keyword scan
11
- 2. Story count prompt hint isn't enforced
12
- 3. `analyzeConfig` metadata is fine but incomplete (missing naxVersion)
13
- 4. Isolation check now always passes (toothless)
14
-
15
- This plan implements clean fixes for all 4 issues.
16
-
17
- ---
18
-
19
- ## Phase 1: Revert and Create Branch
20
-
21
- 1. `git revert --no-commit 4fa39a4 211a884` (revert both commits)
22
- 2. `git checkout -b fix/v0.9.1-routing-isolation`
23
- 3. Commit: `revert: undo 211a884 and 4fa39a4 for clean reimplementation`
24
-
25
- ---
26
-
27
- ## Phase 2: Fix Routing — LLM testStrategy in Decomposition
28
-
29
- **Problem:** `determineTestStrategy()` re-scans keywords after LLM already classified complexity, overriding LLM decisions for simple tasks.
30
-
31
- **Fix:** When `strategy=llm`, have the LLM output `testStrategy` directly in its decomposition response. `determineTestStrategy()` is only used for keyword-mode fallback.
32
-
33
- ### Changes:
34
-
35
- **File: `src/agents/claude.ts`** (decomposition prompt)
36
- - Add to the decomposition prompt schema: each story must include `testStrategy: "three-session-tdd" | "test-after"`
37
- - Add decision rules to prompt:
38
- ```
39
- testStrategy rules:
40
- - "three-session-tdd": ONLY for complex/expert tasks that are security-critical (auth, encryption, tokens) or define public API contracts
41
- - "test-after": for all other tasks including simple/medium complexity
42
- - A task being "simple" complexity should almost never be three-session-tdd
43
- ```
44
- - Add `testStrategy` to the expected JSON response schema alongside existing `complexity` field
45
-
46
- **File: `src/cli/analyze.ts`**
47
- - When building UserStory from LLM decomposition result:
48
- - Use `ds.testStrategy` directly (from LLM response) instead of calling `determineTestStrategy()`
49
- - Fallback to `determineTestStrategy()` only if LLM didn't return a testStrategy
50
- - When using keyword classification (non-LLM path): keep calling `determineTestStrategy()` as-is
51
- - Add `routing.strategy: "llm" | "keyword"` and `routing.llmModel` to the story routing object
52
-
53
- **File: `src/prd/types.ts`**
54
- - Add to `StoryRouting`:
55
- ```ts
56
- strategy?: "keyword" | "llm";
57
- llmModel?: string;
58
- ```
59
-
60
- ### Tests:
61
- - Update existing analyze tests to verify LLM-classified stories use LLM's testStrategy
62
- - Add test: simple story with "auth" in tags gets `test-after` from LLM (not overridden to three-session-tdd)
63
-
64
- ---
65
-
66
- ## Phase 3: Enforce Max Stories + Add analyzeConfig
67
-
68
- **Problem:** Prompt hint for max stories isn't enforced. analyzeConfig missing naxVersion.
69
-
70
- ### Changes:
71
-
72
- **File: `src/agents/claude.ts`** (decomposition prompt)
73
- - Add grouping guidelines (from 211a884 — this part was good):
74
- ```
75
- Grouping Guidelines:
76
- - Combine small related tasks into single stories
77
- - Maximum stories: {maxStories} (from config). If you generate more, merge related ones.
78
- - Aim for coherent units of value
79
- ```
80
- - Pass `maxStories` from config into the prompt template
81
-
82
- **File: `src/cli/analyze.ts`**
83
- - After LLM returns stories, if count > `config.execution.maxStoriesPerFeature`:
84
- - Log a warning: `"LLM returned {n} stories, exceeding max {max}. Consider re-running with stricter grouping."`
85
- - Do NOT auto-truncate (could lose important work). Just warn.
86
- - Add `analyzeConfig` to PRD output:
87
- ```ts
88
- analyzeConfig: {
89
- naxVersion: pkg.version, // read from package.json
90
- model: config.analyze.model,
91
- llmEnhanced: config.analyze.llmEnhanced,
92
- maxStoriesPerFeature: config.execution.maxStoriesPerFeature,
93
- routingStrategy: config.analyze.llmEnhanced ? "llm" : "keyword",
94
- }
95
- ```
96
-
97
- **File: `src/prd/types.ts`**
98
- - Add to PRD interface:
99
- ```ts
100
- analyzeConfig?: {
101
- naxVersion: string;
102
- model: string;
103
- llmEnhanced: boolean;
104
- maxStoriesPerFeature: number;
105
- routingStrategy: "keyword" | "llm";
106
- };
107
- ```
108
-
109
- ### Tests:
110
- - Test that analyzeConfig is populated with correct values
111
- - Test warning logged when stories exceed max
112
-
113
- ---
114
-
115
- ## Phase 4: TDD Isolation — Detector + Verifier Judgment
116
-
117
- **Problem:** Isolation check always passes after 211a884. Should detect and report, let verifier judge.
118
-
119
- ### Changes:
120
-
121
- **File: `src/tdd/types.ts`**
122
- - Update `IsolationCheck`:
123
- ```ts
124
- interface IsolationCheck {
125
- /** Whether strict isolation was maintained (no test files touched) */
126
- strictPass: boolean;
127
- /** Test files modified by implementer */
128
- modifiedTestFiles: string[];
129
- /** Verdict: clean (no changes), needs-review (verifier must judge) */
130
- verdict: "clean" | "needs-review";
131
- description: string;
132
- }
133
- ```
134
-
135
- **File: `src/tdd/isolation.ts`**
136
- - `verifyImplementerIsolation()` returns honest results:
137
- - If no test files modified: `{ strictPass: true, modifiedTestFiles: [], verdict: "clean" }`
138
- - If test files modified: `{ strictPass: false, modifiedTestFiles: [...], verdict: "needs-review" }`
139
- - Do NOT return `passed: true` when files were modified
140
-
141
- **File: `src/tdd/orchestrator.ts`**
142
- - After Session 2 isolation check:
143
- - If `verdict === "clean"`: proceed normally
144
- - If `verdict === "needs-review"`: inject modified file info into verifier (Session 3) prompt
145
- - Update verifier prompt:
146
- ```
147
- ⚠️ ISOLATION REVIEW REQUIRED
148
- The implementer modified these test files: {modifiedTestFiles}
149
-
150
- You MUST review the changes to these files and determine:
151
- - LEGITIMATE: Fixing genuinely incorrect test expectations, adding missing imports,
152
- adjusting test fixtures to match correct implementation behavior
153
- - VIOLATION: Removing test cases, weakening assertions, deleting acceptance criteria
154
- checks, adding tests to inflate pass rate
155
-
156
- Include your verdict in the output:
157
- - isolationVerdict: "approved" | "rejected"
158
- - isolationReason: "<explanation>"
159
-
160
- If REJECTED: fail the story.
161
- ```
162
- - Parse verifier output for isolation verdict
163
- - Log the verdict (approved/rejected + reason) to structured JSONL
164
-
165
- **File: `src/tdd/orchestrator.ts`** (runTddSession result handling)
166
- - When isolation.verdict === "needs-review" and verifier says "rejected":
167
- - Mark story as failed with reason "TDD isolation violation confirmed by verifier"
168
- - When isolation.verdict === "needs-review" and verifier says "approved":
169
- - Mark story as passed with warning logged
170
-
171
- ### Tests:
172
- - Test isolation detection: modified test files → verdict "needs-review"
173
- - Test clean isolation: no test files → verdict "clean"
174
- - Test orchestrator injects isolation context into verifier prompt when needs-review
175
- - Test story fails when verifier rejects isolation
176
-
177
- ---
178
-
179
- ## Phase 5: Version Bump + Cleanup
180
-
181
- 1. Bump version to `0.9.1` in `package.json`
182
- 2. Run full test suite: `bun test`
183
- 3. Commit: `fix(v0.9.1): routing respects LLM complexity, isolation reworked to detector+verifier`
184
- 4. Do NOT push.
185
-
186
- ---
187
-
188
- ## Test Strategy
189
- - Mode: test-after
190
- - Reason: Internal refactor with existing test coverage. Tests updated alongside implementation per phase.
191
-
192
- ## Commits
193
- - Phase 1: `revert: undo 211a884 and 4fa39a4 for clean reimplementation`
194
- - Phase 2: `fix(routing): LLM decomposition outputs testStrategy directly`
195
- - Phase 3: `fix(analyze): enforce max stories warning, add analyzeConfig with naxVersion`
196
- - Phase 4: `fix(tdd): isolation becomes detector, verifier makes judgment`
197
- - Phase 5: `chore: bump to v0.9.1`
@@ -1,62 +0,0 @@
1
- # Fix Plan: nax prompts CLI + Scoped Test Coverage
2
- **Date:** 2026-02-23
3
- **Branch:** master (direct, v0.9.3)
4
-
5
- ## Phase 1: `nax prompts` CLI Command (US-001)
6
-
7
- ### Fix 1: Add CLI command handler
8
- **File:** `src/cli/prompts.ts` (new)
9
- **Change:** New CLI command that:
10
- - Accepts `-f <feature>` (required), `--out <dir>` (optional, default stdout), `--story <id>` (optional filter)
11
- - Loads PRD from feature dir
12
- - Loads config
13
- - For each story (or filtered story):
14
- - Runs routing (classify complexity)
15
- - Runs context building (buildContext + formatContextAsMarkdown)
16
- - Loads constitution (if configured)
17
- - Assembles prompt via buildSingleSessionPrompt / buildBatchPrompt
18
- - For three-session-tdd stories: also builds test-writer/implementer/verifier prompts
19
- - Outputs to stdout or writes files with YAML frontmatter
20
-
21
- ### Fix 2: Register CLI command
22
- **File:** `src/cli/index.ts`
23
- **Change:** Add `prompts` subcommand to the CLI parser. Wire to handler.
24
-
25
- ### Fix 3: Add tests
26
- **File:** `test/prompts-cli.test.ts` (new)
27
- **Change:** Test that:
28
- - `nax prompts` loads PRD and produces prompt files
29
- - Frontmatter includes storyId, testStrategy, contextTokens
30
- - `--story` flag filters to single story
31
- - Three-session-tdd stories produce separate session prompts
32
- - Output dir is created if it doesn't exist
33
-
34
- ## Phase 2: Scoped Test Coverage Scanner (US-003)
35
-
36
- ### Fix 4: Add story scoping to test scanner
37
- **File:** `src/context/test-scanner.ts`
38
- **Change:**
39
- - Accept optional `scopeFiles?: string[]` parameter
40
- - When scopeFiles provided, derive test file patterns (e.g., `src/health.service.ts` → `**/health.service.{spec,test}.ts`)
41
- - Filter scan results to only matching test files
42
- - Fall back to full scan when scopeFiles is empty/undefined
43
-
44
- ### Fix 5: Wire scoping in context builder
45
- **File:** `src/context/builder.ts`
46
- **Change:** Pass `currentStory.contextFiles` to generateTestCoverageSummary as scopeFiles.
47
-
48
- ### Fix 6: Add config option
49
- **File:** `src/config/schema.ts`
50
- **Change:** Add `context.testCoverage.scopeToStory` boolean (default: true) to config schema.
51
-
52
- ### Fix 7: Add tests for scoped scanning
53
- **File:** `test/context.test.ts` or `test/test-scanner.test.ts`
54
- **Change:** Test that test coverage scan respects scopeFiles filter.
55
-
56
- ## Test Strategy
57
- - Mode: test-after
58
- - Run: `bun test` after each phase
59
-
60
- ## Commits
61
- - Phase 1: `feat: add nax prompts CLI command for prompt inspection`
62
- - Phase 2: `feat: scope test coverage scanner to story-relevant files`
@@ -1,189 +0,0 @@
1
- # nax Roadmap — TDD-Lite, LLM Service Layer, Parallelism
2
-
3
- *Date: 2026-02-24*
4
- *Status: Proposed*
5
-
6
- ---
7
-
8
- ## Context
9
-
10
- nax v0.10.0 has a solid TDD pipeline for TypeScript libraries, but struggles with:
11
- - Non-TS/polyglot projects (UI, shell scripts, integration-heavy)
12
- - No parallelism (sequential story execution)
13
- - Memory-heavy (long-running agent sessions, OOMs on 4GB VPS)
14
- - Single agent backend (claude CLI only, no OpenClaw sub-agents)
15
-
16
- dev-orchestrator (OpenClaw skill) solves execution well — worktrees, parallel coders, phase-by-phase memory — but lacks nax's TDD pipeline, structured logging, PRD workflow, and CLI.
17
-
18
- ## nax vs dev-orchestrator — Honest Comparison
19
-
20
- | Capability | nax v0.10.0 | dev-orchestrator |
21
- |:-----------|:-----------|:-----------------|
22
- | **TDD pipeline** | ✅ Three-session (strict isolation) | ❌ None |
23
- | **Verification** | ✅ Isolated verifier | ❌ Code review only |
24
- | **Test quality gates** | ✅ Coverage, typecheck, lint | ❌ Up to the coder |
25
- | **Planning/PRD** | ✅ `nax plan` → `analyze` → structured stories | ❌ Simple task decomposition |
26
- | **Parallelism** | ❌ Sequential (batch = same session) | ✅ Git worktrees, true parallel |
27
- | **Memory** | ❌ Peaks 3-4GB+, OOMs on VPS | ✅ Phase-by-phase ~1-2GB, exits between phases |
28
- | **Agent backends** | ❌ claude CLI only | ✅ OpenClaw sessions_spawn + claude CLI |
29
- | **Structured logging** | ✅ JSONL, `nax runs list/show` | ❌ None |
30
- | **Hooks/plugins** | ✅ Global hooks, plugin system | ❌ None |
31
- | **Escalation tiers** | ✅ Automatic model escalation | ❌ Manual |
32
- | **Reproducibility** | ✅ Same PRD = same run | ❌ Depends on orchestrator prompt |
33
- | **Polyglot support** | ❌ TDD isolation breaks for UI/bash | ✅ Handles anything |
34
- | **Setup overhead** | ❌ PRD → analyze → config → run | ✅ Zero — just spawn with task |
35
- | **CLI** | ✅ Full CLI (`nax plan/run/accept/stories`) | ❌ OpenClaw skill only |
36
-
37
- ### Key Insight
38
-
39
- nax's TDD pipeline is its strongest differentiator. dev-orchestrator's execution model (worktrees + phase-by-phase agents) is proven and lighter. The gap is **agent spawning** — nax can't spawn parallel managed agents (#3 LLM Service Layer).
40
-
41
- ## Decision
42
-
43
- **Fix nax** in phases. Port dev-orchestrator's execution strengths into nax rather than rebuilding nax's TDD/PRD pipeline elsewhere.
44
-
45
- ---
46
-
47
- ## Phase 1 — TDD-Lite + Fallback (Quick Win)
48
-
49
- **Goal:** Solve GitLab #20, support non-TS projects without abandoning TDD.
50
-
51
- ### Three TDD Tiers
52
-
53
- | Strategy | Test Writer | Implementer | Verifier | Use Case |
54
- |:---------|:-----------|:------------|:---------|:---------|
55
- | `three-session-tdd` (strict) | Isolated — no source access | Isolated — no test access | Isolated ✅ | TS libraries, APIs |
56
- | `three-session-tdd-lite` | Can read source, write tests | Free to modify anything | Isolated ✅ | UI, polyglot, integration |
57
- | `test-after` | N/A | Writes code + tests together | N/A | Simple tasks |
58
-
59
- ### Fallback Logic
60
-
61
- - If test-writer produces **0 test files** in strict mode → auto-downgrade to `tdd-lite` and retry
62
- - No wasted iteration, no story pause
63
-
64
- ### Config
65
-
66
- ```json
67
- {
68
- "tdd": {
69
- "strategy": "auto" | "strict" | "lite" | "off",
70
- "enabled": true
71
- }
72
- }
73
- ```
74
-
75
- - `auto` (default): LLM router classifies testability, picks strict or lite
76
- - `strict`: Always three-session-tdd
77
- - `lite`: Always three-session-tdd-lite
78
- - `off`: test-after for everything
79
-
80
- ### Scope
81
-
82
- - Modify `src/tdd/` prompts for lite mode (relax isolation rules for test-writer)
83
- - Add fallback logic in `src/execution/runner.ts`
84
- - Add `strategy` to routing decision
85
- - Update config schema
86
- - No architecture changes needed
87
-
88
- ---
89
-
90
- ## Phase 2 — LLM Service Layer (GitLab #3)
91
-
92
- **Goal:** Abstract agent spawning so nax can use multiple backends and run agents in parallel.
93
-
94
- ### Agent Interface
95
-
96
- ```typescript
97
- interface Agent {
98
- name: string;
99
- spawn(options: AgentSpawnOptions): Promise<AgentSession>;
100
- isInstalled(): Promise<boolean>;
101
- }
102
-
103
- interface AgentSession {
104
- id: string;
105
- status: 'running' | 'completed' | 'failed';
106
- workdir: string;
107
- wait(): Promise<AgentResult>;
108
- kill(): Promise<void>;
109
- steer?(message: string): Promise<void>; // optional
110
- }
111
-
112
- interface AgentSpawnOptions {
113
- prompt: string;
114
- workdir: string;
115
- model?: string;
116
- timeout?: number;
117
- env?: Record<string, string>;
118
- }
119
- ```
120
-
121
- ### Backends
122
-
123
- | Backend | How | Parallelism | Where |
124
- |:--------|:----|:-----------|:------|
125
- | `ClaudeCliAgent` | `claude -p` (existing) | ❌ Sequential | VPS, Mac01 |
126
- | `OpenClawAgent` | `sessions_spawn` | ✅ Managed sub-agents | OpenClaw environments |
127
- | `ApiAgent` | Direct Anthropic/Google API | ✅ Concurrent requests | Anywhere |
128
-
129
- ### Key Design Decisions
130
-
131
- - Agent selection via config: `autoMode.defaultAgent: "claude-cli" | "openclaw" | "api"`
132
- - Each backend implements the same interface — runner doesn't care
133
- - `ApiAgent` is the lightest (no CLI overhead) but needs prompt engineering for tool use
134
-
135
- ---
136
-
137
- ## Phase 3 — Worktree Parallelism
138
-
139
- **Goal:** Run N stories concurrently using git worktrees + LLM Service Layer agents.
140
-
141
- ### Flow
142
-
143
- ```
144
- nax run -f feature --parallel 3
145
-
146
- ├── Worktree: .nax-wt/story-001/ → Agent 1 (tdd pipeline)
147
- ├── Worktree: .nax-wt/story-002/ → Agent 2 (tdd pipeline)
148
- └── Worktree: .nax-wt/story-003/ → Agent 3 (tdd pipeline)
149
-
150
- ├── Each agent exits after its story (phase-by-phase memory)
151
- ├── Verifier runs per-worktree (isolated)
152
- └── Merge back to main branch on pass
153
- ```
154
-
155
- ### Benefits
156
-
157
- - True parallelism (stolen from dev-orchestrator's proven model)
158
- - Phase-by-phase execution = low memory (solves VPS OOM)
159
- - Each worktree is isolated — no git conflicts during execution
160
- - Merge conflicts detected at merge time, not runtime
161
-
162
- ### Dependencies
163
-
164
- - Phase 2 (LLM Service Layer) — need agent spawning abstraction
165
- - Worktree management utilities (create, merge, cleanup)
166
- - Dependency-aware scheduling (respect story dependencies in PRD)
167
-
168
- ---
169
-
170
- ## Dependency Chain
171
-
172
- ```
173
- Phase 1: tdd-lite + fallback ← standalone, no blockers
174
-
175
- Phase 2: LLM Service Layer (#3) ← abstracts agent spawning
176
-
177
- Phase 3: Worktree parallelism ← needs Phase 2
178
-
179
- Memory optimization ← comes free with Phase 3
180
- ```
181
-
182
- ---
183
-
184
- ## Open Questions
185
-
186
- 1. Should `ApiAgent` support tool use (file read/write/exec) or is it prompt-only?
187
- 2. For OpenClaw backend — do we use `sessions_spawn` (managed) or `exec` with claude CLI?
188
- 3. Worktree merge strategy — rebase or merge commit?
189
- 4. Should nax accept a `--backend` flag or always use config?