@nathapp/nax 0.28.0 → 0.30.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (385) hide show
  1. package/CHANGELOG.md +23 -2
  2. package/bin/nax.ts +2 -3
  3. package/dist/nax.js +72753 -0
  4. package/package.json +11 -3
  5. package/src/cli/analyze.ts +2 -7
  6. package/src/cli/config.ts +3 -1
  7. package/src/config/defaults.ts +1 -0
  8. package/src/config/schemas.ts +1 -0
  9. package/src/config/types.ts +1 -0
  10. package/src/context/builder.ts +10 -1
  11. package/src/execution/lifecycle/headless-formatter.ts +2 -4
  12. package/src/prompts/builder.ts +12 -69
  13. package/src/prompts/sections/isolation.ts +38 -8
  14. package/src/prompts/sections/role-task.ts +79 -17
  15. package/src/review/runner.ts +6 -1
  16. package/src/version.ts +2 -1
  17. package/.claude/rules/01-project-conventions.md +0 -34
  18. package/.claude/rules/02-test-architecture.md +0 -39
  19. package/.claude/rules/03-test-writing.md +0 -58
  20. package/.claude/rules/04-forbidden-patterns.md +0 -29
  21. package/.claude/settings.json +0 -15
  22. package/.githooks/pre-commit +0 -16
  23. package/.gitlab-ci.yml +0 -103
  24. package/.mcp.json +0 -8
  25. package/BRIEF.md +0 -140
  26. package/CLAUDE.md +0 -143
  27. package/US-007-IMPLEMENTATION.md +0 -139
  28. package/biome.json +0 -14
  29. package/bun.lock +0 -163
  30. package/bunfig.toml +0 -12
  31. package/docker-compose.test.yml +0 -15
  32. package/docs/20260216-fix-plan-context-review.md +0 -56
  33. package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
  34. package/docs/20260216-v02-plan.md +0 -136
  35. package/docs/20260216-v02-review.md +0 -685
  36. package/docs/20260217-dogfood-findings.md +0 -56
  37. package/docs/20260217-p2-plus-plan.md +0 -117
  38. package/docs/20260217-partial-fixes-plan.md +0 -62
  39. package/docs/20260217-plan-analyze-spec.md +0 -117
  40. package/docs/20260217-post-impl-review.md +0 -1137
  41. package/docs/20260217-quick-wins-plan.md +0 -66
  42. package/docs/20260217-split-runner-plan.md +0 -75
  43. package/docs/20260217-v03-impl-plan.md +0 -80
  44. package/docs/20260217-v03-post-impl-review.md +0 -589
  45. package/docs/20260217-v04-impl-plan.md +0 -86
  46. package/docs/20260217-v05-post-impl-review.md +0 -850
  47. package/docs/20260217-v06-post-impl-review.md +0 -817
  48. package/docs/20260218-adr003-port-plan.md +0 -151
  49. package/docs/20260218-review-adr003-verification.md +0 -175
  50. package/docs/20260219-fix-plan-bug16-19.md +0 -79
  51. package/docs/20260219-fix-plan-bug20-22.md +0 -114
  52. package/docs/20260219-plan-llm-routing.md +0 -116
  53. package/docs/20260219-review-bug20-22-fixes.md +0 -135
  54. package/docs/20260219-routing-baseline-keyword.md +0 -63
  55. package/docs/20260220-plan-structured-logging-p1.md +0 -80
  56. package/docs/20260220-plan-structured-logging-p2.md +0 -37
  57. package/docs/20260220-review-llm-routing.md +0 -180
  58. package/docs/20260220-review-post-fix-llm-routing.md +0 -70
  59. package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
  60. package/docs/20260221-fix-plan-routing-mode.md +0 -125
  61. package/docs/20260221-review-v0.9-implementation.md +0 -379
  62. package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
  63. package/docs/20260223-fix-plan-prompt-audit.md +0 -62
  64. package/docs/20260224-nax-roadmap-phases.md +0 -189
  65. package/docs/20260225-phase2-llm-service-layer.md +0 -401
  66. package/docs/20260225-review-v0.10.1.md +0 -187
  67. package/docs/20260303-v010-implementation-plan.md +0 -165
  68. package/docs/20260304-review-nax.md +0 -492
  69. package/docs/CLAUDE.md.bak +0 -191
  70. package/docs/ROADMAP.md +0 -390
  71. package/docs/SPEC-rectification.md +0 -0
  72. package/docs/SPEC.md +0 -324
  73. package/docs/US-001-plugin-loading-verification.md +0 -152
  74. package/docs/adr/ADR-005-implementation-plan.md +0 -655
  75. package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
  76. package/docs/architecture-analysis.md +0 -1076
  77. package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
  78. package/docs/bugs-from-dogfood-run-c.md +0 -243
  79. package/docs/code-review-20260228.md +0 -612
  80. package/docs/code-review-v0.15.0.md +0 -629
  81. package/docs/hook-lifecycle-test-plan.md +0 -149
  82. package/docs/releases/v0.11.0-and-earlier.md +0 -20
  83. package/docs/releases/v0.12.0.md +0 -15
  84. package/docs/releases/v0.13.0.md +0 -14
  85. package/docs/releases/v0.14.0.md +0 -20
  86. package/docs/releases/v0.14.1.md +0 -36
  87. package/docs/releases/v0.14.2.md +0 -51
  88. package/docs/releases/v0.14.3.md +0 -174
  89. package/docs/releases/v0.14.4.md +0 -94
  90. package/docs/releases/v0.15.0.md +0 -502
  91. package/docs/releases/v0.15.1.md +0 -170
  92. package/docs/releases/v0.15.3.md +0 -193
  93. package/docs/specs/bug-039-orphan-processes.md +0 -131
  94. package/docs/specs/bug-040-review-rectification.md +0 -82
  95. package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
  96. package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
  97. package/docs/specs/bun-pty-migration.md +0 -171
  98. package/docs/specs/central-run-registry.md +0 -116
  99. package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
  100. package/docs/specs/feat-011-file-context-strategy.md +0 -73
  101. package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
  102. package/docs/specs/feat-013-test-after-review.md +0 -89
  103. package/docs/specs/feat-014-heartbeat-observability.md +0 -127
  104. package/docs/specs/status-file-consolidation.md +0 -93
  105. package/docs/specs/status-file-v0.10.1.md +0 -812
  106. package/docs/specs/trigger-completion.md +0 -145
  107. package/docs/specs/verification-architecture-v2.md +0 -343
  108. package/docs/tdd/strategies.md +0 -97
  109. package/docs/v0.10-global-config.md +0 -206
  110. package/docs/v0.10-plugin-system.md +0 -415
  111. package/docs/v0.10-prompt-optimizer.md +0 -234
  112. package/docs/v0.3-spec.md +0 -244
  113. package/docs/v0.4-spec.md +0 -140
  114. package/docs/v0.5-spec.md +0 -237
  115. package/docs/v0.6-spec.md +0 -371
  116. package/docs/v0.7-spec.md +0 -177
  117. package/docs/v0.8-llm-routing.md +0 -206
  118. package/docs/v0.8-structured-logging.md +0 -132
  119. package/docs/v0.9.3-prompt-audit.md +0 -112
  120. package/examples/plugins/console-reporter/index.test.ts +0 -207
  121. package/examples/plugins/console-reporter/index.ts +0 -110
  122. package/memory/topic/feat-010-baseref.md +0 -28
  123. package/memory/topic/feat-013-test-after-deprecation.md +0 -22
  124. package/nax/config.json +0 -154
  125. package/nax/features/bug-039-medium/prd.json +0 -45
  126. package/nax/features/bugfix-v0171/prd.json +0 -52
  127. package/nax/features/central-run-registry/prd.json +0 -105
  128. package/nax/features/config-management/prd.json +0 -108
  129. package/nax/features/config-management/progress.txt +0 -5
  130. package/nax/features/diagnose/acceptance.test.ts +0 -414
  131. package/nax/features/diagnose/prd.json +0 -41
  132. package/nax/features/nax-compliance/prd.json +0 -52
  133. package/nax/features/nax-compliance/progress.txt +0 -1
  134. package/nax/features/orchestration-fixes/prd.json +0 -89
  135. package/nax/features/orchestration-fixes/progress.txt +0 -1
  136. package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
  137. package/nax/features/plugin-integration/prd.json +0 -208
  138. package/nax/features/plugin-integration/progress.txt +0 -5
  139. package/nax/features/post-rearch-bugfix/prd.json +0 -137
  140. package/nax/features/precheck/prd.json +0 -205
  141. package/nax/features/precheck/progress.txt +0 -15
  142. package/nax/features/prompt-builder/prd.json +0 -152
  143. package/nax/features/prompt-builder/progress.txt +0 -3
  144. package/nax/features/review-quality/prd.json +0 -55
  145. package/nax/features/routing-persistence/prd.json +0 -104
  146. package/nax/features/routing-persistence/progress.txt +0 -1
  147. package/nax/features/smart-test-runner/plan.md +0 -7
  148. package/nax/features/smart-test-runner/prd.json +0 -203
  149. package/nax/features/smart-test-runner/progress.txt +0 -13
  150. package/nax/features/smart-test-runner/spec.md +0 -7
  151. package/nax/features/smart-test-runner/tasks.md +0 -8
  152. package/nax/features/status-file-consolidation/prd.json +0 -106
  153. package/nax/features/structured-logging/prd.json +0 -199
  154. package/nax/features/trigger-completion/prd.json +0 -150
  155. package/nax/features/trigger-completion/progress.txt +0 -7
  156. package/nax/features/unlock/prd.json +0 -36
  157. package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
  158. package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
  159. package/nax/features/v0.19.0-hardening/plan.md +0 -7
  160. package/nax/features/v0.19.0-hardening/prd.json +0 -84
  161. package/nax/features/v0.19.0-hardening/progress.txt +0 -7
  162. package/nax/features/v0.19.0-hardening/spec.md +0 -18
  163. package/nax/features/v0.19.0-hardening/tasks.md +0 -8
  164. package/nax/features/verify-v2/prd.json +0 -79
  165. package/nax/features/verify-v2/progress.txt +0 -3
  166. package/nax/status.json +0 -36
  167. package/src/prompts/templates/implementer.ts +0 -6
  168. package/src/prompts/templates/single-session.ts +0 -6
  169. package/src/prompts/templates/test-writer.ts +0 -6
  170. package/src/prompts/templates/verifier.ts +0 -6
  171. package/test/COVERAGE-GAPS.md +0 -333
  172. package/test/e2e/cm-003-default-view.test.ts +0 -195
  173. package/test/e2e/plan-analyze-run.test.ts +0 -902
  174. package/test/helpers/helpers.test.ts +0 -295
  175. package/test/helpers/timeout.ts +0 -42
  176. package/test/integration/US-002-TEST-SUMMARY.md +0 -107
  177. package/test/integration/US-003-TEST-SUMMARY.md +0 -149
  178. package/test/integration/US-004-TEST-SUMMARY.md +0 -106
  179. package/test/integration/US-005-TEST-SUMMARY.md +0 -138
  180. package/test/integration/US-007-TEST-SUMMARY.md +0 -100
  181. package/test/integration/cli/agent-validation.test.ts +0 -439
  182. package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
  183. package/test/integration/cli/cli-config-default-view.test.ts +0 -230
  184. package/test/integration/cli/cli-config-diff.test.ts +0 -461
  185. package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
  186. package/test/integration/cli/cli-config.test.ts +0 -737
  187. package/test/integration/cli/cli-diagnose.test.ts +0 -595
  188. package/test/integration/cli/cli-logs.test.ts +0 -346
  189. package/test/integration/cli/cli-plugins.test.ts +0 -679
  190. package/test/integration/cli/cli-precheck.test.ts +0 -372
  191. package/test/integration/cli/cli-run-headless.test.ts +0 -174
  192. package/test/integration/cli/cli.test.ts +0 -76
  193. package/test/integration/cli/precheck-integration.test.ts +0 -476
  194. package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
  195. package/test/integration/cli/precheck.test.ts +0 -806
  196. package/test/integration/config/config-loader.test.ts +0 -266
  197. package/test/integration/config/config.test.ts +0 -444
  198. package/test/integration/config/merger.test.ts +0 -466
  199. package/test/integration/config/paths.test.ts +0 -52
  200. package/test/integration/config/security-loader.test.ts +0 -83
  201. package/test/integration/context/context-integration.test.ts +0 -703
  202. package/test/integration/context/context-path-security.test.ts +0 -173
  203. package/test/integration/context/context-provider-injection.test.ts +0 -507
  204. package/test/integration/context/context-verification-integration.test.ts +0 -296
  205. package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
  206. package/test/integration/execution/execution-isolation.test.ts +0 -143
  207. package/test/integration/execution/execution.test.ts +0 -634
  208. package/test/integration/execution/feature-status-write.test.ts +0 -302
  209. package/test/integration/execution/parallel.test.ts +0 -251
  210. package/test/integration/execution/prd-pause.test.ts +0 -205
  211. package/test/integration/execution/prd-resolvers.test.ts +0 -186
  212. package/test/integration/execution/progress.test.ts +0 -34
  213. package/test/integration/execution/runner-batching.test.ts +0 -682
  214. package/test/integration/execution/runner-config-plugins.test.ts +0 -462
  215. package/test/integration/execution/runner-escalation.test.ts +0 -561
  216. package/test/integration/execution/runner-fixes.test.ts +0 -400
  217. package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
  218. package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
  219. package/test/integration/execution/status-file-integration.test.ts +0 -289
  220. package/test/integration/execution/status-file.test.ts +0 -380
  221. package/test/integration/execution/status-writer.test.ts +0 -447
  222. package/test/integration/execution/story-id-in-events.test.ts +0 -274
  223. package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
  224. package/test/integration/pipeline/hooks.test.ts +0 -363
  225. package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
  226. package/test/integration/pipeline/pipeline-events.test.ts +0 -476
  227. package/test/integration/pipeline/pipeline.test.ts +0 -660
  228. package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
  229. package/test/integration/pipeline/verify-stage.test.ts +0 -286
  230. package/test/integration/plan/analyze-integration.test.ts +0 -262
  231. package/test/integration/plan/analyze-scanner.test.ts +0 -132
  232. package/test/integration/plan/logger.test.ts +0 -461
  233. package/test/integration/plan/plan.test.ts +0 -157
  234. package/test/integration/plugins/config-integration.test.ts +0 -173
  235. package/test/integration/plugins/config-resolution.test.ts +0 -523
  236. package/test/integration/plugins/loader.test.ts +0 -644
  237. package/test/integration/plugins/plugins-registry.test.ts +0 -747
  238. package/test/integration/plugins/validator.test.ts +0 -564
  239. package/test/integration/prompts/pb-004-migration.test.ts +0 -523
  240. package/test/integration/review/review-config-commands.test.ts +0 -320
  241. package/test/integration/review/review-config-schema.test.ts +0 -117
  242. package/test/integration/review/review-plugin-integration.test.ts +0 -729
  243. package/test/integration/review/review.test.ts +0 -150
  244. package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
  245. package/test/integration/routing/plugin-routing-core.test.ts +0 -527
  246. package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
  247. package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
  248. package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
  249. package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
  250. package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
  251. package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
  252. package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
  253. package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
  254. package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
  255. package/test/integration/tmp/headless-test/test.jsonl +0 -30
  256. package/test/integration/verification/test-scanner.test.ts +0 -403
  257. package/test/integration/verification/verification-asset-check.test.ts +0 -143
  258. package/test/integration/worktree/manager.test.ts +0 -218
  259. package/test/integration/worktree/worktree-merge.test.ts +0 -341
  260. package/test/manual/logging-formatter-demo.ts +0 -158
  261. package/test/ui/tui-agent-panel.test.tsx +0 -99
  262. package/test/ui/tui-pty-integration.test.tsx +0 -146
  263. package/test/unit/acceptance.test.ts +0 -187
  264. package/test/unit/agent-stderr-capture.test.ts +0 -147
  265. package/test/unit/agents/claude.test.ts +0 -107
  266. package/test/unit/analyze-classifier.test.ts +0 -216
  267. package/test/unit/analyze.test.ts +0 -224
  268. package/test/unit/auto-detect.test.ts +0 -250
  269. package/test/unit/cli-status-project-level.test.ts +0 -283
  270. package/test/unit/cli-status.test.ts +0 -418
  271. package/test/unit/commands/common.test.ts +0 -321
  272. package/test/unit/commands/logs.test.ts +0 -458
  273. package/test/unit/commands/runs.test.ts +0 -303
  274. package/test/unit/commands/unlock.test.ts +0 -320
  275. package/test/unit/config/defaults.test.ts +0 -70
  276. package/test/unit/config/quality-commands-schema.test.ts +0 -72
  277. package/test/unit/config/regression-gate-schema.test.ts +0 -160
  278. package/test/unit/config/smart-runner-flag.test.ts +0 -250
  279. package/test/unit/constitution-generators.test.ts +0 -161
  280. package/test/unit/constitution.test.ts +0 -210
  281. package/test/unit/context/context-autodetect.test.ts +0 -297
  282. package/test/unit/context/context-build.test.ts +0 -575
  283. package/test/unit/context/context-coverage.test.ts +0 -236
  284. package/test/unit/context/context-error.test.ts +0 -93
  285. package/test/unit/context/context-estimate-tokens.test.ts +0 -201
  286. package/test/unit/context/context-format.test.ts +0 -302
  287. package/test/unit/context/context-isolation.test.ts +0 -267
  288. package/test/unit/context/context-sort.test.ts +0 -93
  289. package/test/unit/context/context-story.test.ts +0 -108
  290. package/test/unit/context/prior-failures.test.ts +0 -463
  291. package/test/unit/context.test.ts +0 -1726
  292. package/test/unit/cost.test.ts +0 -231
  293. package/test/unit/crash-recovery.test.ts +0 -309
  294. package/test/unit/escalation.test.ts +0 -127
  295. package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
  296. package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
  297. package/test/unit/execution/pid-registry.test.ts +0 -241
  298. package/test/unit/execution/sequential-executor.test.ts +0 -235
  299. package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
  300. package/test/unit/execution/structured-failure.test.ts +0 -415
  301. package/test/unit/execution-logging-stderr.test.ts +0 -157
  302. package/test/unit/execution-stage.test.ts +0 -123
  303. package/test/unit/fix-generator.test.ts +0 -276
  304. package/test/unit/formatters.test.ts +0 -468
  305. package/test/unit/greenfield.test.ts +0 -180
  306. package/test/unit/hooks/shell-security.test.ts +0 -40
  307. package/test/unit/interaction/auto-plugin.test.ts +0 -162
  308. package/test/unit/interaction/human-review-trigger.test.ts +0 -165
  309. package/test/unit/interaction-network-failures.test.ts +0 -390
  310. package/test/unit/interaction-plugins.test.ts +0 -472
  311. package/test/unit/logging/formatter.test.ts +0 -456
  312. package/test/unit/merge.test.ts +0 -269
  313. package/test/unit/metrics/aggregator.test.ts +0 -164
  314. package/test/unit/metrics/tracker.test.ts +0 -186
  315. package/test/unit/metrics.test.ts +0 -276
  316. package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
  317. package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
  318. package/test/unit/pipeline/event-bus.test.ts +0 -105
  319. package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
  320. package/test/unit/pipeline/runner-retry.test.ts +0 -89
  321. package/test/unit/pipeline/stages/autofix.test.ts +0 -97
  322. package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
  323. package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
  324. package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
  325. package/test/unit/pipeline/stages/rectify.test.ts +0 -101
  326. package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
  327. package/test/unit/pipeline/stages/review.test.ts +0 -201
  328. package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
  329. package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
  330. package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
  331. package/test/unit/pipeline/stages/verify.test.ts +0 -267
  332. package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
  333. package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
  334. package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
  335. package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
  336. package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
  337. package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
  338. package/test/unit/prd-auto-default.test.ts +0 -291
  339. package/test/unit/prd-failure-category.test.ts +0 -177
  340. package/test/unit/prd-get-next-story.test.ts +0 -215
  341. package/test/unit/precheck/checks-warnings.test.ts +0 -114
  342. package/test/unit/precheck-checks.test.ts +0 -841
  343. package/test/unit/precheck-story-size-gate.test.ts +0 -288
  344. package/test/unit/precheck-types.test.ts +0 -143
  345. package/test/unit/prompts/builder.test.ts +0 -258
  346. package/test/unit/prompts/loader.test.ts +0 -355
  347. package/test/unit/prompts/sections/conventions.test.ts +0 -30
  348. package/test/unit/prompts/sections/isolation.test.ts +0 -35
  349. package/test/unit/prompts/sections/role-task.test.ts +0 -40
  350. package/test/unit/prompts/sections/sections.test.ts +0 -238
  351. package/test/unit/prompts/sections/story.test.ts +0 -45
  352. package/test/unit/prompts/sections/verdict.test.ts +0 -58
  353. package/test/unit/prompts.test.ts +0 -476
  354. package/test/unit/queue.test.ts +0 -237
  355. package/test/unit/rectification.test.ts +0 -285
  356. package/test/unit/registry.test.ts +0 -288
  357. package/test/unit/review/runner.test.ts +0 -117
  358. package/test/unit/routing/content-hash.test.ts +0 -99
  359. package/test/unit/routing/routing-stability.test.ts +0 -208
  360. package/test/unit/routing/strategies/llm.test.ts +0 -306
  361. package/test/unit/routing-advanced.test.ts +0 -313
  362. package/test/unit/routing-core.test.ts +0 -341
  363. package/test/unit/routing-strategies.test.ts +0 -440
  364. package/test/unit/storyid-events.test.ts +0 -213
  365. package/test/unit/tdd-verdict.test.ts +0 -492
  366. package/test/unit/test-output-parser.test.ts +0 -377
  367. package/test/unit/ui/tui-controls.test.ts +0 -335
  368. package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
  369. package/test/unit/ui/tui-layout.test.ts +0 -379
  370. package/test/unit/ui/tui-stories.test.ts +0 -333
  371. package/test/unit/unit-isolation.test.ts +0 -135
  372. package/test/unit/utils/git.test.ts +0 -50
  373. package/test/unit/utils/path-security.test.ts +0 -47
  374. package/test/unit/utils-helpers.test.ts +0 -318
  375. package/test/unit/verdict.test.ts +0 -325
  376. package/test/unit/verification/orchestrator-types.test.ts +0 -54
  377. package/test/unit/verification/orchestrator.test.ts +0 -66
  378. package/test/unit/verification/smart-runner-config.test.ts +0 -163
  379. package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
  380. package/test/unit/verification/smart-runner.test.ts +0 -262
  381. package/test/unit/verification/strategies/acceptance.test.ts +0 -33
  382. package/test/unit/verification/strategies/regression.test.ts +0 -87
  383. package/test/unit/verification/strategies/scoped.test.ts +0 -100
  384. package/test/unit/worktree-manager.test.ts +0 -159
  385. package/tsconfig.json +0 -27
@@ -1,197 +0,0 @@
1
- # Fix Plan: v0.9.1 — Routing Respect + TDD Isolation Rework
2
-
3
- **Date:** 2026-02-22
4
- **Branch:** fix/v0.9.1-routing-isolation
5
- **Base:** Revert commits 211a884 and 4fa39a4, then apply clean fixes
6
-
7
- ## Context
8
-
9
- Two commits (211a884, 4fa39a4) attempted to fix 4 issues but introduced problems:
10
- 1. `determineTestStrategy()` still overrides LLM complexity via keyword scan
11
- 2. Story count prompt hint isn't enforced
12
- 3. `analyzeConfig` metadata is fine but incomplete (missing naxVersion)
13
- 4. Isolation check now always passes (toothless)
14
-
15
- This plan implements clean fixes for all 4 issues.
16
-
17
- ---
18
-
19
- ## Phase 1: Revert and Create Branch
20
-
21
- 1. `git revert --no-commit 4fa39a4 211a884` (revert both commits)
22
- 2. `git checkout -b fix/v0.9.1-routing-isolation`
23
- 3. Commit: `revert: undo 211a884 and 4fa39a4 for clean reimplementation`
24
-
25
- ---
26
-
27
- ## Phase 2: Fix Routing — LLM testStrategy in Decomposition
28
-
29
- **Problem:** `determineTestStrategy()` re-scans keywords after LLM already classified complexity, overriding LLM decisions for simple tasks.
30
-
31
- **Fix:** When `strategy=llm`, have the LLM output `testStrategy` directly in its decomposition response. `determineTestStrategy()` is only used for keyword-mode fallback.
32
-
33
- ### Changes:
34
-
35
- **File: `src/agents/claude.ts`** (decomposition prompt)
36
- - Add to the decomposition prompt schema: each story must include `testStrategy: "three-session-tdd" | "test-after"`
37
- - Add decision rules to prompt:
38
- ```
39
- testStrategy rules:
40
- - "three-session-tdd": ONLY for complex/expert tasks that are security-critical (auth, encryption, tokens) or define public API contracts
41
- - "test-after": for all other tasks including simple/medium complexity
42
- - A task being "simple" complexity should almost never be three-session-tdd
43
- ```
44
- - Add `testStrategy` to the expected JSON response schema alongside existing `complexity` field
45
-
46
- **File: `src/cli/analyze.ts`**
47
- - When building UserStory from LLM decomposition result:
48
- - Use `ds.testStrategy` directly (from LLM response) instead of calling `determineTestStrategy()`
49
- - Fallback to `determineTestStrategy()` only if LLM didn't return a testStrategy
50
- - When using keyword classification (non-LLM path): keep calling `determineTestStrategy()` as-is
51
- - Add `routing.strategy: "llm" | "keyword"` and `routing.llmModel` to the story routing object
52
-
53
- **File: `src/prd/types.ts`**
54
- - Add to `StoryRouting`:
55
- ```ts
56
- strategy?: "keyword" | "llm";
57
- llmModel?: string;
58
- ```
59
-
60
- ### Tests:
61
- - Update existing analyze tests to verify LLM-classified stories use LLM's testStrategy
62
- - Add test: simple story with "auth" in tags gets `test-after` from LLM (not overridden to three-session-tdd)
63
-
64
- ---
65
-
66
- ## Phase 3: Enforce Max Stories + Add analyzeConfig
67
-
68
- **Problem:** Prompt hint for max stories isn't enforced. analyzeConfig missing naxVersion.
69
-
70
- ### Changes:
71
-
72
- **File: `src/agents/claude.ts`** (decomposition prompt)
73
- - Add grouping guidelines (from 211a884 — this part was good):
74
- ```
75
- Grouping Guidelines:
76
- - Combine small related tasks into single stories
77
- - Maximum stories: {maxStories} (from config). If you generate more, merge related ones.
78
- - Aim for coherent units of value
79
- ```
80
- - Pass `maxStories` from config into the prompt template
81
-
82
- **File: `src/cli/analyze.ts`**
83
- - After LLM returns stories, if count > `config.execution.maxStoriesPerFeature`:
84
- - Log a warning: `"LLM returned {n} stories, exceeding max {max}. Consider re-running with stricter grouping."`
85
- - Do NOT auto-truncate (could lose important work). Just warn.
86
- - Add `analyzeConfig` to PRD output:
87
- ```ts
88
- analyzeConfig: {
89
- naxVersion: pkg.version, // read from package.json
90
- model: config.analyze.model,
91
- llmEnhanced: config.analyze.llmEnhanced,
92
- maxStoriesPerFeature: config.execution.maxStoriesPerFeature,
93
- routingStrategy: config.analyze.llmEnhanced ? "llm" : "keyword",
94
- }
95
- ```
96
-
97
- **File: `src/prd/types.ts`**
98
- - Add to PRD interface:
99
- ```ts
100
- analyzeConfig?: {
101
- naxVersion: string;
102
- model: string;
103
- llmEnhanced: boolean;
104
- maxStoriesPerFeature: number;
105
- routingStrategy: "keyword" | "llm";
106
- };
107
- ```
108
-
109
- ### Tests:
110
- - Test that analyzeConfig is populated with correct values
111
- - Test warning logged when stories exceed max
112
-
113
- ---
114
-
115
- ## Phase 4: TDD Isolation — Detector + Verifier Judgment
116
-
117
- **Problem:** Isolation check always passes after 211a884. Should detect and report, let verifier judge.
118
-
119
- ### Changes:
120
-
121
- **File: `src/tdd/types.ts`**
122
- - Update `IsolationCheck`:
123
- ```ts
124
- interface IsolationCheck {
125
- /** Whether strict isolation was maintained (no test files touched) */
126
- strictPass: boolean;
127
- /** Test files modified by implementer */
128
- modifiedTestFiles: string[];
129
- /** Verdict: clean (no changes), needs-review (verifier must judge) */
130
- verdict: "clean" | "needs-review";
131
- description: string;
132
- }
133
- ```
134
-
135
- **File: `src/tdd/isolation.ts`**
136
- - `verifyImplementerIsolation()` returns honest results:
137
- - If no test files modified: `{ strictPass: true, modifiedTestFiles: [], verdict: "clean" }`
138
- - If test files modified: `{ strictPass: false, modifiedTestFiles: [...], verdict: "needs-review" }`
139
- - Do NOT return `passed: true` when files were modified
140
-
141
- **File: `src/tdd/orchestrator.ts`**
142
- - After Session 2 isolation check:
143
- - If `verdict === "clean"`: proceed normally
144
- - If `verdict === "needs-review"`: inject modified file info into verifier (Session 3) prompt
145
- - Update verifier prompt:
146
- ```
147
- ⚠️ ISOLATION REVIEW REQUIRED
148
- The implementer modified these test files: {modifiedTestFiles}
149
-
150
- You MUST review the changes to these files and determine:
151
- - LEGITIMATE: Fixing genuinely incorrect test expectations, adding missing imports,
152
- adjusting test fixtures to match correct implementation behavior
153
- - VIOLATION: Removing test cases, weakening assertions, deleting acceptance criteria
154
- checks, adding tests to inflate pass rate
155
-
156
- Include your verdict in the output:
157
- - isolationVerdict: "approved" | "rejected"
158
- - isolationReason: "<explanation>"
159
-
160
- If REJECTED: fail the story.
161
- ```
162
- - Parse verifier output for isolation verdict
163
- - Log the verdict (approved/rejected + reason) to structured JSONL
164
-
165
- **File: `src/tdd/orchestrator.ts`** (runTddSession result handling)
166
- - When isolation.verdict === "needs-review" and verifier says "rejected":
167
- - Mark story as failed with reason "TDD isolation violation confirmed by verifier"
168
- - When isolation.verdict === "needs-review" and verifier says "approved":
169
- - Mark story as passed with warning logged
170
-
171
- ### Tests:
172
- - Test isolation detection: modified test files → verdict "needs-review"
173
- - Test clean isolation: no test files → verdict "clean"
174
- - Test orchestrator injects isolation context into verifier prompt when needs-review
175
- - Test story fails when verifier rejects isolation
176
-
177
- ---
178
-
179
- ## Phase 5: Version Bump + Cleanup
180
-
181
- 1. Bump version to `0.9.1` in `package.json`
182
- 2. Run full test suite: `bun test`
183
- 3. Commit: `fix(v0.9.1): routing respects LLM complexity, isolation reworked to detector+verifier`
184
- 4. Do NOT push.
185
-
186
- ---
187
-
188
- ## Test Strategy
189
- - Mode: test-after
190
- - Reason: Internal refactor with existing test coverage. Tests updated alongside implementation per phase.
191
-
192
- ## Commits
193
- - Phase 1: `revert: undo 211a884 and 4fa39a4 for clean reimplementation`
194
- - Phase 2: `fix(routing): LLM decomposition outputs testStrategy directly`
195
- - Phase 3: `fix(analyze): enforce max stories warning, add analyzeConfig with naxVersion`
196
- - Phase 4: `fix(tdd): isolation becomes detector, verifier makes judgment`
197
- - Phase 5: `chore: bump to v0.9.1`
@@ -1,62 +0,0 @@
1
- # Fix Plan: nax prompts CLI + Scoped Test Coverage
2
- **Date:** 2026-02-23
3
- **Branch:** master (direct, v0.9.3)
4
-
5
- ## Phase 1: `nax prompts` CLI Command (US-001)
6
-
7
- ### Fix 1: Add CLI command handler
8
- **File:** `src/cli/prompts.ts` (new)
9
- **Change:** New CLI command that:
10
- - Accepts `-f <feature>` (required), `--out <dir>` (optional, default stdout), `--story <id>` (optional filter)
11
- - Loads PRD from feature dir
12
- - Loads config
13
- - For each story (or filtered story):
14
- - Runs routing (classify complexity)
15
- - Runs context building (buildContext + formatContextAsMarkdown)
16
- - Loads constitution (if configured)
17
- - Assembles prompt via buildSingleSessionPrompt / buildBatchPrompt
18
- - For three-session-tdd stories: also builds test-writer/implementer/verifier prompts
19
- - Outputs to stdout or writes files with YAML frontmatter
20
-
21
- ### Fix 2: Register CLI command
22
- **File:** `src/cli/index.ts`
23
- **Change:** Add `prompts` subcommand to the CLI parser. Wire to handler.
24
-
25
- ### Fix 3: Add tests
26
- **File:** `test/prompts-cli.test.ts` (new)
27
- **Change:** Test that:
28
- - `nax prompts` loads PRD and produces prompt files
29
- - Frontmatter includes storyId, testStrategy, contextTokens
30
- - `--story` flag filters to single story
31
- - Three-session-tdd stories produce separate session prompts
32
- - Output dir is created if it doesn't exist
33
-
34
- ## Phase 2: Scoped Test Coverage Scanner (US-003)
35
-
36
- ### Fix 4: Add story scoping to test scanner
37
- **File:** `src/context/test-scanner.ts`
38
- **Change:**
39
- - Accept optional `scopeFiles?: string[]` parameter
40
- - When scopeFiles provided, derive test file patterns (e.g., `src/health.service.ts` → `**/health.service.{spec,test}.ts`)
41
- - Filter scan results to only matching test files
42
- - Fall back to full scan when scopeFiles is empty/undefined
43
-
44
- ### Fix 5: Wire scoping in context builder
45
- **File:** `src/context/builder.ts`
46
- **Change:** Pass `currentStory.contextFiles` to generateTestCoverageSummary as scopeFiles.
47
-
48
- ### Fix 6: Add config option
49
- **File:** `src/config/schema.ts`
50
- **Change:** Add `context.testCoverage.scopeToStory` boolean (default: true) to config schema.
51
-
52
- ### Fix 7: Add tests for scoped scanning
53
- **File:** `test/context.test.ts` or `test/test-scanner.test.ts`
54
- **Change:** Test that test coverage scan respects scopeFiles filter.
55
-
56
- ## Test Strategy
57
- - Mode: test-after
58
- - Run: `bun test` after each phase
59
-
60
- ## Commits
61
- - Phase 1: `feat: add nax prompts CLI command for prompt inspection`
62
- - Phase 2: `feat: scope test coverage scanner to story-relevant files`
@@ -1,189 +0,0 @@
1
- # nax Roadmap — TDD-Lite, LLM Service Layer, Parallelism
2
-
3
- *Date: 2026-02-24*
4
- *Status: Proposed*
5
-
6
- ---
7
-
8
- ## Context
9
-
10
- nax v0.10.0 has a solid TDD pipeline for TypeScript libraries, but struggles with:
11
- - Non-TS/polyglot projects (UI, shell scripts, integration-heavy)
12
- - No parallelism (sequential story execution)
13
- - Memory-heavy (long-running agent sessions, OOMs on 4GB VPS)
14
- - Single agent backend (claude CLI only, no OpenClaw sub-agents)
15
-
16
- dev-orchestrator (OpenClaw skill) solves execution well — worktrees, parallel coders, phase-by-phase memory — but lacks nax's TDD pipeline, structured logging, PRD workflow, and CLI.
17
-
18
- ## nax vs dev-orchestrator — Honest Comparison
19
-
20
- | Capability | nax v0.10.0 | dev-orchestrator |
21
- |:-----------|:-----------|:-----------------|
22
- | **TDD pipeline** | ✅ Three-session (strict isolation) | ❌ None |
23
- | **Verification** | ✅ Isolated verifier | ❌ Code review only |
24
- | **Test quality gates** | ✅ Coverage, typecheck, lint | ❌ Up to the coder |
25
- | **Planning/PRD** | ✅ `nax plan` → `analyze` → structured stories | ❌ Simple task decomposition |
26
- | **Parallelism** | ❌ Sequential (batch = same session) | ✅ Git worktrees, true parallel |
27
- | **Memory** | ❌ Peaks 3-4GB+, OOMs on VPS | ✅ Phase-by-phase ~1-2GB, exits between phases |
28
- | **Agent backends** | ❌ claude CLI only | ✅ OpenClaw sessions_spawn + claude CLI |
29
- | **Structured logging** | ✅ JSONL, `nax runs list/show` | ❌ None |
30
- | **Hooks/plugins** | ✅ Global hooks, plugin system | ❌ None |
31
- | **Escalation tiers** | ✅ Automatic model escalation | ❌ Manual |
32
- | **Reproducibility** | ✅ Same PRD = same run | ❌ Depends on orchestrator prompt |
33
- | **Polyglot support** | ❌ TDD isolation breaks for UI/bash | ✅ Handles anything |
34
- | **Setup overhead** | ❌ PRD → analyze → config → run | ✅ Zero — just spawn with task |
35
- | **CLI** | ✅ Full CLI (`nax plan/run/accept/stories`) | ❌ OpenClaw skill only |
36
-
37
- ### Key Insight
38
-
39
- nax's TDD pipeline is its strongest differentiator. dev-orchestrator's execution model (worktrees + phase-by-phase agents) is proven and lighter. The gap is **agent spawning** — nax can't spawn parallel managed agents (#3 LLM Service Layer).
40
-
41
- ## Decision
42
-
43
- **Fix nax** in phases. Port dev-orchestrator's execution strengths into nax rather than rebuilding nax's TDD/PRD pipeline elsewhere.
44
-
45
- ---
46
-
47
- ## Phase 1 — TDD-Lite + Fallback (Quick Win)
48
-
49
- **Goal:** Solve GitLab #20, support non-TS projects without abandoning TDD.
50
-
51
- ### Three TDD Tiers
52
-
53
- | Strategy | Test Writer | Implementer | Verifier | Use Case |
54
- |:---------|:-----------|:------------|:---------|:---------|
55
- | `three-session-tdd` (strict) | Isolated — no source access | Isolated — no test access | Isolated ✅ | TS libraries, APIs |
56
- | `three-session-tdd-lite` | Can read source, write tests | Free to modify anything | Isolated ✅ | UI, polyglot, integration |
57
- | `test-after` | N/A | Writes code + tests together | N/A | Simple tasks |
58
-
59
- ### Fallback Logic
60
-
61
- - If test-writer produces **0 test files** in strict mode → auto-downgrade to `tdd-lite` and retry
62
- - No wasted iteration, no story pause
63
-
64
- ### Config
65
-
66
- ```json
67
- {
68
- "tdd": {
69
- "strategy": "auto" | "strict" | "lite" | "off",
70
- "enabled": true
71
- }
72
- }
73
- ```
74
-
75
- - `auto` (default): LLM router classifies testability, picks strict or lite
76
- - `strict`: Always three-session-tdd
77
- - `lite`: Always three-session-tdd-lite
78
- - `off`: test-after for everything
79
-
80
- ### Scope
81
-
82
- - Modify `src/tdd/` prompts for lite mode (relax isolation rules for test-writer)
83
- - Add fallback logic in `src/execution/runner.ts`
84
- - Add `strategy` to routing decision
85
- - Update config schema
86
- - No architecture changes needed
87
-
88
- ---
89
-
90
- ## Phase 2 — LLM Service Layer (GitLab #3)
91
-
92
- **Goal:** Abstract agent spawning so nax can use multiple backends and run agents in parallel.
93
-
94
- ### Agent Interface
95
-
96
- ```typescript
97
- interface Agent {
98
- name: string;
99
- spawn(options: AgentSpawnOptions): Promise<AgentSession>;
100
- isInstalled(): Promise<boolean>;
101
- }
102
-
103
- interface AgentSession {
104
- id: string;
105
- status: 'running' | 'completed' | 'failed';
106
- workdir: string;
107
- wait(): Promise<AgentResult>;
108
- kill(): Promise<void>;
109
- steer?(message: string): Promise<void>; // optional
110
- }
111
-
112
- interface AgentSpawnOptions {
113
- prompt: string;
114
- workdir: string;
115
- model?: string;
116
- timeout?: number;
117
- env?: Record<string, string>;
118
- }
119
- ```
120
-
121
- ### Backends
122
-
123
- | Backend | How | Parallelism | Where |
124
- |:--------|:----|:-----------|:------|
125
- | `ClaudeCliAgent` | `claude -p` (existing) | ❌ Sequential | VPS, Mac01 |
126
- | `OpenClawAgent` | `sessions_spawn` | ✅ Managed sub-agents | OpenClaw environments |
127
- | `ApiAgent` | Direct Anthropic/Google API | ✅ Concurrent requests | Anywhere |
128
-
129
- ### Key Design Decisions
130
-
131
- - Agent selection via config: `autoMode.defaultAgent: "claude-cli" | "openclaw" | "api"`
132
- - Each backend implements the same interface — runner doesn't care
133
- - `ApiAgent` is the lightest (no CLI overhead) but needs prompt engineering for tool use
134
-
135
- ---
136
-
137
- ## Phase 3 — Worktree Parallelism
138
-
139
- **Goal:** Run N stories concurrently using git worktrees + LLM Service Layer agents.
140
-
141
- ### Flow
142
-
143
- ```
144
- nax run -f feature --parallel 3
145
-
146
- ├── Worktree: .nax-wt/story-001/ → Agent 1 (tdd pipeline)
147
- ├── Worktree: .nax-wt/story-002/ → Agent 2 (tdd pipeline)
148
- └── Worktree: .nax-wt/story-003/ → Agent 3 (tdd pipeline)
149
-
150
- ├── Each agent exits after its story (phase-by-phase memory)
151
- ├── Verifier runs per-worktree (isolated)
152
- └── Merge back to main branch on pass
153
- ```
154
-
155
- ### Benefits
156
-
157
- - True parallelism (stolen from dev-orchestrator's proven model)
158
- - Phase-by-phase execution = low memory (solves VPS OOM)
159
- - Each worktree is isolated — no git conflicts during execution
160
- - Merge conflicts detected at merge time, not runtime
161
-
162
- ### Dependencies
163
-
164
- - Phase 2 (LLM Service Layer) — need agent spawning abstraction
165
- - Worktree management utilities (create, merge, cleanup)
166
- - Dependency-aware scheduling (respect story dependencies in PRD)
167
-
168
- ---
169
-
170
- ## Dependency Chain
171
-
172
- ```
173
- Phase 1: tdd-lite + fallback ← standalone, no blockers
174
-
175
- Phase 2: LLM Service Layer (#3) ← abstracts agent spawning
176
-
177
- Phase 3: Worktree parallelism ← needs Phase 2
178
-
179
- Memory optimization ← comes free with Phase 3
180
- ```
181
-
182
- ---
183
-
184
- ## Open Questions
185
-
186
- 1. Should `ApiAgent` support tool use (file read/write/exec) or is it prompt-only?
187
- 2. For OpenClaw backend — do we use `sessions_spawn` (managed) or `exec` with claude CLI?
188
- 3. Worktree merge strategy — rebase or merge commit?
189
- 4. Should nax accept a `--backend` flag or always use config?