@nathapp/nax 0.27.1 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (383) hide show
  1. package/CHANGELOG.md +13 -2
  2. package/dist/nax.js +72691 -0
  3. package/package.json +12 -4
  4. package/src/cli/config.ts +42 -1
  5. package/src/cli/prompts.ts +18 -6
  6. package/src/config/defaults.ts +2 -0
  7. package/src/config/schemas.ts +11 -0
  8. package/src/config/types.ts +8 -0
  9. package/src/context/builder.ts +10 -1
  10. package/src/pipeline/stages/execution.ts +5 -0
  11. package/src/pipeline/stages/prompt.ts +13 -4
  12. package/src/precheck/checks-warnings.ts +37 -0
  13. package/src/precheck/checks.ts +1 -0
  14. package/src/precheck/index.ts +14 -7
  15. package/src/prompts/builder.ts +178 -0
  16. package/src/prompts/index.ts +2 -0
  17. package/src/prompts/loader.ts +43 -0
  18. package/src/prompts/sections/conventions.ts +15 -0
  19. package/src/prompts/sections/index.ts +11 -0
  20. package/src/prompts/sections/isolation.ts +24 -0
  21. package/src/prompts/sections/role-task.ts +34 -0
  22. package/src/prompts/sections/story.ts +13 -0
  23. package/src/prompts/sections/verdict.ts +70 -0
  24. package/src/prompts/templates/implementer.ts +6 -0
  25. package/src/prompts/templates/single-session.ts +6 -0
  26. package/src/prompts/templates/test-writer.ts +6 -0
  27. package/src/prompts/templates/verifier.ts +6 -0
  28. package/src/prompts/types.ts +21 -0
  29. package/src/review/runner.ts +6 -1
  30. package/src/tdd/session-runner.ts +12 -12
  31. package/src/version.ts +2 -1
  32. package/.claude/rules/01-project-conventions.md +0 -34
  33. package/.claude/rules/02-test-architecture.md +0 -39
  34. package/.claude/rules/03-test-writing.md +0 -58
  35. package/.claude/rules/04-forbidden-patterns.md +0 -29
  36. package/.claude/settings.json +0 -15
  37. package/.githooks/pre-commit +0 -16
  38. package/.gitlab-ci.yml +0 -103
  39. package/.mcp.json +0 -8
  40. package/BRIEF.md +0 -140
  41. package/CLAUDE.md +0 -143
  42. package/US-007-IMPLEMENTATION.md +0 -139
  43. package/biome.json +0 -14
  44. package/bun.lock +0 -163
  45. package/bunfig.toml +0 -12
  46. package/docker-compose.test.yml +0 -15
  47. package/docs/20260216-fix-plan-context-review.md +0 -56
  48. package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
  49. package/docs/20260216-v02-plan.md +0 -136
  50. package/docs/20260216-v02-review.md +0 -685
  51. package/docs/20260217-dogfood-findings.md +0 -56
  52. package/docs/20260217-p2-plus-plan.md +0 -117
  53. package/docs/20260217-partial-fixes-plan.md +0 -62
  54. package/docs/20260217-plan-analyze-spec.md +0 -117
  55. package/docs/20260217-post-impl-review.md +0 -1137
  56. package/docs/20260217-quick-wins-plan.md +0 -66
  57. package/docs/20260217-split-runner-plan.md +0 -75
  58. package/docs/20260217-v03-impl-plan.md +0 -80
  59. package/docs/20260217-v03-post-impl-review.md +0 -589
  60. package/docs/20260217-v04-impl-plan.md +0 -86
  61. package/docs/20260217-v05-post-impl-review.md +0 -850
  62. package/docs/20260217-v06-post-impl-review.md +0 -817
  63. package/docs/20260218-adr003-port-plan.md +0 -151
  64. package/docs/20260218-review-adr003-verification.md +0 -175
  65. package/docs/20260219-fix-plan-bug16-19.md +0 -79
  66. package/docs/20260219-fix-plan-bug20-22.md +0 -114
  67. package/docs/20260219-plan-llm-routing.md +0 -116
  68. package/docs/20260219-review-bug20-22-fixes.md +0 -135
  69. package/docs/20260219-routing-baseline-keyword.md +0 -63
  70. package/docs/20260220-plan-structured-logging-p1.md +0 -80
  71. package/docs/20260220-plan-structured-logging-p2.md +0 -37
  72. package/docs/20260220-review-llm-routing.md +0 -180
  73. package/docs/20260220-review-post-fix-llm-routing.md +0 -70
  74. package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
  75. package/docs/20260221-fix-plan-routing-mode.md +0 -125
  76. package/docs/20260221-review-v0.9-implementation.md +0 -379
  77. package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
  78. package/docs/20260223-fix-plan-prompt-audit.md +0 -62
  79. package/docs/20260224-nax-roadmap-phases.md +0 -189
  80. package/docs/20260225-phase2-llm-service-layer.md +0 -401
  81. package/docs/20260225-review-v0.10.1.md +0 -187
  82. package/docs/20260303-v010-implementation-plan.md +0 -165
  83. package/docs/20260304-review-nax.md +0 -492
  84. package/docs/CLAUDE.md.bak +0 -191
  85. package/docs/ROADMAP.md +0 -364
  86. package/docs/SPEC-rectification.md +0 -0
  87. package/docs/SPEC.md +0 -324
  88. package/docs/US-001-plugin-loading-verification.md +0 -152
  89. package/docs/adr/ADR-005-implementation-plan.md +0 -655
  90. package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
  91. package/docs/architecture-analysis.md +0 -1076
  92. package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
  93. package/docs/bugs-from-dogfood-run-c.md +0 -243
  94. package/docs/code-review-20260228.md +0 -612
  95. package/docs/code-review-v0.15.0.md +0 -629
  96. package/docs/hook-lifecycle-test-plan.md +0 -149
  97. package/docs/releases/v0.11.0-and-earlier.md +0 -20
  98. package/docs/releases/v0.12.0.md +0 -15
  99. package/docs/releases/v0.13.0.md +0 -14
  100. package/docs/releases/v0.14.0.md +0 -20
  101. package/docs/releases/v0.14.1.md +0 -36
  102. package/docs/releases/v0.14.2.md +0 -51
  103. package/docs/releases/v0.14.3.md +0 -174
  104. package/docs/releases/v0.14.4.md +0 -94
  105. package/docs/releases/v0.15.0.md +0 -502
  106. package/docs/releases/v0.15.1.md +0 -170
  107. package/docs/releases/v0.15.3.md +0 -193
  108. package/docs/specs/bug-039-orphan-processes.md +0 -131
  109. package/docs/specs/bug-040-review-rectification.md +0 -82
  110. package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
  111. package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
  112. package/docs/specs/bun-pty-migration.md +0 -171
  113. package/docs/specs/central-run-registry.md +0 -116
  114. package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
  115. package/docs/specs/feat-011-file-context-strategy.md +0 -73
  116. package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
  117. package/docs/specs/feat-013-test-after-review.md +0 -89
  118. package/docs/specs/feat-014-heartbeat-observability.md +0 -127
  119. package/docs/specs/status-file-consolidation.md +0 -93
  120. package/docs/specs/status-file-v0.10.1.md +0 -812
  121. package/docs/specs/trigger-completion.md +0 -145
  122. package/docs/specs/verification-architecture-v2.md +0 -343
  123. package/docs/tdd/strategies.md +0 -97
  124. package/docs/v0.10-global-config.md +0 -206
  125. package/docs/v0.10-plugin-system.md +0 -415
  126. package/docs/v0.10-prompt-optimizer.md +0 -234
  127. package/docs/v0.3-spec.md +0 -244
  128. package/docs/v0.4-spec.md +0 -140
  129. package/docs/v0.5-spec.md +0 -237
  130. package/docs/v0.6-spec.md +0 -371
  131. package/docs/v0.7-spec.md +0 -177
  132. package/docs/v0.8-llm-routing.md +0 -206
  133. package/docs/v0.8-structured-logging.md +0 -132
  134. package/docs/v0.9.3-prompt-audit.md +0 -112
  135. package/examples/plugins/console-reporter/index.test.ts +0 -207
  136. package/examples/plugins/console-reporter/index.ts +0 -110
  137. package/memory/topic/feat-010-baseref.md +0 -28
  138. package/memory/topic/feat-013-test-after-deprecation.md +0 -22
  139. package/nax/config.json +0 -154
  140. package/nax/features/bug-039-medium/prd.json +0 -45
  141. package/nax/features/bugfix-v0171/prd.json +0 -52
  142. package/nax/features/central-run-registry/prd.json +0 -105
  143. package/nax/features/config-management/prd.json +0 -108
  144. package/nax/features/config-management/progress.txt +0 -5
  145. package/nax/features/diagnose/acceptance.test.ts +0 -414
  146. package/nax/features/diagnose/prd.json +0 -41
  147. package/nax/features/nax-compliance/prd.json +0 -52
  148. package/nax/features/nax-compliance/progress.txt +0 -1
  149. package/nax/features/orchestration-fixes/prd.json +0 -89
  150. package/nax/features/orchestration-fixes/progress.txt +0 -1
  151. package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
  152. package/nax/features/plugin-integration/prd.json +0 -208
  153. package/nax/features/plugin-integration/progress.txt +0 -5
  154. package/nax/features/post-rearch-bugfix/prd.json +0 -137
  155. package/nax/features/precheck/prd.json +0 -205
  156. package/nax/features/precheck/progress.txt +0 -15
  157. package/nax/features/review-quality/prd.json +0 -55
  158. package/nax/features/routing-persistence/prd.json +0 -104
  159. package/nax/features/routing-persistence/progress.txt +0 -1
  160. package/nax/features/smart-test-runner/plan.md +0 -7
  161. package/nax/features/smart-test-runner/prd.json +0 -203
  162. package/nax/features/smart-test-runner/progress.txt +0 -13
  163. package/nax/features/smart-test-runner/spec.md +0 -7
  164. package/nax/features/smart-test-runner/tasks.md +0 -8
  165. package/nax/features/status-file-consolidation/prd.json +0 -106
  166. package/nax/features/structured-logging/prd.json +0 -199
  167. package/nax/features/trigger-completion/prd.json +0 -150
  168. package/nax/features/trigger-completion/progress.txt +0 -7
  169. package/nax/features/unlock/prd.json +0 -36
  170. package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
  171. package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
  172. package/nax/features/v0.19.0-hardening/plan.md +0 -7
  173. package/nax/features/v0.19.0-hardening/prd.json +0 -84
  174. package/nax/features/v0.19.0-hardening/progress.txt +0 -7
  175. package/nax/features/v0.19.0-hardening/spec.md +0 -18
  176. package/nax/features/v0.19.0-hardening/tasks.md +0 -8
  177. package/nax/features/verify-v2/prd.json +0 -79
  178. package/nax/features/verify-v2/progress.txt +0 -3
  179. package/nax/status.json +0 -36
  180. package/test/COVERAGE-GAPS.md +0 -333
  181. package/test/e2e/cm-003-default-view.test.ts +0 -195
  182. package/test/e2e/plan-analyze-run.test.ts +0 -902
  183. package/test/helpers/helpers.test.ts +0 -295
  184. package/test/helpers/timeout.ts +0 -42
  185. package/test/integration/US-002-TEST-SUMMARY.md +0 -107
  186. package/test/integration/US-003-TEST-SUMMARY.md +0 -149
  187. package/test/integration/US-004-TEST-SUMMARY.md +0 -106
  188. package/test/integration/US-005-TEST-SUMMARY.md +0 -138
  189. package/test/integration/US-007-TEST-SUMMARY.md +0 -100
  190. package/test/integration/cli/agent-validation.test.ts +0 -439
  191. package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
  192. package/test/integration/cli/cli-config-default-view.test.ts +0 -230
  193. package/test/integration/cli/cli-config-diff.test.ts +0 -461
  194. package/test/integration/cli/cli-config.test.ts +0 -737
  195. package/test/integration/cli/cli-diagnose.test.ts +0 -595
  196. package/test/integration/cli/cli-logs.test.ts +0 -346
  197. package/test/integration/cli/cli-plugins.test.ts +0 -679
  198. package/test/integration/cli/cli-precheck.test.ts +0 -372
  199. package/test/integration/cli/cli-run-headless.test.ts +0 -174
  200. package/test/integration/cli/cli.test.ts +0 -76
  201. package/test/integration/cli/precheck-integration.test.ts +0 -476
  202. package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
  203. package/test/integration/cli/precheck.test.ts +0 -806
  204. package/test/integration/config/config-loader.test.ts +0 -266
  205. package/test/integration/config/config.test.ts +0 -444
  206. package/test/integration/config/merger.test.ts +0 -466
  207. package/test/integration/config/paths.test.ts +0 -52
  208. package/test/integration/config/security-loader.test.ts +0 -83
  209. package/test/integration/context/context-integration.test.ts +0 -703
  210. package/test/integration/context/context-path-security.test.ts +0 -173
  211. package/test/integration/context/context-provider-injection.test.ts +0 -507
  212. package/test/integration/context/context-verification-integration.test.ts +0 -296
  213. package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
  214. package/test/integration/execution/execution-isolation.test.ts +0 -143
  215. package/test/integration/execution/execution.test.ts +0 -634
  216. package/test/integration/execution/feature-status-write.test.ts +0 -302
  217. package/test/integration/execution/parallel.test.ts +0 -251
  218. package/test/integration/execution/prd-pause.test.ts +0 -205
  219. package/test/integration/execution/prd-resolvers.test.ts +0 -186
  220. package/test/integration/execution/progress.test.ts +0 -34
  221. package/test/integration/execution/runner-batching.test.ts +0 -682
  222. package/test/integration/execution/runner-config-plugins.test.ts +0 -462
  223. package/test/integration/execution/runner-escalation.test.ts +0 -561
  224. package/test/integration/execution/runner-fixes.test.ts +0 -400
  225. package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
  226. package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
  227. package/test/integration/execution/status-file-integration.test.ts +0 -289
  228. package/test/integration/execution/status-file.test.ts +0 -380
  229. package/test/integration/execution/status-writer.test.ts +0 -447
  230. package/test/integration/execution/story-id-in-events.test.ts +0 -274
  231. package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
  232. package/test/integration/pipeline/hooks.test.ts +0 -363
  233. package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
  234. package/test/integration/pipeline/pipeline-events.test.ts +0 -476
  235. package/test/integration/pipeline/pipeline.test.ts +0 -660
  236. package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
  237. package/test/integration/pipeline/verify-stage.test.ts +0 -286
  238. package/test/integration/plan/analyze-integration.test.ts +0 -262
  239. package/test/integration/plan/analyze-scanner.test.ts +0 -132
  240. package/test/integration/plan/logger.test.ts +0 -461
  241. package/test/integration/plan/plan.test.ts +0 -157
  242. package/test/integration/plugins/config-integration.test.ts +0 -173
  243. package/test/integration/plugins/config-resolution.test.ts +0 -523
  244. package/test/integration/plugins/loader.test.ts +0 -644
  245. package/test/integration/plugins/plugins-registry.test.ts +0 -747
  246. package/test/integration/plugins/validator.test.ts +0 -564
  247. package/test/integration/review/review-config-commands.test.ts +0 -320
  248. package/test/integration/review/review-config-schema.test.ts +0 -117
  249. package/test/integration/review/review-plugin-integration.test.ts +0 -729
  250. package/test/integration/review/review.test.ts +0 -150
  251. package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
  252. package/test/integration/routing/plugin-routing-core.test.ts +0 -527
  253. package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
  254. package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
  255. package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
  256. package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
  257. package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
  258. package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
  259. package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
  260. package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
  261. package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
  262. package/test/integration/tmp/headless-test/test.jsonl +0 -30
  263. package/test/integration/verification/test-scanner.test.ts +0 -403
  264. package/test/integration/verification/verification-asset-check.test.ts +0 -143
  265. package/test/integration/worktree/manager.test.ts +0 -218
  266. package/test/integration/worktree/worktree-merge.test.ts +0 -341
  267. package/test/manual/logging-formatter-demo.ts +0 -158
  268. package/test/ui/tui-agent-panel.test.tsx +0 -99
  269. package/test/ui/tui-pty-integration.test.tsx +0 -146
  270. package/test/unit/acceptance.test.ts +0 -187
  271. package/test/unit/agent-stderr-capture.test.ts +0 -147
  272. package/test/unit/agents/claude.test.ts +0 -107
  273. package/test/unit/analyze-classifier.test.ts +0 -216
  274. package/test/unit/analyze.test.ts +0 -224
  275. package/test/unit/auto-detect.test.ts +0 -250
  276. package/test/unit/cli-status-project-level.test.ts +0 -283
  277. package/test/unit/cli-status.test.ts +0 -418
  278. package/test/unit/commands/common.test.ts +0 -321
  279. package/test/unit/commands/logs.test.ts +0 -458
  280. package/test/unit/commands/runs.test.ts +0 -303
  281. package/test/unit/commands/unlock.test.ts +0 -320
  282. package/test/unit/config/defaults.test.ts +0 -70
  283. package/test/unit/config/quality-commands-schema.test.ts +0 -72
  284. package/test/unit/config/regression-gate-schema.test.ts +0 -160
  285. package/test/unit/config/smart-runner-flag.test.ts +0 -250
  286. package/test/unit/constitution-generators.test.ts +0 -161
  287. package/test/unit/constitution.test.ts +0 -210
  288. package/test/unit/context/context-autodetect.test.ts +0 -297
  289. package/test/unit/context/context-build.test.ts +0 -575
  290. package/test/unit/context/context-coverage.test.ts +0 -236
  291. package/test/unit/context/context-error.test.ts +0 -93
  292. package/test/unit/context/context-estimate-tokens.test.ts +0 -201
  293. package/test/unit/context/context-format.test.ts +0 -302
  294. package/test/unit/context/context-isolation.test.ts +0 -267
  295. package/test/unit/context/context-sort.test.ts +0 -93
  296. package/test/unit/context/context-story.test.ts +0 -108
  297. package/test/unit/context/prior-failures.test.ts +0 -463
  298. package/test/unit/context.test.ts +0 -1726
  299. package/test/unit/cost.test.ts +0 -231
  300. package/test/unit/crash-recovery.test.ts +0 -309
  301. package/test/unit/escalation.test.ts +0 -127
  302. package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
  303. package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
  304. package/test/unit/execution/pid-registry.test.ts +0 -241
  305. package/test/unit/execution/sequential-executor.test.ts +0 -235
  306. package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
  307. package/test/unit/execution/structured-failure.test.ts +0 -415
  308. package/test/unit/execution-logging-stderr.test.ts +0 -157
  309. package/test/unit/execution-stage.test.ts +0 -123
  310. package/test/unit/fix-generator.test.ts +0 -276
  311. package/test/unit/formatters.test.ts +0 -468
  312. package/test/unit/greenfield.test.ts +0 -180
  313. package/test/unit/hooks/shell-security.test.ts +0 -40
  314. package/test/unit/interaction/auto-plugin.test.ts +0 -162
  315. package/test/unit/interaction/human-review-trigger.test.ts +0 -165
  316. package/test/unit/interaction-network-failures.test.ts +0 -390
  317. package/test/unit/interaction-plugins.test.ts +0 -472
  318. package/test/unit/logging/formatter.test.ts +0 -456
  319. package/test/unit/merge.test.ts +0 -269
  320. package/test/unit/metrics/aggregator.test.ts +0 -164
  321. package/test/unit/metrics/tracker.test.ts +0 -186
  322. package/test/unit/metrics.test.ts +0 -276
  323. package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
  324. package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
  325. package/test/unit/pipeline/event-bus.test.ts +0 -105
  326. package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
  327. package/test/unit/pipeline/runner-retry.test.ts +0 -89
  328. package/test/unit/pipeline/stages/autofix.test.ts +0 -97
  329. package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
  330. package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
  331. package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
  332. package/test/unit/pipeline/stages/rectify.test.ts +0 -101
  333. package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
  334. package/test/unit/pipeline/stages/review.test.ts +0 -201
  335. package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
  336. package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
  337. package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
  338. package/test/unit/pipeline/stages/verify.test.ts +0 -267
  339. package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
  340. package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
  341. package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
  342. package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
  343. package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
  344. package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
  345. package/test/unit/prd-auto-default.test.ts +0 -291
  346. package/test/unit/prd-failure-category.test.ts +0 -177
  347. package/test/unit/prd-get-next-story.test.ts +0 -215
  348. package/test/unit/precheck-checks.test.ts +0 -841
  349. package/test/unit/precheck-story-size-gate.test.ts +0 -288
  350. package/test/unit/precheck-types.test.ts +0 -143
  351. package/test/unit/prompts.test.ts +0 -476
  352. package/test/unit/queue.test.ts +0 -237
  353. package/test/unit/rectification.test.ts +0 -285
  354. package/test/unit/registry.test.ts +0 -288
  355. package/test/unit/review/runner.test.ts +0 -117
  356. package/test/unit/routing/content-hash.test.ts +0 -99
  357. package/test/unit/routing/routing-stability.test.ts +0 -208
  358. package/test/unit/routing/strategies/llm.test.ts +0 -306
  359. package/test/unit/routing-advanced.test.ts +0 -313
  360. package/test/unit/routing-core.test.ts +0 -341
  361. package/test/unit/routing-strategies.test.ts +0 -440
  362. package/test/unit/storyid-events.test.ts +0 -213
  363. package/test/unit/tdd-verdict.test.ts +0 -492
  364. package/test/unit/test-output-parser.test.ts +0 -377
  365. package/test/unit/ui/tui-controls.test.ts +0 -335
  366. package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
  367. package/test/unit/ui/tui-layout.test.ts +0 -379
  368. package/test/unit/ui/tui-stories.test.ts +0 -333
  369. package/test/unit/unit-isolation.test.ts +0 -135
  370. package/test/unit/utils/git.test.ts +0 -50
  371. package/test/unit/utils/path-security.test.ts +0 -47
  372. package/test/unit/utils-helpers.test.ts +0 -318
  373. package/test/unit/verdict.test.ts +0 -325
  374. package/test/unit/verification/orchestrator-types.test.ts +0 -54
  375. package/test/unit/verification/orchestrator.test.ts +0 -66
  376. package/test/unit/verification/smart-runner-config.test.ts +0 -163
  377. package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
  378. package/test/unit/verification/smart-runner.test.ts +0 -262
  379. package/test/unit/verification/strategies/acceptance.test.ts +0 -33
  380. package/test/unit/verification/strategies/regression.test.ts +0 -87
  381. package/test/unit/verification/strategies/scoped.test.ts +0 -100
  382. package/test/unit/worktree-manager.test.ts +0 -159
  383. package/tsconfig.json +0 -27
@@ -1,48 +0,0 @@
1
- # BUG-21: Escalation fails when PRD story has null/missing attempts field
2
-
3
- **Severity:** High
4
- **Component:** src/execution/runner.ts (escalation logic)
5
- **Found:** 2026-02-23
6
- **Status:** Open
7
-
8
- ## Summary
9
-
10
- When a story attempts field is null or missing in the PRD, escalation breaks. The agent returns finalAction: escalate but the story is immediately marked as failed instead of being re-queued at a higher model tier.
11
-
12
- ## Reproduction
13
-
14
- 1. Create a PRD with stories that do NOT include an attempts field (or set it to null)
15
- 2. Run nax run -f feature --headless
16
- 3. When a story fails and returns finalAction: escalate, observe:
17
- - Log shows Story failed - max attempts reached immediately
18
- - No actual escalation to next tier occurs
19
- - PRD shows attempts: null after the run
20
-
21
- ## Root Cause
22
-
23
- The runner increments story.attempts via:
24
- attempts: s.attempts + 1
25
- But if s.attempts is null or undefined, this produces NaN, breaking subsequent comparisons.
26
-
27
- Pre-iteration tier check (line ~338):
28
- if (tierCfg && story.attempts >= tierCfg.attempts)
29
- null >= 5 evaluates to false in JS, so this check is silently skipped.
30
-
31
- Post-execution canEscalate check (line ~704):
32
- const canEscalate = storiesToEscalate.every((s) => s.attempts < maxAttempts);
33
- null < 10 is true in JS, so canEscalate is true. But then attempts: s.attempts + 1 yields null + 1 = 1.
34
- The story still gets marked failed, suggesting the PRD save/reload cycle loses the updated value or the iteration loop exits before re-processing.
35
-
36
- ## Suggested Fix
37
-
38
- 1. Initialize attempts to 0 when loading PRD stories with null/undefined attempts
39
- 2. Defensive coercion: attempts: (s.attempts ?? 0) + 1
40
- 3. Add PRD validation on load to ensure all stories have attempts: number (default 0)
41
-
42
- ## Observed Log
43
-
44
- [21:45:33] [execution] Agent session failed { rateLimited: false, storyId: US-002 }
45
- [21:45:33] [agent.complete] { storyId: US-002, success: false, finalAction: escalate, estimatedCost: 0.75 }
46
- [21:45:33] [execution] Story failed - max attempts reached { storyId: US-002 }
47
-
48
- No escalation log line between finalAction: escalate and Story failed, confirming escalation path was skipped.
@@ -1,243 +0,0 @@
1
- # Bugs Found: Dogfood Run C (2026-02-19, plan→analyze→run pipeline)
2
-
3
- ## BUG-16: maxIterations is global, not per-story (CODE)
4
-
5
- **Severity:** High — causes infinite loops on stuck stories
6
-
7
- **Evidence:** Config had `maxIterations: 5` but nax ran **20 iterations**.
8
- The main loop at `runner.ts:140` checks `iterations < config.execution.maxIterations`,
9
- but the config value was overridden. Investigation shows the dogfood config had `maxIterations: 5`
10
- but per the summary the run did 20 iterations.
11
-
12
- **Root Cause:** `maxIterations` is a **global** cap across ALL stories, not per-story.
13
- But the real issue is that the per-story attempt counter (`story.attempts`) doesn't cap the
14
- story — only the escalation logic reads it. If escalation doesn't trigger (see BUG-17),
15
- the story retries indefinitely until the global iteration limit.
16
-
17
- **Expected Behavior:** Each story should respect the tier budget:
18
- - Per-story max attempts = sum of `tierOrder` attempts (default: 5+3+2=10)
19
- - After exhausting all tiers, mark story as FAILED and move to next story
20
- - `maxIterations` should be an override safety cap, not the primary limit
21
-
22
- **Fix Location:** `src/execution/runner.ts` — add per-story attempt check before retrying
23
-
24
- ---
25
-
26
- ## BUG-17: ASSET_CHECK_FAILED doesn't trigger escalation (CODE)
27
-
28
- **Severity:** High — story loops at same tier forever
29
-
30
- **Evidence:** US-004 failed ASSET_CHECK 16 times, always at `balanced` tier.
31
- Never escalated to `powerful` despite `countsTowardEscalation: true`.
32
-
33
- **Root Cause:** The escalation logic lives in the `case "escalate"` handler
34
- (`runner.ts:367`), but ASSET_CHECK failures flow through `post-verify.ts`
35
- which only increments `story.attempts` and reverts to `pending`. It never
36
- returns an `"escalate"` action to the runner — it just reverts the story.
37
-
38
- The escalation check happens in runner.ts case "escalate" but the pipeline
39
- never returns "escalate" for verification failures. The verify stage returns
40
- "continue" (tests passed), then post-verify reverts on ASSET_CHECK but the
41
- result is already "continue".
42
-
43
- **Flow:**
44
- ```
45
- 1. Pipeline runs → verify stage → tests pass → "continue"
46
- 2. completion stage → marks story as passed
47
- 3. post-verify → ASSET_CHECK fails → reverts to pending, increments attempts
48
- 4. Runner sees "continue" from pipeline, never hits "escalate" case
49
- 5. Next iteration picks up story at SAME tier (no escalation)
50
- ```
51
-
52
- **Expected Behavior:** When `story.attempts` exceeds the current tier's budget,
53
- the runner should check tier escalation BEFORE starting the next iteration,
54
- not only in the `"escalate"` case handler.
55
-
56
- **Fix Location:**
57
- - `src/execution/runner.ts` — add tier check at start of iteration (before agent spawn)
58
- - OR `src/execution/post-verify.ts` — escalate the story's `routing.modelTier` when attempts exceed tier budget
59
-
60
- ---
61
-
62
- ## BUG-18: ASSET_CHECK error not fed back to agent prompt (CODE)
63
-
64
- **Severity:** Medium — agent repeats same mistake endlessly
65
-
66
- **Evidence:** All 17 retries of US-004 show the exact same warnings:
67
- ```
68
- ⚠️ Relevant file not found: src/finder.ts (story: US-004)
69
- ⚠️ Relevant file not found: test/finder.test.ts (story: US-004)
70
- ```
71
- The agent kept writing to `src/discovery.ts` instead of `src/finder.ts`.
72
- The ASSET_CHECK error is stored in `story.priorErrors` (post-verify.ts line 102),
73
- but the "Prior Errors" section in the prompt only showed the initial ASSET_CHECK
74
- message, not a clear instruction like "You MUST create src/finder.ts".
75
-
76
- **Expected Behavior:** The ASSET_CHECK error should be prominent in the prompt,
77
- ideally as a mandatory instruction: "REQUIRED: Create these files: src/finder.ts, test/finder.test.ts"
78
-
79
- **Fix Location:** `src/pipeline/stages/prompt.ts` — format ASSET_CHECK errors as mandatory file creation instructions
80
-
81
- ---
82
-
83
- ## BUG-19: Simple complexity routes to balanced tier, not fast (CODE)
84
-
85
- **Severity:** Medium — wastes budget on wrong tier
86
-
87
- **Evidence:** US-001 (simple) and US-004 (simple) both show:
88
- ```
89
- Complexity: simple | Model: balanced | TDD: test-after
90
- Routing: test-after: simple task (medium)
91
- ```
92
- Should start at `fast` (Haiku) per `complexityRouting.simple: "fast"`.
93
-
94
- **Root Cause:** The routing display shows `(medium)` suggesting the actual
95
- routed tier is `medium`/`balanced`, not the expected `fast`. Likely the
96
- routing stage is using test strategy routing instead of complexity routing,
97
- or there's a fallback that overrides the tier.
98
-
99
- **Fix Location:** `src/pipeline/stages/routing.ts` or equivalent — check why
100
- simple stories get routed to balanced instead of fast.
101
-
102
- ---
103
-
104
- ## Test Coverage Gaps
105
-
106
- ### Existing (35 tests in runner.test.ts)
107
- - ✅ Batch prompt building (3 tests)
108
- - ✅ Batch grouping (8 tests)
109
- - ✅ Batch precompute (5 tests)
110
- - ✅ Batch failure escalation (3 tests)
111
- - ✅ Queue commands (6 tests)
112
- - ✅ Escalation chain (7 tests)
113
- - ✅ Hook security/loading/env (19 tests in hooks.test.ts)
114
-
115
- ### Missing (needed to prevent BUG-16–19)
116
- - ❌ **Per-story iteration capping** — story should fail after tier budget exhausted
117
- - ❌ **ASSET_CHECK → escalation trigger** — post-verify failure should escalate tier
118
- - ❌ **ASSET_CHECK error in prompt** — verify mandatory files appear in next prompt
119
- - ❌ **Complexity → tier routing accuracy** — simple=fast, medium=balanced, complex=powerful
120
- - ❌ **Post-verify revert + re-queue** — story reverted correctly after ASSET_CHECK
121
- - ❌ **End-to-end: story passes on retry after escalation** — integration test
122
- - ❌ **End-to-end: story fails permanently after all tiers exhausted** — integration test
123
- - ❌ **Verification unit tests** — no `test/verification.test.ts` exists
124
- - ❌ **Post-verify unit tests** — no `test/post-verify.test.ts` exists
125
-
126
- ---
127
-
128
- *Filed 2026-02-19 from dogfood run C (plan→analyze→run pipeline test)*
129
-
130
- ## BUG-21: No model name validation before run (CONFIG)
131
-
132
- **Severity:** Medium — causes silent failures, wasted retries
133
-
134
- **Evidence:** Dogfood Run D — `claude-opus-4` not recognized by Claude Code CLI.
135
- Agent exited with error message but exit code 0 on some attempts, exit code 1 on others.
136
- TDD test-writer session ran 3 times producing nothing. Wasted ~$0.13 and 3 minutes.
137
-
138
- **Root Cause:** No validation of model names in config against the agent's accepted models.
139
- `claude-opus-4` is not a valid Claude Code model name (`claude-opus-4-5` or `opus` alias is).
140
-
141
- **Expected Behavior:** Before starting a run, validate that all configured model names
142
- are accepted by the target agent. Fail fast with a clear error message.
143
-
144
- **Future Design:** When supporting multiple code agents (Claude, Cursor, Copilot, etc.),
145
- each agent adapter should expose a `validateModel(name: string)` method or provide
146
- a model registry. Worst case: maintain a `models.json` per provider.
147
-
148
- **Workaround:** Use CLI aliases (`haiku`, `sonnet`, `opus`) which always resolve to latest.
149
-
150
- **Fix Location:** `src/config/validate.ts` — add model validation step.
151
- Agent adapter interface: add optional `getSupportedModels()` or `validateModel()`.
152
-
153
- **Priority:** Low — workaround available (use aliases)
154
-
155
- ---
156
-
157
- ## BUG-21: Claude Code child processes orphaned after TDD session failure
158
-
159
- **Found:** Run D, US-007 TDD test-writer failure (2026-02-19 20:11)
160
- **Severity:** Medium (resource leak, CPU waste)
161
- **Component:** `src/tdd/orchestrator.ts` / `src/agents/claude-adapter.ts`
162
-
163
- ### Symptoms
164
- - `bun test` (PID 76312) running at 99.9% CPU for 2+ hours after Run D completed
165
- - Process orphaned (PPID=1), original parent (PGID leader 76309) dead
166
- - Sibling `tail -5` (PID 76313) also orphaned, plus a zombie child (PID 76555)
167
- - Pipeline: `bun test 2>&1 | tail -5` — spawned by Claude Code internally during TDD test-writer session
168
-
169
- ### Root Cause
170
- When Claude Code exits with code 1 (TDD session failure), it does NOT clean up shell commands it spawned internally. nax kills the Claude Code process via the agent adapter, but Claude Code's child processes (`bun test | tail -5`) are in a different process group (PGID 76309 vs Claude Code's own PID).
171
-
172
- nax's `executeWithTimeout()` in `verification.ts` properly kills process groups for commands IT spawns, but TDD session child processes are spawned by Claude Code, not by nax.
173
-
174
- ### Process Tree at Failure
175
- ```
176
- launchd (1)
177
- ├── bun test (76312) ← orphaned, 99.9% CPU, PGID 76309
178
- ├── tail -5 (76313) ← orphaned, sleeping, PGID 76309
179
- └── <defunct> (76555) ← zombie child of 76312
180
- ```
181
- Original PGID leader (76309) is dead — likely the shell Claude Code spawned.
182
-
183
- ### Fix Options
184
- 1. **nax-side (recommended):** After agent adapter returns failure, run `pkill -P <agent_pid>` recursively or `kill -- -<pgid>` to clean up the entire process tree. Add a `cleanupProcessTree(pid)` utility.
185
- 2. **nax-side (belt+suspenders):** Track all child PIDs before/after TDD session via `pgrep -P`, kill any new orphans.
186
- 3. **Upstream (Claude Code):** File issue — Claude Code should clean up child processes on abnormal exit.
187
-
188
- ### Affected Code
189
- - `src/tdd/orchestrator.ts` — `runTddSession()` calls agent adapter but doesn't clean up process tree on failure
190
- - `src/agents/claude-adapter.ts` — `runSession()` kills Claude Code process but not its children
191
-
192
- ### Workaround
193
- Manually kill orphaned processes: `kill -9 -76309` (kill entire PGID)
194
-
195
- ---
196
-
197
- ## BUG-22: TDD orchestrator treats verifier fix-and-commit as failure
198
-
199
- **Found:** Run D2, US-009 (2026-02-19 22:23)
200
- **Severity:** Medium (false positive pause, wastes human review time)
201
- **Component:** `src/tdd/orchestrator.ts`
202
-
203
- ### Symptoms
204
- - US-009 verifier session fixed flaky watcher tests (sleep timing) and added README.md
205
- - All 355 tests pass, 98.7% coverage, clean commit `9f9b048`
206
- - nax paused with "Verifier session identified issues" requiring human review
207
- - No actual issues — the work is complete and correct
208
-
209
- ### Root Cause
210
- `runThreeSessionTdd()` line 387:
211
- ```typescript
212
- const allSuccessful = sessions.every((s) => s.success);
213
- ```
214
-
215
- `session.success` is derived from the Claude Code agent's **exit code**, not the final test state. The verifier likely:
216
- 1. Ran `bun test` → some tests failed (flaky watcher timing)
217
- 2. Fixed the tests (increased sleep timers)
218
- 3. Ran `bun test` again → 355 pass
219
- 4. Committed the fix
220
- 5. But Claude Code exited with code 1 (possibly from the initial failed test run, or from an internal error during the long session)
221
-
222
- The orchestrator checks `sessions.every(s => s.success)` which uses exit code, not actual test outcomes. A verifier that **finds and fixes issues is doing its job** — that's a success, not a failure.
223
-
224
- ### Fix Options
225
- 1. **Post-TDD verification (recommended):** After all 3 sessions complete, run `bun test` independently. If tests pass → mark success regardless of individual session exit codes.
226
- 2. **Verifier exit code tolerance:** If verifier session has commits AND tests pass (checked via isolation), treat as success even with non-zero exit.
227
- 3. **Two-phase verifier:** Split verifier into "check" (run tests, report) and "fix" (apply fixes). Only flag if "fix" also fails.
228
-
229
- ### Evidence
230
- ```
231
- git log -1: "fix: verify and adjust Comprehensive integration tests and documentation"
232
- - 355 tests pass, 0 fail
233
- - 98.70% function coverage, 95.52% line coverage
234
- - Files changed: README.md (+261), test/integration.test.ts (+7/-7)
235
-
236
- nax output: "⏸ Human review needed: Verifier session identified issues"
237
- ```
238
-
239
- ### Impact
240
- - False pause blocks automated pipeline completion
241
- - Human must manually verify and resume — defeats automation purpose
242
- - Cost: $4.95 spent on US-009, then paused on a success
243
- - Combined with misrouting (US-009 shouldn't have been TDD), this story cost ~$5 for ~$0.15 of actual work