@nathapp/nax 0.28.0 → 0.30.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (385) hide show
  1. package/CHANGELOG.md +23 -2
  2. package/bin/nax.ts +2 -3
  3. package/dist/nax.js +72753 -0
  4. package/package.json +11 -3
  5. package/src/cli/analyze.ts +2 -7
  6. package/src/cli/config.ts +3 -1
  7. package/src/config/defaults.ts +1 -0
  8. package/src/config/schemas.ts +1 -0
  9. package/src/config/types.ts +1 -0
  10. package/src/context/builder.ts +10 -1
  11. package/src/execution/lifecycle/headless-formatter.ts +2 -4
  12. package/src/prompts/builder.ts +12 -69
  13. package/src/prompts/sections/isolation.ts +38 -8
  14. package/src/prompts/sections/role-task.ts +79 -17
  15. package/src/review/runner.ts +6 -1
  16. package/src/version.ts +2 -1
  17. package/.claude/rules/01-project-conventions.md +0 -34
  18. package/.claude/rules/02-test-architecture.md +0 -39
  19. package/.claude/rules/03-test-writing.md +0 -58
  20. package/.claude/rules/04-forbidden-patterns.md +0 -29
  21. package/.claude/settings.json +0 -15
  22. package/.githooks/pre-commit +0 -16
  23. package/.gitlab-ci.yml +0 -103
  24. package/.mcp.json +0 -8
  25. package/BRIEF.md +0 -140
  26. package/CLAUDE.md +0 -143
  27. package/US-007-IMPLEMENTATION.md +0 -139
  28. package/biome.json +0 -14
  29. package/bun.lock +0 -163
  30. package/bunfig.toml +0 -12
  31. package/docker-compose.test.yml +0 -15
  32. package/docs/20260216-fix-plan-context-review.md +0 -56
  33. package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
  34. package/docs/20260216-v02-plan.md +0 -136
  35. package/docs/20260216-v02-review.md +0 -685
  36. package/docs/20260217-dogfood-findings.md +0 -56
  37. package/docs/20260217-p2-plus-plan.md +0 -117
  38. package/docs/20260217-partial-fixes-plan.md +0 -62
  39. package/docs/20260217-plan-analyze-spec.md +0 -117
  40. package/docs/20260217-post-impl-review.md +0 -1137
  41. package/docs/20260217-quick-wins-plan.md +0 -66
  42. package/docs/20260217-split-runner-plan.md +0 -75
  43. package/docs/20260217-v03-impl-plan.md +0 -80
  44. package/docs/20260217-v03-post-impl-review.md +0 -589
  45. package/docs/20260217-v04-impl-plan.md +0 -86
  46. package/docs/20260217-v05-post-impl-review.md +0 -850
  47. package/docs/20260217-v06-post-impl-review.md +0 -817
  48. package/docs/20260218-adr003-port-plan.md +0 -151
  49. package/docs/20260218-review-adr003-verification.md +0 -175
  50. package/docs/20260219-fix-plan-bug16-19.md +0 -79
  51. package/docs/20260219-fix-plan-bug20-22.md +0 -114
  52. package/docs/20260219-plan-llm-routing.md +0 -116
  53. package/docs/20260219-review-bug20-22-fixes.md +0 -135
  54. package/docs/20260219-routing-baseline-keyword.md +0 -63
  55. package/docs/20260220-plan-structured-logging-p1.md +0 -80
  56. package/docs/20260220-plan-structured-logging-p2.md +0 -37
  57. package/docs/20260220-review-llm-routing.md +0 -180
  58. package/docs/20260220-review-post-fix-llm-routing.md +0 -70
  59. package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
  60. package/docs/20260221-fix-plan-routing-mode.md +0 -125
  61. package/docs/20260221-review-v0.9-implementation.md +0 -379
  62. package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
  63. package/docs/20260223-fix-plan-prompt-audit.md +0 -62
  64. package/docs/20260224-nax-roadmap-phases.md +0 -189
  65. package/docs/20260225-phase2-llm-service-layer.md +0 -401
  66. package/docs/20260225-review-v0.10.1.md +0 -187
  67. package/docs/20260303-v010-implementation-plan.md +0 -165
  68. package/docs/20260304-review-nax.md +0 -492
  69. package/docs/CLAUDE.md.bak +0 -191
  70. package/docs/ROADMAP.md +0 -390
  71. package/docs/SPEC-rectification.md +0 -0
  72. package/docs/SPEC.md +0 -324
  73. package/docs/US-001-plugin-loading-verification.md +0 -152
  74. package/docs/adr/ADR-005-implementation-plan.md +0 -655
  75. package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
  76. package/docs/architecture-analysis.md +0 -1076
  77. package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
  78. package/docs/bugs-from-dogfood-run-c.md +0 -243
  79. package/docs/code-review-20260228.md +0 -612
  80. package/docs/code-review-v0.15.0.md +0 -629
  81. package/docs/hook-lifecycle-test-plan.md +0 -149
  82. package/docs/releases/v0.11.0-and-earlier.md +0 -20
  83. package/docs/releases/v0.12.0.md +0 -15
  84. package/docs/releases/v0.13.0.md +0 -14
  85. package/docs/releases/v0.14.0.md +0 -20
  86. package/docs/releases/v0.14.1.md +0 -36
  87. package/docs/releases/v0.14.2.md +0 -51
  88. package/docs/releases/v0.14.3.md +0 -174
  89. package/docs/releases/v0.14.4.md +0 -94
  90. package/docs/releases/v0.15.0.md +0 -502
  91. package/docs/releases/v0.15.1.md +0 -170
  92. package/docs/releases/v0.15.3.md +0 -193
  93. package/docs/specs/bug-039-orphan-processes.md +0 -131
  94. package/docs/specs/bug-040-review-rectification.md +0 -82
  95. package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
  96. package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
  97. package/docs/specs/bun-pty-migration.md +0 -171
  98. package/docs/specs/central-run-registry.md +0 -116
  99. package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
  100. package/docs/specs/feat-011-file-context-strategy.md +0 -73
  101. package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
  102. package/docs/specs/feat-013-test-after-review.md +0 -89
  103. package/docs/specs/feat-014-heartbeat-observability.md +0 -127
  104. package/docs/specs/status-file-consolidation.md +0 -93
  105. package/docs/specs/status-file-v0.10.1.md +0 -812
  106. package/docs/specs/trigger-completion.md +0 -145
  107. package/docs/specs/verification-architecture-v2.md +0 -343
  108. package/docs/tdd/strategies.md +0 -97
  109. package/docs/v0.10-global-config.md +0 -206
  110. package/docs/v0.10-plugin-system.md +0 -415
  111. package/docs/v0.10-prompt-optimizer.md +0 -234
  112. package/docs/v0.3-spec.md +0 -244
  113. package/docs/v0.4-spec.md +0 -140
  114. package/docs/v0.5-spec.md +0 -237
  115. package/docs/v0.6-spec.md +0 -371
  116. package/docs/v0.7-spec.md +0 -177
  117. package/docs/v0.8-llm-routing.md +0 -206
  118. package/docs/v0.8-structured-logging.md +0 -132
  119. package/docs/v0.9.3-prompt-audit.md +0 -112
  120. package/examples/plugins/console-reporter/index.test.ts +0 -207
  121. package/examples/plugins/console-reporter/index.ts +0 -110
  122. package/memory/topic/feat-010-baseref.md +0 -28
  123. package/memory/topic/feat-013-test-after-deprecation.md +0 -22
  124. package/nax/config.json +0 -154
  125. package/nax/features/bug-039-medium/prd.json +0 -45
  126. package/nax/features/bugfix-v0171/prd.json +0 -52
  127. package/nax/features/central-run-registry/prd.json +0 -105
  128. package/nax/features/config-management/prd.json +0 -108
  129. package/nax/features/config-management/progress.txt +0 -5
  130. package/nax/features/diagnose/acceptance.test.ts +0 -414
  131. package/nax/features/diagnose/prd.json +0 -41
  132. package/nax/features/nax-compliance/prd.json +0 -52
  133. package/nax/features/nax-compliance/progress.txt +0 -1
  134. package/nax/features/orchestration-fixes/prd.json +0 -89
  135. package/nax/features/orchestration-fixes/progress.txt +0 -1
  136. package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
  137. package/nax/features/plugin-integration/prd.json +0 -208
  138. package/nax/features/plugin-integration/progress.txt +0 -5
  139. package/nax/features/post-rearch-bugfix/prd.json +0 -137
  140. package/nax/features/precheck/prd.json +0 -205
  141. package/nax/features/precheck/progress.txt +0 -15
  142. package/nax/features/prompt-builder/prd.json +0 -152
  143. package/nax/features/prompt-builder/progress.txt +0 -3
  144. package/nax/features/review-quality/prd.json +0 -55
  145. package/nax/features/routing-persistence/prd.json +0 -104
  146. package/nax/features/routing-persistence/progress.txt +0 -1
  147. package/nax/features/smart-test-runner/plan.md +0 -7
  148. package/nax/features/smart-test-runner/prd.json +0 -203
  149. package/nax/features/smart-test-runner/progress.txt +0 -13
  150. package/nax/features/smart-test-runner/spec.md +0 -7
  151. package/nax/features/smart-test-runner/tasks.md +0 -8
  152. package/nax/features/status-file-consolidation/prd.json +0 -106
  153. package/nax/features/structured-logging/prd.json +0 -199
  154. package/nax/features/trigger-completion/prd.json +0 -150
  155. package/nax/features/trigger-completion/progress.txt +0 -7
  156. package/nax/features/unlock/prd.json +0 -36
  157. package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
  158. package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
  159. package/nax/features/v0.19.0-hardening/plan.md +0 -7
  160. package/nax/features/v0.19.0-hardening/prd.json +0 -84
  161. package/nax/features/v0.19.0-hardening/progress.txt +0 -7
  162. package/nax/features/v0.19.0-hardening/spec.md +0 -18
  163. package/nax/features/v0.19.0-hardening/tasks.md +0 -8
  164. package/nax/features/verify-v2/prd.json +0 -79
  165. package/nax/features/verify-v2/progress.txt +0 -3
  166. package/nax/status.json +0 -36
  167. package/src/prompts/templates/implementer.ts +0 -6
  168. package/src/prompts/templates/single-session.ts +0 -6
  169. package/src/prompts/templates/test-writer.ts +0 -6
  170. package/src/prompts/templates/verifier.ts +0 -6
  171. package/test/COVERAGE-GAPS.md +0 -333
  172. package/test/e2e/cm-003-default-view.test.ts +0 -195
  173. package/test/e2e/plan-analyze-run.test.ts +0 -902
  174. package/test/helpers/helpers.test.ts +0 -295
  175. package/test/helpers/timeout.ts +0 -42
  176. package/test/integration/US-002-TEST-SUMMARY.md +0 -107
  177. package/test/integration/US-003-TEST-SUMMARY.md +0 -149
  178. package/test/integration/US-004-TEST-SUMMARY.md +0 -106
  179. package/test/integration/US-005-TEST-SUMMARY.md +0 -138
  180. package/test/integration/US-007-TEST-SUMMARY.md +0 -100
  181. package/test/integration/cli/agent-validation.test.ts +0 -439
  182. package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
  183. package/test/integration/cli/cli-config-default-view.test.ts +0 -230
  184. package/test/integration/cli/cli-config-diff.test.ts +0 -461
  185. package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
  186. package/test/integration/cli/cli-config.test.ts +0 -737
  187. package/test/integration/cli/cli-diagnose.test.ts +0 -595
  188. package/test/integration/cli/cli-logs.test.ts +0 -346
  189. package/test/integration/cli/cli-plugins.test.ts +0 -679
  190. package/test/integration/cli/cli-precheck.test.ts +0 -372
  191. package/test/integration/cli/cli-run-headless.test.ts +0 -174
  192. package/test/integration/cli/cli.test.ts +0 -76
  193. package/test/integration/cli/precheck-integration.test.ts +0 -476
  194. package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
  195. package/test/integration/cli/precheck.test.ts +0 -806
  196. package/test/integration/config/config-loader.test.ts +0 -266
  197. package/test/integration/config/config.test.ts +0 -444
  198. package/test/integration/config/merger.test.ts +0 -466
  199. package/test/integration/config/paths.test.ts +0 -52
  200. package/test/integration/config/security-loader.test.ts +0 -83
  201. package/test/integration/context/context-integration.test.ts +0 -703
  202. package/test/integration/context/context-path-security.test.ts +0 -173
  203. package/test/integration/context/context-provider-injection.test.ts +0 -507
  204. package/test/integration/context/context-verification-integration.test.ts +0 -296
  205. package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
  206. package/test/integration/execution/execution-isolation.test.ts +0 -143
  207. package/test/integration/execution/execution.test.ts +0 -634
  208. package/test/integration/execution/feature-status-write.test.ts +0 -302
  209. package/test/integration/execution/parallel.test.ts +0 -251
  210. package/test/integration/execution/prd-pause.test.ts +0 -205
  211. package/test/integration/execution/prd-resolvers.test.ts +0 -186
  212. package/test/integration/execution/progress.test.ts +0 -34
  213. package/test/integration/execution/runner-batching.test.ts +0 -682
  214. package/test/integration/execution/runner-config-plugins.test.ts +0 -462
  215. package/test/integration/execution/runner-escalation.test.ts +0 -561
  216. package/test/integration/execution/runner-fixes.test.ts +0 -400
  217. package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
  218. package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
  219. package/test/integration/execution/status-file-integration.test.ts +0 -289
  220. package/test/integration/execution/status-file.test.ts +0 -380
  221. package/test/integration/execution/status-writer.test.ts +0 -447
  222. package/test/integration/execution/story-id-in-events.test.ts +0 -274
  223. package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
  224. package/test/integration/pipeline/hooks.test.ts +0 -363
  225. package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
  226. package/test/integration/pipeline/pipeline-events.test.ts +0 -476
  227. package/test/integration/pipeline/pipeline.test.ts +0 -660
  228. package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
  229. package/test/integration/pipeline/verify-stage.test.ts +0 -286
  230. package/test/integration/plan/analyze-integration.test.ts +0 -262
  231. package/test/integration/plan/analyze-scanner.test.ts +0 -132
  232. package/test/integration/plan/logger.test.ts +0 -461
  233. package/test/integration/plan/plan.test.ts +0 -157
  234. package/test/integration/plugins/config-integration.test.ts +0 -173
  235. package/test/integration/plugins/config-resolution.test.ts +0 -523
  236. package/test/integration/plugins/loader.test.ts +0 -644
  237. package/test/integration/plugins/plugins-registry.test.ts +0 -747
  238. package/test/integration/plugins/validator.test.ts +0 -564
  239. package/test/integration/prompts/pb-004-migration.test.ts +0 -523
  240. package/test/integration/review/review-config-commands.test.ts +0 -320
  241. package/test/integration/review/review-config-schema.test.ts +0 -117
  242. package/test/integration/review/review-plugin-integration.test.ts +0 -729
  243. package/test/integration/review/review.test.ts +0 -150
  244. package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
  245. package/test/integration/routing/plugin-routing-core.test.ts +0 -527
  246. package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
  247. package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
  248. package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
  249. package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
  250. package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
  251. package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
  252. package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
  253. package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
  254. package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
  255. package/test/integration/tmp/headless-test/test.jsonl +0 -30
  256. package/test/integration/verification/test-scanner.test.ts +0 -403
  257. package/test/integration/verification/verification-asset-check.test.ts +0 -143
  258. package/test/integration/worktree/manager.test.ts +0 -218
  259. package/test/integration/worktree/worktree-merge.test.ts +0 -341
  260. package/test/manual/logging-formatter-demo.ts +0 -158
  261. package/test/ui/tui-agent-panel.test.tsx +0 -99
  262. package/test/ui/tui-pty-integration.test.tsx +0 -146
  263. package/test/unit/acceptance.test.ts +0 -187
  264. package/test/unit/agent-stderr-capture.test.ts +0 -147
  265. package/test/unit/agents/claude.test.ts +0 -107
  266. package/test/unit/analyze-classifier.test.ts +0 -216
  267. package/test/unit/analyze.test.ts +0 -224
  268. package/test/unit/auto-detect.test.ts +0 -250
  269. package/test/unit/cli-status-project-level.test.ts +0 -283
  270. package/test/unit/cli-status.test.ts +0 -418
  271. package/test/unit/commands/common.test.ts +0 -321
  272. package/test/unit/commands/logs.test.ts +0 -458
  273. package/test/unit/commands/runs.test.ts +0 -303
  274. package/test/unit/commands/unlock.test.ts +0 -320
  275. package/test/unit/config/defaults.test.ts +0 -70
  276. package/test/unit/config/quality-commands-schema.test.ts +0 -72
  277. package/test/unit/config/regression-gate-schema.test.ts +0 -160
  278. package/test/unit/config/smart-runner-flag.test.ts +0 -250
  279. package/test/unit/constitution-generators.test.ts +0 -161
  280. package/test/unit/constitution.test.ts +0 -210
  281. package/test/unit/context/context-autodetect.test.ts +0 -297
  282. package/test/unit/context/context-build.test.ts +0 -575
  283. package/test/unit/context/context-coverage.test.ts +0 -236
  284. package/test/unit/context/context-error.test.ts +0 -93
  285. package/test/unit/context/context-estimate-tokens.test.ts +0 -201
  286. package/test/unit/context/context-format.test.ts +0 -302
  287. package/test/unit/context/context-isolation.test.ts +0 -267
  288. package/test/unit/context/context-sort.test.ts +0 -93
  289. package/test/unit/context/context-story.test.ts +0 -108
  290. package/test/unit/context/prior-failures.test.ts +0 -463
  291. package/test/unit/context.test.ts +0 -1726
  292. package/test/unit/cost.test.ts +0 -231
  293. package/test/unit/crash-recovery.test.ts +0 -309
  294. package/test/unit/escalation.test.ts +0 -127
  295. package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
  296. package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
  297. package/test/unit/execution/pid-registry.test.ts +0 -241
  298. package/test/unit/execution/sequential-executor.test.ts +0 -235
  299. package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
  300. package/test/unit/execution/structured-failure.test.ts +0 -415
  301. package/test/unit/execution-logging-stderr.test.ts +0 -157
  302. package/test/unit/execution-stage.test.ts +0 -123
  303. package/test/unit/fix-generator.test.ts +0 -276
  304. package/test/unit/formatters.test.ts +0 -468
  305. package/test/unit/greenfield.test.ts +0 -180
  306. package/test/unit/hooks/shell-security.test.ts +0 -40
  307. package/test/unit/interaction/auto-plugin.test.ts +0 -162
  308. package/test/unit/interaction/human-review-trigger.test.ts +0 -165
  309. package/test/unit/interaction-network-failures.test.ts +0 -390
  310. package/test/unit/interaction-plugins.test.ts +0 -472
  311. package/test/unit/logging/formatter.test.ts +0 -456
  312. package/test/unit/merge.test.ts +0 -269
  313. package/test/unit/metrics/aggregator.test.ts +0 -164
  314. package/test/unit/metrics/tracker.test.ts +0 -186
  315. package/test/unit/metrics.test.ts +0 -276
  316. package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
  317. package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
  318. package/test/unit/pipeline/event-bus.test.ts +0 -105
  319. package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
  320. package/test/unit/pipeline/runner-retry.test.ts +0 -89
  321. package/test/unit/pipeline/stages/autofix.test.ts +0 -97
  322. package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
  323. package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
  324. package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
  325. package/test/unit/pipeline/stages/rectify.test.ts +0 -101
  326. package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
  327. package/test/unit/pipeline/stages/review.test.ts +0 -201
  328. package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
  329. package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
  330. package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
  331. package/test/unit/pipeline/stages/verify.test.ts +0 -267
  332. package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
  333. package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
  334. package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
  335. package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
  336. package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
  337. package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
  338. package/test/unit/prd-auto-default.test.ts +0 -291
  339. package/test/unit/prd-failure-category.test.ts +0 -177
  340. package/test/unit/prd-get-next-story.test.ts +0 -215
  341. package/test/unit/precheck/checks-warnings.test.ts +0 -114
  342. package/test/unit/precheck-checks.test.ts +0 -841
  343. package/test/unit/precheck-story-size-gate.test.ts +0 -288
  344. package/test/unit/precheck-types.test.ts +0 -143
  345. package/test/unit/prompts/builder.test.ts +0 -258
  346. package/test/unit/prompts/loader.test.ts +0 -355
  347. package/test/unit/prompts/sections/conventions.test.ts +0 -30
  348. package/test/unit/prompts/sections/isolation.test.ts +0 -35
  349. package/test/unit/prompts/sections/role-task.test.ts +0 -40
  350. package/test/unit/prompts/sections/sections.test.ts +0 -238
  351. package/test/unit/prompts/sections/story.test.ts +0 -45
  352. package/test/unit/prompts/sections/verdict.test.ts +0 -58
  353. package/test/unit/prompts.test.ts +0 -476
  354. package/test/unit/queue.test.ts +0 -237
  355. package/test/unit/rectification.test.ts +0 -285
  356. package/test/unit/registry.test.ts +0 -288
  357. package/test/unit/review/runner.test.ts +0 -117
  358. package/test/unit/routing/content-hash.test.ts +0 -99
  359. package/test/unit/routing/routing-stability.test.ts +0 -208
  360. package/test/unit/routing/strategies/llm.test.ts +0 -306
  361. package/test/unit/routing-advanced.test.ts +0 -313
  362. package/test/unit/routing-core.test.ts +0 -341
  363. package/test/unit/routing-strategies.test.ts +0 -440
  364. package/test/unit/storyid-events.test.ts +0 -213
  365. package/test/unit/tdd-verdict.test.ts +0 -492
  366. package/test/unit/test-output-parser.test.ts +0 -377
  367. package/test/unit/ui/tui-controls.test.ts +0 -335
  368. package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
  369. package/test/unit/ui/tui-layout.test.ts +0 -379
  370. package/test/unit/ui/tui-stories.test.ts +0 -333
  371. package/test/unit/unit-isolation.test.ts +0 -135
  372. package/test/unit/utils/git.test.ts +0 -50
  373. package/test/unit/utils/path-security.test.ts +0 -47
  374. package/test/unit/utils-helpers.test.ts +0 -318
  375. package/test/unit/verdict.test.ts +0 -325
  376. package/test/unit/verification/orchestrator-types.test.ts +0 -54
  377. package/test/unit/verification/orchestrator.test.ts +0 -66
  378. package/test/unit/verification/smart-runner-config.test.ts +0 -163
  379. package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
  380. package/test/unit/verification/smart-runner.test.ts +0 -262
  381. package/test/unit/verification/strategies/acceptance.test.ts +0 -33
  382. package/test/unit/verification/strategies/regression.test.ts +0 -87
  383. package/test/unit/verification/strategies/scoped.test.ts +0 -100
  384. package/test/unit/worktree-manager.test.ts +0 -159
  385. package/tsconfig.json +0 -27
@@ -1,401 +0,0 @@
1
- # Phase 2: LLM Service Layer — Merged Architecture Design
2
-
3
- *Date: 2026-02-25*
4
- *Status: Proposed (pending decision)*
5
- *Supersedes: Original issue #3 design + 2026-02-25 architecture analysis*
6
-
7
- ---
8
-
9
- ## Problem
10
-
11
- nax v0.10.0 has two coupling issues:
12
-
13
- 1. **All LLM calls go through Claude Code CLI** — routing, review, acceptance stages spawn `claude -p` just for text reasoning. Wasteful.
14
- 2. **All coding goes through CLI subprocess** — ~350MB RAM each, blocks parallelism.
15
-
16
- ## Solution: Unified LLM Service Layer + Lightweight Agent Loop
17
-
18
- Two execution paths, one provider abstraction:
19
-
20
- ```
21
- LlmProvider (interface — normalized across providers)
22
- ├── AnthropicProvider (Messages API)
23
- ├── GoogleProvider (GenerateContent API)
24
- └── OpenAiCompatProvider (Chat Completions — covers OpenAI, Moonshot, DeepSeek, OpenRouter, Groq, etc.)
25
-
26
- Used by:
27
- ├── LLM Mode (text in → text out) — routing, analyze, review, acceptance
28
- │ └── llm/client.ts → callLlm(prompt, tier, config)
29
-
30
- └── Agent Mode (text + tools) — coding, TDD
31
- ├── DirectApiAdapter — LlmProvider + tool loop (~5MB per session)
32
- └── ClaudeCodeAdapter — CLI subprocess (~350MB, for TDD/interactive)
33
- ```
34
-
35
- ## Architecture
36
-
37
- ```
38
- src/
39
- ├── llm/ # LLM Service Layer (shared by both modes)
40
- │ ├── types.ts # LlmProvider interface, Message, ToolCall types
41
- │ ├── client.ts # callLlm() with fallback chain logic
42
- │ ├── registry.ts # Create provider from config
43
- │ └── providers/
44
- │ ├── anthropic.ts # Anthropic Messages API
45
- │ ├── openai-compat.ts # OpenAI-compatible (configurable baseUrl)
46
- │ └── google.ts # Google Gemini API
47
-
48
- ├── llm/tools/ # Minimal tool set for Direct API coding
49
- │ ├── types.ts # ToolDefinition, ToolResult
50
- │ ├── read-file.ts # Read file contents
51
- │ ├── write-file.ts # Write/create file
52
- │ ├── list-files.ts # List directory
53
- │ ├── search-files.ts # Grep/ripgrep
54
- │ └── run-command.ts # Shell exec (tests, git)
55
-
56
- ├── llm/agent-loop.ts # Tool use cycle: prompt → chat() → execute tools → loop
57
-
58
- ├── agents/ # Agent adapters (implement AgentAdapter interface)
59
- │ ├── types.ts # AgentAdapter, AgentResult (unchanged)
60
- │ ├── claude.ts # ClaudeCodeAdapter (current — subprocess)
61
- │ ├── direct-api.ts # DirectApiAdapter (new — wraps llm/ + tools)
62
- │ ├── registry.ts # Resolve backend config → adapter instance
63
- │ └── cost.ts # Cost estimation (unchanged for CLI, exact for API)
64
-
65
- ├── pipeline/stages/ # Each stage declares its execution mode
66
- │ ├── routing.ts # LLM Mode → llm/client.ts
67
- │ ├── analyze.ts # LLM Mode → llm/client.ts
68
- │ ├── coding.ts # Agent Mode → agents/registry.ts
69
- │ ├── tdd.ts # Agent Mode → agents/registry.ts
70
- │ ├── review.ts # LLM Mode → llm/client.ts
71
- │ └── acceptance.ts # LLM Mode → llm/client.ts
72
-
73
- └── config/schema.ts # Extended with providers, routing, pipeline overrides
74
- ```
75
-
76
- ## LlmProvider Interface
77
-
78
- ```typescript
79
- interface LlmProvider {
80
- readonly name: string;
81
-
82
- chat(options: {
83
- model: string;
84
- messages: Message[];
85
- tools?: ToolDefinition[]; // Optional — omit for LLM Mode (reasoning only)
86
- maxTokens?: number;
87
- temperature?: number;
88
- timeoutMs?: number;
89
- }): Promise<LlmResponse>;
90
- }
91
-
92
- interface LlmResponse {
93
- content: string;
94
- toolCalls: ToolCall[]; // Normalized regardless of provider format
95
- stopReason: "end_turn" | "tool_use" | "max_tokens";
96
- usage: { inputTokens: number; outputTokens: number };
97
- }
98
- ```
99
-
100
- Single interface serves both modes:
101
- - **LLM Mode** (routing, review): `chat()` without `tools` → text response
102
- - **Agent Mode** (coding): `chat()` with `tools` → tool calls → agent loop iterates
103
-
104
- ## Provider Implementations
105
-
106
- | Implementation | Covers | API Format | Tool Use Format |
107
- |:---------------|:-------|:-----------|:---------------|
108
- | `AnthropicProvider` | Anthropic (Claude) | Messages API | `tool_use` content blocks |
109
- | `OpenAiCompatProvider` | OpenAI, Moonshot, DeepSeek, OpenRouter, Groq, Together | Chat Completions | `tool_calls` in message |
110
- | `GoogleProvider` | Google Gemini | GenerateContent | `functionCall` in parts |
111
-
112
- `OpenAiCompatProvider` takes `baseUrl` + `apiKey` — any OpenAI-compatible provider works with zero code.
113
-
114
- ## Tier-Based Fallback Chains
115
-
116
- Each tier is an ordered list of providers. On 429/error, try next in chain:
117
-
118
- ```
119
- Stage needs "balanced" tier
120
- → Try anthropic/sonnet
121
- → Rate limited (429)? → Try openai/gpt-5
122
- → Also limited? → Try next in list
123
- → All exhausted? → Stage fails with clear error
124
- ```
125
-
126
- Both LLM Mode and Agent Mode use the same fallback logic via `llm/client.ts`:
127
-
128
- ```
129
- config.models["balanced"] → [anthropic/sonnet, openai/gpt-5]
130
-
131
- ┌─────────────────┴──────────────────┐
132
- │ │
133
- LLM Mode stages Agent Mode stages
134
- (routing, review) (coding, TDD)
135
- │ │
136
- llm/client.ts DirectApiAdapter
137
- tries providers tries providers
138
- in order in order (with tools)
139
- ```
140
-
141
- Single `ModelDef` (not array) is treated as array of one — backward compatible, no fallback.
142
-
143
- ## Backend Routing
144
-
145
- Three-level resolution:
146
-
147
- ### 1. Per-Stage Pipeline Override (most specific)
148
-
149
- ```json
150
- {
151
- "pipeline": {
152
- "routing": {
153
- "primary": { "provider": "google", "model": "gemini-flash", "via": "api" },
154
- "fallback": [
155
- { "provider": "anthropic", "model": "haiku", "via": "api" },
156
- { "via": "keyword" }
157
- ]
158
- },
159
- "implementation": {
160
- "primary": { "provider": "anthropic", "model": "sonnet", "via": "api" },
161
- "fallback": [
162
- { "via": "claude-cli" }
163
- ]
164
- }
165
- }
166
- }
167
- ```
168
-
169
- The `via` field determines execution path:
170
- - `"api"` → Direct API (LLM Mode or DirectApiAdapter depending on stage)
171
- - `"claude-cli"` → Claude Code CLI subprocess
172
- - `"keyword"` → built-in keyword strategy (routing only)
173
-
174
- ### 2. Strategy Override (tdd/interactive → force backend)
175
-
176
- ```json
177
- {
178
- "agents": {
179
- "overrides": {
180
- "tdd": "claude-cli",
181
- "interactive": "claude-cli"
182
- }
183
- }
184
- }
185
- ```
186
-
187
- ### 3. Tier Routing (default)
188
-
189
- ```json
190
- {
191
- "agents": {
192
- "routing": {
193
- "fast": { "provider": "gemini", "model": "gemini-2.5-flash" },
194
- "balanced": { "provider": "anthropic", "model": "claude-sonnet-4-5" },
195
- "powerful": { "provider": "anthropic", "model": "claude-opus-4" }
196
- }
197
- }
198
- }
199
- ```
200
-
201
- ### Resolution Logic
202
-
203
- ```typescript
204
- function resolveBackend(
205
- tier: ModelTier,
206
- stage: string,
207
- context: { tdd: boolean; interactive: boolean }
208
- ): BackendConfig {
209
- const config = loadConfig();
210
-
211
- // 1. Per-stage pipeline override
212
- if (config.pipeline?.[stage]?.primary) return config.pipeline[stage];
213
-
214
- // 2. Strategy override
215
- if (context.tdd && config.agents?.overrides?.tdd) return config.agents.overrides.tdd;
216
- if (context.interactive && config.agents?.overrides?.interactive) return config.agents.overrides.interactive;
217
-
218
- // 3. Tier routing
219
- return config.agents?.routing?.[tier] ?? "claude-cli";
220
- }
221
- ```
222
-
223
- ## Full Config Example
224
-
225
- ```json
226
- {
227
- "agents": {
228
- "providers": {
229
- "anthropic": {
230
- "type": "anthropic",
231
- "apiKey": "${ANTHROPIC_API_KEY}"
232
- },
233
- "openai": {
234
- "type": "openai-compat",
235
- "baseUrl": "https://api.openai.com/v1",
236
- "apiKey": "${OPENAI_API_KEY}"
237
- },
238
- "gemini": {
239
- "type": "google",
240
- "apiKey": "${GOOGLE_API_KEY}"
241
- },
242
- "moonshot": {
243
- "type": "openai-compat",
244
- "baseUrl": "https://api.moonshot.cn/v1",
245
- "apiKey": "${MOONSHOT_API_KEY}"
246
- },
247
- "deepseek": {
248
- "type": "openai-compat",
249
- "baseUrl": "https://api.deepseek.com/v1",
250
- "apiKey": "${DEEPSEEK_API_KEY}"
251
- }
252
- },
253
- "routing": {
254
- "fast": { "provider": "gemini", "model": "gemini-2.5-flash" },
255
- "balanced": [
256
- { "provider": "anthropic", "model": "claude-sonnet-4-5" },
257
- { "provider": "openai", "model": "gpt-5" }
258
- ],
259
- "powerful": { "provider": "anthropic", "model": "claude-opus-4" }
260
- },
261
- "overrides": {
262
- "tdd": "claude-cli",
263
- "interactive": "claude-cli"
264
- }
265
- },
266
- "pipeline": {
267
- "routing": {
268
- "primary": { "provider": "gemini", "model": "gemini-flash", "via": "api" },
269
- "fallback": [{ "via": "keyword" }]
270
- }
271
- }
272
- }
273
- ```
274
-
275
- ## Minimal Tool Set (for DirectApiAdapter)
276
-
277
- | Tool | What | Lines |
278
- |:-----|:-----|:------|
279
- | `read_file` | Read file contents (with line range) | ~15 |
280
- | `write_file` | Write/create file (with mkdir -p) | ~15 |
281
- | `list_files` | List directory (recursive option) | ~15 |
282
- | `search_files` | Grep/ripgrep pattern search | ~20 |
283
- | `run_command` | Shell exec with timeout + cwd | ~30 |
284
-
285
- ~95 lines total. Each tool is sandboxed to the project workdir.
286
-
287
- ## Agent Loop
288
-
289
- ```typescript
290
- async function agentLoop(
291
- provider: LlmProvider,
292
- model: string,
293
- prompt: string,
294
- workdir: string,
295
- maxIterations: number = 50,
296
- ): Promise<AgentResult> {
297
- const tools = getToolDefinitions();
298
- let messages: Message[] = [{ role: "user", content: prompt }];
299
- let totalCost = { input: 0, output: 0 };
300
-
301
- for (let i = 0; i < maxIterations; i++) {
302
- const response = await provider.chat({ model, messages, tools });
303
- totalCost.input += response.usage.inputTokens;
304
- totalCost.output += response.usage.outputTokens;
305
-
306
- if (response.stopReason === "end_turn") {
307
- return { success: true, output: response.content, cost: totalCost };
308
- }
309
-
310
- // Execute tool calls
311
- const toolResults = await Promise.all(
312
- response.toolCalls.map(tc => executeTool(tc, workdir))
313
- );
314
-
315
- messages.push({ role: "assistant", content: response.content, toolCalls: response.toolCalls });
316
- messages.push({ role: "tool", results: toolResults });
317
- }
318
-
319
- return { success: false, output: "Max iterations reached", cost: totalCost };
320
- }
321
- ```
322
-
323
- ~150 lines with error handling, logging, and token budget checks.
324
-
325
- ## Comparison: CLI vs Direct API
326
-
327
- | Factor | Claude Code CLI | Direct API |
328
- |:-------|:---------------|:-----------|
329
- | RAM per session | ~350MB | ~5MB |
330
- | Parallel stories | OOMs at 3 | 10+ concurrent |
331
- | Cost tracking | Estimated from duration | Exact token counts from API |
332
- | Provider flexibility | Anthropic only | Any provider with tool_use |
333
- | Tool access | ~50 tools (overkill) | 5 tools (minimal, sandboxed) |
334
- | CLAUDE.md support | ✅ Auto-loaded | ❌ Must inject into prompt |
335
- | TDD isolation | ✅ PTY-based session isolation | ⚠️ Possible but needs validation |
336
- | Interactive/TUI | ✅ PTY handle | ❌ Not supported |
337
- | Dependencies | `claude` binary installed | Just HTTP (fetch) |
338
-
339
- ## Backward Compatibility
340
-
341
- - No `agents` section in config → everything uses `claude-cli` (current behavior)
342
- - No `pipeline` section → stages inherit from tier routing
343
- - Single ModelDef (not array) → treated as array of one, no fallback
344
- - Zero breaking changes
345
-
346
- ## Component Breakdown
347
-
348
- | Component | Est. Lines | What |
349
- |:----------|:-----------|:-----|
350
- | `llm/types.ts` | ~60 | LlmProvider, Message, ToolCall, LlmResponse |
351
- | `llm/providers/anthropic.ts` | ~80 | Messages API + tool_use normalization |
352
- | `llm/providers/openai-compat.ts` | ~80 | Chat Completions + configurable baseUrl |
353
- | `llm/providers/google.ts` | ~100 | GenerateContent + functionCall normalization |
354
- | `llm/registry.ts` | ~40 | Provider factory from config |
355
- | `llm/client.ts` | ~80 | callLlm() with fallback chain + retry |
356
- | `llm/tools/*.ts` (5 tools) | ~95 | read, write, list, search, exec |
357
- | `llm/agent-loop.ts` | ~150 | Tool use cycle with iteration limit |
358
- | `agents/direct-api.ts` | ~80 | DirectApiAdapter wrapping llm/ layer |
359
- | `agents/registry.ts` (update) | ~30 | Resolve backend config → adapter |
360
- | `config/schema.ts` (update) | ~100 | providers, routing, overrides, pipeline |
361
- | **Total** | **~895** | |
362
-
363
- ## Implementation Phases
364
-
365
- | Phase | Scope | Effort | Enables |
366
- |:------|:------|:-------|:--------|
367
- | P1 | LlmProvider interface + AnthropicProvider + callLlm() | Small | LLM Mode for routing/review |
368
- | P2 | OpenAiCompatProvider + GoogleProvider | Small | Multi-provider support |
369
- | P3 | Fallback chain logic in client.ts | Medium | Rate limit resilience |
370
- | P4 | Tool definitions + agent loop + DirectApiAdapter | Medium | API-based coding |
371
- | P5 | Per-stage pipeline config | Medium | Fine-grained stage control |
372
- | P6 | Wire LLM Mode into routing, review, acceptance stages | Medium | Remove CLI dependency for reasoning |
373
-
374
- P1-P2 can ship independently as a quick win (LLM Mode only). P4 is the big unlock for Phase 3 parallelism.
375
-
376
- ## Auth/Key Management
377
-
378
- Provider keys flow from config with env var expansion:
379
-
380
- ```json
381
- {
382
- "providers": {
383
- "anthropic": { "type": "anthropic", "apiKey": "${ANTHROPIC_API_KEY}" }
384
- }
385
- }
386
- ```
387
-
388
- Each provider reads `apiKey` from its config entry. Fallback to `process.env` for backward compat.
389
- Per-model env overrides via `ModelDef.env` still work (existing behavior).
390
-
391
- ## Enables Phase 3 (Parallelism)
392
-
393
- With DirectApiAdapter (~5MB each), Phase 3 becomes feasible:
394
- - N stories execute concurrently via parallel HTTP calls
395
- - Each story gets its own git worktree (from dev-orchestrator pattern)
396
- - No OOM risk — 10 concurrent stories ≈ 50MB total vs 3.5GB with CLI
397
- - Exact cost tracking per story from API token counts
398
-
399
- ---
400
-
401
- *Decision pending. This doc captures the merged architecture for future implementation.*
@@ -1,187 +0,0 @@
1
- # Deep Code Review: @nathapp/nax v0.10.1
2
-
3
- **Date:** 2026-02-25
4
- **Reviewer:** Subrina (AI)
5
- **Scope:** Status File, Failure Categories, Verifier Verdicts (31 files changed, ~5,200 lines added)
6
- **Commit Range:** v0.10.0..v0.10.1
7
-
8
- ---
9
-
10
- ## Overall Grade: A- (87/100)
11
-
12
- | Category | Score | Notes |
13
- |:---|:---:|:---|
14
- | Security | 17/20 | Solid input validation on verdict; minor path traversal concern |
15
- | Reliability | 18/20 | Atomic writes, graceful fallbacks, comprehensive error handling |
16
- | API Design | 18/20 | Clean types, good separation of concerns, extensible verdict schema |
17
- | Code Quality | 17/20 | Well-structured, good test coverage; runner.ts exceeds 400-line guideline |
18
- | Best Practices | 17/20 | Good patterns; minor DRY and cleanup opportunities |
19
-
20
- ---
21
-
22
- ## Findings
23
-
24
- ### CRITICAL
25
-
26
- *None found.*
27
-
28
- ### HIGH
29
-
30
- #### BUG-1: runner.ts exceeds 400-line guideline at 1,310 lines (HIGH)
31
-
32
- **File:** `src/execution/runner.ts`
33
- **Risk:** Cognitive complexity, harder to test individual paths, merge conflicts.
34
- **Snippet:** `wc -l src/execution/runner.ts → 1310`
35
- **Fix:** Extract the status-file write orchestration, the story pipeline dispatch, and the escalation logic into separate modules (as was done with `post-verify.ts`). The `writeStatus` closure and its state vars could become a `StatusFileWriter` class.
36
-
37
- #### SEC-1: Status file path not validated for path traversal (HIGH)
38
-
39
- **File:** `src/execution/status-file.ts:170`
40
- ```typescript
41
- export async function writeStatusFile(filePath: string, status: NaxStatusFile): Promise<void> {
42
- const tmpPath = `${filePath}.tmp`;
43
- await Bun.write(tmpPath, JSON.stringify(status, null, 2));
44
- await rename(tmpPath, filePath);
45
- }
46
- ```
47
- **Risk:** If `statusFile` option is user-controlled (e.g., via CLI arg), arbitrary file overwrite is possible. Currently mitigated by the fact that the path is derived from `--status-file` CLI arg which is operator-controlled, but no validation exists.
48
- **Fix:** Add `path.resolve()` + verify the path is within the workdir or a known safe directory.
49
-
50
- ### MEDIUM
51
-
52
- #### ENH-1: `getSafeLogger()` duplicated across files (MEDIUM)
53
-
54
- **Files:** `src/execution/runner.ts:71`, `src/execution/post-verify.ts:80`
55
- ```typescript
56
- function getSafeLogger() {
57
- try { return getLogger(); }
58
- catch { return null; }
59
- }
60
- ```
61
- **Risk:** DRY violation. If logger initialization changes, both copies need updating.
62
- **Fix:** Export `getSafeLogger()` from `src/logger.ts` or a shared utility.
63
-
64
- #### TYPE-1: `captureGitRef()` duplicated with different signatures (MEDIUM)
65
-
66
- **Files:** `src/execution/post-verify.ts:17` (returns `string | undefined`), `src/tdd/orchestrator.ts:30` (returns `string`, throws on failure)
67
- **Risk:** Inconsistent error handling for the same operation. The orchestrator version will throw if git isn't available.
68
- **Fix:** Consolidate into a single shared function in a git utility module. Prefer the `string | undefined` signature for resilience.
69
-
70
- #### BUG-2: `writeStatus` swallows errors silently during critical state transitions (MEDIUM)
71
-
72
- **File:** `src/execution/runner.ts:188-193`
73
- ```typescript
74
- catch (err) {
75
- safeLogger?.warn("status-file", "Failed to write status file (non-fatal)", {
76
- path: statusFile,
77
- error: (err as Error).message,
78
- });
79
- }
80
- ```
81
- **Risk:** If the status file write fails repeatedly (e.g., disk full), the only signal is a warn log that may be missed. External tooling polling the file would see stale data.
82
- **Fix:** Consider a counter; after N consecutive failures, emit a more prominent error or set a flag on the run state.
83
-
84
- #### ENH-2: Verdict validation could use a schema library (MEDIUM)
85
-
86
- **File:** `src/tdd/verdict.ts:85-120` — `isValidVerdict()` is 35 lines of manual validation.
87
- **Risk:** Verbose and error-prone as the schema evolves. New fields require manual validation additions.
88
- **Fix:** Consider using `zod` or `valibot` for declarative schema validation. However, the current zero-dependency approach is acceptable for a CLI tool — marking as enhancement, not a bug.
89
-
90
- #### MEM-1: Atomic write leaves orphan `.tmp` file on crash between write and rename (MEDIUM)
91
-
92
- **File:** `src/execution/status-file.ts:172-175`
93
- **Risk:** If the process crashes after `Bun.write()` but before `rename()`, a `.tmp` file persists. Not a memory leak but can cause confusion.
94
- **Fix:** Add cleanup of stale `.tmp` files at runner startup, or use `try/finally` to attempt cleanup.
95
-
96
- ### LOW
97
-
98
- #### STYLE-1: Inconsistent `as FailureCategory` casts in orchestrator (LOW)
99
-
100
- **File:** `src/tdd/orchestrator.ts:293, 312`
101
- ```typescript
102
- failureCategory: "session-failure" as FailureCategory,
103
- failureCategory: "isolation-violation" as FailureCategory,
104
- ```
105
- **Risk:** The type is already `FailureCategory`, so the cast is redundant and adds noise.
106
- **Fix:** Remove the `as FailureCategory` casts — TypeScript already infers the string literal correctly.
107
-
108
- #### STYLE-2: Test file duplication — `tdd-verdict.test.ts` and `verdict.test.ts` overlap significantly (LOW)
109
-
110
- **Files:** `test/verdict.test.ts` (339 lines), `test/tdd-verdict.test.ts` (290 lines)
111
- **Risk:** Both test `readVerdict`, `cleanupVerdict`, and `categorizeVerdict` with very similar test cases. Maintenance burden doubles.
112
- **Fix:** Consolidate into a single test file. If both were generated by different subtasks, merge the more thorough assertions from each.
113
-
114
- #### ENH-3: `countProgress` iterates stories 4 times (LOW)
115
-
116
- **File:** `src/execution/status-file.ts:101-106`
117
- ```typescript
118
- const passed = stories.filter((s) => s.status === "passed").length;
119
- const failed = stories.filter((s) => s.status === "failed").length;
120
- const paused = stories.filter((s) => s.status === "paused").length;
121
- const blocked = stories.filter((s) => s.status === "blocked").length;
122
- ```
123
- **Risk:** Negligible performance impact (PRDs have <50 stories), but could be a single loop.
124
- **Fix:** Single `reduce()` pass. Low priority — readability is fine as-is.
125
-
126
- #### PERF-1: `buildStatusSnapshot` calls `Date.now()` once but creates `new Date()` separately (LOW)
127
-
128
- **File:** `src/execution/status-file.ts:154`
129
- ```typescript
130
- const now = Date.now();
131
- // ...
132
- updatedAt: new Date(now).toISOString(),
133
- ```
134
- **Risk:** None — this is actually correct and efficient. No finding here, just noting the pattern is clean.
135
-
136
- #### ENH-4: `resolveMaxAttemptsOutcome` could benefit from exhaustive matching (LOW)
137
-
138
- **File:** `src/execution/runner.ts:59-64`
139
- ```typescript
140
- export function resolveMaxAttemptsOutcome(failureCategory?: FailureCategory): "pause" | "fail" {
141
- if (failureCategory === "isolation-violation" || failureCategory === "verifier-rejected") {
142
- return "pause";
143
- }
144
- return "fail";
145
- }
146
- ```
147
- **Risk:** If new `FailureCategory` values are added, this function silently defaults to "fail".
148
- **Fix:** Add a `satisfies never` exhaustive check or use a switch statement.
149
-
150
- ---
151
-
152
- ## What Was Done Well
153
-
154
- 1. **Atomic writes** for the status file — write-to-tmp-then-rename prevents partial reads.
155
- 2. **Verdict validation** is thorough — `isValidVerdict()` checks every required field, returns null on failure (never throws).
156
- 3. **Clean separation** — verdict reading, categorization, and cleanup are separate functions with single responsibilities.
157
- 4. **Fallback paths** — when no verdict file exists, the orchestrator gracefully falls back to independent test verification.
158
- 5. **Test coverage** is excellent — ~6 test files covering all three features with edge cases, error paths, and priority ordering.
159
- 6. **Type design** — `FailureCategory` as a union type, `NaxStatusFile` with version field for forward compat, `ThreeSessionTddResult.verdict` using `null | undefined` distinction.
160
- 7. **Documentation** — JSDoc on all public APIs with clear parameter descriptions.
161
- 8. **`markStoryFailed` backward compatibility** — the `failureCategory` parameter is optional; existing callers don't break.
162
-
163
- ---
164
-
165
- ## Priority Fix Order
166
-
167
- | Priority | ID | Severity | Effort | Description |
168
- |:---:|:---|:---:|:---:|:---|
169
- | 1 | BUG-1 | HIGH | L | Extract runner.ts into smaller modules |
170
- | 2 | SEC-1 | HIGH | S | Validate status file path |
171
- | 3 | ENH-1 | MEDIUM | S | Deduplicate `getSafeLogger()` |
172
- | 4 | TYPE-1 | MEDIUM | S | Consolidate `captureGitRef()` |
173
- | 5 | STYLE-2 | LOW | M | Merge duplicate verdict test files |
174
- | 6 | ENH-4 | LOW | S | Exhaustive match in `resolveMaxAttemptsOutcome` |
175
- | 7 | STYLE-1 | LOW | S | Remove redundant `as FailureCategory` casts |
176
-
177
- *Effort: S = <30min, M = 1-2h, L = 2-4h*
178
-
179
- ---
180
-
181
- ## Summary
182
-
183
- v0.10.1 is a **solid implementation** of three well-scoped features. The code demonstrates good defensive programming (never-throw readers, atomic writes, graceful fallbacks) and strong type design. Test coverage is comprehensive with both happy-path and error-path cases.
184
-
185
- The main concern is **runner.ts growing to 1,310 lines** — the status-file integration added more state and write points to an already large file. The next refactoring pass should extract the status-file writer and the story pipeline dispatch into separate modules.
186
-
187
- No critical security or reliability issues found. The codebase is production-ready.