@nathapp/nax 0.28.0 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (376) hide show
  1. package/CHANGELOG.md +13 -2
  2. package/dist/nax.js +72691 -0
  3. package/package.json +12 -4
  4. package/src/cli/config.ts +3 -1
  5. package/src/config/defaults.ts +1 -0
  6. package/src/config/schemas.ts +1 -0
  7. package/src/config/types.ts +1 -0
  8. package/src/context/builder.ts +10 -1
  9. package/src/prompts/sections/role-task.ts +4 -2
  10. package/src/review/runner.ts +6 -1
  11. package/src/version.ts +2 -1
  12. package/.claude/rules/01-project-conventions.md +0 -34
  13. package/.claude/rules/02-test-architecture.md +0 -39
  14. package/.claude/rules/03-test-writing.md +0 -58
  15. package/.claude/rules/04-forbidden-patterns.md +0 -29
  16. package/.claude/settings.json +0 -15
  17. package/.githooks/pre-commit +0 -16
  18. package/.gitlab-ci.yml +0 -103
  19. package/.mcp.json +0 -8
  20. package/BRIEF.md +0 -140
  21. package/CLAUDE.md +0 -143
  22. package/US-007-IMPLEMENTATION.md +0 -139
  23. package/biome.json +0 -14
  24. package/bun.lock +0 -163
  25. package/bunfig.toml +0 -12
  26. package/docker-compose.test.yml +0 -15
  27. package/docs/20260216-fix-plan-context-review.md +0 -56
  28. package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
  29. package/docs/20260216-v02-plan.md +0 -136
  30. package/docs/20260216-v02-review.md +0 -685
  31. package/docs/20260217-dogfood-findings.md +0 -56
  32. package/docs/20260217-p2-plus-plan.md +0 -117
  33. package/docs/20260217-partial-fixes-plan.md +0 -62
  34. package/docs/20260217-plan-analyze-spec.md +0 -117
  35. package/docs/20260217-post-impl-review.md +0 -1137
  36. package/docs/20260217-quick-wins-plan.md +0 -66
  37. package/docs/20260217-split-runner-plan.md +0 -75
  38. package/docs/20260217-v03-impl-plan.md +0 -80
  39. package/docs/20260217-v03-post-impl-review.md +0 -589
  40. package/docs/20260217-v04-impl-plan.md +0 -86
  41. package/docs/20260217-v05-post-impl-review.md +0 -850
  42. package/docs/20260217-v06-post-impl-review.md +0 -817
  43. package/docs/20260218-adr003-port-plan.md +0 -151
  44. package/docs/20260218-review-adr003-verification.md +0 -175
  45. package/docs/20260219-fix-plan-bug16-19.md +0 -79
  46. package/docs/20260219-fix-plan-bug20-22.md +0 -114
  47. package/docs/20260219-plan-llm-routing.md +0 -116
  48. package/docs/20260219-review-bug20-22-fixes.md +0 -135
  49. package/docs/20260219-routing-baseline-keyword.md +0 -63
  50. package/docs/20260220-plan-structured-logging-p1.md +0 -80
  51. package/docs/20260220-plan-structured-logging-p2.md +0 -37
  52. package/docs/20260220-review-llm-routing.md +0 -180
  53. package/docs/20260220-review-post-fix-llm-routing.md +0 -70
  54. package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
  55. package/docs/20260221-fix-plan-routing-mode.md +0 -125
  56. package/docs/20260221-review-v0.9-implementation.md +0 -379
  57. package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
  58. package/docs/20260223-fix-plan-prompt-audit.md +0 -62
  59. package/docs/20260224-nax-roadmap-phases.md +0 -189
  60. package/docs/20260225-phase2-llm-service-layer.md +0 -401
  61. package/docs/20260225-review-v0.10.1.md +0 -187
  62. package/docs/20260303-v010-implementation-plan.md +0 -165
  63. package/docs/20260304-review-nax.md +0 -492
  64. package/docs/CLAUDE.md.bak +0 -191
  65. package/docs/ROADMAP.md +0 -390
  66. package/docs/SPEC-rectification.md +0 -0
  67. package/docs/SPEC.md +0 -324
  68. package/docs/US-001-plugin-loading-verification.md +0 -152
  69. package/docs/adr/ADR-005-implementation-plan.md +0 -655
  70. package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
  71. package/docs/architecture-analysis.md +0 -1076
  72. package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
  73. package/docs/bugs-from-dogfood-run-c.md +0 -243
  74. package/docs/code-review-20260228.md +0 -612
  75. package/docs/code-review-v0.15.0.md +0 -629
  76. package/docs/hook-lifecycle-test-plan.md +0 -149
  77. package/docs/releases/v0.11.0-and-earlier.md +0 -20
  78. package/docs/releases/v0.12.0.md +0 -15
  79. package/docs/releases/v0.13.0.md +0 -14
  80. package/docs/releases/v0.14.0.md +0 -20
  81. package/docs/releases/v0.14.1.md +0 -36
  82. package/docs/releases/v0.14.2.md +0 -51
  83. package/docs/releases/v0.14.3.md +0 -174
  84. package/docs/releases/v0.14.4.md +0 -94
  85. package/docs/releases/v0.15.0.md +0 -502
  86. package/docs/releases/v0.15.1.md +0 -170
  87. package/docs/releases/v0.15.3.md +0 -193
  88. package/docs/specs/bug-039-orphan-processes.md +0 -131
  89. package/docs/specs/bug-040-review-rectification.md +0 -82
  90. package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
  91. package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
  92. package/docs/specs/bun-pty-migration.md +0 -171
  93. package/docs/specs/central-run-registry.md +0 -116
  94. package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
  95. package/docs/specs/feat-011-file-context-strategy.md +0 -73
  96. package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
  97. package/docs/specs/feat-013-test-after-review.md +0 -89
  98. package/docs/specs/feat-014-heartbeat-observability.md +0 -127
  99. package/docs/specs/status-file-consolidation.md +0 -93
  100. package/docs/specs/status-file-v0.10.1.md +0 -812
  101. package/docs/specs/trigger-completion.md +0 -145
  102. package/docs/specs/verification-architecture-v2.md +0 -343
  103. package/docs/tdd/strategies.md +0 -97
  104. package/docs/v0.10-global-config.md +0 -206
  105. package/docs/v0.10-plugin-system.md +0 -415
  106. package/docs/v0.10-prompt-optimizer.md +0 -234
  107. package/docs/v0.3-spec.md +0 -244
  108. package/docs/v0.4-spec.md +0 -140
  109. package/docs/v0.5-spec.md +0 -237
  110. package/docs/v0.6-spec.md +0 -371
  111. package/docs/v0.7-spec.md +0 -177
  112. package/docs/v0.8-llm-routing.md +0 -206
  113. package/docs/v0.8-structured-logging.md +0 -132
  114. package/docs/v0.9.3-prompt-audit.md +0 -112
  115. package/examples/plugins/console-reporter/index.test.ts +0 -207
  116. package/examples/plugins/console-reporter/index.ts +0 -110
  117. package/memory/topic/feat-010-baseref.md +0 -28
  118. package/memory/topic/feat-013-test-after-deprecation.md +0 -22
  119. package/nax/config.json +0 -154
  120. package/nax/features/bug-039-medium/prd.json +0 -45
  121. package/nax/features/bugfix-v0171/prd.json +0 -52
  122. package/nax/features/central-run-registry/prd.json +0 -105
  123. package/nax/features/config-management/prd.json +0 -108
  124. package/nax/features/config-management/progress.txt +0 -5
  125. package/nax/features/diagnose/acceptance.test.ts +0 -414
  126. package/nax/features/diagnose/prd.json +0 -41
  127. package/nax/features/nax-compliance/prd.json +0 -52
  128. package/nax/features/nax-compliance/progress.txt +0 -1
  129. package/nax/features/orchestration-fixes/prd.json +0 -89
  130. package/nax/features/orchestration-fixes/progress.txt +0 -1
  131. package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
  132. package/nax/features/plugin-integration/prd.json +0 -208
  133. package/nax/features/plugin-integration/progress.txt +0 -5
  134. package/nax/features/post-rearch-bugfix/prd.json +0 -137
  135. package/nax/features/precheck/prd.json +0 -205
  136. package/nax/features/precheck/progress.txt +0 -15
  137. package/nax/features/prompt-builder/prd.json +0 -152
  138. package/nax/features/prompt-builder/progress.txt +0 -3
  139. package/nax/features/review-quality/prd.json +0 -55
  140. package/nax/features/routing-persistence/prd.json +0 -104
  141. package/nax/features/routing-persistence/progress.txt +0 -1
  142. package/nax/features/smart-test-runner/plan.md +0 -7
  143. package/nax/features/smart-test-runner/prd.json +0 -203
  144. package/nax/features/smart-test-runner/progress.txt +0 -13
  145. package/nax/features/smart-test-runner/spec.md +0 -7
  146. package/nax/features/smart-test-runner/tasks.md +0 -8
  147. package/nax/features/status-file-consolidation/prd.json +0 -106
  148. package/nax/features/structured-logging/prd.json +0 -199
  149. package/nax/features/trigger-completion/prd.json +0 -150
  150. package/nax/features/trigger-completion/progress.txt +0 -7
  151. package/nax/features/unlock/prd.json +0 -36
  152. package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
  153. package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
  154. package/nax/features/v0.19.0-hardening/plan.md +0 -7
  155. package/nax/features/v0.19.0-hardening/prd.json +0 -84
  156. package/nax/features/v0.19.0-hardening/progress.txt +0 -7
  157. package/nax/features/v0.19.0-hardening/spec.md +0 -18
  158. package/nax/features/v0.19.0-hardening/tasks.md +0 -8
  159. package/nax/features/verify-v2/prd.json +0 -79
  160. package/nax/features/verify-v2/progress.txt +0 -3
  161. package/nax/status.json +0 -36
  162. package/test/COVERAGE-GAPS.md +0 -333
  163. package/test/e2e/cm-003-default-view.test.ts +0 -195
  164. package/test/e2e/plan-analyze-run.test.ts +0 -902
  165. package/test/helpers/helpers.test.ts +0 -295
  166. package/test/helpers/timeout.ts +0 -42
  167. package/test/integration/US-002-TEST-SUMMARY.md +0 -107
  168. package/test/integration/US-003-TEST-SUMMARY.md +0 -149
  169. package/test/integration/US-004-TEST-SUMMARY.md +0 -106
  170. package/test/integration/US-005-TEST-SUMMARY.md +0 -138
  171. package/test/integration/US-007-TEST-SUMMARY.md +0 -100
  172. package/test/integration/cli/agent-validation.test.ts +0 -439
  173. package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
  174. package/test/integration/cli/cli-config-default-view.test.ts +0 -230
  175. package/test/integration/cli/cli-config-diff.test.ts +0 -461
  176. package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
  177. package/test/integration/cli/cli-config.test.ts +0 -737
  178. package/test/integration/cli/cli-diagnose.test.ts +0 -595
  179. package/test/integration/cli/cli-logs.test.ts +0 -346
  180. package/test/integration/cli/cli-plugins.test.ts +0 -679
  181. package/test/integration/cli/cli-precheck.test.ts +0 -372
  182. package/test/integration/cli/cli-run-headless.test.ts +0 -174
  183. package/test/integration/cli/cli.test.ts +0 -76
  184. package/test/integration/cli/precheck-integration.test.ts +0 -476
  185. package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
  186. package/test/integration/cli/precheck.test.ts +0 -806
  187. package/test/integration/config/config-loader.test.ts +0 -266
  188. package/test/integration/config/config.test.ts +0 -444
  189. package/test/integration/config/merger.test.ts +0 -466
  190. package/test/integration/config/paths.test.ts +0 -52
  191. package/test/integration/config/security-loader.test.ts +0 -83
  192. package/test/integration/context/context-integration.test.ts +0 -703
  193. package/test/integration/context/context-path-security.test.ts +0 -173
  194. package/test/integration/context/context-provider-injection.test.ts +0 -507
  195. package/test/integration/context/context-verification-integration.test.ts +0 -296
  196. package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
  197. package/test/integration/execution/execution-isolation.test.ts +0 -143
  198. package/test/integration/execution/execution.test.ts +0 -634
  199. package/test/integration/execution/feature-status-write.test.ts +0 -302
  200. package/test/integration/execution/parallel.test.ts +0 -251
  201. package/test/integration/execution/prd-pause.test.ts +0 -205
  202. package/test/integration/execution/prd-resolvers.test.ts +0 -186
  203. package/test/integration/execution/progress.test.ts +0 -34
  204. package/test/integration/execution/runner-batching.test.ts +0 -682
  205. package/test/integration/execution/runner-config-plugins.test.ts +0 -462
  206. package/test/integration/execution/runner-escalation.test.ts +0 -561
  207. package/test/integration/execution/runner-fixes.test.ts +0 -400
  208. package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
  209. package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
  210. package/test/integration/execution/status-file-integration.test.ts +0 -289
  211. package/test/integration/execution/status-file.test.ts +0 -380
  212. package/test/integration/execution/status-writer.test.ts +0 -447
  213. package/test/integration/execution/story-id-in-events.test.ts +0 -274
  214. package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
  215. package/test/integration/pipeline/hooks.test.ts +0 -363
  216. package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
  217. package/test/integration/pipeline/pipeline-events.test.ts +0 -476
  218. package/test/integration/pipeline/pipeline.test.ts +0 -660
  219. package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
  220. package/test/integration/pipeline/verify-stage.test.ts +0 -286
  221. package/test/integration/plan/analyze-integration.test.ts +0 -262
  222. package/test/integration/plan/analyze-scanner.test.ts +0 -132
  223. package/test/integration/plan/logger.test.ts +0 -461
  224. package/test/integration/plan/plan.test.ts +0 -157
  225. package/test/integration/plugins/config-integration.test.ts +0 -173
  226. package/test/integration/plugins/config-resolution.test.ts +0 -523
  227. package/test/integration/plugins/loader.test.ts +0 -644
  228. package/test/integration/plugins/plugins-registry.test.ts +0 -747
  229. package/test/integration/plugins/validator.test.ts +0 -564
  230. package/test/integration/prompts/pb-004-migration.test.ts +0 -523
  231. package/test/integration/review/review-config-commands.test.ts +0 -320
  232. package/test/integration/review/review-config-schema.test.ts +0 -117
  233. package/test/integration/review/review-plugin-integration.test.ts +0 -729
  234. package/test/integration/review/review.test.ts +0 -150
  235. package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
  236. package/test/integration/routing/plugin-routing-core.test.ts +0 -527
  237. package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
  238. package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
  239. package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
  240. package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
  241. package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
  242. package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
  243. package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
  244. package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
  245. package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
  246. package/test/integration/tmp/headless-test/test.jsonl +0 -30
  247. package/test/integration/verification/test-scanner.test.ts +0 -403
  248. package/test/integration/verification/verification-asset-check.test.ts +0 -143
  249. package/test/integration/worktree/manager.test.ts +0 -218
  250. package/test/integration/worktree/worktree-merge.test.ts +0 -341
  251. package/test/manual/logging-formatter-demo.ts +0 -158
  252. package/test/ui/tui-agent-panel.test.tsx +0 -99
  253. package/test/ui/tui-pty-integration.test.tsx +0 -146
  254. package/test/unit/acceptance.test.ts +0 -187
  255. package/test/unit/agent-stderr-capture.test.ts +0 -147
  256. package/test/unit/agents/claude.test.ts +0 -107
  257. package/test/unit/analyze-classifier.test.ts +0 -216
  258. package/test/unit/analyze.test.ts +0 -224
  259. package/test/unit/auto-detect.test.ts +0 -250
  260. package/test/unit/cli-status-project-level.test.ts +0 -283
  261. package/test/unit/cli-status.test.ts +0 -418
  262. package/test/unit/commands/common.test.ts +0 -321
  263. package/test/unit/commands/logs.test.ts +0 -458
  264. package/test/unit/commands/runs.test.ts +0 -303
  265. package/test/unit/commands/unlock.test.ts +0 -320
  266. package/test/unit/config/defaults.test.ts +0 -70
  267. package/test/unit/config/quality-commands-schema.test.ts +0 -72
  268. package/test/unit/config/regression-gate-schema.test.ts +0 -160
  269. package/test/unit/config/smart-runner-flag.test.ts +0 -250
  270. package/test/unit/constitution-generators.test.ts +0 -161
  271. package/test/unit/constitution.test.ts +0 -210
  272. package/test/unit/context/context-autodetect.test.ts +0 -297
  273. package/test/unit/context/context-build.test.ts +0 -575
  274. package/test/unit/context/context-coverage.test.ts +0 -236
  275. package/test/unit/context/context-error.test.ts +0 -93
  276. package/test/unit/context/context-estimate-tokens.test.ts +0 -201
  277. package/test/unit/context/context-format.test.ts +0 -302
  278. package/test/unit/context/context-isolation.test.ts +0 -267
  279. package/test/unit/context/context-sort.test.ts +0 -93
  280. package/test/unit/context/context-story.test.ts +0 -108
  281. package/test/unit/context/prior-failures.test.ts +0 -463
  282. package/test/unit/context.test.ts +0 -1726
  283. package/test/unit/cost.test.ts +0 -231
  284. package/test/unit/crash-recovery.test.ts +0 -309
  285. package/test/unit/escalation.test.ts +0 -127
  286. package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
  287. package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
  288. package/test/unit/execution/pid-registry.test.ts +0 -241
  289. package/test/unit/execution/sequential-executor.test.ts +0 -235
  290. package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
  291. package/test/unit/execution/structured-failure.test.ts +0 -415
  292. package/test/unit/execution-logging-stderr.test.ts +0 -157
  293. package/test/unit/execution-stage.test.ts +0 -123
  294. package/test/unit/fix-generator.test.ts +0 -276
  295. package/test/unit/formatters.test.ts +0 -468
  296. package/test/unit/greenfield.test.ts +0 -180
  297. package/test/unit/hooks/shell-security.test.ts +0 -40
  298. package/test/unit/interaction/auto-plugin.test.ts +0 -162
  299. package/test/unit/interaction/human-review-trigger.test.ts +0 -165
  300. package/test/unit/interaction-network-failures.test.ts +0 -390
  301. package/test/unit/interaction-plugins.test.ts +0 -472
  302. package/test/unit/logging/formatter.test.ts +0 -456
  303. package/test/unit/merge.test.ts +0 -269
  304. package/test/unit/metrics/aggregator.test.ts +0 -164
  305. package/test/unit/metrics/tracker.test.ts +0 -186
  306. package/test/unit/metrics.test.ts +0 -276
  307. package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
  308. package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
  309. package/test/unit/pipeline/event-bus.test.ts +0 -105
  310. package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
  311. package/test/unit/pipeline/runner-retry.test.ts +0 -89
  312. package/test/unit/pipeline/stages/autofix.test.ts +0 -97
  313. package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
  314. package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
  315. package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
  316. package/test/unit/pipeline/stages/rectify.test.ts +0 -101
  317. package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
  318. package/test/unit/pipeline/stages/review.test.ts +0 -201
  319. package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
  320. package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
  321. package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
  322. package/test/unit/pipeline/stages/verify.test.ts +0 -267
  323. package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
  324. package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
  325. package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
  326. package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
  327. package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
  328. package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
  329. package/test/unit/prd-auto-default.test.ts +0 -291
  330. package/test/unit/prd-failure-category.test.ts +0 -177
  331. package/test/unit/prd-get-next-story.test.ts +0 -215
  332. package/test/unit/precheck/checks-warnings.test.ts +0 -114
  333. package/test/unit/precheck-checks.test.ts +0 -841
  334. package/test/unit/precheck-story-size-gate.test.ts +0 -288
  335. package/test/unit/precheck-types.test.ts +0 -143
  336. package/test/unit/prompts/builder.test.ts +0 -258
  337. package/test/unit/prompts/loader.test.ts +0 -355
  338. package/test/unit/prompts/sections/conventions.test.ts +0 -30
  339. package/test/unit/prompts/sections/isolation.test.ts +0 -35
  340. package/test/unit/prompts/sections/role-task.test.ts +0 -40
  341. package/test/unit/prompts/sections/sections.test.ts +0 -238
  342. package/test/unit/prompts/sections/story.test.ts +0 -45
  343. package/test/unit/prompts/sections/verdict.test.ts +0 -58
  344. package/test/unit/prompts.test.ts +0 -476
  345. package/test/unit/queue.test.ts +0 -237
  346. package/test/unit/rectification.test.ts +0 -285
  347. package/test/unit/registry.test.ts +0 -288
  348. package/test/unit/review/runner.test.ts +0 -117
  349. package/test/unit/routing/content-hash.test.ts +0 -99
  350. package/test/unit/routing/routing-stability.test.ts +0 -208
  351. package/test/unit/routing/strategies/llm.test.ts +0 -306
  352. package/test/unit/routing-advanced.test.ts +0 -313
  353. package/test/unit/routing-core.test.ts +0 -341
  354. package/test/unit/routing-strategies.test.ts +0 -440
  355. package/test/unit/storyid-events.test.ts +0 -213
  356. package/test/unit/tdd-verdict.test.ts +0 -492
  357. package/test/unit/test-output-parser.test.ts +0 -377
  358. package/test/unit/ui/tui-controls.test.ts +0 -335
  359. package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
  360. package/test/unit/ui/tui-layout.test.ts +0 -379
  361. package/test/unit/ui/tui-stories.test.ts +0 -333
  362. package/test/unit/unit-isolation.test.ts +0 -135
  363. package/test/unit/utils/git.test.ts +0 -50
  364. package/test/unit/utils/path-security.test.ts +0 -47
  365. package/test/unit/utils-helpers.test.ts +0 -318
  366. package/test/unit/verdict.test.ts +0 -325
  367. package/test/unit/verification/orchestrator-types.test.ts +0 -54
  368. package/test/unit/verification/orchestrator.test.ts +0 -66
  369. package/test/unit/verification/smart-runner-config.test.ts +0 -163
  370. package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
  371. package/test/unit/verification/smart-runner.test.ts +0 -262
  372. package/test/unit/verification/strategies/acceptance.test.ts +0 -33
  373. package/test/unit/verification/strategies/regression.test.ts +0 -87
  374. package/test/unit/verification/strategies/scoped.test.ts +0 -100
  375. package/test/unit/worktree-manager.test.ts +0 -159
  376. package/tsconfig.json +0 -27
@@ -1,401 +0,0 @@
1
- # Phase 2: LLM Service Layer — Merged Architecture Design
2
-
3
- *Date: 2026-02-25*
4
- *Status: Proposed (pending decision)*
5
- *Supersedes: Original issue #3 design + 2026-02-25 architecture analysis*
6
-
7
- ---
8
-
9
- ## Problem
10
-
11
- nax v0.10.0 has two coupling issues:
12
-
13
- 1. **All LLM calls go through Claude Code CLI** — routing, review, acceptance stages spawn `claude -p` just for text reasoning. Wasteful.
14
- 2. **All coding goes through CLI subprocess** — ~350MB RAM each, blocks parallelism.
15
-
16
- ## Solution: Unified LLM Service Layer + Lightweight Agent Loop
17
-
18
- Two execution paths, one provider abstraction:
19
-
20
- ```
21
- LlmProvider (interface — normalized across providers)
22
- ├── AnthropicProvider (Messages API)
23
- ├── GoogleProvider (GenerateContent API)
24
- └── OpenAiCompatProvider (Chat Completions — covers OpenAI, Moonshot, DeepSeek, OpenRouter, Groq, etc.)
25
-
26
- Used by:
27
- ├── LLM Mode (text in → text out) — routing, analyze, review, acceptance
28
- │ └── llm/client.ts → callLlm(prompt, tier, config)
29
-
30
- └── Agent Mode (text + tools) — coding, TDD
31
- ├── DirectApiAdapter — LlmProvider + tool loop (~5MB per session)
32
- └── ClaudeCodeAdapter — CLI subprocess (~350MB, for TDD/interactive)
33
- ```
34
-
35
- ## Architecture
36
-
37
- ```
38
- src/
39
- ├── llm/ # LLM Service Layer (shared by both modes)
40
- │ ├── types.ts # LlmProvider interface, Message, ToolCall types
41
- │ ├── client.ts # callLlm() with fallback chain logic
42
- │ ├── registry.ts # Create provider from config
43
- │ └── providers/
44
- │ ├── anthropic.ts # Anthropic Messages API
45
- │ ├── openai-compat.ts # OpenAI-compatible (configurable baseUrl)
46
- │ └── google.ts # Google Gemini API
47
-
48
- ├── llm/tools/ # Minimal tool set for Direct API coding
49
- │ ├── types.ts # ToolDefinition, ToolResult
50
- │ ├── read-file.ts # Read file contents
51
- │ ├── write-file.ts # Write/create file
52
- │ ├── list-files.ts # List directory
53
- │ ├── search-files.ts # Grep/ripgrep
54
- │ └── run-command.ts # Shell exec (tests, git)
55
-
56
- ├── llm/agent-loop.ts # Tool use cycle: prompt → chat() → execute tools → loop
57
-
58
- ├── agents/ # Agent adapters (implement AgentAdapter interface)
59
- │ ├── types.ts # AgentAdapter, AgentResult (unchanged)
60
- │ ├── claude.ts # ClaudeCodeAdapter (current — subprocess)
61
- │ ├── direct-api.ts # DirectApiAdapter (new — wraps llm/ + tools)
62
- │ ├── registry.ts # Resolve backend config → adapter instance
63
- │ └── cost.ts # Cost estimation (unchanged for CLI, exact for API)
64
-
65
- ├── pipeline/stages/ # Each stage declares its execution mode
66
- │ ├── routing.ts # LLM Mode → llm/client.ts
67
- │ ├── analyze.ts # LLM Mode → llm/client.ts
68
- │ ├── coding.ts # Agent Mode → agents/registry.ts
69
- │ ├── tdd.ts # Agent Mode → agents/registry.ts
70
- │ ├── review.ts # LLM Mode → llm/client.ts
71
- │ └── acceptance.ts # LLM Mode → llm/client.ts
72
-
73
- └── config/schema.ts # Extended with providers, routing, pipeline overrides
74
- ```
75
-
76
- ## LlmProvider Interface
77
-
78
- ```typescript
79
- interface LlmProvider {
80
- readonly name: string;
81
-
82
- chat(options: {
83
- model: string;
84
- messages: Message[];
85
- tools?: ToolDefinition[]; // Optional — omit for LLM Mode (reasoning only)
86
- maxTokens?: number;
87
- temperature?: number;
88
- timeoutMs?: number;
89
- }): Promise<LlmResponse>;
90
- }
91
-
92
- interface LlmResponse {
93
- content: string;
94
- toolCalls: ToolCall[]; // Normalized regardless of provider format
95
- stopReason: "end_turn" | "tool_use" | "max_tokens";
96
- usage: { inputTokens: number; outputTokens: number };
97
- }
98
- ```
99
-
100
- Single interface serves both modes:
101
- - **LLM Mode** (routing, review): `chat()` without `tools` → text response
102
- - **Agent Mode** (coding): `chat()` with `tools` → tool calls → agent loop iterates
103
-
104
- ## Provider Implementations
105
-
106
- | Implementation | Covers | API Format | Tool Use Format |
107
- |:---------------|:-------|:-----------|:---------------|
108
- | `AnthropicProvider` | Anthropic (Claude) | Messages API | `tool_use` content blocks |
109
- | `OpenAiCompatProvider` | OpenAI, Moonshot, DeepSeek, OpenRouter, Groq, Together | Chat Completions | `tool_calls` in message |
110
- | `GoogleProvider` | Google Gemini | GenerateContent | `functionCall` in parts |
111
-
112
- `OpenAiCompatProvider` takes `baseUrl` + `apiKey` — any OpenAI-compatible provider works with zero code.
113
-
114
- ## Tier-Based Fallback Chains
115
-
116
- Each tier is an ordered list of providers. On 429/error, try next in chain:
117
-
118
- ```
119
- Stage needs "balanced" tier
120
- → Try anthropic/sonnet
121
- → Rate limited (429)? → Try openai/gpt-5
122
- → Also limited? → Try next in list
123
- → All exhausted? → Stage fails with clear error
124
- ```
125
-
126
- Both LLM Mode and Agent Mode use the same fallback logic via `llm/client.ts`:
127
-
128
- ```
129
- config.models["balanced"] → [anthropic/sonnet, openai/gpt-5]
130
-
131
- ┌─────────────────┴──────────────────┐
132
- │ │
133
- LLM Mode stages Agent Mode stages
134
- (routing, review) (coding, TDD)
135
- │ │
136
- llm/client.ts DirectApiAdapter
137
- tries providers tries providers
138
- in order in order (with tools)
139
- ```
140
-
141
- Single `ModelDef` (not array) is treated as array of one — backward compatible, no fallback.
142
-
143
- ## Backend Routing
144
-
145
- Three-level resolution:
146
-
147
- ### 1. Per-Stage Pipeline Override (most specific)
148
-
149
- ```json
150
- {
151
- "pipeline": {
152
- "routing": {
153
- "primary": { "provider": "google", "model": "gemini-flash", "via": "api" },
154
- "fallback": [
155
- { "provider": "anthropic", "model": "haiku", "via": "api" },
156
- { "via": "keyword" }
157
- ]
158
- },
159
- "implementation": {
160
- "primary": { "provider": "anthropic", "model": "sonnet", "via": "api" },
161
- "fallback": [
162
- { "via": "claude-cli" }
163
- ]
164
- }
165
- }
166
- }
167
- ```
168
-
169
- The `via` field determines execution path:
170
- - `"api"` → Direct API (LLM Mode or DirectApiAdapter depending on stage)
171
- - `"claude-cli"` → Claude Code CLI subprocess
172
- - `"keyword"` → built-in keyword strategy (routing only)
173
-
174
- ### 2. Strategy Override (tdd/interactive → force backend)
175
-
176
- ```json
177
- {
178
- "agents": {
179
- "overrides": {
180
- "tdd": "claude-cli",
181
- "interactive": "claude-cli"
182
- }
183
- }
184
- }
185
- ```
186
-
187
- ### 3. Tier Routing (default)
188
-
189
- ```json
190
- {
191
- "agents": {
192
- "routing": {
193
- "fast": { "provider": "gemini", "model": "gemini-2.5-flash" },
194
- "balanced": { "provider": "anthropic", "model": "claude-sonnet-4-5" },
195
- "powerful": { "provider": "anthropic", "model": "claude-opus-4" }
196
- }
197
- }
198
- }
199
- ```
200
-
201
- ### Resolution Logic
202
-
203
- ```typescript
204
- function resolveBackend(
205
- tier: ModelTier,
206
- stage: string,
207
- context: { tdd: boolean; interactive: boolean }
208
- ): BackendConfig {
209
- const config = loadConfig();
210
-
211
- // 1. Per-stage pipeline override
212
- if (config.pipeline?.[stage]?.primary) return config.pipeline[stage];
213
-
214
- // 2. Strategy override
215
- if (context.tdd && config.agents?.overrides?.tdd) return config.agents.overrides.tdd;
216
- if (context.interactive && config.agents?.overrides?.interactive) return config.agents.overrides.interactive;
217
-
218
- // 3. Tier routing
219
- return config.agents?.routing?.[tier] ?? "claude-cli";
220
- }
221
- ```
222
-
223
- ## Full Config Example
224
-
225
- ```json
226
- {
227
- "agents": {
228
- "providers": {
229
- "anthropic": {
230
- "type": "anthropic",
231
- "apiKey": "${ANTHROPIC_API_KEY}"
232
- },
233
- "openai": {
234
- "type": "openai-compat",
235
- "baseUrl": "https://api.openai.com/v1",
236
- "apiKey": "${OPENAI_API_KEY}"
237
- },
238
- "gemini": {
239
- "type": "google",
240
- "apiKey": "${GOOGLE_API_KEY}"
241
- },
242
- "moonshot": {
243
- "type": "openai-compat",
244
- "baseUrl": "https://api.moonshot.cn/v1",
245
- "apiKey": "${MOONSHOT_API_KEY}"
246
- },
247
- "deepseek": {
248
- "type": "openai-compat",
249
- "baseUrl": "https://api.deepseek.com/v1",
250
- "apiKey": "${DEEPSEEK_API_KEY}"
251
- }
252
- },
253
- "routing": {
254
- "fast": { "provider": "gemini", "model": "gemini-2.5-flash" },
255
- "balanced": [
256
- { "provider": "anthropic", "model": "claude-sonnet-4-5" },
257
- { "provider": "openai", "model": "gpt-5" }
258
- ],
259
- "powerful": { "provider": "anthropic", "model": "claude-opus-4" }
260
- },
261
- "overrides": {
262
- "tdd": "claude-cli",
263
- "interactive": "claude-cli"
264
- }
265
- },
266
- "pipeline": {
267
- "routing": {
268
- "primary": { "provider": "gemini", "model": "gemini-flash", "via": "api" },
269
- "fallback": [{ "via": "keyword" }]
270
- }
271
- }
272
- }
273
- ```
274
-
275
- ## Minimal Tool Set (for DirectApiAdapter)
276
-
277
- | Tool | What | Lines |
278
- |:-----|:-----|:------|
279
- | `read_file` | Read file contents (with line range) | ~15 |
280
- | `write_file` | Write/create file (with mkdir -p) | ~15 |
281
- | `list_files` | List directory (recursive option) | ~15 |
282
- | `search_files` | Grep/ripgrep pattern search | ~20 |
283
- | `run_command` | Shell exec with timeout + cwd | ~30 |
284
-
285
- ~95 lines total. Each tool is sandboxed to the project workdir.
286
-
287
- ## Agent Loop
288
-
289
- ```typescript
290
- async function agentLoop(
291
- provider: LlmProvider,
292
- model: string,
293
- prompt: string,
294
- workdir: string,
295
- maxIterations: number = 50,
296
- ): Promise<AgentResult> {
297
- const tools = getToolDefinitions();
298
- let messages: Message[] = [{ role: "user", content: prompt }];
299
- let totalCost = { input: 0, output: 0 };
300
-
301
- for (let i = 0; i < maxIterations; i++) {
302
- const response = await provider.chat({ model, messages, tools });
303
- totalCost.input += response.usage.inputTokens;
304
- totalCost.output += response.usage.outputTokens;
305
-
306
- if (response.stopReason === "end_turn") {
307
- return { success: true, output: response.content, cost: totalCost };
308
- }
309
-
310
- // Execute tool calls
311
- const toolResults = await Promise.all(
312
- response.toolCalls.map(tc => executeTool(tc, workdir))
313
- );
314
-
315
- messages.push({ role: "assistant", content: response.content, toolCalls: response.toolCalls });
316
- messages.push({ role: "tool", results: toolResults });
317
- }
318
-
319
- return { success: false, output: "Max iterations reached", cost: totalCost };
320
- }
321
- ```
322
-
323
- ~150 lines with error handling, logging, and token budget checks.
324
-
325
- ## Comparison: CLI vs Direct API
326
-
327
- | Factor | Claude Code CLI | Direct API |
328
- |:-------|:---------------|:-----------|
329
- | RAM per session | ~350MB | ~5MB |
330
- | Parallel stories | OOMs at 3 | 10+ concurrent |
331
- | Cost tracking | Estimated from duration | Exact token counts from API |
332
- | Provider flexibility | Anthropic only | Any provider with tool_use |
333
- | Tool access | ~50 tools (overkill) | 5 tools (minimal, sandboxed) |
334
- | CLAUDE.md support | ✅ Auto-loaded | ❌ Must inject into prompt |
335
- | TDD isolation | ✅ PTY-based session isolation | ⚠️ Possible but needs validation |
336
- | Interactive/TUI | ✅ PTY handle | ❌ Not supported |
337
- | Dependencies | `claude` binary installed | Just HTTP (fetch) |
338
-
339
- ## Backward Compatibility
340
-
341
- - No `agents` section in config → everything uses `claude-cli` (current behavior)
342
- - No `pipeline` section → stages inherit from tier routing
343
- - Single ModelDef (not array) → treated as array of one, no fallback
344
- - Zero breaking changes
345
-
346
- ## Component Breakdown
347
-
348
- | Component | Est. Lines | What |
349
- |:----------|:-----------|:-----|
350
- | `llm/types.ts` | ~60 | LlmProvider, Message, ToolCall, LlmResponse |
351
- | `llm/providers/anthropic.ts` | ~80 | Messages API + tool_use normalization |
352
- | `llm/providers/openai-compat.ts` | ~80 | Chat Completions + configurable baseUrl |
353
- | `llm/providers/google.ts` | ~100 | GenerateContent + functionCall normalization |
354
- | `llm/registry.ts` | ~40 | Provider factory from config |
355
- | `llm/client.ts` | ~80 | callLlm() with fallback chain + retry |
356
- | `llm/tools/*.ts` (5 tools) | ~95 | read, write, list, search, exec |
357
- | `llm/agent-loop.ts` | ~150 | Tool use cycle with iteration limit |
358
- | `agents/direct-api.ts` | ~80 | DirectApiAdapter wrapping llm/ layer |
359
- | `agents/registry.ts` (update) | ~30 | Resolve backend config → adapter |
360
- | `config/schema.ts` (update) | ~100 | providers, routing, overrides, pipeline |
361
- | **Total** | **~895** | |
362
-
363
- ## Implementation Phases
364
-
365
- | Phase | Scope | Effort | Enables |
366
- |:------|:------|:-------|:--------|
367
- | P1 | LlmProvider interface + AnthropicProvider + callLlm() | Small | LLM Mode for routing/review |
368
- | P2 | OpenAiCompatProvider + GoogleProvider | Small | Multi-provider support |
369
- | P3 | Fallback chain logic in client.ts | Medium | Rate limit resilience |
370
- | P4 | Tool definitions + agent loop + DirectApiAdapter | Medium | API-based coding |
371
- | P5 | Per-stage pipeline config | Medium | Fine-grained stage control |
372
- | P6 | Wire LLM Mode into routing, review, acceptance stages | Medium | Remove CLI dependency for reasoning |
373
-
374
- P1-P2 can ship independently as a quick win (LLM Mode only). P4 is the big unlock for Phase 3 parallelism.
375
-
376
- ## Auth/Key Management
377
-
378
- Provider keys flow from config with env var expansion:
379
-
380
- ```json
381
- {
382
- "providers": {
383
- "anthropic": { "type": "anthropic", "apiKey": "${ANTHROPIC_API_KEY}" }
384
- }
385
- }
386
- ```
387
-
388
- Each provider reads `apiKey` from its config entry. Fallback to `process.env` for backward compat.
389
- Per-model env overrides via `ModelDef.env` still work (existing behavior).
390
-
391
- ## Enables Phase 3 (Parallelism)
392
-
393
- With DirectApiAdapter (~5MB each), Phase 3 becomes feasible:
394
- - N stories execute concurrently via parallel HTTP calls
395
- - Each story gets its own git worktree (from dev-orchestrator pattern)
396
- - No OOM risk — 10 concurrent stories ≈ 50MB total vs 3.5GB with CLI
397
- - Exact cost tracking per story from API token counts
398
-
399
- ---
400
-
401
- *Decision pending. This doc captures the merged architecture for future implementation.*
@@ -1,187 +0,0 @@
1
- # Deep Code Review: @nathapp/nax v0.10.1
2
-
3
- **Date:** 2026-02-25
4
- **Reviewer:** Subrina (AI)
5
- **Scope:** Status File, Failure Categories, Verifier Verdicts (31 files changed, ~5,200 lines added)
6
- **Commit Range:** v0.10.0..v0.10.1
7
-
8
- ---
9
-
10
- ## Overall Grade: A- (87/100)
11
-
12
- | Category | Score | Notes |
13
- |:---|:---:|:---|
14
- | Security | 17/20 | Solid input validation on verdict; minor path traversal concern |
15
- | Reliability | 18/20 | Atomic writes, graceful fallbacks, comprehensive error handling |
16
- | API Design | 18/20 | Clean types, good separation of concerns, extensible verdict schema |
17
- | Code Quality | 17/20 | Well-structured, good test coverage; runner.ts exceeds 400-line guideline |
18
- | Best Practices | 17/20 | Good patterns; minor DRY and cleanup opportunities |
19
-
20
- ---
21
-
22
- ## Findings
23
-
24
- ### CRITICAL
25
-
26
- *None found.*
27
-
28
- ### HIGH
29
-
30
- #### BUG-1: runner.ts exceeds 400-line guideline at 1,310 lines (HIGH)
31
-
32
- **File:** `src/execution/runner.ts`
33
- **Risk:** Cognitive complexity, harder to test individual paths, merge conflicts.
34
- **Snippet:** `wc -l src/execution/runner.ts → 1310`
35
- **Fix:** Extract the status-file write orchestration, the story pipeline dispatch, and the escalation logic into separate modules (as was done with `post-verify.ts`). The `writeStatus` closure and its state vars could become a `StatusFileWriter` class.
36
-
37
- #### SEC-1: Status file path not validated for path traversal (HIGH)
38
-
39
- **File:** `src/execution/status-file.ts:170`
40
- ```typescript
41
- export async function writeStatusFile(filePath: string, status: NaxStatusFile): Promise<void> {
42
- const tmpPath = `${filePath}.tmp`;
43
- await Bun.write(tmpPath, JSON.stringify(status, null, 2));
44
- await rename(tmpPath, filePath);
45
- }
46
- ```
47
- **Risk:** If `statusFile` option is user-controlled (e.g., via CLI arg), arbitrary file overwrite is possible. Currently mitigated by the fact that the path is derived from `--status-file` CLI arg which is operator-controlled, but no validation exists.
48
- **Fix:** Add `path.resolve()` + verify the path is within the workdir or a known safe directory.
49
-
50
- ### MEDIUM
51
-
52
- #### ENH-1: `getSafeLogger()` duplicated across files (MEDIUM)
53
-
54
- **Files:** `src/execution/runner.ts:71`, `src/execution/post-verify.ts:80`
55
- ```typescript
56
- function getSafeLogger() {
57
- try { return getLogger(); }
58
- catch { return null; }
59
- }
60
- ```
61
- **Risk:** DRY violation. If logger initialization changes, both copies need updating.
62
- **Fix:** Export `getSafeLogger()` from `src/logger.ts` or a shared utility.
63
-
64
- #### TYPE-1: `captureGitRef()` duplicated with different signatures (MEDIUM)
65
-
66
- **Files:** `src/execution/post-verify.ts:17` (returns `string | undefined`), `src/tdd/orchestrator.ts:30` (returns `string`, throws on failure)
67
- **Risk:** Inconsistent error handling for the same operation. The orchestrator version will throw if git isn't available.
68
- **Fix:** Consolidate into a single shared function in a git utility module. Prefer the `string | undefined` signature for resilience.
69
-
70
- #### BUG-2: `writeStatus` swallows errors silently during critical state transitions (MEDIUM)
71
-
72
- **File:** `src/execution/runner.ts:188-193`
73
- ```typescript
74
- catch (err) {
75
- safeLogger?.warn("status-file", "Failed to write status file (non-fatal)", {
76
- path: statusFile,
77
- error: (err as Error).message,
78
- });
79
- }
80
- ```
81
- **Risk:** If the status file write fails repeatedly (e.g., disk full), the only signal is a warn log that may be missed. External tooling polling the file would see stale data.
82
- **Fix:** Consider a counter; after N consecutive failures, emit a more prominent error or set a flag on the run state.
83
-
84
- #### ENH-2: Verdict validation could use a schema library (MEDIUM)
85
-
86
- **File:** `src/tdd/verdict.ts:85-120` — `isValidVerdict()` is 35 lines of manual validation.
87
- **Risk:** Verbose and error-prone as the schema evolves. New fields require manual validation additions.
88
- **Fix:** Consider using `zod` or `valibot` for declarative schema validation. However, the current zero-dependency approach is acceptable for a CLI tool — marking as enhancement, not a bug.
89
-
90
- #### MEM-1: Atomic write leaves orphan `.tmp` file on crash between write and rename (MEDIUM)
91
-
92
- **File:** `src/execution/status-file.ts:172-175`
93
- **Risk:** If the process crashes after `Bun.write()` but before `rename()`, a `.tmp` file persists. Not a memory leak but can cause confusion.
94
- **Fix:** Add cleanup of stale `.tmp` files at runner startup, or use `try/finally` to attempt cleanup.
95
-
96
- ### LOW
97
-
98
- #### STYLE-1: Inconsistent `as FailureCategory` casts in orchestrator (LOW)
99
-
100
- **File:** `src/tdd/orchestrator.ts:293, 312`
101
- ```typescript
102
- failureCategory: "session-failure" as FailureCategory,
103
- failureCategory: "isolation-violation" as FailureCategory,
104
- ```
105
- **Risk:** The type is already `FailureCategory`, so the cast is redundant and adds noise.
106
- **Fix:** Remove the `as FailureCategory` casts — TypeScript already infers the string literal correctly.
107
-
108
- #### STYLE-2: Test file duplication — `tdd-verdict.test.ts` and `verdict.test.ts` overlap significantly (LOW)
109
-
110
- **Files:** `test/verdict.test.ts` (339 lines), `test/tdd-verdict.test.ts` (290 lines)
111
- **Risk:** Both test `readVerdict`, `cleanupVerdict`, and `categorizeVerdict` with very similar test cases. Maintenance burden doubles.
112
- **Fix:** Consolidate into a single test file. If both were generated by different subtasks, merge the more thorough assertions from each.
113
-
114
- #### ENH-3: `countProgress` iterates stories 4 times (LOW)
115
-
116
- **File:** `src/execution/status-file.ts:101-106`
117
- ```typescript
118
- const passed = stories.filter((s) => s.status === "passed").length;
119
- const failed = stories.filter((s) => s.status === "failed").length;
120
- const paused = stories.filter((s) => s.status === "paused").length;
121
- const blocked = stories.filter((s) => s.status === "blocked").length;
122
- ```
123
- **Risk:** Negligible performance impact (PRDs have <50 stories), but could be a single loop.
124
- **Fix:** Single `reduce()` pass. Low priority — readability is fine as-is.
125
-
126
- #### PERF-1: `buildStatusSnapshot` calls `Date.now()` once but creates `new Date()` separately (LOW)
127
-
128
- **File:** `src/execution/status-file.ts:154`
129
- ```typescript
130
- const now = Date.now();
131
- // ...
132
- updatedAt: new Date(now).toISOString(),
133
- ```
134
- **Risk:** None — this is actually correct and efficient. No finding here, just noting the pattern is clean.
135
-
136
- #### ENH-4: `resolveMaxAttemptsOutcome` could benefit from exhaustive matching (LOW)
137
-
138
- **File:** `src/execution/runner.ts:59-64`
139
- ```typescript
140
- export function resolveMaxAttemptsOutcome(failureCategory?: FailureCategory): "pause" | "fail" {
141
- if (failureCategory === "isolation-violation" || failureCategory === "verifier-rejected") {
142
- return "pause";
143
- }
144
- return "fail";
145
- }
146
- ```
147
- **Risk:** If new `FailureCategory` values are added, this function silently defaults to "fail".
148
- **Fix:** Add a `satisfies never` exhaustive check or use a switch statement.
149
-
150
- ---
151
-
152
- ## What Was Done Well
153
-
154
- 1. **Atomic writes** for the status file — write-to-tmp-then-rename prevents partial reads.
155
- 2. **Verdict validation** is thorough — `isValidVerdict()` checks every required field, returns null on failure (never throws).
156
- 3. **Clean separation** — verdict reading, categorization, and cleanup are separate functions with single responsibilities.
157
- 4. **Fallback paths** — when no verdict file exists, the orchestrator gracefully falls back to independent test verification.
158
- 5. **Test coverage** is excellent — ~6 test files covering all three features with edge cases, error paths, and priority ordering.
159
- 6. **Type design** — `FailureCategory` as a union type, `NaxStatusFile` with version field for forward compat, `ThreeSessionTddResult.verdict` using `null | undefined` distinction.
160
- 7. **Documentation** — JSDoc on all public APIs with clear parameter descriptions.
161
- 8. **`markStoryFailed` backward compatibility** — the `failureCategory` parameter is optional; existing callers don't break.
162
-
163
- ---
164
-
165
- ## Priority Fix Order
166
-
167
- | Priority | ID | Severity | Effort | Description |
168
- |:---:|:---|:---:|:---:|:---|
169
- | 1 | BUG-1 | HIGH | L | Extract runner.ts into smaller modules |
170
- | 2 | SEC-1 | HIGH | S | Validate status file path |
171
- | 3 | ENH-1 | MEDIUM | S | Deduplicate `getSafeLogger()` |
172
- | 4 | TYPE-1 | MEDIUM | S | Consolidate `captureGitRef()` |
173
- | 5 | STYLE-2 | LOW | M | Merge duplicate verdict test files |
174
- | 6 | ENH-4 | LOW | S | Exhaustive match in `resolveMaxAttemptsOutcome` |
175
- | 7 | STYLE-1 | LOW | S | Remove redundant `as FailureCategory` casts |
176
-
177
- *Effort: S = <30min, M = 1-2h, L = 2-4h*
178
-
179
- ---
180
-
181
- ## Summary
182
-
183
- v0.10.1 is a **solid implementation** of three well-scoped features. The code demonstrates good defensive programming (never-throw readers, atomic writes, graceful fallbacks) and strong type design. Test coverage is comprehensive with both happy-path and error-path cases.
184
-
185
- The main concern is **runner.ts growing to 1,310 lines** — the status-file integration added more state and write points to an already large file. The next refactoring pass should extract the status-file writer and the story pipeline dispatch into separate modules.
186
-
187
- No critical security or reliability issues found. The codebase is production-ready.