@nathapp/nax 0.27.1 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (383) hide show
  1. package/CHANGELOG.md +13 -2
  2. package/dist/nax.js +72691 -0
  3. package/package.json +12 -4
  4. package/src/cli/config.ts +42 -1
  5. package/src/cli/prompts.ts +18 -6
  6. package/src/config/defaults.ts +2 -0
  7. package/src/config/schemas.ts +11 -0
  8. package/src/config/types.ts +8 -0
  9. package/src/context/builder.ts +10 -1
  10. package/src/pipeline/stages/execution.ts +5 -0
  11. package/src/pipeline/stages/prompt.ts +13 -4
  12. package/src/precheck/checks-warnings.ts +37 -0
  13. package/src/precheck/checks.ts +1 -0
  14. package/src/precheck/index.ts +14 -7
  15. package/src/prompts/builder.ts +178 -0
  16. package/src/prompts/index.ts +2 -0
  17. package/src/prompts/loader.ts +43 -0
  18. package/src/prompts/sections/conventions.ts +15 -0
  19. package/src/prompts/sections/index.ts +11 -0
  20. package/src/prompts/sections/isolation.ts +24 -0
  21. package/src/prompts/sections/role-task.ts +34 -0
  22. package/src/prompts/sections/story.ts +13 -0
  23. package/src/prompts/sections/verdict.ts +70 -0
  24. package/src/prompts/templates/implementer.ts +6 -0
  25. package/src/prompts/templates/single-session.ts +6 -0
  26. package/src/prompts/templates/test-writer.ts +6 -0
  27. package/src/prompts/templates/verifier.ts +6 -0
  28. package/src/prompts/types.ts +21 -0
  29. package/src/review/runner.ts +6 -1
  30. package/src/tdd/session-runner.ts +12 -12
  31. package/src/version.ts +2 -1
  32. package/.claude/rules/01-project-conventions.md +0 -34
  33. package/.claude/rules/02-test-architecture.md +0 -39
  34. package/.claude/rules/03-test-writing.md +0 -58
  35. package/.claude/rules/04-forbidden-patterns.md +0 -29
  36. package/.claude/settings.json +0 -15
  37. package/.githooks/pre-commit +0 -16
  38. package/.gitlab-ci.yml +0 -103
  39. package/.mcp.json +0 -8
  40. package/BRIEF.md +0 -140
  41. package/CLAUDE.md +0 -143
  42. package/US-007-IMPLEMENTATION.md +0 -139
  43. package/biome.json +0 -14
  44. package/bun.lock +0 -163
  45. package/bunfig.toml +0 -12
  46. package/docker-compose.test.yml +0 -15
  47. package/docs/20260216-fix-plan-context-review.md +0 -56
  48. package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
  49. package/docs/20260216-v02-plan.md +0 -136
  50. package/docs/20260216-v02-review.md +0 -685
  51. package/docs/20260217-dogfood-findings.md +0 -56
  52. package/docs/20260217-p2-plus-plan.md +0 -117
  53. package/docs/20260217-partial-fixes-plan.md +0 -62
  54. package/docs/20260217-plan-analyze-spec.md +0 -117
  55. package/docs/20260217-post-impl-review.md +0 -1137
  56. package/docs/20260217-quick-wins-plan.md +0 -66
  57. package/docs/20260217-split-runner-plan.md +0 -75
  58. package/docs/20260217-v03-impl-plan.md +0 -80
  59. package/docs/20260217-v03-post-impl-review.md +0 -589
  60. package/docs/20260217-v04-impl-plan.md +0 -86
  61. package/docs/20260217-v05-post-impl-review.md +0 -850
  62. package/docs/20260217-v06-post-impl-review.md +0 -817
  63. package/docs/20260218-adr003-port-plan.md +0 -151
  64. package/docs/20260218-review-adr003-verification.md +0 -175
  65. package/docs/20260219-fix-plan-bug16-19.md +0 -79
  66. package/docs/20260219-fix-plan-bug20-22.md +0 -114
  67. package/docs/20260219-plan-llm-routing.md +0 -116
  68. package/docs/20260219-review-bug20-22-fixes.md +0 -135
  69. package/docs/20260219-routing-baseline-keyword.md +0 -63
  70. package/docs/20260220-plan-structured-logging-p1.md +0 -80
  71. package/docs/20260220-plan-structured-logging-p2.md +0 -37
  72. package/docs/20260220-review-llm-routing.md +0 -180
  73. package/docs/20260220-review-post-fix-llm-routing.md +0 -70
  74. package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
  75. package/docs/20260221-fix-plan-routing-mode.md +0 -125
  76. package/docs/20260221-review-v0.9-implementation.md +0 -379
  77. package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
  78. package/docs/20260223-fix-plan-prompt-audit.md +0 -62
  79. package/docs/20260224-nax-roadmap-phases.md +0 -189
  80. package/docs/20260225-phase2-llm-service-layer.md +0 -401
  81. package/docs/20260225-review-v0.10.1.md +0 -187
  82. package/docs/20260303-v010-implementation-plan.md +0 -165
  83. package/docs/20260304-review-nax.md +0 -492
  84. package/docs/CLAUDE.md.bak +0 -191
  85. package/docs/ROADMAP.md +0 -364
  86. package/docs/SPEC-rectification.md +0 -0
  87. package/docs/SPEC.md +0 -324
  88. package/docs/US-001-plugin-loading-verification.md +0 -152
  89. package/docs/adr/ADR-005-implementation-plan.md +0 -655
  90. package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
  91. package/docs/architecture-analysis.md +0 -1076
  92. package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
  93. package/docs/bugs-from-dogfood-run-c.md +0 -243
  94. package/docs/code-review-20260228.md +0 -612
  95. package/docs/code-review-v0.15.0.md +0 -629
  96. package/docs/hook-lifecycle-test-plan.md +0 -149
  97. package/docs/releases/v0.11.0-and-earlier.md +0 -20
  98. package/docs/releases/v0.12.0.md +0 -15
  99. package/docs/releases/v0.13.0.md +0 -14
  100. package/docs/releases/v0.14.0.md +0 -20
  101. package/docs/releases/v0.14.1.md +0 -36
  102. package/docs/releases/v0.14.2.md +0 -51
  103. package/docs/releases/v0.14.3.md +0 -174
  104. package/docs/releases/v0.14.4.md +0 -94
  105. package/docs/releases/v0.15.0.md +0 -502
  106. package/docs/releases/v0.15.1.md +0 -170
  107. package/docs/releases/v0.15.3.md +0 -193
  108. package/docs/specs/bug-039-orphan-processes.md +0 -131
  109. package/docs/specs/bug-040-review-rectification.md +0 -82
  110. package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
  111. package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
  112. package/docs/specs/bun-pty-migration.md +0 -171
  113. package/docs/specs/central-run-registry.md +0 -116
  114. package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
  115. package/docs/specs/feat-011-file-context-strategy.md +0 -73
  116. package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
  117. package/docs/specs/feat-013-test-after-review.md +0 -89
  118. package/docs/specs/feat-014-heartbeat-observability.md +0 -127
  119. package/docs/specs/status-file-consolidation.md +0 -93
  120. package/docs/specs/status-file-v0.10.1.md +0 -812
  121. package/docs/specs/trigger-completion.md +0 -145
  122. package/docs/specs/verification-architecture-v2.md +0 -343
  123. package/docs/tdd/strategies.md +0 -97
  124. package/docs/v0.10-global-config.md +0 -206
  125. package/docs/v0.10-plugin-system.md +0 -415
  126. package/docs/v0.10-prompt-optimizer.md +0 -234
  127. package/docs/v0.3-spec.md +0 -244
  128. package/docs/v0.4-spec.md +0 -140
  129. package/docs/v0.5-spec.md +0 -237
  130. package/docs/v0.6-spec.md +0 -371
  131. package/docs/v0.7-spec.md +0 -177
  132. package/docs/v0.8-llm-routing.md +0 -206
  133. package/docs/v0.8-structured-logging.md +0 -132
  134. package/docs/v0.9.3-prompt-audit.md +0 -112
  135. package/examples/plugins/console-reporter/index.test.ts +0 -207
  136. package/examples/plugins/console-reporter/index.ts +0 -110
  137. package/memory/topic/feat-010-baseref.md +0 -28
  138. package/memory/topic/feat-013-test-after-deprecation.md +0 -22
  139. package/nax/config.json +0 -154
  140. package/nax/features/bug-039-medium/prd.json +0 -45
  141. package/nax/features/bugfix-v0171/prd.json +0 -52
  142. package/nax/features/central-run-registry/prd.json +0 -105
  143. package/nax/features/config-management/prd.json +0 -108
  144. package/nax/features/config-management/progress.txt +0 -5
  145. package/nax/features/diagnose/acceptance.test.ts +0 -414
  146. package/nax/features/diagnose/prd.json +0 -41
  147. package/nax/features/nax-compliance/prd.json +0 -52
  148. package/nax/features/nax-compliance/progress.txt +0 -1
  149. package/nax/features/orchestration-fixes/prd.json +0 -89
  150. package/nax/features/orchestration-fixes/progress.txt +0 -1
  151. package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
  152. package/nax/features/plugin-integration/prd.json +0 -208
  153. package/nax/features/plugin-integration/progress.txt +0 -5
  154. package/nax/features/post-rearch-bugfix/prd.json +0 -137
  155. package/nax/features/precheck/prd.json +0 -205
  156. package/nax/features/precheck/progress.txt +0 -15
  157. package/nax/features/review-quality/prd.json +0 -55
  158. package/nax/features/routing-persistence/prd.json +0 -104
  159. package/nax/features/routing-persistence/progress.txt +0 -1
  160. package/nax/features/smart-test-runner/plan.md +0 -7
  161. package/nax/features/smart-test-runner/prd.json +0 -203
  162. package/nax/features/smart-test-runner/progress.txt +0 -13
  163. package/nax/features/smart-test-runner/spec.md +0 -7
  164. package/nax/features/smart-test-runner/tasks.md +0 -8
  165. package/nax/features/status-file-consolidation/prd.json +0 -106
  166. package/nax/features/structured-logging/prd.json +0 -199
  167. package/nax/features/trigger-completion/prd.json +0 -150
  168. package/nax/features/trigger-completion/progress.txt +0 -7
  169. package/nax/features/unlock/prd.json +0 -36
  170. package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
  171. package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
  172. package/nax/features/v0.19.0-hardening/plan.md +0 -7
  173. package/nax/features/v0.19.0-hardening/prd.json +0 -84
  174. package/nax/features/v0.19.0-hardening/progress.txt +0 -7
  175. package/nax/features/v0.19.0-hardening/spec.md +0 -18
  176. package/nax/features/v0.19.0-hardening/tasks.md +0 -8
  177. package/nax/features/verify-v2/prd.json +0 -79
  178. package/nax/features/verify-v2/progress.txt +0 -3
  179. package/nax/status.json +0 -36
  180. package/test/COVERAGE-GAPS.md +0 -333
  181. package/test/e2e/cm-003-default-view.test.ts +0 -195
  182. package/test/e2e/plan-analyze-run.test.ts +0 -902
  183. package/test/helpers/helpers.test.ts +0 -295
  184. package/test/helpers/timeout.ts +0 -42
  185. package/test/integration/US-002-TEST-SUMMARY.md +0 -107
  186. package/test/integration/US-003-TEST-SUMMARY.md +0 -149
  187. package/test/integration/US-004-TEST-SUMMARY.md +0 -106
  188. package/test/integration/US-005-TEST-SUMMARY.md +0 -138
  189. package/test/integration/US-007-TEST-SUMMARY.md +0 -100
  190. package/test/integration/cli/agent-validation.test.ts +0 -439
  191. package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
  192. package/test/integration/cli/cli-config-default-view.test.ts +0 -230
  193. package/test/integration/cli/cli-config-diff.test.ts +0 -461
  194. package/test/integration/cli/cli-config.test.ts +0 -737
  195. package/test/integration/cli/cli-diagnose.test.ts +0 -595
  196. package/test/integration/cli/cli-logs.test.ts +0 -346
  197. package/test/integration/cli/cli-plugins.test.ts +0 -679
  198. package/test/integration/cli/cli-precheck.test.ts +0 -372
  199. package/test/integration/cli/cli-run-headless.test.ts +0 -174
  200. package/test/integration/cli/cli.test.ts +0 -76
  201. package/test/integration/cli/precheck-integration.test.ts +0 -476
  202. package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
  203. package/test/integration/cli/precheck.test.ts +0 -806
  204. package/test/integration/config/config-loader.test.ts +0 -266
  205. package/test/integration/config/config.test.ts +0 -444
  206. package/test/integration/config/merger.test.ts +0 -466
  207. package/test/integration/config/paths.test.ts +0 -52
  208. package/test/integration/config/security-loader.test.ts +0 -83
  209. package/test/integration/context/context-integration.test.ts +0 -703
  210. package/test/integration/context/context-path-security.test.ts +0 -173
  211. package/test/integration/context/context-provider-injection.test.ts +0 -507
  212. package/test/integration/context/context-verification-integration.test.ts +0 -296
  213. package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
  214. package/test/integration/execution/execution-isolation.test.ts +0 -143
  215. package/test/integration/execution/execution.test.ts +0 -634
  216. package/test/integration/execution/feature-status-write.test.ts +0 -302
  217. package/test/integration/execution/parallel.test.ts +0 -251
  218. package/test/integration/execution/prd-pause.test.ts +0 -205
  219. package/test/integration/execution/prd-resolvers.test.ts +0 -186
  220. package/test/integration/execution/progress.test.ts +0 -34
  221. package/test/integration/execution/runner-batching.test.ts +0 -682
  222. package/test/integration/execution/runner-config-plugins.test.ts +0 -462
  223. package/test/integration/execution/runner-escalation.test.ts +0 -561
  224. package/test/integration/execution/runner-fixes.test.ts +0 -400
  225. package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
  226. package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
  227. package/test/integration/execution/status-file-integration.test.ts +0 -289
  228. package/test/integration/execution/status-file.test.ts +0 -380
  229. package/test/integration/execution/status-writer.test.ts +0 -447
  230. package/test/integration/execution/story-id-in-events.test.ts +0 -274
  231. package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
  232. package/test/integration/pipeline/hooks.test.ts +0 -363
  233. package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
  234. package/test/integration/pipeline/pipeline-events.test.ts +0 -476
  235. package/test/integration/pipeline/pipeline.test.ts +0 -660
  236. package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
  237. package/test/integration/pipeline/verify-stage.test.ts +0 -286
  238. package/test/integration/plan/analyze-integration.test.ts +0 -262
  239. package/test/integration/plan/analyze-scanner.test.ts +0 -132
  240. package/test/integration/plan/logger.test.ts +0 -461
  241. package/test/integration/plan/plan.test.ts +0 -157
  242. package/test/integration/plugins/config-integration.test.ts +0 -173
  243. package/test/integration/plugins/config-resolution.test.ts +0 -523
  244. package/test/integration/plugins/loader.test.ts +0 -644
  245. package/test/integration/plugins/plugins-registry.test.ts +0 -747
  246. package/test/integration/plugins/validator.test.ts +0 -564
  247. package/test/integration/review/review-config-commands.test.ts +0 -320
  248. package/test/integration/review/review-config-schema.test.ts +0 -117
  249. package/test/integration/review/review-plugin-integration.test.ts +0 -729
  250. package/test/integration/review/review.test.ts +0 -150
  251. package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
  252. package/test/integration/routing/plugin-routing-core.test.ts +0 -527
  253. package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
  254. package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
  255. package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
  256. package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
  257. package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
  258. package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
  259. package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
  260. package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
  261. package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
  262. package/test/integration/tmp/headless-test/test.jsonl +0 -30
  263. package/test/integration/verification/test-scanner.test.ts +0 -403
  264. package/test/integration/verification/verification-asset-check.test.ts +0 -143
  265. package/test/integration/worktree/manager.test.ts +0 -218
  266. package/test/integration/worktree/worktree-merge.test.ts +0 -341
  267. package/test/manual/logging-formatter-demo.ts +0 -158
  268. package/test/ui/tui-agent-panel.test.tsx +0 -99
  269. package/test/ui/tui-pty-integration.test.tsx +0 -146
  270. package/test/unit/acceptance.test.ts +0 -187
  271. package/test/unit/agent-stderr-capture.test.ts +0 -147
  272. package/test/unit/agents/claude.test.ts +0 -107
  273. package/test/unit/analyze-classifier.test.ts +0 -216
  274. package/test/unit/analyze.test.ts +0 -224
  275. package/test/unit/auto-detect.test.ts +0 -250
  276. package/test/unit/cli-status-project-level.test.ts +0 -283
  277. package/test/unit/cli-status.test.ts +0 -418
  278. package/test/unit/commands/common.test.ts +0 -321
  279. package/test/unit/commands/logs.test.ts +0 -458
  280. package/test/unit/commands/runs.test.ts +0 -303
  281. package/test/unit/commands/unlock.test.ts +0 -320
  282. package/test/unit/config/defaults.test.ts +0 -70
  283. package/test/unit/config/quality-commands-schema.test.ts +0 -72
  284. package/test/unit/config/regression-gate-schema.test.ts +0 -160
  285. package/test/unit/config/smart-runner-flag.test.ts +0 -250
  286. package/test/unit/constitution-generators.test.ts +0 -161
  287. package/test/unit/constitution.test.ts +0 -210
  288. package/test/unit/context/context-autodetect.test.ts +0 -297
  289. package/test/unit/context/context-build.test.ts +0 -575
  290. package/test/unit/context/context-coverage.test.ts +0 -236
  291. package/test/unit/context/context-error.test.ts +0 -93
  292. package/test/unit/context/context-estimate-tokens.test.ts +0 -201
  293. package/test/unit/context/context-format.test.ts +0 -302
  294. package/test/unit/context/context-isolation.test.ts +0 -267
  295. package/test/unit/context/context-sort.test.ts +0 -93
  296. package/test/unit/context/context-story.test.ts +0 -108
  297. package/test/unit/context/prior-failures.test.ts +0 -463
  298. package/test/unit/context.test.ts +0 -1726
  299. package/test/unit/cost.test.ts +0 -231
  300. package/test/unit/crash-recovery.test.ts +0 -309
  301. package/test/unit/escalation.test.ts +0 -127
  302. package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
  303. package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
  304. package/test/unit/execution/pid-registry.test.ts +0 -241
  305. package/test/unit/execution/sequential-executor.test.ts +0 -235
  306. package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
  307. package/test/unit/execution/structured-failure.test.ts +0 -415
  308. package/test/unit/execution-logging-stderr.test.ts +0 -157
  309. package/test/unit/execution-stage.test.ts +0 -123
  310. package/test/unit/fix-generator.test.ts +0 -276
  311. package/test/unit/formatters.test.ts +0 -468
  312. package/test/unit/greenfield.test.ts +0 -180
  313. package/test/unit/hooks/shell-security.test.ts +0 -40
  314. package/test/unit/interaction/auto-plugin.test.ts +0 -162
  315. package/test/unit/interaction/human-review-trigger.test.ts +0 -165
  316. package/test/unit/interaction-network-failures.test.ts +0 -390
  317. package/test/unit/interaction-plugins.test.ts +0 -472
  318. package/test/unit/logging/formatter.test.ts +0 -456
  319. package/test/unit/merge.test.ts +0 -269
  320. package/test/unit/metrics/aggregator.test.ts +0 -164
  321. package/test/unit/metrics/tracker.test.ts +0 -186
  322. package/test/unit/metrics.test.ts +0 -276
  323. package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
  324. package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
  325. package/test/unit/pipeline/event-bus.test.ts +0 -105
  326. package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
  327. package/test/unit/pipeline/runner-retry.test.ts +0 -89
  328. package/test/unit/pipeline/stages/autofix.test.ts +0 -97
  329. package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
  330. package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
  331. package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
  332. package/test/unit/pipeline/stages/rectify.test.ts +0 -101
  333. package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
  334. package/test/unit/pipeline/stages/review.test.ts +0 -201
  335. package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
  336. package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
  337. package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
  338. package/test/unit/pipeline/stages/verify.test.ts +0 -267
  339. package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
  340. package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
  341. package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
  342. package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
  343. package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
  344. package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
  345. package/test/unit/prd-auto-default.test.ts +0 -291
  346. package/test/unit/prd-failure-category.test.ts +0 -177
  347. package/test/unit/prd-get-next-story.test.ts +0 -215
  348. package/test/unit/precheck-checks.test.ts +0 -841
  349. package/test/unit/precheck-story-size-gate.test.ts +0 -288
  350. package/test/unit/precheck-types.test.ts +0 -143
  351. package/test/unit/prompts.test.ts +0 -476
  352. package/test/unit/queue.test.ts +0 -237
  353. package/test/unit/rectification.test.ts +0 -285
  354. package/test/unit/registry.test.ts +0 -288
  355. package/test/unit/review/runner.test.ts +0 -117
  356. package/test/unit/routing/content-hash.test.ts +0 -99
  357. package/test/unit/routing/routing-stability.test.ts +0 -208
  358. package/test/unit/routing/strategies/llm.test.ts +0 -306
  359. package/test/unit/routing-advanced.test.ts +0 -313
  360. package/test/unit/routing-core.test.ts +0 -341
  361. package/test/unit/routing-strategies.test.ts +0 -440
  362. package/test/unit/storyid-events.test.ts +0 -213
  363. package/test/unit/tdd-verdict.test.ts +0 -492
  364. package/test/unit/test-output-parser.test.ts +0 -377
  365. package/test/unit/ui/tui-controls.test.ts +0 -335
  366. package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
  367. package/test/unit/ui/tui-layout.test.ts +0 -379
  368. package/test/unit/ui/tui-stories.test.ts +0 -333
  369. package/test/unit/unit-isolation.test.ts +0 -135
  370. package/test/unit/utils/git.test.ts +0 -50
  371. package/test/unit/utils/path-security.test.ts +0 -47
  372. package/test/unit/utils-helpers.test.ts +0 -318
  373. package/test/unit/verdict.test.ts +0 -325
  374. package/test/unit/verification/orchestrator-types.test.ts +0 -54
  375. package/test/unit/verification/orchestrator.test.ts +0 -66
  376. package/test/unit/verification/smart-runner-config.test.ts +0 -163
  377. package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
  378. package/test/unit/verification/smart-runner.test.ts +0 -262
  379. package/test/unit/verification/strategies/acceptance.test.ts +0 -33
  380. package/test/unit/verification/strategies/regression.test.ts +0 -87
  381. package/test/unit/verification/strategies/scoped.test.ts +0 -100
  382. package/test/unit/worktree-manager.test.ts +0 -159
  383. package/tsconfig.json +0 -27
@@ -1,56 +0,0 @@
1
- # Fix Plan: Context Builder Review Findings
2
- **Date:** 2026-02-16
3
- **Branch:** main (local commits only)
4
-
5
- ## Phase 1: DRY + Token Estimation + Dead Config
6
-
7
- ### Fix 1: Extract context building helper in runner.ts
8
- **File:** `src/execution/runner.ts`
9
- **Impact:** Code duplication between TDD and single-session paths
10
- **Change:** Extract the duplicated 8-line context building block (build + log) into a helper function like `maybeGetContext(story, config, useContext)` that returns `string | undefined`. Call it from both TDD and single-session branches.
11
-
12
- ### Fix 2: Token estimation ratio
13
- **File:** `src/context/builder.ts`
14
- **Impact:** Budget underestimates for code (1:4 is too generous; code averages closer to 1:3)
15
- **Change:** Change `estimateTokens()` from `Math.ceil(text.length / 4)` to `Math.ceil(text.length / 3)`. Update any test assertions that check specific token counts.
16
-
17
- ### Fix 3: Remove dead config paths
18
- **File:** `src/context/types.ts`, `src/context/builder.ts`, `src/execution/runner.ts`
19
- **Impact:** `includeConfig` and `includeDependencies` in `ContextBuilderConfig` are set but never read in `buildContext()`
20
- **Change:** Remove `includeConfig` and `includeDependencies` from `ContextBuilderConfig`. Remove them from the config object in `buildStoryContext()`. If dependencies should be used, wire them — otherwise remove the dead path. The `dependency` type in `ContextElement` can stay for future use.
21
-
22
- ## Phase 2: Wire UserStory fields for real context
23
-
24
- ### Fix 4: Add optional context fields to UserStory
25
- **File:** `src/prd/types.ts`
26
- **Change:** Add to `UserStory` interface:
27
- ```ts
28
- /** Relevant source files for context injection */
29
- relevantFiles?: string[];
30
- /** Prior error messages from failed attempts */
31
- priorErrors?: string[];
32
- /** Custom context strings */
33
- customContext?: string[];
34
- ```
35
-
36
- ### Fix 5: Wire UserStory fields into buildStoryContext
37
- **File:** `src/execution/runner.ts`
38
- **Change:** Replace the hardcoded empty arrays in `buildStoryContext()` with actual values from `story`:
39
- ```ts
40
- relevantFiles: story.relevantFiles || [],
41
- priorErrors: story.priorErrors,
42
- customContext: story.customContext,
43
- ```
44
-
45
- ### Fix 6: Populate priorErrors on retry
46
- **File:** `src/execution/runner.ts`
47
- **Change:** When a story fails and gets retried (escalation), capture the error/failure reason and push it to `story.priorErrors` so the next attempt gets context about what went wrong.
48
-
49
- ## Test Strategy
50
- - Mode: test-after
51
- - Run: `bun test` after each phase
52
- - Update existing assertions in `test/context.test.ts` and `test/context-integration.test.ts` for token ratio change
53
-
54
- ## Commits
55
- - Phase 1: `refactor: DRY context helper, fix token estimation, remove dead config`
56
- - Phase 2: `feat: wire UserStory context fields into context builder`
@@ -1,208 +0,0 @@
1
- # Relentless vs ngent — Architecture Comparison & Port Recommendations
2
-
3
- **Date:** 2026-02-16
4
- **Relentless version:** v0.8.0 (~18,872 LOC src, 72 source files)
5
- **ngent version:** v0.1 (~2,799 LOC src, 33 source files)
6
-
7
- ## Executive Summary
8
-
9
- Relentless is a mature, feature-rich orchestrator with 6 agent adapters, a sophisticated routing system, and a polished TUI. ngent is a focused proof-of-concept with stronger TDD enforcement and simpler architecture. Rather than trying to match Relentless feature-for-feature, ngent should **selectively port the highest-impact features** while preserving its architectural advantages.
10
-
11
- **Bottom line:** Port 5 features from Relentless. Skip 4. Keep 3 ngent strengths.
12
-
13
- ---
14
-
15
- ## Feature-by-Feature Comparison
16
-
17
- ### 1. Agent Adapters
18
-
19
- | | Relentless | ngent |
20
- |:---|:---|:---|
21
- | **Agents** | 6 (Claude, Codex, Droid, OpenCode, Amp, Gemini) | 1 (Claude Code) |
22
- | **LOC** | ~1,800 across 8 files | ~180 in 1 file |
23
- | **Rate limit detection** | Yes (per-agent patterns) | No |
24
- | **Cross-agent fallback** | Yes (cascade system) | No |
25
-
26
- **Verdict:** ngent's single-agent focus is correct for v0.2. Multi-agent only matters when rate-limited or cost-optimizing with free models. **Port rate limit detection only** — cross-agent fallback is premature.
27
-
28
- ### 2. Routing & Classification
29
-
30
- | | Relentless | ngent |
31
- |:---|:---|:---|
32
- | **Model matrix** | 4x4 (free/cheap/good/genius x simple/medium/complex/expert) | 2-tier (haiku/sonnet) |
33
- | **Classifier** | Hybrid (regex heuristic + LLM fallback at <0.8 confidence) | Tag-based + complexity field |
34
- | **Confidence scoring** | Yes (0.0-1.0) | No |
35
- | **Cost estimation** | Per-model token estimates | Per-tier fixed estimates |
36
-
37
- **Verdict:** The 4-mode matrix is overkill for Claude-only. But **expanding to 3 tiers** (haiku/sonnet/opus) with the hybrid classifier approach would improve routing accuracy. The confidence-based LLM fallback is clever — worth porting.
38
-
39
- ### 3. Cascade & Escalation
40
-
41
- | | Relentless | ngent |
42
- |:---|:---|:---|
43
- | **Escalation path** | Configurable chain (e.g., haiku -> sonnet -> opus) | Simple tier bump |
44
- | **Per-attempt cost tracking** | Yes (Zod schema, actual cost per step) | Partial (estimates only) |
45
- | **Blocked detection** | Yes (marks stories as blocked with reason) | No |
46
- | **Max attempts** | Configurable | Hardcoded (3) |
47
-
48
- **Verdict:** ngent already has basic escalation. **Port per-attempt cost tracking and blocked detection.** Configurable max attempts is a quick win.
49
-
50
- ### 4. Context Optimization
51
-
52
- | | Relentless | ngent |
53
- |:---|:---|:---|
54
- | **Approach** | Extract current story + dependencies from tasks.md | Token-budgeted element builder |
55
- | **LOC** | 417 (single file, focused) | 276 (builder.ts) + integration |
56
- | **Story extraction** | Regex-based section parsing from tasks.md | Generic element types (file/config/error/custom) |
57
- | **Checklist filtering** | Tag-based ([US-XXX], [Constitution], [Edge Case]) | Not implemented |
58
- | **Progress summary** | Auto-generated ("5/18 stories complete") | Not implemented |
59
- | **Token savings claim** | ~84% | Not measured yet |
60
-
61
- **Verdict:** Relentless's context builder is more practical — it works with the actual tasks.md format. ngent's is more generic but currently inert (relevantFiles empty until PRD populates them). **Port the story extraction approach** — parse the PRD/tasks to inject only the current story + dependencies into each session prompt. This is the #1 cost saver.
62
-
63
- ### 5. Queue & Mid-Run Control
64
-
65
- | | Relentless | ngent |
66
- |:---|:---|:---|
67
- | **Queue file** | Custom format with commands | `.queue.txt` with cursor marker |
68
- | **Commands** | PAUSE, ABORT, SKIP, RETRY | None (read-only) |
69
- | **File locking** | Yes (lock.ts) | No |
70
- | **Hot reload** | Yes (watches file changes) | Yes (reads between stories) |
71
-
72
- **Verdict:** PAUSE/ABORT/SKIP are essential for production use. **Port queue commands.** File locking is nice-to-have.
73
-
74
- ### 6. Review System
75
-
76
- | | Relentless | ngent |
77
- |:---|:---|:---|
78
- | **Post-run review** | 6 micro-tasks (typecheck, lint, test, security, quality, docs) | None |
79
- | **Review prompts** | Templated per task type | N/A |
80
- | **Review runner** | Dedicated (337 LOC) | N/A |
81
-
82
- **Verdict:** The 6-phase review is valuable for production but heavy for POC stage. **Port a simplified 3-phase version** (typecheck, test, lint) for v0.3. Security/quality/docs can wait.
83
-
84
- ### 7. TUI
85
-
86
- | | Relentless | ngent |
87
- |:---|:---|:---|
88
- | **Framework** | Ink (React for CLI) with 3-column layout | Console.log |
89
- | **LOC** | ~2,000+ (App.tsx, components/, layouts/, hooks/) | 0 |
90
- | **Live cost tracking** | Yes | No |
91
- | **Story progress** | Visual grid | Text output |
92
-
93
- **Verdict:** Ink TUI is polished but massive investment. **Skip for v0.2.** ngent's planned `node-pty` supervised mode serves the same need with less code. Consider a minimal progress display (single-line status updates) instead.
94
-
95
- ### 8. Configuration
96
-
97
- | | Relentless | ngent |
98
- |:---|:---|:---|
99
- | **Schema** | Zod with full validation | Zod with validation |
100
- | **Constitution** | Versioned constitution system | Not implemented |
101
- | **Init wizard** | Interactive `init` command | Not implemented |
102
-
103
- **Verdict:** Constitution system is interesting but not critical. **Skip for v0.2.**
104
-
105
- ---
106
-
107
- ## ngent Advantages to Preserve
108
-
109
- ### 1. Three-Session TDD (Architectural Enforcement)
110
- Relentless uses prompt-based TDD ("write tests first, then implement"). ngent uses **session isolation** — separate Claude Code sessions for test writer, implementer, and verifier with git-diff verification. This is architecturally stronger and prevents the "AI writes tests that match its planned implementation" problem.
111
-
112
- **Keep and strengthen.** This is ngent's #1 differentiator.
113
-
114
- ### 2. Actual Cost Tracking
115
- ngent tracks real API costs from Claude Code's output. Relentless estimates costs from token counts. Actual > estimated.
116
-
117
- **Keep.** Extend with per-attempt tracking (from Relentless).
118
-
119
- ### 3. Simplicity (2.8K vs 18.9K LOC)
120
- ngent is 6.7x smaller. Easier to understand, modify, and debug. Every feature port should be evaluated against complexity cost.
121
-
122
- **Keep.** Resist feature bloat. Each port must earn its LOC.
123
-
124
- ---
125
-
126
- ## Port Recommendations (Prioritized)
127
-
128
- ### P0 — Do in v0.2 (High Impact, Moderate Effort)
129
-
130
- | # | Feature | Source | Est. LOC | Impact |
131
- |:---|:---|:---|:---|:---|
132
- | 1 | **Story-scoped context extraction** | `execution/context-builder.ts` | ~200 | #1 cost saver (~84% token reduction per session) |
133
- | 2 | **Queue commands (PAUSE/ABORT/SKIP)** | `queue/processor.ts`, `queue/parser.ts` | ~150 | Essential for production mid-run control |
134
- | 3 | **3-tier model routing** (haiku/sonnet/opus) | `routing/router.ts` | ~80 | Better cost optimization, already partially exists |
135
-
136
- ### P1 — Do in v0.3 (Medium Impact)
137
-
138
- | # | Feature | Source | Est. LOC | Impact |
139
- |:---|:---|:---|:---|:---|
140
- | 4 | **Per-attempt cost tracking** | `routing/cascade.ts` | ~100 | Better cost visibility, debugging |
141
- | 5 | **Hybrid classifier** (regex + LLM fallback) | `routing/classifier.ts` | ~150 | More accurate complexity routing |
142
- | 6 | **Simplified review phase** (typecheck + test + lint) | `review/runner.ts`, `review/tasks/` | ~200 | Quality gate before marking story complete |
143
- | 7 | **Blocked story detection** | `routing/cascade.ts` | ~50 | Avoid infinite retry loops |
144
-
145
- ### P2 — Consider for v0.4+ (Lower Priority)
146
-
147
- | # | Feature | Source | Reason to Defer |
148
- |:---|:---|:---|:---|
149
- | 8 | Multi-agent adapters | `agents/*.ts` | Only useful when rate-limited or using free models |
150
- | 9 | Ink TUI | `tui/` | ~2K LOC investment, node-pty serves similar need |
151
- | 10 | Constitution system | `config/` | Nice-to-have, not a blocker |
152
- | 11 | Init wizard | `init/` | One-time convenience, low ROI |
153
-
154
- ### Skip Entirely
155
-
156
- | Feature | Reason |
157
- |:---|:---|
158
- | Cross-agent fallback | Claude-only for foreseeable future |
159
- | File locking for queue | Single-instance execution = no contention |
160
- | 4x4 mode-model matrix | Overkill for 1 agent; 3-tier sufficient |
161
-
162
- ---
163
-
164
- ## v0.2 Implementation Plan (from P0 items)
165
-
166
- ### Phase 1: Story-Scoped Context (~200 LOC)
167
- Replace the generic context builder with Relentless-style story extraction:
168
- - Parse PRD to get current story + dependency stories
169
- - Inject only those sections into session prompt
170
- - Add progress summary ("Story 5/12, 4 passed")
171
- - Wire into both single-session and TDD paths
172
-
173
- ### Phase 2: Queue Commands (~150 LOC)
174
- Extend `.queue.txt` parser:
175
- - `PAUSE` — pause after current story completes
176
- - `ABORT` — stop immediately, mark remaining as skipped
177
- - `SKIP US-XXX` — skip a specific story
178
- - Check queue between stories (existing hook point)
179
-
180
- ### Phase 3: 3-Tier Model Routing (~80 LOC)
181
- Extend existing config:
182
- - Add `opus` tier for expert-complexity stories
183
- - Update router to support 3 tiers
184
- - Keep backward compat with existing 2-tier configs
185
-
186
- **Estimated total: ~430 LOC added to ngent (15% growth)**
187
-
188
- ---
189
-
190
- ## Metrics Comparison (12-Story Benchmark)
191
-
192
- | Metric | Relentless (estimated*) | ngent v0.1 | ngent v0.2 (projected) |
193
- |:---|:---|:---|:---|
194
- | Session overhead | ~2-3 min (context optimization) | ~9 min (full context per session) | ~3-4 min (story-scoped) |
195
- | Cost (12 stories) | ~$0.40-0.60* | $1.17 | ~$0.50-0.70 |
196
- | Test coverage | Prompt-based TDD | Session-isolated TDD | Session-isolated TDD |
197
- | Mid-run control | PAUSE/ABORT/SKIP/RETRY | None | PAUSE/ABORT/SKIP |
198
- | Review phase | 6 micro-tasks | None | None (P1 for v0.3) |
199
-
200
- *Relentless estimates based on context optimization claims and architecture analysis; no direct benchmark run.
201
-
202
- ---
203
-
204
- ## Conclusion
205
-
206
- ngent v0.2 should focus on the **three P0 features** (context extraction, queue commands, 3-tier routing) which together address the two biggest pain points from benchmarking: **session cost** and **lack of mid-run control**. Combined with the existing TDD enforcement advantage, this would make ngent competitive with Relentless for our use case while staying at ~3.2K LOC (vs 18.9K).
207
-
208
- The key insight: **don't try to be Relentless.** Port the ideas, not the code. ngent's value is in being opinionated and simple — a tool that does one thing well (orchestrate Claude Code with TDD) rather than a platform that supports everything.
@@ -1,136 +0,0 @@
1
- # ngent v0.2 Implementation Plan
2
- **Date:** 2026-02-16
3
- **Branch:** master (local commits only)
4
-
5
- ## Overview
6
- 4 features, 4 phases. ~530 LOC total. Each phase is self-contained.
7
-
8
- ## Phase 1: Story-Scoped Context Extraction (~200 LOC)
9
-
10
- Replace the generic token-budgeted context builder with PRD-aware story extraction.
11
-
12
- ### Changes:
13
- **`src/context/builder.ts`** — Rewrite `buildContext()`:
14
- - Accept the full PRD + current story ID
15
- - Extract only: current story description + AC + dependency stories from PRD
16
- - Add progress summary line: "Progress: 5/12 stories complete (4 passed, 1 failed)"
17
- - Keep token budget enforcement but feed it story-scoped content instead of file reads
18
- - Remove `readFileSafe` and file-based context (not used in practice)
19
-
20
- **`src/context/types.ts`** — Simplify:
21
- - `StoryContext` should accept `PRD` + `currentStoryId` instead of `relevantFiles[]`
22
- - Keep `BuiltContext` and `ContextBudget` as-is
23
- - Remove `ContextBuilderConfig` (over-engineered for what we need)
24
-
25
- **`src/execution/runner.ts`** — Update `maybeGetContext()`:
26
- - Pass the loaded PRD + current story to the new builder
27
- - Remove the `ContextBuilderConfig` construction
28
-
29
- **`test/context.test.ts`** + **`test/context-integration.test.ts`** — Update tests:
30
- - Test story extraction from a sample PRD
31
- - Test dependency story inclusion
32
- - Test progress summary generation
33
- - Remove file-read tests
34
-
35
- **Run:** `bun test`
36
- **Commit:** `feat(context): story-scoped extraction from PRD`
37
-
38
- ## Phase 2: Queue Commands — PAUSE/ABORT/SKIP (~150 LOC)
39
-
40
- Add mid-run control commands to `.queue.txt`.
41
-
42
- ### Changes:
43
- **`src/queue/types.ts`** — Add:
44
- ```ts
45
- export type QueueCommand = "PAUSE" | "ABORT" | { type: "SKIP"; storyId: string };
46
- export interface QueueFileResult {
47
- commands: QueueCommand[];
48
- guidance: string[]; // Non-command lines (existing behavior)
49
- }
50
- ```
51
-
52
- **`src/queue/manager.ts`** — Add `parseQueueFile(content: string): QueueFileResult`:
53
- - Parse lines: `PAUSE`, `ABORT`, `SKIP US-XXX` are commands
54
- - Everything else after `--- PENDING ---` is guidance (existing behavior)
55
- - Commands are case-insensitive
56
- - Return both commands and guidance
57
-
58
- **`src/execution/runner.ts`** — Check queue between stories:
59
- - After each story completes, read `.queue.txt` and parse
60
- - `PAUSE` → log "⏸️ Paused by user", break loop, return partial results
61
- - `ABORT` → mark remaining stories as skipped, break loop
62
- - `SKIP US-XXX` → set that story's status to "skipped", continue
63
- - Clear processed commands from `.queue.txt` after handling
64
-
65
- **`test/queue.test.ts`** — Add tests:
66
- - Parse PAUSE, ABORT, SKIP commands
67
- - Mixed commands + guidance text
68
- - Case insensitivity
69
- - Integration: runner respects PAUSE/ABORT/SKIP
70
-
71
- **Run:** `bun test`
72
- **Commit:** `feat(queue): add PAUSE/ABORT/SKIP commands`
73
-
74
- ## Phase 3: 3-Tier Model Routing (~80 LOC)
75
-
76
- The config already has 3 tiers (fast/balanced/powerful). The router maps complexity→tier correctly. What's missing: the escalation path should support all 3 tiers explicitly, and the `escalateModelTier()` function in runner.ts needs to handle the full chain.
77
-
78
- ### Changes:
79
- **`src/execution/runner.ts`** — Fix `escalateModelTier()`:
80
- - Current: simple tier bump that may not cover all tiers
81
- - New: explicit chain `fast → balanced → powerful → null` (null = max reached)
82
- - Log the escalation with tier names
83
-
84
- **`src/config/schema.ts`** — Add validation:
85
- - Ensure `complexityRouting` values are valid `ModelTier` keys
86
- - Ensure escalation `maxAttempts` >= 1
87
-
88
- **`test/routing.test.ts`** — Add tests:
89
- - Verify all 4 complexities route to correct tiers
90
- - Verify escalation chain: fast→balanced→powerful→null
91
- - Verify config validation catches invalid tiers
92
-
93
- **Run:** `bun test`
94
- **Commit:** `feat(routing): explicit 3-tier escalation chain`
95
-
96
- ## Phase 4: Story Batching (~100 LOC)
97
-
98
- Group consecutive simple-complexity stories into a single agent session to reduce startup overhead.
99
-
100
- ### Changes:
101
- **`src/execution/runner.ts`** — Add batching logic:
102
- - Before the main loop, scan pending stories
103
- - Group consecutive simple stories into batches (max 4 per batch)
104
- - For batched stories: build a combined prompt with all stories
105
- - Execute batch in a single session, parse results per-story
106
- - Non-simple stories execute individually (existing behavior)
107
- - Add `--no-batch` CLI flag to disable
108
-
109
- **`bin/ngent.ts`** — Add `--no-batch` flag
110
-
111
- **`src/execution/runner.ts`** — Add `buildBatchPrompt(stories: UserStory[]): string`:
112
- - Combine multiple stories into one prompt
113
- - Number each story clearly: "## Story 1: ...", "## Story 2: ..."
114
- - Instruct agent to commit each story separately
115
-
116
- **`test/runner.test.ts`** — Add tests:
117
- - Batch grouping logic (consecutive simple stories)
118
- - Mixed complexity stops batching
119
- - Max batch size enforcement
120
- - Batch prompt generation
121
-
122
- **Run:** `bun test`
123
- **Commit:** `feat(execution): story batching for simple stories`
124
-
125
- ## Test Strategy
126
- - Mode: hybrid
127
- - TDD targets: Phase 2 (queue commands — new public API, write tests first)
128
- - Test-after targets: Phase 1, 3, 4 (extending existing tested modules)
129
-
130
- ## Summary
131
- | Phase | Feature | Est. LOC | Strategy |
132
- |:---|:---|:---|:---|
133
- | 1 | Story-scoped context | ~200 | test-after |
134
- | 2 | Queue commands | ~150 | test-first |
135
- | 3 | 3-tier escalation | ~80 | test-after |
136
- | 4 | Story batching | ~100 | test-after |