npm - @nathapp/nax - Versions diffs - 0.28.0 → 0.29.0 - Mend

@nathapp/nax 0.28.0 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (376) hide show

package/CHANGELOG.md +13 -2
package/dist/nax.js +72691 -0
package/package.json +12 -4
package/src/cli/config.ts +3 -1
package/src/config/defaults.ts +1 -0
package/src/config/schemas.ts +1 -0
package/src/config/types.ts +1 -0
package/src/context/builder.ts +10 -1
package/src/prompts/sections/role-task.ts +4 -2
package/src/review/runner.ts +6 -1
package/src/version.ts +2 -1
package/.claude/rules/01-project-conventions.md +0 -34
package/.claude/rules/02-test-architecture.md +0 -39
package/.claude/rules/03-test-writing.md +0 -58
package/.claude/rules/04-forbidden-patterns.md +0 -29
package/.claude/settings.json +0 -15
package/.githooks/pre-commit +0 -16
package/.gitlab-ci.yml +0 -103
package/.mcp.json +0 -8
package/BRIEF.md +0 -140
package/CLAUDE.md +0 -143
package/US-007-IMPLEMENTATION.md +0 -139
package/biome.json +0 -14
package/bun.lock +0 -163
package/bunfig.toml +0 -12
package/docker-compose.test.yml +0 -15
package/docs/20260216-fix-plan-context-review.md +0 -56
package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
package/docs/20260216-v02-plan.md +0 -136
package/docs/20260216-v02-review.md +0 -685
package/docs/20260217-dogfood-findings.md +0 -56
package/docs/20260217-p2-plus-plan.md +0 -117
package/docs/20260217-partial-fixes-plan.md +0 -62
package/docs/20260217-plan-analyze-spec.md +0 -117
package/docs/20260217-post-impl-review.md +0 -1137
package/docs/20260217-quick-wins-plan.md +0 -66
package/docs/20260217-split-runner-plan.md +0 -75
package/docs/20260217-v03-impl-plan.md +0 -80
package/docs/20260217-v03-post-impl-review.md +0 -589
package/docs/20260217-v04-impl-plan.md +0 -86
package/docs/20260217-v05-post-impl-review.md +0 -850
package/docs/20260217-v06-post-impl-review.md +0 -817
package/docs/20260218-adr003-port-plan.md +0 -151
package/docs/20260218-review-adr003-verification.md +0 -175
package/docs/20260219-fix-plan-bug16-19.md +0 -79
package/docs/20260219-fix-plan-bug20-22.md +0 -114
package/docs/20260219-plan-llm-routing.md +0 -116
package/docs/20260219-review-bug20-22-fixes.md +0 -135
package/docs/20260219-routing-baseline-keyword.md +0 -63
package/docs/20260220-plan-structured-logging-p1.md +0 -80
package/docs/20260220-plan-structured-logging-p2.md +0 -37
package/docs/20260220-review-llm-routing.md +0 -180
package/docs/20260220-review-post-fix-llm-routing.md +0 -70
package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
package/docs/20260221-fix-plan-routing-mode.md +0 -125
package/docs/20260221-review-v0.9-implementation.md +0 -379
package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
package/docs/20260223-fix-plan-prompt-audit.md +0 -62
package/docs/20260224-nax-roadmap-phases.md +0 -189
package/docs/20260225-phase2-llm-service-layer.md +0 -401
package/docs/20260225-review-v0.10.1.md +0 -187
package/docs/20260303-v010-implementation-plan.md +0 -165
package/docs/20260304-review-nax.md +0 -492
package/docs/CLAUDE.md.bak +0 -191
package/docs/ROADMAP.md +0 -390
package/docs/SPEC-rectification.md +0 -0
package/docs/SPEC.md +0 -324
package/docs/US-001-plugin-loading-verification.md +0 -152
package/docs/adr/ADR-005-implementation-plan.md +0 -655
package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
package/docs/architecture-analysis.md +0 -1076
package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
package/docs/bugs-from-dogfood-run-c.md +0 -243
package/docs/code-review-20260228.md +0 -612
package/docs/code-review-v0.15.0.md +0 -629
package/docs/hook-lifecycle-test-plan.md +0 -149
package/docs/releases/v0.11.0-and-earlier.md +0 -20
package/docs/releases/v0.12.0.md +0 -15
package/docs/releases/v0.13.0.md +0 -14
package/docs/releases/v0.14.0.md +0 -20
package/docs/releases/v0.14.1.md +0 -36
package/docs/releases/v0.14.2.md +0 -51
package/docs/releases/v0.14.3.md +0 -174
package/docs/releases/v0.14.4.md +0 -94
package/docs/releases/v0.15.0.md +0 -502
package/docs/releases/v0.15.1.md +0 -170
package/docs/releases/v0.15.3.md +0 -193
package/docs/specs/bug-039-orphan-processes.md +0 -131
package/docs/specs/bug-040-review-rectification.md +0 -82
package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
package/docs/specs/bun-pty-migration.md +0 -171
package/docs/specs/central-run-registry.md +0 -116
package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
package/docs/specs/feat-011-file-context-strategy.md +0 -73
package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
package/docs/specs/feat-013-test-after-review.md +0 -89
package/docs/specs/feat-014-heartbeat-observability.md +0 -127
package/docs/specs/status-file-consolidation.md +0 -93
package/docs/specs/status-file-v0.10.1.md +0 -812
package/docs/specs/trigger-completion.md +0 -145
package/docs/specs/verification-architecture-v2.md +0 -343
package/docs/tdd/strategies.md +0 -97
package/docs/v0.10-global-config.md +0 -206
package/docs/v0.10-plugin-system.md +0 -415
package/docs/v0.10-prompt-optimizer.md +0 -234
package/docs/v0.3-spec.md +0 -244
package/docs/v0.4-spec.md +0 -140
package/docs/v0.5-spec.md +0 -237
package/docs/v0.6-spec.md +0 -371
package/docs/v0.7-spec.md +0 -177
package/docs/v0.8-llm-routing.md +0 -206
package/docs/v0.8-structured-logging.md +0 -132
package/docs/v0.9.3-prompt-audit.md +0 -112
package/examples/plugins/console-reporter/index.test.ts +0 -207
package/examples/plugins/console-reporter/index.ts +0 -110
package/memory/topic/feat-010-baseref.md +0 -28
package/memory/topic/feat-013-test-after-deprecation.md +0 -22
package/nax/config.json +0 -154
package/nax/features/bug-039-medium/prd.json +0 -45
package/nax/features/bugfix-v0171/prd.json +0 -52
package/nax/features/central-run-registry/prd.json +0 -105
package/nax/features/config-management/prd.json +0 -108
package/nax/features/config-management/progress.txt +0 -5
package/nax/features/diagnose/acceptance.test.ts +0 -414
package/nax/features/diagnose/prd.json +0 -41
package/nax/features/nax-compliance/prd.json +0 -52
package/nax/features/nax-compliance/progress.txt +0 -1
package/nax/features/orchestration-fixes/prd.json +0 -89
package/nax/features/orchestration-fixes/progress.txt +0 -1
package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
package/nax/features/plugin-integration/prd.json +0 -208
package/nax/features/plugin-integration/progress.txt +0 -5
package/nax/features/post-rearch-bugfix/prd.json +0 -137
package/nax/features/precheck/prd.json +0 -205
package/nax/features/precheck/progress.txt +0 -15
package/nax/features/prompt-builder/prd.json +0 -152
package/nax/features/prompt-builder/progress.txt +0 -3
package/nax/features/review-quality/prd.json +0 -55
package/nax/features/routing-persistence/prd.json +0 -104
package/nax/features/routing-persistence/progress.txt +0 -1
package/nax/features/smart-test-runner/plan.md +0 -7
package/nax/features/smart-test-runner/prd.json +0 -203
package/nax/features/smart-test-runner/progress.txt +0 -13
package/nax/features/smart-test-runner/spec.md +0 -7
package/nax/features/smart-test-runner/tasks.md +0 -8
package/nax/features/status-file-consolidation/prd.json +0 -106
package/nax/features/structured-logging/prd.json +0 -199
package/nax/features/trigger-completion/prd.json +0 -150
package/nax/features/trigger-completion/progress.txt +0 -7
package/nax/features/unlock/prd.json +0 -36
package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
package/nax/features/v0.19.0-hardening/plan.md +0 -7
package/nax/features/v0.19.0-hardening/prd.json +0 -84
package/nax/features/v0.19.0-hardening/progress.txt +0 -7
package/nax/features/v0.19.0-hardening/spec.md +0 -18
package/nax/features/v0.19.0-hardening/tasks.md +0 -8
package/nax/features/verify-v2/prd.json +0 -79
package/nax/features/verify-v2/progress.txt +0 -3
package/nax/status.json +0 -36
package/test/COVERAGE-GAPS.md +0 -333
package/test/e2e/cm-003-default-view.test.ts +0 -195
package/test/e2e/plan-analyze-run.test.ts +0 -902
package/test/helpers/helpers.test.ts +0 -295
package/test/helpers/timeout.ts +0 -42
package/test/integration/US-002-TEST-SUMMARY.md +0 -107
package/test/integration/US-003-TEST-SUMMARY.md +0 -149
package/test/integration/US-004-TEST-SUMMARY.md +0 -106
package/test/integration/US-005-TEST-SUMMARY.md +0 -138
package/test/integration/US-007-TEST-SUMMARY.md +0 -100
package/test/integration/cli/agent-validation.test.ts +0 -439
package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
package/test/integration/cli/cli-config-default-view.test.ts +0 -230
package/test/integration/cli/cli-config-diff.test.ts +0 -461
package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
package/test/integration/cli/cli-config.test.ts +0 -737
package/test/integration/cli/cli-diagnose.test.ts +0 -595
package/test/integration/cli/cli-logs.test.ts +0 -346
package/test/integration/cli/cli-plugins.test.ts +0 -679
package/test/integration/cli/cli-precheck.test.ts +0 -372
package/test/integration/cli/cli-run-headless.test.ts +0 -174
package/test/integration/cli/cli.test.ts +0 -76
package/test/integration/cli/precheck-integration.test.ts +0 -476
package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
package/test/integration/cli/precheck.test.ts +0 -806
package/test/integration/config/config-loader.test.ts +0 -266
package/test/integration/config/config.test.ts +0 -444
package/test/integration/config/merger.test.ts +0 -466
package/test/integration/config/paths.test.ts +0 -52
package/test/integration/config/security-loader.test.ts +0 -83
package/test/integration/context/context-integration.test.ts +0 -703
package/test/integration/context/context-path-security.test.ts +0 -173
package/test/integration/context/context-provider-injection.test.ts +0 -507
package/test/integration/context/context-verification-integration.test.ts +0 -296
package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
package/test/integration/execution/execution-isolation.test.ts +0 -143
package/test/integration/execution/execution.test.ts +0 -634
package/test/integration/execution/feature-status-write.test.ts +0 -302
package/test/integration/execution/parallel.test.ts +0 -251
package/test/integration/execution/prd-pause.test.ts +0 -205
package/test/integration/execution/prd-resolvers.test.ts +0 -186
package/test/integration/execution/progress.test.ts +0 -34
package/test/integration/execution/runner-batching.test.ts +0 -682
package/test/integration/execution/runner-config-plugins.test.ts +0 -462
package/test/integration/execution/runner-escalation.test.ts +0 -561
package/test/integration/execution/runner-fixes.test.ts +0 -400
package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
package/test/integration/execution/status-file-integration.test.ts +0 -289
package/test/integration/execution/status-file.test.ts +0 -380
package/test/integration/execution/status-writer.test.ts +0 -447
package/test/integration/execution/story-id-in-events.test.ts +0 -274
package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
package/test/integration/pipeline/hooks.test.ts +0 -363
package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
package/test/integration/pipeline/pipeline-events.test.ts +0 -476
package/test/integration/pipeline/pipeline.test.ts +0 -660
package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
package/test/integration/pipeline/verify-stage.test.ts +0 -286
package/test/integration/plan/analyze-integration.test.ts +0 -262
package/test/integration/plan/analyze-scanner.test.ts +0 -132
package/test/integration/plan/logger.test.ts +0 -461
package/test/integration/plan/plan.test.ts +0 -157
package/test/integration/plugins/config-integration.test.ts +0 -173
package/test/integration/plugins/config-resolution.test.ts +0 -523
package/test/integration/plugins/loader.test.ts +0 -644
package/test/integration/plugins/plugins-registry.test.ts +0 -747
package/test/integration/plugins/validator.test.ts +0 -564
package/test/integration/prompts/pb-004-migration.test.ts +0 -523
package/test/integration/review/review-config-commands.test.ts +0 -320
package/test/integration/review/review-config-schema.test.ts +0 -117
package/test/integration/review/review-plugin-integration.test.ts +0 -729
package/test/integration/review/review.test.ts +0 -150
package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
package/test/integration/routing/plugin-routing-core.test.ts +0 -527
package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
package/test/integration/tmp/headless-test/test.jsonl +0 -30
package/test/integration/verification/test-scanner.test.ts +0 -403
package/test/integration/verification/verification-asset-check.test.ts +0 -143
package/test/integration/worktree/manager.test.ts +0 -218
package/test/integration/worktree/worktree-merge.test.ts +0 -341
package/test/manual/logging-formatter-demo.ts +0 -158
package/test/ui/tui-agent-panel.test.tsx +0 -99
package/test/ui/tui-pty-integration.test.tsx +0 -146
package/test/unit/acceptance.test.ts +0 -187
package/test/unit/agent-stderr-capture.test.ts +0 -147
package/test/unit/agents/claude.test.ts +0 -107
package/test/unit/analyze-classifier.test.ts +0 -216
package/test/unit/analyze.test.ts +0 -224
package/test/unit/auto-detect.test.ts +0 -250
package/test/unit/cli-status-project-level.test.ts +0 -283
package/test/unit/cli-status.test.ts +0 -418
package/test/unit/commands/common.test.ts +0 -321
package/test/unit/commands/logs.test.ts +0 -458
package/test/unit/commands/runs.test.ts +0 -303
package/test/unit/commands/unlock.test.ts +0 -320
package/test/unit/config/defaults.test.ts +0 -70
package/test/unit/config/quality-commands-schema.test.ts +0 -72
package/test/unit/config/regression-gate-schema.test.ts +0 -160
package/test/unit/config/smart-runner-flag.test.ts +0 -250
package/test/unit/constitution-generators.test.ts +0 -161
package/test/unit/constitution.test.ts +0 -210
package/test/unit/context/context-autodetect.test.ts +0 -297
package/test/unit/context/context-build.test.ts +0 -575
package/test/unit/context/context-coverage.test.ts +0 -236
package/test/unit/context/context-error.test.ts +0 -93
package/test/unit/context/context-estimate-tokens.test.ts +0 -201
package/test/unit/context/context-format.test.ts +0 -302
package/test/unit/context/context-isolation.test.ts +0 -267
package/test/unit/context/context-sort.test.ts +0 -93
package/test/unit/context/context-story.test.ts +0 -108
package/test/unit/context/prior-failures.test.ts +0 -463
package/test/unit/context.test.ts +0 -1726
package/test/unit/cost.test.ts +0 -231
package/test/unit/crash-recovery.test.ts +0 -309
package/test/unit/escalation.test.ts +0 -127
package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
package/test/unit/execution/pid-registry.test.ts +0 -241
package/test/unit/execution/sequential-executor.test.ts +0 -235
package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
package/test/unit/execution/structured-failure.test.ts +0 -415
package/test/unit/execution-logging-stderr.test.ts +0 -157
package/test/unit/execution-stage.test.ts +0 -123
package/test/unit/fix-generator.test.ts +0 -276
package/test/unit/formatters.test.ts +0 -468
package/test/unit/greenfield.test.ts +0 -180
package/test/unit/hooks/shell-security.test.ts +0 -40
package/test/unit/interaction/auto-plugin.test.ts +0 -162
package/test/unit/interaction/human-review-trigger.test.ts +0 -165
package/test/unit/interaction-network-failures.test.ts +0 -390
package/test/unit/interaction-plugins.test.ts +0 -472
package/test/unit/logging/formatter.test.ts +0 -456
package/test/unit/merge.test.ts +0 -269
package/test/unit/metrics/aggregator.test.ts +0 -164
package/test/unit/metrics/tracker.test.ts +0 -186
package/test/unit/metrics.test.ts +0 -276
package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
package/test/unit/pipeline/event-bus.test.ts +0 -105
package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
package/test/unit/pipeline/runner-retry.test.ts +0 -89
package/test/unit/pipeline/stages/autofix.test.ts +0 -97
package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
package/test/unit/pipeline/stages/rectify.test.ts +0 -101
package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
package/test/unit/pipeline/stages/review.test.ts +0 -201
package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
package/test/unit/pipeline/stages/verify.test.ts +0 -267
package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
package/test/unit/prd-auto-default.test.ts +0 -291
package/test/unit/prd-failure-category.test.ts +0 -177
package/test/unit/prd-get-next-story.test.ts +0 -215
package/test/unit/precheck/checks-warnings.test.ts +0 -114
package/test/unit/precheck-checks.test.ts +0 -841
package/test/unit/precheck-story-size-gate.test.ts +0 -288
package/test/unit/precheck-types.test.ts +0 -143
package/test/unit/prompts/builder.test.ts +0 -258
package/test/unit/prompts/loader.test.ts +0 -355
package/test/unit/prompts/sections/conventions.test.ts +0 -30
package/test/unit/prompts/sections/isolation.test.ts +0 -35
package/test/unit/prompts/sections/role-task.test.ts +0 -40
package/test/unit/prompts/sections/sections.test.ts +0 -238
package/test/unit/prompts/sections/story.test.ts +0 -45
package/test/unit/prompts/sections/verdict.test.ts +0 -58
package/test/unit/prompts.test.ts +0 -476
package/test/unit/queue.test.ts +0 -237
package/test/unit/rectification.test.ts +0 -285
package/test/unit/registry.test.ts +0 -288
package/test/unit/review/runner.test.ts +0 -117
package/test/unit/routing/content-hash.test.ts +0 -99
package/test/unit/routing/routing-stability.test.ts +0 -208
package/test/unit/routing/strategies/llm.test.ts +0 -306
package/test/unit/routing-advanced.test.ts +0 -313
package/test/unit/routing-core.test.ts +0 -341
package/test/unit/routing-strategies.test.ts +0 -440
package/test/unit/storyid-events.test.ts +0 -213
package/test/unit/tdd-verdict.test.ts +0 -492
package/test/unit/test-output-parser.test.ts +0 -377
package/test/unit/ui/tui-controls.test.ts +0 -335
package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
package/test/unit/ui/tui-layout.test.ts +0 -379
package/test/unit/ui/tui-stories.test.ts +0 -333
package/test/unit/unit-isolation.test.ts +0 -135
package/test/unit/utils/git.test.ts +0 -50
package/test/unit/utils/path-security.test.ts +0 -47
package/test/unit/utils-helpers.test.ts +0 -318
package/test/unit/verdict.test.ts +0 -325
package/test/unit/verification/orchestrator-types.test.ts +0 -54
package/test/unit/verification/orchestrator.test.ts +0 -66
package/test/unit/verification/smart-runner-config.test.ts +0 -163
package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
package/test/unit/verification/smart-runner.test.ts +0 -262
package/test/unit/verification/strategies/acceptance.test.ts +0 -33
package/test/unit/verification/strategies/regression.test.ts +0 -87
package/test/unit/verification/strategies/scoped.test.ts +0 -100
package/test/unit/worktree-manager.test.ts +0 -159
package/tsconfig.json +0 -27

package/docs/20260217-v03-post-impl-review.md DELETED Viewed

@@ -1,589 +0,0 @@
-# Deep Code Review: ngent v0.3.0
-**Date:** 2026-02-17
-**Reviewer:** Subrina (AI)
-**Version:** 0.3.0-dev
-**Files:** 65 TypeScript files (src: ~7,172 LOC, test: ~7,757 LOC)
-**Baseline:** 342 tests passing, 881 assertions, TypeScript strict mode
----
-## Overall Grade: A- (88/100)
-The v0.3 pipeline refactor represents a significant architectural improvement, successfully decomposing the monolithic runner into composable stages while maintaining backward compatibility. The new constitution, analyze, and review modules are well-designed with strong type safety and comprehensive test coverage. However, several medium-priority issues around JSDoc coverage, error handling consistency, and incomplete verify stage logic prevent this from achieving an A grade.
-**Key Strengths:**
-- Clean pipeline architecture with proper separation of concerns
-- Excellent test coverage for new modules (constitution: 100%, review: 100%, pipeline: 90%+)
-- Strong type safety with discriminated unions for pipeline results
-- Proper integration between new and existing systems
-**Areas for Improvement:**
-- Incomplete verify stage (placeholder with TODO)
-- JSDoc coverage gaps in pipeline stages (~40%)
-- Inconsistent error handling patterns between stages
-- Missing integration tests for full pipeline execution with all stages
----
-## Findings
-### 🔴 CRITICAL
-None. The codebase is production-ready from a security and reliability standpoint.
----
-### 🟡 HIGH
-#### BUG-7: Verify Stage is a No-Op Placeholder
-**Severity:** HIGH | **Category:** Bug
-**File:** `src/pipeline/stages/verify.ts:18-25`
-```typescript
-export const verifyStage: PipelineStage = {
-  name: "verify",
-  enabled: () => true,
-  async execute(_ctx: PipelineContext): Promise<StageResult> {
-    // TODO: Add verification logic here
-    // - Run tests
-    // - Check build
-    // - Validate output
-    return { action: "continue" };
-  },
-};
-```
-**Risk:** The verify stage is currently a no-op that always passes. This means agent output is never validated before being marked as passed. Stories could be marked complete even if tests fail or builds break.
-**Fix:** Implement verification logic:
-1. Run `bun test` in the workdir
-2. Check exit code
-3. Return `{ action: "fail", reason: "Tests failed" }` if exit code !== 0
-4. Consider adding build verification for TypeScript projects
-**Priority:** P0 — This is a critical gap in the execution pipeline.
----
-#### ENH-6: Pipeline Stages Have Inconsistent Error Handling
-**Severity:** HIGH | **Category:** Enhancement
-**File:** Multiple pipeline stages
-```typescript
-// Constitution stage: returns continue even if loading fails silently
-if (result) {
-  ctx.constitution = result.content;
-  // ...logs...
-}
-// No else — just continues without constitution
-// Execution stage: returns fail with clear reason
-if (!ctx.prompt) {
-  return { action: "fail", reason: "Prompt not built (prompt stage skipped?)" };
-}
-```
-**Risk:** Inconsistent error handling makes it hard to debug pipeline failures. Some stages silently continue on errors, others fail explicitly. This can lead to confusing behavior where a story fails for unclear reasons.
-**Fix:** Establish consistent patterns:
-1. **Soft failures** (constitution missing, context empty) → continue with warning log
-2. **Hard failures** (no agent, invalid config) → return `{ action: "fail", reason: "..." }`
-3. Document these patterns in a `PIPELINE.md` guide
-**Priority:** P1 — Affects debugging experience and maintainability.
----
-### 🟡 MEDIUM
-#### ENH-7: Missing JSDoc on Pipeline Stages (~40% coverage)
-**Severity:** MEDIUM | **Category:** Enhancement
-**File:** `src/pipeline/stages/*.ts`
-```typescript
-// ✗ No JSDoc
-export const queueCheckStage: PipelineStage = {
-  name: "queue-check",
-  enabled: () => true,
-  async execute(ctx: PipelineContext): Promise<StageResult> {
-    // ...
-  },
-};
-// ✓ Should have JSDoc
-/**
- * Queue Check Stage
- *
- * Checks for queue commands (PAUSE/ABORT/SKIP) before executing a story.
- * Processes commands atomically and updates PRD accordingly.
- *
- * @returns
- * - `continue`: No queue commands, proceed
- * - `pause`: PAUSE/ABORT command found, stop execution
- * - `skip`: SKIP command removed all stories from batch
- *
- * @example
- * ```ts
- * // User writes: echo "PAUSE" > .queue.txt
- * const result = await queueCheckStage.execute(ctx);
- * // result: { action: "pause", reason: "User requested pause via .queue.txt" }
- * ```
- */
-```
-**Impact:** New contributors need to read implementation code to understand stage behavior. Missing examples make it hard to understand stage interactions.
-**Fix:** Add JSDoc to all 9 pipeline stages with:
-- Brief description (1-2 sentences)
-- Return value documentation (all possible actions)
-- Example showing stage behavior in context
-**Priority:** P2 — Documentation gap, but code is readable.
----
-#### TYPE-3: Constitution Stage Uses Loose Type Conversion
-**Severity:** MEDIUM | **Category:** Type Safety
-**File:** `src/pipeline/stages/prompt.ts:22-30`
-```typescript
-// Convert constitution string to ConstitutionResult if present
-const constitution: ConstitutionResult | undefined = ctx.constitution
-  ? {
-      content: ctx.constitution,
-      tokens: Math.ceil(ctx.constitution.length / 4), // ⚠️ Duplicates estimation logic
-      originalTokens: Math.ceil(ctx.constitution.length / 4),
-      truncated: false,
-    }
-  : undefined;
-```
-**Risk:**
-1. Duplicates token estimation logic (should use `estimateTokens()` from constitution module)
-2. Uses 1 token ≈ 4 chars, but constitution loader uses 1 token ≈ 3 chars (inconsistent)
-3. If context stores `ConstitutionResult` instead of `string`, this conversion is unnecessary
-**Fix:**
-1. Store `ConstitutionResult | undefined` in `PipelineContext.constitution` instead of `string | undefined`
-2. Update constitution stage to assign the full result object
-3. Remove conversion logic from prompt stage
-**Priority:** P2 — Type inconsistency, but functionally correct.
----
-#### BUG-8: Pipeline Runner Doesn't Preserve Context Mutations Across Stages
-**Severity:** MEDIUM | **Category:** Bug
-**File:** `src/pipeline/runner.ts:48-127`
-```typescript
-export async function runPipeline(
-  stages: PipelineStage[],
-  context: PipelineContext,
-): Promise<PipelineRunResult> {
-  for (const stage of stages) {
-    // ...
-    result = await stage.execute(context); // ⚠️ Stages mutate context in-place
-  }
-  // ...
-  return {
-    success: true,
-    finalAction: "complete",
-    context, // ⚠️ Returns mutated context, but contract is unclear
-  };
-}
-```
-**Risk:** Stages mutate the context object in-place. The function signature doesn't make it clear whether the input `context` is mutated or a new context is returned. This could cause subtle bugs if callers expect immutability.
-**Fix:**
-1. Document mutation contract in JSDoc: "Stages mutate the context in-place. The returned context is the same object, mutated."
-2. Consider cloning context before pipeline execution for safer API (if mutation is unintended)
-3. Add integration test verifying context mutations are preserved
-**Priority:** P2 — Potential footgun, but current usage is correct.
----
-#### PERF-4: Prompt Stage Recreates ConstitutionResult on Every Execution
-**Severity:** MEDIUM | **Category:** Performance
-**File:** `src/pipeline/stages/prompt.ts:22-30`
-```typescript
-async execute(ctx: PipelineContext): Promise<StageResult> {
-  // ⚠️ Re-creates ConstitutionResult every time even though content is static
-  const constitution: ConstitutionResult | undefined = ctx.constitution
-    ? {
-        content: ctx.constitution,
-        tokens: Math.ceil(ctx.constitution.length / 4),
-        originalTokens: Math.ceil(ctx.constitution.length / 4),
-        truncated: false,
-      }
-    : undefined;
-  // ...
-}
-```
-**Impact:** Constitution is loaded once per feature, but prompt stage recreates the result object on every story. For a 100-story feature, this wastes allocation cycles.
-**Fix:** Store `ConstitutionResult` in context (see TYPE-3) so prompt stage can use it directly without reconstruction.
-**Priority:** P3 — Micro-optimization, but aligns with TYPE-3 fix.
----
-#### ENH-8: No Integration Test for Full Pipeline with All Stages
-**Severity:** MEDIUM | **Category:** Enhancement
-**File:** `test/pipeline.test.ts`
-**Current coverage:**
-- ✓ Pipeline runner logic (continue/skip/fail/escalate/pause)
-- ✓ Individual stage unit tests (constitution, review)
-- ✗ Full pipeline execution with all 9 stages
-**Missing:** An integration test that:
-1. Sets up a real workdir with package.json, src/, test/
-2. Runs `runPipeline(defaultPipeline, realContext)`
-3. Verifies all stages execute in order
-4. Checks context accumulation (constitution → context → prompt → agentResult → reviewResult)
-**Fix:** Add `test/pipeline-integration.test.ts`:
-```typescript
-test("full pipeline execution with all stages", async () => {
-  const ctx = createRealTestContext(); // Real files, not mocks
-  const result = await runPipeline(defaultPipeline, ctx);
-  expect(result.success).toBe(true);
-  expect(result.context.constitution).toBeDefined();
-  expect(result.context.prompt).toBeDefined();
-  expect(result.context.agentResult).toBeDefined();
-  // etc.
-});
-```
-**Priority:** P2 — Increases confidence in pipeline integration.
----
-#### STYLE-4: Magic Number for Constitution Token Estimation Inconsistency
-**Severity:** MEDIUM | **Category:** Style
-**File:** `src/pipeline/stages/prompt.ts:26` vs `src/constitution/loader.ts:21`
-```typescript
-// constitution/loader.ts
-export function estimateTokens(text: string): number {
-  return Math.ceil(text.length / 3); // 1 token ≈ 3 chars
-}
-// pipeline/stages/prompt.ts
-tokens: Math.ceil(ctx.constitution.length / 4), // ⚠️ 1 token ≈ 4 chars
-```
-**Risk:** Inconsistent token estimation can lead to underestimation in prompt stage, potentially hitting model context limits unexpectedly.
-**Fix:** Always use `estimateTokens()` from constitution module. Extract as named constant if different heuristic is intentional:
-```typescript
-const CONSERVATIVE_TOKEN_ESTIMATE = 4; // chars per token (more conservative than 3)
-```
-**Priority:** P2 — Consistency issue with functional impact.
----
-### 🟢 LOW
-#### ENH-9: Plan Command Doesn't Validate Spec Template Output
-**Severity:** LOW | **Category:** Enhancement
-**File:** `src/cli/plan.ts:50-132`
-```typescript
-// In interactive mode, assume agent wrote the spec
-if (interactive) {
-  if (result.specContent) {
-    await Bun.write(outputPath, result.specContent);
-  } else {
-    // If agent wrote directly, verify it exists
-    if (!existsSync(outputPath)) { // ⚠️ No format validation
-      throw new Error(`Interactive planning completed but spec not found at ${outputPath}`);
-    }
-  }
-}
-```
-**Impact:** Plan mode checks if spec file exists but doesn't validate it follows the template format. Agent could write invalid markdown or skip required sections (Problem, Requirements, Acceptance Criteria).
-**Fix:** Add optional spec validation:
-1. Parse output markdown
-2. Check for required sections: `# Feature:`, `## Problem`, `## Requirements`, `## Acceptance Criteria`
-3. Warn if sections are missing (don't fail, since agent may use different structure)
-**Priority:** P3 — Nice-to-have validation, but agent output is typically well-structured.
----
-#### STYLE-5: Analyze Classifier Uses `any` for LLM Response Parsing
-**Severity:** LOW | **Category:** Type Safety
-**File:** `src/analyze/classifier.ts:105-127`
-```typescript
-// Extract text from response
-const textContent = response.content.find((c: any) => c.type === "text"); // ⚠️ any
-if (!textContent || textContent.type !== "text") {
-  throw new Error("No text response from LLM");
-}
-// Map to StoryClassification[]
-const classifications: StoryClassification[] = parsed.map((item: any) => ({ // ⚠️ any
-  storyId: item.storyId,
-  complexity: validateComplexity(item.complexity),
-  // ...
-}));
-```
-**Risk:** Using `any` bypasses type checking. If Anthropic SDK changes response structure, this code could fail at runtime without TypeScript catching it.
-**Fix:** Define proper types:
-```typescript
-interface AnthropicTextContent {
-  type: "text";
-  text: string;
-}
-interface LLMClassificationItem {
-  storyId: string;
-  complexity: string;
-  relevantFiles: unknown;
-  reasoning: unknown;
-  estimatedLOC: unknown;
-  risks: unknown;
-}
-```
-**Priority:** P3 — Low risk since Anthropic SDK is stable, but better type safety is always preferred.
----
-#### ENH-10: Pipeline Doesn't Log Which Stages Were Skipped
-**Severity:** LOW | **Category:** Enhancement
-**File:** `src/pipeline/runner.ts:52-56`
-```typescript
-for (const stage of stages) {
-  // Skip disabled stages
-  if (!stage.enabled(context)) {
-    continue; // ⚠️ Silent skip — user doesn't know why stage didn't run
-  }
-  // ...
-}
-```
-**Impact:** If a stage is disabled (e.g., `reviewStage` when `config.review.enabled = false`), the pipeline silently skips it. Users may be confused why review didn't run.
-**Fix:** Add debug logging for skipped stages:
-```typescript
-if (!stage.enabled(context)) {
-  console.log(chalk.dim(`   → Stage "${stage.name}" skipped (disabled)`));
-  continue;
-}
-```
-**Priority:** P3 — Improves observability but not critical.
----
-#### STYLE-6: Queue Check Stage Mutates Context Stories Array
-**Severity:** LOW | **Category:** Style
-**File:** `src/pipeline/stages/queue-check.ts:68`
-```typescript
-// Remove from batch
-ctx.stories = ctx.stories.filter((s) => s.id !== cmd.storyId); // ⚠️ Mutation
-```
-**Risk:** Mutating `ctx.stories` directly could cause confusion if other code expects the original batch to remain unchanged.
-**Fix:** Follow immutability principles:
-```typescript
-// Create new array instead of mutating
-ctx.stories = ctx.stories.filter((s) => s.id !== cmd.storyId);
-// ✓ Already immutable (filter returns new array), but could be clearer:
-const updatedStories = ctx.stories.filter((s) => s.id !== cmd.storyId);
-ctx.stories = updatedStories;
-```
-**Note:** Current code is actually fine (filter returns new array), but the assignment pattern could be clearer.
-**Priority:** P4 — Code works correctly, just a style preference.
----
-#### ENH-11: No Dry-Run Support for Review Stage
-**Severity:** LOW | **Category:** Enhancement
-**File:** `src/pipeline/stages/review.ts:16-29`
-```typescript
-async execute(ctx: PipelineContext): Promise<StageResult> {
-  console.log(chalk.cyan("\n   → Running review phase..."));
-  const reviewResult = await runReview(ctx.config.review, ctx.workdir); // ⚠️ Always runs, even in dry-run mode
-  // ...
-}
-```
-**Impact:** In dry-run mode, review stage still executes `bun test`, `bun run typecheck`, etc. This makes dry runs slow and may fail on incomplete code.
-**Fix:** Check for dry-run flag in context:
-```typescript
-if (ctx.config.execution.dryRun) {
-  console.log(chalk.yellow("   [DRY RUN] Would run review phase"));
-  return { action: "continue" };
-}
-```
-**Note:** PipelineContext doesn't currently have a `dryRun` flag. This would need to be added.
-**Priority:** P4 — Minor UX improvement for dry runs.
----
-#### TYPE-4: Routing Stage Console Logs Duplicate Logic
-**Severity:** LOW | **Category:** Style
-**File:** `src/pipeline/stages/routing.ts:32-45`
-```typescript
-const isBatch = ctx.stories.length > 1;
-if (isBatch) {
-  console.log(
-    chalk.dim(
-      `   Complexity: ${routing.complexity} | Model: ${routing.modelTier} | TDD: ${routing.testStrategy}`,
-    ),
-  );
-} else {
-  console.log(
-    chalk.dim(
-      `   Complexity: ${routing.complexity} | Model: ${routing.modelTier} | TDD: ${routing.testStrategy}`,
-    ),
-  );
-  console.log(chalk.dim(`   Routing: ${routing.reasoning}`));
-}
-```
-**Issue:** Both branches log identical strings. Could be simplified:
-```typescript
-console.log(
-  chalk.dim(
-    `   Complexity: ${routing.complexity} | Model: ${routing.modelTier} | TDD: ${routing.testStrategy}`,
-  ),
-);
-if (!isBatch) {
-  console.log(chalk.dim(`   Routing: ${routing.reasoning}`));
-}
-```
-**Priority:** P4 — Code clarity, no functional impact.
----
-## Priority Fix Order
-| Priority | ID | Effort | Description |
-|:---|:---|:---|:---|
-| **P0** | BUG-7 | M | Implement verify stage logic (run tests, check build) |
-| **P1** | ENH-6 | L | Document and standardize error handling patterns across pipeline stages |
-| **P1** | ENH-7 | M | Add JSDoc to all 9 pipeline stages with examples |
-| **P2** | TYPE-3 | S | Store ConstitutionResult in context, remove prompt stage conversion |
-| **P2** | BUG-8 | S | Document context mutation contract in runPipeline JSDoc |
-| **P2** | ENH-8 | M | Add full pipeline integration test with all stages |
-| **P2** | STYLE-4 | S | Fix token estimation inconsistency (use estimateTokens() everywhere) |
-| **P3** | ENH-9 | M | Add optional spec validation to plan command |
-| **P3** | STYLE-5 | S | Replace `any` with proper types in analyze classifier |
-| **P3** | ENH-10 | S | Log skipped stages for observability |
-| **P4** | STYLE-6 | — | (No action needed — code is correct) |
-| **P4** | ENH-11 | S | Add dry-run support to review stage |
-| **P4** | TYPE-4 | S | Simplify routing stage logging |
-**Effort:** S = Small (<1hr), M = Medium (1-4hrs), L = Large (>4hrs)
----
-## Dimension Scores
-### Security: 20/20 ✓
-- ✓ No hardcoded secrets or credentials
-- ✓ Input validation on all boundaries (queue commands, spec parsing)
-- ✓ Command injection prevention in review runner (using spawn with args array)
-- ✓ Path traversal protection via config path-security module
-- ✓ No eval or dynamic code execution
-- ✓ Hook security validation from v0.2 still in place
-**Notes:** Pipeline stages properly delegate to existing security-vetted modules (hooks, agents, prd). No new security concerns introduced.
-### Reliability: 17/20
-- ✓ Comprehensive error handling in pipeline runner (try/catch, stage failures)
-- ✓ Proper resource cleanup (no leaked streams, timers, or file handles)
-- ✓ Atomic queue file handling from v0.2 maintained
-- ✗ **BUG-7:** Verify stage is a no-op (doesn't actually verify anything)
-- ✗ **ENH-6:** Inconsistent error handling patterns across stages
-- ⚠️ **BUG-8:** Context mutation contract unclear
-**Deductions:** -3 for verify stage gap, -0.5 for inconsistent error patterns, -0.5 for mutation documentation gap.
-### API Design: 18/20
-- ✓ Clean pipeline abstraction with composable stages
-- ✓ Well-defined stage interface (PipelineStage with enabled/execute)
-- ✓ Discriminated union for StageResult (exhaustiveness checking)
-- ✓ Consistent naming conventions (queueCheckStage, routingStage, etc.)
-- ✓ Good separation of concerns (each stage has single responsibility)
-- ✗ **TYPE-3:** Constitution type inconsistency (string vs ConstitutionResult)
-- ✗ **ENH-7:** Missing JSDoc on 60% of pipeline stages
-**Deductions:** -1 for type inconsistency, -1 for documentation gaps.
-### Code Quality: 16/20
-- ✓ Excellent test coverage (constitution: 100%, review: 100%, pipeline: 90%+)
-- ✓ No dead code or commented-out blocks
-- ✓ Files are appropriately sized (<400 lines for all pipeline stages)
-- ✓ Consistent code style (Biome formatting)
-- ✗ **STYLE-4:** Magic number inconsistency (token estimation)
-- ✗ **STYLE-5:** Use of `any` in classifier LLM response parsing
-- ✗ **TYPE-4:** Duplicate logging logic in routing stage
-- ✗ **ENH-8:** Missing integration test for full pipeline
-**Deductions:** -2 for missing integration test, -1 for any usage, -1 for magic number inconsistency.
-### Best Practices: 17/20
-- ✓ Follows established v0.2 patterns (hooks, routing, PRD management)
-- ✓ Proper use of TypeScript features (discriminated unions, exhaustiveness checks)
-- ✓ Clear module boundaries with barrel exports
-- ✓ Good abstraction (pipeline runner is framework-agnostic)
-- ✗ **ENH-6:** Inconsistent error handling (some stages silent fail, others don't)
-- ✗ **ENH-10:** No observability for skipped stages
-- ✗ **BUG-8:** Mutation contract unclear
-**Deductions:** -2 for inconsistent patterns, -1 for observability gap.
----
-## Summary
-The v0.3 pipeline refactor is a **strong architectural improvement** that successfully decomposes the monolithic runner into composable, testable stages. The new modules (constitution, analyze, review) are well-designed with excellent test coverage and proper integration.
-**Critical gap:** The verify stage is currently a placeholder (BUG-7). This must be implemented before v0.3 ships, as it's a core part of the quality gate.
-**Recommended path forward:**
-1. **Immediate (P0):** Implement verify stage with test execution
-2. **Before v0.3 release (P1):** Add pipeline stage JSDoc and standardize error handling
-3. **Post-v0.3 (P2-P4):** Address type inconsistencies, add integration tests, improve observability
-**Grade justification:**
-- Security: Excellent (20/20)
-- Reliability: Very good, one critical gap (17/20)
-- API Design: Very good, minor documentation gap (18/20)
-- Code Quality: Good, missing integration tests (16/20)
-- Best Practices: Good, inconsistent patterns (17/20)
-**Total: 88/100 (A-)**
-With BUG-7 fixed and ENH-6/ENH-7 addressed, this would easily achieve an **A (90+)**.

package/docs/20260217-v04-impl-plan.md DELETED Viewed

@@ -1,86 +0,0 @@
-# v0.4 Implementation Plan: Acceptance Validation
-**Date:** 2026-02-17
-**Branch:** master
-## Complexity Assessment
-- **Files touched:** 8+ (new module + pipeline stage + analyze integration + config + CLI + tests)
-- **LOC:** ~400-600 new
-- **Architectural impact:** New pipeline stage, new analyze output, new retry loop in runner
-- **Test strategy:** test-after (internal modules, not public API)
-## Phase 1: Acceptance test generator
-### 1a: Acceptance module
-**File:** `src/acceptance/generator.ts` (NEW)
-- Parse spec.md acceptance criteria (extract AC-N lines)
-- Build LLM prompt: ACs + codebase context → test file
-- Parse LLM response → write `acceptance.test.ts`
-- Fallback: generate skeleton tests with TODO if LLM fails
-### 1b: Integration with analyze
-**File:** `src/cli/analyze.ts`
-- After decompose, call acceptance test generator
-- Write tests to `ngent/features/<name>/acceptance.test.ts`
-- Config check: `acceptance.generateTests`
-### 1c: Config schema update
-**File:** `src/config/schema.ts`
-- Add `acceptance` config block: `enabled`, `maxRetries`, `generateTests`, `testPath`
-### Tests
-- AC parser extracts criteria from spec markdown
-- Generator produces valid test structure
-- Config validation
-**Commit:** `feat(acceptance): generate acceptance tests from spec ACs`
-## Phase 2: Acceptance pipeline stage
-### 2a: Acceptance stage
-**File:** `src/pipeline/stages/acceptance.ts` (NEW)
-- Only runs when all stories are complete (check prd status)
-- Spawns `bun test acceptance.test.ts` in workdir
-- Parses test output: which ACs passed/failed
-- Returns `continue` if all pass, `fail` with details if any fail
-### 2b: Register in default pipeline
-**File:** `src/pipeline/stages/index.ts`
-- Add `acceptanceStage` after `completionStage`
-### Tests
-- Stage skips when stories still pending
-- Stage runs and parses test results
-- Pass/fail detection
-**Commit:** `feat(acceptance): add acceptance validation pipeline stage`
-## Phase 3: Self-correcting fix loop
-### 3a: Fix story generator
-**File:** `src/acceptance/fix-generator.ts` (NEW)
-- Input: failed ACs + test output + related stories + source code
-- LLM call: generate fix story descriptions
-- Output: FixStory objects with id, title, relatedStories, description
-### 3b: Fix loop in runner
-**File:** `src/execution/runner.ts`
-- After acceptance stage fails: generate fix stories, append to prd
-- Re-run pipeline for fix stories only
-- Re-run acceptance tests
-- Max `config.acceptance.maxRetries` loops
-- If still failing: pause and report to human
-### 3c: Accept override command
-**File:** `src/cli/accept.ts` (NEW)
-- `ngent accept --override AC-2 "reason"`
-- Stores in prd.json `acceptanceOverrides`
-- Acceptance stage skips overridden ACs
-### Tests
-- Fix story generation from failed ACs
-- Retry loop respects maxRetries
-- Override skips specified ACs
-- Full integration: fail → fix → pass
-**Commit:** `feat(acceptance): add self-correcting fix loop with human override`
-## Test Strategy
-- Mode: test-after
-- Run `bun test && bun run typecheck` after each phase