npm - @nathapp/nax - Versions diffs - 0.27.1 → 0.29.0 - Mend

@nathapp/nax 0.27.1 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (383) hide show

package/CHANGELOG.md +13 -2
package/dist/nax.js +72691 -0
package/package.json +12 -4
package/src/cli/config.ts +42 -1
package/src/cli/prompts.ts +18 -6
package/src/config/defaults.ts +2 -0
package/src/config/schemas.ts +11 -0
package/src/config/types.ts +8 -0
package/src/context/builder.ts +10 -1
package/src/pipeline/stages/execution.ts +5 -0
package/src/pipeline/stages/prompt.ts +13 -4
package/src/precheck/checks-warnings.ts +37 -0
package/src/precheck/checks.ts +1 -0
package/src/precheck/index.ts +14 -7
package/src/prompts/builder.ts +178 -0
package/src/prompts/index.ts +2 -0
package/src/prompts/loader.ts +43 -0
package/src/prompts/sections/conventions.ts +15 -0
package/src/prompts/sections/index.ts +11 -0
package/src/prompts/sections/isolation.ts +24 -0
package/src/prompts/sections/role-task.ts +34 -0
package/src/prompts/sections/story.ts +13 -0
package/src/prompts/sections/verdict.ts +70 -0
package/src/prompts/templates/implementer.ts +6 -0
package/src/prompts/templates/single-session.ts +6 -0
package/src/prompts/templates/test-writer.ts +6 -0
package/src/prompts/templates/verifier.ts +6 -0
package/src/prompts/types.ts +21 -0
package/src/review/runner.ts +6 -1
package/src/tdd/session-runner.ts +12 -12
package/src/version.ts +2 -1
package/.claude/rules/01-project-conventions.md +0 -34
package/.claude/rules/02-test-architecture.md +0 -39
package/.claude/rules/03-test-writing.md +0 -58
package/.claude/rules/04-forbidden-patterns.md +0 -29
package/.claude/settings.json +0 -15
package/.githooks/pre-commit +0 -16
package/.gitlab-ci.yml +0 -103
package/.mcp.json +0 -8
package/BRIEF.md +0 -140
package/CLAUDE.md +0 -143
package/US-007-IMPLEMENTATION.md +0 -139
package/biome.json +0 -14
package/bun.lock +0 -163
package/bunfig.toml +0 -12
package/docker-compose.test.yml +0 -15
package/docs/20260216-fix-plan-context-review.md +0 -56
package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
package/docs/20260216-v02-plan.md +0 -136
package/docs/20260216-v02-review.md +0 -685
package/docs/20260217-dogfood-findings.md +0 -56
package/docs/20260217-p2-plus-plan.md +0 -117
package/docs/20260217-partial-fixes-plan.md +0 -62
package/docs/20260217-plan-analyze-spec.md +0 -117
package/docs/20260217-post-impl-review.md +0 -1137
package/docs/20260217-quick-wins-plan.md +0 -66
package/docs/20260217-split-runner-plan.md +0 -75
package/docs/20260217-v03-impl-plan.md +0 -80
package/docs/20260217-v03-post-impl-review.md +0 -589
package/docs/20260217-v04-impl-plan.md +0 -86
package/docs/20260217-v05-post-impl-review.md +0 -850
package/docs/20260217-v06-post-impl-review.md +0 -817
package/docs/20260218-adr003-port-plan.md +0 -151
package/docs/20260218-review-adr003-verification.md +0 -175
package/docs/20260219-fix-plan-bug16-19.md +0 -79
package/docs/20260219-fix-plan-bug20-22.md +0 -114
package/docs/20260219-plan-llm-routing.md +0 -116
package/docs/20260219-review-bug20-22-fixes.md +0 -135
package/docs/20260219-routing-baseline-keyword.md +0 -63
package/docs/20260220-plan-structured-logging-p1.md +0 -80
package/docs/20260220-plan-structured-logging-p2.md +0 -37
package/docs/20260220-review-llm-routing.md +0 -180
package/docs/20260220-review-post-fix-llm-routing.md +0 -70
package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
package/docs/20260221-fix-plan-routing-mode.md +0 -125
package/docs/20260221-review-v0.9-implementation.md +0 -379
package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
package/docs/20260223-fix-plan-prompt-audit.md +0 -62
package/docs/20260224-nax-roadmap-phases.md +0 -189
package/docs/20260225-phase2-llm-service-layer.md +0 -401
package/docs/20260225-review-v0.10.1.md +0 -187
package/docs/20260303-v010-implementation-plan.md +0 -165
package/docs/20260304-review-nax.md +0 -492
package/docs/CLAUDE.md.bak +0 -191
package/docs/ROADMAP.md +0 -364
package/docs/SPEC-rectification.md +0 -0
package/docs/SPEC.md +0 -324
package/docs/US-001-plugin-loading-verification.md +0 -152
package/docs/adr/ADR-005-implementation-plan.md +0 -655
package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
package/docs/architecture-analysis.md +0 -1076
package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
package/docs/bugs-from-dogfood-run-c.md +0 -243
package/docs/code-review-20260228.md +0 -612
package/docs/code-review-v0.15.0.md +0 -629
package/docs/hook-lifecycle-test-plan.md +0 -149
package/docs/releases/v0.11.0-and-earlier.md +0 -20
package/docs/releases/v0.12.0.md +0 -15
package/docs/releases/v0.13.0.md +0 -14
package/docs/releases/v0.14.0.md +0 -20
package/docs/releases/v0.14.1.md +0 -36
package/docs/releases/v0.14.2.md +0 -51
package/docs/releases/v0.14.3.md +0 -174
package/docs/releases/v0.14.4.md +0 -94
package/docs/releases/v0.15.0.md +0 -502
package/docs/releases/v0.15.1.md +0 -170
package/docs/releases/v0.15.3.md +0 -193
package/docs/specs/bug-039-orphan-processes.md +0 -131
package/docs/specs/bug-040-review-rectification.md +0 -82
package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
package/docs/specs/bun-pty-migration.md +0 -171
package/docs/specs/central-run-registry.md +0 -116
package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
package/docs/specs/feat-011-file-context-strategy.md +0 -73
package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
package/docs/specs/feat-013-test-after-review.md +0 -89
package/docs/specs/feat-014-heartbeat-observability.md +0 -127
package/docs/specs/status-file-consolidation.md +0 -93
package/docs/specs/status-file-v0.10.1.md +0 -812
package/docs/specs/trigger-completion.md +0 -145
package/docs/specs/verification-architecture-v2.md +0 -343
package/docs/tdd/strategies.md +0 -97
package/docs/v0.10-global-config.md +0 -206
package/docs/v0.10-plugin-system.md +0 -415
package/docs/v0.10-prompt-optimizer.md +0 -234
package/docs/v0.3-spec.md +0 -244
package/docs/v0.4-spec.md +0 -140
package/docs/v0.5-spec.md +0 -237
package/docs/v0.6-spec.md +0 -371
package/docs/v0.7-spec.md +0 -177
package/docs/v0.8-llm-routing.md +0 -206
package/docs/v0.8-structured-logging.md +0 -132
package/docs/v0.9.3-prompt-audit.md +0 -112
package/examples/plugins/console-reporter/index.test.ts +0 -207
package/examples/plugins/console-reporter/index.ts +0 -110
package/memory/topic/feat-010-baseref.md +0 -28
package/memory/topic/feat-013-test-after-deprecation.md +0 -22
package/nax/config.json +0 -154
package/nax/features/bug-039-medium/prd.json +0 -45
package/nax/features/bugfix-v0171/prd.json +0 -52
package/nax/features/central-run-registry/prd.json +0 -105
package/nax/features/config-management/prd.json +0 -108
package/nax/features/config-management/progress.txt +0 -5
package/nax/features/diagnose/acceptance.test.ts +0 -414
package/nax/features/diagnose/prd.json +0 -41
package/nax/features/nax-compliance/prd.json +0 -52
package/nax/features/nax-compliance/progress.txt +0 -1
package/nax/features/orchestration-fixes/prd.json +0 -89
package/nax/features/orchestration-fixes/progress.txt +0 -1
package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
package/nax/features/plugin-integration/prd.json +0 -208
package/nax/features/plugin-integration/progress.txt +0 -5
package/nax/features/post-rearch-bugfix/prd.json +0 -137
package/nax/features/precheck/prd.json +0 -205
package/nax/features/precheck/progress.txt +0 -15
package/nax/features/review-quality/prd.json +0 -55
package/nax/features/routing-persistence/prd.json +0 -104
package/nax/features/routing-persistence/progress.txt +0 -1
package/nax/features/smart-test-runner/plan.md +0 -7
package/nax/features/smart-test-runner/prd.json +0 -203
package/nax/features/smart-test-runner/progress.txt +0 -13
package/nax/features/smart-test-runner/spec.md +0 -7
package/nax/features/smart-test-runner/tasks.md +0 -8
package/nax/features/status-file-consolidation/prd.json +0 -106
package/nax/features/structured-logging/prd.json +0 -199
package/nax/features/trigger-completion/prd.json +0 -150
package/nax/features/trigger-completion/progress.txt +0 -7
package/nax/features/unlock/prd.json +0 -36
package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
package/nax/features/v0.19.0-hardening/plan.md +0 -7
package/nax/features/v0.19.0-hardening/prd.json +0 -84
package/nax/features/v0.19.0-hardening/progress.txt +0 -7
package/nax/features/v0.19.0-hardening/spec.md +0 -18
package/nax/features/v0.19.0-hardening/tasks.md +0 -8
package/nax/features/verify-v2/prd.json +0 -79
package/nax/features/verify-v2/progress.txt +0 -3
package/nax/status.json +0 -36
package/test/COVERAGE-GAPS.md +0 -333
package/test/e2e/cm-003-default-view.test.ts +0 -195
package/test/e2e/plan-analyze-run.test.ts +0 -902
package/test/helpers/helpers.test.ts +0 -295
package/test/helpers/timeout.ts +0 -42
package/test/integration/US-002-TEST-SUMMARY.md +0 -107
package/test/integration/US-003-TEST-SUMMARY.md +0 -149
package/test/integration/US-004-TEST-SUMMARY.md +0 -106
package/test/integration/US-005-TEST-SUMMARY.md +0 -138
package/test/integration/US-007-TEST-SUMMARY.md +0 -100
package/test/integration/cli/agent-validation.test.ts +0 -439
package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
package/test/integration/cli/cli-config-default-view.test.ts +0 -230
package/test/integration/cli/cli-config-diff.test.ts +0 -461
package/test/integration/cli/cli-config.test.ts +0 -737
package/test/integration/cli/cli-diagnose.test.ts +0 -595
package/test/integration/cli/cli-logs.test.ts +0 -346
package/test/integration/cli/cli-plugins.test.ts +0 -679
package/test/integration/cli/cli-precheck.test.ts +0 -372
package/test/integration/cli/cli-run-headless.test.ts +0 -174
package/test/integration/cli/cli.test.ts +0 -76
package/test/integration/cli/precheck-integration.test.ts +0 -476
package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
package/test/integration/cli/precheck.test.ts +0 -806
package/test/integration/config/config-loader.test.ts +0 -266
package/test/integration/config/config.test.ts +0 -444
package/test/integration/config/merger.test.ts +0 -466
package/test/integration/config/paths.test.ts +0 -52
package/test/integration/config/security-loader.test.ts +0 -83
package/test/integration/context/context-integration.test.ts +0 -703
package/test/integration/context/context-path-security.test.ts +0 -173
package/test/integration/context/context-provider-injection.test.ts +0 -507
package/test/integration/context/context-verification-integration.test.ts +0 -296
package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
package/test/integration/execution/execution-isolation.test.ts +0 -143
package/test/integration/execution/execution.test.ts +0 -634
package/test/integration/execution/feature-status-write.test.ts +0 -302
package/test/integration/execution/parallel.test.ts +0 -251
package/test/integration/execution/prd-pause.test.ts +0 -205
package/test/integration/execution/prd-resolvers.test.ts +0 -186
package/test/integration/execution/progress.test.ts +0 -34
package/test/integration/execution/runner-batching.test.ts +0 -682
package/test/integration/execution/runner-config-plugins.test.ts +0 -462
package/test/integration/execution/runner-escalation.test.ts +0 -561
package/test/integration/execution/runner-fixes.test.ts +0 -400
package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
package/test/integration/execution/status-file-integration.test.ts +0 -289
package/test/integration/execution/status-file.test.ts +0 -380
package/test/integration/execution/status-writer.test.ts +0 -447
package/test/integration/execution/story-id-in-events.test.ts +0 -274
package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
package/test/integration/pipeline/hooks.test.ts +0 -363
package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
package/test/integration/pipeline/pipeline-events.test.ts +0 -476
package/test/integration/pipeline/pipeline.test.ts +0 -660
package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
package/test/integration/pipeline/verify-stage.test.ts +0 -286
package/test/integration/plan/analyze-integration.test.ts +0 -262
package/test/integration/plan/analyze-scanner.test.ts +0 -132
package/test/integration/plan/logger.test.ts +0 -461
package/test/integration/plan/plan.test.ts +0 -157
package/test/integration/plugins/config-integration.test.ts +0 -173
package/test/integration/plugins/config-resolution.test.ts +0 -523
package/test/integration/plugins/loader.test.ts +0 -644
package/test/integration/plugins/plugins-registry.test.ts +0 -747
package/test/integration/plugins/validator.test.ts +0 -564
package/test/integration/review/review-config-commands.test.ts +0 -320
package/test/integration/review/review-config-schema.test.ts +0 -117
package/test/integration/review/review-plugin-integration.test.ts +0 -729
package/test/integration/review/review.test.ts +0 -150
package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
package/test/integration/routing/plugin-routing-core.test.ts +0 -527
package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
package/test/integration/tmp/headless-test/test.jsonl +0 -30
package/test/integration/verification/test-scanner.test.ts +0 -403
package/test/integration/verification/verification-asset-check.test.ts +0 -143
package/test/integration/worktree/manager.test.ts +0 -218
package/test/integration/worktree/worktree-merge.test.ts +0 -341
package/test/manual/logging-formatter-demo.ts +0 -158
package/test/ui/tui-agent-panel.test.tsx +0 -99
package/test/ui/tui-pty-integration.test.tsx +0 -146
package/test/unit/acceptance.test.ts +0 -187
package/test/unit/agent-stderr-capture.test.ts +0 -147
package/test/unit/agents/claude.test.ts +0 -107
package/test/unit/analyze-classifier.test.ts +0 -216
package/test/unit/analyze.test.ts +0 -224
package/test/unit/auto-detect.test.ts +0 -250
package/test/unit/cli-status-project-level.test.ts +0 -283
package/test/unit/cli-status.test.ts +0 -418
package/test/unit/commands/common.test.ts +0 -321
package/test/unit/commands/logs.test.ts +0 -458
package/test/unit/commands/runs.test.ts +0 -303
package/test/unit/commands/unlock.test.ts +0 -320
package/test/unit/config/defaults.test.ts +0 -70
package/test/unit/config/quality-commands-schema.test.ts +0 -72
package/test/unit/config/regression-gate-schema.test.ts +0 -160
package/test/unit/config/smart-runner-flag.test.ts +0 -250
package/test/unit/constitution-generators.test.ts +0 -161
package/test/unit/constitution.test.ts +0 -210
package/test/unit/context/context-autodetect.test.ts +0 -297
package/test/unit/context/context-build.test.ts +0 -575
package/test/unit/context/context-coverage.test.ts +0 -236
package/test/unit/context/context-error.test.ts +0 -93
package/test/unit/context/context-estimate-tokens.test.ts +0 -201
package/test/unit/context/context-format.test.ts +0 -302
package/test/unit/context/context-isolation.test.ts +0 -267
package/test/unit/context/context-sort.test.ts +0 -93
package/test/unit/context/context-story.test.ts +0 -108
package/test/unit/context/prior-failures.test.ts +0 -463
package/test/unit/context.test.ts +0 -1726
package/test/unit/cost.test.ts +0 -231
package/test/unit/crash-recovery.test.ts +0 -309
package/test/unit/escalation.test.ts +0 -127
package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
package/test/unit/execution/pid-registry.test.ts +0 -241
package/test/unit/execution/sequential-executor.test.ts +0 -235
package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
package/test/unit/execution/structured-failure.test.ts +0 -415
package/test/unit/execution-logging-stderr.test.ts +0 -157
package/test/unit/execution-stage.test.ts +0 -123
package/test/unit/fix-generator.test.ts +0 -276
package/test/unit/formatters.test.ts +0 -468
package/test/unit/greenfield.test.ts +0 -180
package/test/unit/hooks/shell-security.test.ts +0 -40
package/test/unit/interaction/auto-plugin.test.ts +0 -162
package/test/unit/interaction/human-review-trigger.test.ts +0 -165
package/test/unit/interaction-network-failures.test.ts +0 -390
package/test/unit/interaction-plugins.test.ts +0 -472
package/test/unit/logging/formatter.test.ts +0 -456
package/test/unit/merge.test.ts +0 -269
package/test/unit/metrics/aggregator.test.ts +0 -164
package/test/unit/metrics/tracker.test.ts +0 -186
package/test/unit/metrics.test.ts +0 -276
package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
package/test/unit/pipeline/event-bus.test.ts +0 -105
package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
package/test/unit/pipeline/runner-retry.test.ts +0 -89
package/test/unit/pipeline/stages/autofix.test.ts +0 -97
package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
package/test/unit/pipeline/stages/rectify.test.ts +0 -101
package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
package/test/unit/pipeline/stages/review.test.ts +0 -201
package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
package/test/unit/pipeline/stages/verify.test.ts +0 -267
package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
package/test/unit/prd-auto-default.test.ts +0 -291
package/test/unit/prd-failure-category.test.ts +0 -177
package/test/unit/prd-get-next-story.test.ts +0 -215
package/test/unit/precheck-checks.test.ts +0 -841
package/test/unit/precheck-story-size-gate.test.ts +0 -288
package/test/unit/precheck-types.test.ts +0 -143
package/test/unit/prompts.test.ts +0 -476
package/test/unit/queue.test.ts +0 -237
package/test/unit/rectification.test.ts +0 -285
package/test/unit/registry.test.ts +0 -288
package/test/unit/review/runner.test.ts +0 -117
package/test/unit/routing/content-hash.test.ts +0 -99
package/test/unit/routing/routing-stability.test.ts +0 -208
package/test/unit/routing/strategies/llm.test.ts +0 -306
package/test/unit/routing-advanced.test.ts +0 -313
package/test/unit/routing-core.test.ts +0 -341
package/test/unit/routing-strategies.test.ts +0 -440
package/test/unit/storyid-events.test.ts +0 -213
package/test/unit/tdd-verdict.test.ts +0 -492
package/test/unit/test-output-parser.test.ts +0 -377
package/test/unit/ui/tui-controls.test.ts +0 -335
package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
package/test/unit/ui/tui-layout.test.ts +0 -379
package/test/unit/ui/tui-stories.test.ts +0 -333
package/test/unit/unit-isolation.test.ts +0 -135
package/test/unit/utils/git.test.ts +0 -50
package/test/unit/utils/path-security.test.ts +0 -47
package/test/unit/utils-helpers.test.ts +0 -318
package/test/unit/verdict.test.ts +0 -325
package/test/unit/verification/orchestrator-types.test.ts +0 -54
package/test/unit/verification/orchestrator.test.ts +0 -66
package/test/unit/verification/smart-runner-config.test.ts +0 -163
package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
package/test/unit/verification/smart-runner.test.ts +0 -262
package/test/unit/verification/strategies/acceptance.test.ts +0 -33
package/test/unit/verification/strategies/regression.test.ts +0 -87
package/test/unit/verification/strategies/scoped.test.ts +0 -100
package/test/unit/worktree-manager.test.ts +0 -159
package/tsconfig.json +0 -27

package/docs/v0.7-spec.md DELETED Viewed

@@ -1,177 +0,0 @@
-# nax v0.7 Specification
-**Date:** 2026-02-17
-**Status:** Draft
-## Theme: Test Context Injection
-v0.7 addresses test redundancy caused by isolated story sessions. Each agent session currently writes tests without knowing what prior stories already covered, leading to duplicate coverage.
-## Problem
-During dogfooding (bun-kv-store, 8 stories), we observed:
-- 6 tests for "name is required" (missing, empty, whitespace, not string, null, undefined)
-- Each story independently writes "comprehensive tests" for its area
-- Validation stories re-test what CRUD stories already covered
-- **Root cause:** Each session's prompt has zero visibility into existing test files
-## Solution: Test Context Injection
-Inject a summary of existing test files into each story's prompt so the agent knows what's already covered and avoids duplication.
-### How It Works
-Before the **prompt stage**, scan the project's test directory and generate a concise summary:
-```
-## Existing Test Coverage
-### test/store.test.ts (45 tests)
-- CRUD operations: create, read, update, delete
-- Error handling: missing key, invalid value
-- Batch operations: getMany, setMany
-### test/validation.test.ts (12 tests)
-- Input validation: name required, type checking
-- Size limits: max key length, max value size
-```
-This summary is injected into the prompt alongside the story context, constitution, and other context elements.
-### Context Element
-```typescript
-interface TestCoverageContext {
-  type: 'test-coverage';
-  priority: 85;           // Below constitution (95), above file context (50)
-  content: string;        // Formatted test summary
-  tokens: number;
-  source: string;         // e.g., "test/*.test.ts"
-}
-```
-### Summary Generation
-```typescript
-interface TestSummaryOptions {
-  /** Test directory to scan (default: auto-detect from config or common patterns) */
-  testDir?: string;
-  /** Glob pattern for test files (default: "**/*.test.{ts,js,tsx,jsx}") */
-  testPattern?: string;
-  /** Max tokens for the summary (default: 500) */
-  maxTokens?: number;
-  /** Summary detail level */
-  detail: 'names-only' | 'names-and-counts' | 'describe-blocks';
-}
-```
-**Detail levels:**
-- `names-only` — Just file names and test count: `test/store.test.ts (45 tests)`
-- `names-and-counts` — File names + top-level describe blocks with counts
-- `describe-blocks` — File names + describe blocks + test names (most expensive but most useful)
-**Default:** `names-and-counts` (good balance of info vs tokens)
-### Scanning Approach
-1. Find test files matching pattern
-2. For each file, extract `describe()` and `test()`/`it()` block names via regex (no AST parsing needed)
-3. Format as markdown summary
-4. Truncate to token budget
-### Prompt Injection
-Add to the story prompt:
-```
-## Existing Test Coverage
-The following tests already exist. DO NOT duplicate this coverage.
-Focus only on testing NEW behavior introduced by this story.
-### test/store.test.ts (45 tests)
-- CRUD operations (12 tests): create, read, update, delete, upsert
-- Validation (8 tests): required fields, type checks, size limits
-- Error handling (6 tests): not found, duplicate key, connection error
-...
-## Your Story
-US-007: Add input sanitization
-...
-```
-### Config
-```json
-{
-  "context": {
-    "testCoverage": {
-      "enabled": true,
-      "detail": "names-and-counts",
-      "maxTokens": 500,
-      "testDir": "test",
-      "testPattern": "**/*.test.{ts,js}"
-    }
-  }
-}
-```
-## Acceptance Criteria
-- [ ] Test files scanned and summarized before each story prompt
-- [ ] Summary injected into prompt at priority 85
-- [ ] `describe()` and `test()` names extracted via regex
-- [ ] Summary respects maxTokens budget
-- [ ] Config allows enabling/disabling and adjusting detail level
-- [ ] Summary updates between stories (reflects tests added by prior stories)
-- [ ] No performance regression (scanning should be <100ms for typical projects)
-## Implementation Plan
-### Phase 1: Test Scanner
-**Files:** `src/context/test-scanner.ts`
-- Scan test directory for test files
-- Extract describe/test block names via regex
-- Format as markdown summary
-- Respect token budget
-**Commit:** `feat(context): add test file scanner for coverage summary`
-### Phase 2: Context Integration
-**Files:** `src/context/builder.ts`, `src/pipeline/stages/context.ts`, `src/config/schema.ts`
-- Add TestCoverageContext element type
-- Wire scanner into context stage (runs before prompt assembly)
-- Add config schema for testCoverage settings
-- Summary refreshes between stories
-**Commit:** `feat(context): inject test coverage summary into story prompts`
-### Phase 3: Prompt Guidance
-**Files:** `src/execution/prompts.ts`
-- Add "DO NOT duplicate" instruction in prompt template
-- Reference existing coverage summary
-- Reinforce constitution test guidance
-**Commit:** `feat(prompts): add test dedup guidance referencing coverage summary`
-## Test Strategy
-- Mode: test-after
-- Unit tests: scanner regex extraction, summary formatting, token truncation
-- Integration: context builder includes test coverage element
-- Run `bun test && bun run typecheck` after each phase
-## Estimated Effort
-~300-400 LOC across 3 phases.
-## Measurement
-Compare v0.5.0 (no dedup) vs v0.7.0 (context injection) on the same dogfood project:
-| Metric | v0.5.0 | v0.7.0 |
-|:---|:---|:---|
-| Total tests generated | ? | ? |
-| Redundant tests | ? | ? |
-| Code quality grade | ? | ? |
-| Acceptance rate | ? | ? |
-| Total cost | ? | ? |
-| Total time | ? | ? |

package/docs/v0.8-llm-routing.md DELETED Viewed

@@ -1,206 +0,0 @@
-# v0.8 — LLM-Enhanced Routing
-> Priority: **HIGH** — keyword routing causes costly misroutes (e.g., US-008 simple barrel exports → powerful + TDD due to "public api" keyword match).
-## Problem
-Keyword-based routing is brittle and expensive:
-1. **False positives**: "public api" in title → three-session-tdd even for simple barrel exports ($1.25 wasted)
-2. **False negatives**: Complex integration work without magic keywords → test-after on fast tier
-3. **No semantic understanding**: Can't assess *actual* implementation effort from acceptance criteria
-4. **Keyword overlap**: Security keywords fire on simple "add auth header to request" stories
-Evidence from dogfood runs:
-- Run D2: US-008 ("Export public API and create barrel exports") — simple task, keyword matched "public api" → powerful + three-session-tdd. Cost: $1.25. Should have been: fast + test-after (~$0.10).
-## Design
-### Config
-```json
-{
-  "routing": {
-    "strategy": "llm",
-    "llm": {
-      "model": "fast",
-      "fallbackToKeywords": true,
-      "maxInputTokens": 2000,
-      "cacheDecisions": true
-    }
-  }
-}
-```
-- `model`: Tier used for the routing LLM call itself (default: `fast` — routing should be cheap)
-- `fallbackToKeywords`: If LLM call fails (timeout, parse error), fall back to keyword strategy (default: `true`)
-- `maxInputTokens`: Token budget for story context sent to LLM (default: `2000`)
-- `cacheDecisions`: Cache routing decisions per story ID within a run (default: `true`)
-### LLM Prompt
-The LLM receives a structured prompt with the story and must return a JSON decision:
-```
-You are a code task router. Given a user story, classify its complexity and select the appropriate execution strategy.
-## Story
-Title: {title}
-Description: {description}
-Acceptance Criteria:
-{acceptanceCriteria as numbered list}
-Tags: {tags}
-## Available Tiers
-- fast: Simple changes, typos, config updates, boilerplate. <30 min of coding.
-- balanced: Standard features, moderate logic, straightforward tests. 30-90 min.
-- powerful: Complex architecture, security-critical, multi-file refactors, novel algorithms. >90 min.
-## Available Test Strategies
-- test-after: Write implementation first, add tests after. For straightforward work.
-- three-session-tdd: Separate test-writer → implementer → verifier sessions. For complex/critical work where test design matters.
-## Rules
-- Default to the CHEAPEST option that will succeed.
-- three-session-tdd ONLY when: (a) security/auth logic, (b) complex algorithms, (c) public API contracts that consumers depend on.
-- Simple barrel exports, re-exports, or index files are ALWAYS test-after + fast, regardless of keywords.
-- A story touching many files doesn't automatically mean complex — copy-paste refactors are simple.
-Respond with ONLY this JSON (no markdown, no explanation):
-{"complexity":"simple|medium|complex|expert","modelTier":"fast|balanced|powerful","testStrategy":"test-after|three-session-tdd","reasoning":"<one line>"}
-```
-### Implementation
-Modify `src/routing/strategies/llm.ts`:
-```typescript
-export const llmStrategy: RoutingStrategy = {
-  name: "llm",
-  async route(story: UserStory, context: RoutingContext): Promise<RoutingDecision | null> {
-    const config = context.config;
-    const llmConfig = config.routing.llm;
-    if (!llmConfig) return null;
-    // Check cache
-    if (llmConfig.cacheDecisions && cachedDecisions.has(story.id)) {
-      return cachedDecisions.get(story.id)!;
-    }
-    try {
-      const prompt = buildRoutingPrompt(story, config);
-      const modelId = config.models[llmConfig.model ?? "fast"];
-      const result = await callLlm(modelId, prompt, {
-        maxTokens: 200,
-        timeout: 15_000, // 15s hard limit — routing shouldn't block
-      });
-      const decision = parseRoutingResponse(result, story, config);
-      if (llmConfig.cacheDecisions) {
-        cachedDecisions.set(story.id, decision);
-      }
-      return decision;
-    } catch (err) {
-      logger.warn(`LLM routing failed for ${story.id}: ${err.message}`);
-      return null; // Falls through to keyword strategy
-    }
-  },
-};
-```
-### LLM Call Mechanism
-nax already spawns Claude Code via `Bun.spawn`. For the routing LLM call, we need a **lightweight** approach:
-**Option A — Claude Code one-shot**: `claude -p "..." --model <model>` with 15s timeout.
-- Pro: Reuses existing infra, model aliases work.
-- Con: ~3-5s startup overhead per call. For 9 stories = 27-45s total.
-**Option B — Direct API call**: HTTP request to provider API (Anthropic/OpenAI/Google).
-- Pro: <1s per call, batch-friendly.
-- Con: Needs API key handling, provider-specific code.
-**Recommendation: Option A** for v0.8 (simplicity), with config option to batch all stories in one call:
-```
-// Single LLM call for all pending stories (batch mode)
-"Route these 9 stories:\n1. US-001: ...\n2. US-002: ...\n\nRespond with JSON array: [{id, complexity, modelTier, testStrategy, reasoning}]"
-```
-Batch mode cuts 9 calls → 1 call. ~5s total routing overhead.
-### Strategy Interface Change
-The current `RoutingStrategy.route()` is synchronous. LLM routing needs async:
-```typescript
-export interface RoutingStrategy {
-  readonly name: string;
-  route(story: UserStory, context: RoutingContext): RoutingDecision | null | Promise<RoutingDecision | null>;
-}
-```
-`StrategyChain.route()` becomes async (already called with `await` in `routeStory()`).
-### Error Handling
-| Failure | Behavior |
-|:---|:---|
-| LLM timeout (>15s) | Return null → keyword fallback |
-| JSON parse error | Return null → keyword fallback |
-| Invalid field values | Return null → keyword fallback |
-| LLM returns unknown complexity | Clamp to nearest valid value |
-| All failures logged | `logger.warn()` with story ID |
-### Logging
-```
-[routing] LLM classified US-008 as simple/fast/test-after: "Barrel export file, no logic to test"
-[routing] LLM routing failed for US-003: timeout after 15000ms, falling back to keyword
-```
-## Cost Analysis
-| Scenario | Keyword Cost | LLM Routing Cost | Savings |
-|:---|:---|:---|:---|
-| US-008 (barrel exports) | $1.25 (powerful+TDD) | $0.10 (fast+test-after) + $0.01 routing | **$1.14 saved** |
-| 9-story run (batch) | Variable | ~$0.02 routing overhead | Net positive if prevents 1+ misroute |
-LLM routing call: ~500 input tokens + 100 output tokens per story = ~$0.001 on Flash.
-Batch mode (9 stories): ~2000 input + 400 output = ~$0.003 total.
-**ROI: One prevented misroute pays for ~400 routing calls.**
-## Acceptance Criteria
-1. `config.routing.strategy = "llm"` activates LLM routing
-2. LLM strategy returns structured `RoutingDecision` with reasoning
-3. Falls back to keyword strategy on any LLM failure
-4. Batch mode: single LLM call routes all pending stories
-5. Routing decisions cached per story ID within a run
-6. Strategy chain async support (non-breaking — keyword still sync)
-7. Routing overhead < 10s for batch of 10 stories
-8. Logging: every LLM routing decision logged with reasoning
-## Files to Modify
-- `src/routing/strategies/llm.ts` — Main implementation
-- `src/routing/strategy.ts` — Make interface async-compatible
-- `src/routing/chain.ts` — `route()` becomes async
-- `src/config/schema.ts` — Add `LlmRoutingConfig` type
-- `src/config/defaults.ts` — Add LLM routing defaults
-- `test/routing/llm-strategy.test.ts` — Unit tests
-- `test/routing/chain.test.ts` — Update for async
-## Non-Goals (v0.8)
-- Direct API calls (Option B) — defer to v0.9+
-- Adaptive routing (learning from historical data) — existing stub, separate feature
-- Custom routing prompts — hardcoded prompt is fine for now
----
-*Created 2026-02-19*

package/docs/v0.8-structured-logging.md DELETED Viewed

@@ -1,132 +0,0 @@
-# Feature: Structured Logging for nax v0.8
-## Problem
-nax currently uses raw `console.log` with chalk formatting throughout the codebase. Developers running `nax run` in headless mode have no way to:
-- Control verbosity (only see errors vs full debug output)
-- Get timing data per story/stage for performance analysis
-- Review token counts and API costs per story
-- Debug failures with full prompt/response dumps
-- Parse logs programmatically for CI/CD integration
-## Requirements
-**REQ-1: Log Levels**
-- Support 4 levels: `error`, `warn`, `info`, `debug`
-- Default level: `info`
-- CLI flags: `--verbose` (debug), `--quiet` (error+warn only), `--silent` (error only)
-- Environment variable override: `NAX_LOG_LEVEL=debug`
-**REQ-2: Structured Log Format**
-- Each log entry includes: `timestamp`, `level`, `stage`, `storyId`, `message`
-- Console output: human-readable with chalk (current style, but level-gated)
-- File output: JSON Lines (`.jsonl`) for machine parsing
-- File location: `nax/features/<name>/runs/<run-id>.jsonl`
-**REQ-3: Stage Lifecycle Events**
-- Emit structured events at each stage transition:
-  - `run.start` — feature name, story count, config
-  - `iteration.start` — iteration number, story id, complexity
-  - `context.built` — file count, token estimate
-  - `agent.start` — model, prompt size (chars/tokens), TDD strategy
-  - `agent.complete` — exit code, duration, stdout size, cost estimate
-  - `test.start` — test command
-  - `test.complete` — pass/fail, test count, duration
-  - `verification.start` — verification strategy
-  - `verification.complete` — pass/fail, issues found
-  - `story.complete` — story id, status, attempts, duration, cost
-  - `iteration.complete` — stories done this iteration
-  - `run.complete` — total stories, passed, failed, cost, duration
-**REQ-4: Per-Story Metrics**
-- Track and report per story: duration, API cost, token count (in/out), attempts, test count
-- Include in `prd.json` story metadata after run completes
-- Summary table at end of run (visible at `info` level)
-**REQ-5: Debug Mode**
-- `--debug` or `NAX_LOG_LEVEL=debug` enables:
-  - Full prompt text logged to file (not console)
-  - Full agent response logged to file
-  - Claude CLI command logged
-  - Environment variables passed to agent (sanitized — mask tokens)
-**REQ-6: Run History**
-- Each `nax run` creates a unique run ID (ISO timestamp or UUID)
-- Log file persisted at `nax/features/<name>/runs/<run-id>.jsonl`
-- Latest run symlinked as `nax/features/<name>/runs/latest.jsonl`
-- `nax runs list -f <feature>` lists past runs with summary
-- `nax runs show <run-id> -f <feature>` shows detailed run report
-**REQ-7: Logger API (Internal)**
-- Singleton logger instance, configured once at CLI entry
-- API: `logger.info(stage, storyId, message, data?)`, `logger.debug(...)`, etc.
-- Replace all `console.log` calls with logger calls
-- Logger writes to both console (filtered by level) and file (all levels)
-## Acceptance Criteria
-**AC-1:** `nax run -f foo --headless` shows same output as today at `info` level
-**AC-2:** `nax run -f foo --verbose` shows agent timing, token counts, and prompt sizes
-**AC-3:** `nax run -f foo --quiet` shows only warnings, errors, and final summary
-**AC-4:** After a run, `nax/features/foo/runs/latest.jsonl` contains structured events
-**AC-5:** Each JSONL line is valid JSON with `timestamp`, `level`, `stage`, `storyId` fields
-**AC-6:** `nax runs list -f foo` shows past runs with date, stories, cost, status
-**AC-7:** Per-story metrics (duration, cost, attempts) appear in the run summary table
-**AC-8:** Debug mode logs full prompts to file without printing to console
-**AC-9:** No `console.log` calls remain in src/ (all replaced with logger)
-## Technical Notes
-### Logger Implementation
-```typescript
-// src/logger.ts
-export type LogLevel = "error" | "warn" | "info" | "debug";
-export interface LogEntry {
-  timestamp: string;
-  level: LogLevel;
-  stage: string;
-  storyId?: string;
-  message: string;
-  data?: Record<string, unknown>;
-}
-export class Logger {
-  constructor(options: { level: LogLevel; filePath?: string });
-  error(stage: string, message: string, data?: Record<string, unknown>): void;
-  warn(stage: string, message: string, data?: Record<string, unknown>): void;
-  info(stage: string, message: string, data?: Record<string, unknown>): void;
-  debug(stage: string, message: string, data?: Record<string, unknown>): void;
-  withStory(storyId: string): StoryLogger; // scoped logger
-}
-```
-### Migration Strategy
-1. Create `src/logger.ts` with Logger class
-2. Add `--verbose`, `--quiet`, `--silent` flags to `bin/nax.ts`
-3. Replace `console.log` calls one module at a time
-4. Add stage events to orchestrator loop
-5. Add run history commands
-### Dependencies
-- None (use Bun built-in fs for file writing)
-- chalk remains for console formatting
-### File Structure
-```
-nax/features/<name>/runs/
-├── 2026-02-19T10-30-00Z.jsonl
-├── 2026-02-20T14-15-00Z.jsonl
-└── latest.jsonl -> 2026-02-20T14-15-00Z.jsonl
-```
-## Out of Scope
-- Remote log shipping (Datadog, Sentry, etc.)
-- Log rotation or cleanup policies
-- Real-time log streaming via WebSocket
-- Custom log formatters or plugins
-- Metrics dashboard or visualization
----
-*Spec created 2026-02-19 for nax v0.8*

package/docs/v0.9.3-prompt-audit.md DELETED Viewed

@@ -1,112 +0,0 @@
-# v0.9.3 — Prompt Audit & Context Isolation
-**Status:** Draft
-**Author:** Subrina
-**Date:** 2026-02-23
-**Base:** v0.9.2
-## Overview
-Add tooling to inspect and verify story-scoped prompt isolation. Ensures each story's agent prompt contains only context relevant to that story — no cross-story leakage.
-## Motivation
-- No way to inspect what prompts agents actually receive without running a full `nax run`
-- `generateTestCoverageSummary` scans the entire repo's test files, leaking context from other stories into unrelated story prompts
-- No automated test verifying that `buildContext()` properly isolates per-story context
-- Prompt inspection is critical for debugging routing, context, and the upcoming v0.10 prompt optimizer
-## Deliverables
-### 1. `nax prompts` CLI Command
-New subcommand that assembles prompts for all stories in a feature without executing agents.
-```bash
-# Dump all story prompts to stdout
-nax prompts -f core
-# Write to directory
-nax prompts -f core --out ./prompt-dump/
-# Single story
-nax prompts -f core --story US-003
-```
-**Pipeline stages executed:** routing → context → constitution → prompt (stops before execution).
-**Output per story:**
-```
-prompt-dump/
-  US-001.prompt.md    # Final assembled prompt
-  US-001.context.md   # Context markdown only (for isolation audit)
-  US-002.prompt.md
-  US-002.context.md
-  ...
-```
-Each file includes a YAML frontmatter header:
-```yaml
----
-storyId: US-003
-title: "Create health indicator interface"
-testStrategy: test-after
-modelTier: balanced
-contextTokens: 2450
-promptTokens: 3800
-dependencies: [US-001]
-contextElements:
-  - type: progress, tokens: 45
-  - type: story, storyId: US-003, tokens: 890
-  - type: dependency, storyId: US-001, tokens: 720
-  - type: test-coverage, tokens: 795
----
-```
-**For three-session-tdd stories:** outputs `US-001.test-writer.md`, `US-001.implementer.md`, `US-001.verifier.md`.
-### 2. `buildContext` Isolation Unit Tests
-Test that `buildContext()` for a given story only includes:
-- The current story
-- Declared dependency stories
-- Progress summary (counts only, no story details)
-- Test coverage (to be scoped — see #3)
-- Error context from current story only
-- File context from current story's `contextFiles` only
-**Negative assertions:**
-- No story IDs from non-dependency stories appear in output
-- No acceptance criteria from unrelated stories leak through
-- Progress summary contains only aggregate counts, not story titles
-### 3. Scoped Test Coverage Scanner
-Fix `generateTestCoverageSummary` to scope results to story-relevant files:
-**Current behavior:** Scans all test files in `testDir` → agent sees coverage from every story.
-**New behavior:**
-1. If story has `contextFiles` → derive test file patterns from source paths (e.g., `src/health.service.ts` → `test/health.service.spec.ts`)
-2. If no `contextFiles` → fall back to full scan (current behavior) with a warning logged
-3. Add `context.testCoverage.scopeToStory` config option (default: `true`)
-## User Stories
-| # | Title | Complexity | Test Strategy | Dependencies |
-|:--|:------|:-----------|:--------------|:-------------|
-| US-001 | `nax prompts` CLI command with file output | medium | test-after | — |
-| US-002 | `buildContext` isolation unit tests | simple | test-after | — |
-| US-003 | Scope test coverage scanner to story-relevant files | medium | test-after | — |
-## Non-Goals
-- No changes to prompt assembly logic (that's v0.10 optimizer territory)
-- No `--optimized` flag yet (depends on v0.10)
-- No changes to TDD orchestrator prompt builders (just audit them)
-## Compatibility
-- `nax prompts` is additive — new CLI command, no existing behavior changed
-- Test coverage scoping is behind config flag with backward-compatible default
-- No breaking changes