npm - @nathapp/nax - Versions diffs - 0.27.1 → 0.29.0 - Mend

@nathapp/nax 0.27.1 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (383) hide show

package/CHANGELOG.md +13 -2
package/dist/nax.js +72691 -0
package/package.json +12 -4
package/src/cli/config.ts +42 -1
package/src/cli/prompts.ts +18 -6
package/src/config/defaults.ts +2 -0
package/src/config/schemas.ts +11 -0
package/src/config/types.ts +8 -0
package/src/context/builder.ts +10 -1
package/src/pipeline/stages/execution.ts +5 -0
package/src/pipeline/stages/prompt.ts +13 -4
package/src/precheck/checks-warnings.ts +37 -0
package/src/precheck/checks.ts +1 -0
package/src/precheck/index.ts +14 -7
package/src/prompts/builder.ts +178 -0
package/src/prompts/index.ts +2 -0
package/src/prompts/loader.ts +43 -0
package/src/prompts/sections/conventions.ts +15 -0
package/src/prompts/sections/index.ts +11 -0
package/src/prompts/sections/isolation.ts +24 -0
package/src/prompts/sections/role-task.ts +34 -0
package/src/prompts/sections/story.ts +13 -0
package/src/prompts/sections/verdict.ts +70 -0
package/src/prompts/templates/implementer.ts +6 -0
package/src/prompts/templates/single-session.ts +6 -0
package/src/prompts/templates/test-writer.ts +6 -0
package/src/prompts/templates/verifier.ts +6 -0
package/src/prompts/types.ts +21 -0
package/src/review/runner.ts +6 -1
package/src/tdd/session-runner.ts +12 -12
package/src/version.ts +2 -1
package/.claude/rules/01-project-conventions.md +0 -34
package/.claude/rules/02-test-architecture.md +0 -39
package/.claude/rules/03-test-writing.md +0 -58
package/.claude/rules/04-forbidden-patterns.md +0 -29
package/.claude/settings.json +0 -15
package/.githooks/pre-commit +0 -16
package/.gitlab-ci.yml +0 -103
package/.mcp.json +0 -8
package/BRIEF.md +0 -140
package/CLAUDE.md +0 -143
package/US-007-IMPLEMENTATION.md +0 -139
package/biome.json +0 -14
package/bun.lock +0 -163
package/bunfig.toml +0 -12
package/docker-compose.test.yml +0 -15
package/docs/20260216-fix-plan-context-review.md +0 -56
package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
package/docs/20260216-v02-plan.md +0 -136
package/docs/20260216-v02-review.md +0 -685
package/docs/20260217-dogfood-findings.md +0 -56
package/docs/20260217-p2-plus-plan.md +0 -117
package/docs/20260217-partial-fixes-plan.md +0 -62
package/docs/20260217-plan-analyze-spec.md +0 -117
package/docs/20260217-post-impl-review.md +0 -1137
package/docs/20260217-quick-wins-plan.md +0 -66
package/docs/20260217-split-runner-plan.md +0 -75
package/docs/20260217-v03-impl-plan.md +0 -80
package/docs/20260217-v03-post-impl-review.md +0 -589
package/docs/20260217-v04-impl-plan.md +0 -86
package/docs/20260217-v05-post-impl-review.md +0 -850
package/docs/20260217-v06-post-impl-review.md +0 -817
package/docs/20260218-adr003-port-plan.md +0 -151
package/docs/20260218-review-adr003-verification.md +0 -175
package/docs/20260219-fix-plan-bug16-19.md +0 -79
package/docs/20260219-fix-plan-bug20-22.md +0 -114
package/docs/20260219-plan-llm-routing.md +0 -116
package/docs/20260219-review-bug20-22-fixes.md +0 -135
package/docs/20260219-routing-baseline-keyword.md +0 -63
package/docs/20260220-plan-structured-logging-p1.md +0 -80
package/docs/20260220-plan-structured-logging-p2.md +0 -37
package/docs/20260220-review-llm-routing.md +0 -180
package/docs/20260220-review-post-fix-llm-routing.md +0 -70
package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
package/docs/20260221-fix-plan-routing-mode.md +0 -125
package/docs/20260221-review-v0.9-implementation.md +0 -379
package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
package/docs/20260223-fix-plan-prompt-audit.md +0 -62
package/docs/20260224-nax-roadmap-phases.md +0 -189
package/docs/20260225-phase2-llm-service-layer.md +0 -401
package/docs/20260225-review-v0.10.1.md +0 -187
package/docs/20260303-v010-implementation-plan.md +0 -165
package/docs/20260304-review-nax.md +0 -492
package/docs/CLAUDE.md.bak +0 -191
package/docs/ROADMAP.md +0 -364
package/docs/SPEC-rectification.md +0 -0
package/docs/SPEC.md +0 -324
package/docs/US-001-plugin-loading-verification.md +0 -152
package/docs/adr/ADR-005-implementation-plan.md +0 -655
package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
package/docs/architecture-analysis.md +0 -1076
package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
package/docs/bugs-from-dogfood-run-c.md +0 -243
package/docs/code-review-20260228.md +0 -612
package/docs/code-review-v0.15.0.md +0 -629
package/docs/hook-lifecycle-test-plan.md +0 -149
package/docs/releases/v0.11.0-and-earlier.md +0 -20
package/docs/releases/v0.12.0.md +0 -15
package/docs/releases/v0.13.0.md +0 -14
package/docs/releases/v0.14.0.md +0 -20
package/docs/releases/v0.14.1.md +0 -36
package/docs/releases/v0.14.2.md +0 -51
package/docs/releases/v0.14.3.md +0 -174
package/docs/releases/v0.14.4.md +0 -94
package/docs/releases/v0.15.0.md +0 -502
package/docs/releases/v0.15.1.md +0 -170
package/docs/releases/v0.15.3.md +0 -193
package/docs/specs/bug-039-orphan-processes.md +0 -131
package/docs/specs/bug-040-review-rectification.md +0 -82
package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
package/docs/specs/bun-pty-migration.md +0 -171
package/docs/specs/central-run-registry.md +0 -116
package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
package/docs/specs/feat-011-file-context-strategy.md +0 -73
package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
package/docs/specs/feat-013-test-after-review.md +0 -89
package/docs/specs/feat-014-heartbeat-observability.md +0 -127
package/docs/specs/status-file-consolidation.md +0 -93
package/docs/specs/status-file-v0.10.1.md +0 -812
package/docs/specs/trigger-completion.md +0 -145
package/docs/specs/verification-architecture-v2.md +0 -343
package/docs/tdd/strategies.md +0 -97
package/docs/v0.10-global-config.md +0 -206
package/docs/v0.10-plugin-system.md +0 -415
package/docs/v0.10-prompt-optimizer.md +0 -234
package/docs/v0.3-spec.md +0 -244
package/docs/v0.4-spec.md +0 -140
package/docs/v0.5-spec.md +0 -237
package/docs/v0.6-spec.md +0 -371
package/docs/v0.7-spec.md +0 -177
package/docs/v0.8-llm-routing.md +0 -206
package/docs/v0.8-structured-logging.md +0 -132
package/docs/v0.9.3-prompt-audit.md +0 -112
package/examples/plugins/console-reporter/index.test.ts +0 -207
package/examples/plugins/console-reporter/index.ts +0 -110
package/memory/topic/feat-010-baseref.md +0 -28
package/memory/topic/feat-013-test-after-deprecation.md +0 -22
package/nax/config.json +0 -154
package/nax/features/bug-039-medium/prd.json +0 -45
package/nax/features/bugfix-v0171/prd.json +0 -52
package/nax/features/central-run-registry/prd.json +0 -105
package/nax/features/config-management/prd.json +0 -108
package/nax/features/config-management/progress.txt +0 -5
package/nax/features/diagnose/acceptance.test.ts +0 -414
package/nax/features/diagnose/prd.json +0 -41
package/nax/features/nax-compliance/prd.json +0 -52
package/nax/features/nax-compliance/progress.txt +0 -1
package/nax/features/orchestration-fixes/prd.json +0 -89
package/nax/features/orchestration-fixes/progress.txt +0 -1
package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
package/nax/features/plugin-integration/prd.json +0 -208
package/nax/features/plugin-integration/progress.txt +0 -5
package/nax/features/post-rearch-bugfix/prd.json +0 -137
package/nax/features/precheck/prd.json +0 -205
package/nax/features/precheck/progress.txt +0 -15
package/nax/features/review-quality/prd.json +0 -55
package/nax/features/routing-persistence/prd.json +0 -104
package/nax/features/routing-persistence/progress.txt +0 -1
package/nax/features/smart-test-runner/plan.md +0 -7
package/nax/features/smart-test-runner/prd.json +0 -203
package/nax/features/smart-test-runner/progress.txt +0 -13
package/nax/features/smart-test-runner/spec.md +0 -7
package/nax/features/smart-test-runner/tasks.md +0 -8
package/nax/features/status-file-consolidation/prd.json +0 -106
package/nax/features/structured-logging/prd.json +0 -199
package/nax/features/trigger-completion/prd.json +0 -150
package/nax/features/trigger-completion/progress.txt +0 -7
package/nax/features/unlock/prd.json +0 -36
package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
package/nax/features/v0.19.0-hardening/plan.md +0 -7
package/nax/features/v0.19.0-hardening/prd.json +0 -84
package/nax/features/v0.19.0-hardening/progress.txt +0 -7
package/nax/features/v0.19.0-hardening/spec.md +0 -18
package/nax/features/v0.19.0-hardening/tasks.md +0 -8
package/nax/features/verify-v2/prd.json +0 -79
package/nax/features/verify-v2/progress.txt +0 -3
package/nax/status.json +0 -36
package/test/COVERAGE-GAPS.md +0 -333
package/test/e2e/cm-003-default-view.test.ts +0 -195
package/test/e2e/plan-analyze-run.test.ts +0 -902
package/test/helpers/helpers.test.ts +0 -295
package/test/helpers/timeout.ts +0 -42
package/test/integration/US-002-TEST-SUMMARY.md +0 -107
package/test/integration/US-003-TEST-SUMMARY.md +0 -149
package/test/integration/US-004-TEST-SUMMARY.md +0 -106
package/test/integration/US-005-TEST-SUMMARY.md +0 -138
package/test/integration/US-007-TEST-SUMMARY.md +0 -100
package/test/integration/cli/agent-validation.test.ts +0 -439
package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
package/test/integration/cli/cli-config-default-view.test.ts +0 -230
package/test/integration/cli/cli-config-diff.test.ts +0 -461
package/test/integration/cli/cli-config.test.ts +0 -737
package/test/integration/cli/cli-diagnose.test.ts +0 -595
package/test/integration/cli/cli-logs.test.ts +0 -346
package/test/integration/cli/cli-plugins.test.ts +0 -679
package/test/integration/cli/cli-precheck.test.ts +0 -372
package/test/integration/cli/cli-run-headless.test.ts +0 -174
package/test/integration/cli/cli.test.ts +0 -76
package/test/integration/cli/precheck-integration.test.ts +0 -476
package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
package/test/integration/cli/precheck.test.ts +0 -806
package/test/integration/config/config-loader.test.ts +0 -266
package/test/integration/config/config.test.ts +0 -444
package/test/integration/config/merger.test.ts +0 -466
package/test/integration/config/paths.test.ts +0 -52
package/test/integration/config/security-loader.test.ts +0 -83
package/test/integration/context/context-integration.test.ts +0 -703
package/test/integration/context/context-path-security.test.ts +0 -173
package/test/integration/context/context-provider-injection.test.ts +0 -507
package/test/integration/context/context-verification-integration.test.ts +0 -296
package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
package/test/integration/execution/execution-isolation.test.ts +0 -143
package/test/integration/execution/execution.test.ts +0 -634
package/test/integration/execution/feature-status-write.test.ts +0 -302
package/test/integration/execution/parallel.test.ts +0 -251
package/test/integration/execution/prd-pause.test.ts +0 -205
package/test/integration/execution/prd-resolvers.test.ts +0 -186
package/test/integration/execution/progress.test.ts +0 -34
package/test/integration/execution/runner-batching.test.ts +0 -682
package/test/integration/execution/runner-config-plugins.test.ts +0 -462
package/test/integration/execution/runner-escalation.test.ts +0 -561
package/test/integration/execution/runner-fixes.test.ts +0 -400
package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
package/test/integration/execution/status-file-integration.test.ts +0 -289
package/test/integration/execution/status-file.test.ts +0 -380
package/test/integration/execution/status-writer.test.ts +0 -447
package/test/integration/execution/story-id-in-events.test.ts +0 -274
package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
package/test/integration/pipeline/hooks.test.ts +0 -363
package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
package/test/integration/pipeline/pipeline-events.test.ts +0 -476
package/test/integration/pipeline/pipeline.test.ts +0 -660
package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
package/test/integration/pipeline/verify-stage.test.ts +0 -286
package/test/integration/plan/analyze-integration.test.ts +0 -262
package/test/integration/plan/analyze-scanner.test.ts +0 -132
package/test/integration/plan/logger.test.ts +0 -461
package/test/integration/plan/plan.test.ts +0 -157
package/test/integration/plugins/config-integration.test.ts +0 -173
package/test/integration/plugins/config-resolution.test.ts +0 -523
package/test/integration/plugins/loader.test.ts +0 -644
package/test/integration/plugins/plugins-registry.test.ts +0 -747
package/test/integration/plugins/validator.test.ts +0 -564
package/test/integration/review/review-config-commands.test.ts +0 -320
package/test/integration/review/review-config-schema.test.ts +0 -117
package/test/integration/review/review-plugin-integration.test.ts +0 -729
package/test/integration/review/review.test.ts +0 -150
package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
package/test/integration/routing/plugin-routing-core.test.ts +0 -527
package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
package/test/integration/tmp/headless-test/test.jsonl +0 -30
package/test/integration/verification/test-scanner.test.ts +0 -403
package/test/integration/verification/verification-asset-check.test.ts +0 -143
package/test/integration/worktree/manager.test.ts +0 -218
package/test/integration/worktree/worktree-merge.test.ts +0 -341
package/test/manual/logging-formatter-demo.ts +0 -158
package/test/ui/tui-agent-panel.test.tsx +0 -99
package/test/ui/tui-pty-integration.test.tsx +0 -146
package/test/unit/acceptance.test.ts +0 -187
package/test/unit/agent-stderr-capture.test.ts +0 -147
package/test/unit/agents/claude.test.ts +0 -107
package/test/unit/analyze-classifier.test.ts +0 -216
package/test/unit/analyze.test.ts +0 -224
package/test/unit/auto-detect.test.ts +0 -250
package/test/unit/cli-status-project-level.test.ts +0 -283
package/test/unit/cli-status.test.ts +0 -418
package/test/unit/commands/common.test.ts +0 -321
package/test/unit/commands/logs.test.ts +0 -458
package/test/unit/commands/runs.test.ts +0 -303
package/test/unit/commands/unlock.test.ts +0 -320
package/test/unit/config/defaults.test.ts +0 -70
package/test/unit/config/quality-commands-schema.test.ts +0 -72
package/test/unit/config/regression-gate-schema.test.ts +0 -160
package/test/unit/config/smart-runner-flag.test.ts +0 -250
package/test/unit/constitution-generators.test.ts +0 -161
package/test/unit/constitution.test.ts +0 -210
package/test/unit/context/context-autodetect.test.ts +0 -297
package/test/unit/context/context-build.test.ts +0 -575
package/test/unit/context/context-coverage.test.ts +0 -236
package/test/unit/context/context-error.test.ts +0 -93
package/test/unit/context/context-estimate-tokens.test.ts +0 -201
package/test/unit/context/context-format.test.ts +0 -302
package/test/unit/context/context-isolation.test.ts +0 -267
package/test/unit/context/context-sort.test.ts +0 -93
package/test/unit/context/context-story.test.ts +0 -108
package/test/unit/context/prior-failures.test.ts +0 -463
package/test/unit/context.test.ts +0 -1726
package/test/unit/cost.test.ts +0 -231
package/test/unit/crash-recovery.test.ts +0 -309
package/test/unit/escalation.test.ts +0 -127
package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
package/test/unit/execution/pid-registry.test.ts +0 -241
package/test/unit/execution/sequential-executor.test.ts +0 -235
package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
package/test/unit/execution/structured-failure.test.ts +0 -415
package/test/unit/execution-logging-stderr.test.ts +0 -157
package/test/unit/execution-stage.test.ts +0 -123
package/test/unit/fix-generator.test.ts +0 -276
package/test/unit/formatters.test.ts +0 -468
package/test/unit/greenfield.test.ts +0 -180
package/test/unit/hooks/shell-security.test.ts +0 -40
package/test/unit/interaction/auto-plugin.test.ts +0 -162
package/test/unit/interaction/human-review-trigger.test.ts +0 -165
package/test/unit/interaction-network-failures.test.ts +0 -390
package/test/unit/interaction-plugins.test.ts +0 -472
package/test/unit/logging/formatter.test.ts +0 -456
package/test/unit/merge.test.ts +0 -269
package/test/unit/metrics/aggregator.test.ts +0 -164
package/test/unit/metrics/tracker.test.ts +0 -186
package/test/unit/metrics.test.ts +0 -276
package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
package/test/unit/pipeline/event-bus.test.ts +0 -105
package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
package/test/unit/pipeline/runner-retry.test.ts +0 -89
package/test/unit/pipeline/stages/autofix.test.ts +0 -97
package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
package/test/unit/pipeline/stages/rectify.test.ts +0 -101
package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
package/test/unit/pipeline/stages/review.test.ts +0 -201
package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
package/test/unit/pipeline/stages/verify.test.ts +0 -267
package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
package/test/unit/prd-auto-default.test.ts +0 -291
package/test/unit/prd-failure-category.test.ts +0 -177
package/test/unit/prd-get-next-story.test.ts +0 -215
package/test/unit/precheck-checks.test.ts +0 -841
package/test/unit/precheck-story-size-gate.test.ts +0 -288
package/test/unit/precheck-types.test.ts +0 -143
package/test/unit/prompts.test.ts +0 -476
package/test/unit/queue.test.ts +0 -237
package/test/unit/rectification.test.ts +0 -285
package/test/unit/registry.test.ts +0 -288
package/test/unit/review/runner.test.ts +0 -117
package/test/unit/routing/content-hash.test.ts +0 -99
package/test/unit/routing/routing-stability.test.ts +0 -208
package/test/unit/routing/strategies/llm.test.ts +0 -306
package/test/unit/routing-advanced.test.ts +0 -313
package/test/unit/routing-core.test.ts +0 -341
package/test/unit/routing-strategies.test.ts +0 -440
package/test/unit/storyid-events.test.ts +0 -213
package/test/unit/tdd-verdict.test.ts +0 -492
package/test/unit/test-output-parser.test.ts +0 -377
package/test/unit/ui/tui-controls.test.ts +0 -335
package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
package/test/unit/ui/tui-layout.test.ts +0 -379
package/test/unit/ui/tui-stories.test.ts +0 -333
package/test/unit/unit-isolation.test.ts +0 -135
package/test/unit/utils/git.test.ts +0 -50
package/test/unit/utils/path-security.test.ts +0 -47
package/test/unit/utils-helpers.test.ts +0 -318
package/test/unit/verdict.test.ts +0 -325
package/test/unit/verification/orchestrator-types.test.ts +0 -54
package/test/unit/verification/orchestrator.test.ts +0 -66
package/test/unit/verification/smart-runner-config.test.ts +0 -163
package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
package/test/unit/verification/smart-runner.test.ts +0 -262
package/test/unit/verification/strategies/acceptance.test.ts +0 -33
package/test/unit/verification/strategies/regression.test.ts +0 -87
package/test/unit/verification/strategies/scoped.test.ts +0 -100
package/test/unit/worktree-manager.test.ts +0 -159
package/tsconfig.json +0 -27

package/test/helpers/helpers.test.ts DELETED Viewed

@@ -1,295 +0,0 @@
-import { afterEach, beforeEach, describe, expect, test } from "bun:test";
-import { mkdirSync, rmSync } from "node:fs";
-import path from "node:path";
-import { spawn } from "bun";
-import { acquireLock, formatProgress, releaseLock } from "../../src/execution/helpers";
-import type { StoryCounts } from "../../src/execution/helpers";
-describe("formatProgress", () => {
-  test("formats progress with all stories pending", () => {
-    const counts: StoryCounts = {
-      total: 12,
-      passed: 0,
-      failed: 0,
-      pending: 12,
-    };
-    const progress = formatProgress(counts, 0, 5.0, 0, 12);
-    expect(progress).toContain("0/12 stories");
-    expect(progress).toContain("0 passed");
-    expect(progress).toContain("0 failed");
-    expect(progress).toContain("$0.00/$5.00");
-    expect(progress).toContain("calculating...");
-  });
-  test("formats progress with some stories completed", () => {
-    const counts: StoryCounts = {
-      total: 12,
-      passed: 5,
-      failed: 1,
-      pending: 6,
-    };
-    // 10 minutes elapsed (600000 ms), 6 stories completed
-    // avg = 600000 / 6 = 100000 ms per story
-    // remaining = 6 stories * 100000 = 600000 ms = 10 minutes
-    const progress = formatProgress(counts, 0.45, 5.0, 600000, 12);
-    expect(progress).toContain("6/12 stories");
-    expect(progress).toContain("5 passed");
-    expect(progress).toContain("1 failed");
-    expect(progress).toContain("$0.45/$5.00");
-    expect(progress).toContain("~10 min remaining");
-  });
-  test("formats progress when all stories are complete", () => {
-    const counts: StoryCounts = {
-      total: 12,
-      passed: 10,
-      failed: 2,
-      pending: 0,
-    };
-    const progress = formatProgress(counts, 1.23, 5.0, 1200000, 12);
-    expect(progress).toContain("12/12 stories");
-    expect(progress).toContain("10 passed");
-    expect(progress).toContain("2 failed");
-    expect(progress).toContain("$1.23/$5.00");
-    expect(progress).toContain("complete");
-  });
-  test("calculates ETA correctly for fast stories", () => {
-    const counts: StoryCounts = {
-      total: 20,
-      passed: 10,
-      failed: 0,
-      pending: 10,
-    };
-    // 2 minutes elapsed (120000 ms) for 10 stories
-    // avg = 120000 / 10 = 12000 ms per story
-    // remaining = 10 stories * 12000 = 120000 ms = 2 minutes
-    const progress = formatProgress(counts, 0.5, 10.0, 120000, 20);
-    expect(progress).toContain("~2 min remaining");
-  });
-  test("rounds ETA to nearest minute", () => {
-    const counts: StoryCounts = {
-      total: 10,
-      passed: 3,
-      failed: 0,
-      pending: 7,
-    };
-    // 8.5 minutes elapsed (510000 ms) for 3 stories
-    // avg = 510000 / 3 = 170000 ms per story
-    // remaining = 7 stories * 170000 = 1190000 ms ≈ 19.8 minutes → rounds to 20
-    const progress = formatProgress(counts, 0.3, 5.0, 510000, 10);
-    expect(progress).toContain("~20 min remaining");
-  });
-  test("includes cost information with proper formatting", () => {
-    const counts: StoryCounts = {
-      total: 5,
-      passed: 2,
-      failed: 0,
-      pending: 3,
-    };
-    const progress = formatProgress(counts, 1.2345, 10.0, 300000, 5);
-    // Should round cost to 2 decimal places
-    expect(progress).toContain("$1.23/$10.00");
-  });
-  test("handles zero elapsed time gracefully", () => {
-    const counts: StoryCounts = {
-      total: 10,
-      passed: 0,
-      failed: 0,
-      pending: 10,
-    };
-    const progress = formatProgress(counts, 0, 5.0, 0, 10);
-    expect(progress).toContain("calculating...");
-    expect(progress).not.toContain("NaN");
-    expect(progress).not.toContain("Infinity");
-  });
-  test("includes all required progress indicators", () => {
-    const counts: StoryCounts = {
-      total: 10,
-      passed: 3,
-      failed: 1,
-      pending: 6,
-    };
-    const progress = formatProgress(counts, 0.5, 5.0, 300000, 10);
-    expect(progress).toContain("Progress:");
-    expect(progress).toContain("passed");
-    expect(progress).toContain("failed");
-    expect(progress).toContain("$");
-    expect(progress).toContain("min remaining");
-  });
-});
-describe("acquireLock and releaseLock", () => {
-  const testDir = path.join(import.meta.dir, ".test-locks");
-  const lockPath = path.join(testDir, "nax.lock");
-  beforeEach(() => {
-    // Create clean test directory
-    rmSync(testDir, { recursive: true, force: true });
-    mkdirSync(testDir, { recursive: true });
-  });
-  afterEach(() => {
-    // Clean up test directory
-    rmSync(testDir, { recursive: true, force: true });
-  });
-  test("acquires lock when no lock file exists", async () => {
-    const acquired = await acquireLock(testDir);
-    expect(acquired).toBe(true);
-    // Verify lock file was created
-    const lockFile = Bun.file(lockPath);
-    expect(await lockFile.exists()).toBe(true);
-    // Verify lock file contains current PID
-    const lockContent = await lockFile.text();
-    const lockData = JSON.parse(lockContent);
-    expect(lockData.pid).toBe(process.pid);
-    expect(typeof lockData.timestamp).toBe("number");
-    await releaseLock(testDir);
-  });
-  test("fails to acquire lock when another process holds it", async () => {
-    // First process acquires lock
-    const acquired1 = await acquireLock(testDir);
-    expect(acquired1).toBe(true);
-    // Second process tries to acquire lock
-    const acquired2 = await acquireLock(testDir);
-    expect(acquired2).toBe(false);
-    await releaseLock(testDir);
-  });
-  test("releases lock successfully", async () => {
-    await acquireLock(testDir);
-    await releaseLock(testDir);
-    // Verify lock file was deleted
-    const lockFile = Bun.file(lockPath);
-    expect(await lockFile.exists()).toBe(false);
-  });
-  test("can re-acquire lock after release", async () => {
-    const acquired1 = await acquireLock(testDir);
-    expect(acquired1).toBe(true);
-    await releaseLock(testDir);
-    const acquired2 = await acquireLock(testDir);
-    expect(acquired2).toBe(true);
-    await releaseLock(testDir);
-  });
-  test("removes stale lock when process is dead", async () => {
-    // Create a lock file with a fake PID that doesn't exist
-    const stalePid = 999999; // Very unlikely to be a real process
-    const staleLock = {
-      pid: stalePid,
-      timestamp: Date.now() - 60000, // 1 minute ago
-    };
-    await Bun.write(lockPath, JSON.stringify(staleLock));
-    // Try to acquire lock - should detect stale lock and remove it
-    const acquired = await acquireLock(testDir);
-    expect(acquired).toBe(true);
-    // Verify new lock file has current PID
-    const lockFile = Bun.file(lockPath);
-    const lockContent = await lockFile.text();
-    const lockData = JSON.parse(lockContent);
-    expect(lockData.pid).toBe(process.pid);
-    await releaseLock(testDir);
-  });
-  test("detects stale lock from OOM-killed process", async () => {
-    // Spawn a short-lived child process
-    const proc = spawn({
-      cmd: ["sleep", "0.1"],
-      stdout: "pipe",
-      stderr: "pipe",
-    });
-    // Get the child PID
-    const childPid = proc.pid;
-    // Wait for it to exit
-    await proc.exited;
-    // Create a lock file with the dead child's PID
-    const staleLock = {
-      pid: childPid,
-      timestamp: Date.now() - 60000, // 1 minute ago
-    };
-    await Bun.write(lockPath, JSON.stringify(staleLock));
-    // Now try to acquire lock - should detect child process is dead
-    const acquired = await acquireLock(testDir);
-    expect(acquired).toBe(true);
-    // Verify new lock has current PID
-    const lockFile = Bun.file(lockPath);
-    const lockContent = await lockFile.text();
-    const lockData = JSON.parse(lockContent);
-    expect(lockData.pid).toBe(process.pid);
-    await releaseLock(testDir);
-  });
-  test("does not remove lock when process is still alive", async () => {
-    // Create lock with current process PID
-    const validLock = {
-      pid: process.pid,
-      timestamp: Date.now() - 60000, // 1 minute ago
-    };
-    await Bun.write(lockPath, JSON.stringify(validLock));
-    // Try to acquire lock - should NOT remove it since process is alive
-    const acquired = await acquireLock(testDir);
-    expect(acquired).toBe(false);
-    // Verify lock still exists with same PID
-    const lockFile = Bun.file(lockPath);
-    const lockContent = await lockFile.text();
-    const lockData = JSON.parse(lockContent);
-    expect(lockData.pid).toBe(process.pid);
-  });
-  test("handles corrupted lock file gracefully", async () => {
-    // Create invalid JSON lock file
-    await Bun.write(lockPath, "not valid json");
-    // Should treat corrupt lock as stale and acquire successfully
-    const acquired = await acquireLock(testDir);
-    expect(acquired).toBe(true);
-  });
-  test("handles release when lock file doesn't exist", async () => {
-    // Should not throw when releasing non-existent lock
-    await expect(releaseLock(testDir)).resolves.toBeUndefined();
-  });
-});

package/test/helpers/timeout.ts DELETED Viewed

@@ -1,42 +0,0 @@
-/**
- * Test Timeout Helpers
- *
- * Utilities to prevent tests from hanging indefinitely.
- */
-/**
- * Wraps a promise with a hard timeout.
- * If the promise doesn't resolve within the timeout, rejects with a timeout error.
- *
- * @param promise The promise to wrap
- * @param timeoutMs Timeout in milliseconds
- * @param operation Description of the operation (for error messages)
- * @returns The promise result if it completes in time
- * @throws TimeoutError if the timeout is exceeded
- */
-export async function withTimeout<T>(promise: Promise<T>, timeoutMs: number, operation = "Operation"): Promise<T> {
-  return Promise.race([
-    promise,
-    new Promise<T>((_, reject) =>
-      setTimeout(() => reject(new Error(`${operation} timed out after ${timeoutMs}ms`)), timeoutMs),
-    ),
-  ]);
-}
-/**
- * Wraps a function call with a hard timeout.
- * Useful for wrapping synchronous or async functions that might hang.
- *
- * @param fn The function to execute
- * @param timeoutMs Timeout in milliseconds
- * @param operation Description of the operation (for error messages)
- * @returns The function result if it completes in time
- * @throws TimeoutError if the timeout is exceeded
- */
-export async function executeWithTimeout<T>(
-  fn: () => Promise<T> | T,
-  timeoutMs: number,
-  operation = "Operation",
-): Promise<T> {
-  return withTimeout(Promise.resolve(fn()), timeoutMs, operation);
-}

package/test/integration/US-002-TEST-SUMMARY.md DELETED Viewed

@@ -1,107 +0,0 @@
-# US-002 Test Summary: Context Provider Injection
-## Overview
-Created comprehensive test suite for US-002 that verifies context providers inject external data into agent prompts with proper token budget management.
-## Test File
-- **Location**: `test/integration/context-provider-injection.test.ts`
-- **Total Tests**: 20
-- **Passing**: 14 (features already implemented)
-- **Failing**: 6 (features not yet implemented)
-## Test Coverage by Acceptance Criteria
-### ✅ AC1: All registered context providers are called before agent execution
-**Status**: All tests passing (feature implemented)
-- ✓ Calls all registered context providers
-- ✓ Providers receive the current story
-- ✓ Works with no providers registered
-### ✅ AC2: Provider content appended under markdown section with label
-**Status**: All tests passing (feature implemented)
-- ✓ Appends provider content under labeled markdown section
-- ✓ Multiple providers create separate labeled sections
-- ✓ Provider content appended to existing context markdown
-### ❌ AC3: Total injected tokens respect token budget
-**Status**: 4 tests failing (feature NOT implemented correctly)
-**Issue**: Current implementation uses hardcoded `PLUGIN_CONTEXT_MAX_TOKENS = 20_000` instead of reading from `config.execution.contextProviderTokenBudget`
-Failing tests:
-- ✗ Respects default token budget of 2000 tokens when not configured
-- ✗ Respects custom token budget from config
-- ✗ Providers added in order until budget exhausted
-- ✗ Single provider exceeding budget is skipped
-### ✅ AC4: Provider errors caught, logged, and skipped
-**Status**: All tests passing (feature implemented)
-- ✓ Continues when a provider throws error
-- ✓ Handles all providers failing gracefully
-- ✓ Error in one provider doesn't affect others
-### ❌ AC5: Token budget configurable via execution.contextProviderTokenBudget
-**Status**: 2 tests failing (feature NOT implemented)
-**Issue**:
-1. `ExecutionConfig` type doesn't include `contextProviderTokenBudget` field
-2. `DEFAULT_CONFIG` doesn't set default value of 2000 tokens
-3. Context stage uses hardcoded value instead of reading from config
-Failing tests:
-- ✗ Default config includes contextProviderTokenBudget with default of 2000
-- ✗ Different projects can have different token budgets
-## Implementation Gaps
-### 1. Config Schema Missing Field
-**File**: `src/config/schema.ts`
-- Add `contextProviderTokenBudget: number` to `ExecutionConfig` interface
-- Add validation in `ExecutionConfigSchema` (Zod)
-- Set default value of 2000 in `DEFAULT_CONFIG.execution`
-### 2. Context Stage Uses Hardcoded Value
-**File**: `src/pipeline/stages/context.ts`
-- Line 32: `const PLUGIN_CONTEXT_MAX_TOKENS = 20_000;` (hardcoded)
-- Should read from: `ctx.config.execution.contextProviderTokenBudget`
-- Lines 62, 72: Replace `PLUGIN_CONTEXT_MAX_TOKENS` with config value
-## Test Execution
-```bash
-# Run US-002 tests only
-bun test ./test/integration/context-provider-injection.test.ts
-# Current results:
-# 14 pass, 6 fail, 46 expect() calls
-```
-## Next Steps for Implementer
-1. **Update ExecutionConfig interface** (src/config/schema.ts):
-   - Add `contextProviderTokenBudget: number` field
-   - Add Zod validation: `z.number().int().min(100).max(100000).default(2000)`
-   - Add to DEFAULT_CONFIG: `contextProviderTokenBudget: 2000`
-2. **Update context stage** (src/pipeline/stages/context.ts):
-   - Remove hardcoded `PLUGIN_CONTEXT_MAX_TOKENS` constant
-   - Read budget from `ctx.config.execution.contextProviderTokenBudget`
-   - Use configured value in budget checks (lines 62, 72)
-3. **Verify all tests pass**:
-   ```bash
-   bun test ./test/integration/context-provider-injection.test.ts
-   ```
-## Coverage Notes
-The test suite covers:
-- ✓ Provider registration and invocation
-- ✓ Markdown formatting with labels
-- ✓ Error handling and soft failures
-- ✓ Token budget enforcement (with config)
-- ✓ Multi-provider orchestration
-- ✓ Integration with existing PRD context
-- ✓ Built context element tracking
-All edge cases are covered per acceptance criteria.

package/test/integration/US-003-TEST-SUMMARY.md DELETED Viewed

@@ -1,149 +0,0 @@
-# US-003 Test Summary: Review Plugins Run After Built-in Verification
-**Story ID:** US-003
-**Date:** 2026-02-27
-**Status:** ✅ PASSED
-**Test File:** `test/integration/review-plugin-integration.test.ts`
-## Overview
-This test suite verifies that plugin reviewers are correctly integrated into the review pipeline stage, running after built-in checks and triggering appropriate retry/escalation on failure.
-## Test Results
-**Total Tests:** 19
-**Passed:** 19
-**Failed:** 0
-**Success Rate:** 100%
-## Acceptance Criteria Coverage
-### ✅ AC1: Plugin reviewers run after built-in checks pass
-| Test | Status |
-|------|--------|
-| Plugin reviewers execute when built-in checks pass | ✅ PASS |
-| Plugin reviewers do not run if built-in checks fail | ✅ PASS |
-| No plugin reviewers registered - continues normally | ✅ PASS |
-**Verification:** Plugin reviewers only execute after built-in checks succeed, preventing unnecessary work when code quality gates fail.
-### ✅ AC2: Each reviewer receives workdir and changed files
-| Test | Status |
-|------|--------|
-| Reviewer receives correct workdir | ✅ PASS |
-| Reviewer receives list of changed files | ✅ PASS |
-| Reviewer receives empty array when no files changed | ✅ PASS |
-**Verification:** Reviewers receive accurate context about the working directory and which files were modified, enabling targeted analysis.
-### ✅ AC3: Reviewer failure triggers retry/escalation
-| Test | Status |
-|------|--------|
-| Failing reviewer returns fail action | ✅ PASS |
-| Reviewer failure includes plugin name in reason | ✅ PASS |
-**Verification:** When a plugin reviewer fails, the pipeline returns a `fail` action with the plugin name in the failure reason, triggering the same retry/escalation logic as built-in check failures.
-### ✅ AC4: Reviewer output included in story result
-| Test | Status |
-|------|--------|
-| Passing reviewer output is captured | ✅ PASS |
-| Failing reviewer output is captured | ✅ PASS |
-**Verification:** All reviewer outputs (success and failure) are stored in `ctx.reviewResult.pluginReviewers`, providing debugging information and audit trail.
-### ✅ AC5: Exceptions count as failures
-| Test | Status |
-|------|--------|
-| Reviewer throwing exception counts as failure | ✅ PASS |
-| Exception message captured in output | ✅ PASS |
-| Non-Error exception converted to string | ✅ PASS |
-**Verification:** When a reviewer throws an exception, it's treated as a failure with the error message captured for debugging. The pipeline correctly handles both Error objects and primitive throws.
-### ✅ AC6: Multiple reviewers run sequentially with short-circuiting
-| Test | Status |
-|------|--------|
-| Multiple reviewers run in order when all pass | ✅ PASS |
-| First failure short-circuits remaining reviewers | ✅ PASS |
-| Exception short-circuits remaining reviewers | ✅ PASS |
-**Verification:** Reviewers execute sequentially in registration order. When one fails (or throws), subsequent reviewers are skipped, providing fail-fast behavior.
-### ✅ Edge Cases
-| Test | Status |
-|------|--------|
-| No plugins context - continues normally | ✅ PASS |
-| Reviewer returns empty output | ✅ PASS |
-| Reviewer without exitCode works | ✅ PASS |
-**Verification:** The implementation handles edge cases gracefully: missing plugin context, empty output strings, and optional exitCode field.
-## Implementation Verification
-### Key Files Modified
-1. **`src/pipeline/stages/review.ts`**
-   - Lines 77-155: Plugin reviewer execution logic
-   - Lines 35-53: `getChangedFiles()` helper function
-   - Correctly integrates plugin reviewers after built-in checks
-2. **`src/review/types.ts`**
-   - Lines 26-38: `PluginReviewerResult` interface
-   - Line 51: Extended `ReviewResult` with `pluginReviewers` field
-3. **`test/integration/review-plugin-integration.test.ts`**
-   - 722 lines of comprehensive test coverage
-   - Mock plugins and reviewers for isolated testing
-   - Git repository setup for realistic changed file detection
-### Type Safety
-- ✅ All TypeScript types correctly defined
-- ✅ No type errors (`bun run typecheck` passes)
-- ✅ Proper type guards and assertions
-### Error Handling
-- ✅ Exceptions caught and converted to failures
-- ✅ Error messages preserved for debugging
-- ✅ Non-Error throws handled correctly
-- ✅ Missing optional fields handled safely
-### Integration Points
-- ✅ Integrates with `PluginRegistry.getReviewers()`
-- ✅ Uses existing pipeline context structure
-- ✅ Follows established patterns from built-in checks
-- ✅ Compatible with retry/escalation logic
-## Performance Considerations
-- Reviewers run sequentially (not parallel) to prevent resource contention
-- Fail-fast behavior minimizes wasted computation
-- Changed files retrieved once and reused for all reviewers
-- No unnecessary git operations or file system scans
-## Conclusion
-**US-003 is fully implemented and verified.** All acceptance criteria are met with comprehensive test coverage. The implementation follows the codebase patterns, handles edge cases gracefully, and integrates seamlessly with the existing plugin system architecture.
-## Test Execution
-```bash
-$ bun test test/integration/review-plugin-integration.test.ts
- 19 pass
- 0 fail
- 51 expect() calls
-Ran 19 tests across 1 file. [1.71s]
-```
-**Final Status:** ✅ READY FOR PRODUCTION

package/test/integration/US-004-TEST-SUMMARY.md DELETED Viewed

@@ -1,106 +0,0 @@
-# US-004: Reporter plugins receive lifecycle events — Test Summary
-**Status:** ✅ PASSED
-**Date:** 2026-02-27
-**Commit:** 26181a1
-## Overview
-This story implements reporter lifecycle events that fire at appropriate points in the runner loop. All reporter calls are fire-and-forget (errors logged, never block pipeline).
-## Implementation Summary
-### Changes Made
-1. **Moved PRD initialization** (runner.ts:205)
-   - Moved `prd` declaration before try block to make it accessible in finally block
-   - Ensures `prd` is available for onRunEnd event even on failure/abort
-2. **Consolidated onRunEnd calls** (runner.ts:1417-1439)
-   - Moved onRunEnd reporter events to finally block
-   - Removed duplicate calls from success paths (parallel and sequential)
-   - Guarantees onRunEnd fires even when run fails or is aborted
-3. **Added dry-run onStoryComplete events** (runner.ts:666-684)
-   - Added missing onStoryComplete events for dry-run mode
-   - Ensures reporters receive events consistently across all execution modes
-### Key Design Decisions
-- **Finally block placement**: onRunEnd must fire even on exceptions, so it's placed in the finally block before teardown and lock release
-- **Error isolation**: Each reporter call is wrapped in try/catch to prevent one reporter's failure from affecting others
-- **Event ordering**: onRunEnd fires before plugin teardown to ensure reporters can still access plugin state
-## Test Results
-All 9 tests in `test/integration/reporter-lifecycle.test.ts` pass:
-### AC1: onRunStart fires once at run start ✅
-- Verified event contains: runId, feature, totalStories, startTime
-- Verified event fires exactly once per run
-### AC2: onStoryComplete fires after each story ✅
-- Verified event contains: runId, storyId, status, durationMs, cost, tier, testStrategy
-- Verified event fires for each story execution (including dry-run)
-- Verified correct status values (completed, failed, skipped, paused)
-### AC3: onRunEnd fires once at run end ✅
-- Verified event contains: runId, totalDurationMs, totalCost, storySummary
-- Verified storySummary contains: completed, failed, skipped, paused counts
-- Verified correct counts match PRD state
-### AC4: Reporter errors never block execution ✅
-- Verified failing reporter doesn't abort run
-- Verified run completes successfully despite reporter errors
-- Verified errors are logged (not thrown)
-### AC5: Multiple reporters all receive events ✅
-- Verified two reporters both receive onRunStart, onStoryComplete, onRunEnd
-- Verified second reporter receives events even if first reporter fails
-- Verified no short-circuiting on error (all reporters always execute)
-### AC6: Events fire even when run fails or is aborted ✅
-- Verified onRunStart and onRunEnd fire when stories are pre-failed
-- Verified onRunEnd fires in finally block (even on exception)
-- Verified storySummary reflects actual failure state
-## Additional Test Coverage
-- **onStoryComplete for different outcomes**: Verified events for completed, failed, skipped, paused stories
-- **Multiple stories**: Verified consistent runId across all events in same run
-- **Dry-run mode**: Verified reporters receive events in dry-run mode
-## Verification Command
-```bash
-bun test test/integration/reporter-lifecycle.test.ts
-```
-**Result:** 9 pass, 0 fail, 48 expect() calls
-## Integration with Existing Code
-- **US-001 (Plugin loading)**: Uses pluginRegistry.getReporters() to retrieve all loaded reporters
-- **US-002 (Context provider injection)**: No conflicts, reporters operate independently
-- **US-003 (Review plugins)**: No conflicts, different lifecycle hooks
-## Notes
-- Reporter events are fire-and-forget by design
-- All reporter methods are optional (IReporter interface)
-- Reporter errors are logged at WARN level (not ERROR) since they're non-critical
-- onRunEnd always fires in finally block, even if try block throws
-- PRD must be accessible in finally block, so it's initialized before try
-## Acceptance Criteria Status
-| AC | Description | Status |
-|----|-------------|--------|
-| 1  | onRunStart fires once at run start with runId, feature, totalStories, startTime | ✅ |
-| 2  | onStoryComplete fires after each story with storyId, status, durationMs, cost, tier, testStrategy | ✅ |
-| 3  | onRunEnd fires once at run end with runId, totalDurationMs, totalCost, storySummary counts | ✅ |
-| 4  | Reporter errors are caught and logged but never block execution | ✅ |
-| 5  | Multiple reporters all receive events (not short-circuited on error) | ✅ |
-| 6  | Events fire even when the run fails or is aborted (onRunEnd still fires) | ✅ |
-**Overall Status:** ✅ ALL ACCEPTANCE CRITERIA MET