npm - @nathapp/nax - Versions diffs - 0.28.0 → 0.29.0 - Mend

@nathapp/nax 0.28.0 → 0.29.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (376) hide show

package/CHANGELOG.md +13 -2
package/dist/nax.js +72691 -0
package/package.json +12 -4
package/src/cli/config.ts +3 -1
package/src/config/defaults.ts +1 -0
package/src/config/schemas.ts +1 -0
package/src/config/types.ts +1 -0
package/src/context/builder.ts +10 -1
package/src/prompts/sections/role-task.ts +4 -2
package/src/review/runner.ts +6 -1
package/src/version.ts +2 -1
package/.claude/rules/01-project-conventions.md +0 -34
package/.claude/rules/02-test-architecture.md +0 -39
package/.claude/rules/03-test-writing.md +0 -58
package/.claude/rules/04-forbidden-patterns.md +0 -29
package/.claude/settings.json +0 -15
package/.githooks/pre-commit +0 -16
package/.gitlab-ci.yml +0 -103
package/.mcp.json +0 -8
package/BRIEF.md +0 -140
package/CLAUDE.md +0 -143
package/US-007-IMPLEMENTATION.md +0 -139
package/biome.json +0 -14
package/bun.lock +0 -163
package/bunfig.toml +0 -12
package/docker-compose.test.yml +0 -15
package/docs/20260216-fix-plan-context-review.md +0 -56
package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
package/docs/20260216-v02-plan.md +0 -136
package/docs/20260216-v02-review.md +0 -685
package/docs/20260217-dogfood-findings.md +0 -56
package/docs/20260217-p2-plus-plan.md +0 -117
package/docs/20260217-partial-fixes-plan.md +0 -62
package/docs/20260217-plan-analyze-spec.md +0 -117
package/docs/20260217-post-impl-review.md +0 -1137
package/docs/20260217-quick-wins-plan.md +0 -66
package/docs/20260217-split-runner-plan.md +0 -75
package/docs/20260217-v03-impl-plan.md +0 -80
package/docs/20260217-v03-post-impl-review.md +0 -589
package/docs/20260217-v04-impl-plan.md +0 -86
package/docs/20260217-v05-post-impl-review.md +0 -850
package/docs/20260217-v06-post-impl-review.md +0 -817
package/docs/20260218-adr003-port-plan.md +0 -151
package/docs/20260218-review-adr003-verification.md +0 -175
package/docs/20260219-fix-plan-bug16-19.md +0 -79
package/docs/20260219-fix-plan-bug20-22.md +0 -114
package/docs/20260219-plan-llm-routing.md +0 -116
package/docs/20260219-review-bug20-22-fixes.md +0 -135
package/docs/20260219-routing-baseline-keyword.md +0 -63
package/docs/20260220-plan-structured-logging-p1.md +0 -80
package/docs/20260220-plan-structured-logging-p2.md +0 -37
package/docs/20260220-review-llm-routing.md +0 -180
package/docs/20260220-review-post-fix-llm-routing.md +0 -70
package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
package/docs/20260221-fix-plan-routing-mode.md +0 -125
package/docs/20260221-review-v0.9-implementation.md +0 -379
package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
package/docs/20260223-fix-plan-prompt-audit.md +0 -62
package/docs/20260224-nax-roadmap-phases.md +0 -189
package/docs/20260225-phase2-llm-service-layer.md +0 -401
package/docs/20260225-review-v0.10.1.md +0 -187
package/docs/20260303-v010-implementation-plan.md +0 -165
package/docs/20260304-review-nax.md +0 -492
package/docs/CLAUDE.md.bak +0 -191
package/docs/ROADMAP.md +0 -390
package/docs/SPEC-rectification.md +0 -0
package/docs/SPEC.md +0 -324
package/docs/US-001-plugin-loading-verification.md +0 -152
package/docs/adr/ADR-005-implementation-plan.md +0 -655
package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
package/docs/architecture-analysis.md +0 -1076
package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
package/docs/bugs-from-dogfood-run-c.md +0 -243
package/docs/code-review-20260228.md +0 -612
package/docs/code-review-v0.15.0.md +0 -629
package/docs/hook-lifecycle-test-plan.md +0 -149
package/docs/releases/v0.11.0-and-earlier.md +0 -20
package/docs/releases/v0.12.0.md +0 -15
package/docs/releases/v0.13.0.md +0 -14
package/docs/releases/v0.14.0.md +0 -20
package/docs/releases/v0.14.1.md +0 -36
package/docs/releases/v0.14.2.md +0 -51
package/docs/releases/v0.14.3.md +0 -174
package/docs/releases/v0.14.4.md +0 -94
package/docs/releases/v0.15.0.md +0 -502
package/docs/releases/v0.15.1.md +0 -170
package/docs/releases/v0.15.3.md +0 -193
package/docs/specs/bug-039-orphan-processes.md +0 -131
package/docs/specs/bug-040-review-rectification.md +0 -82
package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
package/docs/specs/bun-pty-migration.md +0 -171
package/docs/specs/central-run-registry.md +0 -116
package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
package/docs/specs/feat-011-file-context-strategy.md +0 -73
package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
package/docs/specs/feat-013-test-after-review.md +0 -89
package/docs/specs/feat-014-heartbeat-observability.md +0 -127
package/docs/specs/status-file-consolidation.md +0 -93
package/docs/specs/status-file-v0.10.1.md +0 -812
package/docs/specs/trigger-completion.md +0 -145
package/docs/specs/verification-architecture-v2.md +0 -343
package/docs/tdd/strategies.md +0 -97
package/docs/v0.10-global-config.md +0 -206
package/docs/v0.10-plugin-system.md +0 -415
package/docs/v0.10-prompt-optimizer.md +0 -234
package/docs/v0.3-spec.md +0 -244
package/docs/v0.4-spec.md +0 -140
package/docs/v0.5-spec.md +0 -237
package/docs/v0.6-spec.md +0 -371
package/docs/v0.7-spec.md +0 -177
package/docs/v0.8-llm-routing.md +0 -206
package/docs/v0.8-structured-logging.md +0 -132
package/docs/v0.9.3-prompt-audit.md +0 -112
package/examples/plugins/console-reporter/index.test.ts +0 -207
package/examples/plugins/console-reporter/index.ts +0 -110
package/memory/topic/feat-010-baseref.md +0 -28
package/memory/topic/feat-013-test-after-deprecation.md +0 -22
package/nax/config.json +0 -154
package/nax/features/bug-039-medium/prd.json +0 -45
package/nax/features/bugfix-v0171/prd.json +0 -52
package/nax/features/central-run-registry/prd.json +0 -105
package/nax/features/config-management/prd.json +0 -108
package/nax/features/config-management/progress.txt +0 -5
package/nax/features/diagnose/acceptance.test.ts +0 -414
package/nax/features/diagnose/prd.json +0 -41
package/nax/features/nax-compliance/prd.json +0 -52
package/nax/features/nax-compliance/progress.txt +0 -1
package/nax/features/orchestration-fixes/prd.json +0 -89
package/nax/features/orchestration-fixes/progress.txt +0 -1
package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
package/nax/features/plugin-integration/prd.json +0 -208
package/nax/features/plugin-integration/progress.txt +0 -5
package/nax/features/post-rearch-bugfix/prd.json +0 -137
package/nax/features/precheck/prd.json +0 -205
package/nax/features/precheck/progress.txt +0 -15
package/nax/features/prompt-builder/prd.json +0 -152
package/nax/features/prompt-builder/progress.txt +0 -3
package/nax/features/review-quality/prd.json +0 -55
package/nax/features/routing-persistence/prd.json +0 -104
package/nax/features/routing-persistence/progress.txt +0 -1
package/nax/features/smart-test-runner/plan.md +0 -7
package/nax/features/smart-test-runner/prd.json +0 -203
package/nax/features/smart-test-runner/progress.txt +0 -13
package/nax/features/smart-test-runner/spec.md +0 -7
package/nax/features/smart-test-runner/tasks.md +0 -8
package/nax/features/status-file-consolidation/prd.json +0 -106
package/nax/features/structured-logging/prd.json +0 -199
package/nax/features/trigger-completion/prd.json +0 -150
package/nax/features/trigger-completion/progress.txt +0 -7
package/nax/features/unlock/prd.json +0 -36
package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
package/nax/features/v0.19.0-hardening/plan.md +0 -7
package/nax/features/v0.19.0-hardening/prd.json +0 -84
package/nax/features/v0.19.0-hardening/progress.txt +0 -7
package/nax/features/v0.19.0-hardening/spec.md +0 -18
package/nax/features/v0.19.0-hardening/tasks.md +0 -8
package/nax/features/verify-v2/prd.json +0 -79
package/nax/features/verify-v2/progress.txt +0 -3
package/nax/status.json +0 -36
package/test/COVERAGE-GAPS.md +0 -333
package/test/e2e/cm-003-default-view.test.ts +0 -195
package/test/e2e/plan-analyze-run.test.ts +0 -902
package/test/helpers/helpers.test.ts +0 -295
package/test/helpers/timeout.ts +0 -42
package/test/integration/US-002-TEST-SUMMARY.md +0 -107
package/test/integration/US-003-TEST-SUMMARY.md +0 -149
package/test/integration/US-004-TEST-SUMMARY.md +0 -106
package/test/integration/US-005-TEST-SUMMARY.md +0 -138
package/test/integration/US-007-TEST-SUMMARY.md +0 -100
package/test/integration/cli/agent-validation.test.ts +0 -439
package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
package/test/integration/cli/cli-config-default-view.test.ts +0 -230
package/test/integration/cli/cli-config-diff.test.ts +0 -461
package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
package/test/integration/cli/cli-config.test.ts +0 -737
package/test/integration/cli/cli-diagnose.test.ts +0 -595
package/test/integration/cli/cli-logs.test.ts +0 -346
package/test/integration/cli/cli-plugins.test.ts +0 -679
package/test/integration/cli/cli-precheck.test.ts +0 -372
package/test/integration/cli/cli-run-headless.test.ts +0 -174
package/test/integration/cli/cli.test.ts +0 -76
package/test/integration/cli/precheck-integration.test.ts +0 -476
package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
package/test/integration/cli/precheck.test.ts +0 -806
package/test/integration/config/config-loader.test.ts +0 -266
package/test/integration/config/config.test.ts +0 -444
package/test/integration/config/merger.test.ts +0 -466
package/test/integration/config/paths.test.ts +0 -52
package/test/integration/config/security-loader.test.ts +0 -83
package/test/integration/context/context-integration.test.ts +0 -703
package/test/integration/context/context-path-security.test.ts +0 -173
package/test/integration/context/context-provider-injection.test.ts +0 -507
package/test/integration/context/context-verification-integration.test.ts +0 -296
package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
package/test/integration/execution/execution-isolation.test.ts +0 -143
package/test/integration/execution/execution.test.ts +0 -634
package/test/integration/execution/feature-status-write.test.ts +0 -302
package/test/integration/execution/parallel.test.ts +0 -251
package/test/integration/execution/prd-pause.test.ts +0 -205
package/test/integration/execution/prd-resolvers.test.ts +0 -186
package/test/integration/execution/progress.test.ts +0 -34
package/test/integration/execution/runner-batching.test.ts +0 -682
package/test/integration/execution/runner-config-plugins.test.ts +0 -462
package/test/integration/execution/runner-escalation.test.ts +0 -561
package/test/integration/execution/runner-fixes.test.ts +0 -400
package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
package/test/integration/execution/status-file-integration.test.ts +0 -289
package/test/integration/execution/status-file.test.ts +0 -380
package/test/integration/execution/status-writer.test.ts +0 -447
package/test/integration/execution/story-id-in-events.test.ts +0 -274
package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
package/test/integration/pipeline/hooks.test.ts +0 -363
package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
package/test/integration/pipeline/pipeline-events.test.ts +0 -476
package/test/integration/pipeline/pipeline.test.ts +0 -660
package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
package/test/integration/pipeline/verify-stage.test.ts +0 -286
package/test/integration/plan/analyze-integration.test.ts +0 -262
package/test/integration/plan/analyze-scanner.test.ts +0 -132
package/test/integration/plan/logger.test.ts +0 -461
package/test/integration/plan/plan.test.ts +0 -157
package/test/integration/plugins/config-integration.test.ts +0 -173
package/test/integration/plugins/config-resolution.test.ts +0 -523
package/test/integration/plugins/loader.test.ts +0 -644
package/test/integration/plugins/plugins-registry.test.ts +0 -747
package/test/integration/plugins/validator.test.ts +0 -564
package/test/integration/prompts/pb-004-migration.test.ts +0 -523
package/test/integration/review/review-config-commands.test.ts +0 -320
package/test/integration/review/review-config-schema.test.ts +0 -117
package/test/integration/review/review-plugin-integration.test.ts +0 -729
package/test/integration/review/review.test.ts +0 -150
package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
package/test/integration/routing/plugin-routing-core.test.ts +0 -527
package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
package/test/integration/tmp/headless-test/test.jsonl +0 -30
package/test/integration/verification/test-scanner.test.ts +0 -403
package/test/integration/verification/verification-asset-check.test.ts +0 -143
package/test/integration/worktree/manager.test.ts +0 -218
package/test/integration/worktree/worktree-merge.test.ts +0 -341
package/test/manual/logging-formatter-demo.ts +0 -158
package/test/ui/tui-agent-panel.test.tsx +0 -99
package/test/ui/tui-pty-integration.test.tsx +0 -146
package/test/unit/acceptance.test.ts +0 -187
package/test/unit/agent-stderr-capture.test.ts +0 -147
package/test/unit/agents/claude.test.ts +0 -107
package/test/unit/analyze-classifier.test.ts +0 -216
package/test/unit/analyze.test.ts +0 -224
package/test/unit/auto-detect.test.ts +0 -250
package/test/unit/cli-status-project-level.test.ts +0 -283
package/test/unit/cli-status.test.ts +0 -418
package/test/unit/commands/common.test.ts +0 -321
package/test/unit/commands/logs.test.ts +0 -458
package/test/unit/commands/runs.test.ts +0 -303
package/test/unit/commands/unlock.test.ts +0 -320
package/test/unit/config/defaults.test.ts +0 -70
package/test/unit/config/quality-commands-schema.test.ts +0 -72
package/test/unit/config/regression-gate-schema.test.ts +0 -160
package/test/unit/config/smart-runner-flag.test.ts +0 -250
package/test/unit/constitution-generators.test.ts +0 -161
package/test/unit/constitution.test.ts +0 -210
package/test/unit/context/context-autodetect.test.ts +0 -297
package/test/unit/context/context-build.test.ts +0 -575
package/test/unit/context/context-coverage.test.ts +0 -236
package/test/unit/context/context-error.test.ts +0 -93
package/test/unit/context/context-estimate-tokens.test.ts +0 -201
package/test/unit/context/context-format.test.ts +0 -302
package/test/unit/context/context-isolation.test.ts +0 -267
package/test/unit/context/context-sort.test.ts +0 -93
package/test/unit/context/context-story.test.ts +0 -108
package/test/unit/context/prior-failures.test.ts +0 -463
package/test/unit/context.test.ts +0 -1726
package/test/unit/cost.test.ts +0 -231
package/test/unit/crash-recovery.test.ts +0 -309
package/test/unit/escalation.test.ts +0 -127
package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
package/test/unit/execution/pid-registry.test.ts +0 -241
package/test/unit/execution/sequential-executor.test.ts +0 -235
package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
package/test/unit/execution/structured-failure.test.ts +0 -415
package/test/unit/execution-logging-stderr.test.ts +0 -157
package/test/unit/execution-stage.test.ts +0 -123
package/test/unit/fix-generator.test.ts +0 -276
package/test/unit/formatters.test.ts +0 -468
package/test/unit/greenfield.test.ts +0 -180
package/test/unit/hooks/shell-security.test.ts +0 -40
package/test/unit/interaction/auto-plugin.test.ts +0 -162
package/test/unit/interaction/human-review-trigger.test.ts +0 -165
package/test/unit/interaction-network-failures.test.ts +0 -390
package/test/unit/interaction-plugins.test.ts +0 -472
package/test/unit/logging/formatter.test.ts +0 -456
package/test/unit/merge.test.ts +0 -269
package/test/unit/metrics/aggregator.test.ts +0 -164
package/test/unit/metrics/tracker.test.ts +0 -186
package/test/unit/metrics.test.ts +0 -276
package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
package/test/unit/pipeline/event-bus.test.ts +0 -105
package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
package/test/unit/pipeline/runner-retry.test.ts +0 -89
package/test/unit/pipeline/stages/autofix.test.ts +0 -97
package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
package/test/unit/pipeline/stages/rectify.test.ts +0 -101
package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
package/test/unit/pipeline/stages/review.test.ts +0 -201
package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
package/test/unit/pipeline/stages/verify.test.ts +0 -267
package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
package/test/unit/prd-auto-default.test.ts +0 -291
package/test/unit/prd-failure-category.test.ts +0 -177
package/test/unit/prd-get-next-story.test.ts +0 -215
package/test/unit/precheck/checks-warnings.test.ts +0 -114
package/test/unit/precheck-checks.test.ts +0 -841
package/test/unit/precheck-story-size-gate.test.ts +0 -288
package/test/unit/precheck-types.test.ts +0 -143
package/test/unit/prompts/builder.test.ts +0 -258
package/test/unit/prompts/loader.test.ts +0 -355
package/test/unit/prompts/sections/conventions.test.ts +0 -30
package/test/unit/prompts/sections/isolation.test.ts +0 -35
package/test/unit/prompts/sections/role-task.test.ts +0 -40
package/test/unit/prompts/sections/sections.test.ts +0 -238
package/test/unit/prompts/sections/story.test.ts +0 -45
package/test/unit/prompts/sections/verdict.test.ts +0 -58
package/test/unit/prompts.test.ts +0 -476
package/test/unit/queue.test.ts +0 -237
package/test/unit/rectification.test.ts +0 -285
package/test/unit/registry.test.ts +0 -288
package/test/unit/review/runner.test.ts +0 -117
package/test/unit/routing/content-hash.test.ts +0 -99
package/test/unit/routing/routing-stability.test.ts +0 -208
package/test/unit/routing/strategies/llm.test.ts +0 -306
package/test/unit/routing-advanced.test.ts +0 -313
package/test/unit/routing-core.test.ts +0 -341
package/test/unit/routing-strategies.test.ts +0 -440
package/test/unit/storyid-events.test.ts +0 -213
package/test/unit/tdd-verdict.test.ts +0 -492
package/test/unit/test-output-parser.test.ts +0 -377
package/test/unit/ui/tui-controls.test.ts +0 -335
package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
package/test/unit/ui/tui-layout.test.ts +0 -379
package/test/unit/ui/tui-stories.test.ts +0 -333
package/test/unit/unit-isolation.test.ts +0 -135
package/test/unit/utils/git.test.ts +0 -50
package/test/unit/utils/path-security.test.ts +0 -47
package/test/unit/utils-helpers.test.ts +0 -318
package/test/unit/verdict.test.ts +0 -325
package/test/unit/verification/orchestrator-types.test.ts +0 -54
package/test/unit/verification/orchestrator.test.ts +0 -66
package/test/unit/verification/smart-runner-config.test.ts +0 -163
package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
package/test/unit/verification/smart-runner.test.ts +0 -262
package/test/unit/verification/strategies/acceptance.test.ts +0 -33
package/test/unit/verification/strategies/regression.test.ts +0 -87
package/test/unit/verification/strategies/scoped.test.ts +0 -100
package/test/unit/worktree-manager.test.ts +0 -159
package/tsconfig.json +0 -27

package/docs/specs/status-file-v0.10.1.md DELETED Viewed

@@ -1,812 +0,0 @@
-# Spec: v0.10.1 — Status File + TDD Escalation Retry
-**Version:** v0.10.1
-**Author:** Subrina
-**Date:** 2026-02-25
-**Status:** Draft
----
-## Summary
-Add a `--status-file <path>` flag to `nax run` that writes a machine-readable JSON status file, updated after each story completes. Enables external tools (CI/CD, orchestrators, dashboards) to monitor nax runs without parsing logs or aggregating hooks.
-## Motivation
-- **Log parsing is fragile** — format changes break consumers
-- **Hook aggregation has gaps** — if a hook fails, events are lost; no single source of truth
-- **nax already tracks this state** — `RunResult`, story counts, cost, PRD status are all in memory
-- **General-purpose** — useful for any integration, not just our orchestrator skill
-## Interface
-### CLI Flag
-```bash
-nax run -f <feature> --headless --status-file ./nax-status.json
-```
-| Flag | Type | Default | Description |
-|:-----|:-----|:--------|:------------|
-| `--status-file` | `string` | `undefined` | Path to write JSON status file. If not set, no file is written. |
-Relative paths resolved from `cwd` (same as `--headless` log behavior).
-### Status File Schema
-```typescript
-interface NaxStatusFile {
-  /** Schema version for forward compatibility */
-  version: 1;
-  /** Run metadata */
-  run: {
-    id: string;              // Run ID (e.g. "run-2026-02-25T10-00-00-000Z")
-    feature: string;         // Feature name
-    startedAt: string;       // ISO 8601
-    status: "running" | "completed" | "failed" | "stalled";
-    dryRun: boolean;
-  };
-  /** Aggregate progress */
-  progress: {
-    total: number;           // Total stories in PRD
-    passed: number;
-    failed: number;
-    paused: number;
-    blocked: number;
-    pending: number;         // total - passed - failed - paused - blocked
-  };
-  /** Cost tracking */
-  cost: {
-    spent: number;           // USD accumulated
-    limit: number | null;    // From config.execution.costLimit
-  };
-  /** Current story being processed (null if between stories) */
-  current: {
-    storyId: string;
-    title: string;
-    complexity: string;      // simple | medium | complex
-    tddStrategy: string;     // test-after | tdd-lite | three-session-tdd
-    model: string;           // Resolved model name
-    attempt: number;         // Current attempt (1-based)
-    phase: string;           // routing | test-write | implement | verify | review
-  } | null;
-  /** Iteration count */
-  iterations: number;
-  /** Last updated timestamp */
-  updatedAt: string;         // ISO 8601
-  /** Duration so far in ms */
-  durationMs: number;
-}
-```
-### Example Output
-```json
-{
-  "version": 1,
-  "run": {
-    "id": "run-2026-02-25T10-00-00-000Z",
-    "feature": "auth-refactor",
-    "startedAt": "2026-02-25T10:00:00Z",
-    "status": "running",
-    "dryRun": false
-  },
-  "progress": {
-    "total": 12,
-    "passed": 7,
-    "failed": 1,
-    "paused": 0,
-    "blocked": 1,
-    "pending": 3
-  },
-  "cost": {
-    "spent": 1.23,
-    "limit": 5.00
-  },
-  "current": {
-    "storyId": "US-008",
-    "title": "Add retry logic to queue handler",
-    "complexity": "medium",
-    "tddStrategy": "tdd-lite",
-    "model": "claude-sonnet-4-5-20250514",
-    "attempt": 1,
-    "phase": "implement"
-  },
-  "iterations": 8,
-  "updatedAt": "2026-02-25T10:15:32Z",
-  "durationMs": 932000
-}
-```
-## Implementation
-### Files to Change
-| File | Change |
-|:-----|:-------|
-| `src/execution/runner.ts` | Add `statusFile?: string` to `RunOptions`. Call `writeStatusFile()` at key points. |
-| `src/execution/status-file.ts` | **New file.** `writeStatusFile()` function — builds `NaxStatusFile` from run state, writes atomically. |
-| `src/main.ts` (or wherever CLI args are parsed) | Add `--status-file` option, pass to `RunOptions`. |
-### Write Points
-Status file is updated at these moments:
-1. **Run start** — initial state (all stories pending)
-2. **Story start** — update `current` with story info
-3. **Story complete/fail/pause** — update `progress` counts, clear `current`
-4. **Run end** — final state (`status: "completed"` or `"failed"`)
-### Atomic Writes
-Write to `<path>.tmp` then rename to `<path>` to prevent readers from seeing partial JSON:
-```typescript
-import { rename } from "node:fs/promises";
-async function writeStatusFile(path: string, status: NaxStatusFile): Promise<void> {
-  const tmpPath = `${path}.tmp`;
-  await Bun.write(tmpPath, JSON.stringify(status, null, 2));
-  await rename(tmpPath, path);
-}
-```
-### Integration with RunOptions
-```typescript
-// src/execution/runner.ts
-export interface RunOptions {
-  // ... existing fields
-  /** Path to write JSON status file (optional) */
-  statusFile?: string;
-}
-```
-### Progress Counting
-Derive from PRD state (already loaded):
-```typescript
-function countProgress(prd: PRD): NaxStatusFile["progress"] {
-  const stories = prd.stories;
-  const passed = stories.filter(s => s.status === "passed").length;
-  const failed = stories.filter(s => s.status === "failed").length;
-  const paused = stories.filter(s => s.status === "paused").length;
-  const blocked = stories.filter(s => s.status === "blocked").length;
-  const total = stories.length;
-  return { total, passed, failed, paused, blocked, pending: total - passed - failed - paused - blocked };
-}
-```
-### Cleanup
-The status file is **not** deleted on run end — it persists as a record of the last run. Consumers can check `run.status` to determine if the run is still active.
-## Testing
-| Test | Description |
-|:-----|:------------|
-| `status-file.test.ts` | Unit: `writeStatusFile()` produces valid JSON, atomic write works |
-| `status-file.test.ts` | Unit: `countProgress()` correctly counts all states |
-| `runner.test.ts` | Integration: `--status-file` option flows through to `RunOptions` |
-| `runner.test.ts` | Integration: status file updates at each write point |
-| Manual | `--status-file` + `--dry-run` produces correct output |
-## Non-Goals
-- **Real-time streaming** — this is a polled file, not a websocket/SSE stream
-- **Historical run data** — status file represents current/last run only (hooks + events.jsonl cover history)
-- **`nax status --json` command** — future work, can read this file
-## Migration
-None. New optional flag, no breaking changes. If `--status-file` is not passed, behavior is identical to v0.10.0.
----
-# Feature 2: TDD Escalation Retry
-## Summary
-Three-session TDD currently hard-codes `pause` for all failures — isolation violations, session crashes, and test failures all result in the story being paused with no retry. This means TDD stories never benefit from the escalation system that test-after stories use.
-Change: TDD failures should follow the same escalation retry pattern as test-after. Only pause when all retry paths are exhausted.
-## Problem
-Current flow (all TDD failures):
-```
-TDD failure → needsHumanReview=true → execution stage returns "pause" → story paused → NO RETRY
-```
-test-after flow (for comparison):
-```
-Agent failure → execution stage returns "escalate" → runner bumps tier → retries → only fails after max attempts
-```
-## Proposed Retry Strategy
-TDD failures are classified into three categories with different retry paths:
-### Category 1: Isolation Violation (test-writer touches source)
-**Current:** Pause immediately.
-**Proposed:** Auto-downgrade to tdd-lite, then escalate.
-```
-three-session-tdd fails (isolation violation)
-  → Retry 1: three-session-tdd-lite (same tier, skip isolation for writer/implementer)
-    → Success? Done ✅
-    → Fail? Escalate to next tier
-      → Retry 2: tdd-lite + stronger model
-        → Success? Done ✅
-        → Fail? Continue escalation through tier chain
-          → All tiers exhausted → pause (needs human review) ⏸
-```
-**Note:** The zero-file fallback already does this for one specific case (test-writer creates no test files → auto-retry as lite). This generalizes that pattern to all isolation violations.
-### Category 2: Session Failure (agent crash, timeout, non-zero exit)
-**Current:** Pause immediately.
-**Proposed:** Escalate model tier (same as test-after).
-```
-TDD session fails (crash/timeout)
-  → Escalate to next model tier
-    → Retry with stronger model (same TDD strategy)
-      → Success? Done ✅
-      → Fail? Continue escalation
-        → All tiers exhausted → mark failed ❌
-```
-### Category 3: Tests Still Failing After All Sessions
-**Current:** Post-TDD verification runs. If tests fail → pause.
-**Proposed:** Escalate model tier.
-```
-All 3 sessions complete but tests still fail
-  → Escalate to next model tier
-    → Retry full TDD with stronger model
-      → Success? Done ✅
-      → Fail? Continue escalation
-        → All tiers exhausted → mark failed ❌
-```
-### Summary Table
-| Failure Type | Current Action | New Action | Final Fallback |
-|:-------------|:--------------|:-----------|:--------------|
-| Isolation violation | pause | Downgrade to lite → escalate | pause (human review) |
-| Zero test files created | lite retry (exists) | Keep existing + escalate | pause (human review) |
-| Session crash/timeout | pause | Escalate tier | fail |
-| Tests fail post-TDD | pause | Escalate tier | fail |
-| Verifier flags bad code | pause | Escalate tier | pause (human review) |
-**Why "pause" for isolation/verifier but "fail" for crashes?**
-- Isolation violations and verifier concerns suggest the code needs *human judgment* — the AI may be fundamentally misunderstanding the task.
-- Crashes and test failures are mechanical — a stronger model usually fixes them.
-## Implementation
-### Changes to `ThreeSessionTddResult`
-Add a `failureCategory` field so the execution stage can differentiate:
-```typescript
-export interface ThreeSessionTddResult {
-  success: boolean;
-  sessions: TddSessionResult[];
-  needsHumanReview: boolean;
-  reviewReason?: string;
-  totalCost: number;
-  lite: boolean;
-  /** NEW: Categorize failure for retry routing */
-  failureCategory?: "isolation-violation" | "session-failure" | "tests-failing" | "verifier-rejected";
-}
-```
-### Changes to `execution.ts` (pipeline stage)
-Replace the blanket `pause` with category-based routing:
-```typescript
-// Current:
-if (tddResult.needsHumanReview) {
-  return { action: "pause", reason: tddResult.reviewReason };
-}
-// Proposed:
-if (!tddResult.success) {
-  switch (tddResult.failureCategory) {
-    case "isolation-violation":
-      // If already lite → escalate. If strict → retry as lite (same tier).
-      if (tddResult.lite) {
-        return { action: "escalate", reason: tddResult.reviewReason };
-      }
-      // Store flag in context so runner knows to downgrade strategy
-      ctx.retryAsLite = true;
-      return { action: "escalate", reason: `Isolation violation — downgrading to lite` };
-    case "session-failure":
-    case "tests-failing":
-      return { action: "escalate", reason: tddResult.reviewReason };
-    case "verifier-rejected":
-      // Escalate first, pause only after all tiers exhausted
-      return { action: "escalate", reason: tddResult.reviewReason };
-    default:
-      return { action: "pause", reason: tddResult.reviewReason };
-  }
-}
-```
-### Changes to `runner.ts` (escalation handler)
-When escalating a TDD story with `retryAsLite`, update the story's routing to use `three-session-tdd-lite`:
-```typescript
-case "escalate": {
-  // ... existing escalation logic ...
-  // NEW: If retryAsLite flag set, downgrade TDD strategy
-  if (pipelineResult.context?.retryAsLite && story.routing) {
-    story.routing.testStrategy = "three-session-tdd-lite";
-  }
-  // ... rest of escalation ...
-}
-```
-### Changes to `tdd/orchestrator.ts`
-Set `failureCategory` based on what went wrong:
-```typescript
-// After session 1 (test-writer) isolation failure:
-return {
-  success: false,
-  ...
-  failureCategory: "isolation-violation",
-};
-// After session crash/timeout:
-return {
-  success: false,
-  ...
-  failureCategory: "session-failure",
-};
-// After post-TDD verification fails:
-return {
-  success: false,
-  ...
-  failureCategory: "tests-failing",
-};
-```
-### Files to Change
-| File | Change |
-|:-----|:-------|
-| `src/tdd/types.ts` | Add `failureCategory` to `ThreeSessionTddResult` |
-| `src/tdd/orchestrator.ts` | Set `failureCategory` at each failure point |
-| `src/pipeline/stages/execution.ts` | Route by `failureCategory` instead of blanket `pause` |
-| `src/pipeline/types.ts` | Add `retryAsLite?: boolean` to `PipelineContext` |
-| `src/execution/runner.ts` | Handle `retryAsLite` flag in escalation case |
-### Testing
-| Test | Description |
-|:-----|:------------|
-| `tdd/orchestrator.test.ts` | Unit: each failure path sets correct `failureCategory` |
-| `pipeline/execution.test.ts` | Unit: isolation violation returns `escalate` (not `pause`) |
-| `pipeline/execution.test.ts` | Unit: lite isolation violation returns `escalate` |
-| `pipeline/execution.test.ts` | Unit: session failure returns `escalate` |
-| `execution/runner.test.ts` | Integration: TDD story escalates through tiers before failing |
-| `execution/runner.test.ts` | Integration: `retryAsLite` downgrades strategy on next attempt |
-| Manual | Run with intentionally strict project, verify lite downgrade + tier escalation |
-## Retry Budget
-Uses the existing escalation config (`autoMode.escalation.tierOrder`). Example:
-```json
-{
-  "autoMode": {
-    "escalation": {
-      "enabled": true,
-      "tierOrder": [
-        { "tier": "fast", "attempts": 2 },
-        { "tier": "balanced", "attempts": 2 },
-        { "tier": "powerful", "attempts": 1 }
-      ]
-    }
-  }
-}
-```
-For a strict TDD story with isolation violation:
-```
-Attempt 1: three-session-tdd @ fast      → isolation violation
-Attempt 2: three-session-tdd-lite @ fast  → tests fail
-Attempt 3: tdd-lite @ balanced            → tests fail
-Attempt 4: tdd-lite @ balanced            → tests fail
-Attempt 5: tdd-lite @ powerful            → success ✅ (or fail → pause)
-```
-Max cost is bounded by the existing tier budget. No new config needed.
----
-# Feature 3: Structured Verifier Verdicts
-## Summary
-The verifier (session 3) is designed to judge whether the implementer's changes are legitimate — especially when the implementer modified test files. Currently, this judgment is implicit: the verifier runs as a regular agent, and the only signal is "did tests pass after verifier ran?" There's no structured verdict flowing back to the pipeline.
-Add structured output parsing to the verifier session so its judgment feeds into `failureCategory` and the escalation system.
-## Problem
-Current verifier prompt asks it to:
-1. Run tests and verify they pass
-2. Review implementation quality
-3. Check acceptance criteria
-4. **Check if implementer modified test files and judge legitimacy**
-5. Fix issues minimally
-But the result is just `{ success: boolean, estimatedCost: number }` — same as any agent session. The verifier's judgment about test modifications, code quality, and acceptance criteria is lost.
-**Consequences:**
-- If verifier finds illegitimate test modifications, it tries to fix them but we don't know *what* it found
-- If verifier can't fix the issue, it exits non-zero → treated same as a crash
-- No signal to differentiate "tests pass but code is bad" from "tests fail"
-- The `VerifierDecision` type exists in `types.ts` but is **never populated**
-## Proposed Solution
-### Structured Verdict File
-Instead of parsing agent stdout (fragile), the verifier writes a structured verdict file that the orchestrator reads after the session:
-```
-<workdir>/.nax-verifier-verdict.json
-```
-**Why a file?** Claude Code (the agent) can easily write files. Parsing structured output from stdout is unreliable with Claude Code since it mixes tool calls, thinking, and output.
-### Verdict Schema
-```typescript
-interface VerifierVerdict {
-  /** Schema version */
-  version: 1;
-  /** Overall approval */
-  approved: boolean;
-  /** Test results */
-  tests: {
-    /** Did all tests pass? */
-    allPassing: boolean;
-    /** Number of tests passing */
-    passCount: number;
-    /** Number of tests failing */
-    failCount: number;
-  };
-  /** Implementer test modification review */
-  testModifications: {
-    /** Were test files modified by implementer? */
-    detected: boolean;
-    /** List of modified test files */
-    files: string[];
-    /** Are the modifications legitimate? */
-    legitimate: boolean;
-    /** Reasoning for legitimacy judgment */
-    reasoning: string;
-  };
-  /** Acceptance criteria check */
-  acceptanceCriteria: {
-    /** All criteria met? */
-    allMet: boolean;
-    /** Per-criterion status */
-    criteria: Array<{
-      criterion: string;
-      met: boolean;
-      note?: string;
-    }>;
-  };
-  /** Code quality assessment */
-  quality: {
-    /** Overall quality: good | acceptable | poor */
-    rating: "good" | "acceptable" | "poor";
-    /** Issues found */
-    issues: string[];
-  };
-  /** Fixes applied by verifier */
-  fixes: string[];
-  /** Overall reasoning */
-  reasoning: string;
-}
-```
-### Updated Verifier Prompt
-```typescript
-export function buildVerifierPrompt(story: UserStory): string {
-  return `# Test-Driven Development — Session 3: Verify
-You are in the third session of a three-session TDD workflow. Tests and implementation are complete.
-**Story:** ${story.title}
-**Your tasks:**
-1. Run all tests and verify they pass
-2. Review the implementation for quality and correctness
-3. Check that the implementation meets all acceptance criteria
-4. Check if test files were modified by the implementer. If yes, verify the changes are legitimate fixes (e.g. fixing incorrect expectations) and NOT just loosening assertions to mask bugs.
-5. If any issues exist, fix them minimally
-**Acceptance Criteria:**
-${story.acceptanceCriteria.map((ac, i) => `${i + 1}. ${ac}`).join("\n")}
-**IMPORTANT — Write Verdict File:**
-After completing your review, write a JSON verdict file to \`.nax-verifier-verdict.json\` in the project root.
-\`\`\`json
-{
-  "version": 1,
-  "approved": true,
-  "tests": {
-    "allPassing": true,
-    "passCount": 15,
-    "failCount": 0
-  },
-  "testModifications": {
-    "detected": false,
-    "files": [],
-    "legitimate": true,
-    "reasoning": "No test files were modified by implementer"
-  },
-  "acceptanceCriteria": {
-    "allMet": true,
-    "criteria": [
-      { "criterion": "Criterion text", "met": true }
-    ]
-  },
-  "quality": {
-    "rating": "good",
-    "issues": []
-  },
-  "fixes": [],
-  "reasoning": "All tests pass, implementation is clean, all criteria met."
-}
-\`\`\`
-Set \`approved: false\` if:
-- Tests are failing and you cannot fix them
-- Implementer loosened test assertions to mask bugs (testModifications.legitimate = false)
-- Critical acceptance criteria are not met
-- Code quality is poor with security or correctness issues
-Set \`approved: true\` if:
-- All tests pass (or pass after your minimal fixes)
-- Implementation is clean and follows conventions
-- All acceptance criteria met
-- Any test modifications by implementer are legitimate fixes
-When done, commit any fixes with message: "fix: verify and adjust ${story.title}"`;
-}
-```
-### Orchestrator Changes
-After verifier session completes, read and parse the verdict file:
-```typescript
-// In tdd/orchestrator.ts, after session 3 completes:
-// Read verdict file
-const verdictPath = path.join(workdir, ".nax-verifier-verdict.json");
-let verdict: VerifierVerdict | null = null;
-try {
-  const file = Bun.file(verdictPath);
-  if (await file.exists()) {
-    verdict = await file.json() as VerifierVerdict;
-    logger.info("tdd", "Verifier verdict loaded", {
-      storyId: story.id,
-      approved: verdict.approved,
-      testsAllPassing: verdict.tests.allPassing,
-      testModsDetected: verdict.testModifications.detected,
-      testModsLegitimate: verdict.testModifications.legitimate,
-      qualityRating: verdict.quality.rating,
-      allCriteriaMet: verdict.acceptanceCriteria.allMet,
-    });
-  } else {
-    logger.warn("tdd", "No verifier verdict file found — falling back to test-only check", {
-      storyId: story.id,
-    });
-  }
-} catch (err) {
-  logger.warn("tdd", "Failed to parse verifier verdict", {
-    storyId: story.id,
-    error: String(err),
-  });
-}
-// Clean up verdict file (don't leave it in the repo)
-try {
-  await unlink(verdictPath);
-} catch { /* ignore */ }
-```
-### Verdict → failureCategory Mapping
-```typescript
-function categorizeVerdict(
-  verdict: VerifierVerdict | null,
-  session3Success: boolean,
-  testsPass: boolean,
-): { success: boolean; failureCategory?: FailureCategory; reviewReason?: string } {
-  // No verdict file → fall back to existing behavior (test-only check)
-  if (!verdict) {
-    if (testsPass) return { success: true };
-    return {
-      success: false,
-      failureCategory: "tests-failing",
-      reviewReason: "Tests failing after all sessions (no verdict file)",
-    };
-  }
-  // Verdict: approved
-  if (verdict.approved) {
-    return { success: true };
-  }
-  // Verdict: not approved — classify why
-  // Illegitimate test modifications (implementer cheated)
-  if (verdict.testModifications.detected && !verdict.testModifications.legitimate) {
-    return {
-      success: false,
-      failureCategory: "verifier-rejected",
-      reviewReason: `Verifier rejected: illegitimate test modifications in ${verdict.testModifications.files.join(", ")}. ${verdict.testModifications.reasoning}`,
-    };
-  }
-  // Tests failing
-  if (!verdict.tests.allPassing) {
-    return {
-      success: false,
-      failureCategory: "tests-failing",
-      reviewReason: `Tests failing: ${verdict.tests.failCount} failures. ${verdict.reasoning}`,
-    };
-  }
-  // Acceptance criteria not met
-  if (!verdict.acceptanceCriteria.allMet) {
-    const unmet = verdict.acceptanceCriteria.criteria
-      .filter(c => !c.met)
-      .map(c => c.criterion);
-    return {
-      success: false,
-      failureCategory: "verifier-rejected",
-      reviewReason: `Acceptance criteria not met: ${unmet.join("; ")}`,
-    };
-  }
-  // Poor quality
-  if (verdict.quality.rating === "poor") {
-    return {
-      success: false,
-      failureCategory: "verifier-rejected",
-      reviewReason: `Poor code quality: ${verdict.quality.issues.join("; ")}`,
-    };
-  }
-  // Catch-all: verdict says not approved but no clear reason
-  return {
-    success: false,
-    failureCategory: "verifier-rejected",
-    reviewReason: verdict.reasoning || "Verifier rejected without specific reason",
-  };
-}
-```
-### Escalation Behavior per Verdict
-| Verdict Reason | failureCategory | Escalation Path |
-|:---------------|:---------------|:---------------|
-| Illegitimate test mods | `verifier-rejected` | Escalate tier → pause after all tiers |
-| Tests failing | `tests-failing` | Escalate tier → fail after all tiers |
-| Criteria not met | `verifier-rejected` | Escalate tier → pause after all tiers |
-| Poor quality | `verifier-rejected` | Escalate tier → pause after all tiers |
-| Approved | — | Success ✅ |
-| No verdict file | Falls back to test check | Same as before |
-### Verdict File Lifecycle
-1. **Created by:** Verifier agent (session 3) writes `.nax-verifier-verdict.json`
-2. **Read by:** TDD orchestrator after session 3 completes
-3. **Deleted by:** TDD orchestrator after reading (not committed to git)
-4. **Fallback:** If file missing or unparseable, fall back to existing behavior (post-TDD test verification)
-### `.gitignore`
-Add to project `.gitignore` (or nax init template):
-```
-.nax-verifier-verdict.json
-```
-### Files to Change
-| File | Change |
-|:-----|:-------|
-| `src/tdd/types.ts` | Add `VerifierVerdict` interface |
-| `src/tdd/prompts.ts` | Update `buildVerifierPrompt()` with verdict file instructions |
-| `src/tdd/orchestrator.ts` | Read verdict file after session 3, map to `failureCategory` |
-| `src/tdd/verdict.ts` | **New file.** `readVerdict()`, `categorizeVerdict()`, `cleanupVerdict()` |
-### Testing
-| Test | Description |
-|:-----|:------------|
-| `tdd/verdict.test.ts` | Unit: `categorizeVerdict()` for all verdict combinations |
-| `tdd/verdict.test.ts` | Unit: missing verdict file falls back gracefully |
-| `tdd/verdict.test.ts` | Unit: malformed JSON falls back gracefully |
-| `tdd/orchestrator.test.ts` | Integration: verdict file read + cleanup after session 3 |
-| `tdd/orchestrator.test.ts` | Integration: illegitimate test mods → `verifier-rejected` |
-| Manual | Run TDD on a story, verify verdict file is written and consumed |
-### Robustness
-**What if the agent doesn't write the verdict file?**
-Fall back to existing behavior: run tests independently, check pass/fail. This is the same as v0.10.0. The verdict file is an enhancement, not a requirement.
-**What if the JSON is malformed?**
-Log warning, fall back to test-only check. Never crash.
-**What if the agent writes wrong data?**
-Validate required fields (`version`, `approved`, `tests`). Missing fields → fall back. The verdict is advisory — the independent test run is the ground truth for "tests pass."
----
-# v0.10.1 Summary
-Three features, cohesive release:
-| Feature | Files Changed | Effort | Dependency |
-|:--------|:-------------|:-------|:-----------|
-| 1. `--status-file` | 3 (new `status-file.ts`, modify `runner.ts`, CLI) | Medium | None |
-| 2. TDD Escalation Retry | 5 (types, orchestrator, execution stage, pipeline types, runner) | Medium | None |
-| 3. Structured Verifier Verdicts | 4 (types, prompts, orchestrator, new `verdict.ts`) | Medium | Feature 2 (feeds `failureCategory`) |
-**Total files:** 10 changed/new (some overlap — `types.ts` and `orchestrator.ts` touched by features 2+3).
-**Breaking changes:** None. All features are additive/optional.
-**Config changes:** None. Uses existing escalation config.
-### Implementation Order
-1. Feature 1 (`--status-file`) — independent, can ship alone
-2. Feature 2 (TDD escalation) — core retry logic
-3. Feature 3 (verifier verdicts) — builds on feature 2's `failureCategory`