npm - @nathapp/nax - Versions diffs - 0.28.0 → 0.30.0 - Mend

@nathapp/nax 0.28.0 → 0.30.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (385) hide show

package/CHANGELOG.md +23 -2
package/bin/nax.ts +2 -3
package/dist/nax.js +72753 -0
package/package.json +11 -3
package/src/cli/analyze.ts +2 -7
package/src/cli/config.ts +3 -1
package/src/config/defaults.ts +1 -0
package/src/config/schemas.ts +1 -0
package/src/config/types.ts +1 -0
package/src/context/builder.ts +10 -1
package/src/execution/lifecycle/headless-formatter.ts +2 -4
package/src/prompts/builder.ts +12 -69
package/src/prompts/sections/isolation.ts +38 -8
package/src/prompts/sections/role-task.ts +79 -17
package/src/review/runner.ts +6 -1
package/src/version.ts +2 -1
package/.claude/rules/01-project-conventions.md +0 -34
package/.claude/rules/02-test-architecture.md +0 -39
package/.claude/rules/03-test-writing.md +0 -58
package/.claude/rules/04-forbidden-patterns.md +0 -29
package/.claude/settings.json +0 -15
package/.githooks/pre-commit +0 -16
package/.gitlab-ci.yml +0 -103
package/.mcp.json +0 -8
package/BRIEF.md +0 -140
package/CLAUDE.md +0 -143
package/US-007-IMPLEMENTATION.md +0 -139
package/biome.json +0 -14
package/bun.lock +0 -163
package/bunfig.toml +0 -12
package/docker-compose.test.yml +0 -15
package/docs/20260216-fix-plan-context-review.md +0 -56
package/docs/20260216-relentless-vs-ngent-comparison.md +0 -208
package/docs/20260216-v02-plan.md +0 -136
package/docs/20260216-v02-review.md +0 -685
package/docs/20260217-dogfood-findings.md +0 -56
package/docs/20260217-p2-plus-plan.md +0 -117
package/docs/20260217-partial-fixes-plan.md +0 -62
package/docs/20260217-plan-analyze-spec.md +0 -117
package/docs/20260217-post-impl-review.md +0 -1137
package/docs/20260217-quick-wins-plan.md +0 -66
package/docs/20260217-split-runner-plan.md +0 -75
package/docs/20260217-v03-impl-plan.md +0 -80
package/docs/20260217-v03-post-impl-review.md +0 -589
package/docs/20260217-v04-impl-plan.md +0 -86
package/docs/20260217-v05-post-impl-review.md +0 -850
package/docs/20260217-v06-post-impl-review.md +0 -817
package/docs/20260218-adr003-port-plan.md +0 -151
package/docs/20260218-review-adr003-verification.md +0 -175
package/docs/20260219-fix-plan-bug16-19.md +0 -79
package/docs/20260219-fix-plan-bug20-22.md +0 -114
package/docs/20260219-plan-llm-routing.md +0 -116
package/docs/20260219-review-bug20-22-fixes.md +0 -135
package/docs/20260219-routing-baseline-keyword.md +0 -63
package/docs/20260220-plan-structured-logging-p1.md +0 -80
package/docs/20260220-plan-structured-logging-p2.md +0 -37
package/docs/20260220-review-llm-routing.md +0 -180
package/docs/20260220-review-post-fix-llm-routing.md +0 -70
package/docs/20260221-fix-plan-relevantfiles-split.md +0 -101
package/docs/20260221-fix-plan-routing-mode.md +0 -125
package/docs/20260221-review-v0.9-implementation.md +0 -379
package/docs/20260222-fix-plan-v091-routing-isolation.md +0 -197
package/docs/20260223-fix-plan-prompt-audit.md +0 -62
package/docs/20260224-nax-roadmap-phases.md +0 -189
package/docs/20260225-phase2-llm-service-layer.md +0 -401
package/docs/20260225-review-v0.10.1.md +0 -187
package/docs/20260303-v010-implementation-plan.md +0 -165
package/docs/20260304-review-nax.md +0 -492
package/docs/CLAUDE.md.bak +0 -191
package/docs/ROADMAP.md +0 -390
package/docs/SPEC-rectification.md +0 -0
package/docs/SPEC.md +0 -324
package/docs/US-001-plugin-loading-verification.md +0 -152
package/docs/adr/ADR-005-implementation-plan.md +0 -655
package/docs/adr/ADR-005-pipeline-re-architecture.md +0 -464
package/docs/architecture-analysis.md +0 -1076
package/docs/bugs/BUG-21-escalation-null-attempts.md +0 -48
package/docs/bugs-from-dogfood-run-c.md +0 -243
package/docs/code-review-20260228.md +0 -612
package/docs/code-review-v0.15.0.md +0 -629
package/docs/hook-lifecycle-test-plan.md +0 -149
package/docs/releases/v0.11.0-and-earlier.md +0 -20
package/docs/releases/v0.12.0.md +0 -15
package/docs/releases/v0.13.0.md +0 -14
package/docs/releases/v0.14.0.md +0 -20
package/docs/releases/v0.14.1.md +0 -36
package/docs/releases/v0.14.2.md +0 -51
package/docs/releases/v0.14.3.md +0 -174
package/docs/releases/v0.14.4.md +0 -94
package/docs/releases/v0.15.0.md +0 -502
package/docs/releases/v0.15.1.md +0 -170
package/docs/releases/v0.15.3.md +0 -193
package/docs/specs/bug-039-orphan-processes.md +0 -131
package/docs/specs/bug-040-review-rectification.md +0 -82
package/docs/specs/bug-041-cross-story-test-isolation.md +0 -88
package/docs/specs/bug-042-verifier-failure-capture.md +0 -117
package/docs/specs/bun-pty-migration.md +0 -171
package/docs/specs/central-run-registry.md +0 -116
package/docs/specs/feat-010-smart-runner-git-history.md +0 -96
package/docs/specs/feat-011-file-context-strategy.md +0 -73
package/docs/specs/feat-012-tdd-writer-tier.md +0 -79
package/docs/specs/feat-013-test-after-review.md +0 -89
package/docs/specs/feat-014-heartbeat-observability.md +0 -127
package/docs/specs/status-file-consolidation.md +0 -93
package/docs/specs/status-file-v0.10.1.md +0 -812
package/docs/specs/trigger-completion.md +0 -145
package/docs/specs/verification-architecture-v2.md +0 -343
package/docs/tdd/strategies.md +0 -97
package/docs/v0.10-global-config.md +0 -206
package/docs/v0.10-plugin-system.md +0 -415
package/docs/v0.10-prompt-optimizer.md +0 -234
package/docs/v0.3-spec.md +0 -244
package/docs/v0.4-spec.md +0 -140
package/docs/v0.5-spec.md +0 -237
package/docs/v0.6-spec.md +0 -371
package/docs/v0.7-spec.md +0 -177
package/docs/v0.8-llm-routing.md +0 -206
package/docs/v0.8-structured-logging.md +0 -132
package/docs/v0.9.3-prompt-audit.md +0 -112
package/examples/plugins/console-reporter/index.test.ts +0 -207
package/examples/plugins/console-reporter/index.ts +0 -110
package/memory/topic/feat-010-baseref.md +0 -28
package/memory/topic/feat-013-test-after-deprecation.md +0 -22
package/nax/config.json +0 -154
package/nax/features/bug-039-medium/prd.json +0 -45
package/nax/features/bugfix-v0171/prd.json +0 -52
package/nax/features/central-run-registry/prd.json +0 -105
package/nax/features/config-management/prd.json +0 -108
package/nax/features/config-management/progress.txt +0 -5
package/nax/features/diagnose/acceptance.test.ts +0 -414
package/nax/features/diagnose/prd.json +0 -41
package/nax/features/nax-compliance/prd.json +0 -52
package/nax/features/nax-compliance/progress.txt +0 -1
package/nax/features/orchestration-fixes/prd.json +0 -89
package/nax/features/orchestration-fixes/progress.txt +0 -1
package/nax/features/plugin-integration/US-007-VERIFICATION.md +0 -259
package/nax/features/plugin-integration/prd.json +0 -208
package/nax/features/plugin-integration/progress.txt +0 -5
package/nax/features/post-rearch-bugfix/prd.json +0 -137
package/nax/features/precheck/prd.json +0 -205
package/nax/features/precheck/progress.txt +0 -15
package/nax/features/prompt-builder/prd.json +0 -152
package/nax/features/prompt-builder/progress.txt +0 -3
package/nax/features/review-quality/prd.json +0 -55
package/nax/features/routing-persistence/prd.json +0 -104
package/nax/features/routing-persistence/progress.txt +0 -1
package/nax/features/smart-test-runner/plan.md +0 -7
package/nax/features/smart-test-runner/prd.json +0 -203
package/nax/features/smart-test-runner/progress.txt +0 -13
package/nax/features/smart-test-runner/spec.md +0 -7
package/nax/features/smart-test-runner/tasks.md +0 -8
package/nax/features/status-file-consolidation/prd.json +0 -106
package/nax/features/structured-logging/prd.json +0 -199
package/nax/features/trigger-completion/prd.json +0 -150
package/nax/features/trigger-completion/progress.txt +0 -7
package/nax/features/unlock/prd.json +0 -36
package/nax/features/v0.18.3-execution-reliability/prd.json +0 -80
package/nax/features/v0.18.3-execution-reliability/progress.txt +0 -3
package/nax/features/v0.19.0-hardening/plan.md +0 -7
package/nax/features/v0.19.0-hardening/prd.json +0 -84
package/nax/features/v0.19.0-hardening/progress.txt +0 -7
package/nax/features/v0.19.0-hardening/spec.md +0 -18
package/nax/features/v0.19.0-hardening/tasks.md +0 -8
package/nax/features/verify-v2/prd.json +0 -79
package/nax/features/verify-v2/progress.txt +0 -3
package/nax/status.json +0 -36
package/src/prompts/templates/implementer.ts +0 -6
package/src/prompts/templates/single-session.ts +0 -6
package/src/prompts/templates/test-writer.ts +0 -6
package/src/prompts/templates/verifier.ts +0 -6
package/test/COVERAGE-GAPS.md +0 -333
package/test/e2e/cm-003-default-view.test.ts +0 -195
package/test/e2e/plan-analyze-run.test.ts +0 -902
package/test/helpers/helpers.test.ts +0 -295
package/test/helpers/timeout.ts +0 -42
package/test/integration/US-002-TEST-SUMMARY.md +0 -107
package/test/integration/US-003-TEST-SUMMARY.md +0 -149
package/test/integration/US-004-TEST-SUMMARY.md +0 -106
package/test/integration/US-005-TEST-SUMMARY.md +0 -138
package/test/integration/US-007-TEST-SUMMARY.md +0 -100
package/test/integration/cli/agent-validation.test.ts +0 -439
package/test/integration/cli/cli-config-default-edge-cases.test.ts +0 -223
package/test/integration/cli/cli-config-default-view.test.ts +0 -230
package/test/integration/cli/cli-config-diff.test.ts +0 -461
package/test/integration/cli/cli-config-prompts-explain.test.ts +0 -74
package/test/integration/cli/cli-config.test.ts +0 -737
package/test/integration/cli/cli-diagnose.test.ts +0 -595
package/test/integration/cli/cli-logs.test.ts +0 -346
package/test/integration/cli/cli-plugins.test.ts +0 -679
package/test/integration/cli/cli-precheck.test.ts +0 -372
package/test/integration/cli/cli-run-headless.test.ts +0 -174
package/test/integration/cli/cli.test.ts +0 -76
package/test/integration/cli/precheck-integration.test.ts +0 -476
package/test/integration/cli/precheck-orchestrator.test.ts +0 -247
package/test/integration/cli/precheck.test.ts +0 -806
package/test/integration/config/config-loader.test.ts +0 -266
package/test/integration/config/config.test.ts +0 -444
package/test/integration/config/merger.test.ts +0 -466
package/test/integration/config/paths.test.ts +0 -52
package/test/integration/config/security-loader.test.ts +0 -83
package/test/integration/context/context-integration.test.ts +0 -703
package/test/integration/context/context-path-security.test.ts +0 -173
package/test/integration/context/context-provider-injection.test.ts +0 -507
package/test/integration/context/context-verification-integration.test.ts +0 -296
package/test/integration/context/s5-greenfield-fallback.test.ts +0 -298
package/test/integration/execution/execution-isolation.test.ts +0 -143
package/test/integration/execution/execution.test.ts +0 -634
package/test/integration/execution/feature-status-write.test.ts +0 -302
package/test/integration/execution/parallel.test.ts +0 -251
package/test/integration/execution/prd-pause.test.ts +0 -205
package/test/integration/execution/prd-resolvers.test.ts +0 -186
package/test/integration/execution/progress.test.ts +0 -34
package/test/integration/execution/runner-batching.test.ts +0 -682
package/test/integration/execution/runner-config-plugins.test.ts +0 -462
package/test/integration/execution/runner-escalation.test.ts +0 -561
package/test/integration/execution/runner-fixes.test.ts +0 -400
package/test/integration/execution/runner-plugin-integration.test.ts +0 -544
package/test/integration/execution/runner-queue-and-attempts.test.ts +0 -476
package/test/integration/execution/status-file-integration.test.ts +0 -289
package/test/integration/execution/status-file.test.ts +0 -380
package/test/integration/execution/status-writer.test.ts +0 -447
package/test/integration/execution/story-id-in-events.test.ts +0 -274
package/test/integration/interaction/interaction-chain-pipeline.test.ts +0 -476
package/test/integration/pipeline/hooks.test.ts +0 -363
package/test/integration/pipeline/pipeline-acceptance.test.ts +0 -303
package/test/integration/pipeline/pipeline-events.test.ts +0 -476
package/test/integration/pipeline/pipeline.test.ts +0 -660
package/test/integration/pipeline/reporter-lifecycle.test.ts +0 -862
package/test/integration/pipeline/verify-stage.test.ts +0 -286
package/test/integration/plan/analyze-integration.test.ts +0 -262
package/test/integration/plan/analyze-scanner.test.ts +0 -132
package/test/integration/plan/logger.test.ts +0 -461
package/test/integration/plan/plan.test.ts +0 -157
package/test/integration/plugins/config-integration.test.ts +0 -173
package/test/integration/plugins/config-resolution.test.ts +0 -523
package/test/integration/plugins/loader.test.ts +0 -644
package/test/integration/plugins/plugins-registry.test.ts +0 -747
package/test/integration/plugins/validator.test.ts +0 -564
package/test/integration/prompts/pb-004-migration.test.ts +0 -523
package/test/integration/review/review-config-commands.test.ts +0 -320
package/test/integration/review/review-config-schema.test.ts +0 -117
package/test/integration/review/review-plugin-integration.test.ts +0 -729
package/test/integration/review/review.test.ts +0 -150
package/test/integration/routing/plugin-routing-advanced.test.ts +0 -461
package/test/integration/routing/plugin-routing-core.test.ts +0 -527
package/test/integration/routing/routing-stage-bug-021.test.ts +0 -275
package/test/integration/routing/routing-stage-greenfield.test.ts +0 -287
package/test/integration/tdd/tdd-cleanup.test.ts +0 -246
package/test/integration/tdd/tdd-orchestrator-core.test.ts +0 -565
package/test/integration/tdd/tdd-orchestrator-failureCategory.test.ts +0 -355
package/test/integration/tdd/tdd-orchestrator-fallback.test.ts +0 -311
package/test/integration/tdd/tdd-orchestrator-lite.test.ts +0 -289
package/test/integration/tdd/tdd-orchestrator-prompts.test.ts +0 -260
package/test/integration/tdd/tdd-orchestrator-verdict.test.ts +0 -536
package/test/integration/tmp/headless-test/test.jsonl +0 -30
package/test/integration/verification/test-scanner.test.ts +0 -403
package/test/integration/verification/verification-asset-check.test.ts +0 -143
package/test/integration/worktree/manager.test.ts +0 -218
package/test/integration/worktree/worktree-merge.test.ts +0 -341
package/test/manual/logging-formatter-demo.ts +0 -158
package/test/ui/tui-agent-panel.test.tsx +0 -99
package/test/ui/tui-pty-integration.test.tsx +0 -146
package/test/unit/acceptance.test.ts +0 -187
package/test/unit/agent-stderr-capture.test.ts +0 -147
package/test/unit/agents/claude.test.ts +0 -107
package/test/unit/analyze-classifier.test.ts +0 -216
package/test/unit/analyze.test.ts +0 -224
package/test/unit/auto-detect.test.ts +0 -250
package/test/unit/cli-status-project-level.test.ts +0 -283
package/test/unit/cli-status.test.ts +0 -418
package/test/unit/commands/common.test.ts +0 -321
package/test/unit/commands/logs.test.ts +0 -458
package/test/unit/commands/runs.test.ts +0 -303
package/test/unit/commands/unlock.test.ts +0 -320
package/test/unit/config/defaults.test.ts +0 -70
package/test/unit/config/quality-commands-schema.test.ts +0 -72
package/test/unit/config/regression-gate-schema.test.ts +0 -160
package/test/unit/config/smart-runner-flag.test.ts +0 -250
package/test/unit/constitution-generators.test.ts +0 -161
package/test/unit/constitution.test.ts +0 -210
package/test/unit/context/context-autodetect.test.ts +0 -297
package/test/unit/context/context-build.test.ts +0 -575
package/test/unit/context/context-coverage.test.ts +0 -236
package/test/unit/context/context-error.test.ts +0 -93
package/test/unit/context/context-estimate-tokens.test.ts +0 -201
package/test/unit/context/context-format.test.ts +0 -302
package/test/unit/context/context-isolation.test.ts +0 -267
package/test/unit/context/context-sort.test.ts +0 -93
package/test/unit/context/context-story.test.ts +0 -108
package/test/unit/context/prior-failures.test.ts +0 -463
package/test/unit/context.test.ts +0 -1726
package/test/unit/cost.test.ts +0 -231
package/test/unit/crash-recovery.test.ts +0 -309
package/test/unit/escalation.test.ts +0 -127
package/test/unit/execution/lifecycle/run-completion.test.ts +0 -240
package/test/unit/execution/lifecycle/run-regression.test.ts +0 -420
package/test/unit/execution/pid-registry.test.ts +0 -241
package/test/unit/execution/sequential-executor.test.ts +0 -235
package/test/unit/execution/sfc-004-dead-code-cleanup.test.ts +0 -89
package/test/unit/execution/structured-failure.test.ts +0 -415
package/test/unit/execution-logging-stderr.test.ts +0 -157
package/test/unit/execution-stage.test.ts +0 -123
package/test/unit/fix-generator.test.ts +0 -276
package/test/unit/formatters.test.ts +0 -468
package/test/unit/greenfield.test.ts +0 -180
package/test/unit/hooks/shell-security.test.ts +0 -40
package/test/unit/interaction/auto-plugin.test.ts +0 -162
package/test/unit/interaction/human-review-trigger.test.ts +0 -165
package/test/unit/interaction-network-failures.test.ts +0 -390
package/test/unit/interaction-plugins.test.ts +0 -472
package/test/unit/logging/formatter.test.ts +0 -456
package/test/unit/merge.test.ts +0 -269
package/test/unit/metrics/aggregator.test.ts +0 -164
package/test/unit/metrics/tracker.test.ts +0 -186
package/test/unit/metrics.test.ts +0 -276
package/test/unit/optimizer/noop.optimizer.test.ts +0 -125
package/test/unit/optimizer/rule-based.optimizer.test.ts +0 -358
package/test/unit/pipeline/event-bus.test.ts +0 -105
package/test/unit/pipeline/routing-partial-override.test.ts +0 -121
package/test/unit/pipeline/runner-retry.test.ts +0 -89
package/test/unit/pipeline/stages/autofix.test.ts +0 -97
package/test/unit/pipeline/stages/completion-review-gate.test.ts +0 -218
package/test/unit/pipeline/stages/execution-ambiguity.test.ts +0 -311
package/test/unit/pipeline/stages/execution-merge-conflict.test.ts +0 -218
package/test/unit/pipeline/stages/rectify.test.ts +0 -101
package/test/unit/pipeline/stages/regression-stage.test.ts +0 -69
package/test/unit/pipeline/stages/review.test.ts +0 -201
package/test/unit/pipeline/stages/routing-idempotence.test.ts +0 -139
package/test/unit/pipeline/stages/routing-initial-complexity.test.ts +0 -321
package/test/unit/pipeline/stages/routing-persistence.test.ts +0 -380
package/test/unit/pipeline/stages/verify.test.ts +0 -267
package/test/unit/pipeline/subscribers/events-writer.test.ts +0 -227
package/test/unit/pipeline/subscribers/hooks.test.ts +0 -84
package/test/unit/pipeline/subscribers/interaction.test.ts +0 -313
package/test/unit/pipeline/subscribers/registry.test.ts +0 -149
package/test/unit/pipeline/subscribers/reporters.test.ts +0 -90
package/test/unit/pipeline/verify-smart-runner.test.ts +0 -345
package/test/unit/prd-auto-default.test.ts +0 -291
package/test/unit/prd-failure-category.test.ts +0 -177
package/test/unit/prd-get-next-story.test.ts +0 -215
package/test/unit/precheck/checks-warnings.test.ts +0 -114
package/test/unit/precheck-checks.test.ts +0 -841
package/test/unit/precheck-story-size-gate.test.ts +0 -288
package/test/unit/precheck-types.test.ts +0 -143
package/test/unit/prompts/builder.test.ts +0 -258
package/test/unit/prompts/loader.test.ts +0 -355
package/test/unit/prompts/sections/conventions.test.ts +0 -30
package/test/unit/prompts/sections/isolation.test.ts +0 -35
package/test/unit/prompts/sections/role-task.test.ts +0 -40
package/test/unit/prompts/sections/sections.test.ts +0 -238
package/test/unit/prompts/sections/story.test.ts +0 -45
package/test/unit/prompts/sections/verdict.test.ts +0 -58
package/test/unit/prompts.test.ts +0 -476
package/test/unit/queue.test.ts +0 -237
package/test/unit/rectification.test.ts +0 -285
package/test/unit/registry.test.ts +0 -288
package/test/unit/review/runner.test.ts +0 -117
package/test/unit/routing/content-hash.test.ts +0 -99
package/test/unit/routing/routing-stability.test.ts +0 -208
package/test/unit/routing/strategies/llm.test.ts +0 -306
package/test/unit/routing-advanced.test.ts +0 -313
package/test/unit/routing-core.test.ts +0 -341
package/test/unit/routing-strategies.test.ts +0 -440
package/test/unit/storyid-events.test.ts +0 -213
package/test/unit/tdd-verdict.test.ts +0 -492
package/test/unit/test-output-parser.test.ts +0 -377
package/test/unit/ui/tui-controls.test.ts +0 -335
package/test/unit/ui/tui-cost-and-pty.test.ts +0 -190
package/test/unit/ui/tui-layout.test.ts +0 -379
package/test/unit/ui/tui-stories.test.ts +0 -333
package/test/unit/unit-isolation.test.ts +0 -135
package/test/unit/utils/git.test.ts +0 -50
package/test/unit/utils/path-security.test.ts +0 -47
package/test/unit/utils-helpers.test.ts +0 -318
package/test/unit/verdict.test.ts +0 -325
package/test/unit/verification/orchestrator-types.test.ts +0 -54
package/test/unit/verification/orchestrator.test.ts +0 -66
package/test/unit/verification/smart-runner-config.test.ts +0 -163
package/test/unit/verification/smart-runner-discovery.test.ts +0 -354
package/test/unit/verification/smart-runner.test.ts +0 -262
package/test/unit/verification/strategies/acceptance.test.ts +0 -33
package/test/unit/verification/strategies/regression.test.ts +0 -87
package/test/unit/verification/strategies/scoped.test.ts +0 -100
package/test/unit/worktree-manager.test.ts +0 -159
package/tsconfig.json +0 -27

package/docs/code-review-v0.15.0.md DELETED Viewed

@@ -1,629 +0,0 @@
-# Code Review: v0.15.0 Interactive Pipeline
-**Review Date:** 2026-02-28
-**Reviewed By:** Claude Code (Sonnet 4.5)
-**Scope:** All files changed between v0.14.4 (6d27bd7) and HEAD (6fe168a)
----
-## Overall Grade: B+
-**Summary:** The v0.15.0 Interactive Pipeline implementation is well-structured with good separation of concerns. The interaction module follows clean architecture principles with a plugin-based design. However, there are several CRITICAL security and reliability issues that must be fixed immediately, plus architectural violations (files over 400 lines) that need addressing.
-**Strengths:**
-- Clean plugin architecture for interaction system
-- Good type safety throughout interaction module
-- Proper separation between CLI, Telegram, Webhook, and Auto plugins
-- Unified verification layer eliminates duplication
-- Test coverage for critical paths
-**Weaknesses:**
-- Multiple files exceed 400-line limit (violates CLAUDE.md)
-- Missing error handling for network failures in Telegram/Webhook plugins
-- No input validation for malformed webhook callbacks
-- JSON.parse without try-catch in several locations
-- Auto plugin security rule not enforced via config validation
-- Missing tests for edge cases (network failures, malformed input, race conditions)
----
-## Critical Findings
-| ID | Severity | File | Line | Description | Fix |
-|:---|:---|:---|:---|:---|:---|
-| SEC-001 | CRITICAL | `src/interaction/plugins/webhook.ts` | 158 | JSON.parse without try-catch when handling webhook callbacks. Malformed JSON can crash the server. | Wrap in try-catch, return 400 Bad Request on parse error |
-| SEC-002 | CRITICAL | `src/interaction/plugins/telegram.ts` | 79 | No error handling for fetch failure when sending messages. Network errors can crash the plugin. | Add try-catch, throw descriptive error |
-| SEC-003 | CRITICAL | `src/interaction/plugins/telegram.ts` | 244 | No error handling for getUpdates fetch failure. Can cause infinite loop on network errors. | Add try-catch with exponential backoff |
-| SEC-004 | CRITICAL | `src/interaction/plugins/auto.ts` | 72-73 | Security-review never-auto-approve rule is code-based, not config-enforced. Can be accidentally removed. | Add to config schema validation, enforce at chain level |
-| REL-001 | CRITICAL | `src/interaction/chain.ts` | 74-82 | Catch block swallows ALL errors (not just timeout). Plugin crashes are silently converted to timeout responses. | Only catch timeout-specific errors, re-throw others |
-| REL-002 | HIGH | `src/interaction/plugins/webhook.ts` | 80-90 | Polling loop has no exponential backoff. Can cause high CPU usage on stuck requests. | Add exponential backoff with max delay |
-| REL-003 | HIGH | `src/interaction/plugins/telegram.ts` | 96-111 | Polling loop has no exponential backoff. Can hammer Telegram API and get rate limited. | Add exponential backoff (start 1s, max 5s) |
-| TYPE-001 | HIGH | `src/interaction/plugins/webhook.ts` | 117, 127 | Double `as unknown as` casts to work around Bun.serve typing. Loses type safety. | Add proper type definitions for Bun.serve return type |
-| ARCH-001 | HIGH | Multiple files | - | 15 files exceed 400-line limit, violating CLAUDE.md hard requirement. | Split files as documented below |
-| LOG-001 | MEDIUM | `src/interaction/plugins/telegram.ts` | 79-82 | Telegram API error response not logged. Silent failures are hard to debug. | Log error response body before throwing |
-| LOG-002 | MEDIUM | `src/interaction/plugins/webhook.ts` | 72-74 | Webhook POST failure not logged with response body. | Log response body before throwing |
-| TEST-001 | MEDIUM | `test/unit/interaction-plugins.test.ts` | - | No tests for network failures, malformed input, or timeout edge cases. | Add failure scenario tests |
-| TEST-002 | MEDIUM | `test/unit/interaction-plugins.test.ts` | - | Auto plugin LLM call not mocked. Real LLM calls in tests are slow and flaky. | Mock Bun.spawn for LLM calls |
-| MEM-001 | LOW | `src/interaction/plugins/telegram.ts` | 43 | `pendingMessages` Map grows unbounded. Never cleaned up on timeout. | Add cleanup in sendTimeoutMessage |
-| MEM-002 | LOW | `src/interaction/plugins/webhook.ts` | 29 | `pendingResponses` Map grows unbounded. | Add cleanup in cancel() method |
----
-## Files Exceeding 400-Line Limit (ARCH-001)
-**CRITICAL:** CLAUDE.md mandates **400 lines maximum** per file. The following files violate this:
-| File | Lines | Recommended Split |
-|:---|---:|:---|
-| `src/config/schema.ts` | 853 | Split into: `schema-core.ts` (types), `schema-routing.ts`, `schema-interaction.ts`, `schema-validation.ts` |
-| `src/agents/claude.ts` | 820 | Split into: `claude-adapter.ts`, `claude-session.ts`, `claude-parser.ts` |
-| `src/tdd/orchestrator.ts` | 743 | Split into: `orchestrator.ts` (main loop), `session-manager.ts`, `verdict-handler.ts` |
-| `src/execution/sequential-executor.ts` | 648 | Split into: `executor.ts`, `story-runner.ts`, `retry-handler.ts` |
-| `src/cli/diagnose.ts` | 638 | Split into: `diagnose.ts`, `checks.ts`, `formatters.ts` |
-| `src/execution/post-verify.ts` | 584 | Split into: `post-verify.ts`, `rectification.ts`, `escalation-decision.ts` |
-| `src/context/builder.ts` | 576 | Split into: `builder.ts`, `providers.ts`, `test-coverage.ts` |
-| `src/cli/analyze.ts` | 568 | Split into: `analyze.ts`, `metrics.ts`, `reports.ts` |
-| `src/precheck/checks.ts` | 548 | Split into: `checks.ts`, `validators.ts`, `git-checks.ts` |
-| `src/cli/status.ts` | 519 | Split into: `status.ts`, `formatters.ts`, `progress.ts` |
-| `src/execution/helpers.ts` | 450 | Split into: `story-filters.ts`, `batch-helpers.ts`, `status-helpers.ts` |
-| `src/execution/escalation/tier-escalation.ts` | 439 | Split into: `tier-escalation.ts`, `cost-calculator.ts` |
-| `src/routing/strategies/llm.ts` | 432 | Split into: `llm-router.ts`, `batch-router.ts`, `cache.ts` |
-| `src/agents/types.ts` | 430 | Split into: `agent-types.ts`, `session-types.ts`, `result-types.ts` |
-| `src/execution/parallel.ts` | 404 | OK (close to limit, watch carefully) |
-**Action Required:** These files MUST be split before v0.15.0 release. This is a blocking requirement per CLAUDE.md.
----
-## Security Analysis
-### Input Validation
-**FAIL:** Webhook plugin does not validate incoming callback structure.
-```typescript
-// src/interaction/plugins/webhook.ts:158 (VULNERABLE)
-const response = JSON.parse(body) as InteractionResponse;
-this.pendingResponses.set(requestId, response);
-```
-**Attack Vector:**
-- Attacker sends `{"malicious": "payload"}` to webhook callback
-- JSON.parse succeeds but object doesn't match InteractionResponse
-- Type assertion `as InteractionResponse` bypasses type checking
-- Invalid response stored in Map, causes undefined behavior later
-**Fix:** Add Zod schema validation:
-```typescript
-import { z } from "zod";
-const InteractionResponseSchema = z.object({
-  requestId: z.string(),
-  action: z.enum(["approve", "reject", "choose", "input", "skip", "abort"]),
-  value: z.string().optional(),
-  respondedBy: z.string().optional(),
-  respondedAt: z.number(),
-});
-// In handleRequest():
-try {
-  const parsed = JSON.parse(body);
-  const response = InteractionResponseSchema.parse(parsed);
-  this.pendingResponses.set(requestId, response);
-} catch (err) {
-  return new Response("Bad Request: Invalid response format", { status: 400 });
-}
-```
-### Credential Handling
-**PASS:** Telegram bot token and webhook secrets are stored correctly:
-- Read from env vars or config (never hardcoded)
-- HMAC verification uses timing-safe comparison
-- Secrets not logged
-**Recommendation:** Add config validation to reject empty secrets:
-```typescript
-// src/config/schema.ts
-interaction: {
-  config: {
-    secret: z.string().min(32).optional(), // Enforce minimum secret length
-  }
-}
-```
-### SSRF Protection
-**N/A:** Webhook URL is user-configured (not from untrusted input). No SSRF risk.
-### Auto Plugin Security Rule
-**FAIL:** Security-review never-auto-approve rule is enforced in code only:
-```typescript
-// src/interaction/plugins/auto.ts:72-74
-if (request.metadata?.trigger === "security-review") {
-  return undefined; // Escalate to human
-}
-```
-**Issue:** This can be accidentally removed during refactoring.
-**Fix:** Enforce at config schema level:
-```typescript
-// src/config/schema.ts
-triggers: {
-  "security-review": z.object({
-    enabled: z.boolean(),
-    autoApprove: z.literal(false), // NEVER allow auto-approve for security
-  })
-}
-```
----
-## Reliability Analysis
-### Error Handling
-**FAIL:** Network errors are not handled properly.
-**Telegram Plugin (Critical):**
-```typescript
-// src/interaction/plugins/telegram.ts:68 (VULNERABLE)
-const response = await fetch(`https://api.telegram.org/bot${this.botToken}/sendMessage`, {
-  method: "POST",
-  headers: { "Content-Type": "application/json" },
-  body: JSON.stringify({...}),
-});
-const data = (await response.json()) as { ok: boolean; result: TelegramMessage };
-if (!data.ok) {
-  throw new Error("Failed to send Telegram message");
-}
-```
-**Issues:**
-1. `fetch()` can throw on network errors (connection refused, DNS failure, timeout)
-2. `response.json()` can throw on malformed JSON
-3. `data.ok` check assumes `data` is defined
-4. No retry logic for transient failures
-**Fix:**
-```typescript
-try {
-  const response = await fetch(`https://api.telegram.org/bot${this.botToken}/sendMessage`, {
-    method: "POST",
-    headers: { "Content-Type": "application/json" },
-    body: JSON.stringify({...}),
-  });
-  if (!response.ok) {
-    const errorBody = await response.text();
-    throw new Error(`Telegram API error (${response.status}): ${errorBody}`);
-  }
-  const data = await response.json();
-  if (!data.ok) {
-    throw new Error(`Telegram API returned ok=false: ${JSON.stringify(data)}`);
-  }
-  this.pendingMessages.set(request.id, data.result.message_id);
-} catch (err) {
-  const msg = err instanceof Error ? err.message : String(err);
-  throw new Error(`Failed to send Telegram message: ${msg}`);
-}
-```
-**Webhook Plugin (Critical):**
-Same issues as Telegram. Apply similar fix pattern.
-### Race Conditions
-**PASS:** No obvious race conditions found. Interaction chain is single-threaded per request.
-**Potential Issue:** Webhook server starts on first `receive()` call, but multiple concurrent calls could race:
-```typescript
-// src/interaction/plugins/webhook.ts:109
-private async startServer(): Promise<void> {
-  if (this.server) return; // Already running
-  const port = this.config.callbackPort ?? 8765;
-  this.server = Bun.serve({...}) as unknown as Server;
-}
-```
-**Race:** Two concurrent `receive()` calls could both check `if (this.server)` before either sets it.
-**Fix:** Use a mutex or Promise-based lock:
-```typescript
-private serverStartPromise: Promise<void> | null = null;
-private async startServer(): Promise<void> {
-  if (this.server) return;
-  if (this.serverStartPromise) {
-    await this.serverStartPromise;
-    return;
-  }
-  this.serverStartPromise = (async () => {
-    const port = this.config.callbackPort ?? 8765;
-    this.server = Bun.serve({...}) as unknown as Server;
-  })();
-  await this.serverStartPromise;
-  this.serverStartPromise = null;
-}
-```
-### Memory Leaks
-**MEDIUM:** Two Maps grow unbounded:
-- `TelegramInteractionPlugin.pendingMessages` (Line 42)
-- `WebhookInteractionPlugin.pendingResponses` (Line 29)
-**Issue:** When a request times out, the entry is never removed from the Map.
-**Fix:**
-```typescript
-// In sendTimeoutMessage() / cancel():
-this.pendingMessages.delete(requestId);
-this.pendingResponses.delete(requestId);
-```
-Already implemented in `sendTimeoutMessage()` for Telegram (line 331), but not in `cancel()` for Webhook.
----
-## Test Coverage Gaps
-### Current Coverage
-**Good:**
-- ✅ Plugin initialization (with/without config, env vars)
-- ✅ Config validation (missing required fields)
-- ✅ Auto plugin security-review rejection
-**Missing:**
-- ❌ Network failure scenarios (Telegram API down, webhook unreachable)
-- ❌ Malformed responses (invalid JSON, wrong structure)
-- ❌ Timeout edge cases (request expires during polling)
-- ❌ Concurrent request handling
-- ❌ Memory leak verification (Map cleanup)
-- ❌ Auto plugin LLM call (currently untested, would make real API calls)
-### Recommended Additional Tests
-```typescript
-describe("TelegramInteractionPlugin - Error Handling", () => {
-  test("should handle network failure gracefully", async () => {
-    const plugin = new TelegramInteractionPlugin();
-    await plugin.init({ botToken: "token", chatId: "123" });
-    // Mock fetch to throw network error
-    global.fetch = async () => { throw new Error("ECONNREFUSED") };
-    const request = { /* ... */ };
-    await expect(plugin.send(request)).rejects.toThrow("Failed to send Telegram message");
-  });
-  test("should handle malformed API response", async () => {
-    // Mock fetch to return invalid JSON
-    global.fetch = async () => new Response("not json");
-    // ... test
-  });
-  test("should clean up pendingMessages on timeout", async () => {
-    // ... verify Map is empty after timeout
-  });
-});
-describe("WebhookInteractionPlugin - Security", () => {
-  test("should reject malformed callback payload", async () => {
-    const plugin = new WebhookInteractionPlugin();
-    await plugin.init({ url: "http://example.com" });
-    const malformed = { malicious: "payload" };
-    const response = await plugin.handleRequest(
-      new Request("http://localhost:8765/nax/interact/test-id", {
-        method: "POST",
-        body: JSON.stringify(malformed),
-      })
-    );
-    expect(response.status).toBe(400);
-  });
-  test("should reject callback without HMAC when secret configured", async () => {
-    // ... test
-  });
-});
-describe("AutoInteractionPlugin - LLM", () => {
-  test("should make correct LLM decision (mocked)", async () => {
-    // Mock Bun.spawn to return fake LLM response
-    const originalSpawn = Bun.spawn;
-    Bun.spawn = (cmd, opts) => {
-      const mockStdout = new ReadableStream({
-        start(controller) {
-          controller.enqueue(new TextEncoder().encode(
-            JSON.stringify({
-              action: "approve",
-              confidence: 0.8,
-              reasoning: "test"
-            })
-          ));
-          controller.close();
-        }
-      });
-      return { stdout: mockStdout, stderr: new ReadableStream(), exited: Promise.resolve(0) };
-    };
-    // ... test decision logic
-    Bun.spawn = originalSpawn; // Restore
-  });
-});
-```
----
-## Architecture Compliance
-### Plugin Chain Escalation
-**Question:** Does the plugin chain correctly handle escalation when all plugins fail?
-**Answer:** **PARTIAL FAIL**
-Current behavior:
-- `InteractionChain.receive()` catches ALL errors and returns timeout response
-- If primary plugin throws, it's converted to timeout (action: "skip")
-- No escalation to secondary plugins
-**Expected behavior:**
-- Try primary plugin
-- On failure, try next plugin in chain (by priority)
-- Only return timeout if all plugins fail OR timeout reached
-**Current code:**
-```typescript
-// src/interaction/chain.ts:63-82
-async receive(requestId: string, timeout?: number): Promise<InteractionResponse> {
-  const plugin = this.getPrimary();
-  if (!plugin) {
-    throw new Error("No interaction plugin registered");
-  }
-  const timeoutMs = timeout ?? this.config.defaultTimeout;
-  try {
-    const response = await plugin.receive(requestId, timeoutMs);
-    return response;
-  } catch (err) {
-    // BUG: All errors converted to timeout, no fallback to other plugins
-    return {
-      requestId,
-      action: "skip",
-      respondedBy: "timeout",
-      respondedAt: Date.now(),
-    };
-  }
-}
-```
-**Fix:** Implement plugin fallback cascade:
-```typescript
-async receive(requestId: string, timeout?: number): Promise<InteractionResponse> {
-  const timeoutMs = timeout ?? this.config.defaultTimeout;
-  const errors: Error[] = [];
-  // Try each plugin in priority order
-  for (const entry of this.plugins) {
-    try {
-      const response = await entry.plugin.receive(requestId, timeoutMs);
-      return response;
-    } catch (err) {
-      errors.push(err instanceof Error ? err : new Error(String(err)));
-      // Continue to next plugin
-    }
-  }
-  // All plugins failed
-  throw new Error(
-    `All interaction plugins failed: ${errors.map(e => e.message).join("; ")}`
-  );
-}
-```
-### State Persistence
-**Question:** Does state persistence correctly serialize/deserialize all runner state?
-**Answer:** **PASS**
-- `RunState` interface covers all necessary fields (line 11-41)
-- Serialization uses JSON.stringify with pretty-printing (line 48)
-- Deserialization has error handling for corrupted files (line 68-70)
-- File operations use Bun-native APIs correctly
-**Recommendation:** Add Zod schema validation for loaded state:
-```typescript
-import { z } from "zod";
-const RunStateSchema = z.object({
-  feature: z.string(),
-  prdPath: z.string(),
-  iteration: z.number(),
-  totalCost: z.number(),
-  storiesCompleted: z.number(),
-  pendingInteractions: z.array(z.any()), // Use InteractionRequestSchema
-  completedInteractions: z.array(z.any()),
-  pausedAt: z.number(),
-  pauseReason: z.string(),
-  currentStoryId: z.string().optional(),
-  currentTier: z.string().optional(),
-  currentModel: z.string().optional(),
-  metadata: z.record(z.unknown()).optional(),
-});
-export async function deserializeRunState(featureDir: string): Promise<RunState | null> {
-  try {
-    const file = Bun.file(stateFile);
-    const exists = await file.exists();
-    if (!exists) return null;
-    const json = await file.text();
-    const parsed = JSON.parse(json);
-    const state = RunStateSchema.parse(parsed); // Validate before returning
-    return state as RunState;
-  } catch (err) {
-    // Log validation error for debugging
-    console.error("Invalid run state file:", err);
-    return null;
-  }
-}
-```
-### Config Schema Validation
-**Question:** Are all config schema additions validated with Zod?
-**Answer:** **PARTIAL PASS**
-New `InteractionConfig` interface exists (line 289-304) but NOT in Zod schema.
-**Current issue:**
-```typescript
-// src/config/schema.ts:289-304
-export interface InteractionConfig {
-  plugin: string;
-  config?: Record<string, unknown>;
-  defaults: { timeout: number; fallback: string };
-  triggers: Partial<Record<string, boolean | { enabled: boolean; fallback?: string; timeout?: number }>>;
-}
-```
-This is a **TypeScript interface only** — no runtime validation!
-**Fix:** Add Zod schema:
-```typescript
-const InteractionConfigSchema = z.object({
-  plugin: z.enum(["cli", "telegram", "webhook", "auto"]),
-  config: z.record(z.unknown()).optional(),
-  defaults: z.object({
-    timeout: z.number().min(1000).max(3600000), // 1s to 1hr
-    fallback: z.enum(["continue", "skip", "escalate", "abort"]),
-  }),
-  triggers: z.record(
-    z.union([
-      z.boolean(),
-      z.object({
-        enabled: z.boolean(),
-        fallback: z.enum(["continue", "skip", "escalate", "abort"]).optional(),
-        timeout: z.number().min(1000).optional(),
-      }),
-    ])
-  ).partial(),
-});
-// In main config schema:
-export const NaxConfigSchema = z.object({
-  // ... existing fields
-  interaction: InteractionConfigSchema.optional(),
-});
-```
----
-## Top 5 Fixes (Priority Order)
-### 1. Fix Webhook JSON.parse Vulnerability (SEC-001)
-**File:** `src/interaction/plugins/webhook.ts:158`
-**Impact:** CRITICAL — Can crash server on malformed input
-**Effort:** 15 minutes
-Add try-catch + Zod validation:
-```typescript
-try {
-  const parsed = JSON.parse(body);
-  const response = InteractionResponseSchema.parse(parsed);
-  this.pendingResponses.set(requestId, response);
-} catch (err) {
-  return new Response("Bad Request", { status: 400 });
-}
-```
-### 2. Add Network Error Handling to Telegram Plugin (SEC-002, SEC-003)
-**File:** `src/interaction/plugins/telegram.ts:68, 235`
-**Impact:** CRITICAL — Can crash plugin on network failures
-**Effort:** 30 minutes
-Wrap all fetch() calls in try-catch with descriptive errors.
-### 3. Fix InteractionChain Error Swallowing (REL-001)
-**File:** `src/interaction/chain.ts:74-82`
-**Impact:** CRITICAL — Masks real errors as timeouts
-**Effort:** 20 minutes
-Implement plugin fallback cascade (see Architecture section).
-### 4. Add Config Schema Validation for Interaction (SEC-004)
-**File:** `src/config/schema.ts`
-**Impact:** HIGH — Runtime validation missing
-**Effort:** 30 minutes
-Add Zod schemas for InteractionConfig and all trigger configs.
-### 5. Split Files Over 400 Lines (ARCH-001)
-**Files:** 14 files (see table above)
-**Impact:** HIGH — Violates CLAUDE.md hard requirement
-**Effort:** 4-6 hours
-Start with largest offenders:
-1. `config/schema.ts` (853 lines) → 4 files
-2. `agents/claude.ts` (820 lines) → 3 files
-3. `tdd/orchestrator.ts` (743 lines) → 3 files
----
-## Conclusion
-The v0.15.0 Interactive Pipeline implementation demonstrates solid engineering with clean separation of concerns and a well-designed plugin architecture. However, **several CRITICAL security and reliability issues must be fixed before release**.
-**Blocking Issues for Release:**
-1. ✅ Test coverage is adequate (10/10 tests pass)
-2. ❌ **SEC-001, SEC-002, SEC-003** — Network error handling (CRITICAL)
-3. ❌ **REL-001** — Error swallowing in chain (CRITICAL)
-4. ❌ **ARCH-001** — 14 files exceed 400 lines (CRITICAL per CLAUDE.md)
-**Recommended Release Plan:**
-1. Fix all CRITICAL findings (1-3 above) — **2 hours**
-2. Fix HIGH findings (config validation, type casts) — **1 hour**
-3. Split 3 largest files (config, agents, tdd) — **3 hours**
-4. Add missing tests for network failures — **2 hours**
-5. Re-run full test suite + typecheck — **30 minutes**
-6. **Total:** ~8-9 hours to production-ready
-**Post-Release Backlog:**
-- Split remaining 11 files over 400 lines
-- Add comprehensive integration tests
-- Implement exponential backoff for polling loops
-- Add Prometheus metrics for interaction success/failure rates
----
-**Reviewer Signature:** Claude Sonnet 4.5
-**Review Completed:** 2026-02-28