npm - maestro-flow - Versions diffs - 0.5.3 → 0.5.31 - Mend

maestro-flow 0.5.3 → 0.5.31

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (259) hide show

package/.agents/skills/learn-follow/SKILL.md +114 -114
package/.agents/skills/learn-investigate/SKILL.md +138 -139
package/.agents/skills/learn-second-opinion/SKILL.md +105 -109
package/.agents/skills/maestro/SKILL.md +2 -10
package/.agents/skills/maestro-amend/SKILL.md +152 -152
package/.agents/skills/maestro-analyze/SKILL.md +201 -252
package/.agents/skills/maestro-blueprint/SKILL.md +175 -190
package/.agents/skills/maestro-brainstorm/SKILL.md +196 -200
package/.agents/skills/maestro-collab/SKILL.md +159 -159
package/.agents/skills/maestro-companion/SKILL.md +517 -517
package/.agents/skills/maestro-composer/SKILL.md +173 -164
package/.agents/skills/maestro-execute/SKILL.md +169 -170
package/.agents/skills/maestro-fork/SKILL.md +97 -96
package/.agents/skills/maestro-grill/SKILL.md +161 -162
package/.agents/skills/maestro-guard/SKILL.md +93 -92
package/.agents/skills/maestro-impeccable/SKILL.md +296 -253
package/.agents/skills/maestro-init/SKILL.md +117 -118
package/.agents/skills/maestro-merge/SKILL.md +73 -66
package/.agents/skills/maestro-milestone-audit/SKILL.md +4 -10
package/.agents/skills/maestro-milestone-complete/SKILL.md +6 -7
package/.agents/skills/maestro-milestone-release/SKILL.md +122 -131
package/.agents/skills/maestro-next/SKILL.md +241 -245
package/.agents/skills/maestro-overlay/SKILL.md +176 -166
package/.agents/skills/maestro-plan/SKILL.md +211 -197
package/.agents/skills/maestro-player/SKILL.md +167 -167
package/.agents/skills/maestro-quick/SKILL.md +69 -63
package/.agents/skills/maestro-ralph/SKILL.md +2 -36
package/.agents/skills/maestro-ralph-beta/SKILL.md +861 -872
package/.agents/skills/maestro-ralph-execute/SKILL.md +234 -234
package/.agents/skills/maestro-roadmap/SKILL.md +159 -172
package/.agents/skills/maestro-swarm-workflow/SKILL.md +229 -250
package/.agents/skills/maestro-tools-execute/SKILL.md +108 -103
package/.agents/skills/maestro-tools-register/SKILL.md +148 -143
package/.agents/skills/maestro-ui-codify/SKILL.md +103 -86
package/.agents/skills/maestro-universal-workflow/SKILL.md +534 -547
package/.agents/skills/maestro-update/SKILL.md +109 -106
package/.agents/skills/manage-codebase-rebuild/SKILL.md +73 -71
package/.agents/skills/manage-harvest/SKILL.md +83 -81
package/.agents/skills/manage-issue/SKILL.md +59 -60
package/.agents/skills/manage-issue-discover/SKILL.md +70 -68
package/.agents/skills/manage-kg-extractors/SKILL.md +130 -0
package/.agents/skills/manage-knowhow/SKILL.md +70 -66
package/.agents/skills/manage-knowhow-capture/SKILL.md +79 -69
package/.agents/skills/manage-knowledge-audit/SKILL.md +91 -74
package/.agents/skills/manage-status/SKILL.md +52 -42
package/.agents/skills/manage-wiki/SKILL.md +69 -58
package/.agents/skills/odyssey-debug/SKILL.md +445 -459
package/.agents/skills/odyssey-improve/SKILL.md +477 -491
package/.agents/skills/odyssey-planex/SKILL.md +576 -587
package/.agents/skills/odyssey-review-test-fix/SKILL.md +400 -413
package/.agents/skills/odyssey-ui/SKILL.md +431 -448
package/.agents/skills/quality-auto-test/SKILL.md +140 -123
package/.agents/skills/quality-debug/SKILL.md +145 -106
package/.agents/skills/quality-refactor/SKILL.md +91 -53
package/.agents/skills/quality-retrospective/SKILL.md +109 -63
package/.agents/skills/quality-review/SKILL.md +141 -114
package/.agents/skills/quality-sync/SKILL.md +74 -38
package/.agents/skills/quality-test/SKILL.md +133 -103
package/.agents/skills/security-audit/SKILL.md +217 -166
package/.agents/skills/spec-add/SKILL.md +66 -59
package/.agents/skills/spec-load/SKILL.md +68 -68
package/.agents/skills/spec-remove/SKILL.md +42 -42
package/.agents/skills/spec-setup/SKILL.md +38 -41
package/.agy/skills/learn-follow/SKILL.md +114 -114
package/.agy/skills/learn-investigate/SKILL.md +138 -139
package/.agy/skills/learn-second-opinion/SKILL.md +105 -109
package/.agy/skills/maestro/SKILL.md +2 -10
package/.agy/skills/maestro-amend/SKILL.md +152 -152
package/.agy/skills/maestro-analyze/SKILL.md +201 -252
package/.agy/skills/maestro-blueprint/SKILL.md +175 -190
package/.agy/skills/maestro-brainstorm/SKILL.md +196 -200
package/.agy/skills/maestro-collab/SKILL.md +159 -159
package/.agy/skills/maestro-companion/SKILL.md +517 -517
package/.agy/skills/maestro-composer/SKILL.md +173 -164
package/.agy/skills/maestro-execute/SKILL.md +169 -170
package/.agy/skills/maestro-fork/SKILL.md +97 -96
package/.agy/skills/maestro-grill/SKILL.md +161 -162
package/.agy/skills/maestro-guard/SKILL.md +93 -92
package/.agy/skills/maestro-impeccable/SKILL.md +296 -253
package/.agy/skills/maestro-init/SKILL.md +117 -118
package/.agy/skills/maestro-merge/SKILL.md +73 -66
package/.agy/skills/maestro-milestone-audit/SKILL.md +4 -10
package/.agy/skills/maestro-milestone-complete/SKILL.md +6 -7
package/.agy/skills/maestro-milestone-release/SKILL.md +122 -131
package/.agy/skills/maestro-next/SKILL.md +241 -245
package/.agy/skills/maestro-overlay/SKILL.md +176 -166
package/.agy/skills/maestro-plan/SKILL.md +211 -197
package/.agy/skills/maestro-player/SKILL.md +167 -167
package/.agy/skills/maestro-quick/SKILL.md +69 -63
package/.agy/skills/maestro-ralph/SKILL.md +2 -36
package/.agy/skills/maestro-ralph-beta/SKILL.md +861 -872
package/.agy/skills/maestro-ralph-execute/SKILL.md +234 -234
package/.agy/skills/maestro-roadmap/SKILL.md +159 -172
package/.agy/skills/maestro-swarm-workflow/SKILL.md +229 -250
package/.agy/skills/maestro-tools-execute/SKILL.md +108 -103
package/.agy/skills/maestro-tools-register/SKILL.md +148 -143
package/.agy/skills/maestro-ui-codify/SKILL.md +103 -86
package/.agy/skills/maestro-universal-workflow/SKILL.md +534 -547
package/.agy/skills/maestro-update/SKILL.md +109 -106
package/.agy/skills/manage-codebase-rebuild/SKILL.md +73 -71
package/.agy/skills/manage-harvest/SKILL.md +83 -81
package/.agy/skills/manage-issue/SKILL.md +59 -60
package/.agy/skills/manage-issue-discover/SKILL.md +70 -68
package/.agy/skills/manage-kg-extractors/SKILL.md +130 -0
package/.agy/skills/manage-knowhow/SKILL.md +70 -66
package/.agy/skills/manage-knowhow-capture/SKILL.md +79 -69
package/.agy/skills/manage-knowledge-audit/SKILL.md +91 -74
package/.agy/skills/manage-status/SKILL.md +52 -42
package/.agy/skills/manage-wiki/SKILL.md +69 -58
package/.agy/skills/odyssey-debug/SKILL.md +445 -459
package/.agy/skills/odyssey-improve/SKILL.md +477 -491
package/.agy/skills/odyssey-planex/SKILL.md +576 -587
package/.agy/skills/odyssey-review-test-fix/SKILL.md +400 -413
package/.agy/skills/odyssey-ui/SKILL.md +431 -448
package/.agy/skills/quality-auto-test/SKILL.md +140 -123
package/.agy/skills/quality-debug/SKILL.md +145 -106
package/.agy/skills/quality-refactor/SKILL.md +91 -53
package/.agy/skills/quality-retrospective/SKILL.md +109 -63
package/.agy/skills/quality-review/SKILL.md +141 -114
package/.agy/skills/quality-sync/SKILL.md +74 -38
package/.agy/skills/quality-test/SKILL.md +133 -103
package/.agy/skills/security-audit/SKILL.md +217 -166
package/.agy/skills/spec-add/SKILL.md +66 -59
package/.agy/skills/spec-load/SKILL.md +68 -68
package/.agy/skills/spec-remove/SKILL.md +42 -42
package/.agy/skills/spec-setup/SKILL.md +38 -41
package/.claude/commands/learn-follow.md +127 -127
package/.claude/commands/learn-investigate.md +151 -152
package/.claude/commands/learn-second-opinion.md +118 -122
package/.claude/commands/maestro-amend.md +164 -164
package/.claude/commands/maestro-analyze.md +215 -266
package/.claude/commands/maestro-blueprint.md +189 -204
package/.claude/commands/maestro-brainstorm.md +209 -213
package/.claude/commands/maestro-collab.md +172 -172
package/.claude/commands/maestro-companion.md +531 -531
package/.claude/commands/maestro-composer.md +188 -179
package/.claude/commands/maestro-execute.md +183 -184
package/.claude/commands/maestro-fork.md +111 -110
package/.claude/commands/maestro-grill.md +175 -176
package/.claude/commands/maestro-guard.md +103 -102
package/.claude/commands/maestro-impeccable.md +311 -268
package/.claude/commands/maestro-init.md +130 -131
package/.claude/commands/maestro-merge.md +87 -80
package/.claude/commands/maestro-milestone-audit.md +4 -10
package/.claude/commands/maestro-milestone-complete.md +6 -7
package/.claude/commands/maestro-milestone-release.md +136 -145
package/.claude/commands/maestro-next.md +253 -257
package/.claude/commands/maestro-overlay.md +188 -178
package/.claude/commands/maestro-plan.md +225 -211
package/.claude/commands/maestro-player.md +182 -182
package/.claude/commands/maestro-quick.md +83 -77
package/.claude/commands/maestro-ralph-beta.md +875 -886
package/.claude/commands/maestro-ralph-execute.md +247 -247
package/.claude/commands/maestro-ralph.md +2 -36
package/.claude/commands/maestro-roadmap.md +173 -186
package/.claude/commands/maestro-swarm-workflow.md +243 -264
package/.claude/commands/maestro-tools-execute.md +122 -117
package/.claude/commands/maestro-tools-register.md +162 -157
package/.claude/commands/maestro-ui-codify.md +117 -100
package/.claude/commands/maestro-universal-workflow.md +548 -561
package/.claude/commands/maestro-update.md +122 -119
package/.claude/commands/maestro.md +2 -10
package/.claude/commands/manage-codebase-rebuild.md +87 -85
package/.claude/commands/manage-harvest.md +97 -95
package/.claude/commands/manage-issue-discover.md +83 -81
package/.claude/commands/manage-issue.md +72 -73
package/.claude/commands/manage-kg-extractors.md +128 -0
package/.claude/commands/manage-knowhow-capture.md +92 -82
package/.claude/commands/manage-knowhow.md +83 -79
package/.claude/commands/manage-knowledge-audit.md +105 -88
package/.claude/commands/manage-status.md +62 -52
package/.claude/commands/manage-wiki.md +82 -71
package/.claude/commands/odyssey-debug.md +459 -473
package/.claude/commands/odyssey-improve.md +491 -505
package/.claude/commands/odyssey-planex.md +590 -601
package/.claude/commands/odyssey-review-test-fix.md +414 -427
package/.claude/commands/odyssey-ui.md +445 -462
package/.claude/commands/quality-auto-test.md +153 -136
package/.claude/commands/quality-debug.md +159 -120
package/.claude/commands/quality-refactor.md +105 -67
package/.claude/commands/quality-retrospective.md +123 -77
package/.claude/commands/quality-review.md +155 -128
package/.claude/commands/quality-sync.md +88 -52
package/.claude/commands/quality-test.md +147 -117
package/.claude/commands/security-audit.md +230 -179
package/.claude/commands/spec-add.md +77 -70
package/.claude/commands/spec-load.md +78 -78
package/.claude/commands/spec-remove.md +55 -55
package/.claude/commands/spec-setup.md +49 -52
package/dist/src/cli.js +1 -1
package/dist/src/cli.js.map +1 -1
package/dist/src/commands/kg.d.ts.map +1 -1
package/dist/src/commands/kg.js +11 -5
package/dist/src/commands/kg.js.map +1 -1
package/dist/src/graph/kg/extraction/code/code-extractor.d.ts +2 -0
package/dist/src/graph/kg/extraction/code/code-extractor.d.ts.map +1 -1
package/dist/src/graph/kg/extraction/code/code-extractor.js +32 -3
package/dist/src/graph/kg/extraction/code/code-extractor.js.map +1 -1
package/dist/src/graph/kg/extraction/code/plugin-engine.d.ts +35 -0
package/dist/src/graph/kg/extraction/code/plugin-engine.d.ts.map +1 -0
package/dist/src/graph/kg/extraction/code/plugin-engine.js +573 -0
package/dist/src/graph/kg/extraction/code/plugin-engine.js.map +1 -0
package/dist/src/graph/kg/extraction/code/plugin-types.d.ts +95 -0
package/dist/src/graph/kg/extraction/code/plugin-types.d.ts.map +1 -0
package/dist/src/graph/kg/extraction/code/plugin-types.js +5 -0
package/dist/src/graph/kg/extraction/code/plugin-types.js.map +1 -0
package/dist/src/graph/kg/extraction/orchestrator.d.ts.map +1 -1
package/dist/src/graph/kg/extraction/orchestrator.js +17 -5
package/dist/src/graph/kg/extraction/orchestrator.js.map +1 -1
package/dist/src/graph/kg/schema.sql +16 -11
package/dist/src/graph/kg/surface/cli.d.ts.map +1 -1
package/dist/src/graph/kg/surface/cli.js +153 -56
package/dist/src/graph/kg/surface/cli.js.map +1 -1
package/dist/src/hooks/workspace.d.ts +4 -2
package/dist/src/hooks/workspace.d.ts.map +1 -1
package/dist/src/hooks/workspace.js +6 -2
package/dist/src/hooks/workspace.js.map +1 -1
package/package.json +91 -91
package/workflows/analyze.md +25 -49
package/workflows/auto-test.md +699 -699
package/workflows/blueprint.md +403 -431
package/workflows/brainstorm.md +54 -195
package/workflows/business-test.md +570 -570
package/workflows/claude-instructions.md +23 -51
package/workflows/codex-instructions.md +27 -77
package/workflows/coding-philosophy.md +69 -69
package/workflows/command-authoring.md +823 -823
package/workflows/debug.md +43 -98
package/workflows/delegate-usage.md +39 -241
package/workflows/execute.md +4 -53
package/workflows/grill.md +12 -56
package/workflows/harvest.md +22 -68
package/workflows/init.md +148 -148
package/workflows/instruction-authoring-guide.md +97 -0
package/workflows/issue-execute.md +110 -110
package/workflows/issue-gaps-analyze.codex.md +260 -260
package/workflows/issue-gaps-analyze.md +216 -216
package/workflows/issue-plan.md +110 -110
package/workflows/issue.md +338 -346
package/workflows/knowhow.md +0 -32
package/workflows/learn.md +277 -277
package/workflows/maestro-chain-execute.md +20 -20
package/workflows/refactor.md +22 -44
package/workflows/retrospective.md +16 -65
package/workflows/review.md +446 -486
package/workflows/roadmap.md +35 -132
package/workflows/skill-authoring.md +265 -265
package/workflows/spec-generate.md +470 -470
package/workflows/specs-remove.md +104 -104
package/workflows/sync.md +11 -41
package/workflows/test-gen.md +226 -226
package/workflows/test.md +385 -475
package/workflows/ui-design.md +391 -391
package/workflows/ui-style.md +199 -199
package/workflows/wiki-connect.md +151 -151
package/workflows/wiki-digest.md +178 -178
package/workflows/wiki-manage.md +109 -109
package/workflows/cli-tools-usage.md +0 -252
package/workflows/delegate-protocol.codex.md +0 -65

package/.agents/skills/quality-auto-test/SKILL.md CHANGED Viewed

@@ -13,126 +13,143 @@ allowed-tools:
 ---
 <!-- Open-standard mirror generated by scripts/build-agents-standard.mjs — do not edit; re-run after editing .claude/ source. -->
-<purpose>
-Run unified automated testing via CSV layer pipeline. Reads project state to auto-select the optimal scenario source — PRD specs (when spec package exists), coverage gaps (when Nyquist audit found gaps), or code exploration (default). All sources converge into a CSV pipeline: discover infrastructure → plan → build scenarios.csv → write tests per layer (spawn_agents_on_csv parallel) → execute → diagnose failures (spawn_agents_on_csv parallel) → iterate → report.
-Key mechanisms:
-- **Intelligent routing**: Reads `.tests/`, `.workflow/blueprint/`, `verification.json` to auto-select source — no mode flag needed
-- **CSV parallel test writing**: Per-layer `spawn_agents_on_csv` — each agent writes one test file independently
-- **CSV parallel failure diagnosis**: Failed scenarios dispatched via `spawn_agents_on_csv` for classification + fix
-- **Unified iteration engine**: Nested inner loop (fix test_defects via diagnosis CSV, max 3/layer) + outer loop (adaptive strategy, max N iterations)
-- **Layers as waves**: L0→L1→L2→L3 sequential (fail-fast on critical), scenarios within layer parallel
-- **Discovery board**: `discoveries.ndjson` shared across all agents/iterations (append-only)
-- **Degenerate modes**: `--max-iter 1` = single-pass generation; default = full iterative cycle
-- **Session persistence**: CSV state + state.json survive context resets, resume from any point
-</purpose>
-<required_reading>
-@~/.maestro/workflows/auto-test.md
-</required_reading>
-<context>
-Phase or task: $ARGUMENTS (required — phase number)
-**Flags:**
-- `--max-iter N` — Maximum outer iterations (default: 5). Set to 1 for single-pass generation only.
-- `--layer L` — Start from or restrict to specific layer (L0|L1|L2|L3)
-- `--dry-run` — Generate test plan only, do not execute
-- `--re-run` — Re-run only previously failed/blocked scenarios
-**Intelligent routing** (auto-detected from project state):
-| Priority | Condition | Route | Reference skill |
-|----------|-----------|-------|-----------------|
-| 1 | Active session exists (state.json status=running) | Resume | — |
-| 2 | --re-run flag + previous failures | Re-run | — |
-| 3 | Spec package exists (REQ-*.md) | spec | quality-business-test (separate skill) |
-| 4 | Nyquist gaps exist (verification.json) | gap | quality-test-gen (separate skill) |
-| 5 | Default | code | quality-integration-test (separate skill) |
-Flags, artifact context resolution, and output formats defined in workflow auto-test.md.
-### Pre-load context (before test generation)
-1. **Test specs + tools**: Run `maestro spec load --category test` to load test conventions (framework, patterns, naming). Apply to all generated tests.
-2. **Coding specs**: Run `maestro spec load --category coding` to understand coding patterns for accurate test targeting.
-3. **Role Knowledge**:
-   - Browse: `maestro search --category test`
-   - Load task-relevant entries: `maestro wiki load <id1> [id2...]`
-4. All are optional — proceed without if unavailable.
-</context>
-<execution>
-Follow '~/.maestro/workflows/auto-test.md' completely.
-**Command-specific extensions (not in workflow):**
-**Review findings integration** (from related review artifacts):
-- Extract critical/high findings as additional test scenarios, marked `source: "review_finding"`
-- When review verdict is "BLOCK" and review-finding tests fail, suggest quality-debug
-**Debug root cause integration** (from related debug artifacts):
-- Generate regression test scenarios from confirmed root causes, marked `source: "debug_root_cause"`
-**Register artifact on completion:**
-```
-Append to state.json.artifacts[]:
-{
-  id: nextArtifactId(artifacts, "test"),  // TST-001
-  type: "test",
-  milestone: current_milestone,
-  phase: target_phase,
-  scope: "phase",
-  path: "scratch/{YYYYMMDD}-auto-test-P{N}-{slug}",
-  status: issues == 0 ? "completed" : "failed",
-  depends_on: exec_art.id,
-  harvested: false,
-  created_at: start_time,
-  completed_at: now()
-}
-```
-**Next-step routing on completion:**
-- Converged (>=95%) → `/quality-review {phase}`
-- All requirements verified (spec source) → `/maestro-milestone-audit`
-- Bugs discovered → `/quality-debug --from-uat {phase}`
-- Max iter, >80% → `/quality-test {phase}` for manual UAT
-- Max iter, <80% → `/quality-debug {phase}`
-- Coverage still low → `/quality-auto-test {phase} --layer {missing}`
-- Re-run all pass → `/quality-review {phase}`
-- Single pass, all pass → `/quality-test {phase}`
-</execution>
-<error_codes>
-| Code | Severity | Condition | Recovery |
-|------|----------|-----------|----------|
-| E001 | error | Phase argument required (no active sessions) | Prompt user for phase number |
-| E002 | error | Phase not found in artifact registry | Check state.json artifacts |
-| E003 | error | No test framework detected | Install test framework or configure test runner |
-| W001 | warning | One or more test scenarios failed | Auto-iterate or suggest fix options |
-| W002 | warning | Max iterations reached without convergence | Review reflection-log.md, suggest debug |
-| W003 | warning | Degraded spec mode (no full spec package) | Consider running maestro-roadmap --mode full |
-</error_codes>
-<success_criteria>
-- [ ] Phase resolved from artifact registry
-- [ ] Route auto-selected from project state (spec/gap/code)
-- [ ] Active sessions checked, resume offered if applicable
-- [ ] Scenarios extracted and normalized to unified format
-- [ ] Test infrastructure discovered (framework, patterns, conventions)
-- [ ] test-plan.json generated with layer distribution
-- [ ] User confirmed plan (or --dry-run stopped here)
-- [ ] Tests written following RED-GREEN methodology and existing patterns
-- [ ] Tests executed progressively (L0→L3) with fail-fast on critical
-- [ ] Iteration engine ran (inner: test_defect fix, outer: strategy adjust)
-- [ ] state.json, report.json, reflection-log.md written
-- [ ] Test confidence scored per iteration (Step 7.5) with 5-dimension factor model
-- [ ] Convergence check includes confidence >= 60% alongside pass_rate threshold
-- [ ] Pressure pass completed on highest-pass-rate layer before completion
-- [ ] report.json includes confidence section
-- [ ] index.json updated with auto_test section
-- [ ] If spec source: traceability matrix built, traceability.md written
-- [ ] If failures: issues auto-created in issues.jsonl
-- [ ] If gap source: validation.json gaps updated (MISSING→COVERED)
-- [ ] Next step routed based on convergence status
-</success_criteria>
+<purpose>
+Unified automated testing via CSV layer pipeline. Auto-selects scenario source from project state (specs / coverage gaps / code exploration), then: discover → plan → build CSV → write tests (parallel) → execute → diagnose failures (parallel) → iterate → report.
+Layers L0→L3 sequential (fail-fast), scenarios within layer parallel. `--max-iter 1` = single-pass; default = full iterative cycle.
+</purpose>
+<required_reading>
+@~/.maestro/workflows/auto-test.md
+</required_reading>
+<context>
+Phase or task: $ARGUMENTS (required — phase number)
+**Flags:**
+- `--max-iter N` — Maximum outer iterations (default: 5). Set to 1 for single-pass generation only.
+- `--layer L` — Start from or restrict to specific layer (L0|L1|L2|L3)
+- `--dry-run` — Generate test plan only, do not execute
+- `--re-run` — Re-run only previously failed/blocked scenarios
+**Intelligent routing** (auto-detected from project state):
+| Priority | Condition | Route | Reference skill |
+|----------|-----------|-------|-----------------|
+| 1 | Active session exists (state.json status=running) | Resume | — |
+| 2 | --re-run flag + previous failures | Re-run | — |
+| 3 | Spec package exists (REQ-*.md) | spec | quality-business-test (separate skill) |
+| 4 | Nyquist gaps exist (verification.json) | gap | quality-test-gen (separate skill) |
+| 5 | Default | code | quality-integration-test (separate skill) |
+Flags, artifact context resolution, and output formats defined in workflow auto-test.md.
+### Pre-load context (before test generation)
+1. **Test specs + tools**: Run `maestro spec load --category test` to load test conventions (framework, patterns, naming). Apply to all generated tests.
+2. **Coding specs**: Run `maestro spec load --category coding` to understand coding patterns for accurate test targeting.
+3. **Role Knowledge**:
+   - Browse: `maestro search --category test`
+   - Load task-relevant entries: `maestro wiki load <id1> [id2...]`
+4. All are optional — proceed without if unavailable.
+</context>
+<execution>
+Follow '~/.maestro/workflows/auto-test.md' completely.
+### Phase Gates (MANDATORY, BLOCKING)
+**GATE 1: Setup → Plan** (Route Selection → CSV Generation)
+- REQUIRED: Phase resolved from artifact registry. E001/E002 if missing.
+- REQUIRED: Route auto-selected (spec/gap/code) from project state.
+- REQUIRED: Test infrastructure discovered (framework, patterns, conventions).
+- BLOCKED if missing: cannot generate test plan without route and framework.
+**GATE 2: Plan → Write** (CSV → Test Generation)
+- REQUIRED: test-plan.json generated with layer distribution (L0→L3).
+- REQUIRED: User confirmed plan (unless `--dry-run` stops here).
+- BLOCKED if plan missing or rejected: do not write tests.
+**GATE 3: Write → Execute** (Test Generation → Execution)
+- REQUIRED: All planned test files written following existing patterns.
+- REQUIRED: Tests follow RED-GREEN methodology.
+- BLOCKED if tests incomplete: finish writing before execution.
+**GATE 4: Execute → Report** (Iteration → Completion)
+- REQUIRED: Progressive execution completed (L0→L3, fail-fast on critical).
+- REQUIRED: Iteration engine ran (inner: test_defect fix, outer: strategy adjust).
+- REQUIRED: Confidence scored with 5-dimension factor model (>= 60%).
+- REQUIRED: Pressure pass completed on highest-pass-rate layer.
+- BLOCKED if iteration incomplete: continue iterating before reporting.
+**Command-specific extensions (not in workflow):**
+**Review findings integration** (from related review artifacts):
+- Extract critical/high findings as additional test scenarios, marked `source: "review_finding"`
+- When review verdict is "BLOCK" and review-finding tests fail, suggest quality-debug
+**Debug root cause integration** (from related debug artifacts):
+- Generate regression test scenarios from confirmed root causes, marked `source: "debug_root_cause"`
+**Register artifact on completion:**
+```
+Append to state.json.artifacts[]:
+{
+  id: nextArtifactId(artifacts, "test"),  // TST-001
+  type: "test",
+  milestone: current_milestone,
+  phase: target_phase,
+  scope: "phase",
+  path: "scratch/{YYYYMMDD}-auto-test-P{N}-{slug}",
+  status: issues == 0 ? "completed" : "failed",
+  depends_on: exec_art.id,
+  harvested: false,
+  created_at: start_time,
+  completed_at: now()
+}
+```
+**Next-step routing on completion:**
+- Converged (>=95%) → `/quality-review {phase}`
+- All requirements verified (spec source) → `/maestro-milestone-audit`
+- Bugs discovered → `/quality-debug --from-uat {phase}`
+- Max iter, >80% → `/quality-test {phase}` for manual UAT
+- Max iter, <80% → `/quality-debug {phase}`
+- Coverage still low → `/quality-auto-test {phase} --layer {missing}`
+- Re-run all pass → `/quality-review {phase}`
+- Single pass, all pass → `/quality-test {phase}`
+</execution>
+<error_codes>
+| Code | Severity | Condition | Recovery |
+|------|----------|-----------|----------|
+| E001 | error | Phase argument required (no active sessions) | Prompt user for phase number |
+| E002 | error | Phase not found in artifact registry | Check state.json artifacts |
+| E003 | error | No test framework detected | Install test framework or configure test runner |
+| W001 | warning | One or more test scenarios failed | Auto-iterate or suggest fix options |
+| W002 | warning | Max iterations reached without convergence | Review reflection-log.md, suggest debug |
+| W003 | warning | Degraded spec mode (no full spec package) | Consider running maestro-roadmap --mode full |
+</error_codes>
+<success_criteria>
+- [ ] Phase resolved from artifact registry
+- [ ] Route auto-selected from project state (spec/gap/code)
+- [ ] Active sessions checked, resume offered if applicable
+- [ ] Scenarios extracted and normalized to unified format
+- [ ] Test infrastructure discovered (framework, patterns, conventions)
+- [ ] test-plan.json generated with layer distribution
+- [ ] User confirmed plan (or --dry-run stopped here)
+- [ ] Tests written following RED-GREEN methodology and existing patterns
+- [ ] Tests executed progressively (L0→L3) with fail-fast on critical
+- [ ] Iteration engine ran (inner: test_defect fix, outer: strategy adjust)
+- [ ] state.json, report.json, reflection-log.md written
+- [ ] Test confidence scored per iteration (Step 7.5) with 5-dimension factor model
+- [ ] Convergence check includes confidence >= 60% alongside pass_rate threshold
+- [ ] Pressure pass completed on highest-pass-rate layer before completion
+- [ ] report.json includes confidence section
+- [ ] index.json updated with auto_test section
+- [ ] If spec source: traceability matrix built, traceability.md written
+- [ ] If failures: issues auto-created in issues.jsonl
+- [ ] If gap source: validation.json gaps updated (MISSING→COVERED)
+- [ ] Next step routed based on convergence status
+</success_criteria>

package/.agents/skills/quality-debug/SKILL.md CHANGED Viewed

@@ -14,109 +14,148 @@ allowed-tools:
 ---
 <!-- Open-standard mirror generated by scripts/build-agents-standard.mjs — do not edit; re-run after editing .claude/ source. -->
-<purpose>
-Debug issues using scientific method with subagent isolation and persistent debug state. Three entry modes (standalone, from-UAT, parallel) and structured root cause collection with UAT feedback loop. Full algorithm defined in workflow debug.md.
-</purpose>
-<required_reading>
-@~/.maestro/workflows/debug.md
-</required_reading>
-<context>
-User's issue: $ARGUMENTS
-**Flags:**
-- `--from-uat <phase>` -- Read gaps from phase's uat.md as pre-filled symptoms
-- `--parallel` -- Spawn parallel debug agents (one per gap cluster)
-**All context via state.json.artifacts[]:**
-```
-related = artifacts.filter(a =>
-  a.phase === target_phase && a.milestone === current_milestone
-).sort_by(completed_at asc)
-```
-Each artifact's type determines its outputs at `.workflow/{a.path}/`:
-- **execute** → .summaries/, .task/ (source of code changes)
-- **review** → review.json (findings guide hypothesis formation)
-- **debug** → understanding.md, evidence.ndjson (prior investigations, avoid re-investigation)
-- **test** → uat.md (--from-uat gap source), .tests/
-Extract conclusions from related artifacts that may affect this debug session — review findings guide investigation direction, prior debug avoids redundant work.
-### Pre-load (optional, proceed without)
-- Codebase docs: `.workflow/codebase/ARCHITECTURE.md` → module boundaries
-- Wiki: `maestro search "<symptom keywords>" --json` → prior investigations
-- Specs: `maestro spec load --category debug --keyword "<symptom>"` → known issues/workarounds
-- Role knowledge: `maestro search --category debug` → select relevant → `maestro wiki load`
-**Output**: `DEBUG_DIR = .workflow/scratch/{YYYYMMDD}-debug-P{N}-{slug}/` (P{N} = phase number when phase-scoped; omit for standalone). Output directory rules defined in workflow debug.md Step 4.
-</context>
-<execution>
-Follow '~/.maestro/workflows/debug.md' completely.
-**Register artifact on completion (phase-scoped only):**
-```
-Append to state.json.artifacts[]:
-{
-  id: nextArtifactId(artifacts, "debug"),  // DBG-001
-  type: "debug",
-  milestone: current_milestone,
-  phase: target_phase,
-  scope: "phase",
-  path: "scratch/{YYYYMMDD}-debug-P{N}-{slug}",
-  status: all_diagnosed ? "completed" : "failed",
-  depends_on: triggering_review_id || exec_art.id,
-  harvested: false,
-  created_at: start_time,
-  completed_at: now()
-}
-```
-### Post-debug Knowledge Inquiry
-| Condition | Ask | Route |
-|-----------|-----|-------|
-| Recurring root cause pattern (seen in prior debug) | "Document in debug-notes.md?" | spec-add debug |
-| Non-obvious fix / workaround | "Record as learning?" | spec-add learning |
-| Root cause = architectural boundary violation | "Update architecture-constraints.md?" | spec-add arch |
-On confirm → `invoke_skill("spec-add", "<category> <content> --description \"<summary>\"")`.
-**Next-step routing on completion:**
-- Root cause found, fix needed → `/maestro-plan {phase} --gaps`
-- Root cause found (from UAT), auto-fix → `/quality-test {phase} --auto-fix`
-- Inconclusive, need more info → `/quality-debug {issue} -c` (resume session)
-- Standalone fix already applied → `/maestro-execute {phase}`
-</execution>
-<error_codes>
-| Code | Severity | Condition | Recovery |
-|------|----------|-----------|----------|
-| E001 | error | Issue description required (no arguments, no active sessions) | Check arguments format, re-run with correct input |
-| E002 | error | UAT file not found for --from-uat phase | Verify UAT file exists for specified phase |
-| W001 | warning | Existing debug session found, offer resume | Review existing sessions, choose resume or new |
-| W002 | warning | Checkpoint reached, user input needed | Provide requested input to continue |
-| W003 | warning | Some gaps inconclusive, partial diagnosis | Review partial results, retry inconclusive gaps |
-</error_codes>
-<success_criteria>
-- [ ] Input parsed: standalone, --from-uat, or --parallel mode determined
-- [ ] Active sessions checked and resume offered if applicable
-- [ ] Symptoms gathered (interactive) or loaded from UAT (pre-filled)
-- [ ] Debug output directory created (phase .debug/ or scratch/)
-- [ ] Debug agent(s) spawned with full symptom context
-- [ ] If --parallel: one agent per gap cluster, all concurrent
-- [ ] evidence.ndjson written with structured NDJSON entries
-- [ ] understanding.md tracks evolving understanding per cluster
-- [ ] Root causes collected with fix_direction and affected_files
-- [ ] Multi-factor confidence scored per gap (Step 7.0) replacing simple high/medium/low
-- [ ] Readiness gate checked before ROOT CAUSE declaration
-- [ ] Pressure pass completed on confirmed hypothesis
-- [ ] Confidence table appended to understanding.md
-- [ ] If --from-uat: uat.md gaps updated with diagnosis artifacts
-- [ ] Results unified into diagnosis summary with confidence section
-- [ ] Next step routed (plan --gaps + execute if fix needed, verify if fix applied, resume if inconclusive)
-</success_criteria>
+<purpose>
+Debug issues using scientific method with subagent isolation and persistent debug state. Three entry modes (standalone, from-UAT, parallel) and structured root cause collection with UAT feedback loop. Full algorithm defined in workflow debug.md.
+</purpose>
+<required_reading>
+@~/.maestro/workflows/debug.md
+</required_reading>
+<context>
+User's issue: $ARGUMENTS
+**Flags:**
+- `--from-uat <phase>` -- Read gaps from phase's uat.md as pre-filled symptoms
+- `--parallel` -- Spawn parallel debug agents (one per gap cluster)
+**All context via state.json.artifacts[]:**
+```
+related = artifacts.filter(a =>
+  a.phase === target_phase && a.milestone === current_milestone
+).sort_by(completed_at asc)
+```
+Each artifact's type determines its outputs at `.workflow/{a.path}/`:
+- **execute** → .summaries/, .task/ (source of code changes)
+- **review** → review.json (findings guide hypothesis formation)
+- **debug** → understanding.md, evidence.ndjson (prior investigations, avoid re-investigation)
+- **test** → uat.md (--from-uat gap source), .tests/
+### Pre-load (optional, proceed without)
+- Codebase docs: `.workflow/codebase/ARCHITECTURE.md` → module boundaries
+- Wiki: `maestro search "<symptom keywords>" --json` → prior investigations
+- Specs: `maestro spec load --category debug --keyword "<symptom>"` → known issues/workarounds
+- Role knowledge: `maestro search --category debug` → select relevant → `maestro wiki load`
+**Output**: `DEBUG_DIR = .workflow/scratch/{YYYYMMDD}-debug-P{N}-{slug}/` (P{N} = phase number when phase-scoped; omit for standalone). Output directory rules defined in workflow debug.md Step 4.
+</context>
+<execution>
+Follow '~/.maestro/workflows/debug.md' completely.
+### Phase Gates (MANDATORY, BLOCKING)
+**GATE 1: Input → Investigation**
+- REQUIRED: Symptoms gathered (interactive) or loaded from UAT (--from-uat).
+- REQUIRED: Debug output directory created.
+- BLOCKED if missing: cannot investigate without symptom baseline.
+**GATE 2: Investigation → Diagnosis**
+- REQUIRED: Debug agent(s) spawned with full symptom context.
+- REQUIRED: evidence.ndjson written with structured entries.
+- REQUIRED: understanding.md tracks evolving understanding.
+- BLOCKED if incomplete: continue investigation before declaring root cause.
+**GATE 3: Diagnosis → Completion**
+- REQUIRED: Root causes collected with fix_direction and affected_files.
+- REQUIRED: Multi-factor confidence scored per gap.
+- REQUIRED: Readiness gate checked and pressure pass completed.
+- BLOCKED if inconclusive: resume session or escalate.
+**Register artifact on completion (phase-scoped only):**
+```
+Append to state.json.artifacts[]:
+{
+  id: nextArtifactId(artifacts, "debug"),  // DBG-001
+  type: "debug",
+  milestone: current_milestone,
+  phase: target_phase,
+  scope: "phase",
+  path: "scratch/{YYYYMMDD}-debug-P{N}-{slug}",
+  status: all_diagnosed ? "completed" : "failed",
+  depends_on: triggering_review_id || exec_art.id,
+  harvested: false,
+  created_at: start_time,
+  completed_at: now()
+}
+```
+### Post-debug Knowledge Inquiry
+| Condition | Ask | Route |
+|-----------|-----|-------|
+| Recurring root cause pattern (seen in prior debug) | "Document in debug-notes.md?" | spec-add debug |
+| Non-obvious fix / workaround | "Record as learning?" | spec-add learning |
+| Root cause = architectural boundary violation | "Update architecture-constraints.md?" | spec-add arch |
+On confirm → `invoke_skill("spec-add", "<category> <content> --description \"<summary>\"")`.
+</execution>
+<completion>
+### Standalone report
+```
+--- COMPLETION STATUS ---
+STATUS: DONE|DONE_WITH_CONCERNS|NEEDS_RETRY
+CONCERNS: {description if applicable}
+--- END STATUS ---
+```
+### Ralph-invoked completion
+End the step by calling the CLI (no text block output):
+```
+maestro ralph complete <idx> --status {STATUS} [--evidence {path}]
+```
+### Next-step routing
+| Condition | Suggestion |
+|-----------|-----------|
+| Root cause found, fix needed | `/maestro-plan {phase} --gaps` |
+| Root cause found (from UAT), auto-fix | `/quality-test {phase} --auto-fix` |
+| Inconclusive, need more info | `/quality-debug {issue} -c` (resume) |
+| Standalone fix already applied | `/maestro-execute {phase}` |
+</completion>
+<error_codes>
+| Code | Severity | Condition | Recovery |
+|------|----------|-----------|----------|
+| E001 | error | Issue description required (no arguments, no active sessions) | Check arguments format, re-run with correct input |
+| E002 | error | UAT file not found for --from-uat phase | Verify UAT file exists for specified phase |
+| W001 | warning | Existing debug session found, offer resume | Review existing sessions, choose resume or new |
+| W002 | warning | Checkpoint reached, user input needed | Provide requested input to continue |
+| W003 | warning | Some gaps inconclusive, partial diagnosis | Review partial results, retry inconclusive gaps |
+</error_codes>
+<success_criteria>
+- [ ] Input parsed: standalone, --from-uat, or --parallel mode determined
+- [ ] Active sessions checked and resume offered if applicable
+- [ ] Symptoms gathered (interactive) or loaded from UAT (pre-filled)
+- [ ] Debug output directory created (phase .debug/ or scratch/)
+- [ ] Debug agent(s) spawned with full symptom context
+- [ ] If --parallel: one agent per gap cluster, all concurrent
+- [ ] evidence.ndjson written with structured NDJSON entries
+- [ ] understanding.md tracks evolving understanding per cluster
+- [ ] Root causes collected with fix_direction and affected_files
+- [ ] Multi-factor confidence scored per gap (Step 7.0) replacing simple high/medium/low
+- [ ] Readiness gate checked before ROOT CAUSE declaration
+- [ ] Pressure pass completed on confirmed hypothesis
+- [ ] Confidence table appended to understanding.md
+- [ ] If --from-uat: uat.md gaps updated with diagnosis artifacts
+- [ ] Results unified into diagnosis summary with confidence section
+- [ ] Next step routed (plan --gaps + execute if fix needed, verify if fix applied, resume if inconclusive)
+</success_criteria>