npm - maestro-flow - Versions diffs - 0.3.10 → 0.3.11 - Mend

maestro-flow 0.3.10 → 0.3.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (184) hide show

package/.claude/agents/workflow-collab-planner.md +1 -1
package/.claude/agents/workflow-executor.md +1 -1
package/.claude/agents/workflow-plan-checker.md +1 -1
package/.claude/agents/workflow-planner.md +1 -1
package/.claude/commands/learn-decompose.md +176 -176
package/.claude/commands/learn-follow.md +167 -167
package/.claude/commands/learn-retro.md +1 -1
package/.claude/commands/maestro-coordinate.md +1 -3
package/.claude/commands/manage-harvest.md +131 -131
package/.claude/commands/manage-issue.md +2 -2
package/.claude/commands/spec-add.md +67 -56
package/.claude/commands/spec-load.md +66 -64
package/.claude/commands/spec-setup.md +5 -9
package/.codex/skills/learn-decompose/SKILL.md +119 -0
package/.codex/skills/learn-follow/SKILL.md +83 -0
package/.codex/skills/learn-investigate/SKILL.md +83 -0
package/.codex/skills/learn-retro/SKILL.md +83 -0
package/.codex/skills/learn-second-opinion/SKILL.md +86 -0
package/.codex/skills/maestro/SKILL.md +151 -279
package/.codex/skills/maestro-analyze/SKILL.md +59 -71
package/.codex/skills/maestro-brainstorm/SKILL.md +452 -463
package/.codex/skills/maestro-chain/SKILL.md +95 -110
package/.codex/skills/maestro-coordinate/SKILL.md +68 -234
package/.codex/skills/maestro-execute/SKILL.md +435 -446
package/.codex/skills/maestro-fork/SKILL.md +68 -0
package/.codex/skills/maestro-init/SKILL.md +171 -167
package/.codex/skills/maestro-learn/SKILL.md +80 -0
package/.codex/skills/maestro-link-coordinate/SKILL.md +224 -220
package/.codex/skills/maestro-merge/SKILL.md +62 -0
package/.codex/skills/maestro-milestone-audit/SKILL.md +108 -103
package/.codex/skills/maestro-milestone-complete/SKILL.md +155 -149
package/.codex/skills/maestro-milestone-release/SKILL.md +70 -0
package/.codex/skills/maestro-overlay/SKILL.md +188 -185
package/.codex/skills/maestro-plan/SKILL.md +58 -69
package/.codex/skills/maestro-quick/SKILL.md +26 -23
package/.codex/skills/maestro-roadmap/SKILL.md +65 -73
package/.codex/skills/maestro-spec-generate/SKILL.md +66 -74
package/.codex/skills/maestro-ui-design/SKILL.md +34 -31
package/.codex/skills/maestro-verify/SKILL.md +556 -566
package/.codex/skills/manage-codebase-rebuild/SKILL.md +397 -405
package/.codex/skills/manage-codebase-refresh/SKILL.md +93 -82
package/.codex/skills/manage-harvest/SKILL.md +82 -0
package/.codex/skills/manage-issue/SKILL.md +80 -65
package/.codex/skills/manage-issue-discover/SKILL.md +491 -503
package/.codex/skills/manage-learn/SKILL.md +190 -186
package/.codex/skills/manage-memory/SKILL.md +95 -72
package/.codex/skills/manage-memory-capture/SKILL.md +99 -86
package/.codex/skills/manage-status/SKILL.md +102 -89
package/.codex/skills/quality-business-test/SKILL.md +228 -223
package/.codex/skills/quality-debug/SKILL.md +54 -66
package/.codex/skills/quality-integration-test/SKILL.md +532 -544
package/.codex/skills/quality-refactor/SKILL.md +197 -191
package/.codex/skills/quality-retrospective/SKILL.md +512 -505
package/.codex/skills/quality-review/SKILL.md +93 -105
package/.codex/skills/quality-sync/SKILL.md +101 -89
package/.codex/skills/quality-test/SKILL.md +202 -198
package/.codex/skills/quality-test-gen/SKILL.md +93 -104
package/.codex/skills/spec-add/SKILL.md +58 -39
package/.codex/skills/spec-load/SKILL.md +45 -40
package/.codex/skills/spec-map/SKILL.md +180 -182
package/.codex/skills/spec-setup/SKILL.md +94 -76
package/.codex/skills/team-coordinate/SKILL.md +346 -357
package/.codex/skills/team-executor/SKILL.md +70 -112
package/.codex/skills/team-lifecycle-v4/SKILL.md +311 -299
package/.codex/skills/team-quality-assurance/SKILL.md +234 -227
package/.codex/skills/team-review/SKILL.md +232 -225
package/.codex/skills/team-tech-debt/SKILL.md +78 -100
package/.codex/skills/team-testing/SKILL.md +242 -235
package/.codex/skills/wiki-connect/SKILL.md +75 -0
package/.codex/skills/wiki-digest/SKILL.md +87 -0
package/README.md +7 -4
package/README.zh-CN.md +7 -4
package/dashboard/dist-server/dashboard/src/server/routes/specs.d.ts +1 -1
package/dashboard/dist-server/dashboard/src/server/routes/specs.js +75 -30
package/dashboard/dist-server/dashboard/src/server/routes/specs.js.map +1 -1
package/dashboard/dist-server/dashboard/src/server/state/event-bus.d.ts +5 -0
package/dashboard/dist-server/dashboard/src/server/state/event-bus.js +5 -0
package/dashboard/dist-server/dashboard/src/server/state/event-bus.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/constants.js +5 -0
package/dashboard/dist-server/dashboard/src/shared/constants.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/team-types.d.ts +21 -0
package/dashboard/dist-server/dashboard/src/shared/team-types.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/types.d.ts +3 -2
package/dashboard/dist-server/dashboard/src/shared/ws-protocol.d.ts +1 -1
package/dashboard/dist-server/dashboard/src/shared/ws-protocol.js.map +1 -1
package/dashboard/dist-server/src/hooks/constants.d.ts +2 -0
package/dashboard/dist-server/src/hooks/constants.js +2 -0
package/dashboard/dist-server/src/hooks/constants.js.map +1 -1
package/dist/src/commands/collab.js +4 -4
package/dist/src/commands/collab.js.map +1 -1
package/dist/src/commands/hooks.d.ts.map +1 -1
package/dist/src/commands/hooks.js +66 -1
package/dist/src/commands/hooks.js.map +1 -1
package/dist/src/commands/spec.d.ts.map +1 -1
package/dist/src/commands/spec.js +7 -2
package/dist/src/commands/spec.js.map +1 -1
package/dist/src/hooks/constants.d.ts +2 -0
package/dist/src/hooks/constants.d.ts.map +1 -1
package/dist/src/hooks/constants.js +2 -0
package/dist/src/hooks/constants.js.map +1 -1
package/dist/src/hooks/guards/index.d.ts +1 -0
package/dist/src/hooks/guards/index.d.ts.map +1 -1
package/dist/src/hooks/guards/index.js +1 -0
package/dist/src/hooks/guards/index.js.map +1 -1
package/dist/src/hooks/guards/spec-validator.d.ts +25 -0
package/dist/src/hooks/guards/spec-validator.d.ts.map +1 -0
package/dist/src/hooks/guards/spec-validator.js +66 -0
package/dist/src/hooks/guards/spec-validator.js.map +1 -0
package/dist/src/hooks/keyword-spec-injector.d.ts +21 -0
package/dist/src/hooks/keyword-spec-injector.d.ts.map +1 -0
package/dist/src/hooks/keyword-spec-injector.js +96 -0
package/dist/src/hooks/keyword-spec-injector.js.map +1 -0
package/dist/src/hooks/plugins/spec-injection-plugin.d.ts +2 -1
package/dist/src/hooks/plugins/spec-injection-plugin.d.ts.map +1 -1
package/dist/src/hooks/plugins/spec-injection-plugin.js +21 -12
package/dist/src/hooks/plugins/spec-injection-plugin.js.map +1 -1
package/dist/src/hooks/spec-bridge.d.ts +40 -0
package/dist/src/hooks/spec-bridge.d.ts.map +1 -0
package/dist/src/hooks/spec-bridge.js +97 -0
package/dist/src/hooks/spec-bridge.js.map +1 -0
package/dist/src/hooks/spec-injector.d.ts.map +1 -1
package/dist/src/hooks/spec-injector.js +18 -12
package/dist/src/hooks/spec-injector.js.map +1 -1
package/dist/src/team/phase-orchestrator.d.ts +52 -0
package/dist/src/team/phase-orchestrator.d.ts.map +1 -0
package/dist/src/team/phase-orchestrator.js +165 -0
package/dist/src/team/phase-orchestrator.js.map +1 -0
package/dist/src/team/phase-types.d.ts +51 -0
package/dist/src/team/phase-types.d.ts.map +1 -0
package/dist/src/team/phase-types.js +41 -0
package/dist/src/team/phase-types.js.map +1 -0
package/dist/src/tools/index.d.ts.map +1 -1
package/dist/src/tools/index.js +6 -0
package/dist/src/tools/index.js.map +1 -1
package/dist/src/tools/spec-entry-parser.d.ts +56 -0
package/dist/src/tools/spec-entry-parser.d.ts.map +1 -0
package/dist/src/tools/spec-entry-parser.js +196 -0
package/dist/src/tools/spec-entry-parser.js.map +1 -0
package/dist/src/tools/spec-init.d.ts.map +1 -1
package/dist/src/tools/spec-init.js +66 -92
package/dist/src/tools/spec-init.js.map +1 -1
package/dist/src/tools/spec-keyword-index.d.ts +30 -0
package/dist/src/tools/spec-keyword-index.d.ts.map +1 -0
package/dist/src/tools/spec-keyword-index.js +101 -0
package/dist/src/tools/spec-keyword-index.js.map +1 -0
package/dist/src/tools/spec-loader.d.ts +3 -3
package/dist/src/tools/spec-loader.d.ts.map +1 -1
package/dist/src/tools/spec-loader.js +49 -23
package/dist/src/tools/spec-loader.js.map +1 -1
package/dist/src/tools/team-agents.d.ts +27 -0
package/dist/src/tools/team-agents.d.ts.map +1 -0
package/dist/src/tools/team-agents.js +362 -0
package/dist/src/tools/team-agents.js.map +1 -0
package/dist/src/tools/team-mailbox.d.ts +40 -0
package/dist/src/tools/team-mailbox.d.ts.map +1 -0
package/dist/src/tools/team-mailbox.js +384 -0
package/dist/src/tools/team-mailbox.js.map +1 -0
package/dist/src/tools/team-msg.d.ts +17 -8
package/dist/src/tools/team-msg.d.ts.map +1 -1
package/dist/src/tools/team-msg.js +110 -13
package/dist/src/tools/team-msg.js.map +1 -1
package/dist/src/tools/team-tasks-mcp.d.ts +27 -0
package/dist/src/tools/team-tasks-mcp.d.ts.map +1 -0
package/dist/src/tools/team-tasks-mcp.js +408 -0
package/dist/src/tools/team-tasks-mcp.js.map +1 -0
package/package.json +2 -1
package/workflows/analyze.md +816 -816
package/workflows/brainstorm.md +471 -471
package/workflows/codebase-rebuild.md +332 -332
package/workflows/codebase-refresh.md +240 -240
package/workflows/execute.md +1 -1
package/workflows/harvest.md +420 -420
package/workflows/integration-test.md +343 -343
package/workflows/issue-discover.md +414 -414
package/workflows/map.md +111 -111
package/workflows/milestone-complete.md +176 -176
package/workflows/plan.md +1 -1
package/workflows/quick.md +497 -497
package/workflows/refactor.md +300 -300
package/workflows/roadmap.md +335 -335
package/workflows/spec-generate.md +640 -640
package/workflows/specs-add.md +46 -81
package/workflows/specs-load.md +15 -17
package/workflows/specs-setup.md +40 -161

package/.codex/skills/quality-test/SKILL.md CHANGED Viewed

@@ -1,198 +1,202 @@
----
-name: quality-test
-description: Conversational UAT with session persistence, auto-diagnosis, and gap-plan closure loop. Interactive testing flow with severity inference and parallel debug agents.
-argument-hint: "<phase> [--auto-fix] [--session ID]"
-allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Agent, AskUserQuestion
----
-## Auto Mode
-No auto mode -- UAT is inherently interactive. `--auto-fix` only automates gap closure, not test execution.
-# Test (UAT)
-## Usage
-```bash
-$quality-test "3"                    # test phase 3
-$quality-test "3 --smoke"            # smoke tests first, then UAT
-$quality-test "3 --auto-fix"         # auto-trigger gap-fix loop on failures
-$quality-test "--session 04-comments"  # resume specific session
-```
-**Flags**:
-- `<phase>`: Phase number or scratch task ID
-- `--smoke`: Run cold-start smoke tests before UAT
-- `--auto-fix`: Auto-trigger gap-fix loop (plan --gaps -> execute -> re-verify) on failures
-- `--session ID`: Resume a specific UAT session
-**Output**: `{target_dir}/uat.md` + `.tests/test-plan.json` + `.tests/test-results.json` + `.tests/coverage-report.json`
----
-## Overview
-Conversational UAT: present expected behavior one test at a time, user confirms or describes issues. Severity inferred from natural language (never asked). Session persists in `uat.md` across context resets. Failed tests trigger parallel debug agent diagnosis and optional gap-fix closure.
-**Philosophy**: Show expected, ask if reality matches.
----
-## Implementation
-### Step 1: Resolve Target
-1. Parse `$ARGUMENTS` for phase number, scratch task ID, or flags
-2. **Phase mode**: set `PHASE_DIR = .workflow/phases/{NN}-{slug}/`
-3. **Scratch mode**: set `SCRATCH_DIR = .workflow/scratch/{id}/`
-4. Validate target exists and has `verification.json` -- if missing: **E002**
-### Step 2: Check Active Sessions
-```bash
-find .workflow/phases -name "uat.md" -type f 2>/dev/null | head -5
-find .workflow/scratch -name "uat.md" -type f 2>/dev/null | head -5
-```
-- If active sessions exist and no target specified: display session table, ask user to resume or start new
-- If `--session ID` specified: resume that session directly (skip to Step 9)
-- If session exists for target: offer resume or restart
-### Step 3: Smoke Tests (if --smoke)
-Run basic sanity checks (app starts, routes respond, build clean, deps installed).
-If any smoke fails: **E003** -- abort, suggest Skill({ skill: "quality-debug" })
-### Step 4: Load Verification Context
-Read from target directory: verification.json, validation.json, index.json, plan.json, `.summaries/TASK-*.md`. Build testable list from user-observable outcomes.
-### Step 5: Design Test Scenarios
-Create scenarios from testables (id T-001, name, category, expected behavior, requirement_ref). Focus on USER-OBSERVABLE outcomes. Write `{target_dir}/.tests/test-plan.json`.
-### Step 6: Create UAT File
-Archive previous `uat.md` to `.history/` if exists.
-Write `{target_dir}/uat.md` with frontmatter (status, target, started), Current Test section, Tests section (all pending), Summary counters, empty Gaps section.
-### Step 7: Present Test (Interactive Loop)
-Present one test at a time:
-```
-------------------------------------------------------------
-  TEST {number}/{total}: {name}
-------------------------------------------------------------
-Expected behavior:
-{expected}
-------------------------------------------------------------
-> Type "pass" or describe what's wrong
-------------------------------------------------------------
-```
-Wait for user response (plain text).
-### Step 8: Process Response
-| Response | Action |
-|----------|--------|
-| empty, "yes", "y", "ok", "pass", "next" | Mark as pass |
-| "skip", "can't test", "n/a" | Mark as skipped |
-| Anything else | Log as issue, infer severity |
-**Severity inference** (never ask):
-- "crashes", "error", "fails completely" -> blocker
-- "doesn't work", "wrong behavior", "broken" -> major
-- "works but...", "slow", "minor issue" -> minor
-- "color", "spacing", "typo" -> cosmetic
-- Default: major
-**On issue**: auto-create issue in `.workflow/issues/issues.jsonl` with back-reference.
-**Batched writes**: write to file on issue, every 5 passes, or completion.
-If more tests: update Current Test, loop to Step 7.
-If done: go to Step 10.
-### Step 9: Resume From File
-Read `uat.md`, find first `result: [pending]` test, announce progress, continue from there (go to Step 7).
-### Step 10: Complete Session
-1. Update `uat.md` frontmatter: status -> "complete"
-2. Archive previous result artifacts to `.history/`
-3. Write `.tests/test-results.json` and `.tests/coverage-report.json`
-4. Update `index.json` with UAT results
-5. If no issues: go to Step 13
-6. If issues found: go to Step 11
-### Step 11: Auto-Diagnose
-Cluster related gaps by component/area. Spawn one debug Agent per cluster:
-```
-Agent({
-  subagent_type: "general-purpose",
-  description: "Diagnose UAT gap cluster: {cluster_name}",
-  prompt: "Investigate UAT failures. Gaps: {gap list}. Find root cause, fix direction, affected files, evidence (file:line).",
-  run_in_background: false
-})
-```
-Update `uat.md` gaps with diagnosis results (root_cause, fix_direction, affected_files).
-### Step 12: Gap Closure Decision
-**If `--auto-fix`**: execute gap-fix loop directly.
-**Otherwise**: present diagnosis summary and offer options:
-1. Auto-fix (plan --gaps -> execute -> re-verify, max 2 iterations)
-2. Debug deep -- Skill({ skill: "quality-debug" })
-3. Plan fixes -- Skill({ skill: "maestro-plan", args: "--gaps" })
-4. Manual fix
-Update issue lifecycle during gap-fix loop (registered -> planning -> executing -> completed/failed).
-### Step 13: Report
-```
-=== UAT RESULTS ===
-Target:      {target}
-Smoke Tests: {smoke_count} run, {smoke_pass} passed
-UAT Tests:   {total} total
-  Passed:    {passed}
-  Issues:    {issues} ({blocker_count} blockers, {major_count} major)
-  Skipped:   {skipped}
-Diagnosis:   {diagnosed_count}/{issues} gaps diagnosed
-Auto-fix:    {fixed_count} gaps resolved
-Next steps:
-  {suggested_next_command}
-```
----
-## Error Handling
-| Code | Severity | Condition | Recovery |
-|------|----------|-----------|----------|
-| E001 | error | Phase or task target required | Prompt user for phase number |
-| E002 | error | Phase not verified (no verification.json) | Suggest Skill({ skill: "maestro-verify" }) |
-| E003 | error | Smoke test failed (app won't start) | Suggest Skill({ skill: "quality-debug" }) |
-| W001 | warning | Test scenarios failed | Auto-diagnose, suggest fix options |
-| W002 | warning | Coverage below threshold | Suggest Skill({ skill: "quality-test-gen" }) |
----
-## Core Rules
-- **One test at a time** -- never batch-present tests
-- **Never ask severity** -- always infer from natural language
-- **Session persistence** -- uat.md survives context resets, resume from any point
-- **Batched writes** -- minimize file I/O (on issue, every 5 passes, completion)
-- **Gap-fix loop max 2 iterations** -- prevent infinite loops
-- **Agent calls use `run_in_background: false`** for synchronous execution
-- **Auto-create issues** in `.workflow/issues/issues.jsonl` for every failed test
+---
+name: quality-test
+description: Conversational UAT with session persistence, auto-diagnosis, and gap-plan closure loop. Interactive testing flow with severity inference and parallel debug agents.
+argument-hint: "<phase> [--auto-fix] [--session ID]"
+allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Agent, AskUserQuestion
+---
+<purpose>
+Conversational UAT: present expected behavior one test at a time, user confirms or describes issues. Severity inferred from natural language (never asked). Session persists in `uat.md` across context resets. Failed tests trigger parallel debug agent diagnosis and optional gap-fix closure.
+**Philosophy**: Show expected, ask if reality matches.
+</purpose>
+<context>
+$ARGUMENTS -- phase number or scratch task ID, plus optional flags.
+**Usage**:
+```bash
+$quality-test "3"                    # test phase 3
+$quality-test "3 --smoke"            # smoke tests first, then UAT
+$quality-test "3 --auto-fix"         # auto-trigger gap-fix loop on failures
+$quality-test "--session 04-comments"  # resume specific session
+```
+**Flags**:
+- `<phase>`: Phase number or scratch task ID
+- `--smoke`: Run cold-start smoke tests before UAT
+- `--auto-fix`: Auto-trigger gap-fix loop (plan --gaps -> execute -> re-verify) on failures
+- `--session ID`: Resume a specific UAT session
+No auto mode -- UAT is inherently interactive. `--auto-fix` only automates gap closure, not test execution.
+**Output**: `{target_dir}/uat.md` + `.tests/test-plan.json` + `.tests/test-results.json` + `.tests/coverage-report.json`
+</context>
+<invariants>
+1. **One test at a time** -- never batch-present tests
+2. **Never ask severity** -- always infer from natural language
+3. **Session persistence** -- uat.md survives context resets, resume from any point
+4. **Batched writes** -- minimize file I/O (on issue, every 5 passes, completion)
+5. **Gap-fix loop max 2 iterations** -- prevent infinite loops
+6. **Agent calls use `run_in_background: false`** for synchronous execution
+7. **Auto-create issues** in `.workflow/issues/issues.jsonl` for every failed test
+</invariants>
+<execution>
+### Step 1: Resolve Target
+1. Parse `$ARGUMENTS` for phase number, scratch task ID, or flags
+2. **Phase mode**: set `PHASE_DIR = .workflow/phases/{NN}-{slug}/`
+3. **Scratch mode**: set `SCRATCH_DIR = .workflow/scratch/{id}/`
+4. Validate target exists and has `verification.json` -- if missing: **E002**
+### Step 2: Check Active Sessions
+```bash
+find .workflow/phases -name "uat.md" -type f 2>/dev/null | head -5
+find .workflow/scratch -name "uat.md" -type f 2>/dev/null | head -5
+```
+- If active sessions exist and no target specified: display session table, ask user to resume or start new
+- If `--session ID` specified: resume that session directly (skip to Step 9)
+- If session exists for target: offer resume or restart
+### Step 3: Smoke Tests (if --smoke)
+Run basic sanity checks (app starts, routes respond, build clean, deps installed).
+If any smoke fails: **E003** -- abort, suggest Skill({ skill: "quality-debug" })
+### Step 4: Load Verification Context
+Read from target directory: verification.json, validation.json, index.json, plan.json, `.summaries/TASK-*.md`. Build testable list from user-observable outcomes.
+### Step 5: Design Test Scenarios
+Create scenarios from testables (id T-001, name, category, expected behavior, requirement_ref). Focus on USER-OBSERVABLE outcomes. Write `{target_dir}/.tests/test-plan.json`.
+### Step 6: Create UAT File
+Archive previous `uat.md` to `.history/` if exists.
+Write `{target_dir}/uat.md` with frontmatter (status, target, started), Current Test section, Tests section (all pending), Summary counters, empty Gaps section.
+### Step 7: Present Test (Interactive Loop)
+Present one test at a time:
+```
+------------------------------------------------------------
+  TEST {number}/{total}: {name}
+------------------------------------------------------------
+Expected behavior:
+{expected}
+------------------------------------------------------------
+> Type "pass" or describe what's wrong
+------------------------------------------------------------
+```
+Wait for user response (plain text).
+### Step 8: Process Response
+| Response | Action |
+|----------|--------|
+| empty, "yes", "y", "ok", "pass", "next" | Mark as pass |
+| "skip", "can't test", "n/a" | Mark as skipped |
+| Anything else | Log as issue, infer severity |
+**Severity inference** (never ask):
+- "crashes", "error", "fails completely" -> blocker
+- "doesn't work", "wrong behavior", "broken" -> major
+- "works but...", "slow", "minor issue" -> minor
+- "color", "spacing", "typo" -> cosmetic
+- Default: major
+**On issue**: auto-create issue in `.workflow/issues/issues.jsonl` with back-reference.
+**Batched writes**: write to file on issue, every 5 passes, or completion.
+If more tests: update Current Test, loop to Step 7.
+If done: go to Step 10.
+### Step 9: Resume From File
+Read `uat.md`, find first `result: [pending]` test, announce progress, continue from there (go to Step 7).
+### Step 10: Complete Session
+1. Update `uat.md` frontmatter: status -> "complete"
+2. Archive previous result artifacts to `.history/`
+3. Write `.tests/test-results.json` and `.tests/coverage-report.json`
+4. Update `index.json` with UAT results
+5. If no issues: go to Step 13
+6. If issues found: go to Step 11
+### Step 11: Auto-Diagnose
+Cluster related gaps by component/area. Spawn one debug Agent per cluster:
+```
+Agent({
+  subagent_type: "general-purpose",
+  description: "Diagnose UAT gap cluster: {cluster_name}",
+  prompt: "Investigate UAT failures. Gaps: {gap list}. Find root cause, fix direction, affected files, evidence (file:line).",
+  run_in_background: false
+})
+```
+Update `uat.md` gaps with diagnosis results (root_cause, fix_direction, affected_files).
+### Step 12: Gap Closure Decision
+**If `--auto-fix`**: execute gap-fix loop directly.
+**Otherwise**: present diagnosis summary and offer options:
+1. Auto-fix (plan --gaps -> execute -> re-verify, max 2 iterations)
+2. Debug deep -- Skill({ skill: "quality-debug" })
+3. Plan fixes -- Skill({ skill: "maestro-plan", args: "--gaps" })
+4. Manual fix
+Update issue lifecycle during gap-fix loop (registered -> planning -> executing -> completed/failed).
+### Step 13: Report
+```
+=== UAT RESULTS ===
+Target:      {target}
+Smoke Tests: {smoke_count} run, {smoke_pass} passed
+UAT Tests:   {total} total
+  Passed:    {passed}
+  Issues:    {issues} ({blocker_count} blockers, {major_count} major)
+  Skipped:   {skipped}
+Diagnosis:   {diagnosed_count}/{issues} gaps diagnosed
+Auto-fix:    {fixed_count} gaps resolved
+Next steps:
+  {suggested_next_command}
+```
+</execution>
+<error_codes>
+| Code | Severity | Condition | Recovery |
+|------|----------|-----------|----------|
+| E001 | error | Phase or task target required | Prompt user for phase number |
+| E002 | error | Phase not verified (no verification.json) | Suggest Skill({ skill: "maestro-verify" }) |
+| E003 | error | Smoke test failed (app won't start) | Suggest Skill({ skill: "quality-debug" }) |
+| W001 | warning | Test scenarios failed | Auto-diagnose, suggest fix options |
+| W002 | warning | Coverage below threshold | Suggest Skill({ skill: "quality-test-gen" }) |
+</error_codes>
+<success_criteria>
+- [ ] Target resolved and verification context loaded
+- [ ] Test scenarios designed from user-observable outcomes
+- [ ] UAT file created with session persistence
+- [ ] Tests presented one at a time, severity inferred (never asked)
+- [ ] Issues auto-created for all failures
+- [ ] Diagnosis completed for failed test clusters
+- [ ] Gap closure offered (auto-fix or manual options)
+- [ ] Final report with pass/fail counts and next steps
+</success_criteria>