npm - maestro-flow - Versions diffs - 0.3.9 → 0.3.11 - Mend

maestro-flow 0.3.9 → 0.3.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (281) hide show

package/.claude/agents/workflow-collab-planner.md +1 -1
package/.claude/agents/workflow-executor.md +1 -1
package/.claude/agents/workflow-plan-checker.md +1 -1
package/.claude/agents/workflow-planner.md +1 -1
package/.claude/commands/learn-decompose.md +176 -176
package/.claude/commands/learn-follow.md +167 -167
package/.claude/commands/learn-retro.md +1 -1
package/.claude/commands/maestro-analyze.md +46 -3
package/.claude/commands/maestro-coordinate.md +1 -3
package/.claude/commands/maestro-execute.md +14 -0
package/.claude/commands/maestro-plan.md +16 -0
package/.claude/commands/manage-harvest.md +131 -131
package/.claude/commands/manage-issue-discover.md +2 -2
package/.claude/commands/manage-issue.md +5 -5
package/.claude/commands/spec-add.md +67 -56
package/.claude/commands/spec-load.md +66 -64
package/.claude/commands/spec-setup.md +5 -9
package/.codex/skills/learn-decompose/SKILL.md +119 -0
package/.codex/skills/learn-follow/SKILL.md +83 -0
package/.codex/skills/learn-investigate/SKILL.md +83 -0
package/.codex/skills/learn-retro/SKILL.md +83 -0
package/.codex/skills/learn-second-opinion/SKILL.md +86 -0
package/.codex/skills/maestro/SKILL.md +335 -0
package/.codex/skills/maestro-analyze/SKILL.md +84 -75
package/.codex/skills/maestro-brainstorm/SKILL.md +452 -463
package/.codex/skills/maestro-chain/SKILL.md +233 -0
package/.codex/skills/maestro-coordinate/SKILL.md +167 -278
package/.codex/skills/maestro-execute/SKILL.md +435 -438
package/.codex/skills/maestro-fork/SKILL.md +68 -0
package/.codex/skills/maestro-init/SKILL.md +171 -167
package/.codex/skills/maestro-learn/SKILL.md +80 -0
package/.codex/skills/maestro-link-coordinate/SKILL.md +224 -220
package/.codex/skills/maestro-merge/SKILL.md +62 -0
package/.codex/skills/maestro-milestone-audit/SKILL.md +108 -103
package/.codex/skills/maestro-milestone-complete/SKILL.md +155 -149
package/.codex/skills/maestro-milestone-release/SKILL.md +70 -0
package/.codex/skills/maestro-overlay/SKILL.md +188 -185
package/.codex/skills/maestro-plan/SKILL.md +66 -69
package/.codex/skills/maestro-quick/SKILL.md +26 -23
package/.codex/skills/maestro-roadmap/SKILL.md +65 -73
package/.codex/skills/maestro-spec-generate/SKILL.md +66 -74
package/.codex/skills/maestro-ui-design/SKILL.md +34 -31
package/.codex/skills/maestro-verify/SKILL.md +556 -566
package/.codex/skills/manage-codebase-rebuild/SKILL.md +397 -405
package/.codex/skills/manage-codebase-refresh/SKILL.md +93 -82
package/.codex/skills/manage-harvest/SKILL.md +82 -0
package/.codex/skills/manage-issue/SKILL.md +80 -65
package/.codex/skills/manage-issue-discover/SKILL.md +491 -503
package/.codex/skills/manage-learn/SKILL.md +190 -186
package/.codex/skills/manage-memory/SKILL.md +95 -72
package/.codex/skills/manage-memory-capture/SKILL.md +99 -86
package/.codex/skills/manage-status/SKILL.md +102 -89
package/.codex/skills/quality-business-test/SKILL.md +228 -223
package/.codex/skills/quality-debug/SKILL.md +54 -66
package/.codex/skills/quality-integration-test/SKILL.md +532 -544
package/.codex/skills/quality-refactor/SKILL.md +197 -191
package/.codex/skills/quality-retrospective/SKILL.md +512 -505
package/.codex/skills/quality-review/SKILL.md +93 -105
package/.codex/skills/quality-sync/SKILL.md +101 -89
package/.codex/skills/quality-test/SKILL.md +202 -198
package/.codex/skills/quality-test-gen/SKILL.md +93 -104
package/.codex/skills/spec-add/SKILL.md +58 -39
package/.codex/skills/spec-load/SKILL.md +45 -40
package/.codex/skills/spec-map/SKILL.md +180 -182
package/.codex/skills/spec-setup/SKILL.md +94 -76
package/.codex/skills/team-coordinate/SKILL.md +346 -357
package/.codex/skills/team-executor/SKILL.md +70 -112
package/.codex/skills/team-lifecycle-v4/SKILL.md +311 -299
package/.codex/skills/team-quality-assurance/SKILL.md +234 -227
package/.codex/skills/team-review/SKILL.md +232 -225
package/.codex/skills/team-tech-debt/SKILL.md +78 -100
package/.codex/skills/team-testing/SKILL.md +242 -235
package/.codex/skills/wiki-connect/SKILL.md +75 -0
package/.codex/skills/wiki-digest/SKILL.md +87 -0
package/README.md +14 -11
package/README.zh-CN.md +14 -11
package/chains/issue-lifecycle.json +13 -13
package/chains/singles/issue-analyze.json +3 -3
package/chains/singles/issue-execute.json +3 -3
package/chains/singles/issue-plan.json +3 -3
package/dashboard/dist-server/dashboard/src/server/commander/commander-agent.js +2 -2
package/dashboard/dist-server/dashboard/src/server/commander/commander-agent.js.map +1 -1
package/dashboard/dist-server/dashboard/src/server/coordinator/chain-map.js +3 -3
package/dashboard/dist-server/dashboard/src/server/coordinator/chain-map.js.map +1 -1
package/dashboard/dist-server/dashboard/src/server/routes/issues.js +34 -0
package/dashboard/dist-server/dashboard/src/server/routes/issues.js.map +1 -1
package/dashboard/dist-server/dashboard/src/server/routes/specs.d.ts +1 -1
package/dashboard/dist-server/dashboard/src/server/routes/specs.js +75 -30
package/dashboard/dist-server/dashboard/src/server/routes/specs.js.map +1 -1
package/dashboard/dist-server/dashboard/src/server/state/event-bus.d.ts +5 -0
package/dashboard/dist-server/dashboard/src/server/state/event-bus.js +5 -0
package/dashboard/dist-server/dashboard/src/server/state/event-bus.js.map +1 -1
package/dashboard/dist-server/dashboard/src/server/ws/handlers/execution-handler.js +2 -3
package/dashboard/dist-server/dashboard/src/server/ws/handlers/execution-handler.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/constants.js +5 -0
package/dashboard/dist-server/dashboard/src/shared/constants.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/issue-types.d.ts +5 -0
package/dashboard/dist-server/dashboard/src/shared/issue-types.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/normalize-task.d.ts +2 -0
package/dashboard/dist-server/dashboard/src/shared/normalize-task.js +75 -0
package/dashboard/dist-server/dashboard/src/shared/normalize-task.js.map +1 -0
package/dashboard/dist-server/dashboard/src/shared/team-types.d.ts +21 -0
package/dashboard/dist-server/dashboard/src/shared/team-types.js.map +1 -1
package/dashboard/dist-server/dashboard/src/shared/types.d.ts +3 -2
package/dashboard/dist-server/dashboard/src/shared/ws-protocol.d.ts +1 -1
package/dashboard/dist-server/dashboard/src/shared/ws-protocol.js.map +1 -1
package/dashboard/dist-server/src/hooks/constants.d.ts +92 -12
package/dashboard/dist-server/src/hooks/constants.js +151 -16
package/dashboard/dist-server/src/hooks/constants.js.map +1 -1
package/dashboard/dist-server/src/types/index.d.ts +5 -0
package/dist/src/commands/collab.d.ts +1 -34
package/dist/src/commands/collab.d.ts.map +1 -1
package/dist/src/commands/collab.js +8 -76
package/dist/src/commands/collab.js.map +1 -1
package/dist/src/commands/hooks.d.ts +5 -1
package/dist/src/commands/hooks.d.ts.map +1 -1
package/dist/src/commands/hooks.js +115 -10
package/dist/src/commands/hooks.js.map +1 -1
package/dist/src/commands/install-ui/InstallConfirm.d.ts +3 -1
package/dist/src/commands/install-ui/InstallConfirm.d.ts.map +1 -1
package/dist/src/commands/install-ui/InstallConfirm.js +3 -1
package/dist/src/commands/install-ui/InstallConfirm.js.map +1 -1
package/dist/src/commands/install-ui/InstallExecution.d.ts.map +1 -1
package/dist/src/commands/install-ui/InstallExecution.js +5 -1
package/dist/src/commands/install-ui/InstallExecution.js.map +1 -1
package/dist/src/commands/install-ui/InstallFlow.d.ts.map +1 -1
package/dist/src/commands/install-ui/InstallFlow.js +7 -3
package/dist/src/commands/install-ui/InstallFlow.js.map +1 -1
package/dist/src/commands/install-ui/StatuslineConfig.d.ts +6 -1
package/dist/src/commands/install-ui/StatuslineConfig.d.ts.map +1 -1
package/dist/src/commands/install-ui/StatuslineConfig.js +27 -5
package/dist/src/commands/install-ui/StatuslineConfig.js.map +1 -1
package/dist/src/commands/spec.d.ts.map +1 -1
package/dist/src/commands/spec.js +7 -2
package/dist/src/commands/spec.js.map +1 -1
package/dist/src/hooks/__tests__/statusline-visual-test.d.ts +7 -0
package/dist/src/hooks/__tests__/statusline-visual-test.d.ts.map +1 -0
package/dist/src/hooks/__tests__/statusline-visual-test.js +236 -0
package/dist/src/hooks/__tests__/statusline-visual-test.js.map +1 -0
package/dist/src/hooks/constants.d.ts +92 -12
package/dist/src/hooks/constants.d.ts.map +1 -1
package/dist/src/hooks/constants.js +151 -16
package/dist/src/hooks/constants.js.map +1 -1
package/dist/src/hooks/guards/index.d.ts +2 -0
package/dist/src/hooks/guards/index.d.ts.map +1 -1
package/dist/src/hooks/guards/index.js +2 -0
package/dist/src/hooks/guards/index.js.map +1 -1
package/dist/src/hooks/guards/preflight-guard.d.ts +29 -0
package/dist/src/hooks/guards/preflight-guard.d.ts.map +1 -0
package/dist/src/hooks/guards/preflight-guard.js +95 -0
package/dist/src/hooks/guards/preflight-guard.js.map +1 -0
package/dist/src/hooks/guards/spec-validator.d.ts +25 -0
package/dist/src/hooks/guards/spec-validator.d.ts.map +1 -0
package/dist/src/hooks/guards/spec-validator.js +66 -0
package/dist/src/hooks/guards/spec-validator.js.map +1 -0
package/dist/src/hooks/index.d.ts +1 -0
package/dist/src/hooks/index.d.ts.map +1 -1
package/dist/src/hooks/index.js +1 -0
package/dist/src/hooks/index.js.map +1 -1
package/dist/src/hooks/keyword-spec-injector.d.ts +21 -0
package/dist/src/hooks/keyword-spec-injector.d.ts.map +1 -0
package/dist/src/hooks/keyword-spec-injector.js +96 -0
package/dist/src/hooks/keyword-spec-injector.js.map +1 -0
package/dist/src/hooks/plugins/spec-injection-plugin.d.ts +2 -1
package/dist/src/hooks/plugins/spec-injection-plugin.d.ts.map +1 -1
package/dist/src/hooks/plugins/spec-injection-plugin.js +21 -12
package/dist/src/hooks/plugins/spec-injection-plugin.js.map +1 -1
package/dist/src/hooks/preflight-core.d.ts +37 -0
package/dist/src/hooks/preflight-core.d.ts.map +1 -0
package/dist/src/hooks/preflight-core.js +86 -0
package/dist/src/hooks/preflight-core.js.map +1 -0
package/dist/src/hooks/spec-bridge.d.ts +40 -0
package/dist/src/hooks/spec-bridge.d.ts.map +1 -0
package/dist/src/hooks/spec-bridge.js +97 -0
package/dist/src/hooks/spec-bridge.js.map +1 -0
package/dist/src/hooks/spec-injector.d.ts.map +1 -1
package/dist/src/hooks/spec-injector.js +18 -12
package/dist/src/hooks/spec-injector.js.map +1 -1
package/dist/src/hooks/statusline.d.ts +8 -17
package/dist/src/hooks/statusline.d.ts.map +1 -1
package/dist/src/hooks/statusline.js +269 -112
package/dist/src/hooks/statusline.js.map +1 -1
package/dist/src/i18n/locales/en.d.ts.map +1 -1
package/dist/src/i18n/locales/en.js +5 -0
package/dist/src/i18n/locales/en.js.map +1 -1
package/dist/src/i18n/locales/zh.d.ts.map +1 -1
package/dist/src/i18n/locales/zh.js +5 -0
package/dist/src/i18n/locales/zh.js.map +1 -1
package/dist/src/i18n/types.d.ts +5 -0
package/dist/src/i18n/types.d.ts.map +1 -1
package/dist/src/team/phase-orchestrator.d.ts +52 -0
package/dist/src/team/phase-orchestrator.d.ts.map +1 -0
package/dist/src/team/phase-orchestrator.js +165 -0
package/dist/src/team/phase-orchestrator.js.map +1 -0
package/dist/src/team/phase-types.d.ts +51 -0
package/dist/src/team/phase-types.d.ts.map +1 -0
package/dist/src/team/phase-types.js +41 -0
package/dist/src/team/phase-types.js.map +1 -0
package/dist/src/tools/collab-adapter.d.ts +17 -0
package/dist/src/tools/collab-adapter.d.ts.map +1 -1
package/dist/src/tools/collab-adapter.js +138 -0
package/dist/src/tools/collab-adapter.js.map +1 -1
package/dist/src/tools/index.d.ts.map +1 -1
package/dist/src/tools/index.js +6 -0
package/dist/src/tools/index.js.map +1 -1
package/dist/src/tools/merge-validator.d.ts +24 -0
package/dist/src/tools/merge-validator.d.ts.map +1 -0
package/dist/src/tools/merge-validator.js +220 -0
package/dist/src/tools/merge-validator.js.map +1 -0
package/dist/src/tools/spec-entry-parser.d.ts +56 -0
package/dist/src/tools/spec-entry-parser.d.ts.map +1 -0
package/dist/src/tools/spec-entry-parser.js +196 -0
package/dist/src/tools/spec-entry-parser.js.map +1 -0
package/dist/src/tools/spec-init.d.ts.map +1 -1
package/dist/src/tools/spec-init.js +66 -92
package/dist/src/tools/spec-init.js.map +1 -1
package/dist/src/tools/spec-keyword-index.d.ts +30 -0
package/dist/src/tools/spec-keyword-index.d.ts.map +1 -0
package/dist/src/tools/spec-keyword-index.js +101 -0
package/dist/src/tools/spec-keyword-index.js.map +1 -0
package/dist/src/tools/spec-loader.d.ts +3 -3
package/dist/src/tools/spec-loader.d.ts.map +1 -1
package/dist/src/tools/spec-loader.js +49 -23
package/dist/src/tools/spec-loader.js.map +1 -1
package/dist/src/tools/team-agents.d.ts +27 -0
package/dist/src/tools/team-agents.d.ts.map +1 -0
package/dist/src/tools/team-agents.js +362 -0
package/dist/src/tools/team-agents.js.map +1 -0
package/dist/src/tools/team-mailbox.d.ts +40 -0
package/dist/src/tools/team-mailbox.d.ts.map +1 -0
package/dist/src/tools/team-mailbox.js +384 -0
package/dist/src/tools/team-mailbox.js.map +1 -0
package/dist/src/tools/team-msg.d.ts +17 -8
package/dist/src/tools/team-msg.d.ts.map +1 -1
package/dist/src/tools/team-msg.js +110 -13
package/dist/src/tools/team-msg.js.map +1 -1
package/dist/src/tools/team-tasks-mcp.d.ts +27 -0
package/dist/src/tools/team-tasks-mcp.d.ts.map +1 -0
package/dist/src/tools/team-tasks-mcp.js +408 -0
package/dist/src/tools/team-tasks-mcp.js.map +1 -0
package/dist/src/types/index.d.ts +5 -0
package/dist/src/types/index.d.ts.map +1 -1
package/package.json +2 -1
package/templates/cli/prompts/workflow-skill-conflict-patterns.txt +3 -3
package/templates/cli/prompts/workflow-skill-lessons-learned.txt +3 -3
package/templates/search-tools.md +1 -1
package/workflows/analyze.md +816 -816
package/workflows/brainstorm.md +471 -471
package/workflows/cli-tools-usage.md +44 -27
package/workflows/codebase-rebuild.md +332 -332
package/workflows/codebase-refresh.md +240 -240
package/workflows/delegate-usage.md +3 -3
package/workflows/execute.md +1 -1
package/workflows/harvest.md +420 -420
package/workflows/integration-test.md +343 -343
package/workflows/issue-analyze.md +6 -2
package/workflows/issue-discover.md +414 -414
package/workflows/issue-execute.md +6 -3
package/workflows/issue-plan.md +5 -2
package/workflows/maestro-coordinate.codex.md +281 -470
package/workflows/maestro-coordinate.md +14 -14
package/workflows/maestro-link-coordinate.md +2 -2
package/workflows/maestro.codex.md +710 -0
package/workflows/maestro.md +10 -11
package/workflows/map.md +111 -111
package/workflows/milestone-complete.md +176 -176
package/workflows/plan.md +1 -1
package/workflows/quick.md +497 -497
package/workflows/refactor.md +300 -300
package/workflows/retrospective.md +1 -1
package/workflows/roadmap.md +335 -335
package/workflows/spec-generate.md +640 -640
package/workflows/specs-add.md +46 -81
package/workflows/specs-load.md +15 -17
package/workflows/specs-setup.md +40 -161
package/.claude/commands/manage-issue-analyze.md +0 -62
package/.claude/commands/manage-issue-execute.md +0 -73
package/.claude/commands/manage-issue-plan.md +0 -62
package/.codex/skills/manage-issue-analyze/SKILL.md +0 -207
package/.codex/skills/manage-issue-execute/SKILL.md +0 -200
package/.codex/skills/manage-issue-plan/SKILL.md +0 -186

package/.codex/skills/quality-integration-test/SKILL.md CHANGED Viewed

@@ -1,544 +1,532 @@
----
-name: quality-integration-test
-description: Self-iterating integration test cycle via CSV wave pipeline. Progressive L0-L3 layers in linear pipeline topology with reflection-driven adaptive strategy engine. Replaces quality-integration-test command.
-argument-hint: "[-y|--yes] [-c|--concurrency N] [--continue] \"<phase> [--max-iterations N] [--target-coverage N]\""
-allowed-tools: spawn_agents_on_csv, Read, Write, Edit, Bash, Glob, Grep, AskUserQuestion
----
-## Auto Mode
-When `--yes` or `-y`: Auto-confirm test plan, skip interactive validation, use defaults for layer detection.
-# Maestro Integration Test (CSV Wave)
-## Usage
-```bash
-$quality-integration-test "3"
-$quality-integration-test -c 4 "3 --max-iterations 8"
-$quality-integration-test -y "3 --target-coverage 90"
-$quality-integration-test --continue "integration-test-phase3-20260318"
-```
-**Flags**:
-- `-y, --yes`: Skip all confirmations (auto mode)
-- `-c, --concurrency N`: Max concurrent agents within each wave (default: 4)
-- `--continue`: Resume existing session
-**Output Directory**: `.workflow/.csv-wave/{session-id}/`
-**Core Output**: `tasks.csv` (master state) + `results.csv` (final) + `discoveries.ndjson` (shared exploration) + `context.md` (human-readable report) + `summary.json` (structured output for downstream)
----
-## Overview
-Linear pipeline test execution using `spawn_agents_on_csv`. Progressive L0 → L1 → L2 → L3 layers where each layer depends on the previous passing. Self-iterating 6-phase cycle (Explore → Design → Develop → Test → Reflect → Adjust) with adaptive strategy engine.
-**Core workflow**: Explore Codebase → Design Test Plan → Progressive Layer Execution → Reflect → Adjust Strategy → Iterate
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│              INTEGRATION TEST CSV WAVE WORKFLOW                          │
-├─────────────────────────────────────────────────────────────────────────┤
-│                                                                          │
-│  Phase 1: Exploration → CSV                                              │
-│     ├─ Resolve phase directory from arguments                            │
-│     ├─ Explore codebase for integration points                           │
-│     ├─ Discover test infrastructure and existing tests                   │
-│     ├─ Load pre-generated tests from quality-test-gen                    │
-│     ├─ Design L0-L3 test plan                                            │
-│     ├─ Generate tasks.csv with rows per layer + module                   │
-│     └─ User validates test plan (skip if -y)                             │
-│                                                                          │
-│  Phase 2: Wave Execution Engine (Linear Pipeline)                        │
-│     ├─ Wave 1: L0 Static Analysis                                        │
-│     │   ├─ Type checking (tsc --noEmit)                                  │
-│     │   ├─ Linting (eslint / ruff)                                       │
-│     │   └─ Results: pass/fail per check                                  │
-│     ├─ Wave 2: L1 Unit Tests (parallel per module)                       │
-│     │   ├─ Each module agent runs unit tests independently               │
-│     │   ├─ Discoveries shared (test commands, fixtures)                  │
-│     │   └─ Results: tests_passed + tests_failed per module               │
-│     ├─ Wave 3: L2 Integration Tests                                      │
-│     │   ├─ Cross-module + API + DB tests                                 │
-│     │   ├─ Uses L1 context for test commands and patterns                │
-│     │   └─ Results: tests_passed + tests_failed + coverage               │
-│     ├─ Wave 4: L3 E2E Tests                                              │
-│     │   ├─ Full user flow tests                                          │
-│     │   ├─ Uses L2 context for integration points                        │
-│     │   └─ Results: tests_passed + tests_failed + coverage               │
-│     └─ discoveries.ndjson shared across all waves (append-only)          │
-│                                                                          │
-│  Phase 3: Reflect + Iterate                                              │
-│     ├─ Calculate overall pass rate                                       │
-│     ├─ Reflect on results (what worked, what failed, patterns)           │
-│     ├─ Adjust strategy (conservative/aggressive/surgical/reflective)     │
-│     ├─ If pass_rate < target: iterate (back to Phase 2)                  │
-│     ├─ If pass_rate >= target OR max_iterations: finalize                │
-│     ├─ Export results.csv + summary.json                                 │
-│     ├─ Generate context.md + reflection-log.md                           │
-│     └─ Display summary with next steps                                   │
-│                                                                          │
-└─────────────────────────────────────────────────────────────────────────┘
-```
----
-## CSV Schema
-### tasks.csv (Master State)
-```csv
-id,title,description,test_layer,test_scope,deps,context_from,wave,status,findings,tests_passed,tests_failed,coverage,error
-"1","L0 Type Check","Run TypeScript type checking with tsc --noEmit. Report all type errors with file:line references.","L0-static","src/**/*.ts","","","1","","","","","",""
-"2","L0 Lint","Run ESLint on all source files. Report errors and warnings with file:line references.","L0-static","src/**/*.ts","","","1","","","","","",""
-"3","L1 Auth Module","Run unit tests for auth module: token verification, session management, password hashing. Isolated tests with mocked dependencies.","L1-unit","src/auth/**/*.ts","1;2","1;2","2","","","","","",""
-"4","L1 API Module","Run unit tests for API module: route handlers, middleware, validators. Isolated tests with mocked DB.","L1-unit","src/api/**/*.ts","1;2","1;2","2","","","","","",""
-"5","L1 Utils Module","Run unit tests for utility functions: validation, formatting, helpers. Pure function tests.","L1-unit","src/utils/**/*.ts","1;2","1;2","2","","","","","",""
-"6","L2 API Integration","Run integration tests: API endpoints with real middleware chain, DB fixtures, cross-module data flow.","L2-integration","src/api/**/*.ts;src/auth/**/*.ts","3;4;5","3;4;5","3","","","","","",""
-"7","L2 DB Integration","Run integration tests: database queries, migrations, transaction handling with test DB.","L2-integration","src/db/**/*.ts","3;4;5","3;4;5","3","","","","","",""
-"8","L3 User Flows","Run E2E tests: login flow, CRUD operations, error handling. Full browser/process execution.","L3-e2e","src/**/*.ts","6;7","6;7","4","","","","","",""
-```
-**Columns**:
-| Column | Phase | Description |
-|--------|-------|-------------|
-| `id` | Input | Unique task identifier (string) |
-| `title` | Input | Short task title |
-| `description` | Input | Detailed test execution instructions for this layer/scope |
-| `test_layer` | Input | Test layer: L0-static/L1-unit/L2-integration/L3-e2e |
-| `test_scope` | Input | Semicolon-separated file/module globs to test |
-| `deps` | Input | Semicolon-separated dependency task IDs (previous layer tasks) |
-| `context_from` | Input | Semicolon-separated task IDs whose findings this task needs |
-| `wave` | Computed | Wave number: 1=L0, 2=L1, 3=L2, 4=L3 |
-| `status` | Output | `pending` → `completed` / `failed` / `skipped` |
-| `findings` | Output | Key findings summary: failures, patterns, coverage notes (max 500 chars) |
-| `tests_passed` | Output | Count of passing tests |
-| `tests_failed` | Output | Count of failing tests |
-| `coverage` | Output | Coverage percentage for this scope (e.g., `87.5%`) |
-| `error` | Output | Error message if failed |
-### Per-Wave CSV (Temporary)
-Each wave generates `wave-{N}.csv` with extra `prev_context` column populated from predecessor findings.
----
-## Output Artifacts
-| File | Purpose | Lifecycle |
-|------|---------|-----------|
-| `tasks.csv` | Master state — all tasks with status/findings | Updated after each wave |
-| `wave-{N}.csv` | Per-wave input (temporary) | Created before wave, deleted after |
-| `results.csv` | Final export of all task results | Created in Phase 3 |
-| `discoveries.ndjson` | Shared exploration board | Append-only, carries across waves |
-| `context.md` | Human-readable integration test report | Created in Phase 3 |
-| `summary.json` | Structured output for downstream commands | Created in Phase 3 |
-| `reflection-log.md` | Per-iteration reflection history | Append-only across iterations |
----
-## Session Structure
-```
-.workflow/.csv-wave/integration-test-{phase}-{date}/
-├── tasks.csv
-├── results.csv
-├── discoveries.ndjson
-├── context.md
-├── summary.json
-├── reflection-log.md
-├── state.json
-├── iteration-{N}/
-│   ├── wave-{N}.csv (temporary)
-│   └── test-results.json
-└── wave-{N}.csv (temporary)
-```
----
-## Implementation
-### Session Initialization
-```javascript
-const getUtc8ISOString = () => new Date(Date.now() + 8 * 60 * 60 * 1000).toISOString()
-// Parse flags
-const AUTO_YES = $ARGUMENTS.includes('--yes') || $ARGUMENTS.includes('-y')
-const continueMode = $ARGUMENTS.includes('--continue')
-const concurrencyMatch = $ARGUMENTS.match(/(?:--concurrency|-c)\s+(\d+)/)
-const maxConcurrency = concurrencyMatch ? parseInt(concurrencyMatch[1]) : 4
-// Parse integration-test-specific flags
-const maxIterMatch = $ARGUMENTS.match(/--max-iterations\s+(\d+)/)
-const maxIterations = maxIterMatch ? parseInt(maxIterMatch[1]) : 5
-const coverageMatch = $ARGUMENTS.match(/--target-coverage\s+(\d+)/)
-const targetCoverage = coverageMatch ? parseInt(coverageMatch[1]) : 95
-// Clean phase text
-const phaseArg = $ARGUMENTS
-  .replace(/--yes|-y|--continue|--concurrency\s+\d+|-c\s+\d+|--max-iterations\s+\d+|--target-coverage\s+\d+/g, '')
-  .trim()
-const dateStr = getUtc8ISOString().substring(0, 10).replace(/-/g, '')
-const sessionId = `integration-test-phase${phaseArg}-${dateStr}`
-const sessionFolder = `.workflow/.csv-wave/${sessionId}`
-Bash(`mkdir -p ${sessionFolder}`)
-// Initialize state.json
-const state = {
-  phase: phaseArg,
-  started_at: getUtc8ISOString(),
-  current_iteration: 0,
-  max_iterations: maxIterations,
-  strategy: "conservative",
-  current_layer: "L0",
-  pass_rates: [],
-  convergence_threshold: targetCoverage,
-  status: "running"
-}
-Write(`${sessionFolder}/state.json`, JSON.stringify(state, null, 2))
-// Initialize reflection-log.md
-Write(`${sessionFolder}/reflection-log.md`,
-  `# Integration Test Reflection Log\nPhase: ${phaseArg}\nStarted: ${getUtc8ISOString()}\n\n## Iterations\n`)
-```
----
-### Phase 1: Exploration → CSV
-**Objective**: Explore codebase, discover integration points, design L0-L3 test plan, generate tasks.csv.
-**Decomposition Rules**:
-1. **Phase resolution**: Resolve `{phaseArg}` to `.workflow/phases/{NN}-{slug}/`
-2. **Codebase exploration**:
-   - Cross-module imports and dependencies
-   - API endpoints and route definitions
-   - Database interactions and queries
-   - External service integrations
-   - Event flows and message passing
-3. **Test infrastructure discovery**:
-   - Detect frameworks (jest/vitest/pytest, playwright/cypress)
-   - Find existing integration and E2E tests
-   - Identify test utilities, fixtures, DB seed scripts
-4. **Pre-generated test loading**:
-   Check `{phase_dir}/.tests/test-gen-report.json` for tests from `quality-test-gen`. Merge integration/e2e tests into plan (execute but don't re-generate).
-5. **Layer design**:
-| Layer | Wave | Tasks | Content |
-|-------|------|-------|---------|
-| L0 | 1 | 1-2 | Type check + lint commands |
-| L1 | 2 | 1 per module | Unit tests per discovered module (parallel) |
-| L2 | 3 | 1-3 | Integration tests (API, DB, cross-module) |
-| L3 | 4 | 1-2 | E2E tests (user flows) |
-6. **Dependency wiring**: L1 depends on L0, L2 depends on L1, L3 depends on L2.
-7. **CSV generation**: Rows for all layers with correct wave assignments and deps.
-**User validation**: Display layer breakdown with test counts (skip if AUTO_YES).
----
-### Phase 2: Wave Execution Engine
-**Objective**: Execute test layers wave-by-wave via spawn_agents_on_csv. Progressive — each layer requires previous to pass.
-#### Wave 1: L0 Static Analysis
-1. Read master `tasks.csv`
-2. Filter rows where `wave == 1` AND `status == pending`
-3. No prev_context needed (first wave)
-4. Write `wave-1.csv`
-5. Execute:
-```javascript
-spawn_agents_on_csv({
-  csv_path: `${sessionFolder}/wave-1.csv`,
-  id_column: "id",
-  instruction: buildL0Instruction(sessionFolder),
-  max_concurrency: maxConcurrency,
-  max_runtime_seconds: 300,
-  output_csv_path: `${sessionFolder}/wave-1-results.csv`,
-  output_schema: {
-    type: "object",
-    properties: {
-      id: { type: "string" },
-      status: { type: "string", enum: ["completed", "failed"] },
-      findings: { type: "string" },
-      tests_passed: { type: "string" },
-      tests_failed: { type: "string" },
-      coverage: { type: "string" },
-      error: { type: "string" }
-    },
-    required: ["id", "status", "findings"]
-  }
-})
-```
-6. Read `wave-1-results.csv`, merge into master `tasks.csv`
-7. Delete `wave-1.csv`
-8. **Gate check**: If all L0 tasks failed, skip remaining waves for this iteration
-#### Wave 2: L1 Unit Tests (Parallel per Module)
-1. Read master `tasks.csv`
-2. Filter rows where `wave == 2` AND `status == pending`
-3. Check deps — all L0 tasks must be completed (not failed)
-4. Build `prev_context` from L0 findings:
-   ```
-   [Task 1: L0 Type Check] Clean — 0 type errors
-   [Task 2: L0 Lint] 3 warnings in auth module (non-blocking)
-   ```
-5. Write `wave-2.csv` with `prev_context` column
-6. Execute `spawn_agents_on_csv` for L1 agents (parallel per module)
-7. Merge results into master `tasks.csv`
-8. Delete `wave-2.csv`
-9. **Gate check**: If all L1 tasks failed, skip L2 and L3
-#### Wave 3: L2 Integration Tests
-1. Read master `tasks.csv`
-2. Filter rows where `wave == 3` AND `status == pending`
-3. Build `prev_context` from L1 findings (test commands used, failures found, coverage gaps)
-4. Write `wave-3.csv` with `prev_context`
-5. Execute `spawn_agents_on_csv` for L2 agents
-6. Merge results, delete temp CSV
-7. **Gate check**: If all L2 tasks failed, skip L3
-#### Wave 4: L3 E2E Tests
-1. Read master `tasks.csv`
-2. Filter rows where `wave == 4` AND `status == pending`
-3. Build `prev_context` from L2 findings (integration points tested, coverage levels)
-4. Write `wave-4.csv` with `prev_context`
-5. Execute `spawn_agents_on_csv` for L3 agents
-6. Merge results, delete temp CSV
----
-### Phase 3: Reflect + Iterate
-**Objective**: Evaluate results, reflect, adjust strategy, iterate or finalize.
-#### Step 3a: Calculate Pass Rate
-Aggregate across all layers:
-```
-overall_pass_rate = total_passed / (total_passed + total_failed) * 100
-```
-Record in `state.json.pass_rates[]`.
-#### Step 3b: Reflect
-Analyze iteration results:
-- Which tests failed and why?
-- Is pass rate improving, plateauing, or regressing?
-- Are failures clustered in one layer/module or spread out?
-- Is the current strategy working?
-Append to `reflection-log.md`:
-```markdown
-## Iteration {N}
-Strategy: {strategy_name}
-Pass rate: {rate}% (previous: {prev_rate}%)
-Delta: {+/-}%
-### What worked
-- {observation}
-### What failed
-- {test}: {reason}
-### Pattern detected
-- {pattern, e.g., "all failures in auth module"}
-### Strategy assessment
-- Current strategy: {effective|ineffective|partially_effective}
-- Recommendation: {keep|switch_to_X}
-```
-#### Step 3c: Adjust Strategy (Adaptive Strategy Engine)
-| Condition | Strategy | Behavior |
-|-----------|----------|----------|
-| Iteration 1-2 | Conservative | Fix obvious failures, don't refactor |
-| Pass rate >80% AND failures similar to previous | Aggressive | Batch-fix related failures together |
-| New regressions appeared | Surgical | Revert last changes, fix regression only |
-| Stuck 3+ iterations (rate not improving) | Reflective | Step back, re-analyze root cause pattern |
-**Strategy transitions**:
-```
-Conservative -> (pass rate >80%) -> Aggressive
-Aggressive -> (regression) -> Surgical
-Surgical -> (regression fixed) -> Aggressive
-Any -> (stuck 3+ iters) -> Reflective
-Reflective -> (new insight) -> Conservative (restart approach)
-```
-Update `state.json` with new strategy and iteration count.
-#### Step 3d: Convergence Check
-- If `overall_pass_rate >= target_coverage`: **CONVERGED** → finalize
-- If `iteration >= max_iterations`: **MAX_ITER_REACHED** → finalize
-- Otherwise: **ITERATE** → reset pending tasks for failing layers, go back to Phase 2
-#### Step 3e: Finalize
-1. Read final master `tasks.csv`
-2. Export as `results.csv`
-3. Build `summary.json`:
-```json
-{
-  "phase": "<phase>",
-  "completed_at": "<ISO>",
-  "session_id": "<session-id>",
-  "iterations": 3,
-  "final_pass_rate": 97.5,
-  "converged": true,
-  "convergence_threshold": 95,
-  "strategy_history": ["conservative", "conservative", "aggressive"],
-  "layers": {
-    "L0": { "status": "pass" },
-    "L1": { "total": 15, "passed": 15, "failed": 0, "pass_rate": 100.0 },
-    "L2": { "total": 8, "passed": 7, "failed": 1, "pass_rate": 87.5 },
-    "L3": { "total": 4, "passed": 4, "failed": 0, "pass_rate": 100.0 }
-  },
-  "bugs_discovered": [],
-  "regressions_fixed": []
-}
-```
-4. Generate `context.md`:
-```markdown
-# Integration Test Report — Phase {phase}
-## Summary
-- Iterations: {N}/{max_iter}
-- Converged: {yes/no} (threshold: {threshold}%)
-- Final pass rate: {rate}%
-- Strategy: {final_strategy} (transitioned {N} times)
-## Layer Results
-| Layer | Status | Passed | Failed | Pass Rate | Coverage |
-|-------|--------|--------|--------|-----------|----------|
-| L0 Static | {pass/fail} | — | — | — | — |
-| L1 Unit | {status} | {P} | {F} | {rate}% | {cov}% |
-| L2 Integration | {status} | {P} | {F} | {rate}% | {cov}% |
-| L3 E2E | {status} | {P} | {F} | {rate}% | {cov}% |
-## Iteration History
-| Iter | Strategy | Pass Rate | Delta | Action |
-|------|----------|-----------|-------|--------|
-| 1 | conservative | 72.0% | — | fixed 3 type errors |
-| 2 | conservative | 85.5% | +13.5% | fixed auth test fixtures |
-| 3 | aggressive | 97.5% | +12.0% | batch-fixed API tests |
-## Reflection Summary
-{key insights from reflection-log.md}
-## Bugs Discovered
-{list of bugs found during testing}
-## Next Steps
-{suggested_next_command}
-```
-5. Copy `summary.json` to phase `.tests/integration/` directory.
-6. Update `index.json` with integration test status.
-7. Display summary.
-**Next step routing**:
-| Result | Suggestion |
-|--------|------------|
-| Converged (>=target%) | `maestro-verify {phase}` to update validation |
-| Max iter, >80% | `quality-test {phase}` for manual UAT on remaining gaps |
-| Max iter, <80% | `quality-debug` for deep investigation |
-| Bugs discovered | `maestro-plan {phase} --gaps` to plan fixes |
----
-## Shared Discovery Board Protocol
-### Standard Discovery Types
-| Type | Dedup Key | Data Schema | Description |
-|------|-----------|-------------|-------------|
-| `code_pattern` | `data.name` | `{name, file, description}` | Reusable code pattern found |
-| `integration_point` | `data.file` | `{file, description, exports[]}` | Module connection point |
-| `convention` | singleton | `{naming, imports, formatting}` | Project code conventions |
-| `blocker` | `data.issue` | `{issue, severity, impact}` | Blocking issue found |
-| `tech_stack` | singleton | `{framework, language, tools[]}` | Technology stack info |
-### Domain Discovery Types
-| Type | Dedup Key | Data Schema | Description |
-|------|-----------|-------------|-------------|
-| `test_command` | `data.layer` | `{layer, command, flags, cwd}` | Working test command for a layer |
-| `test_fixture` | `data.name` | `{name, file, setup, teardown}` | Shared test fixture or DB seed |
-| `coverage_gap` | `data.module` | `{module, layer, uncovered_areas[]}` | Coverage gap in a module |
-| `regression` | `data.test` | `{test, file, previous_status, current_status}` | Test that regressed |
-| `flaky_test` | `data.test` | `{test, file, fail_rate, pattern}` | Intermittently failing test |
-### Protocol
-1. **Read** `{session_folder}/discoveries.ndjson` before own test execution
-2. **Skip covered**: If discovery of same type + dedup key exists, skip
-3. **Write immediately**: Append findings as found
-4. **Append-only**: Never modify or delete
-5. **Deduplicate**: Check before writing
-```bash
-echo '{"ts":"<ISO>","worker":"{id}","type":"test_command","data":{"layer":"L1","command":"npx vitest run --reporter=verbose","flags":"--testPathPattern=unit","cwd":"."}}' >> {session_folder}/discoveries.ndjson
-```
----
-## Error Handling
-| Error | Resolution |
-|-------|------------|
-| Phase directory not found | Abort with error: "Phase {N} not found" |
-| No test framework detected | Abort with error: "No test framework detected (E003)" |
-| L0 static analysis fails | Record failures, proceed to L1 (type errors are informational) |
-| All tasks in a layer failed | Gate check: skip subsequent layers for this iteration |
-| Agent timeout | Mark as failed, continue with remaining agents in wave |
-| Max iterations without convergence | Finalize with current results, warn (W001) |
-| Regression detected | Switch to Surgical strategy (W002) |
-| Stuck 3+ iterations | Switch to Reflective strategy (W003) |
-| CSV parse error | Validate format, show line number |
-| discoveries.ndjson corrupt | Ignore malformed lines |
-| Continue mode: no session found | List available sessions |
-| state.json missing on resume | Rebuild from tasks.csv status column |
----
-## Core Rules
-1. **Start Immediately**: First action is session initialization, then Phase 1
-2. **Wave Order is Sacred**: Never execute wave N+1 before wave N completes and results are merged
-3. **Progressive Layers**: L0 → L1 → L2 → L3 — each layer gates the next
-4. **CSV is Source of Truth**: Master tasks.csv holds all state
-5. **Context Propagation**: prev_context built from master CSV, not from memory
-6. **Discovery Board is Append-Only**: Never clear, modify, or recreate discoveries.ndjson
-7. **Self-Iterating**: Loop until convergence or max iterations — do not stop after one pass
-8. **Strategy is Adaptive**: Apply the strategy engine rules for transitions, never stay on a failing strategy
-9. **Reflect Before Adjusting**: Always log reflection before changing strategy
-10. **Cleanup Temp Files**: Remove wave-{N}.csv after results are merged
-11. **DO NOT STOP**: Continuous execution until convergence or max iterations reached
+---
+name: quality-integration-test
+description: Self-iterating integration test cycle via CSV wave pipeline. Progressive L0-L3 layers in linear pipeline topology with reflection-driven adaptive strategy engine. Replaces quality-integration-test command.
+argument-hint: "[-y|--yes] [-c|--concurrency N] [--continue] \"<phase> [--max-iterations N] [--target-coverage N]\""
+allowed-tools: spawn_agents_on_csv, Read, Write, Edit, Bash, Glob, Grep, AskUserQuestion
+---
+<purpose>
+Linear pipeline test execution using `spawn_agents_on_csv`. Progressive L0 -> L1 -> L2 -> L3 layers where each layer depends on the previous passing. Self-iterating 6-phase cycle (Explore -> Design -> Develop -> Test -> Reflect -> Adjust) with adaptive strategy engine.
+**Core workflow**: Explore Codebase -> Design Test Plan -> Progressive Layer Execution -> Reflect -> Adjust Strategy -> Iterate
+```
++-------------------------------------------------------------------------+
+|              INTEGRATION TEST CSV WAVE WORKFLOW                          |
++-------------------------------------------------------------------------+
+|                                                                          |
+|  Phase 1: Exploration -> CSV                                             |
+|     +-- Resolve phase directory from arguments                           |
+|     +-- Explore codebase for integration points                          |
+|     +-- Discover test infrastructure and existing tests                  |
+|     +-- Load pre-generated tests from quality-test-gen                   |
+|     +-- Design L0-L3 test plan                                           |
+|     +-- Generate tasks.csv with rows per layer + module                  |
+|     +-- User validates test plan (skip if -y)                            |
+|                                                                          |
+|  Phase 2: Wave Execution Engine (Linear Pipeline)                        |
+|     +-- Wave 1: L0 Static Analysis                                       |
+|     |   +-- Type checking (tsc --noEmit)                                 |
+|     |   +-- Linting (eslint / ruff)                                      |
+|     |   +-- Results: pass/fail per check                                 |
+|     +-- Wave 2: L1 Unit Tests (parallel per module)                      |
+|     |   +-- Each module agent runs unit tests independently              |
+|     |   +-- Discoveries shared (test commands, fixtures)                 |
+|     |   +-- Results: tests_passed + tests_failed per module              |
+|     +-- Wave 3: L2 Integration Tests                                     |
+|     |   +-- Cross-module + API + DB tests                                |
+|     |   +-- Uses L1 context for test commands and patterns               |
+|     |   +-- Results: tests_passed + tests_failed + coverage              |
+|     +-- Wave 4: L3 E2E Tests                                             |
+|     |   +-- Full user flow tests                                         |
+|     |   +-- Uses L2 context for integration points                       |
+|     |   +-- Results: tests_passed + tests_failed + coverage              |
+|     +-- discoveries.ndjson shared across all waves (append-only)         |
+|                                                                          |
+|  Phase 3: Reflect + Iterate                                              |
+|     +-- Calculate overall pass rate                                      |
+|     +-- Reflect on results (what worked, what failed, patterns)          |
+|     +-- Adjust strategy (conservative/aggressive/surgical/reflective)    |
+|     +-- If pass_rate < target: iterate (back to Phase 2)                 |
+|     +-- If pass_rate >= target OR max_iterations: finalize               |
+|     +-- Export results.csv + summary.json                                |
+|     +-- Generate context.md + reflection-log.md                          |
+|     +-- Display summary with next steps                                  |
+|                                                                          |
++-------------------------------------------------------------------------+
+```
+</purpose>
+<context>
+```bash
+$quality-integration-test "3"
+$quality-integration-test -c 4 "3 --max-iterations 8"
+$quality-integration-test -y "3 --target-coverage 90"
+$quality-integration-test --continue "integration-test-phase3-20260318"
+```
+**Flags**:
+- `-y, --yes`: Skip all confirmations (auto mode)
+- `-c, --concurrency N`: Max concurrent agents within each wave (default: 4)
+- `--continue`: Resume existing session
+When `--yes` or `-y`: Auto-confirm test plan, skip interactive validation, use defaults for layer detection.
+**Output Directory**: `.workflow/.csv-wave/{session-id}/`
+**Core Output**: `tasks.csv` (master state) + `results.csv` (final) + `discoveries.ndjson` (shared exploration) + `context.md` (human-readable report) + `summary.json` (structured output for downstream)
+</context>
+<csv_schema>
+### tasks.csv (Master State)
+```csv
+id,title,description,test_layer,test_scope,deps,context_from,wave,status,findings,tests_passed,tests_failed,coverage,error
+"1","L0 Type Check","Run TypeScript type checking with tsc --noEmit. Report all type errors with file:line references.","L0-static","src/**/*.ts","","","1","","","","","",""
+"2","L0 Lint","Run ESLint on all source files. Report errors and warnings with file:line references.","L0-static","src/**/*.ts","","","1","","","","","",""
+"3","L1 Auth Module","Run unit tests for auth module: token verification, session management, password hashing. Isolated tests with mocked dependencies.","L1-unit","src/auth/**/*.ts","1;2","1;2","2","","","","","",""
+"4","L1 API Module","Run unit tests for API module: route handlers, middleware, validators. Isolated tests with mocked DB.","L1-unit","src/api/**/*.ts","1;2","1;2","2","","","","","",""
+"5","L1 Utils Module","Run unit tests for utility functions: validation, formatting, helpers. Pure function tests.","L1-unit","src/utils/**/*.ts","1;2","1;2","2","","","","","",""
+"6","L2 API Integration","Run integration tests: API endpoints with real middleware chain, DB fixtures, cross-module data flow.","L2-integration","src/api/**/*.ts;src/auth/**/*.ts","3;4;5","3;4;5","3","","","","","",""
+"7","L2 DB Integration","Run integration tests: database queries, migrations, transaction handling with test DB.","L2-integration","src/db/**/*.ts","3;4;5","3;4;5","3","","","","","",""
+"8","L3 User Flows","Run E2E tests: login flow, CRUD operations, error handling. Full browser/process execution.","L3-e2e","src/**/*.ts","6;7","6;7","4","","","","","",""
+```
+**Columns**:
+| Column | Phase | Description |
+|--------|-------|-------------|
+| `id` | Input | Unique task identifier (string) |
+| `title` | Input | Short task title |
+| `description` | Input | Detailed test execution instructions for this layer/scope |
+| `test_layer` | Input | Test layer: L0-static/L1-unit/L2-integration/L3-e2e |
+| `test_scope` | Input | Semicolon-separated file/module globs to test |
+| `deps` | Input | Semicolon-separated dependency task IDs (previous layer tasks) |
+| `context_from` | Input | Semicolon-separated task IDs whose findings this task needs |
+| `wave` | Computed | Wave number: 1=L0, 2=L1, 3=L2, 4=L3 |
+| `status` | Output | `pending` -> `completed` / `failed` / `skipped` |
+| `findings` | Output | Key findings summary: failures, patterns, coverage notes (max 500 chars) |
+| `tests_passed` | Output | Count of passing tests |
+| `tests_failed` | Output | Count of failing tests |
+| `coverage` | Output | Coverage percentage for this scope (e.g., `87.5%`) |
+| `error` | Output | Error message if failed |
+### Per-Wave CSV (Temporary)
+Each wave generates `wave-{N}.csv` with extra `prev_context` column populated from predecessor findings.
+### Output Artifacts
+| File | Purpose | Lifecycle |
+|------|---------|-----------|
+| `tasks.csv` | Master state -- all tasks with status/findings | Updated after each wave |
+| `wave-{N}.csv` | Per-wave input (temporary) | Created before wave, deleted after |
+| `results.csv` | Final export of all task results | Created in Phase 3 |
+| `discoveries.ndjson` | Shared exploration board | Append-only, carries across waves |
+| `context.md` | Human-readable integration test report | Created in Phase 3 |
+| `summary.json` | Structured output for downstream commands | Created in Phase 3 |
+| `reflection-log.md` | Per-iteration reflection history | Append-only across iterations |
+### Session Structure
+```
+.workflow/.csv-wave/integration-test-{phase}-{date}/
++-- tasks.csv
++-- results.csv
++-- discoveries.ndjson
++-- context.md
++-- summary.json
++-- reflection-log.md
++-- state.json
++-- iteration-{N}/
+|   +-- wave-{N}.csv (temporary)
+|   +-- test-results.json
++-- wave-{N}.csv (temporary)
+```
+</csv_schema>
+<invariants>
+1. **Start Immediately**: First action is session initialization, then Phase 1
+2. **Wave Order is Sacred**: Never execute wave N+1 before wave N completes and results are merged
+3. **Progressive Layers**: L0 -> L1 -> L2 -> L3 -- each layer gates the next
+4. **CSV is Source of Truth**: Master tasks.csv holds all state
+5. **Context Propagation**: prev_context built from master CSV, not from memory
+6. **Discovery Board is Append-Only**: Never clear, modify, or recreate discoveries.ndjson
+7. **Self-Iterating**: Loop until convergence or max iterations -- do not stop after one pass
+8. **Strategy is Adaptive**: Apply the strategy engine rules for transitions, never stay on a failing strategy
+9. **Reflect Before Adjusting**: Always log reflection before changing strategy
+10. **Cleanup Temp Files**: Remove wave-{N}.csv after results are merged
+11. **DO NOT STOP**: Continuous execution until convergence or max iterations reached
+</invariants>
+<execution>
+### Session Initialization
+```javascript
+const getUtc8ISOString = () => new Date(Date.now() + 8 * 60 * 60 * 1000).toISOString()
+// Parse flags
+const AUTO_YES = $ARGUMENTS.includes('--yes') || $ARGUMENTS.includes('-y')
+const continueMode = $ARGUMENTS.includes('--continue')
+const concurrencyMatch = $ARGUMENTS.match(/(?:--concurrency|-c)\s+(\d+)/)
+const maxConcurrency = concurrencyMatch ? parseInt(concurrencyMatch[1]) : 4
+// Parse integration-test-specific flags
+const maxIterMatch = $ARGUMENTS.match(/--max-iterations\s+(\d+)/)
+const maxIterations = maxIterMatch ? parseInt(maxIterMatch[1]) : 5
+const coverageMatch = $ARGUMENTS.match(/--target-coverage\s+(\d+)/)
+const targetCoverage = coverageMatch ? parseInt(coverageMatch[1]) : 95
+// Clean phase text
+const phaseArg = $ARGUMENTS
+  .replace(/--yes|-y|--continue|--concurrency\s+\d+|-c\s+\d+|--max-iterations\s+\d+|--target-coverage\s+\d+/g, '')
+  .trim()
+const dateStr = getUtc8ISOString().substring(0, 10).replace(/-/g, '')
+const sessionId = `integration-test-phase${phaseArg}-${dateStr}`
+const sessionFolder = `.workflow/.csv-wave/${sessionId}`
+Bash(`mkdir -p ${sessionFolder}`)
+// Initialize state.json
+const state = {
+  phase: phaseArg,
+  started_at: getUtc8ISOString(),
+  current_iteration: 0,
+  max_iterations: maxIterations,
+  strategy: "conservative",
+  current_layer: "L0",
+  pass_rates: [],
+  convergence_threshold: targetCoverage,
+  status: "running"
+}
+Write(`${sessionFolder}/state.json`, JSON.stringify(state, null, 2))
+// Initialize reflection-log.md
+Write(`${sessionFolder}/reflection-log.md`,
+  `# Integration Test Reflection Log\nPhase: ${phaseArg}\nStarted: ${getUtc8ISOString()}\n\n## Iterations\n`)
+```
+### Phase 1: Exploration -> CSV
+**Objective**: Explore codebase, discover integration points, design L0-L3 test plan, generate tasks.csv.
+**Decomposition Rules**:
+1. **Phase resolution**: Resolve `{phaseArg}` to `.workflow/phases/{NN}-{slug}/`
+2. **Codebase exploration**:
+   - Cross-module imports and dependencies
+   - API endpoints and route definitions
+   - Database interactions and queries
+   - External service integrations
+   - Event flows and message passing
+3. **Test infrastructure discovery**:
+   - Detect frameworks (jest/vitest/pytest, playwright/cypress)
+   - Find existing integration and E2E tests
+   - Identify test utilities, fixtures, DB seed scripts
+4. **Pre-generated test loading**:
+   Check `{phase_dir}/.tests/test-gen-report.json` for tests from `quality-test-gen`. Merge integration/e2e tests into plan (execute but don't re-generate).
+5. **Layer design**:
+| Layer | Wave | Tasks | Content |
+|-------|------|-------|---------|
+| L0 | 1 | 1-2 | Type check + lint commands |
+| L1 | 2 | 1 per module | Unit tests per discovered module (parallel) |
+| L2 | 3 | 1-3 | Integration tests (API, DB, cross-module) |
+| L3 | 4 | 1-2 | E2E tests (user flows) |
+6. **Dependency wiring**: L1 depends on L0, L2 depends on L1, L3 depends on L2.
+7. **CSV generation**: Rows for all layers with correct wave assignments and deps.
+**User validation**: Display layer breakdown with test counts (skip if AUTO_YES).
+### Phase 2: Wave Execution Engine
+**Objective**: Execute test layers wave-by-wave via spawn_agents_on_csv. Progressive -- each layer requires previous to pass.
+#### Wave 1: L0 Static Analysis
+1. Read master `tasks.csv`
+2. Filter rows where `wave == 1` AND `status == pending`
+3. No prev_context needed (first wave)
+4. Write `wave-1.csv`
+5. Execute:
+```javascript
+spawn_agents_on_csv({
+  csv_path: `${sessionFolder}/wave-1.csv`,
+  id_column: "id",
+  instruction: buildL0Instruction(sessionFolder),
+  max_concurrency: maxConcurrency,
+  max_runtime_seconds: 300,
+  output_csv_path: `${sessionFolder}/wave-1-results.csv`,
+  output_schema: {
+    type: "object",
+    properties: {
+      id: { type: "string" },
+      status: { type: "string", enum: ["completed", "failed"] },
+      findings: { type: "string" },
+      tests_passed: { type: "string" },
+      tests_failed: { type: "string" },
+      coverage: { type: "string" },
+      error: { type: "string" }
+    },
+    required: ["id", "status", "findings"]
+  }
+})
+```
+6. Read `wave-1-results.csv`, merge into master `tasks.csv`
+7. Delete `wave-1.csv`
+8. **Gate check**: If all L0 tasks failed, skip remaining waves for this iteration
+#### Wave 2: L1 Unit Tests (Parallel per Module)
+1. Read master `tasks.csv`
+2. Filter rows where `wave == 2` AND `status == pending`
+3. Check deps -- all L0 tasks must be completed (not failed)
+4. Build `prev_context` from L0 findings:
+   ```
+   [Task 1: L0 Type Check] Clean -- 0 type errors
+   [Task 2: L0 Lint] 3 warnings in auth module (non-blocking)
+   ```
+5. Write `wave-2.csv` with `prev_context` column
+6. Execute `spawn_agents_on_csv` for L1 agents (parallel per module)
+7. Merge results into master `tasks.csv`
+8. Delete `wave-2.csv`
+9. **Gate check**: If all L1 tasks failed, skip L2 and L3
+#### Wave 3: L2 Integration Tests
+1. Read master `tasks.csv`
+2. Filter rows where `wave == 3` AND `status == pending`
+3. Build `prev_context` from L1 findings (test commands used, failures found, coverage gaps)
+4. Write `wave-3.csv` with `prev_context`
+5. Execute `spawn_agents_on_csv` for L2 agents
+6. Merge results, delete temp CSV
+7. **Gate check**: If all L2 tasks failed, skip L3
+#### Wave 4: L3 E2E Tests
+1. Read master `tasks.csv`
+2. Filter rows where `wave == 4` AND `status == pending`
+3. Build `prev_context` from L2 findings (integration points tested, coverage levels)
+4. Write `wave-4.csv` with `prev_context`
+5. Execute `spawn_agents_on_csv` for L3 agents
+6. Merge results, delete temp CSV
+### Phase 3: Reflect + Iterate
+**Objective**: Evaluate results, reflect, adjust strategy, iterate or finalize.
+#### Step 3a: Calculate Pass Rate
+Aggregate across all layers:
+```
+overall_pass_rate = total_passed / (total_passed + total_failed) * 100
+```
+Record in `state.json.pass_rates[]`.
+#### Step 3b: Reflect
+Analyze iteration results:
+- Which tests failed and why?
+- Is pass rate improving, plateauing, or regressing?
+- Are failures clustered in one layer/module or spread out?
+- Is the current strategy working?
+Append to `reflection-log.md`:
+```markdown
+## Iteration {N}
+Strategy: {strategy_name}
+Pass rate: {rate}% (previous: {prev_rate}%)
+Delta: {+/-}%
+### What worked
+- {observation}
+### What failed
+- {test}: {reason}
+### Pattern detected
+- {pattern, e.g., "all failures in auth module"}
+### Strategy assessment
+- Current strategy: {effective|ineffective|partially_effective}
+- Recommendation: {keep|switch_to_X}
+```
+#### Step 3c: Adjust Strategy (Adaptive Strategy Engine)
+| Condition | Strategy | Behavior |
+|-----------|----------|----------|
+| Iteration 1-2 | Conservative | Fix obvious failures, don't refactor |
+| Pass rate >80% AND failures similar to previous | Aggressive | Batch-fix related failures together |
+| New regressions appeared | Surgical | Revert last changes, fix regression only |
+| Stuck 3+ iterations (rate not improving) | Reflective | Step back, re-analyze root cause pattern |
+**Strategy transitions**:
+```
+Conservative -> (pass rate >80%) -> Aggressive
+Aggressive -> (regression) -> Surgical
+Surgical -> (regression fixed) -> Aggressive
+Any -> (stuck 3+ iters) -> Reflective
+Reflective -> (new insight) -> Conservative (restart approach)
+```
+Update `state.json` with new strategy and iteration count.
+#### Step 3d: Convergence Check
+- If `overall_pass_rate >= target_coverage`: **CONVERGED** -> finalize
+- If `iteration >= max_iterations`: **MAX_ITER_REACHED** -> finalize
+- Otherwise: **ITERATE** -> reset pending tasks for failing layers, go back to Phase 2
+#### Step 3e: Finalize
+1. Read final master `tasks.csv`
+2. Export as `results.csv`
+3. Build `summary.json`:
+```json
+{
+  "phase": "<phase>",
+  "completed_at": "<ISO>",
+  "session_id": "<session-id>",
+  "iterations": 3,
+  "final_pass_rate": 97.5,
+  "converged": true,
+  "convergence_threshold": 95,
+  "strategy_history": ["conservative", "conservative", "aggressive"],
+  "layers": {
+    "L0": { "status": "pass" },
+    "L1": { "total": 15, "passed": 15, "failed": 0, "pass_rate": 100.0 },
+    "L2": { "total": 8, "passed": 7, "failed": 1, "pass_rate": 87.5 },
+    "L3": { "total": 4, "passed": 4, "failed": 0, "pass_rate": 100.0 }
+  },
+  "bugs_discovered": [],
+  "regressions_fixed": []
+}
+```
+4. Generate `context.md`:
+```markdown
+# Integration Test Report -- Phase {phase}
+## Summary
+- Iterations: {N}/{max_iter}
+- Converged: {yes/no} (threshold: {threshold}%)
+- Final pass rate: {rate}%
+- Strategy: {final_strategy} (transitioned {N} times)
+## Layer Results
+| Layer | Status | Passed | Failed | Pass Rate | Coverage |
+|-------|--------|--------|--------|-----------|----------|
+| L0 Static | {pass/fail} | -- | -- | -- | -- |
+| L1 Unit | {status} | {P} | {F} | {rate}% | {cov}% |
+| L2 Integration | {status} | {P} | {F} | {rate}% | {cov}% |
+| L3 E2E | {status} | {P} | {F} | {rate}% | {cov}% |
+## Iteration History
+| Iter | Strategy | Pass Rate | Delta | Action |
+|------|----------|-----------|-------|--------|
+| 1 | conservative | 72.0% | -- | fixed 3 type errors |
+| 2 | conservative | 85.5% | +13.5% | fixed auth test fixtures |
+| 3 | aggressive | 97.5% | +12.0% | batch-fixed API tests |
+## Reflection Summary
+{key insights from reflection-log.md}
+## Bugs Discovered
+{list of bugs found during testing}
+## Next Steps
+{suggested_next_command}
+```
+5. Copy `summary.json` to phase `.tests/integration/` directory.
+6. Update `index.json` with integration test status.
+7. Display summary.
+**Next step routing**:
+| Result | Suggestion |
+|--------|------------|
+| Converged (>=target%) | `maestro-verify {phase}` to update validation |
+| Max iter, >80% | `quality-test {phase}` for manual UAT on remaining gaps |
+| Max iter, <80% | `quality-debug` for deep investigation |
+| Bugs discovered | `maestro-plan {phase} --gaps` to plan fixes |
+### Shared Discovery Board Protocol
+#### Standard Discovery Types
+| Type | Dedup Key | Data Schema | Description |
+|------|-----------|-------------|-------------|
+| `code_pattern` | `data.name` | `{name, file, description}` | Reusable code pattern found |
+| `integration_point` | `data.file` | `{file, description, exports[]}` | Module connection point |
+| `convention` | singleton | `{naming, imports, formatting}` | Project code conventions |
+| `blocker` | `data.issue` | `{issue, severity, impact}` | Blocking issue found |
+| `tech_stack` | singleton | `{framework, language, tools[]}` | Technology stack info |
+#### Domain Discovery Types
+| Type | Dedup Key | Data Schema | Description |
+|------|-----------|-------------|-------------|
+| `test_command` | `data.layer` | `{layer, command, flags, cwd}` | Working test command for a layer |
+| `test_fixture` | `data.name` | `{name, file, setup, teardown}` | Shared test fixture or DB seed |
+| `coverage_gap` | `data.module` | `{module, layer, uncovered_areas[]}` | Coverage gap in a module |
+| `regression` | `data.test` | `{test, file, previous_status, current_status}` | Test that regressed |
+| `flaky_test` | `data.test` | `{test, file, fail_rate, pattern}` | Intermittently failing test |
+#### Protocol
+1. **Read** `{session_folder}/discoveries.ndjson` before own test execution
+2. **Skip covered**: If discovery of same type + dedup key exists, skip
+3. **Write immediately**: Append findings as found
+4. **Append-only**: Never modify or delete
+5. **Deduplicate**: Check before writing
+```bash
+echo '{"ts":"<ISO>","worker":"{id}","type":"test_command","data":{"layer":"L1","command":"npx vitest run --reporter=verbose","flags":"--testPathPattern=unit","cwd":"."}}' >> {session_folder}/discoveries.ndjson
+```
+</execution>
+<error_codes>
+| Error | Resolution |
+|-------|------------|
+| Phase directory not found | Abort with error: "Phase {N} not found" |
+| No test framework detected | Abort with error: "No test framework detected (E003)" |
+| L0 static analysis fails | Record failures, proceed to L1 (type errors are informational) |
+| All tasks in a layer failed | Gate check: skip subsequent layers for this iteration |
+| Agent timeout | Mark as failed, continue with remaining agents in wave |
+| Max iterations without convergence | Finalize with current results, warn (W001) |
+| Regression detected | Switch to Surgical strategy (W002) |
+| Stuck 3+ iterations | Switch to Reflective strategy (W003) |
+| CSV parse error | Validate format, show line number |
+| discoveries.ndjson corrupt | Ignore malformed lines |
+| Continue mode: no session found | List available sessions |
+| state.json missing on resume | Rebuild from tasks.csv status column |
+</error_codes>
+<success_criteria>
+- [ ] Session initialized with state.json and reflection-log.md
+- [ ] tasks.csv generated with correct layer/wave assignments and dependencies
+- [ ] All waves executed sequentially (L0 -> L1 -> L2 -> L3) with gate checks
+- [ ] Reflection logged after each iteration with strategy assessment
+- [ ] Strategy engine transitions applied correctly based on pass rates
+- [ ] Convergence reached or max iterations exhausted
+- [ ] results.csv, summary.json, and context.md generated
+- [ ] Temporary wave-{N}.csv files cleaned up after merge
+- [ ] discoveries.ndjson maintained as append-only across all waves
+</success_criteria>