npm - maestro-flow - Versions diffs - 0.3.29 → 0.3.31 - Mend

maestro-flow 0.3.29 → 0.3.31

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

package/.claude/commands/maestro-init.md +3 -3
package/.claude/commands/maestro-plan.md +2 -2
package/.claude/commands/maestro-ralph-execute.md +55 -21
package/.claude/commands/maestro.md +13 -11
package/.claude/commands/quality-business-test.md +3 -3
package/.claude/commands/quality-retrospective.md +2 -2
package/.codex/skills/maestro/SKILL.md +4 -4
package/.codex/skills/maestro-init/SKILL.md +9 -9
package/.codex/skills/maestro-link-coordinate/SKILL.md +4 -4
package/.codex/skills/maestro-milestone-complete/SKILL.md +3 -2
package/.codex/skills/maestro-plan/SKILL.md +2 -2
package/.codex/skills/maestro-ralph/SKILL.md +199 -56
package/.codex/skills/maestro-ralph-execute/SKILL.md +25 -1
package/.codex/skills/quality-business-test/SKILL.md +6 -6
package/.codex/skills/quality-retrospective/SKILL.md +5 -5
package/.codex/skills/quality-test/SKILL.md +2 -2
package/package.json +1 -1

package/.claude/commands/maestro-init.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-init
 description: Initialize project with auto state detection (empty/code/existing)
-argument-hint: "[--auto] [--from-brainstorm SESSION-ID]"
+argument-hint: "[-y] [--from-brainstorm SESSION-ID]"
 allowed-tools:
   - Read
   - Write
@@ -24,7 +24,7 @@ Initialize a new project through auto state detection and unified flow. Invoked
 <context>
 **Flags:**
-- `--auto` -- Automatic mode. After config questions, runs research without further interaction. Expects idea document via @ reference.
+- `-y` -- Automatic mode. After config questions, runs research without further interaction. Expects idea document via @ reference.
 - `--from-brainstorm SESSION-ID` -- Import from a brainstorm session. Reads guidance-specification.md to pre-fill project vision, goals, constraints, and terminology. Skips interactive questioning.
 **Load project state if exists:**
@@ -61,7 +61,7 @@ Other commands:
 <error_codes>
 | Code | Severity | Condition | Recovery |
 |------|----------|-----------|----------|
-| E001 | error | No arguments provided when --auto requires @ reference | Check arguments format, re-run with correct input |
+| E001 | error | No arguments provided when -y requires @ reference | Check arguments format, re-run with correct input |
 | E002 | error | .workflow/ already exists for greenfield init | Check .workflow/ directory state, resolve conflicts |
 | E003 | error | Brainstorm session not found (--from-brainstorm) | Check arguments format, re-run with correct input |
 | W001 | warning | Research agent failed, continuing with partial results | Retry research or proceed with partial results |

package/.claude/commands/maestro-plan.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-plan
 description: Explore, clarify, plan, check, and confirm a phase execution plan
-argument-hint: "[phase] [--collab] [--spec SPEC-xxx] [--auto] [--gaps] [--dir <path>] [--revise [instructions]] [--check <plan-dir>]"
+argument-hint: "[phase] [--collab] [--spec SPEC-xxx] [-y] [--gaps] [--dir <path>] [--revise [instructions]] [--check <plan-dir>]"
 allowed-tools:
   - Read
   - Write
@@ -36,7 +36,7 @@ All plan output goes to `.workflow/scratch/{YYYYMMDD}-plan-[P{N}-|M{N}-]{slug}/`
 <context>
 $ARGUMENTS — phase number, or no args for milestone-wide planning, with optional flags.
-Scope routing, base flags (`--collab`, `--spec`, `--auto`, `--gaps`, `--dir`), output directory format, and artifact registration are defined in workflow plan.md.
+Scope routing, base flags (`--collab`, `--spec`, `-y`, `--gaps`, `--dir`), output directory format, and artifact registration are defined in workflow plan.md.
 **Command-level flags** (extensions beyond workflow base):
 - `--revise [instructions]` -- See workflow plan.md § Revise Mode

package/.claude/commands/maestro-ralph-execute.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-ralph-execute
 description: Single-step executor — find next pending command in ralph session, execute by type (decision/skill/cli), hand off to next iteration
-argument-hint: "[session-id]"
+argument-hint: "[-y] [session-id]"
 allowed-tools:
   - Read
   - Write
@@ -27,11 +27,19 @@ Mutual invocation with `/maestro-ralph` forms a persistent self-perpetuating wor
 </purpose>
 <context>
-$ARGUMENTS — optional session ID. If omitted, finds latest running ralph session.
+$ARGUMENTS — optional `-y` flag + optional session ID. If session ID omitted, finds latest running ralph session.
+**Flag parsing:**
+```
+Parse $ARGUMENTS:
+  Contains "-y" or "--yes" → auto = true, remove flag from remaining args
+  Remaining → session_id (if matches ralph-* pattern)
+```
 **Session discovery:**
 Scan `.workflow/.ralph/ralph-*/status.json` for `status == "running"`, sorted by `created_at` descending.
-If $ARGUMENTS matches a session ID pattern, use that specific session.
+If remaining args match a session ID pattern, use that specific session.
+Also read `session.auto` from status.json — if `true`, treat as `-y` even if flag not passed（ralph 已写入）。
 </context>
 <execution>
@@ -52,12 +60,12 @@ If no session found:
   End.
 ```
-Read status.json → extract: `id`, `commands[]`, `current`, `status`, `phase`.
+Read status.json → extract: `id`, `steps[]`, `current_step`, `status`, `phase`.
 ## Step 2: Find Next Pending Command
 ```
-next = commands.find(cmd => cmd.status == "pending")
+next = steps.find(step => step.status == "pending")
 If no pending command:
   → Step 5 (Complete)
@@ -123,15 +131,15 @@ Write status.json
 Display step banner:
 ```
 ------------------------------------------------------------
-  [{next.index}/{commands.length - 1}] {next.skill} [{next.type}]
+  [{next.index}/{steps.length - 1}] {next.skill} [{next.type}]
 ------------------------------------------------------------
   Args: {next.args}
   {next.type == "decision" ? "Retry: " + JSON.parse(next.args).retry_count + "/" + JSON.parse(next.args).max_retries : ""}
 ```
-**Context weight hint** (after 4+ completed steps):
+**Context weight hint** (after 4+ completed steps, skip if auto):
 ```
-If completed_count >= 4:
+If completed_count >= 4 && !auto:
   Display: ⚡ 已执行 {completed_count} 步，上下文较重。可 /maestro-ralph continue 在新上下文恢复。
 ```
@@ -148,7 +156,7 @@ Skill({ skill: "maestro-ralph" })
 Ralph will:
 1. Detect the running decision node in status.json
 2. Evaluate execution results (verify gaps, test failures, etc.)
-3. Optionally expand commands[] with fix loops
+3. Optionally expand steps[] with fix loops
 4. Mark the decision node completed
 5. Call `Skill("maestro-ralph-execute")` to resume
@@ -158,8 +166,25 @@ Ralph will:
 Synchronous in-session execution.
+**`-y` auto flag 传播：** 当 `auto == true` 时，按传播表对目标 skill 附加 auto flag：
+```
+auto_flag_map = {
+  "maestro-init": "-y",
+  "maestro-analyze": "-y",
+  "maestro-brainstorm": "-y",
+  "maestro-roadmap": "-y",
+  "maestro-plan": "-y",
+  "maestro-execute": "-y",
+  "quality-business-test": "-y",
+  "quality-test": "-y --auto-fix",
+  "maestro-milestone-complete": "-y"
+}
+flag = auto_flag_map[next.skill] || ""
+effective_args = flag ? `${next.args} ${flag}` : next.args
+```
 ```
-Skill({ skill: next.skill, args: next.args })
+Skill({ skill: next.skill, args: effective_args })
 ```
 On success:
@@ -179,10 +204,18 @@ next.completed_at = new Date().toISOString()
 Write status.json
 Display: [N/total] ✗ {next.skill} failed: {error}
-AskUserQuestion: "retry / skip / abort"
-  retry → reset next.status = "pending", next.error = null → Skill("maestro-ralph-execute")
-  skip  → next.status = "skipped" → Skill("maestro-ralph-execute")
-  abort → status.status = "paused" → Write status.json → End.
+If auto:
+  If not next.retried:
+    next.retried = true, next.status = "pending", next.error = null → retry once
+  Else:
+    next.status = "skipped" → continue (auto-skip)
+    Display: [N/total] ⏭ {next.skill} auto-skipped after retry
+Else:
+  AskUserQuestion: "retry / skip / abort"
+    retry → reset next.status = "pending", next.error = null → Skill("maestro-ralph-execute")
+    skip  → next.status = "skipped" → Skill("maestro-ralph-execute")
+    abort → status.status = "paused" → Write status.json → End.
 ```
 Then hand off:
@@ -232,8 +265,7 @@ next.status = "failed"
 next.error = "{error details}"
 Write status.json
-AskUserQuestion: "retry / skip / abort"
-  (same as 4b failure handling)
+(same as 4b failure handling: auto → retry once then skip; else → AskUserQuestion)
 ```
 Then hand off:
@@ -243,7 +275,7 @@ Skill({ skill: "maestro-ralph-execute" })
 ## Step 5: Complete Session
-When no pending commands remain:
+When no pending steps remain:
 ```
 status.status = "completed"
@@ -260,7 +292,7 @@ Display completion report:
   Phase:    {phase}
   Steps:    {completed}/{total}
-  {commands.map(cmd => {
+  {steps.map(cmd => {
     icon = cmd.status == "completed" ? "✓" :
            cmd.status == "skipped"   ? "—" :
            cmd.status == "failed"    ? "✗" : " "
@@ -287,12 +319,14 @@ Display completion report:
 <success_criteria>
 - [ ] Session discovery finds latest running ralph session
-- [ ] Pending command correctly identified from commands[]
+- [ ] `-y` flag parsed from args OR inherited from session.auto
+- [ ] Pending step correctly identified from steps[]
 - [ ] decision nodes hand off to maestro-ralph via Skill()
 - [ ] skill nodes execute synchronously via Skill() and self-invoke next
+- [ ] `-y` auto flag 按传播表附加到目标 skill args
 - [ ] cli nodes use maestro delegate with run_in_background + stop pattern
 - [ ] status.json updated after every status change (resume-safe)
-- [ ] Failure handling offers retry/skip/abort
+- [ ] auto 模式：失败重试一次后 auto-skip；非 auto：AskUserQuestion retry/skip/abort
 - [ ] Completion report shows all steps with status icons
-- [ ] Self-invocation chain continues until all commands complete
+- [ ] Self-invocation chain continues until all steps complete
 </success_criteria>

package/.claude/commands/maestro.md CHANGED Viewed

@@ -65,21 +65,23 @@ When `-y` is active, maestro propagates auto flags to downstream commands. Only
 | Command | Auto Flag | Effect |
 |---------|-----------|--------|
+| maestro-init | `-y` | Skip interactive questioning |
 | maestro-analyze | `-y` | Skip interactive scoping, auto-deepen |
 | maestro-brainstorm | `-y` | Skip interactive questions, use defaults |
 | maestro-roadmap | `-y` | Skip interactive questions, use defaults (create/revise/review) |
 | maestro-ui-design | `-y` | Skip interactive selection, pick top variant |
-| maestro-plan | `--auto` | Skip interactive clarification |
-| maestro-roadmap --mode full | `-y` | Skip interactive questions, use defaults |
-| maestro-execute | *(none)* | No auto flag — executes all tasks normally |
-| maestro-verify | *(none)* | No auto flag — runs full verification |
-| quality-review | *(none)* | No auto flag — auto-detects level, runs fully |
-| quality-test | `--auto-fix` | Auto-trigger gap-fix loop on failures |
-| quality-test-gen | *(none)* | No auto flag — generates tests normally |
-| quality-debug | *(none)* | No auto flag — runs diagnosis normally |
-| quality-retrospective | `--auto-yes` | Accept all routing recommendations (spec/note/issue) without prompting |
-| maestro-milestone-audit | *(none)* | No auto flag — validates milestone readiness |
-| manage-learn | *(none)* | No auto flag — pure file operation, no prompts |
+| maestro-plan | `-y` | Skip confirmations and clarification |
+| maestro-execute | `-y` | Skip confirmations, blocked auto-continue |
+| maestro-verify | *(none)* | No interactive prompts |
+| quality-business-test | `-y` | Skip plan confirmation |
+| quality-review | *(none)* | No interactive prompts, auto-detects level |
+| quality-test | `-y --auto-fix` | Auto-trigger gap-fix loop on failures |
+| quality-test-gen | *(none)* | No interactive prompts |
+| quality-debug | *(none)* | No interactive prompts |
+| quality-retrospective | `-y` | Accept all routing recommendations without prompting |
+| maestro-milestone-audit | *(none)* | No interactive prompts |
+| maestro-milestone-complete | `-y` | Skip knowledge promotion inquiry |
+| manage-learn | *(none)* | No interactive prompts |
 Commands not listed (manage-*, spec-*, milestone-*) have no auto flags and execute as-is.

package/.claude/commands/quality-business-test.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: quality-business-test
 description: PRD-forward business testing with requirement traceability, fixture generation, and multi-layer execution
-argument-hint: "<phase> [--spec SPEC-xxx] [--layer L1|L2|L3] [--gen-code] [--dry-run] [--re-run] [--auto]"
+argument-hint: "<phase> [--spec SPEC-xxx] [--layer L1|L2|L3] [--gen-code] [--dry-run] [--re-run] [-y]"
 allowed-tools:
   - Read
   - Write
@@ -37,7 +37,7 @@ Phase: $ARGUMENTS (required -- phase number)
 - `--gen-code` -- Generate framework-specific test classes (JUnit/RestAssured, supertest/vitest, pytest/httpx)
 - `--dry-run` -- Extract scenarios and fixtures only, don't execute
 - `--re-run` -- Re-run only previously failed/blocked scenarios
-- `--auto` -- Skip interactive confirmations
+- `-y` -- Skip interactive confirmations
 **Layer definitions:**
@@ -96,7 +96,7 @@ Follow '~/.maestro/workflows/business-test.md' completely.
 - [ ] RFC 2119 keywords mapped to test priorities
 - [ ] Test fixtures generated (valid/invalid/boundary per REQ data model)
 - [ ] business-test-plan.json written with layer distribution
-- [ ] User confirmed plan (or --auto skipped confirmation)
+- [ ] User confirmed plan (or -y skipped confirmation)
 - [ ] Test code generated if --gen-code (framework-appropriate)
 - [ ] L1 executed with Generator-Critic loop (max 3 iterations)
 - [ ] L2 executed if no L1 critical failures

package/.claude/commands/quality-retrospective.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: quality-retrospective
 description: Multi-lens 复盘 of completed phase(s); routes insights to spec/note/issue stores and the lessons library
-argument-hint: "[phase|N..M] [--lens technical|process|quality|decision] [--all] [--no-route] [--compare N] [--auto-yes]"
+argument-hint: "[phase|N..M] [--lens technical|process|quality|decision] [--all] [--no-route] [--compare N] [-y]"
 allowed-tools:
   - Read
   - Write
@@ -30,7 +30,7 @@ Post-execution multi-perspective retrospective (复盘) for completed phases. Co
 <context>
 Arguments: $ARGUMENTS
-Modes (scan/single/range/all), flags (--lens, --no-route, --compare, --auto-yes), and storage paths defined in workflow retrospective.md Argument Shape and Stages 1-7.
+Modes (scan/single/range/all), flags (--lens, --no-route, --compare, -y), and storage paths defined in workflow retrospective.md Argument Shape and Stages 1-7.
 </context>
 <execution>

package/.codex/skills/maestro/SKILL.md CHANGED Viewed

@@ -189,10 +189,10 @@ functions.update_plan({
 | Skill | Flag |
 |-------|------|
-| `maestro-analyze`, `maestro-brainstorm`, `maestro-ui-design`, `maestro-roadmap` | `-y` |
-| `maestro-plan` | `--auto` |
-| `quality-test` | `--auto-fix` |
-| `quality-retrospective` | `--auto-yes` |
+| `maestro-init`, `maestro-analyze`, `maestro-brainstorm`, `maestro-ui-design`, `maestro-roadmap` | `-y` |
+| `maestro-plan`, `maestro-execute`, `maestro-milestone-complete` | `-y` |
+| `quality-business-test`, `quality-retrospective` | `-y` |
+| `quality-test` | `-y --auto-fix` |
 **`buildSkillCall(step, ctx)`**: Replace placeholders `{phase}`, `{description}`, `{issue_id}`, `{plan_dir}`, `{analysis_dir}`, `{brainstorm_dir}`, `{spec_session_id}` in `step.args` with corresponding `ctx` values. Append auto-yes flag if applicable. Return `$<skill> <args>`.

package/.codex/skills/maestro-init/SKILL.md CHANGED Viewed

@@ -1,26 +1,26 @@
 ---
 name: maestro-init
 description: Initialize project with auto state detection — creates .workflow/ directory, project.md, state.json, config.json, and specs/
-argument-hint: "[--auto] [--from-brainstorm SESSION-ID]"
+argument-hint: "[-y] [--from-brainstorm SESSION-ID]"
 allowed-tools: Read, Write, Edit, Bash, Glob, Grep, AskUserQuestion
 ---
 <purpose>
 Sequential project setup skill. Detects project state (empty/code/existing), gathers project information through deep questioning or document extraction, then creates the `.workflow/` directory structure. No parallel agents — single sequential flow.
-When `--auto`: After config questions, run research without further interaction. Expects idea document via @ reference.
+When `-y`: After config questions, run research without further interaction. Expects idea document via @ reference.
 </purpose>
 <context>
 ```bash
 $maestro-init ""
-$maestro-init "--auto"
+$maestro-init "-y"
 $maestro-init "--from-brainstorm 20260318-brainstorm-auth"
 ```
 **Flags**:
-- `--auto`: Skip interactive questioning; extract from provided document
+- `-y`: Skip interactive questioning; extract from provided document
 - `--from-brainstorm SESSION-ID`: Import vision/goals/constraints from brainstorm guidance-specification.md
 **Output**: `.workflow/` directory with project.md, state.json, config.json, specs/
@@ -29,7 +29,7 @@ $maestro-init "--from-brainstorm 20260318-brainstorm-auth"
 <invariants>
 1. **Never create roadmap** — init only creates .workflow/ structure; roadmap is a separate step
-2. **Deep questioning over speed** — follow threads, ask clarifying questions (unless --auto)
+2. **Deep questioning over speed** — follow threads, ask clarifying questions (unless -y)
 3. **Detect, don't assume** — scan for existing files, package managers, frameworks before asking
 4. **Templates are source of truth** — always read templates before writing files
 5. **Idempotent check** — if .workflow/ exists, refuse to overwrite (E002)
@@ -40,7 +40,7 @@ $maestro-init "--from-brainstorm 20260318-brainstorm-auth"
 ### Step 1: Parse Arguments
 Extract flags from arguments:
-- `--auto` flag presence
+- `-y` flag presence
 - `--from-brainstorm SESSION-ID` value
 - Remaining text as project description
@@ -60,7 +60,7 @@ Classify as:
 - Extract: vision, goals, constraints, terminology, tech decisions
 - Skip interactive questioning
-**If `--auto`**:
+**If `-y`**:
 - Extract project info from provided document/@ reference
 - Minimal interactive questions (confirm core value only)
@@ -94,7 +94,7 @@ Initialize from template with `current_milestone: null`, `status: "initialized"`
 ### Step 8: Write config.json
-Configuration questions (or defaults for --auto): granularity (fine/medium/coarse), workflow agents (enable/disable), gate preferences. Write to `.workflow/config.json`.
+Configuration questions (or defaults for -y): granularity (fine/medium/coarse), workflow agents (enable/disable), gate preferences. Write to `.workflow/config.json`.
 ### Step 9: Initialize specs/
@@ -110,7 +110,7 @@ Display created files and next steps: `$maestro-roadmap --mode full` (full spec)
 | Code | Severity | Description | Recovery |
 |------|----------|-------------|----------|
-| E001 | error | No arguments when --auto requires document | Ask user for document reference |
+| E001 | error | No arguments when -y requires document | Ask user for document reference |
 | E002 | error | .workflow/ already exists | Show status, suggest manage-status |
 | E003 | error | Brainstorm session not found | List available sessions |
 | W001 | warning | Could not detect tech stack | Continue with manual input |

package/.codex/skills/maestro-link-coordinate/SKILL.md CHANGED Viewed

@@ -150,10 +150,10 @@ Set `state.status` to completed/failed based on `node.status`. Record final hist
 | Skill | Flag |
 |-------|------|
-| `maestro-analyze`, `maestro-brainstorm`, `maestro-ui-design`, `maestro-roadmap` | `-y` |
-| `maestro-plan` | `--auto` |
-| `quality-test` | `--auto-fix` |
-| `quality-retrospective` | `--auto-yes` |
+| `maestro-init`, `maestro-analyze`, `maestro-brainstorm`, `maestro-ui-design`, `maestro-roadmap` | `-y` |
+| `maestro-plan`, `maestro-execute`, `maestro-milestone-complete` | `-y` |
+| `quality-business-test`, `quality-retrospective` | `-y` |
+| `quality-test` | `-y --auto-fix` |
 **buildSkillCall(node, ctx, autoMode)**: Substitute `{phase}`, `{description}`, `{issue_id}`, `{milestone_num}` from context into `node.args`. If autoMode, append auto flag from `node.auto_flag` or AUTO_FLAG_MAP. Return `$${node.cmd} ${resolvedArgs}`.

package/.codex/skills/maestro-milestone-complete/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-milestone-complete
 description: Archive completed milestone scratch artifacts to milestones/ dir, move artifact entries to milestone_history, extract learnings, advance state.
-argument-hint: "[milestone] [--force]"
+argument-hint: "[milestone] [--force] [-y]"
 allowed-tools: Read, Write, Edit, Bash, Glob, Grep
 ---
@@ -49,7 +49,8 @@ Read `.summaries/` and `reflection-log.md` from execute artifacts. Extract patte
 2. **Convention drift**: Compare summaries against `coding-conventions.md` and `architecture-constraints.md` -- ask if conventions need updating
 3. **Wiki island check**: Auto-trigger `wiki-connect --fix` to link new knowledge
-If user confirms promotion, append `<spec-entry>` to target category file preserving original date and source.
+If `-y`: auto-accept all promotions without asking.
+If not `-y`: ask user for confirmation. If user confirms, append `<spec-entry>` to target category file preserving original date and source.
 ### Step 4: Archive Artifact Entries

package/.codex/skills/maestro-plan/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-plan
 description: Exploration-driven planning via CSV wave pipeline. Wave 1 runs parallel codebase exploration agents, Wave 2 synthesizes explorations into plan.json + TASK-*.json. Replaces maestro-plan command.
-argument-hint: "[-y|--yes] [-c|--concurrency N] [--continue] \"<phase> [--auto] [--dir <path>] [--gaps] [--spec SPEC-xxx] [--collab]\""
+argument-hint: "[-y|--yes] [-c|--concurrency N] [--continue] \"<phase> [--dir <path>] [--gaps] [--spec SPEC-xxx] [--collab]\""
 allowed-tools: spawn_agents_on_csv, Read, Write, Edit, Bash, Glob, Grep, AskUserQuestion
 ---
@@ -51,7 +51,7 @@ Wave-based planning using `spawn_agents_on_csv`. Wave 1 explores codebase contex
 <context>
 ```bash
 $maestro-plan "3"
-$maestro-plan -y "3 --auto"
+$maestro-plan -y "3"
 $maestro-plan -c 4 "3 --spec SPEC-001"
 $maestro-plan "3 --gaps"
 $maestro-plan "3 --dir .workflow/scratch/quick-nav-fix"

package/.codex/skills/maestro-ralph/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-ralph
 description: Closed-loop lifecycle decision engine — read state, infer position, build adaptive chain, execute via CSV waves, STOP at decision nodes for re-evaluation
-argument-hint: "\"intent\" | status | continue | execute"
+argument-hint: "\"intent\" [-y] | status | continue | execute"
 allowed-tools: spawn_agents_on_csv, Read, Write, Edit, Bash, Glob, Grep, AskUserQuestion
 ---
@@ -21,7 +21,7 @@ Key difference from maestro coordinator:
 Three node types in the chain:
 - **decision**: Barrier that STOPS execution. Ralph re-reads result files, decides whether to expand chain.
 - **skill**: Executed via `spawn_agents_on_csv`. Barrier skills (analyze, plan, execute, brainstorm) run solo. Non-barriers can parallel.
-- **cli**: Executed via `spawn_agents_on_csv` with delegate wrapper.
+- **cli**: Executed via `maestro delegate` (轻量替代，如 quick 模式的 review)。单步执行，不进 CSV wave。
 Session at `.workflow/.ralph/ralph-{YYYYMMDD-HHmmss}/status.json`.
 </purpose>
@@ -36,19 +36,52 @@ $ARGUMENTS — intent text, or keywords.
 otherwise             → handleNew(). Start from Phase 1.
 ```
+**Flags:**
+- `-y` / `--yes` — Auto mode: skip confirmation, decision nodes auto-evaluate并继续（不 STOP），错误自动重试一次后跳过。`-y` 存入 `session.auto`，传播到 ralph-execute 及下游 skill。
+**`-y` 传播链：**
+```
+ralph -y → session.auto = true
+         → wave CSV skill_call 附加 -y: $maestro-ralph-execute -y "$skill_call"
+           → ralph-execute 解析 -y，附加到目标 skill: $maestro-plan -y 1
+```
+**`-y` 下游传播表：**
+| Skill | 附加 Flag | 效果 |
+|-------|-----------|------|
+| maestro-init | `-y` | 跳过交互提问 |
+| maestro-analyze | `-y` | 跳过交互 scoping |
+| maestro-brainstorm | `-y` | 跳过交互提问 |
+| maestro-roadmap | `-y` | 跳过交互选择 |
+| maestro-plan | `-y` | 跳过确认和澄清 |
+| maestro-execute | `-y` | 跳过确认，blocked 自动继续 |
+| maestro-verify | *(none)* | 无交互，正常执行 |
+| quality-business-test | `-y` | 跳过计划确认 |
+| quality-review | *(none)* | 无交互确认，自动检测级别 |
+| quality-test | `-y --auto-fix` | 自动触发 gap-fix loop |
+| quality-test-gen | *(none)* | 无交互，正常生成 |
+| quality-debug | *(none)* | 无交互确认，正常诊断 |
+| maestro-milestone-audit | *(none)* | 无交互，正常执行 |
+| maestro-milestone-complete | `-y` | 跳过 knowledge promotion 交互 |
+未列出的命令无 auto flag，原样执行。
 **Decision-node detection (for execute mode):**
 If status.json has a pending decision node as next step → Phase 2b (evaluate), not Phase 2a (spawn).
 </context>
 <invariants>
-1. **ALL skills via spawn_agents_on_csv**: Coordinator NEVER executes skills directly.
-2. **Decision nodes STOP execution**: After processing a decision node, coordinator writes status.json and STOPS. User must call `$maestro-ralph execute` to resume.
+1. **Skills via spawn_agents_on_csv, CLI via delegate**: Coordinator NEVER executes skills directly. CLI steps use `maestro delegate`.
+2. **Decision nodes STOP execution**: After processing a decision node, coordinator writes status.json and STOPS. User must call `$maestro-ralph execute` to resume. **例外：`-y` 模式下 decision 自动评估后继续，不 STOP（post-debug-escalate 除外）。**
 3. **Barrier = solo wave**: barrier skills (analyze, plan, execute, brainstorm, roadmap) always run alone.
 4. **Non-barriers can parallel**: consecutive non-barrier + non-decision steps grouped into one wave.
-5. **Decision = barrier + stop**: decision node is always solo AND halts the loop.
+5. **Decision = barrier + conditional stop**: decision node is always solo. 默认 STOP；`-y` 模式自动继续。
 6. **Wave-by-wave**: never start wave N+1 before wave N results are read.
 7. **Coordinator owns context**: sub-agents never read prior results.
-8. **Abort on failure**: failed step → mark remaining skipped → pause session.
+8. **Abort on failure**: failed step → `-y` 模式重试一次后跳过并继续；非 `-y` 模式 mark remaining skipped → pause session.
+9. **Quality mode governs steps**: quality_mode (full/standard/quick) 决定哪些质量步骤被包含。
+10. **passed_gates skip**: 重试循环中已通过的质量门不重复执行（除非代码变更影响了其检查范围）。
 </invariants>
 <execution>
@@ -116,7 +149,7 @@ When latest is "verify", read result files to refine position:
   resolve_artifact_dir(latest_verify_artifact)
   Read verification.json from that dir:
     gaps[] non-empty or passed == false         → "verify-failed" (needs fix loop)
-    passed == true, no review.json              → "business-test"
+    passed == true, no review.json              → "post-verify" (chain builder 按 quality_mode 决定下一步)
     has review.json with verdict == "BLOCK"     → "review-failed"
     has review.json with verdict != "BLOCK"     → "test"
     has uat.md with status == "complete", all passed → "milestone-audit"
@@ -133,29 +166,85 @@ Fallback: glob .workflow/scratch/*-P{phase}-*/ sorted by date DESC, take first
 ### 1c: Build command sequence
-**Lifecycle stages** (full pipeline):
-```
-Stage              Skill                         Barrier  Decision After
-──────────────────────────────────────────────────────────────────────────
-brainstorm         maestro-brainstorm "{intent}" yes      — (0→1 only)
-init               maestro-init                  no       —
-roadmap            maestro-roadmap "{intent}"    yes      —
-analyze            maestro-analyze {phase}       yes      —
-plan               maestro-plan {phase}          yes      —
-execute            maestro-execute {phase}       yes      —
-verify             maestro-verify {phase}        no       decision:post-verify
-business-test      quality-business-test {phase}  no       decision:post-business-test
-review             quality-review {phase}        no       decision:post-review
-test-gen           quality-test-gen {phase}      no       —
-test               quality-test {phase}          no       decision:post-test
-milestone-audit    maestro-milestone-audit       no       —
-milestone-complete maestro-milestone-complete    no       decision:post-milestone
+**Quality pipeline modes** (`quality_mode` in session):
+| Mode | 含义 | 质量步骤 |
+|------|------|----------|
+| `full` | 全量质量管线 | verify → business-test → review → test-gen → test |
+| `standard` | 标准管线（默认） | verify → review → test（跳过 business-test、test-gen 按条件） |
+| `quick` | 轻量验证 | verify → CLI-review（跳过 business-test、test-gen、test） |
+Mode 选择逻辑（Phase 1a 后自动推断，可被用户覆盖）：
+```
+有 requirements/REQ-*.md 且 phase scope == "phase" → full
+其他场景                                           → standard
+用户显式指定                                        → 覆盖自动推断
+```
+**Lifecycle stages** (带条件的完整管线):
+```
+Stage              Skill                          Barrier  Decision After          Condition
+───────────────────────────────────────────────────────────────────────────────────────────────
+brainstorm         maestro-brainstorm "{intent}"  yes      —                       0→1 only
+init               maestro-init                   no       —                       always
+roadmap            maestro-roadmap "{intent}"     yes      —                       always
+analyze            maestro-analyze {phase}        yes      —                       always
+plan               maestro-plan {phase}           yes      —                       always
+execute            maestro-execute {phase}        yes      —                       always
+verify             maestro-verify {phase}         no       decision:post-verify    always
+business-test      quality-business-test {phase}  no       decision:post-biz-test  full only ①
+review             quality-review {phase}         no       decision:post-review    full/standard ②
+  └─ CLI alt       delegate --role review         —        decision:post-review    quick ②
+test-gen           quality-test-gen {phase}       no       —                       full; standard 按条件 ③
+test               quality-test {phase}           no       decision:post-test      full/standard ④
+milestone-audit    maestro-milestone-audit        no       —                       always
+milestone-complete maestro-milestone-complete     no       decision:post-milestone always
+```
+**条件说明：**
+- ① `business-test`: 仅 full 模式。与 `quality-test` 有 40% 重叠（PRD 正向 vs 代码反向），full 模式两者互补覆盖，standard/quick 模式省略
+- ② `review`: full/standard 用完整 skill spawn（6 维度并行）；quick 模式改用 CLI delegate（轻量代码审查）
+- ③ `test-gen`: full 模式始终执行；standard 模式仅在 `validation.json` 覆盖率 < 80% 或不存在时执行
+- ④ `test`: full/standard 执行；quick 模式跳过（依赖 verify + CLI-review 即可）
+**CLI review 替代（quick 模式）：**
+```json
+{
+  "type": "cli",
+  "skill": "maestro delegate",
+  "args": "\"review changed files in phase {phase}\" --role review --mode analysis --rule analysis-review-code-quality",
+  "output_file": "{artifact_dir}/review.json"
+}
+```
+CLI review 输出需符合 review.json schema（verdict + issues[]），供 post-review 决策节点消费。
+**条件步骤的链构建：**
+```
+buildSteps(position, target, quality_mode):
+  steps = lifecycle_stages[position..target]
+  # 按 quality_mode 过滤
+  if quality_mode != "full":
+    remove business-test + decision:post-biz-test
+  if quality_mode == "quick":
+    replace review skill → CLI review
+    remove test-gen
+    remove test + decision:post-test
+  if quality_mode == "standard":
+    # test-gen 延迟决定：在 post-verify 决策后检查覆盖率
+    mark test-gen as conditional: "check_coverage"
+  return steps
 ```
 Generate `steps[]` from current position to target. Decision nodes use:
 ```json
 { "type": "decision", "skill": "maestro-ralph", "args": "{\"decision\":\"post-verify\",\"retry_count\":0,\"max_retries\":2}" }
 ```
+Conditional steps use:
+```json
+{ "type": "skill", "skill": "quality-test-gen {phase}", "condition": "check_coverage", "threshold": 80 }
+```
 ### 1d: Create session
@@ -170,7 +259,9 @@ Write `.workflow/.ralph/ralph-{YYYYMMDD-HHmmss}/status.json`:
   "target": "milestone-complete",
   "phase": null,
   "milestone": null,
-  "auto_mode": false,
+  "auto": false,
+  "quality_mode": "standard",
+  "passed_gates": [],
   "context": { "plan_dir": null, "analysis_dir": null, "brainstorm_dir": null },
   "steps": [...],
   "waves": [],
@@ -187,18 +278,25 @@ Write `.workflow/.ralph/ralph-{YYYYMMDD-HHmmss}/status.json`:
 ============================================================
   Position:  {position} (Phase {N}, {milestone})
   Target:    milestone-complete
+  Quality:   {quality_mode} (full|standard|quick)
   Steps:     {total} ({decision_count} decision points)
   [ ] 0. maestro-plan {phase}              [skill/barrier]
   [ ] 1. maestro-execute {phase}           [skill/barrier]
   [ ] 2. maestro-verify {phase}            [skill]
   [ ] 3. ◆ post-verify                     [decision] ← STOP
-  [ ] 4. quality-business-test {phase}     [skill]
+  [ ] 4. quality-review {phase}            [skill]        ← standard
+  [ ] 4. quality-review {phase}            [cli/delegate] ← quick
+  [ ] 5. ◆ post-review                     [decision] ← STOP
   ...
+  ── skipped (standard mode) ──────────────────────────────
+  [~] _. quality-business-test {phase}     [skip: standard]
+  [?] _. quality-test-gen {phase}          [conditional: coverage < 80%]
 ============================================================
 ```
-If not auto_mode: AskUserQuestion → Proceed / Cancel
+If not auto: AskUserQuestion → Proceed / Cancel / Change quality mode
+If auto (`-y`): skip confirmation, proceed directly
 ### 1f: Fall through to Phase 2
@@ -225,7 +323,7 @@ Sort by created_at DESC
 For the decision type, find the relevant artifact:
   post-verify        → latest type=="verify" artifact
-  post-business-test → same dir as verify (business-test writes to same artifact dir)
+  post-biz-test      → same dir as verify (business-test writes to same artifact dir)
   post-review        → latest artifact dir → review.json
   post-test          → latest artifact dir → uat.md + .tests/test-results.json
@@ -234,6 +332,9 @@ artifact_dir = resolve_artifact_dir(artifact)
 **Evaluate by decision type:**
+> **passed_gates 机制**：session.passed_gates[] 记录已通过的质量门。重试循环中跳过已通过的门，避免重复执行。
+> 当代码被修改（debug+plan+execute）后，清除 passed_gates 中被影响的门（verify 始终重新执行）。
 **post-verify:**
 ```
 Read {artifact_dir}/verification.json
@@ -250,10 +351,17 @@ If gaps found (passed == false or gaps[].length > 0):
     → Display: ◆ post-verify: gaps detected, inserting debug+fix loop (retry {N}/{max})
 If no gaps (passed == true):
+  → Add "verify" to passed_gates
+  → 条件检查 test-gen（standard 模式）：
+    Read {artifact_dir}/validation.json
+    If coverage < 80% or validation.json not found:
+      activate conditional test-gen step (set condition = "met")
+    Else:
+      skip test-gen step (set status = "skipped")
   → No insertion, proceed
 ```
-**post-business-test:**
+**post-biz-test (仅 full 模式):**
 ```
 Read {artifact_dir}/business-test-results.json or scan for business test output
 Check: failures[] or passed field
@@ -262,12 +370,14 @@ If failures found:
   If meta.retry_count >= meta.max_retries:
     → Insert: [quality-debug --from-business-test {phase}, decision:post-debug-escalate]
   Else:
+    → Clear passed_gates (code will change)
     → Insert: [quality-debug --from-business-test {phase},
                maestro-plan --gaps {phase}, maestro-execute {phase},
                maestro-verify {phase}, decision:post-verify(retry:0),
-               quality-business-test {phase}, decision:post-business-test(retry+1)]
+               quality-business-test {phase}, decision:post-biz-test(retry+1)]
 If all pass:
+  → Add "business-test" to passed_gates
   → No insertion, proceed
 ```
@@ -280,15 +390,18 @@ If verdict == "BLOCK" or any issue.severity == "critical":
   If meta.retry_count >= meta.max_retries:
     → Insert: [quality-debug "{block_summary}", decision:post-debug-escalate]
   Else:
+    → Clear passed_gates (code will change)
     → Insert: [quality-debug "{block_issues}",
                maestro-plan --gaps {phase}, maestro-execute {phase},
                quality-review {phase}, decision:post-review(retry+1)]
+    注：review 失败只重跑 review，不回滚到 verify（verify 已通过且代码仅修复 review 问题）
 If verdict == "PASS" or "WARN":
+  → Add "review" to passed_gates
   → No insertion, proceed
 ```
-**post-test:**
+**post-test (仅 full/standard 模式):**
 ```
 Read {artifact_dir}/uat.md (parse frontmatter + gap sections)
 Read {artifact_dir}/.tests/test-results.json if exists
@@ -297,15 +410,19 @@ If failures found (any test result != pass, or gaps with severity >= high):
   If meta.retry_count >= meta.max_retries:
     → Insert: [quality-debug --from-uat {phase}, decision:post-debug-escalate]
   Else:
+    → Clear passed_gates (code will change)
+    → 轻量重试：仅重新执行 verify + 未通过的质量门
     → Insert: [quality-debug --from-uat {phase},
                maestro-plan --gaps {phase}, maestro-execute {phase},
                maestro-verify {phase}, decision:post-verify(retry:0),
-               quality-business-test {phase}, decision:post-business-test(retry:0),
-               quality-review {phase}, decision:post-review(retry:0),
-               quality-test-gen {phase}, quality-test {phase},
-               decision:post-test(retry+1)]
+               // 对 passed_gates 中的每个门：对比修改文件列表与该门的检查范围
+               //   有交集 → 重新插入该门 + 对应 decision
+               //   无交集 → 跳过（不插入）
+               quality-test {phase}, decision:post-test(retry+1)]
+    注：不再重新插入整条管线。verify 始终重跑（代码已变），其余门按影响范围判断。
 If all pass:
+  → Add "test" to passed_gates
   → No insertion, proceed
 ```
@@ -319,22 +436,19 @@ If next milestone found:
   first_phase = next_m.phases[0]
   Update ralph session: milestone = next_m.name, phase = first_phase
-  → Insert full lifecycle for next milestone:
+  → Reset passed_gates = []
+  → Re-infer quality_mode for next milestone (check REQ-*.md existence)
+  → Insert lifecycle for next milestone (按 quality_mode 过滤):
     [maestro-analyze {first_phase} [barrier],
      maestro-plan {first_phase} [barrier],
      maestro-execute {first_phase} [barrier],
      maestro-verify {first_phase},
      decision:post-verify(retry:0),
-     quality-business-test {first_phase},
-     decision:post-business-test(retry:0),
-     quality-review {first_phase},
-     decision:post-review(retry:0),
-     quality-test-gen {first_phase},
-     quality-test {first_phase},
-     decision:post-test(retry:0),
+     ...quality steps per quality_mode (see 1c buildSteps)...,
      maestro-milestone-audit,
      maestro-milestone-complete,
      decision:post-milestone]
+  注：使用 buildSteps() 按当前 quality_mode 生成质量步骤，不硬编码完整管线
   → Display: ◆ post-milestone: {completed_m.name} done → advancing to {next_m.name} Phase {first_phase}
@@ -357,13 +471,17 @@ After evaluation:
 2. Reindex steps if inserted
 3. Write status.json
 4. Display: `◆ Decision: {type} → {outcome}`
-5. Fall through to Phase 2c (continue executing next steps)
+5. **STOP 判定：**
+   - `post-debug-escalate` → 始终 STOP（无论 `-y` 与否）
+   - `auto == true` (`-y`) → 不 STOP，直接 fall through to Phase 2c
+   - `auto == false` → STOP。Display: `⏸ 到达决策节点。使用 $maestro-ralph execute 继续。`
 ### 2c: Build and Execute Next Wave
 **While pending non-decision steps remain:**
 1. **buildNextWave**: Take first pending step.
+   - If conditional step with condition not met → mark "skipped", advance to next
    - If barrier → solo wave
    - If non-barrier → collect consecutive non-barrier, non-decision steps
    - Stop at first decision node (it will be processed in next `execute` call)
@@ -377,15 +495,32 @@ After evaluation:
    {analysis_dir}→ status.context.analysis_dir
    ```
-3. **Write wave CSV**: `{sessionDir}/wave-{N}.csv`
+3. **Route by step type:**
+   **type == "skill"** → Write wave CSV: `{sessionDir}/wave-{N}.csv`
    Each row spawns a `$maestro-ralph-execute` agent with the target skill_call as argument:
    ```csv
    id,skill_call,topic
    "3","$maestro-ralph-execute \"$maestro-verify 1\"","Ralph step 3/14: verify phase 1"
    ```
+   当 `session.auto == true` 时，skill_call 附加 `-y`：
+   ```csv
+   "3","$maestro-ralph-execute -y \"$maestro-verify 1\"","Ralph step 3/14: verify phase 1"
+   ```
+   ralph-execute 解析 `-y` 后，按传播表对目标 skill 附加对应 auto flag。
    The inner `$maestro-verify 1` is the actual skill; `$maestro-ralph-execute` is the worker wrapper.
-4. **Spawn**:
+   **type == "cli"** → CLI delegate 执行（quick 模式 review 等）：
+   ```
+   Bash({
+     command: 'maestro delegate "{step.args}" --mode analysis',
+     run_in_background: true
+   })
+   ```
+   等待回调 → `maestro delegate output <id>` → 解析输出写入 `{artifact_dir}/{output_file}`
+   CLI 步骤始终单步执行，不进 CSV wave。
+4. **Spawn** (仅 skill 类型):
    ```
    spawn_agents_on_csv({
      csv_path: "{sessionDir}/wave-{N}.csv",
@@ -398,7 +533,7 @@ After evaluation:
    })
    ```
-5. **Read results**: Update step status from results CSV
+5. **Read results**: Update step status from results CSV (skill) or delegate output (cli)
 6. **Barrier check**: If wave was a barrier skill, read artifacts, update context:
    | Barrier | Read | Update |
@@ -413,8 +548,9 @@ After evaluation:
 8. **Failure check**: Any step failed → mark remaining skipped, pause session, STOP
-9. **Decision check**: If next pending step is a decision node → STOP.
-   Display: `⏸ 到达决策节点: {decision_type}。使用 $maestro-ralph execute 继续。`
+9. **Decision check**: If next pending step is a decision node:
+   - `auto == true` → 不 STOP，直接进入 Phase 2b 评估该决策节点，然后继续循环
+   - `auto == false` → STOP。Display: `⏸ 到达决策节点: {decision_type}。使用 $maestro-ralph execute 继续。`
 10. **Continue**: If next pending is not decision, loop back to step 1
@@ -451,15 +587,17 @@ Write status.json
   RALPH COMPLETE
 ============================================================
   Session:  {id}
+  Quality:  {quality_mode}
   Phase:    {phase} → {milestone}
   Waves:    {wave_count} executed
-  Steps:    {completed}/{total}
+  Steps:    {completed}/{total} ({skipped} skipped)
   [✓] 0. maestro-plan 1            [W1]
   [✓] 1. maestro-execute 1         [W2]
   [✓] 2. maestro-verify 1          [W3]
   [✓] 3. ◆ post-verify             [decision: no gaps]
-  [✓] 4. quality-business-test 1   [W4]
+  [~] 4. quality-business-test 1   [skipped: standard mode]
+  [✓] 5. quality-review 1          [W4]
   ...
   Resume: $maestro-ralph execute
@@ -479,11 +617,12 @@ id,skill_call,topic
 "4","$maestro-ralph-execute \"$quality-business-test 1\"","Ralph step 4/14: business test phase 1"
 ```
-- `skill_call` column: always `$maestro-ralph-execute "<inner_skill_call>"`
+- `skill_call` column: `$maestro-ralph-execute [-y] "<inner_skill_call>"`（`session.auto` 时附加 `-y`）
 - `topic` column: human-readable step description
 - Non-barrier + non-decision steps can be grouped in one wave CSV with multiple rows
 - Barrier steps always solo (one row per CSV)
 - Decision steps are NEVER in CSV — processed by ralph directly
+- CLI steps (type=="cli") are NEVER in CSV — processed by ralph via maestro delegate
 </csv_schema>
 <error_codes>
@@ -504,16 +643,20 @@ id,skill_call,topic
 - [ ] state.json artifacts correctly read with actual schema (type, path, scope, milestone, depends_on)
 - [ ] Lifecycle position inferred from artifacts + result files (verification.json, review.json, uat.md)
 - [ ] Artifact dir resolved via resolve_artifact_dir() with fallback globs
-- [ ] Full quality pipeline: verify → business-test → review → test-gen → test
-- [ ] Decision nodes at: post-verify, post-business-test, post-review, post-test, post-milestone
+- [ ] Quality mode (full/standard/quick) 正确推断并影响步骤生成
+- [ ] Conditional steps: business-test 仅 full 模式，test-gen 按覆盖率条件
+- [ ] CLI 替代: quick 模式 review 走 delegate 而非 skill spawn
+- [ ] Decision nodes at: post-verify, post-biz-test (full only), post-review, post-test (full/standard), post-milestone
 - [ ] Every decision failure path starts with quality-debug before plan --gaps
+- [ ] passed_gates[] 正确追踪，重试时跳过已通过的质量门
+- [ ] 重试循环轻量化：post-test 失败不重跑整条管线，仅重跑未通过的门
 - [ ] retry_count tracked per decision node, max_retries enforced
 - [ ] Max retries → post-debug-escalate → session paused for human intervention
-- [ ] All skills via spawn_agents_on_csv (through ralph-execute) — coordinator never executes directly
+- [ ] Skills via spawn_agents_on_csv, CLI via delegate — coordinator never executes directly
 - [ ] Decision nodes STOP execution — user must call `execute` to resume
 - [ ] Barrier skills run solo, non-barriers grouped in parallel waves
 - [ ] Placeholder args resolved before CSV assembly ({phase}, {intent}, {scratch_dir})
-- [ ] post-milestone inserts next milestone lifecycle with recursive post-milestone
+- [ ] post-milestone 用 buildSteps() 生成下一个 milestone 的步骤（按 quality_mode）
 - [ ] status.json persisted after every wave
 - [ ] Command insertion + reindex works correctly after decision expansion
 </success_criteria>

package/.codex/skills/maestro-ralph-execute/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: maestro-ralph-execute
 description: Single-step skill executor — spawned by maestro-ralph via CSV, reads ralph session context, executes one skill command, reports result
-argument-hint: "<skill_call>"
+argument-hint: "[-y] <skill_call>"
 allowed-tools: Read, Write, Edit, Bash, Glob, Grep
 ---
@@ -62,6 +62,10 @@ Read-only for this agent. Provides:
 ## Step 1: Parse skill_call
 ```
+Parse $ARGUMENTS:
+  Contains "-y" or "--yes" → auto = true, remove flag from remaining args
+  Remaining → skill_call
 Extract from skill_call:
   skill_name = text between $ and first space (e.g. "maestro-plan")
   skill_args = remainder after first space (e.g. "1")
@@ -71,6 +75,8 @@ If skill_call is empty or malformed:
   → End.
 ```
+Also read `session.auto` from ralph status.json — if `true`, treat as `-y` even if flag not passed.
 ## Step 2: Load ralph session context
 ```
@@ -134,6 +140,24 @@ maestro-verify, maestro-milestone-audit, maestro-milestone-complete:
 ## Step 4: Execute skill
+**`-y` auto flag 传播：** 当 `auto == true` 时，按传播表附加 flag：
+```
+auto_flag_map = {
+  "maestro-init": "-y",
+  "maestro-analyze": "-y",
+  "maestro-brainstorm": "-y",
+  "maestro-roadmap": "-y",
+  "maestro-plan": "-y",
+  "maestro-execute": "-y",
+  "quality-business-test": "-y",
+  "quality-test": "-y --auto-fix",
+  "quality-retrospective": "-y",
+  "maestro-milestone-complete": "-y"
+}
+flag = auto_flag_map[skill_name] || ""
+skill_args = flag ? `${skill_args} ${flag}` : skill_args
+```
 ```
 Read .codex/skills/{skill_name}/SKILL.md to understand the skill
 Execute the skill with enriched skill_args as $ARGUMENTS

package/.codex/skills/quality-business-test/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: quality-business-test
 description: PRD-forward business testing with requirement traceability, multi-layer execution (L1 Interface -> L2 Business Rule -> L3 Scenario), fixture generation, and feedback loop.
-argument-hint: "<phase> [--spec SPEC-xxx] [--layer L1|L2|L3] [--gen-code] [--dry-run] [--re-run] [--auto]"
+argument-hint: "<phase> [--spec SPEC-xxx] [--layer L1|L2|L3] [--gen-code] [--dry-run] [--re-run] [-y]"
 allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Agent, AskUserQuestion
 ---
@@ -37,7 +37,7 @@ $quality-business-test "3 --gen-code"               # generate framework-specifi
 $quality-business-test "3 --dry-run"                # extract scenarios only, don't execute
 $quality-business-test "3 --re-run"                 # re-run only previously failed scenarios
 $quality-business-test "3 --spec SPEC-auth-2026-04" # explicit spec reference
-$quality-business-test "3 --auto"                   # skip plan confirmation
+$quality-business-test "3 -y"                   # skip plan confirmation
 ```
 **Flags**:
@@ -47,9 +47,9 @@ $quality-business-test "3 --auto"                   # skip plan confirmation
 - `--gen-code`: Generate framework-specific test classes (JUnit/RestAssured, supertest/vitest, pytest/httpx)
 - `--dry-run`: Extract scenarios and fixtures only, don't execute
 - `--re-run`: Re-run only previously failed/blocked scenarios
-- `--auto`: Skip interactive confirmations
+- `-y`: Skip interactive confirmations
-`--auto` skips interactive confirmation of test plan. `--dry-run` extracts scenarios only without execution.
+`-y` skips interactive confirmation of test plan. `--dry-run` extracts scenarios only without execution.
 **Output**: `{artifact_dir}/.tests/business/business-test-plan.json` + `business-test-report.json` + `business-test-summary.md`
 </context>
@@ -125,7 +125,7 @@ Three tiers:
 1. Archive previous `business-test-plan.json` to `.history/` if exists
 2. Write `.tests/business/business-test-plan.json` with scenarios, fixtures, mock_contracts, requirement_coverage_plan
 3. Display plan summary (scenario counts per layer, fixture counts, requirement coverage)
-4. If not `--auto`: wait for user confirmation (yes/edit/cancel)
+4. If not `-y`: wait for user confirmation (yes/edit/cancel)
 5. If `--dry-run`: stop here, report plan
 ### Step 5: Generate Test Code (if --gen-code)
@@ -209,7 +209,7 @@ Map each result to `REQ-NNN:AC-N`. Per AC: `passed` (all scenarios pass), `faile
 - [ ] Phase resolved and spec package loaded (or degraded mode activated)
 - [ ] Business test scenarios extracted from PRD acceptance criteria
 - [ ] Fixtures generated for all layers
-- [ ] Test plan written and confirmed (or --auto/--dry-run)
+- [ ] Test plan written and confirmed (or -y/--dry-run)
 - [ ] Tests executed progressively L1 -> L2 -> L3 with fail-fast
 - [ ] Traceability matrix maps every result to REQ-NNN:AC-N
 - [ ] Reports generated (JSON + summary markdown)

package/.codex/skills/quality-retrospective/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: quality-retrospective
 description: Multi-lens 复盘 (retrospective) for completed phases. Context-Agent Fork loads phase artifacts once; four parallel lens agents (technical, process, quality, decision) analyze independently; synthesizer distills insights; outputs are routed to spec stubs, knowhow tips, issues, and lessons.jsonl.
-argument-hint: "[phase|N..M] [--lens technical|process|quality|decision] [--all] [--no-route] [--compare N] [--auto-yes]"
+argument-hint: "[phase|N..M] [--lens technical|process|quality|decision] [--all] [--no-route] [--compare N] [-y]"
 allowed-tools: Read, Write, Edit, Bash, Glob, Grep
 ---
@@ -50,7 +50,7 @@ $quality-retrospective "3"
 $quality-retrospective "2..4"
 $quality-retrospective "--all"
 $quality-retrospective "3 --lens technical --no-route"
-$quality-retrospective "3 --compare 2 --auto-yes"
+$quality-retrospective "3 --compare 2 -y"
 ```
 **Flags**:
@@ -61,9 +61,9 @@ $quality-retrospective "3 --compare 2 --auto-yes"
 - `--lens <name>` -- restrict to one lens (repeatable): `technical|process|quality|decision`
 - `--no-route` -- produce retrospective.{md,json} only; skip auto-creation of spec/note/issue
 - `--compare <M>` -- emit a delta section vs phase M's prior retrospective
-- `--auto-yes` -- accept all routing recommendations without prompting
+- `-y` -- accept all routing recommendations without prompting
-When `--auto-yes`: Accept all routing recommendations without prompting. Route all insights automatically.
+When `-y`: Accept all routing recommendations without prompting. Route all insights automatically.
 **Storage written**:
 - `{target_dir}/retrospective.md` -- human-readable record (target_dir resolved via state.json artifact registry to `.workflow/scratch/{YYYYMMDD}-{type}-{slug}/`)
@@ -124,7 +124,7 @@ Each artifact's type determines its outputs at `.workflow/{a.path}/`:
 6. **Stable INS-ids**: `INS-{8hex}` from `hash(phase_num + lens + title)` -- re-runs do not create duplicates
 7. **Archive before overwrite**: Move existing retrospective.{md,json} to `.history/` with timestamp before writing new ones
 8. **Spec learnings.md backward-compat**: Append to it only if it already exists -- never create it
-9. **Route confirmation**: Unless `--auto-yes`, present routing table and ask per-group before writing spec/issue/knowhow
+9. **Route confirmation**: Unless `-y`, present routing table and ask per-group before writing spec/issue/knowhow
 10. **Lessons always written**: Append to `lessons.jsonl` regardless of `--no-route` -- routing only controls spec/issue/knowhow creation
 </invariants>

package/.codex/skills/quality-test/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: quality-test
 description: Conversational UAT with session persistence, auto-diagnosis, and gap-plan closure loop. Interactive testing flow with severity inference and parallel debug agents.
-argument-hint: "<phase> [--auto-fix] [--session ID]"
+argument-hint: "<phase> [-y] [--auto-fix] [--session ID]"
 allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Agent, AskUserQuestion
 ---
@@ -29,7 +29,7 @@ $quality-test "--session 04-comments"  # resume specific session
 - `--auto-fix`: Auto-trigger gap-fix loop (plan --gaps -> execute -> re-verify) on failures
 - `--session ID`: Resume a specific UAT session
-No auto mode -- UAT is inherently interactive. `--auto-fix` only automates gap closure, not test execution.
+`-y` implies `--auto-fix`。UAT 执行本身保持交互（展示预期 → 确认），`-y` 仅自动化 gap closure loop。
 **Output**: `{target_dir}/uat.md` + `.tests/test-plan.json` + `.tests/test-results.json` + `.tests/coverage-report.json`
 </context>

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "maestro-flow",
-  "version": "0.3.29",
+  "version": "0.3.31",
   "description": "Workflow orchestration CLI with MCP endpoint support and extensible architecture",
   "type": "module",
   "imports": {