maestro-flow 0.3.38 → 0.3.40

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (187) hide show
  1. package/.claude/agents/workflow-analyzer.md +2 -0
  2. package/.claude/agents/workflow-debugger.md +2 -0
  3. package/.claude/agents/workflow-executor.md +2 -0
  4. package/.claude/agents/workflow-integration-checker.md +1 -0
  5. package/.claude/agents/workflow-nyquist-auditor.md +1 -0
  6. package/.claude/agents/workflow-planner.md +2 -0
  7. package/.claude/agents/workflow-reviewer.md +2 -0
  8. package/.claude/agents/workflow-verifier.md +2 -0
  9. package/.claude/commands/learn-decompose.md +176 -176
  10. package/.claude/commands/learn-follow.md +167 -167
  11. package/.claude/commands/learn-investigate.md +221 -221
  12. package/.claude/commands/learn-retro.md +303 -303
  13. package/.claude/commands/learn-second-opinion.md +167 -167
  14. package/.claude/commands/maestro-amend.md +300 -300
  15. package/.claude/commands/maestro-analyze.md +130 -126
  16. package/.claude/commands/maestro-brainstorm.md +104 -100
  17. package/.claude/commands/maestro-composer.md +354 -354
  18. package/.claude/commands/maestro-execute.md +120 -114
  19. package/.claude/commands/maestro-fork.md +86 -86
  20. package/.claude/commands/maestro-init.md +78 -78
  21. package/.claude/commands/maestro-learn.md +140 -140
  22. package/.claude/commands/maestro-link-coordinate.md +1 -1
  23. package/.claude/commands/maestro-merge.md +61 -61
  24. package/.claude/commands/maestro-milestone-release.md +96 -96
  25. package/.claude/commands/maestro-overlay.md +178 -178
  26. package/.claude/commands/maestro-plan.md +154 -138
  27. package/.claude/commands/maestro-player.md +404 -404
  28. package/.claude/commands/maestro-quick.md +56 -56
  29. package/.claude/commands/maestro-ralph-execute.md +7 -18
  30. package/.claude/commands/maestro-ralph.md +48 -12
  31. package/.claude/commands/maestro-roadmap.md +1 -1
  32. package/.claude/commands/maestro-ui-design.md +93 -93
  33. package/.claude/commands/maestro-update.md +176 -176
  34. package/.claude/commands/maestro-verify.md +96 -90
  35. package/.claude/commands/maestro.md +121 -121
  36. package/.claude/commands/manage-codebase-rebuild.md +75 -75
  37. package/.claude/commands/manage-codebase-refresh.md +57 -57
  38. package/.claude/commands/manage-harvest.md +94 -94
  39. package/.claude/commands/manage-issue-discover.md +77 -77
  40. package/.claude/commands/manage-issue.md +73 -73
  41. package/.claude/commands/manage-knowhow-capture.md +193 -193
  42. package/.claude/commands/manage-knowhow.md +77 -77
  43. package/.claude/commands/manage-learn.md +67 -67
  44. package/.claude/commands/manage-status.md +51 -51
  45. package/.claude/commands/manage-wiki.md +62 -62
  46. package/.claude/commands/quality-auto-test.md +5 -1
  47. package/.claude/commands/quality-debug.md +125 -115
  48. package/.claude/commands/quality-refactor.md +55 -55
  49. package/.claude/commands/quality-retrospective.md +78 -78
  50. package/.claude/commands/quality-review.md +114 -108
  51. package/.claude/commands/quality-sync.md +51 -51
  52. package/.claude/commands/quality-test.md +107 -103
  53. package/.claude/commands/spec-add.md +49 -49
  54. package/.claude/commands/spec-load.md +51 -51
  55. package/.claude/commands/spec-remove.md +51 -51
  56. package/.claude/commands/spec-setup.md +51 -51
  57. package/.claude/commands/wiki-connect.md +62 -62
  58. package/.claude/commands/wiki-digest.md +69 -69
  59. package/.codex/skills/learn-decompose/SKILL.md +113 -113
  60. package/.codex/skills/learn-follow/SKILL.md +1 -1
  61. package/.codex/skills/learn-investigate/SKILL.md +83 -83
  62. package/.codex/skills/learn-retro/SKILL.md +83 -83
  63. package/.codex/skills/learn-second-opinion/SKILL.md +86 -86
  64. package/.codex/skills/maestro/SKILL.md +304 -304
  65. package/.codex/skills/maestro-analyze/SKILL.md +11 -3
  66. package/.codex/skills/maestro-brainstorm/SKILL.md +451 -397
  67. package/.codex/skills/maestro-composer/SKILL.md +213 -213
  68. package/.codex/skills/maestro-execute/SKILL.md +346 -318
  69. package/.codex/skills/maestro-fork/SKILL.md +56 -2
  70. package/.codex/skills/maestro-init/SKILL.md +40 -16
  71. package/.codex/skills/maestro-learn/SKILL.md +80 -80
  72. package/.codex/skills/maestro-link-coordinate/SKILL.md +257 -257
  73. package/.codex/skills/maestro-merge/SKILL.md +1 -1
  74. package/.codex/skills/maestro-milestone-audit/SKILL.md +1 -1
  75. package/.codex/skills/maestro-milestone-complete/SKILL.md +40 -9
  76. package/.codex/skills/maestro-milestone-release/SKILL.md +70 -70
  77. package/.codex/skills/maestro-overlay/SKILL.md +1 -1
  78. package/.codex/skills/maestro-plan/SKILL.md +29 -4
  79. package/.codex/skills/maestro-player/SKILL.md +323 -323
  80. package/.codex/skills/maestro-quick/SKILL.md +1 -1
  81. package/.codex/skills/maestro-ralph/SKILL.md +693 -578
  82. package/.codex/skills/maestro-roadmap/SKILL.md +518 -468
  83. package/.codex/skills/maestro-ui-design/SKILL.md +109 -12
  84. package/.codex/skills/maestro-verify/SKILL.md +27 -9
  85. package/.codex/skills/manage-codebase-rebuild/SKILL.md +3 -2
  86. package/.codex/skills/manage-codebase-refresh/SKILL.md +1 -1
  87. package/.codex/skills/manage-harvest/SKILL.md +91 -91
  88. package/.codex/skills/manage-issue/SKILL.md +19 -6
  89. package/.codex/skills/manage-issue-discover/SKILL.md +1 -1
  90. package/.codex/skills/manage-knowhow/SKILL.md +95 -95
  91. package/.codex/skills/manage-knowhow-capture/SKILL.md +110 -110
  92. package/.codex/skills/manage-learn/SKILL.md +1 -1
  93. package/.codex/skills/manage-status/SKILL.md +1 -1
  94. package/.codex/skills/manage-wiki/SKILL.md +55 -55
  95. package/.codex/skills/quality-auto-test/SKILL.md +553 -547
  96. package/.codex/skills/quality-debug/SKILL.md +345 -334
  97. package/.codex/skills/quality-refactor/SKILL.md +1 -1
  98. package/.codex/skills/quality-retrospective/SKILL.md +292 -292
  99. package/.codex/skills/quality-review/SKILL.md +365 -364
  100. package/.codex/skills/quality-sync/SKILL.md +1 -1
  101. package/.codex/skills/quality-test/SKILL.md +507 -498
  102. package/.codex/skills/spec-add/SKILL.md +101 -101
  103. package/.codex/skills/spec-load/SKILL.md +77 -77
  104. package/.codex/skills/spec-map/SKILL.md +1 -1
  105. package/.codex/skills/spec-remove/SKILL.md +69 -69
  106. package/.codex/skills/spec-setup/SKILL.md +1 -1
  107. package/.codex/skills/team-coordinate/SKILL.md +2 -1
  108. package/.codex/skills/team-executor/SKILL.md +116 -115
  109. package/.codex/skills/team-lifecycle-v4/SKILL.md +2 -1
  110. package/.codex/skills/team-lifecycle-v4/instructions/agent-instruction.md +14 -6
  111. package/.codex/skills/team-lifecycle-v4/roles/analyst/role.md +16 -4
  112. package/.codex/skills/team-lifecycle-v4/roles/executor/commands/implement.md +7 -1
  113. package/.codex/skills/team-lifecycle-v4/roles/planner/role.md +16 -4
  114. package/.codex/skills/team-lifecycle-v4/roles/writer/role.md +8 -2
  115. package/.codex/skills/team-quality-assurance/SKILL.md +2 -1
  116. package/.codex/skills/team-quality-assurance/roles/scout/role.md +9 -2
  117. package/.codex/skills/team-review/SKILL.md +2 -1
  118. package/.codex/skills/team-review/roles/reviewer/role.md +10 -1
  119. package/.codex/skills/team-review/roles/scanner/role.md +10 -1
  120. package/.codex/skills/team-tech-debt/SKILL.md +144 -143
  121. package/.codex/skills/team-tech-debt/roles/executor/role.md +9 -5
  122. package/.codex/skills/team-tech-debt/roles/scanner/role.md +10 -0
  123. package/.codex/skills/team-tech-debt/roles/validator/role.md +8 -2
  124. package/.codex/skills/team-testing/SKILL.md +2 -1
  125. package/.codex/skills/team-testing/roles/executor/role.md +8 -2
  126. package/.codex/skills/team-testing/roles/generator/role.md +8 -2
  127. package/.codex/skills/wiki-connect/SKILL.md +73 -73
  128. package/.codex/skills/wiki-digest/SKILL.md +87 -87
  129. package/dashboard/dist-server/dashboard/src/server/agents/claude-code-adapter.js +4 -0
  130. package/dashboard/dist-server/dashboard/src/server/agents/claude-code-adapter.js.map +1 -1
  131. package/dashboard/dist-server/dashboard/src/server/agents/codex-cli-adapter.js +6 -0
  132. package/dashboard/dist-server/dashboard/src/server/agents/codex-cli-adapter.js.map +1 -1
  133. package/dashboard/dist-server/shared/agent-types.d.ts +2 -0
  134. package/dashboard/dist-server/src/agents/cli-agent-runner.d.ts +2 -0
  135. package/dashboard/dist-server/src/agents/cli-agent-runner.js +4 -0
  136. package/dashboard/dist-server/src/agents/cli-agent-runner.js.map +1 -1
  137. package/dashboard/dist-server/src/commands/delegate.d.ts +2 -0
  138. package/dashboard/dist-server/src/commands/delegate.js +18 -0
  139. package/dashboard/dist-server/src/commands/delegate.js.map +1 -1
  140. package/dashboard/dist-server/src/config/cli-tools-config.d.ts +6 -0
  141. package/dashboard/dist-server/src/config/cli-tools-config.js +2 -0
  142. package/dashboard/dist-server/src/config/cli-tools-config.js.map +1 -1
  143. package/dist/shared/agent-types.d.ts +2 -0
  144. package/dist/shared/agent-types.d.ts.map +1 -1
  145. package/dist/src/agents/cli-agent-runner.d.ts +2 -0
  146. package/dist/src/agents/cli-agent-runner.d.ts.map +1 -1
  147. package/dist/src/agents/cli-agent-runner.js +4 -0
  148. package/dist/src/agents/cli-agent-runner.js.map +1 -1
  149. package/dist/src/commands/config.d.ts.map +1 -1
  150. package/dist/src/commands/config.js +29 -1
  151. package/dist/src/commands/config.js.map +1 -1
  152. package/dist/src/commands/delegate.d.ts +2 -0
  153. package/dist/src/commands/delegate.d.ts.map +1 -1
  154. package/dist/src/commands/delegate.js +18 -0
  155. package/dist/src/commands/delegate.js.map +1 -1
  156. package/dist/src/commands/launcher.d.ts.map +1 -1
  157. package/dist/src/commands/launcher.js +27 -4
  158. package/dist/src/commands/launcher.js.map +1 -1
  159. package/dist/src/config/cli-tools-config.d.ts +6 -0
  160. package/dist/src/config/cli-tools-config.d.ts.map +1 -1
  161. package/dist/src/config/cli-tools-config.js +2 -0
  162. package/dist/src/config/cli-tools-config.js.map +1 -1
  163. package/dist/src/core/overlay/applier.d.ts.map +1 -1
  164. package/dist/src/core/overlay/applier.js +65 -5
  165. package/dist/src/core/overlay/applier.js.map +1 -1
  166. package/dist/src/core/overlay/loader.d.ts.map +1 -1
  167. package/dist/src/core/overlay/loader.js +9 -4
  168. package/dist/src/core/overlay/loader.js.map +1 -1
  169. package/dist/src/core/overlay/types.d.ts +2 -0
  170. package/dist/src/core/overlay/types.d.ts.map +1 -1
  171. package/dist/src/core/overlay/types.js +2 -0
  172. package/dist/src/core/overlay/types.js.map +1 -1
  173. package/dist/src/tui/tools-ui/ToolsDashboard.d.ts.map +1 -1
  174. package/dist/src/tui/tools-ui/ToolsDashboard.js +1 -1
  175. package/dist/src/tui/tools-ui/ToolsDashboard.js.map +1 -1
  176. package/dist/src/tui/tools-ui/ToolsOverview.d.ts.map +1 -1
  177. package/dist/src/tui/tools-ui/ToolsOverview.js +51 -4
  178. package/dist/src/tui/tools-ui/ToolsOverview.js.map +1 -1
  179. package/package.json +1 -1
  180. package/shared/agent-types.ts +2 -0
  181. package/workflows/analyze.md +24 -2
  182. package/workflows/auto-test.md +12 -0
  183. package/workflows/brainstorm.md +11 -1
  184. package/workflows/debug.md +13 -4
  185. package/workflows/delegate-protocol.codex.md +65 -0
  186. package/workflows/plan.md +14 -4
  187. package/workflows/test.md +10 -0
@@ -1,56 +1,56 @@
1
- ---
2
- name: maestro-quick
3
- description: Execute a quick task with workflow guarantees but skip optional agents
4
- argument-hint: "[description] [--full] [--discuss]"
5
- allowed-tools:
6
- - Read
7
- - Write
8
- - Edit
9
- - Bash
10
- - Glob
11
- - Grep
12
- - Task
13
- - AskUserQuestion
14
- ---
15
- <purpose>
16
- Execute small, ad-hoc tasks with workflow guarantees (atomic commits, state tracking) using a shortened pipeline. Invoked for tasks that are well-understood and do not require full phase-level planning. Produces scratch task directory with plan, execution results, and optional verification. Flags --discuss and --full enable additional pipeline stages.
17
- </purpose>
18
-
19
- <required_reading>
20
- @~/.maestro/workflows/quick.md
21
- </required_reading>
22
-
23
- <context>
24
- $ARGUMENTS
25
-
26
- Parse for:
27
- - `--full` flag -- Enables plan-checking (max 2 iterations) and post-execution verification
28
- - `--discuss` flag -- Decision extraction before planning (gray areas, Locked/Free/Deferred classification)
29
- - Remaining text as task description
30
- </context>
31
-
32
- <execution>
33
- Follow '~/.maestro/workflows/quick.md' completely.
34
-
35
- **Next-step routing on completion:**
36
- - Task done, --full verification passed → /manage-status
37
- - Task done, verification found gaps → /quality-debug {issue}
38
- - Task done, want to sync docs → /quality-sync
39
- - Need a full phase workflow instead → /maestro-plan {phase}
40
- </execution>
41
-
42
- <error_codes>
43
- | Code | Severity | Condition | Recovery |
44
- |------|----------|-----------|----------|
45
- | E001 | error | Task description required (no text provided) | Check arguments format, re-run with correct input |
46
- | E002 | error | Scratch directory creation failed | Check disk space and .workflow/ permissions |
47
- | W001 | warning | Verification found minor gaps | Review gaps and determine if they need fixing |
48
- </error_codes>
49
-
50
- <success_criteria>
51
- - [ ] Scratch task directory created under .workflow/scratch/
52
- - [ ] plan.json written with task definitions
53
- - [ ] All tasks executed with summaries written
54
- - [ ] state.json updated with scratch task entry
55
- - [ ] Commit created with task changes
56
- </success_criteria>
1
+ ---
2
+ name: maestro-quick
3
+ description: Quick task execution, skip optional agents
4
+ argument-hint: "[description] [--full] [--discuss]"
5
+ allowed-tools:
6
+ - Read
7
+ - Write
8
+ - Edit
9
+ - Bash
10
+ - Glob
11
+ - Grep
12
+ - Task
13
+ - AskUserQuestion
14
+ ---
15
+ <purpose>
16
+ Execute small, ad-hoc tasks with workflow guarantees (atomic commits, state tracking) using a shortened pipeline. Invoked for tasks that are well-understood and do not require full phase-level planning. Produces scratch task directory with plan, execution results, and optional verification. Flags --discuss and --full enable additional pipeline stages.
17
+ </purpose>
18
+
19
+ <required_reading>
20
+ @~/.maestro/workflows/quick.md
21
+ </required_reading>
22
+
23
+ <context>
24
+ $ARGUMENTS
25
+
26
+ Parse for:
27
+ - `--full` flag -- Enables plan-checking (max 2 iterations) and post-execution verification
28
+ - `--discuss` flag -- Decision extraction before planning (gray areas, Locked/Free/Deferred classification)
29
+ - Remaining text as task description
30
+ </context>
31
+
32
+ <execution>
33
+ Follow '~/.maestro/workflows/quick.md' completely.
34
+
35
+ **Next-step routing on completion:**
36
+ - Task done, --full verification passed → /manage-status
37
+ - Task done, verification found gaps → /quality-debug {issue}
38
+ - Task done, want to sync docs → /quality-sync
39
+ - Need a full phase workflow instead → /maestro-plan {phase}
40
+ </execution>
41
+
42
+ <error_codes>
43
+ | Code | Severity | Condition | Recovery |
44
+ |------|----------|-----------|----------|
45
+ | E001 | error | Task description required (no text provided) | Check arguments format, re-run with correct input |
46
+ | E002 | error | Scratch directory creation failed | Check disk space and .workflow/ permissions |
47
+ | W001 | warning | Verification found minor gaps | Review gaps and determine if they need fixing |
48
+ </error_codes>
49
+
50
+ <success_criteria>
51
+ - [ ] Scratch task directory created under .workflow/scratch/
52
+ - [ ] plan.json written with task definitions
53
+ - [ ] All tasks executed with summaries written
54
+ - [ ] state.json updated with scratch task entry
55
+ - [ ] Commit created with task changes
56
+ </success_criteria>
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: maestro-ralph-execute
3
- description: Single-step executor — find next pending step in session, execute by type (decision/internal/external), hand off to next iteration
3
+ description: Execute next pending step in ralph session
4
4
  argument-hint: "[-y] [session-id]"
5
5
  allowed-tools:
6
6
  - Read
@@ -130,11 +130,6 @@ Display step banner:
130
130
 
131
131
  If decision node: also show `Retry: {retry_count}/{max_retries}` from parsed args.
132
132
 
133
- Context weight hint (non-auto only, after 4+ completed steps):
134
- ```
135
- ⚡ 已执行 {completed_count} 步,上下文较重。可 /maestro-ralph continue 在新上下文恢复。
136
- ```
137
-
138
133
  ## Step 5: Execute by Type
139
134
 
140
135
  ### 5a. decision node (ralph-only)
@@ -189,13 +184,8 @@ HARD RULE: external nodes ALWAYS delegate to `claude` — only Claude Code can e
189
184
 
190
185
  ```
191
186
  Bash({
192
- command: `maestro delegate "Execute: /${next.skill} ${next.args}
193
-
194
- You are a delegate session within a ralph/maestro pipeline.
195
- Your task: invoke the slash command /${next.skill} with args: ${next.args}
196
- Use Skill({ skill: \"${next.skill}\", args: \"${next.args}\" }) to execute it.
197
- Do NOT reimplement the skill logic manually — invoke the actual command.
198
- All artifact outputs follow the skill's own conventions." --to claude --mode write`,
187
+ command: `maestro delegate "Execute: Skill({ skill: \"${next.skill}\", args: \"${next.args}\" })
188
+ Do NOT reimplement — invoke the skill command directly." --to claude --mode write`,
199
189
  run_in_background: true,
200
190
  timeout: 600000
201
191
  })
@@ -245,10 +235,10 @@ If not next.retried:
245
235
  next.retried = true, next.status = "pending", next.error = null
246
236
  Write status.json → Skill("maestro-ralph-execute") // retry once
247
237
  Else:
248
- next.status = "skipped"
238
+ status.status = "paused"
249
239
  Write status.json
250
- Display: [{next.index}/{total}] {next.skill} auto-skipped after retry
251
- → Skill("maestro-ralph-execute") // continue
240
+ Display: [{next.index}/{total}] {next.skill} 重试后仍失败,会话已暂停。请检查后 /maestro-ralph continue 恢复。
241
+ End.
252
242
  ```
253
243
 
254
244
  **Interactive mode (non-auto):**
@@ -302,7 +292,6 @@ Type badges: `◆` decision, `⚡` external, (none) internal.
302
292
  | E002 | error | Session status.json corrupt | Show path, suggest manual check |
303
293
  | E003 | error | CLI delegate failed + user abort | Mark paused, suggest resume |
304
294
  | W001 | warning | Step completed with warnings | Log and continue |
305
- | W002 | warning | Context heavy (step >= 4) | Hint: /maestro-ralph continue |
306
295
  </error_codes>
307
296
 
308
297
  <success_criteria>
@@ -316,7 +305,7 @@ Type badges: `◆` decision, `⚡` external, (none) internal.
316
305
  - [ ] external nodes use maestro delegate --to claude with run_in_background + STOP pattern
317
306
  - [ ] Context propagation: output signals update status.json.context
318
307
  - [ ] status.json updated after every status change (resume-safe)
319
- - [ ] Auto mode: retry once then skip; interactive: AskUserQuestion retry/skip/abort
308
+ - [ ] Auto mode: retry once then pause; interactive: AskUserQuestion retry/skip/abort
320
309
  - [ ] Completion report shows all steps with status icons and type badges
321
310
  - [ ] Self-invocation chain continues until all steps complete or session paused
322
311
  </success_criteria>
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: maestro-ralph
3
- description: Closed-loop lifecycle decision engine — read project state, infer position, build adaptive command chain with decision/internal/external nodes
3
+ description: Adaptive lifecycle engine — infer state, build command chain
4
4
  argument-hint: "[-y] \"intent\" | status | continue"
5
5
  allowed-tools:
6
6
  - Read
@@ -112,6 +112,12 @@ Also check: `.workflow/roadmap.md` existence, `.workflow/scratch/` for result fi
112
112
 
113
113
  ### 2.2: Infer lifecycle position
114
114
 
115
+ **Phase 0 — Intent-based override:**
116
+
117
+ If intent matches brainstorm pattern (contains "brainstorm", "头脑风暴", "探索", "ideate", or "设计思路"), position = `brainstorm` regardless of project state.
118
+
119
+ Chain for existing project: `brainstorm → roadmap → analyze → ...` (skip init if `.workflow/state.json` exists).
120
+
115
121
  **Phase 1 — Bootstrap detection:**
116
122
 
117
123
  | Condition | Position | Chain starts at |
@@ -164,7 +170,7 @@ Generate steps from `lifecycle_position` to target (default: `milestone-complete
164
170
 
165
171
  | Stage | Skill command | Type | Decision after |
166
172
  |-------|--------------|------|----------------|
167
- | brainstorm | `maestro-brainstorm "{intent}"` | external | — (0→1 only) |
173
+ | brainstorm | `maestro-brainstorm "{intent}"` | external | — |
168
174
  | init | `maestro-init` | internal | — |
169
175
  | roadmap | `maestro-roadmap "{intent}"` | internal | — |
170
176
  | analyze | `maestro-analyze {phase}` | external | — |
@@ -316,14 +322,32 @@ For quality-gate decisions (post-verify, post-business-test, post-review, post-t
316
322
  | Decision type | Files to include |
317
323
  |---------------|-----------------|
318
324
  | post-verify | `{artifact_dir}/verification.json` |
319
- | post-business-test | `{artifact_dir}/business-test-results.json` |
325
+ | post-business-test | `{artifact_dir}/.tests/auto-test/report.json` |
320
326
  | post-review | `{artifact_dir}/review.json` |
321
327
  | post-test | `{artifact_dir}/uat.md`, `{artifact_dir}/.tests/test-results.json` |
322
328
 
329
+ **Confidence-aware evaluation**:
330
+
331
+ Before delegating, check if artifact contains a confidence section (added by downstream commands):
332
+ - `verification.json` → `confidence.overall` (from maestro-verify)
333
+ - `report.json` → `confidence.overall` (from quality-auto-test)
334
+ - `review.json` → may contain dimension confidence (from quality-review)
335
+ - `uat.md` → confidence summary section (from quality-test)
336
+
337
+ If confidence data found, include in delegate prompt as additional signal:
338
+ ```
339
+ 已有置信度评估: 整体 {overall}%, 最弱维度: {weakest} ({score}%)
340
+ ```
341
+
342
+ **Confidence-based verdict bias**: When artifact confidence is available:
343
+ - confidence < 60% → bias toward "fix" even if surface status looks clean (hidden quality gaps)
344
+ - confidence 60-95% → use delegate verdict as-is
345
+ - confidence > 95% → bias toward "proceed" (strong evidence of quality)
346
+
323
347
  ```
324
348
  Bash({
325
349
  command: `maestro delegate "PURPOSE: 评估 ${meta.decision} 质量门结果,判断是否通过
326
- TASK: 读取结果文件 | 分析通过/失败状态 | 评估问题严重性 | 给出下一步建议
350
+ TASK: 读取结果文件 | 分析通过/失败状态 | 评估问题严重性 | 检查置信度评分 | 给出下一步建议
327
351
  MODE: analysis
328
352
  CONTEXT: @${result_files}
329
353
  EXPECTED: 严格按以下格式输出:
@@ -332,8 +356,10 @@ STATUS: proceed | fix | escalate
332
356
  REASON: 一句话解释
333
357
  GAP_SUMMARY: 具体问题描述(仅 fix/escalate 时填写,用于传递给 quality-debug)
334
358
  CONFIDENCE: high | medium | low
359
+ CONFIDENCE_SCORE: 0-100(从结果文件中读取置信度分数,无则估算)
360
+ WEAKEST_DIMENSION: 最弱维度名称
335
361
  ---END---
336
- CONSTRAINTS: 只评估不修改 | STATUS 三选一 | 如果 retry ${meta.retry_count}/${meta.max_retries} 已达上限且仍有问题则必须 escalate" --role analyze --mode analysis`,
362
+ CONSTRAINTS: 只评估不修改 | STATUS 三选一 | 置信度 < 60% 倾向 fix | 如果 retry ${meta.retry_count}/${meta.max_retries} 已达上限且仍有问题则必须 escalate" --role analyze --mode analysis`,
337
363
  run_in_background: true
338
364
  })
339
365
  STOP — wait for callback.
@@ -346,12 +372,20 @@ STOP — wait for callback.
346
372
  Parse structured response:
347
373
  ```
348
374
  Extract between ---VERDICT--- and ---END---:
349
- verdict.status = "proceed" | "fix" | "escalate"
350
- verdict.reason = string
351
- verdict.gap_summary = string (context for quality-debug)
352
- verdict.confidence = "high" | "medium" | "low"
375
+ verdict.status = "proceed" | "fix" | "escalate"
376
+ verdict.reason = string
377
+ verdict.gap_summary = string (context for quality-debug)
378
+ verdict.confidence = "high" | "medium" | "low"
379
+ verdict.confidence_score = 0-100 (numeric, from artifact or estimated)
380
+ verdict.weakest_dimension = string (weakest confidence dimension)
353
381
 
354
382
  If parse fails → fallback: treat as "fix" with generic gap_summary
383
+
384
+ Confidence-based verdict adjustment (after parse, before apply):
385
+ If verdict.confidence_score < 60 AND verdict.status == "proceed":
386
+ → Override to "fix", reason += " (置信度不足: {score}%,{weakest_dimension} 需加强)"
387
+ If verdict.confidence_score > 95 AND verdict.status == "fix" AND retry_count > 0:
388
+ → Suggest "proceed" override, reason += " (置信度充分: {score}%,建议通过)"
355
389
  ```
356
390
 
357
391
  **Apply verdict:**
@@ -497,9 +531,11 @@ End.
497
531
  - [ ] Full quality pipeline generated: verify → business-test → review → test-gen → test
498
532
  - [ ] Decision nodes inserted after: post-verify, post-business-test, post-review, post-test, post-milestone
499
533
  - [ ] Quality-gate decisions delegated via `maestro delegate --role analyze --mode analysis`
500
- - [ ] Delegate verdict parsed: STATUS / REASON / GAP_SUMMARY / CONFIDENCE
501
- - [ ] `-y` mode: auto-follow delegate verdict without user confirmation
502
- - [ ] Interactive mode: display recommendation + AskUserQuestion with override options
534
+ - [ ] Delegate verdict parsed: STATUS / REASON / GAP_SUMMARY / CONFIDENCE / CONFIDENCE_SCORE / WEAKEST_DIMENSION
535
+ - [ ] Confidence-based verdict adjustment applied (< 60% bias fix, > 95% bias proceed)
536
+ - [ ] Artifact confidence sections read when available (verification.json, report.json, uat.md)
537
+ - [ ] `-y` mode: auto-follow adjusted verdict without user confirmation
538
+ - [ ] Interactive mode: display recommendation with confidence score + AskUserQuestion with override options
503
539
  - [ ] Delegate failure fallback: treat as "fix" verdict
504
540
  - [ ] gap_summary from delegate passed to quality-debug as context
505
541
  - [ ] Fix-loop templates applied per decision type with retry_count increment
@@ -1,6 +1,6 @@
1
1
  ---
2
2
  name: maestro-roadmap
3
- description: Roadmap generation with dual mode — light (requirement→roadmap) or full (requirement→spec package→roadmap)
3
+ description: Generate roadmap from requirements (light or full mode)
4
4
  argument-hint: "<requirement> [--mode light|full] [-y] [-c] [-m progressive|direct|auto] [--from-brainstorm SESSION-ID] [--revise [instructions]] [--review]"
5
5
  allowed-tools:
6
6
  - Read
@@ -1,93 +1,93 @@
1
- ---
2
- name: maestro-ui-design
3
- description: Generate UI design prototypes with multiple styles via ui-ux-pro-max, user selects winner, solidify as code reference
4
- argument-hint: "<phase|topic> [--styles N] [--stack <stack>] [--targets <pages>] [--layouts N] [--refine] [--persist] [--full] [-y]"
5
- allowed-tools:
6
- - Read
7
- - Write
8
- - Edit
9
- - Bash
10
- - Glob
11
- - Grep
12
- - Agent
13
- - AskUserQuestion
14
- ---
15
- <purpose>
16
- Generate UI design prototypes for a phase or topic. Two workflow paths, auto-selected by skill availability:
17
-
18
- 1. **Primary (ui-style.md):** Delegates design to ui-ux-pro-max skill. Generates multiple style variants via `--design-system`, user selects, solidifies as code reference. Lightweight and fast.
19
- 2. **Fallback (ui-design.md):** Self-contained 4-layer pipeline (style → animation → layout → assembly) with 6D attribute space, OKLCH tokens, layout templates, and full prototype matrix. Used when ui-ux-pro-max is unavailable or `--full` is requested.
20
-
21
- Both paths produce the same output contract: MASTER.md + design-tokens.json + animation-tokens.json + selection.json for downstream plan/execute consumption.
22
-
23
- Position in pipeline: analyze -> **ui-design** -> plan -> execute -> verify
24
- </purpose>
25
-
26
- <deferred_reading>
27
- - [ui-style.md](~/.maestro/workflows/ui-style.md) — read when SKILL_PATH found (primary path)
28
- - [ui-design.md](~/.maestro/workflows/ui-design.md) — read when SKILL_PATH empty or --full (fallback path)
29
- - [index.json](~/.maestro/templates/index.json) — read when updating phase metadata
30
- - [scratch-index.json](~/.maestro/templates/scratch-index.json) — read when operating in scratch mode
31
- </deferred_reading>
32
-
33
- <context>
34
- $ARGUMENTS — phase number for phase mode, topic text for scratch mode, with optional flags.
35
-
36
- Flags, workflow routing, scope modes, and output artifacts defined in the routed workflow (ui-style.md or ui-design.md).
37
-
38
- **Phase mode** (number): resolves phase directory, reads context.md/brainstorm for requirements.
39
- **Scratch mode** (text): creates `.workflow/scratch/{YYYYMMDD}-ui-design-{slug}/` for standalone exploration.
40
- </context>
41
-
42
- <execution>
43
- ## Workflow Routing
44
-
45
- Detect ui-ux-pro-max skill availability and route to the appropriate workflow:
46
-
47
- - **`--full` flag present** → Follow '~/.maestro/workflows/ui-design.md' completely (forced full pipeline)
48
- - **ui-ux-pro-max found** → Follow '~/.maestro/workflows/ui-style.md' completely (lightweight delegation)
49
- - **ui-ux-pro-max not found** → Follow '~/.maestro/workflows/ui-design.md' completely (self-contained fallback)
50
-
51
- Skill detection logic, report format, and complete pipeline steps defined in the routed workflow file.
52
-
53
- **Next-step routing on completion:**
54
- - Plan with design reference → /maestro-plan {phase}
55
- - Refine selected design → /maestro-ui-design {phase} --refine
56
- - Analyze before planning → /maestro-analyze {phase}
57
- </execution>
58
-
59
- <error_codes>
60
- | Code | Severity | Description | Stage |
61
- |------|----------|-------------|-------|
62
- | E001 | error | Phase or topic argument required | parse_input |
63
- | E002 | error | Phase directory not found | parse_input |
64
- | E003 | error | Python not available (both paths need Python for ui-ux-pro-max or agent fallback) | setup |
65
- | E004 | error | --refine requires existing design-ref/ | parse_input |
66
- | W001 | warning | Design system generation returned partial results | generate |
67
- | W002 | warning | Prototype rendering failed for one variant | render |
68
- | W003 | warning | No context.md found, using phase goal only | context |
69
- | W004 | warning | ui-ux-pro-max not found, falling back to full pipeline | routing |
70
- </error_codes>
71
-
72
- <success_criteria>
73
- **Both paths (common):**
74
- - [ ] Requirements extracted from phase context (context.md, brainstorm, spec, or user input)
75
- - [ ] N style variants generated with contrasting design directions
76
- - [ ] User selected preferred variant (or auto-selected in -y mode)
77
- - [ ] MASTER.md written with complete design system specification
78
- - [ ] design-tokens.json written with production-ready tokens (OKLCH colors, component_styles)
79
- - [ ] animation-tokens.json written (duration, easing, transitions, keyframes)
80
- - [ ] selection.json recorded with choice metadata
81
- - [ ] index.json updated with design_ref status
82
-
83
- **ui-style.md path (primary):**
84
- - [ ] ui-ux-pro-max --design-system called with product/industry/style keywords
85
- - [ ] Tokens extracted from ui-ux-pro-max output into structured JSON
86
-
87
- **ui-design.md path (--full or fallback):**
88
- - [ ] 6D attribute space used for maximum contrast between variants
89
- - [ ] Layout templates generated per target (dom_structure + css_layout_rules)
90
- - [ ] HTML prototypes assembled: styles x layouts x targets
91
- - [ ] compare.html generated as interactive matrix viewer
92
- </success_criteria>
93
- </output>
1
+ ---
2
+ name: maestro-ui-design
3
+ description: Generate UI design prototypes, select and solidify as code
4
+ argument-hint: "<phase|topic> [--styles N] [--stack <stack>] [--targets <pages>] [--layouts N] [--refine] [--persist] [--full] [-y]"
5
+ allowed-tools:
6
+ - Read
7
+ - Write
8
+ - Edit
9
+ - Bash
10
+ - Glob
11
+ - Grep
12
+ - Agent
13
+ - AskUserQuestion
14
+ ---
15
+ <purpose>
16
+ Generate UI design prototypes for a phase or topic. Two workflow paths, auto-selected by skill availability:
17
+
18
+ 1. **Primary (ui-style.md):** Delegates design to ui-ux-pro-max skill. Generates multiple style variants via `--design-system`, user selects, solidifies as code reference. Lightweight and fast.
19
+ 2. **Fallback (ui-design.md):** Self-contained 4-layer pipeline (style → animation → layout → assembly) with 6D attribute space, OKLCH tokens, layout templates, and full prototype matrix. Used when ui-ux-pro-max is unavailable or `--full` is requested.
20
+
21
+ Both paths produce the same output contract: MASTER.md + design-tokens.json + animation-tokens.json + selection.json for downstream plan/execute consumption.
22
+
23
+ Position in pipeline: analyze -> **ui-design** -> plan -> execute -> verify
24
+ </purpose>
25
+
26
+ <deferred_reading>
27
+ - [ui-style.md](~/.maestro/workflows/ui-style.md) — read when SKILL_PATH found (primary path)
28
+ - [ui-design.md](~/.maestro/workflows/ui-design.md) — read when SKILL_PATH empty or --full (fallback path)
29
+ - [index.json](~/.maestro/templates/index.json) — read when updating phase metadata
30
+ - [scratch-index.json](~/.maestro/templates/scratch-index.json) — read when operating in scratch mode
31
+ </deferred_reading>
32
+
33
+ <context>
34
+ $ARGUMENTS — phase number for phase mode, topic text for scratch mode, with optional flags.
35
+
36
+ Flags, workflow routing, scope modes, and output artifacts defined in the routed workflow (ui-style.md or ui-design.md).
37
+
38
+ **Phase mode** (number): resolves phase directory, reads context.md/brainstorm for requirements.
39
+ **Scratch mode** (text): creates `.workflow/scratch/{YYYYMMDD}-ui-design-{slug}/` for standalone exploration.
40
+ </context>
41
+
42
+ <execution>
43
+ ## Workflow Routing
44
+
45
+ Detect ui-ux-pro-max skill availability and route to the appropriate workflow:
46
+
47
+ - **`--full` flag present** → Follow '~/.maestro/workflows/ui-design.md' completely (forced full pipeline)
48
+ - **ui-ux-pro-max found** → Follow '~/.maestro/workflows/ui-style.md' completely (lightweight delegation)
49
+ - **ui-ux-pro-max not found** → Follow '~/.maestro/workflows/ui-design.md' completely (self-contained fallback)
50
+
51
+ Skill detection logic, report format, and complete pipeline steps defined in the routed workflow file.
52
+
53
+ **Next-step routing on completion:**
54
+ - Plan with design reference → /maestro-plan {phase}
55
+ - Refine selected design → /maestro-ui-design {phase} --refine
56
+ - Analyze before planning → /maestro-analyze {phase}
57
+ </execution>
58
+
59
+ <error_codes>
60
+ | Code | Severity | Description | Stage |
61
+ |------|----------|-------------|-------|
62
+ | E001 | error | Phase or topic argument required | parse_input |
63
+ | E002 | error | Phase directory not found | parse_input |
64
+ | E003 | error | Python not available (both paths need Python for ui-ux-pro-max or agent fallback) | setup |
65
+ | E004 | error | --refine requires existing design-ref/ | parse_input |
66
+ | W001 | warning | Design system generation returned partial results | generate |
67
+ | W002 | warning | Prototype rendering failed for one variant | render |
68
+ | W003 | warning | No context.md found, using phase goal only | context |
69
+ | W004 | warning | ui-ux-pro-max not found, falling back to full pipeline | routing |
70
+ </error_codes>
71
+
72
+ <success_criteria>
73
+ **Both paths (common):**
74
+ - [ ] Requirements extracted from phase context (context.md, brainstorm, spec, or user input)
75
+ - [ ] N style variants generated with contrasting design directions
76
+ - [ ] User selected preferred variant (or auto-selected in -y mode)
77
+ - [ ] MASTER.md written with complete design system specification
78
+ - [ ] design-tokens.json written with production-ready tokens (OKLCH colors, component_styles)
79
+ - [ ] animation-tokens.json written (duration, easing, transitions, keyframes)
80
+ - [ ] selection.json recorded with choice metadata
81
+ - [ ] index.json updated with design_ref status
82
+
83
+ **ui-style.md path (primary):**
84
+ - [ ] ui-ux-pro-max --design-system called with product/industry/style keywords
85
+ - [ ] Tokens extracted from ui-ux-pro-max output into structured JSON
86
+
87
+ **ui-design.md path (--full or fallback):**
88
+ - [ ] 6D attribute space used for maximum contrast between variants
89
+ - [ ] Layout templates generated per target (dom_structure + css_layout_rules)
90
+ - [ ] HTML prototypes assembled: styles x layouts x targets
91
+ - [ ] compare.html generated as interactive matrix viewer
92
+ </success_criteria>
93
+ </output>