agy-superpowers 5.0.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/LICENSE +21 -0
- package/README.md +180 -0
- package/bin/init.js +58 -0
- package/package.json +36 -0
- package/template/agent/.shared/update-superpowers.sh +171 -0
- package/template/agent/agents/code-reviewer.md +48 -0
- package/template/agent/rules/superpowers.md +52 -0
- package/template/agent/skills/brainstorming/SKILL.md +164 -0
- package/template/agent/skills/brainstorming/scripts/frame-template.html +214 -0
- package/template/agent/skills/brainstorming/scripts/helper.js +88 -0
- package/template/agent/skills/brainstorming/scripts/server.cjs +338 -0
- package/template/agent/skills/brainstorming/scripts/start-server.sh +153 -0
- package/template/agent/skills/brainstorming/scripts/stop-server.sh +55 -0
- package/template/agent/skills/brainstorming/spec-document-reviewer-prompt.md +49 -0
- package/template/agent/skills/brainstorming/visual-companion.md +286 -0
- package/template/agent/skills/dispatching-parallel-agents/SKILL.md +182 -0
- package/template/agent/skills/executing-plans/SKILL.md +70 -0
- package/template/agent/skills/finishing-a-development-branch/SKILL.md +200 -0
- package/template/agent/skills/receiving-code-review/SKILL.md +213 -0
- package/template/agent/skills/requesting-code-review/SKILL.md +105 -0
- package/template/agent/skills/requesting-code-review/code-reviewer.md +146 -0
- package/template/agent/skills/subagent-driven-development/SKILL.md +277 -0
- package/template/agent/skills/subagent-driven-development/code-quality-reviewer-prompt.md +26 -0
- package/template/agent/skills/subagent-driven-development/implementer-prompt.md +113 -0
- package/template/agent/skills/subagent-driven-development/spec-reviewer-prompt.md +61 -0
- package/template/agent/skills/systematic-debugging/CREATION-LOG.md +119 -0
- package/template/agent/skills/systematic-debugging/SKILL.md +296 -0
- package/template/agent/skills/systematic-debugging/condition-based-waiting-example.ts +158 -0
- package/template/agent/skills/systematic-debugging/condition-based-waiting.md +115 -0
- package/template/agent/skills/systematic-debugging/defense-in-depth.md +122 -0
- package/template/agent/skills/systematic-debugging/find-polluter.sh +63 -0
- package/template/agent/skills/systematic-debugging/root-cause-tracing.md +169 -0
- package/template/agent/skills/systematic-debugging/test-academic.md +14 -0
- package/template/agent/skills/systematic-debugging/test-pressure-1.md +58 -0
- package/template/agent/skills/systematic-debugging/test-pressure-2.md +68 -0
- package/template/agent/skills/systematic-debugging/test-pressure-3.md +69 -0
- package/template/agent/skills/test-driven-development/SKILL.md +371 -0
- package/template/agent/skills/test-driven-development/testing-anti-patterns.md +299 -0
- package/template/agent/skills/using-git-worktrees/SKILL.md +218 -0
- package/template/agent/skills/using-superpowers/SKILL.md +115 -0
- package/template/agent/skills/using-superpowers/references/codex-tools.md +25 -0
- package/template/agent/skills/using-superpowers/references/gemini-tools.md +33 -0
- package/template/agent/skills/verification-before-completion/SKILL.md +139 -0
- package/template/agent/skills/writing-plans/SKILL.md +145 -0
- package/template/agent/skills/writing-plans/plan-document-reviewer-prompt.md +49 -0
- package/template/agent/skills/writing-skills/SKILL.md +655 -0
- package/template/agent/skills/writing-skills/anthropic-best-practices.md +1150 -0
- package/template/agent/skills/writing-skills/examples/CLAUDE_MD_TESTING.md +189 -0
- package/template/agent/skills/writing-skills/graphviz-conventions.dot +172 -0
- package/template/agent/skills/writing-skills/persuasion-principles.md +187 -0
- package/template/agent/skills/writing-skills/render-graphs.js +168 -0
- package/template/agent/skills/writing-skills/testing-skills-with-subagents.md +384 -0
- package/template/agent/superpowers-version.json +5 -0
- package/template/agent/workflows/brainstorm.md +20 -0
- package/template/agent/workflows/code-review.md +45 -0
- package/template/agent/workflows/debug.md +17 -0
- package/template/agent/workflows/execute-plan.md +25 -0
- package/template/agent/workflows/update-superpowers.md +47 -0
- package/template/agent/workflows/write-plan.md +17 -0
|
@@ -0,0 +1,286 @@
|
|
|
1
|
+
# Visual Companion Guide
|
|
2
|
+
|
|
3
|
+
Browser-based visual brainstorming companion for showing mockups, diagrams, and options.
|
|
4
|
+
|
|
5
|
+
## When to Use
|
|
6
|
+
|
|
7
|
+
Decide per-question, not per-session. The test: **would the user understand this better by seeing it than reading it?**
|
|
8
|
+
|
|
9
|
+
**Use the browser** when the content itself is visual:
|
|
10
|
+
|
|
11
|
+
- **UI mockups** — wireframes, layouts, navigation structures, component designs
|
|
12
|
+
- **Architecture diagrams** — system components, data flow, relationship maps
|
|
13
|
+
- **Side-by-side visual comparisons** — comparing two layouts, two color schemes, two design directions
|
|
14
|
+
- **Design polish** — when the question is about look and feel, spacing, visual hierarchy
|
|
15
|
+
- **Spatial relationships** — state machines, flowcharts, entity relationships rendered as diagrams
|
|
16
|
+
|
|
17
|
+
**Use the terminal** when the content is text or tabular:
|
|
18
|
+
|
|
19
|
+
- **Requirements and scope questions** — "what does X mean?", "which features are in scope?"
|
|
20
|
+
- **Conceptual A/B/C choices** — picking between approaches described in words
|
|
21
|
+
- **Tradeoff lists** — pros/cons, comparison tables
|
|
22
|
+
- **Technical decisions** — API design, data modeling, architectural approach selection
|
|
23
|
+
- **Clarifying questions** — anything where the answer is words, not a visual preference
|
|
24
|
+
|
|
25
|
+
A question *about* a UI topic is not automatically a visual question. "What kind of wizard do you want?" is conceptual — use the terminal. "Which of these wizard layouts feels right?" is visual — use the browser.
|
|
26
|
+
|
|
27
|
+
## How It Works
|
|
28
|
+
|
|
29
|
+
The server watches a directory for HTML files and serves the newest one to the browser. You write HTML content, the user sees it in their browser and can click to select options. Selections are recorded to a `.events` file that you read on your next turn.
|
|
30
|
+
|
|
31
|
+
**Content fragments vs full documents:** If your HTML file starts with `<!DOCTYPE` or `<html`, the server serves it as-is (just injects the helper script). Otherwise, the server automatically wraps your content in the frame template — adding the header, CSS theme, selection indicator, and all interactive infrastructure. **Write content fragments by default.** Only write full documents when you need complete control over the page.
|
|
32
|
+
|
|
33
|
+
## Starting a Session
|
|
34
|
+
|
|
35
|
+
```bash
|
|
36
|
+
# Start server with persistence (mockups saved to project)
|
|
37
|
+
scripts/start-server.sh --project-dir /path/to/project
|
|
38
|
+
|
|
39
|
+
# Returns: {"type":"server-started","port":52341,"url":"http://localhost:52341",
|
|
40
|
+
# "screen_dir":"/path/to/project/.superpowers/brainstorm/12345-1706000000"}
|
|
41
|
+
```
|
|
42
|
+
|
|
43
|
+
Save `screen_dir` from the response. Tell user to open the URL.
|
|
44
|
+
|
|
45
|
+
**Finding connection info:** The server writes its startup JSON to `$SCREEN_DIR/.server-info`. If you launched the server in the background and didn't capture stdout, read that file to get the URL and port. When using `--project-dir`, check `<project>/.superpowers/brainstorm/` for the session directory.
|
|
46
|
+
|
|
47
|
+
**Note:** Pass the project root as `--project-dir` so mockups persist in `.superpowers/brainstorm/` and survive server restarts. Without it, files go to `/tmp` and get cleaned up. Remind the user to add `.superpowers/` to `.gitignore` if it's not already there.
|
|
48
|
+
|
|
49
|
+
**Launching the server by platform:**
|
|
50
|
+
|
|
51
|
+
**Claude Code (macOS / Linux):**
|
|
52
|
+
```bash
|
|
53
|
+
# Default mode works — the script backgrounds the server itself
|
|
54
|
+
scripts/start-server.sh --project-dir /path/to/project
|
|
55
|
+
```
|
|
56
|
+
|
|
57
|
+
**Claude Code (Windows):**
|
|
58
|
+
```bash
|
|
59
|
+
# Windows auto-detects and uses foreground mode, which blocks the tool call.
|
|
60
|
+
# Use run_in_background: true on the Bash tool call so the server survives
|
|
61
|
+
# across conversation turns.
|
|
62
|
+
scripts/start-server.sh --project-dir /path/to/project
|
|
63
|
+
```
|
|
64
|
+
When calling this via the Bash tool, set `run_in_background: true`. Then read `$SCREEN_DIR/.server-info` on the next turn to get the URL and port.
|
|
65
|
+
|
|
66
|
+
**Codex:**
|
|
67
|
+
```bash
|
|
68
|
+
# Codex reaps background processes. The script auto-detects CODEX_CI and
|
|
69
|
+
# switches to foreground mode. Run it normally — no extra flags needed.
|
|
70
|
+
scripts/start-server.sh --project-dir /path/to/project
|
|
71
|
+
```
|
|
72
|
+
|
|
73
|
+
**Gemini CLI:**
|
|
74
|
+
```bash
|
|
75
|
+
# Use --foreground and set is_background: true on your shell tool call
|
|
76
|
+
# so the process survives across turns
|
|
77
|
+
scripts/start-server.sh --project-dir /path/to/project --foreground
|
|
78
|
+
```
|
|
79
|
+
|
|
80
|
+
**Other environments:** The server must keep running in the background across conversation turns. If your environment reaps detached processes, use `--foreground` and launch the command with your platform's background execution mechanism.
|
|
81
|
+
|
|
82
|
+
If the URL is unreachable from your browser (common in remote/containerized setups), bind a non-loopback host:
|
|
83
|
+
|
|
84
|
+
```bash
|
|
85
|
+
scripts/start-server.sh \
|
|
86
|
+
--project-dir /path/to/project \
|
|
87
|
+
--host 0.0.0.0 \
|
|
88
|
+
--url-host localhost
|
|
89
|
+
```
|
|
90
|
+
|
|
91
|
+
Use `--url-host` to control what hostname is printed in the returned URL JSON.
|
|
92
|
+
|
|
93
|
+
## The Loop
|
|
94
|
+
|
|
95
|
+
1. **Check server is alive**, then **write HTML** to a new file in `screen_dir`:
|
|
96
|
+
- Before each write, check that `$SCREEN_DIR/.server-info` exists. If it doesn't (or `.server-stopped` exists), the server has shut down — restart it with `start-server.sh` before continuing. The server auto-exits after 30 minutes of inactivity.
|
|
97
|
+
- Use semantic filenames: `platform.html`, `visual-style.html`, `layout.html`
|
|
98
|
+
- **Never reuse filenames** — each screen gets a fresh file
|
|
99
|
+
- Use Write tool — **never use cat/heredoc** (dumps noise into terminal)
|
|
100
|
+
- Server automatically serves the newest file
|
|
101
|
+
|
|
102
|
+
2. **Tell user what to expect and end your turn:**
|
|
103
|
+
- Remind them of the URL (every step, not just first)
|
|
104
|
+
- Give a brief text summary of what's on screen (e.g., "Showing 3 layout options for the homepage")
|
|
105
|
+
- Ask them to respond in the terminal: "Take a look and let me know what you think. Click to select an option if you'd like."
|
|
106
|
+
|
|
107
|
+
3. **On your next turn** — after the user responds in the terminal:
|
|
108
|
+
- Read `$SCREEN_DIR/.events` if it exists — this contains the user's browser interactions (clicks, selections) as JSON lines
|
|
109
|
+
- Merge with the user's terminal text to get the full picture
|
|
110
|
+
- The terminal message is the primary feedback; `.events` provides structured interaction data
|
|
111
|
+
|
|
112
|
+
4. **Iterate or advance** — if feedback changes current screen, write a new file (e.g., `layout-v2.html`). Only move to the next question when the current step is validated.
|
|
113
|
+
|
|
114
|
+
5. **Unload when returning to terminal** — when the next step doesn't need the browser (e.g., a clarifying question, a tradeoff discussion), push a waiting screen to clear the stale content:
|
|
115
|
+
|
|
116
|
+
```html
|
|
117
|
+
<!-- filename: waiting.html (or waiting-2.html, etc.) -->
|
|
118
|
+
<div style="display:flex;align-items:center;justify-content:center;min-height:60vh">
|
|
119
|
+
<p class="subtitle">Continuing in terminal...</p>
|
|
120
|
+
</div>
|
|
121
|
+
```
|
|
122
|
+
|
|
123
|
+
This prevents the user from staring at a resolved choice while the conversation has moved on. When the next visual question comes up, push a new content file as usual.
|
|
124
|
+
|
|
125
|
+
6. Repeat until done.
|
|
126
|
+
|
|
127
|
+
## Writing Content Fragments
|
|
128
|
+
|
|
129
|
+
Write just the content that goes inside the page. The server wraps it in the frame template automatically (header, theme CSS, selection indicator, and all interactive infrastructure).
|
|
130
|
+
|
|
131
|
+
**Minimal example:**
|
|
132
|
+
|
|
133
|
+
```html
|
|
134
|
+
<h2>Which layout works better?</h2>
|
|
135
|
+
<p class="subtitle">Consider readability and visual hierarchy</p>
|
|
136
|
+
|
|
137
|
+
<div class="options">
|
|
138
|
+
<div class="option" data-choice="a" onclick="toggleSelect(this)">
|
|
139
|
+
<div class="letter">A</div>
|
|
140
|
+
<div class="content">
|
|
141
|
+
<h3>Single Column</h3>
|
|
142
|
+
<p>Clean, focused reading experience</p>
|
|
143
|
+
</div>
|
|
144
|
+
</div>
|
|
145
|
+
<div class="option" data-choice="b" onclick="toggleSelect(this)">
|
|
146
|
+
<div class="letter">B</div>
|
|
147
|
+
<div class="content">
|
|
148
|
+
<h3>Two Column</h3>
|
|
149
|
+
<p>Sidebar navigation with main content</p>
|
|
150
|
+
</div>
|
|
151
|
+
</div>
|
|
152
|
+
</div>
|
|
153
|
+
```
|
|
154
|
+
|
|
155
|
+
That's it. No `<html>`, no CSS, no `<script>` tags needed. The server provides all of that.
|
|
156
|
+
|
|
157
|
+
## CSS Classes Available
|
|
158
|
+
|
|
159
|
+
The frame template provides these CSS classes for your content:
|
|
160
|
+
|
|
161
|
+
### Options (A/B/C choices)
|
|
162
|
+
|
|
163
|
+
```html
|
|
164
|
+
<div class="options">
|
|
165
|
+
<div class="option" data-choice="a" onclick="toggleSelect(this)">
|
|
166
|
+
<div class="letter">A</div>
|
|
167
|
+
<div class="content">
|
|
168
|
+
<h3>Title</h3>
|
|
169
|
+
<p>Description</p>
|
|
170
|
+
</div>
|
|
171
|
+
</div>
|
|
172
|
+
</div>
|
|
173
|
+
```
|
|
174
|
+
|
|
175
|
+
**Multi-select:** Add `data-multiselect` to the container to let users select multiple options. Each click toggles the item. The indicator bar shows the count.
|
|
176
|
+
|
|
177
|
+
```html
|
|
178
|
+
<div class="options" data-multiselect>
|
|
179
|
+
<!-- same option markup — users can select/deselect multiple -->
|
|
180
|
+
</div>
|
|
181
|
+
```
|
|
182
|
+
|
|
183
|
+
### Cards (visual designs)
|
|
184
|
+
|
|
185
|
+
```html
|
|
186
|
+
<div class="cards">
|
|
187
|
+
<div class="card" data-choice="design1" onclick="toggleSelect(this)">
|
|
188
|
+
<div class="card-image"><!-- mockup content --></div>
|
|
189
|
+
<div class="card-body">
|
|
190
|
+
<h3>Name</h3>
|
|
191
|
+
<p>Description</p>
|
|
192
|
+
</div>
|
|
193
|
+
</div>
|
|
194
|
+
</div>
|
|
195
|
+
```
|
|
196
|
+
|
|
197
|
+
### Mockup container
|
|
198
|
+
|
|
199
|
+
```html
|
|
200
|
+
<div class="mockup">
|
|
201
|
+
<div class="mockup-header">Preview: Dashboard Layout</div>
|
|
202
|
+
<div class="mockup-body"><!-- your mockup HTML --></div>
|
|
203
|
+
</div>
|
|
204
|
+
```
|
|
205
|
+
|
|
206
|
+
### Split view (side-by-side)
|
|
207
|
+
|
|
208
|
+
```html
|
|
209
|
+
<div class="split">
|
|
210
|
+
<div class="mockup"><!-- left --></div>
|
|
211
|
+
<div class="mockup"><!-- right --></div>
|
|
212
|
+
</div>
|
|
213
|
+
```
|
|
214
|
+
|
|
215
|
+
### Pros/Cons
|
|
216
|
+
|
|
217
|
+
```html
|
|
218
|
+
<div class="pros-cons">
|
|
219
|
+
<div class="pros"><h4>Pros</h4><ul><li>Benefit</li></ul></div>
|
|
220
|
+
<div class="cons"><h4>Cons</h4><ul><li>Drawback</li></ul></div>
|
|
221
|
+
</div>
|
|
222
|
+
```
|
|
223
|
+
|
|
224
|
+
### Mock elements (wireframe building blocks)
|
|
225
|
+
|
|
226
|
+
```html
|
|
227
|
+
<div class="mock-nav">Logo | Home | About | Contact</div>
|
|
228
|
+
<div style="display: flex;">
|
|
229
|
+
<div class="mock-sidebar">Navigation</div>
|
|
230
|
+
<div class="mock-content">Main content area</div>
|
|
231
|
+
</div>
|
|
232
|
+
<button class="mock-button">Action Button</button>
|
|
233
|
+
<input class="mock-input" placeholder="Input field">
|
|
234
|
+
<div class="placeholder">Placeholder area</div>
|
|
235
|
+
```
|
|
236
|
+
|
|
237
|
+
### Typography and sections
|
|
238
|
+
|
|
239
|
+
- `h2` — page title
|
|
240
|
+
- `h3` — section heading
|
|
241
|
+
- `.subtitle` — secondary text below title
|
|
242
|
+
- `.section` — content block with bottom margin
|
|
243
|
+
- `.label` — small uppercase label text
|
|
244
|
+
|
|
245
|
+
## Browser Events Format
|
|
246
|
+
|
|
247
|
+
When the user clicks options in the browser, their interactions are recorded to `$SCREEN_DIR/.events` (one JSON object per line). The file is cleared automatically when you push a new screen.
|
|
248
|
+
|
|
249
|
+
```jsonl
|
|
250
|
+
{"type":"click","choice":"a","text":"Option A - Simple Layout","timestamp":1706000101}
|
|
251
|
+
{"type":"click","choice":"c","text":"Option C - Complex Grid","timestamp":1706000108}
|
|
252
|
+
{"type":"click","choice":"b","text":"Option B - Hybrid","timestamp":1706000115}
|
|
253
|
+
```
|
|
254
|
+
|
|
255
|
+
The full event stream shows the user's exploration path — they may click multiple options before settling. The last `choice` event is typically the final selection, but the pattern of clicks can reveal hesitation or preferences worth asking about.
|
|
256
|
+
|
|
257
|
+
If `.events` doesn't exist, the user didn't interact with the browser — use only their terminal text.
|
|
258
|
+
|
|
259
|
+
## Design Tips
|
|
260
|
+
|
|
261
|
+
- **Scale fidelity to the question** — wireframes for layout, polish for polish questions
|
|
262
|
+
- **Explain the question on each page** — "Which layout feels more professional?" not just "Pick one"
|
|
263
|
+
- **Iterate before advancing** — if feedback changes current screen, write a new version
|
|
264
|
+
- **2-4 options max** per screen
|
|
265
|
+
- **Use real content when it matters** — for a photography portfolio, use actual images (Unsplash). Placeholder content obscures design issues.
|
|
266
|
+
- **Keep mockups simple** — focus on layout and structure, not pixel-perfect design
|
|
267
|
+
|
|
268
|
+
## File Naming
|
|
269
|
+
|
|
270
|
+
- Use semantic names: `platform.html`, `visual-style.html`, `layout.html`
|
|
271
|
+
- Never reuse filenames — each screen must be a new file
|
|
272
|
+
- For iterations: append version suffix like `layout-v2.html`, `layout-v3.html`
|
|
273
|
+
- Server serves newest file by modification time
|
|
274
|
+
|
|
275
|
+
## Cleaning Up
|
|
276
|
+
|
|
277
|
+
```bash
|
|
278
|
+
scripts/stop-server.sh $SCREEN_DIR
|
|
279
|
+
```
|
|
280
|
+
|
|
281
|
+
If the session used `--project-dir`, mockup files persist in `.superpowers/brainstorm/` for later reference. Only `/tmp` sessions get deleted on stop.
|
|
282
|
+
|
|
283
|
+
## Reference
|
|
284
|
+
|
|
285
|
+
- Frame template (CSS reference): `scripts/frame-template.html`
|
|
286
|
+
- Helper script (client-side): `scripts/helper.js`
|
|
@@ -0,0 +1,182 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: dispatching-parallel-agents
|
|
3
|
+
description: Use when facing 2+ independent tasks that can be worked on without shared state or sequential dependencies
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Dispatching Parallel Agents
|
|
7
|
+
|
|
8
|
+
## Overview
|
|
9
|
+
|
|
10
|
+
You delegate tasks to specialized agents with isolated context. By precisely crafting their instructions and context, you ensure they stay focused and succeed at their task. They should never inherit your session's context or history — you construct exactly what they need. This also preserves your own context for coordination work.
|
|
11
|
+
|
|
12
|
+
When you have multiple unrelated failures (different test files, different subsystems, different bugs), investigating them sequentially wastes time. Each investigation is independent and can happen in parallel.
|
|
13
|
+
|
|
14
|
+
**Core principle:** Dispatch one agent per independent problem domain. Let them work concurrently.
|
|
15
|
+
|
|
16
|
+
## When to Use
|
|
17
|
+
|
|
18
|
+
```dot
|
|
19
|
+
digraph when_to_use {
|
|
20
|
+
"Multiple failures?" [shape=diamond];
|
|
21
|
+
"Are they independent?" [shape=diamond];
|
|
22
|
+
"Single agent investigates all" [shape=box];
|
|
23
|
+
"One agent per problem domain" [shape=box];
|
|
24
|
+
"Can they work in parallel?" [shape=diamond];
|
|
25
|
+
"Sequential agents" [shape=box];
|
|
26
|
+
"Parallel dispatch" [shape=box];
|
|
27
|
+
|
|
28
|
+
"Multiple failures?" -> "Are they independent?" [label="yes"];
|
|
29
|
+
"Are they independent?" -> "Single agent investigates all" [label="no - related"];
|
|
30
|
+
"Are they independent?" -> "Can they work in parallel?" [label="yes"];
|
|
31
|
+
"Can they work in parallel?" -> "Parallel dispatch" [label="yes"];
|
|
32
|
+
"Can they work in parallel?" -> "Sequential agents" [label="no - shared state"];
|
|
33
|
+
}
|
|
34
|
+
```
|
|
35
|
+
|
|
36
|
+
**Use when:**
|
|
37
|
+
- 3+ test files failing with different root causes
|
|
38
|
+
- Multiple subsystems broken independently
|
|
39
|
+
- Each problem can be understood without context from others
|
|
40
|
+
- No shared state between investigations
|
|
41
|
+
|
|
42
|
+
**Don't use when:**
|
|
43
|
+
- Failures are related (fix one might fix others)
|
|
44
|
+
- Need to understand full system state
|
|
45
|
+
- Agents would interfere with each other
|
|
46
|
+
|
|
47
|
+
## The Pattern
|
|
48
|
+
|
|
49
|
+
### 1. Identify Independent Domains
|
|
50
|
+
|
|
51
|
+
Group failures by what's broken:
|
|
52
|
+
- File A tests: Tool approval flow
|
|
53
|
+
- File B tests: Batch completion behavior
|
|
54
|
+
- File C tests: Abort functionality
|
|
55
|
+
|
|
56
|
+
Each domain is independent - fixing tool approval doesn't affect abort tests.
|
|
57
|
+
|
|
58
|
+
### 2. Create Focused Agent Tasks
|
|
59
|
+
|
|
60
|
+
Each agent gets:
|
|
61
|
+
- **Specific scope:** One test file or subsystem
|
|
62
|
+
- **Clear goal:** Make these tests pass
|
|
63
|
+
- **Constraints:** Don't change other code
|
|
64
|
+
- **Expected output:** Summary of what you found and fixed
|
|
65
|
+
|
|
66
|
+
### 3. Dispatch in Parallel
|
|
67
|
+
|
|
68
|
+
```typescript
|
|
69
|
+
// In Claude Code / AI environment
|
|
70
|
+
Task("Fix agent-tool-abort.test.ts failures")
|
|
71
|
+
Task("Fix batch-completion-behavior.test.ts failures")
|
|
72
|
+
Task("Fix tool-approval-race-conditions.test.ts failures")
|
|
73
|
+
// All three run concurrently
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
### 4. Review and Integrate
|
|
77
|
+
|
|
78
|
+
When agents return:
|
|
79
|
+
- Read each summary
|
|
80
|
+
- Verify fixes don't conflict
|
|
81
|
+
- Run full test suite
|
|
82
|
+
- Integrate all changes
|
|
83
|
+
|
|
84
|
+
## Agent Prompt Structure
|
|
85
|
+
|
|
86
|
+
Good agent prompts are:
|
|
87
|
+
1. **Focused** - One clear problem domain
|
|
88
|
+
2. **Self-contained** - All context needed to understand the problem
|
|
89
|
+
3. **Specific about output** - What should the agent return?
|
|
90
|
+
|
|
91
|
+
```markdown
|
|
92
|
+
Fix the 3 failing tests in src/agents/agent-tool-abort.test.ts:
|
|
93
|
+
|
|
94
|
+
1. "should abort tool with partial output capture" - expects 'interrupted at' in message
|
|
95
|
+
2. "should handle mixed completed and aborted tools" - fast tool aborted instead of completed
|
|
96
|
+
3. "should properly track pendingToolCount" - expects 3 results but gets 0
|
|
97
|
+
|
|
98
|
+
These are timing/race condition issues. Your task:
|
|
99
|
+
|
|
100
|
+
1. Read the test file and understand what each test verifies
|
|
101
|
+
2. Identify root cause - timing issues or actual bugs?
|
|
102
|
+
3. Fix by:
|
|
103
|
+
- Replacing arbitrary timeouts with event-based waiting
|
|
104
|
+
- Fixing bugs in abort implementation if found
|
|
105
|
+
- Adjusting test expectations if testing changed behavior
|
|
106
|
+
|
|
107
|
+
Do NOT just increase timeouts - find the real issue.
|
|
108
|
+
|
|
109
|
+
Return: Summary of what you found and what you fixed.
|
|
110
|
+
```
|
|
111
|
+
|
|
112
|
+
## Common Mistakes
|
|
113
|
+
|
|
114
|
+
**❌ Too broad:** "Fix all the tests" - agent gets lost
|
|
115
|
+
**✅ Specific:** "Fix agent-tool-abort.test.ts" - focused scope
|
|
116
|
+
|
|
117
|
+
**❌ No context:** "Fix the race condition" - agent doesn't know where
|
|
118
|
+
**✅ Context:** Paste the error messages and test names
|
|
119
|
+
|
|
120
|
+
**❌ No constraints:** Agent might refactor everything
|
|
121
|
+
**✅ Constraints:** "Do NOT change production code" or "Fix tests only"
|
|
122
|
+
|
|
123
|
+
**❌ Vague output:** "Fix it" - you don't know what changed
|
|
124
|
+
**✅ Specific:** "Return summary of root cause and changes"
|
|
125
|
+
|
|
126
|
+
## When NOT to Use
|
|
127
|
+
|
|
128
|
+
**Related failures:** Fixing one might fix others - investigate together first
|
|
129
|
+
**Need full context:** Understanding requires seeing entire system
|
|
130
|
+
**Exploratory debugging:** You don't know what's broken yet
|
|
131
|
+
**Shared state:** Agents would interfere (editing same files, using same resources)
|
|
132
|
+
|
|
133
|
+
## Real Example from Session
|
|
134
|
+
|
|
135
|
+
**Scenario:** 6 test failures across 3 files after major refactoring
|
|
136
|
+
|
|
137
|
+
**Failures:**
|
|
138
|
+
- agent-tool-abort.test.ts: 3 failures (timing issues)
|
|
139
|
+
- batch-completion-behavior.test.ts: 2 failures (tools not executing)
|
|
140
|
+
- tool-approval-race-conditions.test.ts: 1 failure (execution count = 0)
|
|
141
|
+
|
|
142
|
+
**Decision:** Independent domains - abort logic separate from batch completion separate from race conditions
|
|
143
|
+
|
|
144
|
+
**Dispatch:**
|
|
145
|
+
```
|
|
146
|
+
Agent 1 → Fix agent-tool-abort.test.ts
|
|
147
|
+
Agent 2 → Fix batch-completion-behavior.test.ts
|
|
148
|
+
Agent 3 → Fix tool-approval-race-conditions.test.ts
|
|
149
|
+
```
|
|
150
|
+
|
|
151
|
+
**Results:**
|
|
152
|
+
- Agent 1: Replaced timeouts with event-based waiting
|
|
153
|
+
- Agent 2: Fixed event structure bug (threadId in wrong place)
|
|
154
|
+
- Agent 3: Added wait for async tool execution to complete
|
|
155
|
+
|
|
156
|
+
**Integration:** All fixes independent, no conflicts, full suite green
|
|
157
|
+
|
|
158
|
+
**Time saved:** 3 problems solved in parallel vs sequentially
|
|
159
|
+
|
|
160
|
+
## Key Benefits
|
|
161
|
+
|
|
162
|
+
1. **Parallelization** - Multiple investigations happen simultaneously
|
|
163
|
+
2. **Focus** - Each agent has narrow scope, less context to track
|
|
164
|
+
3. **Independence** - Agents don't interfere with each other
|
|
165
|
+
4. **Speed** - 3 problems solved in time of 1
|
|
166
|
+
|
|
167
|
+
## Verification
|
|
168
|
+
|
|
169
|
+
After agents return:
|
|
170
|
+
1. **Review each summary** - Understand what changed
|
|
171
|
+
2. **Check for conflicts** - Did agents edit same code?
|
|
172
|
+
3. **Run full suite** - Verify all fixes work together
|
|
173
|
+
4. **Spot check** - Agents can make systematic errors
|
|
174
|
+
|
|
175
|
+
## Real-World Impact
|
|
176
|
+
|
|
177
|
+
From debugging session (2025-10-03):
|
|
178
|
+
- 6 failures across 3 files
|
|
179
|
+
- 3 agents dispatched in parallel
|
|
180
|
+
- All investigations completed concurrently
|
|
181
|
+
- All fixes integrated successfully
|
|
182
|
+
- Zero conflicts between agent changes
|
|
@@ -0,0 +1,70 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: executing-plans
|
|
3
|
+
description: Use when you have a written implementation plan to execute in a separate session with review checkpoints
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Executing Plans
|
|
7
|
+
|
|
8
|
+
## Overview
|
|
9
|
+
|
|
10
|
+
Load plan, review critically, execute all tasks, report when complete.
|
|
11
|
+
|
|
12
|
+
**Announce at start:** "I'm using the executing-plans skill to implement this plan."
|
|
13
|
+
|
|
14
|
+
**Note:** Tell your human partner that Superpowers works much better with access to subagents. The quality of its work will be significantly higher if run on a platform with subagent support (such as Claude Code or Codex). If subagents are available, use superpowers:subagent-driven-development instead of this skill.
|
|
15
|
+
|
|
16
|
+
## The Process
|
|
17
|
+
|
|
18
|
+
### Step 1: Load and Review Plan
|
|
19
|
+
1. Read plan file
|
|
20
|
+
2. Review critically - identify any questions or concerns about the plan
|
|
21
|
+
3. If concerns: Raise them with your human partner before starting
|
|
22
|
+
4. If no concerns: Create TodoWrite and proceed
|
|
23
|
+
|
|
24
|
+
### Step 2: Execute Tasks
|
|
25
|
+
|
|
26
|
+
For each task:
|
|
27
|
+
1. Mark as in_progress
|
|
28
|
+
2. Follow each step exactly (plan has bite-sized steps)
|
|
29
|
+
3. Run verifications as specified
|
|
30
|
+
4. Mark as completed
|
|
31
|
+
|
|
32
|
+
### Step 3: Complete Development
|
|
33
|
+
|
|
34
|
+
After all tasks complete and verified:
|
|
35
|
+
- Announce: "I'm using the finishing-a-development-branch skill to complete this work."
|
|
36
|
+
- **REQUIRED SUB-SKILL:** Use superpowers:finishing-a-development-branch
|
|
37
|
+
- Follow that skill to verify tests, present options, execute choice
|
|
38
|
+
|
|
39
|
+
## When to Stop and Ask for Help
|
|
40
|
+
|
|
41
|
+
**STOP executing immediately when:**
|
|
42
|
+
- Hit a blocker (missing dependency, test fails, instruction unclear)
|
|
43
|
+
- Plan has critical gaps preventing starting
|
|
44
|
+
- You don't understand an instruction
|
|
45
|
+
- Verification fails repeatedly
|
|
46
|
+
|
|
47
|
+
**Ask for clarification rather than guessing.**
|
|
48
|
+
|
|
49
|
+
## When to Revisit Earlier Steps
|
|
50
|
+
|
|
51
|
+
**Return to Review (Step 1) when:**
|
|
52
|
+
- Partner updates the plan based on your feedback
|
|
53
|
+
- Fundamental approach needs rethinking
|
|
54
|
+
|
|
55
|
+
**Don't force through blockers** - stop and ask.
|
|
56
|
+
|
|
57
|
+
## Remember
|
|
58
|
+
- Review plan critically first
|
|
59
|
+
- Follow plan steps exactly
|
|
60
|
+
- Don't skip verifications
|
|
61
|
+
- Reference skills when plan says to
|
|
62
|
+
- Stop when blocked, don't guess
|
|
63
|
+
- Never start implementation on main/master branch without explicit user consent
|
|
64
|
+
|
|
65
|
+
## Integration
|
|
66
|
+
|
|
67
|
+
**Required workflow skills:**
|
|
68
|
+
- **superpowers:using-git-worktrees** - REQUIRED: Set up isolated workspace before starting
|
|
69
|
+
- **superpowers:writing-plans** - Creates the plan this skill executes
|
|
70
|
+
- **superpowers:finishing-a-development-branch** - Complete development after all tasks
|