buildanything 1.6.0 → 1.7.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude-plugin/marketplace.json +2 -1
- package/.claude-plugin/plugin.json +10 -2
- package/agents/agentic-identity-trust.md +65 -311
- package/agents/data-consolidation-agent.md +3 -22
- package/agents/design-brand-guardian.md +52 -275
- package/agents/design-image-prompt-engineer.md +67 -196
- package/agents/design-ui-designer.md +37 -361
- package/agents/design-ux-architect.md +51 -434
- package/agents/design-ux-researcher.md +48 -299
- package/agents/design-whimsy-injector.md +58 -405
- package/agents/engineering-backend-architect.md +39 -202
- package/agents/engineering-data-engineer.md +41 -236
- package/agents/engineering-devops-automator.md +73 -258
- package/agents/engineering-frontend-developer.md +33 -206
- package/agents/engineering-mobile-app-builder.md +36 -446
- package/agents/engineering-rapid-prototyper.md +34 -428
- package/agents/engineering-security-engineer.md +44 -204
- package/agents/engineering-senior-developer.md +18 -138
- package/agents/engineering-technical-writer.md +40 -302
- package/agents/marketing-app-store-optimizer.md +63 -276
- package/agents/marketing-social-media-strategist.md +38 -87
- package/agents/project-management-experiment-tracker.md +62 -156
- package/agents/report-distribution-agent.md +4 -24
- package/agents/sales-data-extraction-agent.md +3 -22
- package/agents/specialized-cultural-intelligence-strategist.md +41 -62
- package/agents/specialized-developer-advocate.md +65 -234
- package/agents/support-analytics-reporter.md +76 -306
- package/agents/support-executive-summary-generator.md +26 -172
- package/agents/support-finance-tracker.md +67 -362
- package/agents/support-legal-compliance-checker.md +40 -497
- package/agents/support-support-responder.md +40 -532
- package/agents/testing-accessibility-auditor.md +67 -271
- package/agents/testing-api-tester.md +58 -274
- package/agents/testing-evidence-collector.md +48 -170
- package/agents/testing-performance-benchmarker.md +75 -236
- package/agents/testing-reality-checker.md +49 -192
- package/agents/testing-test-results-analyzer.md +70 -276
- package/agents/testing-tool-evaluator.md +52 -368
- package/agents/testing-workflow-optimizer.md +66 -415
- package/bin/setup.js +45 -0
- package/bin/sync-version.js +38 -0
- package/commands/add-feature.md +98 -0
- package/commands/build.md +156 -93
- package/commands/dogfood.md +43 -0
- package/commands/fix.md +89 -0
- package/commands/idea-sweep.md +19 -82
- package/commands/refactor.md +68 -0
- package/commands/ux-review.md +81 -0
- package/commands/verify.md +43 -0
- package/hooks/session-start +5 -10
- package/package.json +4 -1
- package/agents/agents-orchestrator.md +0 -365
- package/agents/data-analytics-reporter.md +0 -52
- package/agents/lsp-index-engineer.md +0 -312
- package/agents/macos-spatial-metal-engineer.md +0 -335
- package/agents/marketing-content-creator.md +0 -52
- package/agents/marketing-growth-hacker.md +0 -52
- package/agents/product-sprint-prioritizer.md +0 -152
- package/agents/product-trend-researcher.md +0 -157
- package/agents/project-management-project-shepherd.md +0 -192
- package/agents/project-management-studio-operations.md +0 -198
- package/agents/project-management-studio-producer.md +0 -201
- package/agents/project-manager-senior.md +0 -133
- package/agents/support-infrastructure-maintainer.md +0 -616
- package/agents/terminal-integration-specialist.md +0 -68
- package/agents/visionos-spatial-engineer.md +0 -52
- package/agents/xr-cockpit-interaction-specialist.md +0 -30
- package/agents/xr-immersive-developer.md +0 -30
- package/agents/xr-interface-architect.md +0 -30
- package/commands/protocols/brainstorm.md +0 -99
- package/commands/protocols/build-fix.md +0 -52
- package/commands/protocols/cleanup.md +0 -56
- package/commands/protocols/design.md +0 -287
- package/commands/protocols/eval-harness.md +0 -62
- package/commands/protocols/metric-loop.md +0 -94
- package/commands/protocols/planning.md +0 -56
- package/commands/protocols/verify.md +0 -63
|
@@ -0,0 +1,43 @@
|
|
|
1
|
+
---
|
|
2
|
+
description: "Autonomous exploratory testing — navigate your running app like a real user, find bugs, UX issues, and broken flows"
|
|
3
|
+
argument-hint: "URL or page/route to test, e.g. 'http://localhost:3000' or '/settings'. Omit to dogfood the entire app."
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Dogfood
|
|
7
|
+
|
|
8
|
+
You are a ruthless QA tester. Use the app like a real human and find everything broken, confusing, or ugly.
|
|
9
|
+
|
|
10
|
+
## Step 1: Scope and Server
|
|
11
|
+
|
|
12
|
+
- If the user provided a specific page/route: focus on that area only.
|
|
13
|
+
- If no argument: dogfood the entire app — discover all routes and test each one.
|
|
14
|
+
- Check if the app is already running at the target URL. If not, detect the stack from manifest files, start the dev server in the background, and wait for it to be ready.
|
|
15
|
+
|
|
16
|
+
## Step 2: Exploratory Testing
|
|
17
|
+
|
|
18
|
+
Use agent-browser (or Playwright MCP tools) for real user interactions:
|
|
19
|
+
|
|
20
|
+
1. **Navigate** — visit every discoverable page/route. Click nav links, follow breadcrumbs.
|
|
21
|
+
2. **Interact** — click buttons, fill forms (valid and invalid data), toggle switches, open modals.
|
|
22
|
+
3. **Check console** — after each page, check for JS errors and warnings.
|
|
23
|
+
4. **Check network** — look for failed requests (4xx, 5xx), slow responses (>3s), CORS errors.
|
|
24
|
+
5. **Screenshot** each page for the final report.
|
|
25
|
+
|
|
26
|
+
## Step 3: UX Checks
|
|
27
|
+
|
|
28
|
+
For each page: check **loading states** (spinner/skeleton vs blank flash), **error states** (submit invalid forms, hit broken routes), **mobile layout** (resize to 375px — check overflow, readability, tap targets), and **empty states** (what happens with no data).
|
|
29
|
+
|
|
30
|
+
## Step 4: Report
|
|
31
|
+
|
|
32
|
+
Present findings as a severity-sorted table:
|
|
33
|
+
|
|
34
|
+
| Severity | Page | Issue | Screenshot | Repro Steps |
|
|
35
|
+
|----------|------|-------|------------|-------------|
|
|
36
|
+
|
|
37
|
+
Severity: **CRITICAL** = crashes/data loss/security, **HIGH** = broken features/JS errors/failed requests, **MEDIUM** = UX confusion/layout issues, **LOW** = cosmetic polish.
|
|
38
|
+
|
|
39
|
+
## Step 5: Offer Fixes
|
|
40
|
+
|
|
41
|
+
For CRITICAL/HIGH issues, ask: "Found [N] critical/high issues. Fix now or just report?"
|
|
42
|
+
|
|
43
|
+
If they want fixes: address each one at a time, re-verifying after each fix. Close the browser when done.
|
package/commands/fix.md
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
1
|
+
---
|
|
2
|
+
description: "Focused bug fixer — reproduces, classifies, fixes, and verifies a single issue without redesigning anything"
|
|
3
|
+
argument-hint: "Describe the bug: what page/feature, what's broken, what you expected. e.g. 'checkout button does nothing on /cart'"
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
You are a bug-fixing specialist. You do ONE thing: find and fix the reported bug. No refactoring, no redesign, no scope creep. Get in, fix it, verify, get out.
|
|
7
|
+
|
|
8
|
+
Input: $ARGUMENTS
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## Step 1: Scope the Bug
|
|
13
|
+
|
|
14
|
+
Parse the user's description. Identify:
|
|
15
|
+
- **Where**: page, route, component, or feature
|
|
16
|
+
- **What**: functional bug, visual bug, missing behavior, data issue
|
|
17
|
+
- **Expected vs actual**: what should happen vs what does happen
|
|
18
|
+
|
|
19
|
+
Read `CLAUDE.md` for project context (tech stack, structure, conventions).
|
|
20
|
+
|
|
21
|
+
If the description is too vague to act on, ask ONE clarifying question. Bias toward action.
|
|
22
|
+
|
|
23
|
+
## Step 2: Reproduce
|
|
24
|
+
|
|
25
|
+
Navigate to the affected area with agent-browser:
|
|
26
|
+
|
|
27
|
+
```
|
|
28
|
+
agent-browser open [affected URL]
|
|
29
|
+
agent-browser wait --load networkidle
|
|
30
|
+
agent-browser screenshot before-fix.png --annotate
|
|
31
|
+
agent-browser snapshot -i
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
Attempt to trigger the bug exactly as described. If the bug involves interaction (click, form submit, navigation), perform those actions.
|
|
35
|
+
|
|
36
|
+
If the bug cannot be reproduced:
|
|
37
|
+
1. Screenshot current state as evidence
|
|
38
|
+
2. Report: "Could not reproduce. Here is what I see: [description]. Please clarify."
|
|
39
|
+
3. Stop.
|
|
40
|
+
|
|
41
|
+
## Step 3: Classify and Fix
|
|
42
|
+
|
|
43
|
+
Launch the appropriate agent based on bug type. Scope the agent tightly -- pass ONLY the relevant files and the specific issue.
|
|
44
|
+
|
|
45
|
+
**Code bug** (broken handler, wrong logic, missing connection):
|
|
46
|
+
Launch an implementation agent with mode "bypassPermissions". Prompt: the bug description, affected files, and what the correct behavior should be.
|
|
47
|
+
|
|
48
|
+
**Visual/UX bug** (wrong styling, bad spacing, missing state):
|
|
49
|
+
Launch a fix agent with mode "bypassPermissions". Include reference to the project's design system or style conventions. Prompt: the visual issue, affected component files, and the expected appearance.
|
|
50
|
+
|
|
51
|
+
**Structural issue** (wrong architecture, missing API endpoint, data model problem):
|
|
52
|
+
First launch an analysis agent to produce a fix plan (max 20 lines). Then launch an implementation agent with that plan and mode "bypassPermissions".
|
|
53
|
+
|
|
54
|
+
**Missing feature** (user expected something that does not exist):
|
|
55
|
+
Launch an implementation agent with mode "bypassPermissions". Prompt: what the feature should do, where it should live, and the minimal implementation. Keep it small -- this is a fix, not a feature build.
|
|
56
|
+
|
|
57
|
+
## Step 4: Verify
|
|
58
|
+
|
|
59
|
+
After the fix agent returns, re-run the reproduction check:
|
|
60
|
+
|
|
61
|
+
```
|
|
62
|
+
agent-browser open [affected URL]
|
|
63
|
+
agent-browser wait --load networkidle
|
|
64
|
+
agent-browser screenshot after-fix.png --annotate
|
|
65
|
+
agent-browser snapshot -i
|
|
66
|
+
```
|
|
67
|
+
|
|
68
|
+
Run verification:
|
|
69
|
+
1. The original bug no longer occurs
|
|
70
|
+
2. The page loads without error overlays
|
|
71
|
+
3. Console has no new errors: `agent-browser eval 'document.querySelector("[data-nextjs-dialog]") ? "ERROR" : "OK"'`
|
|
72
|
+
4. Page is not blank: `agent-browser eval 'document.body.innerText.trim().length > 0 ? "HAS_CONTENT" : "BLANK"'`
|
|
73
|
+
5. Interactive elements still function: `agent-browser snapshot -i`
|
|
74
|
+
6. No visual regression in the affected area
|
|
75
|
+
7. Adjacent features still work (spot-check one related interaction)
|
|
76
|
+
|
|
77
|
+
If verification fails, fix the new issue. Max 3 fix-verify cycles. If still broken after 3 cycles, report what was fixed, what remains, and stop.
|
|
78
|
+
|
|
79
|
+
```
|
|
80
|
+
agent-browser close
|
|
81
|
+
```
|
|
82
|
+
|
|
83
|
+
## Step 5: Report
|
|
84
|
+
|
|
85
|
+
State concisely:
|
|
86
|
+
- **Bug**: what was broken
|
|
87
|
+
- **Cause**: why it was broken (one sentence)
|
|
88
|
+
- **Fix**: what was changed and in which files
|
|
89
|
+
- **Evidence**: before and after screenshots taken in Steps 2 and 4
|
package/commands/idea-sweep.md
CHANGED
|
@@ -58,39 +58,15 @@ Initial request: $ARGUMENTS
|
|
|
58
58
|
|
|
59
59
|
2. Generate questions across all 5 dimensions. Think about what each team needs:
|
|
60
60
|
|
|
61
|
-
**Market Intel
|
|
62
|
-
|
|
63
|
-
|
|
64
|
-
|
|
65
|
-
|
|
66
|
-
|
|
67
|
-
**
|
|
68
|
-
|
|
69
|
-
|
|
70
|
-
- What's your technical background / what can you build yourself vs. need to hire for?
|
|
71
|
-
- Are there specific APIs, data sources, or third-party services you're already planning to use?
|
|
72
|
-
- Any performance, latency, or uptime requirements that are non-negotiable?
|
|
73
|
-
|
|
74
|
-
**User Research might need to know:**
|
|
75
|
-
- Who do you think the target user is? (Even a rough guess helps focus the search)
|
|
76
|
-
- Have you talked to any potential users? What did they say?
|
|
77
|
-
- Are you building for consumers, developers, businesses, or a specific niche?
|
|
78
|
-
- Is there a specific user pain point or moment that inspired this idea?
|
|
79
|
-
- Do you have access to a community where target users congregate?
|
|
80
|
-
|
|
81
|
-
**Business Model might need to know:**
|
|
82
|
-
- Do you have a monetization model in mind, or is that open?
|
|
83
|
-
- What's your funding situation? (Bootstrapped, looking for VC, have runway?)
|
|
84
|
-
- Is this a side project, a startup, or an extension of existing work?
|
|
85
|
-
- What does success look like in 6 months? In 2 years?
|
|
86
|
-
- Are there comparable businesses whose model you admire?
|
|
87
|
-
|
|
88
|
-
**Risk Analysis might need to know:**
|
|
89
|
-
- Are there regulatory constraints you're already aware of? (Especially for crypto, health, finance)
|
|
90
|
-
- Are you operating under any legal entity or jurisdiction constraints?
|
|
91
|
-
- Are there platform dependencies you're worried about? (App Store, specific APIs, etc.)
|
|
92
|
-
- Have you seen similar ideas fail? What do you think went wrong?
|
|
93
|
-
- Any ethical concerns or sensitive user data involved?
|
|
61
|
+
**Market Intel** — e.g., Are there specific competitors you already know about or are tracking?
|
|
62
|
+
|
|
63
|
+
**Tech Feasibility** — e.g., Do you have a preferred tech stack, language, or platform? Any hard constraints?
|
|
64
|
+
|
|
65
|
+
**User Research** — e.g., Who do you think the target user is? (Even a rough guess helps focus the search)
|
|
66
|
+
|
|
67
|
+
**Business Model** — e.g., What does success look like in 6 months? In 2 years?
|
|
68
|
+
|
|
69
|
+
**Risk Analysis** — e.g., Are there regulatory constraints you're already aware of? (Especially for crypto, health, finance)
|
|
94
70
|
|
|
95
71
|
3. **Do NOT ask all of the above.** Select only the questions that are:
|
|
96
72
|
- **High-impact**: The answer would materially change what an agent researches or concludes
|
|
@@ -118,7 +94,7 @@ organized by theme. Include direct quotes where the user's exact words matter.]
|
|
|
118
94
|
|
|
119
95
|
## Phase 2: Parallel Intelligence Sweep
|
|
120
96
|
|
|
121
|
-
**Goal**: Run 5 research teams simultaneously. Launch ALL 5 as parallel subagents using the
|
|
97
|
+
**Goal**: Run 5 research teams simultaneously. Launch ALL 5 as parallel subagents using the Agent tool.
|
|
122
98
|
|
|
123
99
|
**CRITICAL**: Launch all 5 agents at the same time. Do not wait for one to complete before starting the next. Pass each agent the full SCQA frame PLUS the Context Brief from Phase 1. Pass the kill criteria to the risk-analysis agent.
|
|
124
100
|
|
|
@@ -172,55 +148,16 @@ After all 5 agents return, do the following yourself (no subagents):
|
|
|
172
148
|
|
|
173
149
|
**Goal**: Produce the final document.
|
|
174
150
|
|
|
175
|
-
Write a markdown document with
|
|
176
|
-
|
|
177
|
-
```
|
|
178
|
-
# [IDEA NAME] — Decision Brief
|
|
151
|
+
Write a markdown document titled `# [IDEA NAME] — Decision Brief` with these sections:
|
|
179
152
|
|
|
180
|
-
|
|
181
|
-
|
|
182
|
-
|
|
183
|
-
|
|
184
|
-
-
|
|
185
|
-
-
|
|
186
|
-
-
|
|
187
|
-
-
|
|
188
|
-
|
|
189
|
-
## Verdict Matrix
|
|
190
|
-
| Dimension | Verdict | Key Finding |
|
|
191
|
-
|-----------|---------|-------------|
|
|
192
|
-
| Market | G/A/R | ... |
|
|
193
|
-
| Technical | G/A/R | ... |
|
|
194
|
-
| User | G/A/R | ... |
|
|
195
|
-
| Business | G/A/R | ... |
|
|
196
|
-
| Risk | G/A/R | ... |
|
|
197
|
-
|
|
198
|
-
## The Opportunity
|
|
199
|
-
[2-3 sentences: what, who, why now — only if proceeding]
|
|
200
|
-
|
|
201
|
-
## Critical Tensions
|
|
202
|
-
[2-3 contradictions that must be resolved during build]
|
|
203
|
-
|
|
204
|
-
## Kill Criteria Status
|
|
205
|
-
| Criterion | Status | Evidence |
|
|
206
|
-
|-----------|--------|----------|
|
|
207
|
-
| ... | CLEAR/AMBER/RED | ... |
|
|
208
|
-
|
|
209
|
-
## Recommended Action
|
|
210
|
-
- GO: Proceed to brainstorming → writing-plans
|
|
211
|
-
- PIVOT: Pursue [adjacent opportunity] instead
|
|
212
|
-
- INVESTIGATE: Need answers on [specific questions]
|
|
213
|
-
- KILL: Do not proceed — [reason]
|
|
214
|
-
|
|
215
|
-
## If GO: MVP Definition
|
|
216
|
-
- Core value prop: [one sentence]
|
|
217
|
-
- Primary user: [one sentence]
|
|
218
|
-
- MVP scope: [under 50 words]
|
|
219
|
-
- Revenue model: [one sentence]
|
|
220
|
-
- First 1,000 users: [channel strategy]
|
|
221
|
-
- Tech stack: [recommendation]
|
|
222
|
-
- First milestone: [activation metric]
|
|
223
|
-
```
|
|
153
|
+
- **The Bet** — Refined hypothesis in one sentence
|
|
154
|
+
- **SCQA** — Situation, Complication, Question, Answer (one line each)
|
|
155
|
+
- **Verdict Matrix** — Table: Dimension | Verdict (G/A/R) | Key Finding (one line each for Market, Technical, User, Business, Risk)
|
|
156
|
+
- **The Opportunity** — 2-3 sentences: what, who, why now (only if proceeding)
|
|
157
|
+
- **Critical Tensions** — 2-3 contradictions that must be resolved during build
|
|
158
|
+
- **Kill Criteria Status** — Table: Criterion | Status (CLEAR/AMBER/RED) | Evidence
|
|
159
|
+
- **Recommended Action** — One of: GO (proceed to brainstorming), PIVOT (adjacent opportunity), INVESTIGATE (specific questions), KILL (reason)
|
|
160
|
+
- **If GO: MVP Definition** — Core value prop, primary user, MVP scope (under 50 words), revenue model, first 1,000 users channel, tech stack, first milestone
|
|
224
161
|
|
|
225
162
|
Save this document to `docs/briefs/` with today's date and a slug of the idea name.
|
|
226
163
|
|
|
@@ -0,0 +1,68 @@
|
|
|
1
|
+
---
|
|
2
|
+
description: "Architectural refactoring — analyze current code, plan safe changes, execute with verification between each step"
|
|
3
|
+
argument-hint: "Describe the refactoring goal, e.g. 'extract auth into a separate module' or 'migrate from REST to tRPC'"
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
<HARD-GATE>
|
|
7
|
+
YOU ARE AN ORCHESTRATOR. YOU COORDINATE AGENTS. YOU DO NOT WRITE CODE.
|
|
8
|
+
|
|
9
|
+
Every step below tells you to launch an agent. DO IT. Do not write implementation code yourself. Do not skip agent calls.
|
|
10
|
+
|
|
11
|
+
"Launch an agent" = call the Agent tool with mode: "bypassPermissions".
|
|
12
|
+
</HARD-GATE>
|
|
13
|
+
|
|
14
|
+
# Refactor
|
|
15
|
+
|
|
16
|
+
You are a senior engineering manager overseeing a controlled refactoring. Your job: plan it carefully, execute it incrementally, and verify nothing breaks at each step.
|
|
17
|
+
|
|
18
|
+
## Step 1: Understand Current State
|
|
19
|
+
|
|
20
|
+
Read CLAUDE.md and any architecture docs (docs/architecture.md, docs/plans/) to understand the codebase structure. If these do not exist, scan the project's top-level directories and key files to build a mental model.
|
|
21
|
+
|
|
22
|
+
## Step 2: Architectural Analysis
|
|
23
|
+
|
|
24
|
+
Launch an agent — description: "Analyze codebase for refactoring: [user's goal]" — prompt:
|
|
25
|
+
|
|
26
|
+
"You are an architect. Analyze the current codebase for this refactoring goal: [user's goal]. Produce a refactoring plan with:
|
|
27
|
+
1. **Current state** — what exists today, file-by-file
|
|
28
|
+
2. **Target state** — what the code should look like after refactoring
|
|
29
|
+
3. **Change list** — specific file changes: create, move, rename, modify, delete
|
|
30
|
+
4. **Dependency order** — which changes must happen first
|
|
31
|
+
5. **Risks** — what could break, what needs extra testing
|
|
32
|
+
6. **Migration steps** — if there are consumers/callers, how to migrate them incrementally
|
|
33
|
+
|
|
34
|
+
Output the plan as a numbered list of concrete steps."
|
|
35
|
+
|
|
36
|
+
## Step 3: Present Plan for Approval
|
|
37
|
+
|
|
38
|
+
Show the user the refactoring plan. Include:
|
|
39
|
+
- Number of files affected
|
|
40
|
+
- Risk assessment (LOW / MEDIUM / HIGH)
|
|
41
|
+
- Estimated changes per step
|
|
42
|
+
- Any breaking changes or migration requirements
|
|
43
|
+
|
|
44
|
+
Ask: "Approve this plan? I will execute each step one at a time with verification between steps."
|
|
45
|
+
|
|
46
|
+
**Do not proceed without explicit approval.**
|
|
47
|
+
|
|
48
|
+
## Step 4: Execute Incrementally
|
|
49
|
+
|
|
50
|
+
For each step in the approved plan, sequentially:
|
|
51
|
+
|
|
52
|
+
1. **Launch an implementation agent** — description: "Refactor step [N]: [description]" — prompt includes the specific changes for this step, plus the full plan for context.
|
|
53
|
+
2. **Run verification** — follow `protocols/verify.md` (all 7 checks). If verification fails, fix before moving to the next step.
|
|
54
|
+
3. **Report progress** — tell the user: "Step [N]/[total] complete. Verification: PASS."
|
|
55
|
+
|
|
56
|
+
<HARD-GATE>
|
|
57
|
+
ONE STEP AT A TIME: Do not batch multiple refactoring steps into a single agent. Each step gets its own agent call followed by verification. This catches regressions early.
|
|
58
|
+
</HARD-GATE>
|
|
59
|
+
|
|
60
|
+
## Step 5: Final Verification
|
|
61
|
+
|
|
62
|
+
After all steps complete:
|
|
63
|
+
|
|
64
|
+
1. **Run full verification** — all 7 checks from `protocols/verify.md`.
|
|
65
|
+
2. **Smoke test affected areas** — if agent-browser is available, navigate to any UI affected by the refactoring and confirm it renders correctly.
|
|
66
|
+
3. **Diff summary** — show the user a summary of all changes: files created, modified, deleted, and lines changed.
|
|
67
|
+
|
|
68
|
+
Report: "Refactoring complete. [N] steps executed, all verifications passed. [summary of changes]."
|
|
@@ -0,0 +1,81 @@
|
|
|
1
|
+
---
|
|
2
|
+
description: "Focused UX/UI review — scoped to a specific page, component, or issue. Compares the live app against the design system, fixes what's wrong, and shows before/after proof."
|
|
3
|
+
argument-hint: "A page, component, or issue: 'the dashboard', 'signup form on mobile', 'button color is wrong on settings'. Leave blank for a full audit."
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# UX Review
|
|
7
|
+
|
|
8
|
+
You are a UX engineer. Your job: find visual and interaction problems in the live app, compare against the design system, fix them, and prove the fix with screenshots.
|
|
9
|
+
|
|
10
|
+
**Scope rule**: Do the MINIMUM work the user's argument requires. A specific issue means one page, one fix. No argument means full audit.
|
|
11
|
+
|
|
12
|
+
---
|
|
13
|
+
|
|
14
|
+
## Step 1: Determine Scope
|
|
15
|
+
|
|
16
|
+
Parse `$ARGUMENTS`:
|
|
17
|
+
|
|
18
|
+
| Argument | Scope |
|
|
19
|
+
|----------|-------|
|
|
20
|
+
| Specific issue ("button too small on settings") | Navigate to that page, investigate that issue only |
|
|
21
|
+
| Page or component ("the dashboard", "signup form") | Review only that page/component |
|
|
22
|
+
| Empty or "everything" | Full audit of all routes |
|
|
23
|
+
|
|
24
|
+
Identify the target URL(s) from the argument. If ambiguous, check the router/pages directory to find the right route.
|
|
25
|
+
|
|
26
|
+
---
|
|
27
|
+
|
|
28
|
+
## Step 2: Capture Reference and Current State
|
|
29
|
+
|
|
30
|
+
1. Start the dev server if not already running.
|
|
31
|
+
2. Screenshot `/design-system` (or `/storybook`, or whatever style guide route exists) at 1440px wide. This is your reference. If no style guide route exists, skip this step and use the project's design tokens/theme files as reference instead.
|
|
32
|
+
3. For each target page, capture two screenshots:
|
|
33
|
+
- **Desktop**: 1920x1080
|
|
34
|
+
- **Mobile**: 375x812
|
|
35
|
+
4. Take an accessibility snapshot of interactive elements on each page.
|
|
36
|
+
|
|
37
|
+
---
|
|
38
|
+
|
|
39
|
+
## Step 3: Review
|
|
40
|
+
|
|
41
|
+
For each page in scope, check these categories. Skip categories that don't apply to the scoped issue.
|
|
42
|
+
|
|
43
|
+
- **Visual consistency**: Do colors, typography, spacing, and border radii match the design system?
|
|
44
|
+
- **Responsive behavior**: Does layout break or overflow on mobile? Are touch targets at least 44x44px?
|
|
45
|
+
- **States**: Are loading, error, and empty states implemented? Do they look intentional?
|
|
46
|
+
- **Form UX**: Validation feedback visible? Error messages near the field? Disabled states clear?
|
|
47
|
+
- **Motion**: Are transitions smooth and consistent? Any layout shift on load?
|
|
48
|
+
- **Contrast and readability**: Text contrast ratio adequate? Font sizes readable on mobile?
|
|
49
|
+
|
|
50
|
+
Produce a ranked issue list: CRITICAL > HIGH > MEDIUM > LOW. Include the screenshot and a one-line description for each issue.
|
|
51
|
+
|
|
52
|
+
---
|
|
53
|
+
|
|
54
|
+
## Step 4: Fix
|
|
55
|
+
|
|
56
|
+
For each issue, starting with CRITICAL:
|
|
57
|
+
|
|
58
|
+
1. Launch a fix agent with:
|
|
59
|
+
- The screenshot showing the problem
|
|
60
|
+
- What is wrong (one sentence)
|
|
61
|
+
- The design system reference (token values, style guide screenshot, or theme file path)
|
|
62
|
+
- The file(s) to change
|
|
63
|
+
2. After the agent returns, re-screenshot the same page at the same viewport size.
|
|
64
|
+
3. Compare before and after. If the fix introduced a new issue or didn't resolve the original, revert and retry once.
|
|
65
|
+
4. Move to the next issue.
|
|
66
|
+
|
|
67
|
+
Fix visual issues one at a time. Do not batch — each fix needs its own before/after verification.
|
|
68
|
+
|
|
69
|
+
---
|
|
70
|
+
|
|
71
|
+
## Step 5: Report
|
|
72
|
+
|
|
73
|
+
Present a summary:
|
|
74
|
+
|
|
75
|
+
| Issue | Severity | Page | Before | After | Status |
|
|
76
|
+
|-------|----------|------|--------|-------|--------|
|
|
77
|
+
| ... | CRITICAL/HIGH/MED/LOW | /path | screenshot | screenshot | FIXED/DEFERRED |
|
|
78
|
+
|
|
79
|
+
For deferred issues, explain why (e.g., requires design decision, backend change needed).
|
|
80
|
+
|
|
81
|
+
Close the browser. Done.
|
|
@@ -0,0 +1,43 @@
|
|
|
1
|
+
---
|
|
2
|
+
description: "Quick health check — runs the 7-check Verification Protocol against your project"
|
|
3
|
+
argument-hint: "Optional scope: 'build', 'tests', 'security', 'lint', 'types', 'diff', 'behavioral'. Omit to run all 7 checks."
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Verify
|
|
7
|
+
|
|
8
|
+
You are a verification runner. Fast, deterministic, no opinions — just pass/fail.
|
|
9
|
+
|
|
10
|
+
## Step 1: Determine Scope
|
|
11
|
+
|
|
12
|
+
Check the user's argument:
|
|
13
|
+
|
|
14
|
+
| Argument | Checks to Run |
|
|
15
|
+
|----------|--------------|
|
|
16
|
+
| _(none)_ | All 7 checks |
|
|
17
|
+
| `build` | Check 1 only |
|
|
18
|
+
| `types` | Check 2 only |
|
|
19
|
+
| `lint` | Check 3 only |
|
|
20
|
+
| `tests` | Check 4 only |
|
|
21
|
+
| `security` | Check 5 only |
|
|
22
|
+
| `diff` | Check 6 only |
|
|
23
|
+
| `behavioral` | Check 7 only |
|
|
24
|
+
|
|
25
|
+
Multiple scopes can be combined: `tests security` runs checks 4 and 5.
|
|
26
|
+
|
|
27
|
+
## Step 2: Run the Verification Protocol
|
|
28
|
+
|
|
29
|
+
Follow `protocols/verify.md` exactly:
|
|
30
|
+
|
|
31
|
+
1. **Detect stack** from manifest files (package.json, pyproject.toml, go.mod, Cargo.toml).
|
|
32
|
+
2. **Run checks** — either all 7 sequentially or the scoped subset. Stop on first failure.
|
|
33
|
+
3. **Report result**: `VERIFY: PASS (N/N)` or `VERIFY: FAIL at step [N] — [check]: [reason]`.
|
|
34
|
+
|
|
35
|
+
## Step 3: Handle Result
|
|
36
|
+
|
|
37
|
+
**On PASS:** Report the green result. Done.
|
|
38
|
+
|
|
39
|
+
**On FAIL:** Report what failed with the full error output, then suggest:
|
|
40
|
+
|
|
41
|
+
> "Verification failed at [check]. Run `/buildanything:build` with a fix prompt, or I can attempt a targeted fix now."
|
|
42
|
+
|
|
43
|
+
Do not auto-fix unless the user asks. This command is for reporting, not repairing.
|
package/hooks/session-start
CHANGED
|
@@ -33,7 +33,7 @@ else
|
|
|
33
33
|
if echo "$BUILD_STATE" | grep -q "Active Metric Loop"; then
|
|
34
34
|
METRIC_LOOP="
|
|
35
35
|
ACTIVE METRIC LOOP DETECTED — You were mid-iteration when context compacted.
|
|
36
|
-
1. Read
|
|
36
|
+
1. Read protocols/metric-loop.md to reload the loop protocol
|
|
37
37
|
2. Find the 'Active Metric Loop' section in .build-state.md for your metric definition and score history
|
|
38
38
|
3. Resume from the iteration indicated in the score log table
|
|
39
39
|
4. Do NOT restart the loop from scratch — continue where you left off"
|
|
@@ -68,10 +68,7 @@ YOU ARE A DISPATCHER, NOT A DOER. Your context is precious — protect it.
|
|
|
68
68
|
- Save research outputs to docs/plans/ so you can reference files later instead of holding everything in context.
|
|
69
69
|
|
|
70
70
|
CRITICAL RULES:
|
|
71
|
-
1. You
|
|
72
|
-
2. You follow phase gates — no advancing without quality gate approval
|
|
73
|
-
3. Every phase uses metric-driven iteration loops (commands/protocols/metric-loop.md)
|
|
74
|
-
4. You must re-read commands/build.md if you are unsure of the process
|
|
71
|
+
1. You must re-read commands/build.md if you are unsure of the process
|
|
75
72
|
|
|
76
73
|
YOUR CURRENT STATE (from docs/plans/.build-state.md):
|
|
77
74
|
ORCHESTRATOR
|
|
@@ -84,15 +81,13 @@ ${DESIGN_WARNING}
|
|
|
84
81
|
|
|
85
82
|
NEXT ACTIONS:
|
|
86
83
|
1. Re-read commands/build.md to reload the full orchestrator process
|
|
87
|
-
2. Re-read
|
|
88
|
-
3. Re-read
|
|
84
|
+
2. Re-read protocols/metric-loop.md if you are mid-loop
|
|
85
|
+
3. Re-read protocols/design.md if you are in Phase 3 (Design & Visual Identity)
|
|
89
86
|
4. Re-read docs/plans/sprint-tasks.md for task list and acceptance criteria
|
|
90
87
|
5. Re-read docs/plans/architecture.md for architecture context
|
|
91
88
|
6. Re-read CLAUDE.md for build decisions
|
|
92
89
|
7. Re-read docs/plans/learnings.md if it exists (patterns and pitfalls from previous builds)
|
|
93
|
-
8. Rebuild TodoWrite from docs/plans/.build-state.md (TodoWrite does NOT survive compaction)
|
|
94
|
-
9. Resume from the phase and step indicated in your state above
|
|
95
|
-
10. Dispatch work to specialist agents — do not implement directly"
|
|
90
|
+
8. Rebuild TodoWrite from docs/plans/.build-state.md (TodoWrite does NOT survive compaction)"
|
|
96
91
|
fi
|
|
97
92
|
|
|
98
93
|
# Output as additional_context for Claude Code
|
package/package.json
CHANGED
|
@@ -1,10 +1,13 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "buildanything",
|
|
3
|
-
"version": "1.
|
|
3
|
+
"version": "1.7.0",
|
|
4
4
|
"description": "One command to build an entire product. 73 specialist agents orchestrated into a full engineering pipeline for Claude Code.",
|
|
5
5
|
"bin": {
|
|
6
6
|
"buildanything": "./bin/setup.js"
|
|
7
7
|
},
|
|
8
|
+
"scripts": {
|
|
9
|
+
"version": "node bin/sync-version.js && git add .claude-plugin/plugin.json .claude-plugin/marketplace.json"
|
|
10
|
+
},
|
|
8
11
|
"keywords": [
|
|
9
12
|
"claude",
|
|
10
13
|
"claude-code",
|