buildanything 1.7.0 → 1.7.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -12,7 +12,7 @@
12
12
  "name": "buildanything",
13
13
  "source": "./",
14
14
  "description": "Full product build pipeline with 73 specialist agents orchestrated across architecture, implementation, testing, and hardening phases. Includes /build (full factory) and /idea-sweep (parallel research).",
15
- "version": "1.7.0"
15
+ "version": "1.7.1"
16
16
  }
17
17
  ]
18
18
  }
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "buildanything",
3
- "version": "1.7.0",
3
+ "version": "1.7.1",
4
4
  "description": "One command to build an entire product. 73 specialist agents orchestrated into a full engineering pipeline — from idea to shipped, tested, reviewed code.",
5
5
  "author": {
6
6
  "name": "Sujit"
package/README.md CHANGED
@@ -2,17 +2,18 @@
2
2
 
3
3
  **One command to build an entire product.**
4
4
 
5
- buildanything is a Claude Code plugin that orchestrates 73 specialist AI agents into a full engineering pipeline. You describe what you want to build. buildanything handles architecture, implementation, testing, code review, security audit, accessibility, and documentation — the same process that teams at Meta, Google, and Stripe run, compressed into one session.
5
+ buildanything is a Claude Code plugin that orchestrates 55 specialist AI agents into a full engineering pipeline — from idea to shipped, tested, reviewed code. You describe what you want to build. buildanything handles research, design, architecture, implementation, behavioral testing, and hardening.
6
6
 
7
7
  No agent expertise required. No manual coordination. Just `/build`.
8
8
 
9
9
  ## Install
10
10
 
11
- **One command:**
12
11
  ```
13
12
  npx buildanything
14
13
  ```
15
14
 
15
+ This installs the plugin, companion plugins, and [agent-browser](https://github.com/vercel-labs/agent-browser) for behavioral testing.
16
+
16
17
  **Or manually in Claude Code:**
17
18
  ```
18
19
  /plugin marketplace add sujitmeka/buildanything
@@ -23,95 +24,104 @@ npx buildanything
23
24
 
24
25
  ### `/build` — Full Product Pipeline
25
26
 
26
- Takes a brainstormed idea and builds it. Runs 5 phases with quality gates between each:
27
-
28
- 1. **Architecture** — Backend Architect, UX Architect, Security Engineer, and code-architect design the system. Sprint Prioritizer and Senior Project Manager break it into ordered tasks with acceptance criteria.
27
+ Takes an idea and builds it through 8 phases with quality gates, metric-driven iteration loops, and behavioral verification:
29
28
 
30
- 2. **Foundation** — DevOps Automator and Frontend Developer scaffold the project. UX Architect lays down the design system.
29
+ 0. **Context & Pre-Flight** — Scans for existing work, checks prerequisites, initializes state.
30
+ 1. **Brainstorm & Research** — Interactive brainstorming + 5 parallel research agents (market, tech, user, business, risk). Writes CLAUDE.md as the product brain.
31
+ 2. **Architecture & Planning** — 4 parallel architecture agents + sprint planning with user journeys, NFRs, and behavioral acceptance criteria per task.
32
+ 3. **Design & Visual Identity** — Competitive research via Playwright, living style guide with every component rendered and interactive, visual QA scoring at 80/100. Anti-AI-template detection.
33
+ 4. **Foundation** — Scaffolding, design system from style guide, acceptance test stubs (TDD-style — tests fail until features are built).
34
+ 5. **Build** — Per-task: implement → cleanup → metric loop → behavioral smoke test (agent-browser) → 7-check verification. Each task verified before the next starts.
35
+ 6. **Harden** — 5 parallel audits (API, performance, accessibility, security, UX quality) → eval harness → metric loop → E2E tests from user journeys → autonomous dogfooding → fake data detector → Reality Checker with fix-and-retest loops.
36
+ 7. **Ship** — Requirements coverage report, documentation, learnings capture.
31
37
 
32
- 3. **Build** — Each task goes through Dev→Test→Review loops. Frontend Developer, Backend Architect, or AI Engineer implement. Evidence Collector verifies. code-reviewer and silent-failure-hunter review. Failed tasks loop back with feedback, max 3 retries before escalating to you.
38
+ ```
39
+ /build a prediction market maker for Polymarket
40
+ /build docs/plans/my-design.md --autonomous
41
+ ```
33
42
 
34
- 4. **Harden**API Tester, Performance Benchmarker, Accessibility Auditor, and Security Engineer stress-test the full product. code-simplifier and type-design-analyzer clean up. Reality Checker gives the final verdict (defaults to NEEDS WORK).
43
+ ### `/idea-sweep`Parallel Research Sweep
35
44
 
36
- 5. **Ship** Technical Writer documents everything. Clean commits. Completion report.
45
+ 5 research agents evaluate an idea in parallel. Outputs a decision brief: GO / PIVOT / INVESTIGATE / KILL.
37
46
 
38
47
  ```
39
- /build autonomous prediction market maker for Polymarket
40
- /build docs/plans/2025-06-15-my-idea-design.md
48
+ /idea-sweep AI-powered building code compliance checker
41
49
  ```
42
50
 
43
- ### `/idea-sweep` — Parallel Research Sweep
51
+ ### Post-Build Commands
44
52
 
45
- Takes a raw idea and runs 5 research teams in parallel to decide if it's worth building:
53
+ | Command | Use case |
54
+ |---|---|
55
+ | `/fix` | "The submit button doesn't work" — scoped bug fixing with agent-browser verification |
56
+ | `/ux-review` | "The dashboard feels cramped" — UX audit against the living style guide, mobile checks |
57
+ | `/add-feature` | "Add a pause button" — mini build cycle using existing design system and architecture |
58
+ | `/dogfood` | "Test everything" — autonomous exploratory testing of the running app |
59
+ | `/verify` | "Does it all pass?" — quick 7-check health check |
60
+ | `/refactor` | "Change the auth to OAuth" — architect plans the change, then incremental execution |
46
61
 
47
- - **market-intel**TAM/SAM/SOM, competitive landscape, timing
48
- - **tech-feasibility** — Architecture sketch, hard problems, build vs buy, MVP scope
49
- - **user-research** — Persona, JTBD, current alternatives, behavioral barriers
50
- - **business-model** — Revenue model, unit economics, growth loops, moat
51
- - **risk-analysis** — Regulatory, security, dependencies, failure modes
62
+ All commands are argument-driventhey scope themselves to what you describe.
52
63
 
53
- Outputs a decision brief: GO / PIVOT / INVESTIGATE / KILL.
64
+ ## How It Works
54
65
 
55
- ```
56
- /idea-sweep AI-powered building code compliance checker
57
- ```
66
+ ### Behavioral Verification (agent-browser)
58
67
 
59
- ## The 73 Agents
68
+ Every UI task is smoke tested after implementation. [agent-browser](https://github.com/vercel-labs/agent-browser) opens the app, clicks buttons, fills forms, and collects evidence:
60
69
 
61
- buildanything includes agents from [agency-agents](https://github.com/msitarzewski/agency-agents) and custom research agents, organized into specialist divisions:
70
+ - **Snapshot diffs** verifies DOM actually changes when you click something
71
+ - **Network inspection** — catches failed API calls and missing endpoints
72
+ - **Console errors** — catches uncaught JS exceptions
73
+ - **Annotated screenshots** — labeled visual evidence (Claude is multimodal)
74
+ - **HAR capture** — full network traffic for fake data analysis
62
75
 
63
- ### Design (8)
64
- Brand Guardian · Image Prompt Engineer · Inclusive Visuals Specialist · UI Designer · UX Architect · UX Researcher · Visual Storyteller · Whimsy Injector
76
+ If a button doesn't work, the smoke test catches it immediately — not in Phase 6.
65
77
 
66
- ### Engineering (11)
67
- AI Engineer · Autonomous Optimization Architect · Backend Architect · Data Engineer · DevOps Automator · Frontend Developer · Mobile App Builder · Rapid Prototyper · Security Engineer · Senior Developer · Technical Writer
78
+ ### Living Style Guide
68
79
 
69
- ### Marketing (11)
70
- App Store Optimizer · Content Creator · Growth Hacker · Instagram Curator · Reddit Community Builder · Social Media Strategist · TikTok Strategist · Twitter Engager · WeChat Official Account Manager · Xiaohongshu Specialist · Zhihu Strategist
80
+ Phase 3 builds a rendered, interactive style guide at `/design-system` with every component in every state. This ships with the product and is referenced at every stage:
71
81
 
72
- ### Product (4)
73
- Behavioral Nudge Engine · Feedback Synthesizer · Sprint Prioritizer · Trend Researcher
82
+ - Phase 4: Design system tokens match the style guide
83
+ - Phase 5: Implementation agents reference it for UI tasks
84
+ - Phase 6: UX audit compares every page against it
74
85
 
75
- ### Project Management (5)
76
- Experiment Tracker · Project Shepherd · Senior Project Manager · Studio Operations · Studio Producer
86
+ ### Feedback Loops
77
87
 
78
- ### Spatial Computing (6)
79
- macOS Spatial/Metal Engineer · Terminal Integration Specialist · visionOS Spatial Engineer · XR Cockpit Interaction Specialist · XR Immersive Developer · XR Interface Architect
88
+ Every testing step feeds back into development:
80
89
 
81
- ### Specialized (9)
82
- Agentic Identity & Trust Architect · Agents Orchestrator · Cultural Intelligence Strategist · Data Analytics Reporter · Data Consolidation Agent · Developer Advocate · LSP/Index Engineer · Report Distribution Agent · Sales Data Extraction Agent
90
+ - Smoke test fails → fix agent + re-test (max 2 cycles)
91
+ - Dogfood finds issues classify + fix + re-dogfood (max 3 cycles)
92
+ - Fake data detected → fix agent replaces with real API calls (max 2 cycles)
93
+ - Reality Checker says NEEDS WORK → classify issues + fix + re-verify + re-check (max 2 cycles)
83
94
 
84
- ### Support (6)
85
- Analytics Reporter · Executive Summary Generator · Finance Tracker · Infrastructure Maintainer · Legal Compliance Checker · Support Responder
95
+ Nothing gets logged and ignored.
86
96
 
87
- ### Testing (8)
88
- Accessibility Auditor · API Tester · Evidence Collector · Performance Benchmarker · Reality Checker · Test Results Analyzer · Tool Evaluator · Workflow Optimizer
97
+ ## The 55 Agents
89
98
 
90
- ### Research (5)
91
- market-intel · tech-feasibility · user-research · business-model · risk-analysis
99
+ ### Design (8)
100
+ Brand Guardian · Image Prompt Engineer · Inclusive Visuals Specialist · UI Designer · UX Architect · UX Researcher · Visual Storyteller · Whimsy Injector
92
101
 
93
- ## Works With
102
+ ### Engineering (9)
103
+ AI Engineer · Autonomous Optimization Architect · Backend Architect · Data Engineer · DevOps Automator · Frontend Developer · Mobile App Builder · Rapid Prototyper · Security Engineer · Senior Developer · Technical Writer
94
104
 
95
- buildanything is designed to work alongside Claude Code's built-in plugins:
105
+ ### Marketing (8)
106
+ App Store Optimizer · Instagram Curator · Reddit Community Builder · Social Media Strategist · TikTok Strategist · Twitter Engager · WeChat Official Account Manager · Xiaohongshu Specialist · Zhihu Strategist
96
107
 
97
- - **feature-dev** — buildanything's `/build` command invokes `code-architect`, `code-explorer`, and `code-reviewer` from this plugin
98
- - **pr-review-toolkit** `silent-failure-hunter`, `code-simplifier`, `type-design-analyzer`, `comment-analyzer` are used in hardening
99
- - **code-review** — Used for final code review passes
100
- - **commit-commands** — Used for clean git commits during the pipeline
108
+ ### Product (2)
109
+ Behavioral Nudge Engine · Feedback Synthesizer
101
110
 
102
- Install these from the official Anthropic marketplace for the full experience:
103
- ```
104
- /plugin install feature-dev@claude-plugin-directory
105
- /plugin install pr-review-toolkit@claude-plugin-directory
106
- /plugin install code-review@claude-plugin-directory
107
- /plugin install commit-commands@claude-plugin-directory
108
- ```
111
+ ### Project Management (1)
112
+ Experiment Tracker
113
+
114
+ ### Specialized (4)
115
+ Agentic Identity & Trust Architect · Cultural Intelligence Strategist · Developer Advocate · Data Consolidation Agent · Report Distribution Agent · Sales Data Extraction Agent
116
+
117
+ ### Support (5)
118
+ Analytics Reporter · Executive Summary Generator · Finance Tracker · Legal Compliance Checker · Support Responder
109
119
 
110
- ## Credits
120
+ ### Testing (7)
121
+ Accessibility Auditor · API Tester · Evidence Collector · Performance Benchmarker · Reality Checker · Test Results Analyzer · Tool Evaluator · Workflow Optimizer
111
122
 
112
- - Agent definitions from [agency-agents](https://github.com/msitarzewski/agency-agents) by Mike Sitarzewski
113
- - Orchestration patterns inspired by [agency-agents](https://github.com/msitarzewski/agency-agents)
114
- - Claude Code plugin architecture by [Anthropic](https://github.com/anthropics/claude-code)
123
+ ### Research (5)
124
+ market-intel · tech-feasibility · user-research · business-model · risk-analysis
115
125
 
116
126
  ## License
117
127
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "buildanything",
3
- "version": "1.7.0",
3
+ "version": "1.7.1",
4
4
  "description": "One command to build an entire product. 73 specialist agents orchestrated into a full engineering pipeline for Claude Code.",
5
5
  "bin": {
6
6
  "buildanything": "./bin/setup.js"