buildcrew 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/LICENSE ADDED
@@ -0,0 +1,21 @@
1
+ MIT License
2
+
3
+ Copyright (c) 2026 z1nun
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
package/README.ko.md ADDED
@@ -0,0 +1,236 @@
1
+ # buildcrew
2
+
3
+ Claude Code를 위한 11개 AI 에이전트 — 9개 운영 모드로 자동 오케스트레이션되는 개발 팀.
4
+
5
+ ```bash
6
+ npx buildcrew
7
+ ```
8
+
9
+ > [English](README.md) | **한국어**
10
+
11
+ ---
12
+
13
+ ## 왜 buildcrew인가?
14
+
15
+ AI 코딩 에이전트는 강력하지만 구조 없이는 일관성 없는 결과를 냅니다. buildcrew는 Claude Code에 다음을 제공합니다:
16
+
17
+ - **팀** — 명확한 역할을 가진 11개 전문 에이전트
18
+ - **프로세스** — 품질 게이트와 반복을 갖춘 순차 파이프라인
19
+ - **하네스** — 프로젝트 컨텍스트, 규칙, 도메인 지식이 모든 에이전트에 내장
20
+ - **오케스트레이터** — `@constitution`에게 말하면 자동으로 적절한 에이전트 배치
21
+
22
+ ```
23
+ 나: @constitution 유저 대시보드 추가해줘
24
+ 크루: 기획자 → 디자이너 → 개발자 → QA → 브라우저 QA → 리뷰어 → 배포
25
+ ```
26
+
27
+ 외부 의존성 없음. 런타임 없음. 바이너리 없음. 그냥 마크다운.
28
+
29
+ ---
30
+
31
+ ## 시작하기
32
+
33
+ ```bash
34
+ # 1. 에이전트 설치
35
+ npx buildcrew
36
+
37
+ # 2. 프로젝트 하네스 설정 (인터랙티브)
38
+ npx buildcrew init
39
+
40
+ # 3. 추가 컨텍스트 (선택)
41
+ npx buildcrew add erd
42
+ npx buildcrew add architecture
43
+ npx buildcrew add design-system
44
+
45
+ # 4. 사용
46
+ @constitution 유저 대시보드 추가해줘
47
+ ```
48
+
49
+ ---
50
+
51
+ ## 하네스 엔지니어링
52
+
53
+ 하네스는 "범용 AI 출력"과 "내 프로젝트에 맞는 코드"의 차이를 만듭니다.
54
+
55
+ ```bash
56
+ npx buildcrew init
57
+ ```
58
+
59
+ 인터랙티브 설정으로 핵심 하네스 파일 생성:
60
+
61
+ ```
62
+ [1/3] 프로젝트 컨텍스트
63
+ 프로젝트명, 설명, 기술 스택 (자동 감지), 배포 타겟, 프로덕션 URL
64
+
65
+ [2/3] 팀 규칙
66
+ 코딩 컨벤션, 우선순위, 금지사항, 품질 기준, 리뷰 규칙
67
+
68
+ [3/3] 도메인 지식
69
+ 업종, 유저 타입, 도메인 용어, 비즈니스 룰
70
+ ```
71
+
72
+ ### 템플릿으로 추가 컨텍스트 제공
73
+
74
+ ```bash
75
+ npx buildcrew add erd # DB 스키마 & 관계
76
+ npx buildcrew add architecture # 시스템 아키텍처
77
+ npx buildcrew add api-spec # API 엔드포인트 & 계약
78
+ npx buildcrew add design-system # 색상, 타이포, 컴포넌트
79
+ npx buildcrew add glossary # 도메인 용어 & 유저 역할
80
+ npx buildcrew add user-flow # 유저 여정 & 페이지 맵
81
+ npx buildcrew add env-vars # 환경변수 가이드
82
+ ```
83
+
84
+ ### 열린 구조
85
+
86
+ 템플릿에 제한되지 않습니다. `.claude/harness/`에 아무 `.md` 파일이나 추가하세요:
87
+
88
+ ```
89
+ .claude/harness/
90
+ ├── project.md ← 핵심 (init으로 생성)
91
+ ├── rules.md ← 핵심 (init으로 생성)
92
+ ├── erd.md ← 템플릿 (add로 추가)
93
+ ├── architecture.md ← 템플릿 (add로 추가)
94
+ ├── 내맘대로.md ← 직접 작성 (에이전트가 읽음)
95
+ └── 무엇이든.md ← 아무 .md 파일 가능
96
+ ```
97
+
98
+ ### 하네스 파일 → 에이전트 라우팅
99
+
100
+ | 파일 | 읽는 에이전트 |
101
+ |------|-------------|
102
+ | `project.md`, `rules.md` | 모든 에이전트 |
103
+ | `erd.md`, `architecture.md`, `api-spec.md` | developer, reviewer, security-auditor, investigator |
104
+ | `design-system.md` | designer |
105
+ | `glossary.md`, `user-flow.md` | planner, designer, browser-qa |
106
+ | `env-vars.md` | developer, security-auditor |
107
+ | 커스텀 파일 | reviewer, security-auditor (전부 읽음) |
108
+
109
+ ### 하네스 관리
110
+
111
+ ```bash
112
+ npx buildcrew harness # 하네스 파일 상태 확인
113
+ npx buildcrew add # 사용 가능한 템플릿 목록
114
+ npx buildcrew add erd # 특정 템플릿 추가
115
+ npx buildcrew init --force # 핵심 파일 재생성
116
+ ```
117
+
118
+ ---
119
+
120
+ ## 에이전트
121
+
122
+ ### 빌드 팀
123
+
124
+ | 에이전트 | 모델 | 역할 |
125
+ |---------|------|------|
126
+ | **planner** | opus | 6가지 강제 질문 + 4관점 자체 리뷰 (CEO, 엔지니어링, 디자인, QA). 관점별 1-10점 채점. |
127
+ | **designer** | opus | 웹에서 UI/UX 레퍼런스 수집, Playwright로 실제 사이트 스크린샷, Figma MCP 통합, 프로덕션 React/Next.js 컴포넌트 작성. AI 슬롭 블랙리스트. |
128
+ | **developer** | sonnet | 기획서 + 디자인 + 하네스 규칙에 따라 구현. |
129
+
130
+ ### 품질 팀
131
+
132
+ | 에이전트 | 모델 | 역할 |
133
+ |---------|------|------|
134
+ | **qa-tester** | sonnet | 코드 레벨 검증 — 타입, 린트, 빌드, 수용 기준. |
135
+ | **browser-qa** | sonnet | Playwright MCP 실제 브라우저 테스트 — 유저 플로우, 반응형, 콘솔, 건강 점수 (0-100). |
136
+ | **reviewer** | opus | 4관점 병렬 리뷰 (보안, 성능, 테스트, 유지보수) + 적대적 리뷰 + 자동 수정. |
137
+ | **health-checker** | sonnet | 코드 품질 대시보드 — 7개 카테고리 가중 점수 (0-10) + 트렌드. |
138
+
139
+ ### 보안 & 운영 팀
140
+
141
+ | 에이전트 | 모델 | 역할 |
142
+ |---------|------|------|
143
+ | **security-auditor** | opus | OWASP Top 10 + STRIDE 위협 모델. 10단계 감사. |
144
+ | **canary-monitor** | sonnet | 배포 후 프로덕션 헬스 — 페이지, API, 콘솔, 성능 비교. |
145
+ | **shipper** | sonnet | 릴리즈 파이프라인 — 테스트 → 버전 → 체인지로그 → PR. |
146
+
147
+ ### 전문가
148
+
149
+ | 에이전트 | 모델 | 역할 |
150
+ |---------|------|------|
151
+ | **investigator** | sonnet | 근본 원인 디버깅. 4단계 조사. 무관한 코드 수정 동결. |
152
+
153
+ ---
154
+
155
+ ## 9가지 운영 모드
156
+
157
+ `@constitution`에게 자연스럽게 말하세요. 모드를 자동 감지합니다.
158
+
159
+ | 모드 | 예시 | 파이프라인 |
160
+ |------|------|----------|
161
+ | **Feature** | "유저 대시보드 추가해줘" | 기획 → 디자인 → 개발 → QA → 브라우저 QA → 리뷰 |
162
+ | **Project Audit** | "프로젝트 전체 점검해줘" | 스캔 → 우선순위 → 수정 → 검증 (반복) |
163
+ | **Browser QA** | "브라우저 테스트해줘" | Playwright 테스트 + 건강 점수 |
164
+ | **Security** | "보안 점검해줘" | OWASP + STRIDE + 시크릿 + 의존성 |
165
+ | **Debug** | "왜 로그인이 안 돼?" | 4단계 근본 원인 조사 |
166
+ | **Health** | "헬스체크 돌려줘" | 품질 대시보드 (타입, 린트, 의존성, i18n) |
167
+ | **Canary** | "배포 확인해줘" | 프로덕션 모니터링 |
168
+ | **Review** | "코드 리뷰해줘" | 멀티 전문가 + 적대적 + 자동 수정 |
169
+ | **Ship** | "배포해줘" | 테스트 → 버전 → 체인지로그 → PR |
170
+
171
+ ### 모드 체이닝
172
+
173
+ Constitution이 다음 모드를 자동 제안:
174
+ - Feature 완료 → Ship 제안
175
+ - Ship 완료 → Canary 제안
176
+ - Canary CRITICAL → Debug 트리거
177
+
178
+ ---
179
+
180
+ ## 기능 파이프라인
181
+
182
+ 각 기능이 완전한 문서 체인을 생성:
183
+
184
+ ```
185
+ .claude/pipeline/{기능명}/
186
+ ├── 01-plan.md 요구사항 + 4관점 리뷰 점수
187
+ ├── 02-references.md 실제 사이트에서 수집한 UI/UX 레퍼런스
188
+ ├── 02-design.md 디자인 결정 + 컴포넌트 스펙
189
+ ├── 03-dev-notes.md 구현 노트 + 변경 파일
190
+ ├── 04-qa-report.md 수용 기준 검증
191
+ ├── 05-browser-qa.md 건강 점수 + 스크린샷 + 플로우
192
+ ├── 06-review.md 리뷰 발견사항 + 자동 수정
193
+ └── 07-ship.md PR URL + 릴리즈 노트
194
+ ```
195
+
196
+ ---
197
+
198
+ ## CLI 레퍼런스
199
+
200
+ | 명령어 | 설명 |
201
+ |--------|------|
202
+ | `npx buildcrew` | 11개 에이전트 설치 |
203
+ | `npx buildcrew init` | 프로젝트 하네스 설정 (인터랙티브) |
204
+ | `npx buildcrew init --force` | 하네스 재생성 |
205
+ | `npx buildcrew add` | 사용 가능한 템플릿 목록 |
206
+ | `npx buildcrew add <type>` | 하네스 템플릿 추가 |
207
+ | `npx buildcrew harness` | 하네스 파일 상태 확인 |
208
+ | `npx buildcrew --force` | 기존 에이전트 덮어쓰기 |
209
+ | `npx buildcrew --list` | 에이전트 목록 (모델 포함) |
210
+ | `npx buildcrew --uninstall` | 에이전트 제거 |
211
+ | `npx buildcrew --version` | 버전 확인 |
212
+
213
+ ## 요구사항
214
+
215
+ - **필수**: [Claude Code](https://claude.ai/code) CLI
216
+ - **선택**: [Playwright MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/playwright) — browser-qa, canary-monitor, designer용
217
+ - **선택**: [Figma MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/figma) — designer용
218
+
219
+ ```bash
220
+ # 실제 브라우저 테스트 활성화
221
+ claude mcp add playwright -- npx @anthropic-ai/mcp-server-playwright
222
+ ```
223
+
224
+ ## 커스터마이징
225
+
226
+ 모든 파일 수정 가능:
227
+
228
+ ```
229
+ .claude/agents/ 에이전트 정의 — 역할, 도구, 지시사항, 모델 수정
230
+ .claude/harness/ 프로젝트 컨텍스트 — 언제든 수정, 아무 .md 추가
231
+ .claude/pipeline/ 출력 문서 — 기능별 자동 생성
232
+ ```
233
+
234
+ ## 라이선스
235
+
236
+ MIT
package/README.md ADDED
@@ -0,0 +1,276 @@
1
+ # buildcrew
2
+
3
+ > **English** | [한국어](README.ko.md)
4
+
5
+ 11 AI agents for Claude Code — auto-orchestrated dev team with 9 operating modes.
6
+
7
+ ```bash
8
+ npx buildcrew
9
+ ```
10
+
11
+ ---
12
+
13
+ ## Why buildcrew?
14
+
15
+ AI coding agents are powerful, but without structure they produce inconsistent results. buildcrew gives Claude Code:
16
+
17
+ - **A team** — 11 specialized agents, each with a clear role
18
+ - **A process** — sequential pipeline with quality gates and iteration
19
+ - **A harness** — your project context, rules, and domain knowledge baked in
20
+ - **An orchestrator** — just talk to `@constitution`, it routes automatically
21
+
22
+ ```
23
+ You: @constitution Add user authentication
24
+ Crew: Planner → Designer → Developer → QA → Browser QA → Reviewer → Ship
25
+ ```
26
+
27
+ No external dependencies. No runtime. No binaries. Just Markdown.
28
+
29
+ ---
30
+
31
+ ## Getting Started
32
+
33
+ ```bash
34
+ # 1. Install agents
35
+ npx buildcrew
36
+
37
+ # 2. Set up project harness (interactive)
38
+ npx buildcrew init
39
+
40
+ # 3. Add more context (optional)
41
+ npx buildcrew add erd
42
+ npx buildcrew add architecture
43
+ npx buildcrew add design-system
44
+
45
+ # 4. Start working
46
+ @constitution Add user dashboard
47
+ ```
48
+
49
+ ---
50
+
51
+ ## Harness Engineering
52
+
53
+ The harness is what separates "generic AI output" from "code that fits your project."
54
+
55
+ ```bash
56
+ npx buildcrew init
57
+ ```
58
+
59
+ Interactive setup creates core harness files:
60
+
61
+ ```
62
+ [1/3] Project Context
63
+ Project name, description, tech stack (auto-detected), deploy target, production URL
64
+
65
+ [2/3] Team Rules
66
+ Coding conventions, priorities, what to avoid, quality standards, review rules
67
+
68
+ [3/3] Domain Knowledge
69
+ Industry, user types, domain terms, business rules
70
+ ```
71
+
72
+ ### Add more context with templates
73
+
74
+ ```bash
75
+ npx buildcrew add erd # Database schema & relationships
76
+ npx buildcrew add architecture # System architecture overview
77
+ npx buildcrew add api-spec # API endpoints & contracts
78
+ npx buildcrew add design-system # Colors, typography, components
79
+ npx buildcrew add glossary # Domain terms & user roles
80
+ npx buildcrew add user-flow # User journeys & page map
81
+ npx buildcrew add env-vars # Environment variables guide
82
+ ```
83
+
84
+ ### The harness is open
85
+
86
+ Not limited to templates. Create any `.md` file in `.claude/harness/`:
87
+
88
+ ```
89
+ .claude/harness/
90
+ ├── project.md ← core (auto by init)
91
+ ├── rules.md ← core (auto by init)
92
+ ├── erd.md ← template
93
+ ├── architecture.md ← template
94
+ ├── design-system.md ← template
95
+ ├── glossary.md ← template
96
+ ├── my-custom-notes.md ← your own file (agents read it too)
97
+ └── anything.md ← any .md file works
98
+ ```
99
+
100
+ All agents read harness files before every task. Constitution routes the right files to the right agents:
101
+
102
+ | File | Routed to |
103
+ |------|-----------|
104
+ | `project.md`, `rules.md` | All agents |
105
+ | `erd.md`, `architecture.md`, `api-spec.md` | developer, reviewer, security-auditor, investigator |
106
+ | `design-system.md` | designer |
107
+ | `glossary.md`, `user-flow.md` | planner, designer, browser-qa |
108
+ | `env-vars.md` | developer, security-auditor |
109
+ | Custom files | reviewer, security-auditor (read ALL) |
110
+
111
+ ### Manage the harness
112
+
113
+ ```bash
114
+ npx buildcrew harness # Show status of all harness files
115
+ npx buildcrew add # List available templates
116
+ npx buildcrew add erd # Add a specific template
117
+ npx buildcrew init --force # Regenerate core files from scratch
118
+ ```
119
+
120
+ ---
121
+
122
+ ## Agents
123
+
124
+ ### Build Team
125
+
126
+ | Agent | Model | Role |
127
+ |-------|-------|------|
128
+ | **planner** | opus | 6 Forcing Questions + 4-Lens Self-Review (CEO, Engineering, Design, QA). Produces battle-tested plans scored 1-10 per lens. |
129
+ | **designer** | opus | Searches web for UI/UX references, screenshots real sites with Playwright, integrates Figma MCP, writes production React/Next.js components. AI slop blacklist enforced. |
130
+ | **developer** | sonnet | Implements features following plan + design + harness conventions. Self-reviews before handoff. |
131
+
132
+ ### Quality Team
133
+
134
+ | Agent | Model | Role |
135
+ |-------|-------|------|
136
+ | **qa-tester** | sonnet | Code-level verification — types, lint, build, acceptance criteria check. |
137
+ | **browser-qa** | sonnet | Real browser testing via Playwright MCP — user flows, responsive (375/768/1440px), console errors, network, health score (0-100). |
138
+ | **reviewer** | opus | 4-specialist parallel review (security, performance, testing, maintainability) + adversarial pass + auto-fix. Fix-first approach. |
139
+ | **health-checker** | sonnet | Code quality dashboard — 7-category weighted score (0-10), trend tracking, top 5 actionable items. |
140
+
141
+ ### Security & Ops
142
+
143
+ | Agent | Model | Role |
144
+ |-------|-------|------|
145
+ | **security-auditor** | opus | OWASP Top 10 + STRIDE threat model. 10-phase audit. Secrets scan, dependency audit, AI/LLM security. Confidence gate (8/10 standard, 2/10 comprehensive). |
146
+ | **canary-monitor** | sonnet | Post-deploy health — page availability, API status, console errors, performance vs baseline. |
147
+ | **shipper** | sonnet | Release pipeline — pre-flight checks → version bump → changelog → commit → push → PR creation. |
148
+
149
+ ### Specialist
150
+
151
+ | Agent | Model | Role |
152
+ |-------|-------|------|
153
+ | **investigator** | sonnet | Root cause debugging. 4 phases: investigate → hypothesize → test → fix. Edit freeze on unrelated code. "No fix without root cause." |
154
+
155
+ ---
156
+
157
+ ## 9 Operating Modes
158
+
159
+ Talk to `@constitution` naturally. It auto-detects the mode.
160
+
161
+ | Mode | Example | What happens |
162
+ |------|---------|-------------|
163
+ | **Feature** | "Add user dashboard" | Plan → Design → Dev → QA → Browser QA → Review |
164
+ | **Project Audit** | "full project audit" | Scan → Prioritize → Fix → Verify (loop) |
165
+ | **Browser QA** | "browser qa localhost:3000" | Playwright tests flows, responsive, console |
166
+ | **Security** | "security audit" | OWASP + STRIDE + secrets + deps |
167
+ | **Debug** | "debug: login broken" | 4-phase root cause investigation |
168
+ | **Health** | "health check" | Quality dashboard (types, lint, deps, i18n) |
169
+ | **Canary** | "canary https://myapp.com" | Post-deploy production monitoring |
170
+ | **Review** | "code review" | Multi-specialist analysis + auto-fix |
171
+ | **Ship** | "ship" | Test → version → changelog → PR |
172
+
173
+ ### Mode chaining
174
+
175
+ Constitution auto-suggests the next mode:
176
+ - Feature complete → Ship
177
+ - Ship complete → Canary
178
+ - Canary CRITICAL → Debug
179
+
180
+ ---
181
+
182
+ ## Feature Pipeline
183
+
184
+ Each feature generates a full document chain:
185
+
186
+ ```
187
+ .claude/pipeline/{feature}/
188
+ ├── 01-plan.md Requirements + 4-lens review scores
189
+ ├── 02-references.md Curated UI/UX references from real sites
190
+ ├── 02-design.md Design decisions + component specs
191
+ ├── 03-dev-notes.md Implementation notes + files changed
192
+ ├── 04-qa-report.md Acceptance criteria verification
193
+ ├── 05-browser-qa.md Health score + screenshots + flow results
194
+ ├── 06-review.md Review findings + auto-fixes applied
195
+ └── 07-ship.md PR URL + release notes
196
+ ```
197
+
198
+ ---
199
+
200
+ ## CLI Reference
201
+
202
+ | Command | Description |
203
+ |---------|-------------|
204
+ | `npx buildcrew` | Install 11 agents to `.claude/agents/` |
205
+ | `npx buildcrew init` | Set up project harness (interactive) |
206
+ | `npx buildcrew init --force` | Regenerate harness from scratch |
207
+ | `npx buildcrew add` | List available harness templates |
208
+ | `npx buildcrew add <type>` | Add a harness template (erd, architecture, etc.) |
209
+ | `npx buildcrew harness` | Show status of all harness files |
210
+ | `npx buildcrew --force` | Overwrite existing agent files |
211
+ | `npx buildcrew --list` | List all agents with models |
212
+ | `npx buildcrew --uninstall` | Remove installed agents |
213
+ | `npx buildcrew --version` | Show version |
214
+
215
+ ## Requirements
216
+
217
+ - **Required**: [Claude Code](https://claude.ai/code) CLI
218
+ - **Optional**: [Playwright MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/playwright) — for browser-qa, canary-monitor, designer
219
+ - **Optional**: [Figma MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/figma) — for designer
220
+
221
+ ```bash
222
+ # Enable real browser testing
223
+ claude mcp add playwright -- npx @anthropic-ai/mcp-server-playwright
224
+ ```
225
+
226
+ ## Customization
227
+
228
+ Every file is editable:
229
+
230
+ ```
231
+ .claude/agents/ Agent definitions — edit roles, tools, instructions, model
232
+ .claude/harness/ Project context — edit anytime, add any .md file
233
+ .claude/pipeline/ Output documents — auto-generated per feature
234
+ ```
235
+
236
+ Change an agent's model:
237
+ ```yaml
238
+ # In any agent .md file
239
+ model: opus # or sonnet, haiku
240
+ ```
241
+
242
+ ## Architecture
243
+
244
+ ```
245
+ @constitution (orchestrator, opus)
246
+
247
+ ├─ reads .claude/harness/*.md (project context)
248
+
249
+ ├─ detects mode from user message
250
+
251
+ ├─ dispatches agents with relevant harness context
252
+
253
+ └─ enforces quality gates + iteration
254
+
255
+ ├── Build: planner → designer → developer
256
+ ├── Quality: qa-tester → browser-qa → reviewer
257
+ ├── Sec/Ops: security-auditor, canary-monitor, shipper
258
+ └── Debug: investigator
259
+ ```
260
+
261
+ ## Compared to gstack
262
+
263
+ | | buildcrew | gstack |
264
+ |---|-----------|--------|
265
+ | Install | `npx buildcrew` | `git clone` + Bun + `./setup` |
266
+ | Orchestration | Auto (constitution routes) | Manual (`/qa`, `/review`, ...) |
267
+ | Harness | `init` + `add` + custom .md | Manual CLAUDE.md |
268
+ | Dependencies | None | Bun, Playwright binary (~58MB) |
269
+ | Browser testing | Playwright MCP | Custom Playwright daemon |
270
+ | Pipeline docs | Auto-generated chain (01→07) | Per-skill output |
271
+ | Agents | 11 (5 opus, 6 sonnet) | 34 |
272
+ | Customization | Edit .md directly, never overwritten | May overwrite on update |
273
+
274
+ ## License
275
+
276
+ MIT
@@ -0,0 +1,130 @@
1
+ ---
2
+ name: browser-qa
3
+ description: Browser QA agent - performs real browser testing using Playwright MCP, captures screenshots, tests user flows, checks console errors, and verifies responsive design
4
+ model: sonnet
5
+ tools:
6
+ - Read
7
+ - Write
8
+ - Glob
9
+ - Grep
10
+ - Bash
11
+ - mcp__playwright__browser_navigate
12
+ - mcp__playwright__browser_click
13
+ - mcp__playwright__browser_fill_form
14
+ - mcp__playwright__browser_snapshot
15
+ - mcp__playwright__browser_take_screenshot
16
+ - mcp__playwright__browser_evaluate
17
+ - mcp__playwright__browser_console_messages
18
+ - mcp__playwright__browser_hover
19
+ - mcp__playwright__browser_press_key
20
+ - mcp__playwright__browser_wait_for
21
+ - mcp__playwright__browser_tabs
22
+ - mcp__playwright__browser_resize
23
+ - mcp__playwright__browser_network_requests
24
+ - mcp__playwright__browser_navigate_back
25
+ - mcp__playwright__browser_select_option
26
+ - mcp__playwright__browser_close
27
+ - mcp__playwright__browser_drag
28
+ - mcp__playwright__browser_file_upload
29
+ - mcp__playwright__browser_handle_dialog
30
+ - mcp__playwright__browser_run_code
31
+ ---
32
+
33
+ # Browser QA Agent
34
+
35
+ > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
36
+
37
+
38
+ You are a **Browser QA Tester** who performs real browser-based testing using Playwright. You actually navigate the application, click buttons, fill forms, and verify everything works from a real user's perspective.
39
+
40
+ ---
41
+
42
+ ## Test Tiers
43
+
44
+ | Tier | Scope | When |
45
+ |------|-------|------|
46
+ | **Quick** | Affected pages only, happy paths | Small changes |
47
+ | **Standard** | All major flows + edge cases | Feature completion (default) |
48
+ | **Exhaustive** | Every page, every state, every breakpoint | Pre-release |
49
+
50
+ ---
51
+
52
+ ## Process
53
+
54
+ ### Phase 1: Setup & Orient
55
+ 1. Ensure dev server is running (check the provided URL or `http://localhost:3000`)
56
+ 2. If pipeline docs exist, read plan and dev notes to know what to verify
57
+ 3. Navigate to target URL, take initial snapshot
58
+ 4. Detect the application structure (routes, navigation, key pages)
59
+
60
+ ### Phase 2: Page Exploration
61
+ For each page: navigate → snapshot → screenshot → check console → check network → identify interactive elements
62
+
63
+ ### Phase 3: User Flow Testing
64
+ Test each flow end-to-end. After every interaction: check console for errors, verify expected outcome, screenshot key states.
65
+
66
+ ### Phase 4: State Testing
67
+ For each interactive component verify: default, loading, error, empty, hover, active/focus, disabled states.
68
+
69
+ ### Phase 5: Responsive Testing
70
+ Test at three breakpoints by resizing:
71
+ - Mobile: 375 x 812
72
+ - Tablet: 768 x 1024
73
+ - Desktop: 1440 x 900
74
+
75
+ ### Phase 6: Accessibility Quick Check
76
+ - Keyboard navigation: Tab through all interactive elements
77
+ - Focus indicators visible?
78
+ - ARIA labels present in accessibility tree?
79
+
80
+ ### Phase 7: Console & Network Audit
81
+ Collect all console errors, check for 4xx/5xx API responses, CORS issues, failed resource loads.
82
+
83
+ ---
84
+
85
+ ## Health Score
86
+
87
+ | Category | Weight |
88
+ |----------|--------|
89
+ | Console Errors | 15% |
90
+ | Functional (flows) | 25% |
91
+ | UX (states) | 20% |
92
+ | Responsive | 15% |
93
+ | Accessibility | 10% |
94
+ | Performance | 10% |
95
+ | Network Errors | 5% |
96
+
97
+ Score: 90-100 Excellent, 70-89 Good, 50-69 Needs Work, <50 Critical.
98
+
99
+ ---
100
+
101
+ ## Output
102
+
103
+ Write to `.claude/pipeline/{feature-name}/05-browser-qa.md`:
104
+
105
+ ```markdown
106
+ # Browser QA Report: {Feature Name}
107
+ ## Test Configuration
108
+ ## Health Score: [NN]/100
109
+ | Category | Score | Details |
110
+ ## Flows Tested
111
+ | # | Flow | Status | Notes |
112
+ ## Issues Found
113
+ ### ISSUE-NNN: [Title]
114
+ - Severity, Category, Page, Steps to Reproduce, Expected, Actual, Suggested Fix
115
+ ## Console Errors
116
+ ## Responsive Results
117
+ ## Overall Status: [PASS | FAIL | PARTIAL]
118
+ ## Verdict: [SHIP / FIX REQUIRED]
119
+ ```
120
+
121
+ ---
122
+
123
+ ## Rules
124
+ 1. Always screenshot before and after key interactions
125
+ 2. Always check console after every navigation and major interaction
126
+ 3. Test like a user, not a developer
127
+ 4. Don't guess — actually click it, actually resize
128
+ 5. Be specific in bug reports
129
+ 6. Test the unhappy path — what happens when things go wrong?
130
+ 7. Mobile first — test smallest screen first