buildcrew 1.0.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/LICENSE +21 -0
- package/README.ko.md +236 -0
- package/README.md +276 -0
- package/agents/browser-qa.md +130 -0
- package/agents/canary-monitor.md +96 -0
- package/agents/constitution.md +546 -0
- package/agents/designer.md +321 -0
- package/agents/developer.md +60 -0
- package/agents/health-checker.md +108 -0
- package/agents/investigator.md +100 -0
- package/agents/planner.md +281 -0
- package/agents/qa-tester.md +83 -0
- package/agents/reviewer.md +97 -0
- package/agents/security-auditor.md +120 -0
- package/agents/shipper.md +94 -0
- package/bin/setup.js +429 -0
- package/package.json +37 -0
package/LICENSE
ADDED
|
@@ -0,0 +1,21 @@
|
|
|
1
|
+
MIT License
|
|
2
|
+
|
|
3
|
+
Copyright (c) 2026 z1nun
|
|
4
|
+
|
|
5
|
+
Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
6
|
+
of this software and associated documentation files (the "Software"), to deal
|
|
7
|
+
in the Software without restriction, including without limitation the rights
|
|
8
|
+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
9
|
+
copies of the Software, and to permit persons to whom the Software is
|
|
10
|
+
furnished to do so, subject to the following conditions:
|
|
11
|
+
|
|
12
|
+
The above copyright notice and this permission notice shall be included in all
|
|
13
|
+
copies or substantial portions of the Software.
|
|
14
|
+
|
|
15
|
+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
16
|
+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
17
|
+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
18
|
+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
19
|
+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
20
|
+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
21
|
+
SOFTWARE.
|
package/README.ko.md
ADDED
|
@@ -0,0 +1,236 @@
|
|
|
1
|
+
# buildcrew
|
|
2
|
+
|
|
3
|
+
Claude Code를 위한 11개 AI 에이전트 — 9개 운영 모드로 자동 오케스트레이션되는 개발 팀.
|
|
4
|
+
|
|
5
|
+
```bash
|
|
6
|
+
npx buildcrew
|
|
7
|
+
```
|
|
8
|
+
|
|
9
|
+
> [English](README.md) | **한국어**
|
|
10
|
+
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
## 왜 buildcrew인가?
|
|
14
|
+
|
|
15
|
+
AI 코딩 에이전트는 강력하지만 구조 없이는 일관성 없는 결과를 냅니다. buildcrew는 Claude Code에 다음을 제공합니다:
|
|
16
|
+
|
|
17
|
+
- **팀** — 명확한 역할을 가진 11개 전문 에이전트
|
|
18
|
+
- **프로세스** — 품질 게이트와 반복을 갖춘 순차 파이프라인
|
|
19
|
+
- **하네스** — 프로젝트 컨텍스트, 규칙, 도메인 지식이 모든 에이전트에 내장
|
|
20
|
+
- **오케스트레이터** — `@constitution`에게 말하면 자동으로 적절한 에이전트 배치
|
|
21
|
+
|
|
22
|
+
```
|
|
23
|
+
나: @constitution 유저 대시보드 추가해줘
|
|
24
|
+
크루: 기획자 → 디자이너 → 개발자 → QA → 브라우저 QA → 리뷰어 → 배포
|
|
25
|
+
```
|
|
26
|
+
|
|
27
|
+
외부 의존성 없음. 런타임 없음. 바이너리 없음. 그냥 마크다운.
|
|
28
|
+
|
|
29
|
+
---
|
|
30
|
+
|
|
31
|
+
## 시작하기
|
|
32
|
+
|
|
33
|
+
```bash
|
|
34
|
+
# 1. 에이전트 설치
|
|
35
|
+
npx buildcrew
|
|
36
|
+
|
|
37
|
+
# 2. 프로젝트 하네스 설정 (인터랙티브)
|
|
38
|
+
npx buildcrew init
|
|
39
|
+
|
|
40
|
+
# 3. 추가 컨텍스트 (선택)
|
|
41
|
+
npx buildcrew add erd
|
|
42
|
+
npx buildcrew add architecture
|
|
43
|
+
npx buildcrew add design-system
|
|
44
|
+
|
|
45
|
+
# 4. 사용
|
|
46
|
+
@constitution 유저 대시보드 추가해줘
|
|
47
|
+
```
|
|
48
|
+
|
|
49
|
+
---
|
|
50
|
+
|
|
51
|
+
## 하네스 엔지니어링
|
|
52
|
+
|
|
53
|
+
하네스는 "범용 AI 출력"과 "내 프로젝트에 맞는 코드"의 차이를 만듭니다.
|
|
54
|
+
|
|
55
|
+
```bash
|
|
56
|
+
npx buildcrew init
|
|
57
|
+
```
|
|
58
|
+
|
|
59
|
+
인터랙티브 설정으로 핵심 하네스 파일 생성:
|
|
60
|
+
|
|
61
|
+
```
|
|
62
|
+
[1/3] 프로젝트 컨텍스트
|
|
63
|
+
프로젝트명, 설명, 기술 스택 (자동 감지), 배포 타겟, 프로덕션 URL
|
|
64
|
+
|
|
65
|
+
[2/3] 팀 규칙
|
|
66
|
+
코딩 컨벤션, 우선순위, 금지사항, 품질 기준, 리뷰 규칙
|
|
67
|
+
|
|
68
|
+
[3/3] 도메인 지식
|
|
69
|
+
업종, 유저 타입, 도메인 용어, 비즈니스 룰
|
|
70
|
+
```
|
|
71
|
+
|
|
72
|
+
### 템플릿으로 추가 컨텍스트 제공
|
|
73
|
+
|
|
74
|
+
```bash
|
|
75
|
+
npx buildcrew add erd # DB 스키마 & 관계
|
|
76
|
+
npx buildcrew add architecture # 시스템 아키텍처
|
|
77
|
+
npx buildcrew add api-spec # API 엔드포인트 & 계약
|
|
78
|
+
npx buildcrew add design-system # 색상, 타이포, 컴포넌트
|
|
79
|
+
npx buildcrew add glossary # 도메인 용어 & 유저 역할
|
|
80
|
+
npx buildcrew add user-flow # 유저 여정 & 페이지 맵
|
|
81
|
+
npx buildcrew add env-vars # 환경변수 가이드
|
|
82
|
+
```
|
|
83
|
+
|
|
84
|
+
### 열린 구조
|
|
85
|
+
|
|
86
|
+
템플릿에 제한되지 않습니다. `.claude/harness/`에 아무 `.md` 파일이나 추가하세요:
|
|
87
|
+
|
|
88
|
+
```
|
|
89
|
+
.claude/harness/
|
|
90
|
+
├── project.md ← 핵심 (init으로 생성)
|
|
91
|
+
├── rules.md ← 핵심 (init으로 생성)
|
|
92
|
+
├── erd.md ← 템플릿 (add로 추가)
|
|
93
|
+
├── architecture.md ← 템플릿 (add로 추가)
|
|
94
|
+
├── 내맘대로.md ← 직접 작성 (에이전트가 읽음)
|
|
95
|
+
└── 무엇이든.md ← 아무 .md 파일 가능
|
|
96
|
+
```
|
|
97
|
+
|
|
98
|
+
### 하네스 파일 → 에이전트 라우팅
|
|
99
|
+
|
|
100
|
+
| 파일 | 읽는 에이전트 |
|
|
101
|
+
|------|-------------|
|
|
102
|
+
| `project.md`, `rules.md` | 모든 에이전트 |
|
|
103
|
+
| `erd.md`, `architecture.md`, `api-spec.md` | developer, reviewer, security-auditor, investigator |
|
|
104
|
+
| `design-system.md` | designer |
|
|
105
|
+
| `glossary.md`, `user-flow.md` | planner, designer, browser-qa |
|
|
106
|
+
| `env-vars.md` | developer, security-auditor |
|
|
107
|
+
| 커스텀 파일 | reviewer, security-auditor (전부 읽음) |
|
|
108
|
+
|
|
109
|
+
### 하네스 관리
|
|
110
|
+
|
|
111
|
+
```bash
|
|
112
|
+
npx buildcrew harness # 하네스 파일 상태 확인
|
|
113
|
+
npx buildcrew add # 사용 가능한 템플릿 목록
|
|
114
|
+
npx buildcrew add erd # 특정 템플릿 추가
|
|
115
|
+
npx buildcrew init --force # 핵심 파일 재생성
|
|
116
|
+
```
|
|
117
|
+
|
|
118
|
+
---
|
|
119
|
+
|
|
120
|
+
## 에이전트
|
|
121
|
+
|
|
122
|
+
### 빌드 팀
|
|
123
|
+
|
|
124
|
+
| 에이전트 | 모델 | 역할 |
|
|
125
|
+
|---------|------|------|
|
|
126
|
+
| **planner** | opus | 6가지 강제 질문 + 4관점 자체 리뷰 (CEO, 엔지니어링, 디자인, QA). 관점별 1-10점 채점. |
|
|
127
|
+
| **designer** | opus | 웹에서 UI/UX 레퍼런스 수집, Playwright로 실제 사이트 스크린샷, Figma MCP 통합, 프로덕션 React/Next.js 컴포넌트 작성. AI 슬롭 블랙리스트. |
|
|
128
|
+
| **developer** | sonnet | 기획서 + 디자인 + 하네스 규칙에 따라 구현. |
|
|
129
|
+
|
|
130
|
+
### 품질 팀
|
|
131
|
+
|
|
132
|
+
| 에이전트 | 모델 | 역할 |
|
|
133
|
+
|---------|------|------|
|
|
134
|
+
| **qa-tester** | sonnet | 코드 레벨 검증 — 타입, 린트, 빌드, 수용 기준. |
|
|
135
|
+
| **browser-qa** | sonnet | Playwright MCP 실제 브라우저 테스트 — 유저 플로우, 반응형, 콘솔, 건강 점수 (0-100). |
|
|
136
|
+
| **reviewer** | opus | 4관점 병렬 리뷰 (보안, 성능, 테스트, 유지보수) + 적대적 리뷰 + 자동 수정. |
|
|
137
|
+
| **health-checker** | sonnet | 코드 품질 대시보드 — 7개 카테고리 가중 점수 (0-10) + 트렌드. |
|
|
138
|
+
|
|
139
|
+
### 보안 & 운영 팀
|
|
140
|
+
|
|
141
|
+
| 에이전트 | 모델 | 역할 |
|
|
142
|
+
|---------|------|------|
|
|
143
|
+
| **security-auditor** | opus | OWASP Top 10 + STRIDE 위협 모델. 10단계 감사. |
|
|
144
|
+
| **canary-monitor** | sonnet | 배포 후 프로덕션 헬스 — 페이지, API, 콘솔, 성능 비교. |
|
|
145
|
+
| **shipper** | sonnet | 릴리즈 파이프라인 — 테스트 → 버전 → 체인지로그 → PR. |
|
|
146
|
+
|
|
147
|
+
### 전문가
|
|
148
|
+
|
|
149
|
+
| 에이전트 | 모델 | 역할 |
|
|
150
|
+
|---------|------|------|
|
|
151
|
+
| **investigator** | sonnet | 근본 원인 디버깅. 4단계 조사. 무관한 코드 수정 동결. |
|
|
152
|
+
|
|
153
|
+
---
|
|
154
|
+
|
|
155
|
+
## 9가지 운영 모드
|
|
156
|
+
|
|
157
|
+
`@constitution`에게 자연스럽게 말하세요. 모드를 자동 감지합니다.
|
|
158
|
+
|
|
159
|
+
| 모드 | 예시 | 파이프라인 |
|
|
160
|
+
|------|------|----------|
|
|
161
|
+
| **Feature** | "유저 대시보드 추가해줘" | 기획 → 디자인 → 개발 → QA → 브라우저 QA → 리뷰 |
|
|
162
|
+
| **Project Audit** | "프로젝트 전체 점검해줘" | 스캔 → 우선순위 → 수정 → 검증 (반복) |
|
|
163
|
+
| **Browser QA** | "브라우저 테스트해줘" | Playwright 테스트 + 건강 점수 |
|
|
164
|
+
| **Security** | "보안 점검해줘" | OWASP + STRIDE + 시크릿 + 의존성 |
|
|
165
|
+
| **Debug** | "왜 로그인이 안 돼?" | 4단계 근본 원인 조사 |
|
|
166
|
+
| **Health** | "헬스체크 돌려줘" | 품질 대시보드 (타입, 린트, 의존성, i18n) |
|
|
167
|
+
| **Canary** | "배포 확인해줘" | 프로덕션 모니터링 |
|
|
168
|
+
| **Review** | "코드 리뷰해줘" | 멀티 전문가 + 적대적 + 자동 수정 |
|
|
169
|
+
| **Ship** | "배포해줘" | 테스트 → 버전 → 체인지로그 → PR |
|
|
170
|
+
|
|
171
|
+
### 모드 체이닝
|
|
172
|
+
|
|
173
|
+
Constitution이 다음 모드를 자동 제안:
|
|
174
|
+
- Feature 완료 → Ship 제안
|
|
175
|
+
- Ship 완료 → Canary 제안
|
|
176
|
+
- Canary CRITICAL → Debug 트리거
|
|
177
|
+
|
|
178
|
+
---
|
|
179
|
+
|
|
180
|
+
## 기능 파이프라인
|
|
181
|
+
|
|
182
|
+
각 기능이 완전한 문서 체인을 생성:
|
|
183
|
+
|
|
184
|
+
```
|
|
185
|
+
.claude/pipeline/{기능명}/
|
|
186
|
+
├── 01-plan.md 요구사항 + 4관점 리뷰 점수
|
|
187
|
+
├── 02-references.md 실제 사이트에서 수집한 UI/UX 레퍼런스
|
|
188
|
+
├── 02-design.md 디자인 결정 + 컴포넌트 스펙
|
|
189
|
+
├── 03-dev-notes.md 구현 노트 + 변경 파일
|
|
190
|
+
├── 04-qa-report.md 수용 기준 검증
|
|
191
|
+
├── 05-browser-qa.md 건강 점수 + 스크린샷 + 플로우
|
|
192
|
+
├── 06-review.md 리뷰 발견사항 + 자동 수정
|
|
193
|
+
└── 07-ship.md PR URL + 릴리즈 노트
|
|
194
|
+
```
|
|
195
|
+
|
|
196
|
+
---
|
|
197
|
+
|
|
198
|
+
## CLI 레퍼런스
|
|
199
|
+
|
|
200
|
+
| 명령어 | 설명 |
|
|
201
|
+
|--------|------|
|
|
202
|
+
| `npx buildcrew` | 11개 에이전트 설치 |
|
|
203
|
+
| `npx buildcrew init` | 프로젝트 하네스 설정 (인터랙티브) |
|
|
204
|
+
| `npx buildcrew init --force` | 하네스 재생성 |
|
|
205
|
+
| `npx buildcrew add` | 사용 가능한 템플릿 목록 |
|
|
206
|
+
| `npx buildcrew add <type>` | 하네스 템플릿 추가 |
|
|
207
|
+
| `npx buildcrew harness` | 하네스 파일 상태 확인 |
|
|
208
|
+
| `npx buildcrew --force` | 기존 에이전트 덮어쓰기 |
|
|
209
|
+
| `npx buildcrew --list` | 에이전트 목록 (모델 포함) |
|
|
210
|
+
| `npx buildcrew --uninstall` | 에이전트 제거 |
|
|
211
|
+
| `npx buildcrew --version` | 버전 확인 |
|
|
212
|
+
|
|
213
|
+
## 요구사항
|
|
214
|
+
|
|
215
|
+
- **필수**: [Claude Code](https://claude.ai/code) CLI
|
|
216
|
+
- **선택**: [Playwright MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/playwright) — browser-qa, canary-monitor, designer용
|
|
217
|
+
- **선택**: [Figma MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/figma) — designer용
|
|
218
|
+
|
|
219
|
+
```bash
|
|
220
|
+
# 실제 브라우저 테스트 활성화
|
|
221
|
+
claude mcp add playwright -- npx @anthropic-ai/mcp-server-playwright
|
|
222
|
+
```
|
|
223
|
+
|
|
224
|
+
## 커스터마이징
|
|
225
|
+
|
|
226
|
+
모든 파일 수정 가능:
|
|
227
|
+
|
|
228
|
+
```
|
|
229
|
+
.claude/agents/ 에이전트 정의 — 역할, 도구, 지시사항, 모델 수정
|
|
230
|
+
.claude/harness/ 프로젝트 컨텍스트 — 언제든 수정, 아무 .md 추가
|
|
231
|
+
.claude/pipeline/ 출력 문서 — 기능별 자동 생성
|
|
232
|
+
```
|
|
233
|
+
|
|
234
|
+
## 라이선스
|
|
235
|
+
|
|
236
|
+
MIT
|
package/README.md
ADDED
|
@@ -0,0 +1,276 @@
|
|
|
1
|
+
# buildcrew
|
|
2
|
+
|
|
3
|
+
> **English** | [한국어](README.ko.md)
|
|
4
|
+
|
|
5
|
+
11 AI agents for Claude Code — auto-orchestrated dev team with 9 operating modes.
|
|
6
|
+
|
|
7
|
+
```bash
|
|
8
|
+
npx buildcrew
|
|
9
|
+
```
|
|
10
|
+
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
## Why buildcrew?
|
|
14
|
+
|
|
15
|
+
AI coding agents are powerful, but without structure they produce inconsistent results. buildcrew gives Claude Code:
|
|
16
|
+
|
|
17
|
+
- **A team** — 11 specialized agents, each with a clear role
|
|
18
|
+
- **A process** — sequential pipeline with quality gates and iteration
|
|
19
|
+
- **A harness** — your project context, rules, and domain knowledge baked in
|
|
20
|
+
- **An orchestrator** — just talk to `@constitution`, it routes automatically
|
|
21
|
+
|
|
22
|
+
```
|
|
23
|
+
You: @constitution Add user authentication
|
|
24
|
+
Crew: Planner → Designer → Developer → QA → Browser QA → Reviewer → Ship
|
|
25
|
+
```
|
|
26
|
+
|
|
27
|
+
No external dependencies. No runtime. No binaries. Just Markdown.
|
|
28
|
+
|
|
29
|
+
---
|
|
30
|
+
|
|
31
|
+
## Getting Started
|
|
32
|
+
|
|
33
|
+
```bash
|
|
34
|
+
# 1. Install agents
|
|
35
|
+
npx buildcrew
|
|
36
|
+
|
|
37
|
+
# 2. Set up project harness (interactive)
|
|
38
|
+
npx buildcrew init
|
|
39
|
+
|
|
40
|
+
# 3. Add more context (optional)
|
|
41
|
+
npx buildcrew add erd
|
|
42
|
+
npx buildcrew add architecture
|
|
43
|
+
npx buildcrew add design-system
|
|
44
|
+
|
|
45
|
+
# 4. Start working
|
|
46
|
+
@constitution Add user dashboard
|
|
47
|
+
```
|
|
48
|
+
|
|
49
|
+
---
|
|
50
|
+
|
|
51
|
+
## Harness Engineering
|
|
52
|
+
|
|
53
|
+
The harness is what separates "generic AI output" from "code that fits your project."
|
|
54
|
+
|
|
55
|
+
```bash
|
|
56
|
+
npx buildcrew init
|
|
57
|
+
```
|
|
58
|
+
|
|
59
|
+
Interactive setup creates core harness files:
|
|
60
|
+
|
|
61
|
+
```
|
|
62
|
+
[1/3] Project Context
|
|
63
|
+
Project name, description, tech stack (auto-detected), deploy target, production URL
|
|
64
|
+
|
|
65
|
+
[2/3] Team Rules
|
|
66
|
+
Coding conventions, priorities, what to avoid, quality standards, review rules
|
|
67
|
+
|
|
68
|
+
[3/3] Domain Knowledge
|
|
69
|
+
Industry, user types, domain terms, business rules
|
|
70
|
+
```
|
|
71
|
+
|
|
72
|
+
### Add more context with templates
|
|
73
|
+
|
|
74
|
+
```bash
|
|
75
|
+
npx buildcrew add erd # Database schema & relationships
|
|
76
|
+
npx buildcrew add architecture # System architecture overview
|
|
77
|
+
npx buildcrew add api-spec # API endpoints & contracts
|
|
78
|
+
npx buildcrew add design-system # Colors, typography, components
|
|
79
|
+
npx buildcrew add glossary # Domain terms & user roles
|
|
80
|
+
npx buildcrew add user-flow # User journeys & page map
|
|
81
|
+
npx buildcrew add env-vars # Environment variables guide
|
|
82
|
+
```
|
|
83
|
+
|
|
84
|
+
### The harness is open
|
|
85
|
+
|
|
86
|
+
Not limited to templates. Create any `.md` file in `.claude/harness/`:
|
|
87
|
+
|
|
88
|
+
```
|
|
89
|
+
.claude/harness/
|
|
90
|
+
├── project.md ← core (auto by init)
|
|
91
|
+
├── rules.md ← core (auto by init)
|
|
92
|
+
├── erd.md ← template
|
|
93
|
+
├── architecture.md ← template
|
|
94
|
+
├── design-system.md ← template
|
|
95
|
+
├── glossary.md ← template
|
|
96
|
+
├── my-custom-notes.md ← your own file (agents read it too)
|
|
97
|
+
└── anything.md ← any .md file works
|
|
98
|
+
```
|
|
99
|
+
|
|
100
|
+
All agents read harness files before every task. Constitution routes the right files to the right agents:
|
|
101
|
+
|
|
102
|
+
| File | Routed to |
|
|
103
|
+
|------|-----------|
|
|
104
|
+
| `project.md`, `rules.md` | All agents |
|
|
105
|
+
| `erd.md`, `architecture.md`, `api-spec.md` | developer, reviewer, security-auditor, investigator |
|
|
106
|
+
| `design-system.md` | designer |
|
|
107
|
+
| `glossary.md`, `user-flow.md` | planner, designer, browser-qa |
|
|
108
|
+
| `env-vars.md` | developer, security-auditor |
|
|
109
|
+
| Custom files | reviewer, security-auditor (read ALL) |
|
|
110
|
+
|
|
111
|
+
### Manage the harness
|
|
112
|
+
|
|
113
|
+
```bash
|
|
114
|
+
npx buildcrew harness # Show status of all harness files
|
|
115
|
+
npx buildcrew add # List available templates
|
|
116
|
+
npx buildcrew add erd # Add a specific template
|
|
117
|
+
npx buildcrew init --force # Regenerate core files from scratch
|
|
118
|
+
```
|
|
119
|
+
|
|
120
|
+
---
|
|
121
|
+
|
|
122
|
+
## Agents
|
|
123
|
+
|
|
124
|
+
### Build Team
|
|
125
|
+
|
|
126
|
+
| Agent | Model | Role |
|
|
127
|
+
|-------|-------|------|
|
|
128
|
+
| **planner** | opus | 6 Forcing Questions + 4-Lens Self-Review (CEO, Engineering, Design, QA). Produces battle-tested plans scored 1-10 per lens. |
|
|
129
|
+
| **designer** | opus | Searches web for UI/UX references, screenshots real sites with Playwright, integrates Figma MCP, writes production React/Next.js components. AI slop blacklist enforced. |
|
|
130
|
+
| **developer** | sonnet | Implements features following plan + design + harness conventions. Self-reviews before handoff. |
|
|
131
|
+
|
|
132
|
+
### Quality Team
|
|
133
|
+
|
|
134
|
+
| Agent | Model | Role |
|
|
135
|
+
|-------|-------|------|
|
|
136
|
+
| **qa-tester** | sonnet | Code-level verification — types, lint, build, acceptance criteria check. |
|
|
137
|
+
| **browser-qa** | sonnet | Real browser testing via Playwright MCP — user flows, responsive (375/768/1440px), console errors, network, health score (0-100). |
|
|
138
|
+
| **reviewer** | opus | 4-specialist parallel review (security, performance, testing, maintainability) + adversarial pass + auto-fix. Fix-first approach. |
|
|
139
|
+
| **health-checker** | sonnet | Code quality dashboard — 7-category weighted score (0-10), trend tracking, top 5 actionable items. |
|
|
140
|
+
|
|
141
|
+
### Security & Ops
|
|
142
|
+
|
|
143
|
+
| Agent | Model | Role |
|
|
144
|
+
|-------|-------|------|
|
|
145
|
+
| **security-auditor** | opus | OWASP Top 10 + STRIDE threat model. 10-phase audit. Secrets scan, dependency audit, AI/LLM security. Confidence gate (8/10 standard, 2/10 comprehensive). |
|
|
146
|
+
| **canary-monitor** | sonnet | Post-deploy health — page availability, API status, console errors, performance vs baseline. |
|
|
147
|
+
| **shipper** | sonnet | Release pipeline — pre-flight checks → version bump → changelog → commit → push → PR creation. |
|
|
148
|
+
|
|
149
|
+
### Specialist
|
|
150
|
+
|
|
151
|
+
| Agent | Model | Role |
|
|
152
|
+
|-------|-------|------|
|
|
153
|
+
| **investigator** | sonnet | Root cause debugging. 4 phases: investigate → hypothesize → test → fix. Edit freeze on unrelated code. "No fix without root cause." |
|
|
154
|
+
|
|
155
|
+
---
|
|
156
|
+
|
|
157
|
+
## 9 Operating Modes
|
|
158
|
+
|
|
159
|
+
Talk to `@constitution` naturally. It auto-detects the mode.
|
|
160
|
+
|
|
161
|
+
| Mode | Example | What happens |
|
|
162
|
+
|------|---------|-------------|
|
|
163
|
+
| **Feature** | "Add user dashboard" | Plan → Design → Dev → QA → Browser QA → Review |
|
|
164
|
+
| **Project Audit** | "full project audit" | Scan → Prioritize → Fix → Verify (loop) |
|
|
165
|
+
| **Browser QA** | "browser qa localhost:3000" | Playwright tests flows, responsive, console |
|
|
166
|
+
| **Security** | "security audit" | OWASP + STRIDE + secrets + deps |
|
|
167
|
+
| **Debug** | "debug: login broken" | 4-phase root cause investigation |
|
|
168
|
+
| **Health** | "health check" | Quality dashboard (types, lint, deps, i18n) |
|
|
169
|
+
| **Canary** | "canary https://myapp.com" | Post-deploy production monitoring |
|
|
170
|
+
| **Review** | "code review" | Multi-specialist analysis + auto-fix |
|
|
171
|
+
| **Ship** | "ship" | Test → version → changelog → PR |
|
|
172
|
+
|
|
173
|
+
### Mode chaining
|
|
174
|
+
|
|
175
|
+
Constitution auto-suggests the next mode:
|
|
176
|
+
- Feature complete → Ship
|
|
177
|
+
- Ship complete → Canary
|
|
178
|
+
- Canary CRITICAL → Debug
|
|
179
|
+
|
|
180
|
+
---
|
|
181
|
+
|
|
182
|
+
## Feature Pipeline
|
|
183
|
+
|
|
184
|
+
Each feature generates a full document chain:
|
|
185
|
+
|
|
186
|
+
```
|
|
187
|
+
.claude/pipeline/{feature}/
|
|
188
|
+
├── 01-plan.md Requirements + 4-lens review scores
|
|
189
|
+
├── 02-references.md Curated UI/UX references from real sites
|
|
190
|
+
├── 02-design.md Design decisions + component specs
|
|
191
|
+
├── 03-dev-notes.md Implementation notes + files changed
|
|
192
|
+
├── 04-qa-report.md Acceptance criteria verification
|
|
193
|
+
├── 05-browser-qa.md Health score + screenshots + flow results
|
|
194
|
+
├── 06-review.md Review findings + auto-fixes applied
|
|
195
|
+
└── 07-ship.md PR URL + release notes
|
|
196
|
+
```
|
|
197
|
+
|
|
198
|
+
---
|
|
199
|
+
|
|
200
|
+
## CLI Reference
|
|
201
|
+
|
|
202
|
+
| Command | Description |
|
|
203
|
+
|---------|-------------|
|
|
204
|
+
| `npx buildcrew` | Install 11 agents to `.claude/agents/` |
|
|
205
|
+
| `npx buildcrew init` | Set up project harness (interactive) |
|
|
206
|
+
| `npx buildcrew init --force` | Regenerate harness from scratch |
|
|
207
|
+
| `npx buildcrew add` | List available harness templates |
|
|
208
|
+
| `npx buildcrew add <type>` | Add a harness template (erd, architecture, etc.) |
|
|
209
|
+
| `npx buildcrew harness` | Show status of all harness files |
|
|
210
|
+
| `npx buildcrew --force` | Overwrite existing agent files |
|
|
211
|
+
| `npx buildcrew --list` | List all agents with models |
|
|
212
|
+
| `npx buildcrew --uninstall` | Remove installed agents |
|
|
213
|
+
| `npx buildcrew --version` | Show version |
|
|
214
|
+
|
|
215
|
+
## Requirements
|
|
216
|
+
|
|
217
|
+
- **Required**: [Claude Code](https://claude.ai/code) CLI
|
|
218
|
+
- **Optional**: [Playwright MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/playwright) — for browser-qa, canary-monitor, designer
|
|
219
|
+
- **Optional**: [Figma MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/figma) — for designer
|
|
220
|
+
|
|
221
|
+
```bash
|
|
222
|
+
# Enable real browser testing
|
|
223
|
+
claude mcp add playwright -- npx @anthropic-ai/mcp-server-playwright
|
|
224
|
+
```
|
|
225
|
+
|
|
226
|
+
## Customization
|
|
227
|
+
|
|
228
|
+
Every file is editable:
|
|
229
|
+
|
|
230
|
+
```
|
|
231
|
+
.claude/agents/ Agent definitions — edit roles, tools, instructions, model
|
|
232
|
+
.claude/harness/ Project context — edit anytime, add any .md file
|
|
233
|
+
.claude/pipeline/ Output documents — auto-generated per feature
|
|
234
|
+
```
|
|
235
|
+
|
|
236
|
+
Change an agent's model:
|
|
237
|
+
```yaml
|
|
238
|
+
# In any agent .md file
|
|
239
|
+
model: opus # or sonnet, haiku
|
|
240
|
+
```
|
|
241
|
+
|
|
242
|
+
## Architecture
|
|
243
|
+
|
|
244
|
+
```
|
|
245
|
+
@constitution (orchestrator, opus)
|
|
246
|
+
│
|
|
247
|
+
├─ reads .claude/harness/*.md (project context)
|
|
248
|
+
│
|
|
249
|
+
├─ detects mode from user message
|
|
250
|
+
│
|
|
251
|
+
├─ dispatches agents with relevant harness context
|
|
252
|
+
│
|
|
253
|
+
└─ enforces quality gates + iteration
|
|
254
|
+
│
|
|
255
|
+
├── Build: planner → designer → developer
|
|
256
|
+
├── Quality: qa-tester → browser-qa → reviewer
|
|
257
|
+
├── Sec/Ops: security-auditor, canary-monitor, shipper
|
|
258
|
+
└── Debug: investigator
|
|
259
|
+
```
|
|
260
|
+
|
|
261
|
+
## Compared to gstack
|
|
262
|
+
|
|
263
|
+
| | buildcrew | gstack |
|
|
264
|
+
|---|-----------|--------|
|
|
265
|
+
| Install | `npx buildcrew` | `git clone` + Bun + `./setup` |
|
|
266
|
+
| Orchestration | Auto (constitution routes) | Manual (`/qa`, `/review`, ...) |
|
|
267
|
+
| Harness | `init` + `add` + custom .md | Manual CLAUDE.md |
|
|
268
|
+
| Dependencies | None | Bun, Playwright binary (~58MB) |
|
|
269
|
+
| Browser testing | Playwright MCP | Custom Playwright daemon |
|
|
270
|
+
| Pipeline docs | Auto-generated chain (01→07) | Per-skill output |
|
|
271
|
+
| Agents | 11 (5 opus, 6 sonnet) | 34 |
|
|
272
|
+
| Customization | Edit .md directly, never overwritten | May overwrite on update |
|
|
273
|
+
|
|
274
|
+
## License
|
|
275
|
+
|
|
276
|
+
MIT
|
|
@@ -0,0 +1,130 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: browser-qa
|
|
3
|
+
description: Browser QA agent - performs real browser testing using Playwright MCP, captures screenshots, tests user flows, checks console errors, and verifies responsive design
|
|
4
|
+
model: sonnet
|
|
5
|
+
tools:
|
|
6
|
+
- Read
|
|
7
|
+
- Write
|
|
8
|
+
- Glob
|
|
9
|
+
- Grep
|
|
10
|
+
- Bash
|
|
11
|
+
- mcp__playwright__browser_navigate
|
|
12
|
+
- mcp__playwright__browser_click
|
|
13
|
+
- mcp__playwright__browser_fill_form
|
|
14
|
+
- mcp__playwright__browser_snapshot
|
|
15
|
+
- mcp__playwright__browser_take_screenshot
|
|
16
|
+
- mcp__playwright__browser_evaluate
|
|
17
|
+
- mcp__playwright__browser_console_messages
|
|
18
|
+
- mcp__playwright__browser_hover
|
|
19
|
+
- mcp__playwright__browser_press_key
|
|
20
|
+
- mcp__playwright__browser_wait_for
|
|
21
|
+
- mcp__playwright__browser_tabs
|
|
22
|
+
- mcp__playwright__browser_resize
|
|
23
|
+
- mcp__playwright__browser_network_requests
|
|
24
|
+
- mcp__playwright__browser_navigate_back
|
|
25
|
+
- mcp__playwright__browser_select_option
|
|
26
|
+
- mcp__playwright__browser_close
|
|
27
|
+
- mcp__playwright__browser_drag
|
|
28
|
+
- mcp__playwright__browser_file_upload
|
|
29
|
+
- mcp__playwright__browser_handle_dialog
|
|
30
|
+
- mcp__playwright__browser_run_code
|
|
31
|
+
---
|
|
32
|
+
|
|
33
|
+
# Browser QA Agent
|
|
34
|
+
|
|
35
|
+
> **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
|
|
36
|
+
|
|
37
|
+
|
|
38
|
+
You are a **Browser QA Tester** who performs real browser-based testing using Playwright. You actually navigate the application, click buttons, fill forms, and verify everything works from a real user's perspective.
|
|
39
|
+
|
|
40
|
+
---
|
|
41
|
+
|
|
42
|
+
## Test Tiers
|
|
43
|
+
|
|
44
|
+
| Tier | Scope | When |
|
|
45
|
+
|------|-------|------|
|
|
46
|
+
| **Quick** | Affected pages only, happy paths | Small changes |
|
|
47
|
+
| **Standard** | All major flows + edge cases | Feature completion (default) |
|
|
48
|
+
| **Exhaustive** | Every page, every state, every breakpoint | Pre-release |
|
|
49
|
+
|
|
50
|
+
---
|
|
51
|
+
|
|
52
|
+
## Process
|
|
53
|
+
|
|
54
|
+
### Phase 1: Setup & Orient
|
|
55
|
+
1. Ensure dev server is running (check the provided URL or `http://localhost:3000`)
|
|
56
|
+
2. If pipeline docs exist, read plan and dev notes to know what to verify
|
|
57
|
+
3. Navigate to target URL, take initial snapshot
|
|
58
|
+
4. Detect the application structure (routes, navigation, key pages)
|
|
59
|
+
|
|
60
|
+
### Phase 2: Page Exploration
|
|
61
|
+
For each page: navigate → snapshot → screenshot → check console → check network → identify interactive elements
|
|
62
|
+
|
|
63
|
+
### Phase 3: User Flow Testing
|
|
64
|
+
Test each flow end-to-end. After every interaction: check console for errors, verify expected outcome, screenshot key states.
|
|
65
|
+
|
|
66
|
+
### Phase 4: State Testing
|
|
67
|
+
For each interactive component verify: default, loading, error, empty, hover, active/focus, disabled states.
|
|
68
|
+
|
|
69
|
+
### Phase 5: Responsive Testing
|
|
70
|
+
Test at three breakpoints by resizing:
|
|
71
|
+
- Mobile: 375 x 812
|
|
72
|
+
- Tablet: 768 x 1024
|
|
73
|
+
- Desktop: 1440 x 900
|
|
74
|
+
|
|
75
|
+
### Phase 6: Accessibility Quick Check
|
|
76
|
+
- Keyboard navigation: Tab through all interactive elements
|
|
77
|
+
- Focus indicators visible?
|
|
78
|
+
- ARIA labels present in accessibility tree?
|
|
79
|
+
|
|
80
|
+
### Phase 7: Console & Network Audit
|
|
81
|
+
Collect all console errors, check for 4xx/5xx API responses, CORS issues, failed resource loads.
|
|
82
|
+
|
|
83
|
+
---
|
|
84
|
+
|
|
85
|
+
## Health Score
|
|
86
|
+
|
|
87
|
+
| Category | Weight |
|
|
88
|
+
|----------|--------|
|
|
89
|
+
| Console Errors | 15% |
|
|
90
|
+
| Functional (flows) | 25% |
|
|
91
|
+
| UX (states) | 20% |
|
|
92
|
+
| Responsive | 15% |
|
|
93
|
+
| Accessibility | 10% |
|
|
94
|
+
| Performance | 10% |
|
|
95
|
+
| Network Errors | 5% |
|
|
96
|
+
|
|
97
|
+
Score: 90-100 Excellent, 70-89 Good, 50-69 Needs Work, <50 Critical.
|
|
98
|
+
|
|
99
|
+
---
|
|
100
|
+
|
|
101
|
+
## Output
|
|
102
|
+
|
|
103
|
+
Write to `.claude/pipeline/{feature-name}/05-browser-qa.md`:
|
|
104
|
+
|
|
105
|
+
```markdown
|
|
106
|
+
# Browser QA Report: {Feature Name}
|
|
107
|
+
## Test Configuration
|
|
108
|
+
## Health Score: [NN]/100
|
|
109
|
+
| Category | Score | Details |
|
|
110
|
+
## Flows Tested
|
|
111
|
+
| # | Flow | Status | Notes |
|
|
112
|
+
## Issues Found
|
|
113
|
+
### ISSUE-NNN: [Title]
|
|
114
|
+
- Severity, Category, Page, Steps to Reproduce, Expected, Actual, Suggested Fix
|
|
115
|
+
## Console Errors
|
|
116
|
+
## Responsive Results
|
|
117
|
+
## Overall Status: [PASS | FAIL | PARTIAL]
|
|
118
|
+
## Verdict: [SHIP / FIX REQUIRED]
|
|
119
|
+
```
|
|
120
|
+
|
|
121
|
+
---
|
|
122
|
+
|
|
123
|
+
## Rules
|
|
124
|
+
1. Always screenshot before and after key interactions
|
|
125
|
+
2. Always check console after every navigation and major interaction
|
|
126
|
+
3. Test like a user, not a developer
|
|
127
|
+
4. Don't guess — actually click it, actually resize
|
|
128
|
+
5. Be specific in bug reports
|
|
129
|
+
6. Test the unhappy path — what happens when things go wrong?
|
|
130
|
+
7. Mobile first — test smallest screen first
|