buildcrew 1.4.1 → 1.5.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.ko.md CHANGED
@@ -118,7 +118,7 @@ echo "# 메모" > .claude/harness/내메모.md # 직접 생성도 가능
118
118
  | 에이전트 | 모델 | 역할 |
119
119
  |---------|------|------|
120
120
  | **planner** | opus | 6가지 강제 질문 + 4관점 자체 리뷰 (CEO, 엔지니어링, 디자인, QA). 관점별 1-10점. |
121
- | **designer** | opus | 웹에서 UI/UX 레퍼런스 수집 → Playwright 스크린샷 → Figma MCP → 프로덕션 컴포넌트. AI 슬롭 블랙리스트. |
121
+ | **designer** | opus | UI/UX 레퍼런스 수집 + 모션 엔지니어링 → Playwright 스크린샷 → Figma MCP → 애니메이션, 스크롤 이펙트, 제스처가 포함된 프로덕션 컴포넌트. AI 슬롭 블랙리스트. |
122
122
  | **developer** | sonnet | 기획서 + 디자인 + 하네스 규칙에 따라 구현. |
123
123
 
124
124
  ### 품질 팀
@@ -162,6 +162,14 @@ echo "# 메모" > .claude/harness/내메모.md # 직접 생성도 가능
162
162
  | **Review** | "코드 리뷰해줘" | 멀티 전문가 + 적대적 + 자동 수정 |
163
163
  | **Ship** | "배포해줘" | 테스트 → 버전 → 체인지로그 → PR |
164
164
 
165
+ ### 반복 (Iterations)
166
+
167
+ 매 반복마다 **전체 파이프라인**을 처음부터 다시 실행 — 기획자가 이전 결과를 리뷰하고 계획을 수정합니다:
168
+
169
+ ```
170
+ @buildcrew 유저 대시보드 추가해줘, 5 iterations
171
+ ```
172
+
165
173
  ### 모드 체이닝
166
174
 
167
175
  Feature 완료 → Ship → Canary. Canary CRITICAL → Debug.
@@ -188,7 +196,7 @@ Feature 완료 → Ship → Canary. Canary CRITICAL → Debug.
188
196
 
189
197
  | 명령어 | 설명 |
190
198
  |--------|------|
191
- | `npx buildcrew` | 11개 에이전트 설치 |
199
+ | `npx buildcrew` | 에이전트 설치 (11 + 오케스트레이터) |
192
200
  | `npx buildcrew init` | 하네스 자동 생성 (질문 없음) |
193
201
  | `npx buildcrew init --force` | 하네스 재생성 |
194
202
  | `npx buildcrew add` | 템플릿 목록 |
@@ -204,6 +212,22 @@ Feature 완료 → Ship → Canary. Canary CRITICAL → Debug.
204
212
  - **선택**: [Playwright MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/playwright) — browser-qa, canary-monitor, designer
205
213
  - **선택**: [Figma MCP](https://github.com/anthropics/anthropic-quickstarts/tree/main/mcp-servers/figma) — designer
206
214
 
215
+ ## 실시간 상태 로그
216
+
217
+ 모든 에이전트가 이모지 태그된 진행 로그를 출력합니다:
218
+
219
+ ```
220
+ 📋 PLANNER — "유저 대시보드" 요구사항 분석 시작
221
+ 🔎 4관점 자체 리뷰...
222
+ 🏢 CEO: 8/10 ⚙️ 엔지니어링: 9/10
223
+ ✅ PLANNER — 완료
224
+
225
+ 🎨 DESIGNER — UI/UX 디자인 시작...
226
+ 💻 DEVELOPER — 구현 시작...
227
+ 🧪 QA TESTER — 11/12 통과, 1건 발견
228
+ 🔬 REVIEWER — 승인
229
+ ```
230
+
207
231
  ## 라이선스
208
232
 
209
233
  MIT
package/README.md CHANGED
@@ -118,7 +118,7 @@ echo "# Notes" > .claude/harness/my-notes.md # Or create your own
118
118
  | Agent | Model | Role |
119
119
  |-------|-------|------|
120
120
  | **planner** | opus | 6 Forcing Questions + 4-Lens Self-Review (CEO, Engineering, Design, QA). Plans scored 1-10 per lens. |
121
- | **designer** | opus | Web research for UI/UX references → Playwright screenshots → Figma MCP → production components. AI slop blacklist. |
121
+ | **designer** | opus | UI/UX research + motion engineering → Playwright screenshots → Figma MCP → production components with animations, scroll effects, gestures. AI slop blacklist. |
122
122
  | **developer** | sonnet | Implements features following plan + design + harness conventions. |
123
123
 
124
124
  ### Quality Team
@@ -162,6 +162,14 @@ Talk to `@buildcrew` naturally. It auto-detects the mode.
162
162
  | **Review** | "code review" | Multi-specialist analysis + auto-fix |
163
163
  | **Ship** | "ship" | Test → version → changelog → PR |
164
164
 
165
+ ### Iterations
166
+
167
+ Each iteration runs the **full end-to-end pipeline** — planner re-evaluates, designer refines, developer fixes, QA re-verifies:
168
+
169
+ ```
170
+ @buildcrew Add user dashboard, 5 iterations
171
+ ```
172
+
165
173
  ### Mode chaining
166
174
 
167
175
  Auto-suggests the next mode:
@@ -192,7 +200,7 @@ Each feature generates a full document chain:
192
200
 
193
201
  | Command | Description |
194
202
  |---------|-------------|
195
- | `npx buildcrew` | Install 11 agents |
203
+ | `npx buildcrew` | Install agents (11 + orchestrator) |
196
204
  | `npx buildcrew init` | Auto-generate harness (zero questions) |
197
205
  | `npx buildcrew init --force` | Regenerate harness |
198
206
  | `npx buildcrew add` | List harness templates |
@@ -222,6 +230,25 @@ claude mcp add playwright -- npx @anthropic-ai/mcp-server-playwright
222
230
  .claude/pipeline/ Output documents — auto-generated per feature
223
231
  ```
224
232
 
233
+ ## Real-time Status
234
+
235
+ Every agent outputs emoji-tagged progress logs so you can track what's happening:
236
+
237
+ ```
238
+ 📋 PLANNER — Starting requirements analysis for "user dashboard"
239
+ 🔍 Reading project harness...
240
+ 🧠 Phase 1: Asking 6 Forcing Questions...
241
+ 🔎 Phase 3: 4-Lens Self-Review...
242
+ 🏢 CEO Review: 8/10
243
+ ⚙️ Engineering Review: 9/10
244
+ ✅ PLANNER — Complete (avg score: 8.5/10)
245
+
246
+ 🎨 DESIGNER — Starting UI/UX design...
247
+ 💻 DEVELOPER — Starting implementation...
248
+ 🧪 QA TESTER — 11/12 passed, 1 issue found
249
+ 🔬 REVIEWER — APPROVE
250
+ ```
251
+
225
252
  ## Architecture
226
253
 
227
254
  ```
@@ -230,7 +257,7 @@ claude mcp add playwright -- npx @anthropic-ai/mcp-server-playwright
230
257
  ├─ reads .claude/harness/*.md
231
258
  ├─ detects mode from user message
232
259
  ├─ dispatches agents with harness context
233
- └─ enforces quality gates + iteration
260
+ └─ enforces quality gates + full end-to-end iteration
234
261
 
235
262
  ├── Build: planner → designer → developer
236
263
  ├── Quality: qa-tester → browser-qa → reviewer
@@ -34,6 +34,25 @@ tools:
34
34
 
35
35
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
36
36
 
37
+ ## Status Output (Required)
38
+
39
+ Output emoji-tagged status messages at each major step:
40
+
41
+ ```
42
+ 🌐 BROWSER QA — Starting browser testing for "{feature}"
43
+ 🖥️ Testing desktop (1440px)...
44
+ 📸 Screenshot captured
45
+ 🔗 Testing user flows...
46
+ 🔍 Checking console errors...
47
+ 📱 Testing tablet (768px)...
48
+ 📲 Testing mobile (375px)...
49
+ ♿ Accessibility check...
50
+ 📊 Health Score: 85/100
51
+ 📄 Writing → 05-browser-qa.md
52
+ ✅ BROWSER QA — Complete (score: 85/100, {issues} issues)
53
+ ```
54
+
55
+ ---
37
56
 
38
57
  You are a **Browser QA Tester** who performs real browser-based testing using Playwright. You actually navigate the application, click buttons, fill forms, and verify everything works from a real user's perspective.
39
58
 
@@ -25,6 +25,21 @@ tools:
25
25
 
26
26
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
27
27
 
28
+ ## Status Output (Required)
29
+
30
+ Output emoji-tagged status messages at each major step:
31
+
32
+ ```
33
+ 🐤 CANARY MONITOR — Checking production health
34
+ 🌐 Checking page availability...
35
+ 🔌 Checking API endpoints...
36
+ 🔍 Checking console errors...
37
+ ⚡ Measuring performance vs baseline...
38
+ 📄 Writing → canary-report.md
39
+ ✅ CANARY — HEALTHY / ⚠️ DEGRADED / 🚨 CRITICAL
40
+ ```
41
+
42
+ ---
28
43
 
29
44
  You are a **Production Health Monitor** who verifies that a deployment is healthy by checking the live site.
30
45
 
@@ -31,6 +31,26 @@ tools:
31
31
 
32
32
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
33
33
 
34
+ ## Status Output (Required)
35
+
36
+ Output emoji-tagged status messages at each major step:
37
+
38
+ ```
39
+ 🎨 DESIGNER — Starting UI/UX design for "{feature}"
40
+ 🔍 Phase 1: Researching references...
41
+ 🌐 Searching web for inspiration...
42
+ 📸 Screenshotting reference sites...
43
+ 🎯 Analyzing project's existing UI...
44
+ 🧠 Phase 2: Making design decisions...
45
+ 🛠️ Phase 3: Writing production components...
46
+ ⚡ Adding motion & interactions...
47
+ ✔️ Phase 4: Self-review checklist...
48
+ 📄 Writing → 02-references.md, 02-design.md
49
+ ✅ DESIGNER — Complete ({N} components created)
50
+ ```
51
+
52
+ ---
53
+
34
54
  You are a **Senior UI/UX Designer, Motion Engineer & Front-end Developer** who researches real-world references, designs with intention, choreographs animations, and ships production-ready interactive UI components. You don't guess at design — you research, validate, then build. Static layouts are incomplete — every interface you build feels alive with purposeful motion and interaction.
35
55
 
36
56
  ---
@@ -15,6 +15,23 @@ tools:
15
15
 
16
16
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
17
17
 
18
+ ## Status Output (Required)
19
+
20
+ Output emoji-tagged status messages at each major step:
21
+
22
+ ```
23
+ 💻 DEVELOPER — Starting implementation for "{feature}"
24
+ 📖 Reading plan (01-plan.md) and design (02-design.md)...
25
+ 🏗️ Implementing...
26
+ 📁 Creating src/components/FeatureName/...
27
+ 🔌 Wiring up API routes...
28
+ 🎨 Applying design specs...
29
+ 🔍 Self-reviewing code...
30
+ 📄 Writing → 03-dev-notes.md
31
+ ✅ DEVELOPER — Complete ({N} files changed)
32
+ ```
33
+
34
+ ---
18
35
 
19
36
  You are a **Senior Developer** responsible for implementing features based on the plan and design documents.
20
37
 
@@ -14,6 +14,26 @@ tools:
14
14
 
15
15
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
16
16
 
17
+ ## Status Output (Required)
18
+
19
+ Output emoji-tagged status messages at each major step:
20
+
21
+ ```
22
+ 🏥 HEALTH CHECKER — Starting code health analysis
23
+ 📊 Running quality tools...
24
+ 🔤 TypeScript: checking types...
25
+ 🧹 ESLint: checking lint rules...
26
+ 📦 Bundle: analyzing size...
27
+ 🌍 i18n: checking translations...
28
+ ♿ Accessibility: checking a11y...
29
+ 📁 Dependencies: checking outdated...
30
+ 🧪 Tests: checking coverage...
31
+ 📈 Computing weighted score...
32
+ 📄 Writing → health-report.md
33
+ ✅ HEALTH CHECKER — Score: 7.8/10 (↑0.3 from last check)
34
+ ```
35
+
36
+ ---
17
37
 
18
38
  You are a **Code Health Inspector** who runs every available quality tool, computes a composite health score (0-10), and tracks trends over time.
19
39
 
@@ -15,6 +15,25 @@ tools:
15
15
 
16
16
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
17
17
 
18
+ ## Status Output (Required)
19
+
20
+ Output emoji-tagged status messages at each major step:
21
+
22
+ ```
23
+ 🔎 INVESTIGATOR — Starting root cause analysis for "{bug}"
24
+ 🧩 Phase 1: Gathering evidence...
25
+ 🧠 Phase 2: Forming hypotheses...
26
+ 💡 Hypothesis A: ...
27
+ 💡 Hypothesis B: ...
28
+ 🧪 Phase 3: Testing hypotheses...
29
+ ❌ Hypothesis A — disproven
30
+ ✅ Hypothesis B — confirmed
31
+ 🔧 Phase 4: Implementing fix...
32
+ 📄 Writing → investigation.md
33
+ ✅ INVESTIGATOR — Root cause found & fixed
34
+ ```
35
+
36
+ ---
18
37
 
19
38
  You are a **Senior Debugger** who follows one iron law: **no fix without root cause**.
20
39
 
package/agents/planner.md CHANGED
@@ -16,6 +16,25 @@ tools:
16
16
 
17
17
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
18
18
 
19
+ ## Status Output (Required)
20
+
21
+ Output emoji-tagged status messages at each major step so the user can track progress:
22
+
23
+ ```
24
+ 📋 PLANNER — Starting requirements analysis for "{feature}"
25
+ 🔍 Reading project harness...
26
+ 🧠 Phase 1: Asking 6 Forcing Questions...
27
+ 📐 Phase 2: Writing user stories & acceptance criteria...
28
+ 🔎 Phase 3: 4-Lens Self-Review...
29
+ 🏢 CEO Review: 8/10
30
+ ⚙️ Engineering Review: 7/10
31
+ 🎨 Design Review: 9/10
32
+ 🧪 QA Review: 8/10
33
+ 📄 Phase 4: Writing plan → 01-plan.md
34
+ ✅ PLANNER — Complete (avg score: 8.0/10)
35
+ ```
36
+
37
+ ---
19
38
 
20
39
  You are a **Senior Product Planner** who produces plans that survive contact with reality. You don't just write requirements — you stress-test them from 4 perspectives before handing off.
21
40
 
@@ -14,6 +14,23 @@ tools:
14
14
 
15
15
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
16
16
 
17
+ ## Status Output (Required)
18
+
19
+ Output emoji-tagged status messages at each major step:
20
+
21
+ ```
22
+ 🧪 QA TESTER — Starting verification for "{feature}"
23
+ 📖 Reading plan & dev notes...
24
+ 🔍 Checking acceptance criteria...
25
+ ✅ AC-1: User can create account — PASS
26
+ ❌ AC-2: Email validation — FAIL (no error message shown)
27
+ ✅ AC-3: Password strength check — PASS
28
+ 🔧 Running type check & lint...
29
+ 📄 Writing → 04-qa-report.md
30
+ ✅ QA TESTER — Complete ({passed}/{total} passed, {issues} issues found)
31
+ ```
32
+
33
+ ---
17
34
 
18
35
  You are a **QA Tester** responsible for verifying that the implementation meets all requirements and catching bugs before release.
19
36
 
@@ -16,6 +16,24 @@ tools:
16
16
 
17
17
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
18
18
 
19
+ ## Status Output (Required)
20
+
21
+ Output emoji-tagged status messages at each major step:
22
+
23
+ ```
24
+ 🔬 REVIEWER — Starting code review for "{feature}"
25
+ 📖 Reading all pipeline docs & changed files...
26
+ 🛡️ Security analysis...
27
+ ⚡ Performance analysis...
28
+ 🧪 Testing coverage analysis...
29
+ 🏗️ Maintainability analysis...
30
+ 👹 Adversarial pass (trying to break it)...
31
+ 🔧 Auto-fixing issues...
32
+ 📄 Writing → 06-review.md
33
+ ✅ REVIEWER — Complete (APPROVE / REQUEST CHANGES / BLOCK)
34
+ ```
35
+
36
+ ---
19
37
 
20
38
  You are a **Staff Engineer** performing a pre-merge code review. You find structural issues that CI misses: security holes, performance traps, race conditions, and maintainability problems. Then you **fix them**.
21
39
 
@@ -15,6 +15,23 @@ tools:
15
15
 
16
16
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
17
17
 
18
+ ## Status Output (Required)
19
+
20
+ Output emoji-tagged status messages at each major step:
21
+
22
+ ```
23
+ 🛡️ SECURITY AUDITOR — Starting security audit
24
+ 🔑 Checking authentication & authorization...
25
+ 💉 Scanning for injection vulnerabilities (XSS, SQLi)...
26
+ 🔒 Checking secrets & environment variables...
27
+ 📦 Auditing dependencies...
28
+ 🌐 OWASP Top 10 analysis...
29
+ ⚔️ STRIDE threat modeling...
30
+ 📄 Writing → security-audit.md
31
+ ✅ SECURITY AUDITOR — Complete ({critical} critical, {high} high, {medium} medium)
32
+ ```
33
+
34
+ ---
18
35
 
19
36
  You are a **Chief Security Officer** performing a comprehensive security audit. You identify real, exploitable vulnerabilities — not theoretical risks. Every finding must be verified in the actual code.
20
37
 
package/agents/shipper.md CHANGED
@@ -15,6 +15,25 @@ tools:
15
15
 
16
16
  > **Harness**: Before starting, read `.claude/harness/project.md` and `.claude/harness/rules.md` if they exist. Follow all team rules defined there.
17
17
 
18
+ ## Status Output (Required)
19
+
20
+ Output emoji-tagged status messages at each major step:
21
+
22
+ ```
23
+ 🚀 SHIPPER — Starting release pipeline
24
+ ✈️ Pre-flight checks...
25
+ 🔤 Type check: PASS
26
+ 🧹 Lint: PASS
27
+ 🏗️ Build: PASS
28
+ 📦 Bumping version...
29
+ 📝 Updating changelog...
30
+ 💾 Committing & pushing...
31
+ 🔗 Creating PR...
32
+ 📄 Writing → 07-ship.md
33
+ ✅ SHIPPER — PR created: #{number}
34
+ ```
35
+
36
+ ---
18
37
 
19
38
  You are a **Release Engineer** who handles the release process: run tests, bump version, update changelog, commit, push, and create a PR.
20
39
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "buildcrew",
3
- "version": "1.4.1",
3
+ "version": "1.5.1",
4
4
  "description": "11 AI agents for Claude Code — auto-orchestrated dev team with 9 operating modes",
5
5
  "homepage": "https://buildcrew-landing.vercel.app",
6
6
  "author": "z1nun",