start-vibing 2.0.42 → 2.0.44
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json
CHANGED
|
@@ -46,17 +46,16 @@ All agents MUST use these MCP servers when applicable:
|
|
|
46
46
|
| `context7` | Up-to-date library documentation | research, analyzer, tester |
|
|
47
47
|
| `sequential-thinking` | Complex problem-solving | orchestrator, analyzer, final-validator |
|
|
48
48
|
| `memory` | Persistent knowledge graph | domain-updater, commit-manager |
|
|
49
|
-
| `playwright` | Browser automation
|
|
49
|
+
| `playwright` | Browser automation (via MCP tools) | ui-ux-reviewer |
|
|
50
50
|
| `nextjs-devtools` | Next.js specific development tools | analyzer (Next.js projects) |
|
|
51
51
|
| `mongodb` | MongoDB database operations | tester, security-auditor |
|
|
52
52
|
|
|
53
53
|
### Agent MCP Usage Rules
|
|
54
54
|
|
|
55
55
|
- **research agent**: MUST use `context7` for library docs before recommending patterns
|
|
56
|
-
- **tester agent**: MUST use `playwright` for E2E tests
|
|
57
56
|
- **analyzer agent**: SHOULD use `context7` to verify current API patterns
|
|
58
57
|
- **domain-updater**: SHOULD use `memory` to persist patterns across sessions
|
|
59
|
-
- **ui-ux-reviewer**: MUST use `playwright` to verify UI implementations
|
|
58
|
+
- **ui-ux-reviewer**: MUST use `playwright` MCP tools to verify UI implementations
|
|
60
59
|
|
|
61
60
|
---
|
|
62
61
|
|
|
@@ -205,9 +204,8 @@ All implementations MUST:
|
|
|
205
204
|
- [ ] Pass typecheck (command from config)
|
|
206
205
|
- [ ] Pass lint (command from config)
|
|
207
206
|
- [ ] Pass unit tests (command from config)
|
|
208
|
-
- [ ] Pass E2E tests (command from config)
|
|
209
207
|
- [ ] Pass build (command from config)
|
|
210
|
-
- [ ]
|
|
208
|
+
- [ ] Use Playwright MCP for browser testing when needed
|
|
211
209
|
- [ ] Have documentation in `docs/`
|
|
212
210
|
- [ ] Be security audited
|
|
213
211
|
- [ ] Be committed with conventional commits
|
|
@@ -215,6 +213,48 @@ All implementations MUST:
|
|
|
215
213
|
|
|
216
214
|
---
|
|
217
215
|
|
|
216
|
+
## Opus 4.6 Best Practices
|
|
217
|
+
|
|
218
|
+
### Effort Levels
|
|
219
|
+
|
|
220
|
+
| Level | Use For |
|
|
221
|
+
|-------|---------|
|
|
222
|
+
| `max` | Orchestrators, complex architecture (Opus only) |
|
|
223
|
+
| `high` | Default - complex agentic tasks |
|
|
224
|
+
| `medium` | Most workloads, balance speed/cost |
|
|
225
|
+
| `low` | Subagents, high-volume tasks |
|
|
226
|
+
|
|
227
|
+
### Prompting Rules
|
|
228
|
+
|
|
229
|
+
- **AVOID**: Words like "CRITICAL", "MUST", "be thorough" - causes overtriggering
|
|
230
|
+
- **USE**: "Use this tool when it would enhance understanding"
|
|
231
|
+
- Keep prompts minimal - Opus 4.6 overthinks with verbose instructions
|
|
232
|
+
- Explicit instructions work better than implied expectations
|
|
233
|
+
|
|
234
|
+
### Model Selection
|
|
235
|
+
|
|
236
|
+
| Model | Use Case |
|
|
237
|
+
|-------|----------|
|
|
238
|
+
| **Opus 4.6** | Team leads, orchestrators, deep reasoning |
|
|
239
|
+
| **Sonnet 4.6** | Teammates, subagents (5x cheaper, near-identical performance) |
|
|
240
|
+
| **Haiku 4.5** | Simple classification, no thinking needed |
|
|
241
|
+
|
|
242
|
+
### Interleaved Thinking
|
|
243
|
+
|
|
244
|
+
- Enabled automatically with adaptive thinking
|
|
245
|
+
- Claude reasons between every tool call
|
|
246
|
+
- Pass thinking blocks back unmodified in tool loops
|
|
247
|
+
- Provides transparency into Claude's reasoning process
|
|
248
|
+
|
|
249
|
+
### Cost Control
|
|
250
|
+
|
|
251
|
+
- Set `max_tokens` to 64K+ at high effort
|
|
252
|
+
- Use `low` effort for subagents
|
|
253
|
+
- Billed for FULL thinking tokens (not summarized output)
|
|
254
|
+
- Thinking tokens count toward context window
|
|
255
|
+
|
|
256
|
+
---
|
|
257
|
+
|
|
218
258
|
## Domain Updater Agent
|
|
219
259
|
|
|
220
260
|
The **domain-updater** runs BEFORE commit-manager to ensure git stays clean.
|
|
@@ -55,17 +55,38 @@ async function main(): Promise<void> {
|
|
|
55
55
|
|
|
56
56
|
const systemMessage = `TASK WORKFLOW (English only):
|
|
57
57
|
|
|
58
|
-
0. READ both CLAUDE.md (project root) and .claude/CLAUDE.md before making any changes.
|
|
59
|
-
|
|
58
|
+
0. READ both CLAUDE.md (project root) and .claude/CLAUDE.md before making any changes.
|
|
59
|
+
|
|
60
|
+
1. CREATE a detailed todo-list (TaskCreate) breaking down the request into steps.
|
|
61
|
+
- This is MANDATORY for ALL tasks, not optional.
|
|
62
|
+
- Include "Update CLAUDE.md with changes" as final task.
|
|
63
|
+
|
|
60
64
|
2. WORK through each item sequentially — mark in_progress when starting, completed when done.
|
|
61
|
-
|
|
62
|
-
|
|
63
|
-
|
|
64
|
-
|
|
65
|
-
|
|
66
|
-
|
|
67
|
-
|
|
68
|
-
|
|
65
|
+
|
|
66
|
+
3. UI/UX IMPLEMENTATION:
|
|
67
|
+
- BEFORE implementing any UI: invoke ui-mobile, ui-tablet, ui-desktop agents in PARALLEL.
|
|
68
|
+
- RESEARCH competitors in the same niche using competitor-analyzer or research-web agent.
|
|
69
|
+
- UI changes without market research will be REJECTED by final-validator.
|
|
70
|
+
|
|
71
|
+
4. TESTING (via Playwright MCP):
|
|
72
|
+
- Use the Playwright MCP server (mcp__playwright__*) for browser testing.
|
|
73
|
+
- Do NOT run bun run test:e2e - use MCP tools directly.
|
|
74
|
+
- Test on 3 viewports: mobile (375px), tablet (768px), desktop (1280px).
|
|
75
|
+
|
|
76
|
+
5. COMMIT using conventional commits via commit-manager agent.
|
|
77
|
+
|
|
78
|
+
6. UPDATE CLAUDE.md BEFORE finishing (MANDATORY):
|
|
79
|
+
a. "## Last Change" section (date: ${today}, branch, summary). Keep only latest.
|
|
80
|
+
b. Architecture changes: Update Architecture section with new patterns, file locations.
|
|
81
|
+
c. Business rules: Add new rules to relevant sections (not just summary).
|
|
82
|
+
d. UI patterns: Document new components, design decisions, aesthetic rules.
|
|
83
|
+
e. Workflows: Update Workflow section if process changed.
|
|
84
|
+
f. Gotchas: Add to FORBIDDEN or NRY sections if discovered.
|
|
85
|
+
g. Config/Stack: Update if tooling changed.
|
|
86
|
+
|
|
87
|
+
CLAUDE.md is the SINGLE SOURCE OF TRUTH. Document everything that affects future sessions.
|
|
88
|
+
|
|
89
|
+
7. RUN stop-validator before finishing: npx tsx .claude/hooks/stop-validator.ts`;
|
|
69
90
|
|
|
70
91
|
console.log(JSON.stringify({ continue: true, systemMessage }));
|
|
71
92
|
process.exit(0);
|
|
@@ -1,5 +1,5 @@
|
|
|
1
1
|
{
|
|
2
|
-
"model": "
|
|
2
|
+
"model": "opus",
|
|
3
3
|
"max_tokens": 8192,
|
|
4
4
|
"max_turns": 100,
|
|
5
5
|
|
|
@@ -239,7 +239,7 @@
|
|
|
239
239
|
},
|
|
240
240
|
"testing": {
|
|
241
241
|
"unit_tests_required": true,
|
|
242
|
-
"
|
|
242
|
+
"playwright_mcp_for_browser_tests": true,
|
|
243
243
|
"data_testid_required": true,
|
|
244
244
|
"edge_cases_research_required": true
|
|
245
245
|
},
|
|
@@ -266,11 +266,6 @@
|
|
|
266
266
|
"required": true,
|
|
267
267
|
"blocking": true
|
|
268
268
|
},
|
|
269
|
-
"test_e2e": {
|
|
270
|
-
"command": "bun run test:e2e",
|
|
271
|
-
"required": "when_ui_changes",
|
|
272
|
-
"blocking": true
|
|
273
|
-
},
|
|
274
269
|
"build": {
|
|
275
270
|
"command": "bun run build",
|
|
276
271
|
"required": true,
|