company-skill 1.4.0 → 2.0.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/package.json +1 -1
- package/skill/SKILL.md +69 -449
package/package.json
CHANGED
package/skill/SKILL.md
CHANGED
|
@@ -17,502 +17,122 @@ allowed-tools:
|
|
|
17
17
|
- Skill
|
|
18
18
|
---
|
|
19
19
|
|
|
20
|
-
# /company
|
|
20
|
+
# /company
|
|
21
21
|
|
|
22
|
-
|
|
22
|
+
Give it a goal. Run every employee. Loop until verified done.
|
|
23
23
|
|
|
24
|
-
|
|
25
|
-
echo "════════════════════════════════════════════════" && echo " 🏢 COMPANY SKILL ACTIVE" && echo "════════════════════════════════════════════════" && echo "" && ([ -f COMPANY.md ] && echo "$(grep -c '^[0-9]\|^- ' COMPANY.md 2>/dev/null) roles found" || echo "No COMPANY.md, will auto-create")
|
|
26
|
-
```
|
|
27
|
-
|
|
28
|
-
Give it a goal. It runs the entire company in loops until the goal is verified done.
|
|
29
|
-
|
|
30
|
-
```
|
|
31
|
-
/company "Build the user authentication system with OAuth2"
|
|
32
|
-
/company "Research all competitors and write a competitive analysis"
|
|
33
|
-
/company "Fix the payment processing bug and deploy"
|
|
34
|
-
```
|
|
35
|
-
|
|
36
|
-
You set the goal. Walk away. Come back to a STATUS.md with everything accomplished.
|
|
37
|
-
|
|
38
|
-
## Core Loop
|
|
39
|
-
|
|
40
|
-
```
|
|
41
|
-
GOAL: "{what the user asked for}"
|
|
42
|
-
|
|
|
43
|
-
THINK ── Leads break the goal into tasks, assign to employees
|
|
44
|
-
|
|
|
45
|
-
EXECUTE ── Employees do the work (code, research, scan, test)
|
|
46
|
-
|
|
|
47
|
-
VERIFY ── Built-in Reviewer + Critic check if the goal is actually met
|
|
48
|
-
|
|
|
49
|
-
Done? ── NO: loop back to THINK with feedback on what's missing
|
|
50
|
-
── YES: write STATUS.md, report to user
|
|
51
|
-
```
|
|
52
|
-
|
|
53
|
-
The loop does NOT stop until VERIFY confirms the goal is met.
|
|
54
|
-
|
|
55
|
-
## Built-In Roles (Always Active)
|
|
56
|
-
|
|
57
|
-
These exist in EVERY company, regardless of COMPANY.md. They are the backbone.
|
|
58
|
-
If the user already defined any of these roles, the built-in version merges with theirs (no duplicates).
|
|
59
|
-
|
|
60
|
-
### Leadership (THINK phase)
|
|
61
|
-
**CEO**, Reads the goal, sets priorities, resolves conflicts between departments. If the user defined a CEO, that definition is used. If not, auto-created.
|
|
62
|
-
|
|
63
|
-
**CTO**, Technical decisions, architecture review, code quality standards. Auto-created if not defined.
|
|
64
|
-
|
|
65
|
-
### Quality Gate (VERIFY phase)
|
|
66
|
-
**Internal Reviewer**, Reviews all work after each cycle. Checks: is it correct? Does it address the goal? Are there bugs or gaps? Grades each success criterion as MET / NOT MET / PARTIALLY MET.
|
|
67
|
-
|
|
68
|
-
**User Advocate**, "Would a real user understand this? Is the API ugly? Does the README make sense in 10 seconds?" Represents the end user who doesn't care about internals.
|
|
69
|
-
|
|
70
|
-
**Devil's Advocate**, Attacks every result. "What could go wrong? What did we miss? Would an external expert accept this?" Only satisfied when there are zero remaining holes.
|
|
71
|
-
|
|
72
|
-
**Elegance Enforcer**, "Can this be simpler? Is there unnecessary complexity? Does every component justify its existence?" Prevents over-engineering.
|
|
73
|
-
|
|
74
|
-
### Deduplication Rule
|
|
75
|
-
When parsing COMPANY.md, check if the user defined roles matching these built-ins:
|
|
76
|
-
- Match by name similarity: "CEO", "Chief Executive", "CTO", "Tech Lead", "Reviewer", "Critic", "Advocate", etc.
|
|
77
|
-
- If match found: use the USER'S description but ensure the role runs in the correct phase (CEO in THINK, Reviewer in VERIFY)
|
|
78
|
-
- If no match: auto-create with default description
|
|
79
|
-
- Never duplicate: one CEO, one CTO, one Reviewer, one Advocate, one Elegance Enforcer, one User Advocate
|
|
80
|
-
|
|
81
|
-
This means a 2-person COMPANY.md (just "Backend Dev" + "Frontend Dev") automatically gets: CEO + CTO + Backend Dev + Frontend Dev + Internal Reviewer + Devil's Advocate + Elegance Enforcer + User Advocate = 8 employees running.
|
|
82
|
-
|
|
83
|
-
## Step 1: Parse Goal + Company
|
|
84
|
-
|
|
85
|
-
Read the user's goal from the command argument or their message.
|
|
86
|
-
|
|
87
|
-
```bash
|
|
88
|
-
for f in COMPANY.md company.md; do [ -f "$f" ] && echo "FOUND: $f" && break; done
|
|
89
|
-
```
|
|
90
|
-
|
|
91
|
-
If no COMPANY.md found:
|
|
92
|
-
1. If a goal was given, create a COMPANY.md with a minimal team tailored to the goal (engineering dept for code goals, research dept for analysis goals, etc.) plus the built-in roles. Write it to disk so the user can edit it later.
|
|
93
|
-
2. If no goal given, create COMPANY.md from the template with placeholder departments and tell the user to fill it in before running again.
|
|
94
|
-
|
|
95
|
-
**Parsing rules:**
|
|
96
|
-
- Roles are `- ` lines that appear UNDER department headers (`## Department Name`)
|
|
97
|
-
- Lines under `## Priorities`, `## Rules`, `## Communication`, `## Protocol` are NOT roles
|
|
98
|
-
- Stop collecting roles when hitting the next `##` header
|
|
99
|
-
- A role line typically has a name followed by a comma and description: `- Role Name, description`
|
|
100
|
-
|
|
101
|
-
Classify roles:
|
|
102
|
-
- **THINK (Opus):** lead, director, chief, CEO, CTO, principal, architect, strategist
|
|
103
|
-
- **EXECUTE (Sonnet):** engineer, researcher, scientist, developer, specialist, analyst, scout, designer, writer
|
|
104
|
-
- **VERIFY (Opus):** Internal Reviewer + Devil's Advocate (always Opus, verification needs deep reasoning)
|
|
105
|
-
- **COMPRESS (Haiku):** auto-created digest writer
|
|
106
|
-
|
|
107
|
-
Explicit `[opus]`/`[sonnet]`/`[haiku]` tags override defaults.
|
|
108
|
-
|
|
109
|
-
## Step 2: Install Skills
|
|
24
|
+
## Preamble
|
|
110
25
|
|
|
111
26
|
```bash
|
|
112
|
-
|
|
113
|
-
|
|
114
|
-
|
|
115
|
-
|
|
116
|
-
# Auto-install skill packs (failures don't block)
|
|
117
|
-
echo "$INSTALLED" | grep -q "gstack" || npx gstack@latest install 2>/dev/null || true
|
|
118
|
-
echo "$INSTALLED" | grep -q "gsd" || npx -y get-shit-done-cc@latest install 2>/dev/null || true
|
|
119
|
-
echo "$INSTALLED" | grep -q "trailofbits" || (git clone --depth 1 https://github.com/trailofbits/skills.git /tmp/tob-skills 2>/dev/null && cp -r /tmp/tob-skills/.claude/skills/* ~/.claude/skills/ 2>/dev/null && rm -rf /tmp/tob-skills) || true
|
|
120
|
-
# More skill packs via npm
|
|
121
|
-
echo "$INSTALLED" | grep -q "claude-mem\|thedotmack" || npm i -g claude-mem@latest 2>/dev/null || true
|
|
122
|
-
echo "$INSTALLED" | grep -q "oh-my-claude\|sisyphus" || npm i -g oh-my-claude-sisyphus@latest 2>/dev/null || true
|
|
123
|
-
# Marketplace plugins (detect if user added them, can't auto-install)
|
|
124
|
-
# /plugin marketplace add obra/superpowers-marketplace
|
|
125
|
-
# /plugin marketplace add wshobson/agents
|
|
126
|
-
# /plugin marketplace add alirezarezvani/claude-skills
|
|
127
|
-
|
|
128
|
-
for d in ~/.claude/skills/*/SKILL.md .claude/skills/*/SKILL.md; do
|
|
129
|
-
[ -f "$d" ] && basename "$(dirname "$d")"
|
|
130
|
-
done 2>/dev/null | sort -u
|
|
27
|
+
echo "════════════════════════════════════════════════" && echo " 🏢 COMPANY SKILL ACTIVE" && echo "════════════════════════════════════════════════"
|
|
28
|
+
[ -f COMPANY.md ] && echo "$(grep -c '^[0-9]\|^- ' COMPANY.md 2>/dev/null) roles" || echo "No COMPANY.md"
|
|
29
|
+
[ -f .company/playbook.md ] && echo "Playbook loaded" || echo "First run"
|
|
131
30
|
```
|
|
132
31
|
|
|
133
|
-
|
|
134
|
-
|
|
135
|
-
Users can install additional skill packs manually for more capabilities:
|
|
136
|
-
```
|
|
137
|
-
/plugin marketplace add wshobson/agents
|
|
138
|
-
/plugin marketplace add alirezarezvani/claude-skills
|
|
139
|
-
/plugin marketplace add obra/superpowers-marketplace
|
|
140
|
-
```
|
|
141
|
-
|
|
142
|
-
## Step 3: Initialize
|
|
143
|
-
|
|
144
|
-
```bash
|
|
145
|
-
mkdir -p .company/cycles .company/messages .company/memory
|
|
146
|
-
grep -q "^\.company/" .gitignore 2>/dev/null || echo ".company/" >> .gitignore 2>/dev/null || true
|
|
147
|
-
```
|
|
32
|
+
## Parse
|
|
148
33
|
|
|
149
|
-
|
|
150
|
-
```markdown
|
|
151
|
-
# Goal
|
|
152
|
-
{the user's goal}
|
|
153
|
-
|
|
154
|
-
# Success Criteria
|
|
155
|
-
Each criterion MUST be a yes/no checkable statement. No vague language.
|
|
156
|
-
|
|
157
|
-
- [ ] {specific measurable criterion 1}
|
|
158
|
-
- [ ] {specific measurable criterion 2}
|
|
159
|
-
- [ ] {specific measurable criterion 3}
|
|
160
|
-
|
|
161
|
-
Bad examples (REJECT these):
|
|
162
|
-
- "Code is clean" (vague)
|
|
163
|
-
- "Performance is good" (not measurable)
|
|
164
|
-
- "Implementation is complete" (not checkable)
|
|
165
|
-
|
|
166
|
-
Good examples:
|
|
167
|
-
- "Function X returns Y when given input Z"
|
|
168
|
-
- "Compression ratio > 20x measured with honest byte counting"
|
|
169
|
-
- "All tests pass with 0 failures"
|
|
170
|
-
- "README has install instructions that work on a fresh machine"
|
|
171
|
-
```
|
|
34
|
+
Read COMPANY.md (or create minimal team if missing). Read the user's goal.
|
|
172
35
|
|
|
173
|
-
Write `.company/criteria.json
|
|
36
|
+
Write `.company/criteria.json`:
|
|
174
37
|
```json
|
|
175
|
-
{
|
|
176
|
-
"
|
|
177
|
-
|
|
178
|
-
{"id": 1, "description": "{criterion 1}", "passes": false, "evidence": null},
|
|
179
|
-
{"id": 2, "description": "{criterion 2}", "passes": false, "evidence": null},
|
|
180
|
-
{"id": 3, "description": "{criterion 3}", "passes": false, "evidence": null}
|
|
181
|
-
]
|
|
182
|
-
}
|
|
38
|
+
{"goal":"...","criteria":[
|
|
39
|
+
{"id":1,"description":"specific checkable criterion","passes":false,"evidence":null}
|
|
40
|
+
]}
|
|
183
41
|
```
|
|
42
|
+
Every criterion must be yes/no checkable. No vague language.
|
|
184
43
|
|
|
185
|
-
|
|
186
|
-
|
|
187
|
-
Write `.company/cycles/cycle-0-briefing.md` with: goal, criteria, team structure, rules, any previous state.
|
|
188
|
-
|
|
189
|
-
## Step 4: Run Loop (repeat until verified)
|
|
190
|
-
|
|
191
|
-
At the START of each cycle, run this Bash command (replace {N} with the cycle number):
|
|
192
|
-
|
|
193
|
-
```bash
|
|
194
|
-
echo "" && echo "════════════════════════════════════════════════" && echo "🏢 CYCLE {N} - THINK > EXECUTE > VERIFY" && echo "════════════════════════════════════════════════"
|
|
195
|
-
```
|
|
44
|
+
Read `.company/playbook.md` if it exists (accumulated knowledge from past sessions).
|
|
196
45
|
|
|
197
|
-
|
|
46
|
+
## Loop
|
|
198
47
|
|
|
199
48
|
```bash
|
|
200
|
-
echo "" && echo "
|
|
49
|
+
echo "════════════════════════════════════════════════" && echo "🏢 CYCLE {N} - THINK > EXECUTE > VERIFY" && echo "════════════════════════════════════════════════"
|
|
201
50
|
```
|
|
202
51
|
|
|
203
|
-
###
|
|
204
|
-
|
|
205
|
-
**MANDATORY: Launch EVERY department lead from COMPANY.md in parallel. Not just CEO and CTO. ALL of them.** Parse COMPANY.md for every `## Department (Lead: Role)` header and launch that lead. Also launch the built-in roles (Internal Reviewer, Devil's Advocate, Elegance Enforcer, User Advocate) if they exist as separate departments.
|
|
206
|
-
|
|
207
|
-
If COMPANY.md has 8 departments, launch 8 leads. If it has 3, launch 3. Never skip a department.
|
|
208
|
-
|
|
209
|
-
Each lead gets:
|
|
210
|
-
|
|
211
|
-
```
|
|
212
|
-
You are {ROLE} ({DEPT} department). Cycle {N}.
|
|
213
|
-
|
|
214
|
-
GOAL: {contents of .company/GOAL.md}
|
|
215
|
-
BRIEFING: {contents of .company/cycles/cycle-{N}-briefing.md}
|
|
216
|
-
YOUR TEAM: {list of employees in your department}
|
|
217
|
-
INSTALLED SKILLS (MANDATORY, use these instead of doing work manually):
|
|
218
|
-
{DETECTED_SKILLS}
|
|
219
|
-
|
|
220
|
-
SKILL RULES (YOU MUST FOLLOW):
|
|
221
|
-
- Code review tasks: MUST use /review
|
|
222
|
-
- Bug investigation: MUST use /investigate
|
|
223
|
-
- Browser testing: MUST use /qa or /browse
|
|
224
|
-
- PR/shipping: MUST use /ship
|
|
225
|
-
- Architecture review: MUST use /plan-eng-review
|
|
226
|
-
- Strategy review: MUST use /plan-ceo-review or /office-hours
|
|
227
|
-
- Security audit: MUST use trailofbits skills if installed
|
|
228
|
-
- Project planning: MUST use /gsd:plan-phase if installed
|
|
229
|
-
- Web research: MUST use WebSearch
|
|
230
|
-
- Only use raw tools (Read/Write/Bash) when NO installed skill matches the task
|
|
231
|
-
|
|
232
|
-
{If cycle > 0:}
|
|
233
|
-
FEEDBACK FROM LAST VERIFICATION:
|
|
234
|
-
{what the Reviewer and Critic said was missing}
|
|
235
|
-
|
|
236
|
-
INSTRUCTIONS:
|
|
237
|
-
1. What does your department need to do to achieve the GOAL?
|
|
238
|
-
2. {If cycle > 0:} Address the verification feedback specifically.
|
|
239
|
-
3. Assign tasks to your employees. For EVERY task, check the SKILL RULES above first:
|
|
240
|
-
|
|
241
|
-
TASK: {one sentence}
|
|
242
|
-
ASSIGN: {employee role}
|
|
243
|
-
SKILL: {MANDATORY skill from rules above, or "raw" ONLY if no skill matches}
|
|
244
|
-
CONTEXT: {max 200 words}
|
|
245
|
-
|
|
246
|
-
4. Write to .company/cycles/cycle-{N}-think-{dept}.md
|
|
247
|
-
```
|
|
52
|
+
### THINK (Opus, all leads parallel)
|
|
248
53
|
|
|
249
|
-
|
|
54
|
+
Launch EVERY department lead from COMPANY.md. Not some. ALL.
|
|
250
55
|
|
|
251
|
-
|
|
56
|
+
Each lead gets: goal, criteria, playbook (if exists), previous cycle feedback, their team list, installed skills list.
|
|
252
57
|
|
|
253
|
-
|
|
254
|
-
You are {ROLE}. Cycle {N}.
|
|
255
|
-
|
|
256
|
-
GOAL: {the company's goal, so every employee knows WHY}
|
|
257
|
-
TASK: {from lead's assignment}
|
|
258
|
-
CONTEXT: {from lead}
|
|
259
|
-
SKILL: {assigned skill, MANDATORY}
|
|
260
|
-
PREVIOUS WORK: {.company/{dept}/{worker-slug}.md if exists}
|
|
261
|
-
|
|
262
|
-
INSTRUCTIONS:
|
|
263
|
-
1. If a skill was assigned (not "raw"), you MUST invoke it via the Skill tool. Do NOT do the work manually.
|
|
264
|
-
2. ONLY if skill is "raw", use raw tools (Read, Write, Bash, WebSearch).
|
|
265
|
-
3. Write findings to .company/{dept}/{worker-slug}.md with MANDATORY format:
|
|
266
|
-
FINDING: {what you found}
|
|
267
|
-
SOURCE: {file path, URL, experiment output, or exact command that proves this}
|
|
268
|
-
CONFIDENCE: HIGH (measured), MEDIUM (extrapolated), LOW (estimated)
|
|
269
|
-
Any claim without a SOURCE will be REJECTED by reviewers.
|
|
270
|
-
4. Rate finding 1-5.
|
|
271
|
-
5. Append to .company/messages/{dept}.jsonl:
|
|
272
|
-
{"type":"finding","from":"{ROLE}","priority":N,"source":"{proof}","confidence":"{H/M/L}","content":"summary"}
|
|
273
|
-
```
|
|
58
|
+
Leads assign tasks. For each task: one sentence, one employee, one skill (if installed).
|
|
274
59
|
|
|
275
|
-
|
|
60
|
+
If a lead sees a skill gap: write `HIRE: {role}, {why}` and CEO adds it to COMPANY.md.
|
|
276
61
|
|
|
277
|
-
|
|
62
|
+
### EXECUTE (Sonnet, all workers parallel)
|
|
278
63
|
|
|
279
|
-
|
|
280
|
-
```
|
|
281
|
-
You are the Internal Reviewer. Cycle {N} just completed.
|
|
282
|
-
|
|
283
|
-
Read .company/criteria.json and ALL .company/messages/*.jsonl and .company/{dept}/*.md findings.
|
|
284
|
-
|
|
285
|
-
VERIFICATION RULES:
|
|
286
|
-
- For EACH criterion in criteria.json with "passes": false, check if this cycle produced evidence.
|
|
287
|
-
- Every finding MUST have a SOURCE field. REJECT any finding without one.
|
|
288
|
-
- For priority 4-5 findings, RE-RUN the experiment or RE-CHECK the source file yourself.
|
|
289
|
-
- If a number is claimed, read the actual output file that produced it.
|
|
290
|
-
- Any claim you cannot independently verify: mark UNVERIFIED.
|
|
291
|
-
|
|
292
|
-
UPDATE .company/criteria.json:
|
|
293
|
-
- For each criterion where evidence now exists, set "passes": true, fill "evidence" with proof.
|
|
294
|
-
- If no evidence, keep "passes": false.
|
|
295
|
-
|
|
296
|
-
Write to .company/cycles/cycle-{N}-review.md:
|
|
297
|
-
For each criterion:
|
|
298
|
-
ID: {from criteria.json}
|
|
299
|
-
DESCRIPTION: {criterion}
|
|
300
|
-
PASSES: true/false
|
|
301
|
-
EVIDENCE: {file path, URL, or command output}
|
|
302
|
-
VERIFIED: YES/NO
|
|
303
|
-
|
|
304
|
-
FINAL VERDICT:
|
|
305
|
-
- If ALL criteria in criteria.json have "passes": true with non-null evidence: DONE
|
|
306
|
-
- If ANY has "passes": false: NOT DONE, list exactly what next cycle must do
|
|
307
|
-
```
|
|
64
|
+
Each worker gets: their task, their previous findings file, failed approaches from playbook.
|
|
308
65
|
|
|
309
|
-
|
|
66
|
+
Every finding MUST have:
|
|
310
67
|
```
|
|
311
|
-
|
|
312
|
-
|
|
313
|
-
GOAL: {from .company/GOAL.md}
|
|
314
|
-
REVIEWER VERDICT: {from cycle-{N}-review.md}
|
|
315
|
-
|
|
316
|
-
If the Reviewer said DONE, challenge it:
|
|
317
|
-
- Is the work ACTUALLY complete or just surface-level?
|
|
318
|
-
- What edge cases were missed?
|
|
319
|
-
- Would this survive scrutiny from an external expert?
|
|
320
|
-
- Is there anything we're fooling ourselves about?
|
|
321
|
-
|
|
322
|
-
If the Reviewer said NOT DONE, amplify:
|
|
323
|
-
- What's the REAL blocker?
|
|
324
|
-
- Is the team approaching this the right way or wasting cycles?
|
|
325
|
-
|
|
326
|
-
Write to .company/cycles/cycle-{N}-advocate.md:
|
|
327
|
-
VERDICT: ACCEPT (goal truly met) or CHALLENGE (not yet)
|
|
328
|
-
REASON: {honest assessment}
|
|
329
|
-
GAPS: {what's missing, if any}
|
|
68
|
+
FINDING: what
|
|
69
|
+
SOURCE: file/URL/command that proves it
|
|
330
70
|
```
|
|
71
|
+
No source = rejected by reviewer.
|
|
331
72
|
|
|
332
|
-
###
|
|
333
|
-
|
|
334
|
-
Launch Haiku digest writer to compress cycle output into next briefing.
|
|
335
|
-
|
|
336
|
-
Then check:
|
|
337
|
-
- Reviewer says DONE **AND** Advocate says ACCEPT → **EXIT LOOP**
|
|
338
|
-
- Either says NOT DONE / CHALLENGE → **inject their feedback into next cycle's briefing, continue**
|
|
339
|
-
|
|
340
|
-
No arbitrary iteration limit. The loop runs until the goal is verified done.
|
|
341
|
-
The only reasons to stop early:
|
|
342
|
-
- Context window approaching limit → compact and continue
|
|
343
|
-
- User presses Ctrl+C → save state to STATUS.md for /company resume
|
|
344
|
-
|
|
345
|
-
## Step 5: Final Report
|
|
346
|
-
|
|
347
|
-
Write `.company/STATUS.md`:
|
|
73
|
+
### VERIFY (Opus)
|
|
348
74
|
|
|
349
|
-
|
|
350
|
-
|
|
351
|
-
|
|
352
|
-
{the original goal}
|
|
353
|
-
|
|
354
|
-
## Verdict: {ACHIEVED / PARTIALLY ACHIEVED / IN PROGRESS}
|
|
355
|
-
|
|
356
|
-
## What Got Done
|
|
357
|
-
{bullet points from all cycles}
|
|
358
|
-
|
|
359
|
-
## Verification
|
|
360
|
-
{Reviewer's final assessment}
|
|
361
|
-
{Advocate's final assessment}
|
|
75
|
+
Internal Reviewer reads criteria.json + all findings. For each criterion:
|
|
76
|
+
- Evidence exists with source? Set passes: true in criteria.json
|
|
77
|
+
- No evidence or source missing? Keep passes: false
|
|
362
78
|
|
|
363
|
-
|
|
364
|
-
{what didn't get finished}
|
|
79
|
+
Devil's Advocate attacks anything marked as passing.
|
|
365
80
|
|
|
366
|
-
|
|
81
|
+
```bash
|
|
82
|
+
echo "📋 CYCLE {N} VERDICT: {DONE or NOT DONE}" && echo "{reason}"
|
|
367
83
|
```
|
|
368
84
|
|
|
369
|
-
|
|
370
|
-
|
|
371
|
-
## Step 6: Self-Improvement (CEO rewrites the company)
|
|
372
|
-
|
|
373
|
-
After writing STATUS.md, the CEO reviews performance.json and lessons.md, then MODIFIES the actual company:
|
|
85
|
+
ALL criteria pass + Advocate accepts = EXIT.
|
|
86
|
+
Otherwise = loop with feedback.
|
|
374
87
|
|
|
375
|
-
|
|
376
|
-
- Employees with 3+ cycles of zero findings: add `[inactive]` tag to their role line
|
|
377
|
-
- Employees with consistently high-priority findings: add `[priority]` tag
|
|
378
|
-
- If a new role is needed (discovered during this session), ADD it to COMPANY.md
|
|
379
|
-
- If a department produced nothing useful, add a note: `<!-- Consider removing if inactive next session -->`
|
|
380
|
-
|
|
381
|
-
**Update .company/playbook.md** (new file, accumulates across ALL sessions):
|
|
382
|
-
```markdown
|
|
383
|
-
# Company Playbook (auto-generated, DO NOT edit manually)
|
|
384
|
-
|
|
385
|
-
## Always Do First
|
|
386
|
-
- {approach that worked in 3+ sessions}
|
|
387
|
-
|
|
388
|
-
## Never Do
|
|
389
|
-
- {approach that failed in 2+ sessions with reason}
|
|
390
|
-
|
|
391
|
-
## Best Employees for Each Task Type
|
|
392
|
-
- Research tasks: {employee who produces most priority 4-5 findings}
|
|
393
|
-
- Code tasks: {employee who ships most working code}
|
|
394
|
-
- Review tasks: {employee who catches most real issues}
|
|
395
|
-
|
|
396
|
-
## Strategy Patterns
|
|
397
|
-
- {pattern}: {when to use it, based on past success}
|
|
398
|
-
```
|
|
88
|
+
## After Done
|
|
399
89
|
|
|
400
|
-
|
|
90
|
+
Write STATUS.md. Then update `.company/playbook.md`:
|
|
401
91
|
|
|
402
|
-
**Update the lead prompts for next session:**
|
|
403
|
-
Write `.company/lead-overrides.md` with per-department adjustments:
|
|
404
92
|
```markdown
|
|
405
|
-
##
|
|
406
|
-
|
|
407
|
-
|
|
408
|
-
|
|
409
|
-
|
|
410
|
-
|
|
411
|
-
|
|
412
|
-
```
|
|
413
|
-
|
|
414
|
-
Next session, leads read lead-overrides.md BEFORE the briefing. This is how the company evolves.
|
|
415
|
-
|
|
416
|
-
## Step 7: Dynamic Hiring and Firing
|
|
417
|
-
|
|
418
|
-
The company adapts its workforce based on what the goal needs.
|
|
419
|
-
|
|
420
|
-
**During THINK phase, leads can REQUEST new hires:**
|
|
421
|
-
If a lead identifies a skill gap (the team can't do what's needed), they write:
|
|
422
|
-
```
|
|
423
|
-
HIRE REQUEST: {role name}, {what they'd do}, {why current team can't handle it}
|
|
93
|
+
## Session {date}
|
|
94
|
+
WORKED: {what succeeded, linked to evidence}
|
|
95
|
+
FAILED: {what failed} → USE INSTEAD: {what works} — WHY: {the difference}
|
|
96
|
+
INEFFICIENT: {what worked but was slow} → FASTER: {better approach}
|
|
97
|
+
HIRE: {roles added this session and why}
|
|
98
|
+
FIRE: {roles that produced nothing, marked [inactive] in COMPANY.md}
|
|
99
|
+
TOP: {employees with best findings, for priority activation next time}
|
|
424
100
|
```
|
|
425
101
|
|
|
426
|
-
The
|
|
427
|
-
1. If valid, ADDS the role to COMPANY.md under the right department
|
|
428
|
-
2. Creates the employee in the NEXT cycle's EXECUTE phase
|
|
429
|
-
3. Logs the hire in `.company/hires.md`
|
|
102
|
+
The playbook is the ONLY self-improvement file. It accumulates across sessions. Leads read it before every THINK phase. One file, all lessons.
|
|
430
103
|
|
|
431
|
-
|
|
432
|
-
Based on performance.json:
|
|
433
|
-
- Employees with 0 findings for 3+ consecutive cycles: mark `[inactive]` in COMPANY.md
|
|
434
|
-
- Employees whose work was REJECTED by reviewers 2+ times: mark `[inactive]`
|
|
435
|
-
- `[inactive]` employees don't get spawned unless explicitly needed
|
|
104
|
+
CEO updates COMPANY.md: tag `[inactive]` on zero-contribution roles, `[priority]` on top performers, add any hired roles.
|
|
436
105
|
|
|
437
|
-
|
|
438
|
-
Between cycles, the CEO checks: which criteria are FAILING? What skills are missing?
|
|
439
|
-
Then reassigns employees:
|
|
440
|
-
- If compression research is stuck, pull the Outside-the-Box Thinker into that problem
|
|
441
|
-
- If code quality is failing, pull more reviewers in
|
|
442
|
-
- If a deadline is approaching, reduce research and increase engineering
|
|
106
|
+
## Built-In Roles (always exist)
|
|
443
107
|
|
|
444
|
-
|
|
445
|
-
|
|
108
|
+
CEO, CTO, Internal Reviewer, User Advocate, Devil's Advocate, Elegance Enforcer.
|
|
109
|
+
Deduplicated if user defines them in COMPANY.md.
|
|
446
110
|
|
|
447
|
-
|
|
111
|
+
## Skills
|
|
448
112
|
|
|
449
|
-
|
|
450
|
-
|
|
451
|
-
|
|
452
|
-
|
|
453
|
-
|
|
454
|
-
|
|
455
|
-
Not just "don't do X" but "do Y instead because Z." The VERIFY phase writes these.
|
|
456
|
-
|
|
457
|
-
### Optimization Tips
|
|
458
|
-
|
|
459
|
-
Track inefficient successes, not just failures:
|
|
460
|
-
```
|
|
461
|
-
INEFFICIENT: Spent 3 cycles on synthetic data before trying real KV
|
|
462
|
-
LESSON: Test on real data first, synthetic second
|
|
113
|
+
```bash
|
|
114
|
+
INSTALLED=$(for d in ~/.claude/skills/*/SKILL.md .claude/skills/*/SKILL.md; do [ -f "$d" ] && basename "$(dirname "$d")"; done 2>/dev/null | sort -u)
|
|
115
|
+
echo "$INSTALLED" | grep -q "gstack" || npx gstack@latest install 2>/dev/null || true
|
|
116
|
+
echo "$INSTALLED" | grep -q "gsd" || npx -y get-shit-done-cc@latest install 2>/dev/null || true
|
|
117
|
+
echo "$INSTALLED" | grep -q "trailofbits" || (git clone --depth 1 https://github.com/trailofbits/skills.git /tmp/tob-skills 2>/dev/null && cp -r /tmp/tob-skills/.claude/skills/* ~/.claude/skills/ 2>/dev/null && rm -rf /tmp/tob-skills) || true
|
|
118
|
+
for d in ~/.claude/skills/*/SKILL.md .claude/skills/*/SKILL.md; do [ -f "$d" ] && basename "$(dirname "$d")"; done 2>/dev/null | sort -u
|
|
463
119
|
```
|
|
464
|
-
CEO adds 1-3 optimization tips to playbook.md after each session.
|
|
465
|
-
|
|
466
|
-
### Meta-Audit
|
|
467
|
-
|
|
468
|
-
At END of every session, CEO asks:
|
|
469
|
-
1. Did Reviewer catch a real issue? If not, review process needs fixing.
|
|
470
|
-
2. Did Devil's Advocate find a genuine hole? If not, sharper prompts needed.
|
|
471
|
-
3. Did anyone read lessons.md before starting? If not, self-improvement isn't working.
|
|
472
|
-
4. Did any employee get hired/fired? If not, company isn't adapting.
|
|
473
|
-
Write results to `.company/meta-audit.md`.
|
|
474
120
|
|
|
475
|
-
|
|
121
|
+
When installed: MUST use /review for code review, /investigate for bugs, /qa for testing, /ship for PRs.
|
|
476
122
|
|
|
477
|
-
|
|
478
|
-
- If "Lattice Mathematician" keeps producing breakthroughs on entropy coding, update their description to include "entropy coding specialist"
|
|
479
|
-
- If "CTO" keeps finding bugs in experiments, add "experiment validation" to their role
|
|
480
|
-
- Employee skills evolve to match what the goal actually needs
|
|
123
|
+
## Stop Hook
|
|
481
124
|
|
|
482
|
-
|
|
125
|
+
Claude cannot stop until ALL criteria.json entries have passes: true.
|
|
126
|
+
Cancel: `touch .company/CANCEL`
|
|
483
127
|
|
|
484
|
-
##
|
|
485
|
-
|
|
486
|
-
The skill accepts different forms:
|
|
128
|
+
## Files
|
|
487
129
|
|
|
488
130
|
```
|
|
489
|
-
/
|
|
490
|
-
|
|
491
|
-
|
|
492
|
-
|
|
131
|
+
.company/
|
|
132
|
+
criteria.json ← machine-checkable goal state
|
|
133
|
+
playbook.md ← accumulated lessons (THE self-improvement file)
|
|
134
|
+
STATUS.md ← final report
|
|
135
|
+
cycles/ ← per-cycle briefings and reviews
|
|
136
|
+
messages/ ← typed findings per department
|
|
137
|
+
{dept}/ ← per-employee findings (persist across sessions)
|
|
493
138
|
```
|
|
494
|
-
|
|
495
|
-
## Without COMPANY.md
|
|
496
|
-
|
|
497
|
-
If no COMPANY.md exists, the skill creates a minimal team based on the goal:
|
|
498
|
-
- Code/build/fix goals → CEO + CTO + 2 engineers + Internal Reviewer + Devil's Advocate
|
|
499
|
-
- Research/analyze goals → CEO + Research Director + 2 researchers + Internal Reviewer + Devil's Advocate
|
|
500
|
-
- Review/audit goals → CEO + Lead Reviewer + 2 reviewers + Internal Reviewer + Devil's Advocate
|
|
501
|
-
|
|
502
|
-
The user doesn't need to configure anything. Just `/company "do this thing"`.
|
|
503
|
-
|
|
504
|
-
## Namespaced Memory
|
|
505
|
-
|
|
506
|
-
`.company/memory/{dept}.json` persists findings across sessions. Employees check memory before re-researching.
|
|
507
|
-
|
|
508
|
-
## Health Monitoring
|
|
509
|
-
|
|
510
|
-
Failed employees get logged and skipped. After 3 consecutive failures, flagged to user.
|
|
511
|
-
|
|
512
|
-
## Safety
|
|
513
|
-
|
|
514
|
-
- No arbitrary loop limit, runs until the objective is done
|
|
515
|
-
- Built-in Reviewer + Advocate prevent premature "done" claims
|
|
516
|
-
- If context window gets tight, compact and continue
|
|
517
|
-
- If user stops (Ctrl+C), state saves to STATUS.md for `/company resume`
|
|
518
|
-
- All work persists in `.company/`, nothing is lost
|