create-claude-cabinet 0.22.0 → 0.23.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json
CHANGED
|
@@ -344,7 +344,27 @@ in the debrief report:
|
|
|
344
344
|
> different destinations (memory/feedback vs finding database). Both
|
|
345
345
|
> feed the enforcement pipeline, but through different channels.
|
|
346
346
|
|
|
347
|
-
### 10.
|
|
347
|
+
### 10. Methodology Capture (core)
|
|
348
|
+
|
|
349
|
+
Read `phases/methodology-capture.md`. This is an **instruction phase**
|
|
350
|
+
shipped with CC — it tells Claude to detect when the session produced
|
|
351
|
+
load-bearing methodology (new skill, multi-phase gate scripts, convention
|
|
352
|
+
docs, calibration harnesses, multi-subagent choreography) and capture
|
|
353
|
+
the layer above individual lessons: the reasoning chain with skeptic's
|
|
354
|
+
counters AND a narrateable summary for external audiences.
|
|
355
|
+
|
|
356
|
+
Most sessions skip silently (detection heuristics fire only on core
|
|
357
|
+
methodology work — not bug fixes or small refactors). When they fire,
|
|
358
|
+
the user gets a single four-choice question (internal record, narrative
|
|
359
|
+
summary, both, or neither). Output lands in `.claude/methodology/` as
|
|
360
|
+
two files (`<date>-<slug>-internal.md` and `<date>-<slug>-narrative.md`)
|
|
361
|
+
and is surfaced in the report.
|
|
362
|
+
|
|
363
|
+
**This phase should not be skipped.** It's how the project's load-bearing
|
|
364
|
+
methodology accumulates a defensible record instead of living in
|
|
365
|
+
someone's head.
|
|
366
|
+
|
|
367
|
+
### 11. Upstream Feedback (core)
|
|
348
368
|
|
|
349
369
|
Read `phases/upstream-feedback.md`. This is an **instruction phase**
|
|
350
370
|
shipped with CC — it tells Claude to reflect on whether the session
|
|
@@ -361,7 +381,7 @@ what was confusing, what needed a workaround.
|
|
|
361
381
|
|
|
362
382
|
**This phase should not be skipped.** It's how CC learns from use.
|
|
363
383
|
|
|
364
|
-
###
|
|
384
|
+
### 12. Skill Discovery (core)
|
|
365
385
|
|
|
366
386
|
Silently reflect: did this session involve a workflow the user is
|
|
367
387
|
likely to repeat? Not every session produces one — most don't. But
|
|
@@ -392,7 +412,7 @@ generalizable pattern, cabinet member, or convention? If so, mention
|
|
|
392
412
|
`/cc-extract` as an option for proposing it upstream to CC. This is
|
|
393
413
|
rarer than project-specific skills.
|
|
394
414
|
|
|
395
|
-
###
|
|
415
|
+
### 13. Cabinet Check (core)
|
|
396
416
|
|
|
397
417
|
Silently check: is this project's expertise coverage still right?
|
|
398
418
|
|
|
@@ -415,7 +435,7 @@ members" or "committees"). If the user wants it, activate the member
|
|
|
415
435
|
via `committees-project.yaml`, `/seed`, or Tier 3 install. Track
|
|
416
436
|
declined suggestions so you don't re-suggest next session.
|
|
417
437
|
|
|
418
|
-
###
|
|
438
|
+
### 14. Capture Loose Ends (core)
|
|
419
439
|
|
|
420
440
|
Read `phases/loose-ends.md` for non-project items and environmental
|
|
421
441
|
concerns to capture before closing. Sessions generate non-project
|
|
@@ -439,14 +459,14 @@ Orient will query omega for items 1-3. Item 4 is checked via the
|
|
|
439
459
|
outbox. If debrief skips any of these, orient starts the next session
|
|
440
460
|
blind on that dimension.
|
|
441
461
|
|
|
442
|
-
###
|
|
462
|
+
### 15. Discover Custom Phases
|
|
443
463
|
|
|
444
464
|
After running the core phases above, check for any additional phase
|
|
445
465
|
files in `phases/` that the skeleton doesn't define. These are project-
|
|
446
466
|
specific extensions. Each custom phase file declares its position in
|
|
447
467
|
the workflow. Execute them at their declared position.
|
|
448
468
|
|
|
449
|
-
###
|
|
469
|
+
### 16. Present Report (presentation)
|
|
450
470
|
|
|
451
471
|
Read `phases/report.md` for how to present the debrief summary.
|
|
452
472
|
|
|
@@ -467,6 +487,7 @@ Read `phases/report.md` for how to present the debrief summary.
|
|
|
467
487
|
| `health-checks.md` | Skip | Session-end health checks |
|
|
468
488
|
| `record-lessons.md` | Default: ask what was learned | How to capture learnings |
|
|
469
489
|
| `audit-pattern-capture.md` | **Instruction: always runs** | Detect recurring audit findings, write to patterns-project.md |
|
|
490
|
+
| `methodology-capture.md` | **Instruction: always runs** | Detect methodology-level work; capture reasoning chain + narrative to `.claude/methodology/` |
|
|
470
491
|
| `upstream-feedback.md` | **Instruction: always runs** | Surface CC friction to source repo |
|
|
471
492
|
| `loose-ends.md` | Skip | Non-project items to capture |
|
|
472
493
|
| `report.md` | Default: brief summary | How to present the report |
|
|
@@ -480,7 +501,8 @@ skip presentation phases. Core phases always run.
|
|
|
480
501
|
- **Core phases** (always run): inventory, close-work,
|
|
481
502
|
cabinet-consultations, audit-pattern-capture, auto-maintenance,
|
|
482
503
|
update-state, health-checks, persist-work, record-lessons,
|
|
483
|
-
upstream-feedback, skill-discovery,
|
|
504
|
+
methodology-capture, upstream-feedback, skill-discovery,
|
|
505
|
+
cabinet-check, loose-ends
|
|
484
506
|
- **Presentation phases** (skippable): report
|
|
485
507
|
|
|
486
508
|
A project that wants a quick debrief variant skips the report and
|
|
@@ -0,0 +1,210 @@
|
|
|
1
|
+
# Methodology Capture — Instruction Phase
|
|
2
|
+
|
|
3
|
+
**Position:** After record-lessons, before upstream-feedback.
|
|
4
|
+
|
|
5
|
+
**This is an instruction phase** — it tells Claude when and how to
|
|
6
|
+
capture methodology-level artifacts from a session. It ships with CC
|
|
7
|
+
and should not be deleted or replaced with `skip: true`. If the phase
|
|
8
|
+
file is removed, the behavior described here still applies via the
|
|
9
|
+
default-from-SKILL.md mechanism.
|
|
10
|
+
|
|
11
|
+
## What this phase does
|
|
12
|
+
|
|
13
|
+
Standard debrief captures individual lessons ("user scope hints are
|
|
14
|
+
defensive signals"), discrete decisions ("used append-only history
|
|
15
|
+
table"), and documentation fixes. It does NOT capture the **reasoning
|
|
16
|
+
chain** or **self-critique layer** behind load-bearing methodology —
|
|
17
|
+
the kind of work where *how you did it* is as important as *what you
|
|
18
|
+
did*.
|
|
19
|
+
|
|
20
|
+
This phase fills that gap when the session's work warrants it. Most
|
|
21
|
+
sessions don't. The phase is gated on detection heuristics so bug
|
|
22
|
+
fixes and small refactors skip silently.
|
|
23
|
+
|
|
24
|
+
## Test for "core work"
|
|
25
|
+
|
|
26
|
+
Adapted from the source feedback that motivated this phase:
|
|
27
|
+
|
|
28
|
+
> Not every session, but when the work is core to the product's
|
|
29
|
+
> differentiation or legitimacy — the thing the founder would want
|
|
30
|
+
> on a pitch deck, the thing a customer would ask "how do I know
|
|
31
|
+
> this works?" about.
|
|
32
|
+
|
|
33
|
+
If the session produced methodology that could be asked about, defended,
|
|
34
|
+
demonstrated, or replicated — it's core work. If it was routine
|
|
35
|
+
application of existing methodology, it isn't.
|
|
36
|
+
|
|
37
|
+
## Workflow
|
|
38
|
+
|
|
39
|
+
### 1. Detect methodology-level signals (silent)
|
|
40
|
+
|
|
41
|
+
Scan the session for any of these signals. Fire only when one or more
|
|
42
|
+
match. Otherwise skip silently and proceed to upstream-feedback.
|
|
43
|
+
|
|
44
|
+
- **New skill created** in `.claude/skills/` or `templates/skills/`
|
|
45
|
+
(including new `phases/*.md` files that define orchestration logic,
|
|
46
|
+
not just content)
|
|
47
|
+
- **Multi-phase pib-db project** with actions named "Phase N…" where
|
|
48
|
+
N ≥ 2 (or equivalent work-tracker multi-phase structure)
|
|
49
|
+
- **Audit artifacts** written to structured run directories
|
|
50
|
+
(e.g., `audit-runs/<timestamp>/`)
|
|
51
|
+
- **Multi-subagent parallel choreography** used during this session
|
|
52
|
+
(two or more agents spawned in a single parallel batch with
|
|
53
|
+
coordinated roles)
|
|
54
|
+
- **Gate scripts with pass/fail thresholds** created this session
|
|
55
|
+
(exit codes, verification-breadcrumbs, blocking-phase markers)
|
|
56
|
+
- **Disposition registers or calibration harnesses** created this
|
|
57
|
+
session (files that record triage outcomes, calibration examples,
|
|
58
|
+
decision rationales for future reference)
|
|
59
|
+
- **New convention docs** under `templates/cabinet/` or equivalent
|
|
60
|
+
reference-docs location that encode cross-skill or cross-project
|
|
61
|
+
invariants
|
|
62
|
+
|
|
63
|
+
Detection should be based on evidence from this session — git diff,
|
|
64
|
+
files written, subagent spawn counts. Do not guess. If ambiguous,
|
|
65
|
+
err toward not-firing (false positives waste the user's attention;
|
|
66
|
+
false negatives only cost one methodology record).
|
|
67
|
+
|
|
68
|
+
### 2. Ask once (if signals fire)
|
|
69
|
+
|
|
70
|
+
Present the detected signals in a single sentence, then ask a
|
|
71
|
+
four-choice question. Do not ask open-ended questions. Do not batch
|
|
72
|
+
with other debrief prompts.
|
|
73
|
+
|
|
74
|
+
> "This session produced methodology work: [brief list of detected
|
|
75
|
+
> signals — e.g., 'a new skill, a 4-phase project, a gate script'].
|
|
76
|
+
> Capture:
|
|
77
|
+
> (a) **internal critique-edges record** (reasoning chain, skeptic's
|
|
78
|
+
> counters, off-ramps)
|
|
79
|
+
> (b) **narrateable summary** (200-400 words for external audiences)
|
|
80
|
+
> (c) **both**
|
|
81
|
+
> (d) **neither**"
|
|
82
|
+
|
|
83
|
+
If the user says (d), stop silently — the detection fired but the
|
|
84
|
+
user declined. Do not re-prompt next session for the same work.
|
|
85
|
+
|
|
86
|
+
### 3. Write the requested artifacts
|
|
87
|
+
|
|
88
|
+
Write to `.claude/methodology/YYYY-MM-DD[-slug]-<type>.md` where
|
|
89
|
+
`<type>` is `internal` or `narrative`. Slug is a short phrase derived
|
|
90
|
+
from the session's focus. If the directory doesn't exist, create it.
|
|
91
|
+
|
|
92
|
+
**Path override:** If `phases/methodology-capture.md` in the project
|
|
93
|
+
overrides the default location (e.g., some projects may want
|
|
94
|
+
`docs/methodology/` instead), respect the override.
|
|
95
|
+
|
|
96
|
+
### 3a. Internal critique-edges record
|
|
97
|
+
|
|
98
|
+
For each significant decision made this session, record:
|
|
99
|
+
|
|
100
|
+
```markdown
|
|
101
|
+
### [Decision in one sentence]
|
|
102
|
+
|
|
103
|
+
**Claim:** What we decided and why it's defensible.
|
|
104
|
+
|
|
105
|
+
**Evidence:** What we had in hand — measurements, prior cases,
|
|
106
|
+
documentation, stakeholder input.
|
|
107
|
+
|
|
108
|
+
**Skeptic's critique:** Where a thoughtful reader could press. What
|
|
109
|
+
would a contrarian cabinet member flag? What dismissals might be
|
|
110
|
+
too quick?
|
|
111
|
+
|
|
112
|
+
**Counter:** Our response to the critique — the reason the decision
|
|
113
|
+
still stands despite the pressure.
|
|
114
|
+
|
|
115
|
+
**Off-ramp:** What would change our mind? What observation, metric,
|
|
116
|
+
or shift in constraints would cause us to revisit this?
|
|
117
|
+
```
|
|
118
|
+
|
|
119
|
+
A session typically has 2-6 significant decisions worth this treatment.
|
|
120
|
+
Not every micro-choice. The test: "would a future session executing
|
|
121
|
+
this work benefit from knowing the skeptic's critique?"
|
|
122
|
+
|
|
123
|
+
### 3b. Narrateable summary
|
|
124
|
+
|
|
125
|
+
Write 200-400 words, external-audience-friendly. Readable by a
|
|
126
|
+
non-insider. No skill-specific jargon unless inlined with a
|
|
127
|
+
one-phrase gloss.
|
|
128
|
+
|
|
129
|
+
Structure (flexible — adapt to what the work is):
|
|
130
|
+
|
|
131
|
+
- **The problem** (1-2 sentences about the friction or opportunity)
|
|
132
|
+
- **The approach** (what methodology was applied or built)
|
|
133
|
+
- **Specifics and scale** (concrete numbers, files, components — enough
|
|
134
|
+
that a reader can sense the work's weight)
|
|
135
|
+
- **What makes this work** (the load-bearing mechanism, the non-obvious
|
|
136
|
+
insight)
|
|
137
|
+
- **How you'd know it's working** (what metric, signal, or demonstration
|
|
138
|
+
would confirm it)
|
|
139
|
+
|
|
140
|
+
Format so it could be pasted into a pitch deck section, an about page,
|
|
141
|
+
an investor update, a collaborator email, or a customer FAQ.
|
|
142
|
+
|
|
143
|
+
### 4. Integration with the debrief report
|
|
144
|
+
|
|
145
|
+
At the end of the report phase, surface any methodology artifacts
|
|
146
|
+
created this session:
|
|
147
|
+
|
|
148
|
+
> **Methodology captured this session:**
|
|
149
|
+
> - `.claude/methodology/2026-04-18-deferred-trigger-mechanism-internal.md` — 4 decisions with skeptic counters
|
|
150
|
+
> - `.claude/methodology/2026-04-18-deferred-trigger-mechanism-narrative.md` — 340 words
|
|
151
|
+
|
|
152
|
+
One-line preview plus file path for each. Don't paste the content —
|
|
153
|
+
the files are the artifact.
|
|
154
|
+
|
|
155
|
+
## Project overrides
|
|
156
|
+
|
|
157
|
+
Projects MAY override behavior by editing this phase file:
|
|
158
|
+
|
|
159
|
+
- **`skip: true`** — entire phase skipped, including detection.
|
|
160
|
+
- **Custom detection heuristics** — replace the signal list above with
|
|
161
|
+
project-specific triggers.
|
|
162
|
+
- **Custom templates** for internal/narrative records — some projects
|
|
163
|
+
may want different sections or formats.
|
|
164
|
+
- **Different output location** — e.g., `docs/methodology/` for
|
|
165
|
+
projects that want methodology visible to collaborators via
|
|
166
|
+
standard docs tooling.
|
|
167
|
+
|
|
168
|
+
The default (this file's content as shipped) should be reasonable for
|
|
169
|
+
any project doing occasional core methodology work.
|
|
170
|
+
|
|
171
|
+
## Calibration
|
|
172
|
+
|
|
173
|
+
**Without this phase (what happens today):**
|
|
174
|
+
Session produces a new multi-phase skill with gate scripts, a convention
|
|
175
|
+
doc, and a calibration harness. Debrief captures three discrete lessons
|
|
176
|
+
("user scope hints are defensive," "append-only history preserves
|
|
177
|
+
audit trail," etc.) and closes. Six weeks later, someone asks "why did
|
|
178
|
+
you build it this way instead of [X]?" — the answer is fully in
|
|
179
|
+
someone's head (maybe nobody's, if attention has moved on), and the
|
|
180
|
+
skeptic's critique that was considered and addressed during the work
|
|
181
|
+
isn't recoverable. The narrative layer that would let a stakeholder
|
|
182
|
+
understand what was done at a glance doesn't exist.
|
|
183
|
+
|
|
184
|
+
**With this phase:**
|
|
185
|
+
Same session fires the multi-phase + new-skill + convention-doc
|
|
186
|
+
signals. User picks (c) both. Claude writes:
|
|
187
|
+
- `internal.md` with 4 decisions, each including the skeptic's
|
|
188
|
+
critique and the off-ramp (so a future disagreement has ground
|
|
189
|
+
to stand on)
|
|
190
|
+
- `narrative.md` in 340 words, pastable into a roadmap email
|
|
191
|
+
When a stakeholder asks six weeks later, the artifact exists. When
|
|
192
|
+
the project revisits the decision, the off-ramps are legible.
|
|
193
|
+
|
|
194
|
+
The phase costs maybe 3-5 minutes per session it fires. It fires on
|
|
195
|
+
a minority of sessions. Most sessions are untouched.
|
|
196
|
+
|
|
197
|
+
## Relationship to adjacent phases
|
|
198
|
+
|
|
199
|
+
- **record-lessons** (step 9): captures individual lessons. Parallel
|
|
200
|
+
layer: granular, fact-shaped. This phase captures the layer above:
|
|
201
|
+
compositional, reasoning-shaped.
|
|
202
|
+
- **upstream-feedback** (step 11): captures CC friction. Different
|
|
203
|
+
input (what hurt) and destination (CC source repo feedback/).
|
|
204
|
+
- **skill-discovery** (step 12): asks whether the workflow should
|
|
205
|
+
become a skill. Different output (a skill file), different trigger
|
|
206
|
+
(repeatability, not depth).
|
|
207
|
+
|
|
208
|
+
These four phases together cover: discrete lessons (9), methodology
|
|
209
|
+
layer (this), friction (11), and workflow-worth-encoding (12). Each
|
|
210
|
+
fills a different bucket. None subsumes another.
|