claudecode-omc 4.9.3 → 4.9.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +7 -5
- package/package.json +1 -1
- package/skills/conductor/SKILL.md +197 -166
- package/skills/planning-with-files/scripts/check-complete.ps1 +28 -28
- package/skills/planning-with-files/scripts/init-session.ps1 +31 -31
- package/dist/__tests__/marketplace-metadata.test.d.ts +0 -2
- package/dist/__tests__/marketplace-metadata.test.d.ts.map +0 -1
- package/dist/__tests__/marketplace-metadata.test.js +0 -20
- package/dist/__tests__/marketplace-metadata.test.js.map +0 -1
- package/skills/skill-creator/SKILL.md +0 -311
- package/skills/skill-creator/references/upstream-anthropic-skill-creator.md +0 -485
- package/skills/skill-debugger/README.md +0 -52
|
@@ -1,311 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: skill-creator
|
|
3
|
-
description: Create new skills, modify and improve existing skills, and evaluate whether they route and behave correctly in OMC. Use when users want to create a skill from scratch, edit an existing skill, improve skill triggering, or iterate on a skill with prompt-based tests.
|
|
4
|
-
argument-hint: "<skill-goal-or-skill-path>"
|
|
5
|
-
metadata:
|
|
6
|
-
short-description: Create or upgrade a skill
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
# Skill Creator
|
|
10
|
-
|
|
11
|
-
A skill for creating new skills and iteratively improving them.
|
|
12
|
-
|
|
13
|
-
This local version is based on the vendored Anthropic upstream file at `references/upstream-anthropic-skill-creator.md`, then adapted for OMC conventions and tooling.
|
|
14
|
-
|
|
15
|
-
## Adaptation Summary
|
|
16
|
-
|
|
17
|
-
- Upstream baseline: overall skill-creation loop, communication guidance, capture-intent flow, anatomy, progressive disclosure, and writing principles
|
|
18
|
-
- Local OMC adaptation: replaces upstream eval-viewer and benchmark workflow with `skill-quality-analyzer`, `skill-debugger`, and `skill-tester`
|
|
19
|
-
- Local OMC adaptation: keeps Claude Code-oriented metadata examples used in this repo
|
|
20
|
-
- Local OMC adaptation: confines project-specific orchestration fields such as `pipeline`, `next-skill`, and `handoff` to explicit local sections
|
|
21
|
-
- Maintenance rule: when upstream changes, update the vendored file first, then re-apply only the minimal OMC-specific differences
|
|
22
|
-
|
|
23
|
-
## Upstream Sync Policy
|
|
24
|
-
|
|
25
|
-
Treat `references/upstream-anthropic-skill-creator.md` as the authoritative upstream baseline for structure and intent.
|
|
26
|
-
|
|
27
|
-
When updating this local file:
|
|
28
|
-
- compare against the vendored upstream file first
|
|
29
|
-
- preserve the upstream section order unless there is a concrete local reason not to
|
|
30
|
-
- keep OMC-specific behavior in clearly labeled local sections
|
|
31
|
-
- do not silently replace upstream guidance with local conventions
|
|
32
|
-
- if upstream and OMC differ, prefer upstream for general skill-authoring advice and confine OMC differences to explicit adaptation notes
|
|
33
|
-
|
|
34
|
-
When Anthropic updates the upstream file:
|
|
35
|
-
1. refresh the vendored upstream copy
|
|
36
|
-
2. diff the upstream copy against this local file
|
|
37
|
-
3. pull forward structural and wording improvements that still fit OMC
|
|
38
|
-
4. re-check OMC-specific sections so they remain isolated and justified
|
|
39
|
-
|
|
40
|
-
At a high level, the process of creating a skill in OMC goes like this:
|
|
41
|
-
|
|
42
|
-
- Decide what you want the skill to do and roughly how it should do it
|
|
43
|
-
- Write a draft of the skill
|
|
44
|
-
- Create a few realistic test prompts and run prompt-level checks
|
|
45
|
-
- Review the results both qualitatively and structurally
|
|
46
|
-
- Rewrite the skill based on what you learn
|
|
47
|
-
- Repeat until routing and behavior are stable
|
|
48
|
-
- Expand the test set and try again at larger scale if needed
|
|
49
|
-
|
|
50
|
-
Your job when using this skill is to figure out where the user is in this process and then help them move forward. Sometimes that means interviewing and drafting. Sometimes it means jumping straight to evaluation and revision.
|
|
51
|
-
|
|
52
|
-
After the draft exists, the normal OMC collaboration loop is:
|
|
53
|
-
|
|
54
|
-
1. `skill-quality-analyzer` for static review
|
|
55
|
-
2. `skill-debugger` for routing/configuration diagnosis
|
|
56
|
-
3. `skill-tester` for prompt-level verification
|
|
57
|
-
4. revise and repeat
|
|
58
|
-
|
|
59
|
-
## Communicating with the User
|
|
60
|
-
|
|
61
|
-
People using this skill may have very different levels of technical familiarity. Phrase things according to the user's level.
|
|
62
|
-
|
|
63
|
-
In the default case:
|
|
64
|
-
- "evaluation" and "benchmark" are usually acceptable
|
|
65
|
-
- terms like "JSON", "frontmatter", or "assertion" may need a brief explanation
|
|
66
|
-
|
|
67
|
-
Prefer short explanations over jargon when unsure.
|
|
68
|
-
|
|
69
|
-
## Creating a Skill
|
|
70
|
-
|
|
71
|
-
### Capture Intent
|
|
72
|
-
|
|
73
|
-
Start by understanding the user's intent. The current conversation may already contain the workflow the user wants to capture.
|
|
74
|
-
|
|
75
|
-
Extract from the conversation first:
|
|
76
|
-
- tools that were used
|
|
77
|
-
- sequence of steps
|
|
78
|
-
- corrections the user made
|
|
79
|
-
- input and output formats
|
|
80
|
-
|
|
81
|
-
Then fill the gaps:
|
|
82
|
-
1. What should this skill enable Claude to do?
|
|
83
|
-
2. When should this skill trigger?
|
|
84
|
-
3. What should the output or effect look like?
|
|
85
|
-
4. Should we set up prompt tests to verify the skill works?
|
|
86
|
-
|
|
87
|
-
Skills with objectively checkable outputs benefit from test cases. Purely subjective skills may need lighter evaluation.
|
|
88
|
-
|
|
89
|
-
### Interview and Research
|
|
90
|
-
|
|
91
|
-
Ask about:
|
|
92
|
-
- edge cases
|
|
93
|
-
- input and output formats
|
|
94
|
-
- example files
|
|
95
|
-
- success criteria
|
|
96
|
-
- dependencies
|
|
97
|
-
- nearby requests that should go elsewhere
|
|
98
|
-
|
|
99
|
-
Do not write test prompts until the intended boundaries are clear.
|
|
100
|
-
|
|
101
|
-
If outside research is useful, use primary sources and repo context first.
|
|
102
|
-
|
|
103
|
-
### Write the `SKILL.md`
|
|
104
|
-
|
|
105
|
-
Based on the interview, fill in:
|
|
106
|
-
- `name`
|
|
107
|
-
- `description`
|
|
108
|
-
- optional behavior metadata
|
|
109
|
-
- the actual instructions
|
|
110
|
-
|
|
111
|
-
The `description` is the primary discovery mechanism. It should say both what the skill does and when to use it.
|
|
112
|
-
|
|
113
|
-
Because skills tend to under-trigger, descriptions should be a little pushy. Do not make them vague taglines.
|
|
114
|
-
|
|
115
|
-
## Skill Writing Guide
|
|
116
|
-
|
|
117
|
-
### Anatomy of a Skill
|
|
118
|
-
|
|
119
|
-
```text
|
|
120
|
-
skill-name/
|
|
121
|
-
├── SKILL.md
|
|
122
|
-
├── Bundled Resources
|
|
123
|
-
│ ├── scripts/
|
|
124
|
-
│ ├── references/
|
|
125
|
-
│ └── assets/
|
|
126
|
-
└── agents/
|
|
127
|
-
└── openai.yaml
|
|
128
|
-
```
|
|
129
|
-
|
|
130
|
-
#### `SKILL.md`
|
|
131
|
-
|
|
132
|
-
Every `SKILL.md` consists of:
|
|
133
|
-
- frontmatter
|
|
134
|
-
- markdown instructions
|
|
135
|
-
|
|
136
|
-
The main discovery surface is:
|
|
137
|
-
- `name`
|
|
138
|
-
- `description`
|
|
139
|
-
|
|
140
|
-
Recommended local frontmatter shape:
|
|
141
|
-
|
|
142
|
-
```yaml
|
|
143
|
-
---
|
|
144
|
-
name: skill-name
|
|
145
|
-
description: When to use the skill and what it does.
|
|
146
|
-
argument-hint: "<args>"
|
|
147
|
-
disable-model-invocation: false
|
|
148
|
-
user-invocable: true
|
|
149
|
-
allowed-tools:
|
|
150
|
-
- Read
|
|
151
|
-
- Grep
|
|
152
|
-
model: sonnet
|
|
153
|
-
---
|
|
154
|
-
```
|
|
155
|
-
|
|
156
|
-
#### Bundled Resources
|
|
157
|
-
|
|
158
|
-
##### `scripts/`
|
|
159
|
-
|
|
160
|
-
Use when the same code keeps getting rewritten or determinism matters.
|
|
161
|
-
|
|
162
|
-
##### `references/`
|
|
163
|
-
|
|
164
|
-
Use for documentation Claude should read only when needed.
|
|
165
|
-
|
|
166
|
-
##### `assets/`
|
|
167
|
-
|
|
168
|
-
Use for templates, icons, fonts, sample deliverables, and similar output resources.
|
|
169
|
-
|
|
170
|
-
#### What Not to Include in a Skill
|
|
171
|
-
|
|
172
|
-
A skill should contain the files needed for another agent to do the work, not extra process documentation.
|
|
173
|
-
|
|
174
|
-
Avoid adding:
|
|
175
|
-
- `README.md`
|
|
176
|
-
- `INSTALLATION_GUIDE.md`
|
|
177
|
-
- `QUICK_REFERENCE.md`
|
|
178
|
-
- `CHANGELOG.md`
|
|
179
|
-
|
|
180
|
-
### Progressive Disclosure
|
|
181
|
-
|
|
182
|
-
Skills use a three-level loading model:
|
|
183
|
-
1. metadata
|
|
184
|
-
2. `SKILL.md` body
|
|
185
|
-
3. bundled resources
|
|
186
|
-
|
|
187
|
-
Guidelines:
|
|
188
|
-
- keep `SKILL.md` compact
|
|
189
|
-
- split detail into `references/` before the file becomes hard to scan
|
|
190
|
-
- clearly point from `SKILL.md` to any follow-up files
|
|
191
|
-
- avoid deeply nested references
|
|
192
|
-
|
|
193
|
-
### Principle of Lack of Surprise
|
|
194
|
-
|
|
195
|
-
Skills must not contain malware, exploit code, or misleading behavior. The contents of the skill should match what the description leads a user to expect.
|
|
196
|
-
|
|
197
|
-
### Writing Patterns
|
|
198
|
-
|
|
199
|
-
Prefer imperative instructions.
|
|
200
|
-
|
|
201
|
-
Defining output formats:
|
|
202
|
-
|
|
203
|
-
```markdown
|
|
204
|
-
## Report structure
|
|
205
|
-
ALWAYS use this exact template:
|
|
206
|
-
# [Title]
|
|
207
|
-
## Executive summary
|
|
208
|
-
## Key findings
|
|
209
|
-
## Recommendations
|
|
210
|
-
```
|
|
211
|
-
|
|
212
|
-
Examples pattern:
|
|
213
|
-
|
|
214
|
-
```markdown
|
|
215
|
-
## Commit message format
|
|
216
|
-
**Example 1:**
|
|
217
|
-
Input: Added user authentication with JWT tokens
|
|
218
|
-
Output: feat(auth): implement JWT-based authentication
|
|
219
|
-
```
|
|
220
|
-
|
|
221
|
-
### Writing Style
|
|
222
|
-
|
|
223
|
-
Explain why important things matter instead of only piling up MUSTs. Prefer a draft-and-revise approach over trying to write the perfect skill in one pass.
|
|
224
|
-
|
|
225
|
-
## Test Cases
|
|
226
|
-
|
|
227
|
-
After writing the draft, come up with 2-3 realistic test prompts. Share them with the user or validate them yourself if the intent is already clear.
|
|
228
|
-
|
|
229
|
-
In OMC, prefer a prompt matrix with:
|
|
230
|
-
- obvious positive cases
|
|
231
|
-
- borderline positive cases
|
|
232
|
-
- obvious negative cases
|
|
233
|
-
|
|
234
|
-
The best negative cases are near-misses, not completely unrelated requests.
|
|
235
|
-
|
|
236
|
-
## Running and Evaluating Test Cases in OMC
|
|
237
|
-
|
|
238
|
-
This local adaptation does not rely on Anthropic's upstream benchmark viewer scripts as the primary workflow. Use the OMC toolchain instead.
|
|
239
|
-
|
|
240
|
-
### Step 1: Static Review
|
|
241
|
-
|
|
242
|
-
Run `skill-quality-analyzer` to inspect:
|
|
243
|
-
- discovery surface
|
|
244
|
-
- structure
|
|
245
|
-
- examples and eval coverage
|
|
246
|
-
- execution safety
|
|
247
|
-
- maintainability
|
|
248
|
-
|
|
249
|
-
### Step 2: Routing and Config Review
|
|
250
|
-
|
|
251
|
-
Run `skill-debugger` to inspect:
|
|
252
|
-
- under-triggering
|
|
253
|
-
- over-triggering
|
|
254
|
-
- name/description collisions
|
|
255
|
-
- metadata issues such as `disable-model-invocation`, `user-invocable`, and `allowed-tools`
|
|
256
|
-
|
|
257
|
-
### Step 3: Prompt-Level Verification
|
|
258
|
-
|
|
259
|
-
Run `skill-tester` with:
|
|
260
|
-
- positive prompts
|
|
261
|
-
- borderline prompts
|
|
262
|
-
- negative prompts
|
|
263
|
-
|
|
264
|
-
Check both selection and behavior after selection.
|
|
265
|
-
|
|
266
|
-
### Step 4: Revise
|
|
267
|
-
|
|
268
|
-
Bias revisions toward:
|
|
269
|
-
- tightening `description`
|
|
270
|
-
- narrowing overlap with neighboring skills
|
|
271
|
-
- fixing metadata that blocks routing
|
|
272
|
-
- bundling deterministic resources where repetition keeps appearing
|
|
273
|
-
|
|
274
|
-
### Step 5: Repeat Until Stable
|
|
275
|
-
|
|
276
|
-
Continue until:
|
|
277
|
-
- the skill triggers for clear in-scope prompts
|
|
278
|
-
- the skill stays quiet for clear out-of-scope prompts
|
|
279
|
-
- the body and bundled resources are proportionate to the task
|
|
280
|
-
|
|
281
|
-
## OMC-Specific Extensions
|
|
282
|
-
|
|
283
|
-
The upstream Anthropic file is the baseline. The fields below are local OMC extensions and should be treated as project-specific:
|
|
284
|
-
|
|
285
|
-
```yaml
|
|
286
|
-
pipeline: [skill-a, skill-b]
|
|
287
|
-
next-skill: skill-b
|
|
288
|
-
next-skill-args: --direct
|
|
289
|
-
handoff: .omc/plans/example.md
|
|
290
|
-
```
|
|
291
|
-
|
|
292
|
-
Use them only when this repository actually consumes them.
|
|
293
|
-
|
|
294
|
-
## Deliverables
|
|
295
|
-
|
|
296
|
-
A finished local skill should have:
|
|
297
|
-
- a valid `SKILL.md`
|
|
298
|
-
- a differentiated `description`
|
|
299
|
-
- justified bundled resources
|
|
300
|
-
- positive and negative prompt examples
|
|
301
|
-
- evidence from analyzer/debugger/tester passes
|
|
302
|
-
|
|
303
|
-
## Success Checklist
|
|
304
|
-
|
|
305
|
-
- [ ] skill package exists at `skills/<name>/SKILL.md`
|
|
306
|
-
- [ ] `name` matches the folder name
|
|
307
|
-
- [ ] `description` clearly says when to use the skill and what it does
|
|
308
|
-
- [ ] prompt matrix covers positive, borderline, and negative cases
|
|
309
|
-
- [ ] `skill-quality-analyzer` feedback addressed
|
|
310
|
-
- [ ] `skill-debugger` feedback addressed
|
|
311
|
-
- [ ] `skill-tester` feedback addressed
|