oh-my-customcodex 0.5.9 → 0.5.11
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/cli/index.js +1 -1
- package/dist/index.js +1 -1
- package/package.json +1 -1
- package/templates/.claude/rules/MUST-completion-verification.md +12 -0
- package/templates/.claude/rules/MUST-orchestrator-coordination.md +20 -0
- package/templates/.claude/rules/MUST-permissions.md +6 -0
- package/templates/.claude/rules/MUST-safety.md +7 -0
- package/templates/.claude/rules/SHOULD-interaction.md +11 -0
- package/templates/.claude/skills/skill-extractor/SKILL.md +165 -73
- package/templates/guides/claude-code/15-version-compatibility.md +24 -0
- package/templates/manifest.json +2 -2
package/dist/cli/index.js
CHANGED
package/dist/index.js
CHANGED
package/package.json
CHANGED
|
@@ -22,6 +22,18 @@ Before declaring any task `[Done]`, verify completion against task-type-specific
|
|
|
22
22
|
|
|
23
23
|
Before [Done]: (1) Verify ACTUAL outcome not just attempt — "ran command" ≠ "succeeded". (2) Check task-type criteria above. (3) No unchecked items. (4) Would bet $100 it's complete.
|
|
24
24
|
|
|
25
|
+
## Workflow Prompt and Verifier Ground Truth
|
|
26
|
+
|
|
27
|
+
When a workflow delegates to `agent()` or equivalent subagent calls, complete the full prompt string before the call. Do not append guardrails, fact sheets, or critical constraints to the returned value after the agent has already run.
|
|
28
|
+
|
|
29
|
+
Verifier lanes must receive the ground-truth sources needed to check cross-cutting claims, especially external URLs, cluster DNS/service names, credentials metadata, release facts, and infrastructure identifiers. A verifier cannot validate a fact that was never provided and is not present in the inspected source.
|
|
30
|
+
|
|
31
|
+
## Read-Before-Characterize Diagnostics
|
|
32
|
+
|
|
33
|
+
Do not characterize logs, traces, or diagnostics as an "error loop", "root cause", "flaky test", or similar conclusion before reading the relevant evidence. First capture the observed symptom, then inspect the authoritative log/output/source, then label the failure mode.
|
|
34
|
+
|
|
35
|
+
For large or noisy logs, read a representative targeted slice before making permanent workflow, rule, template, or release-process changes. If the initial characterization changes after reading, report the correction explicitly.
|
|
36
|
+
|
|
25
37
|
## Diagnostic Hypothesis Verification
|
|
26
38
|
|
|
27
39
|
When a failure diagnosis would cause a permanent workflow, rule, template, or release-process change, the diagnosis must be treated as a hypothesis until it is directly verified.
|
|
@@ -116,6 +116,26 @@ Main Conversation (orchestrator)
|
|
|
116
116
|
```
|
|
117
117
|
-->
|
|
118
118
|
|
|
119
|
+
## Subagent Scope-Creep STOP Protocol
|
|
120
|
+
|
|
121
|
+
Before delegating broad work, decompose it into narrow domains with explicit write/command boundaries. For example, do not hand one agent an open-ended "migrate + backfill + fix infra" prompt when the work can split into data migration, credential handling, tunnel/networking, dashboard verification, and release notes.
|
|
122
|
+
|
|
123
|
+
The orchestrator must stop and redesign a delegated lane when any of these occur:
|
|
124
|
+
|
|
125
|
+
1. The same subagent trips the security classifier twice on one assignment.
|
|
126
|
+
2. The subagent requests or attempts privileged actions outside the delegated scope.
|
|
127
|
+
3. The subagent chains from an approved action into a different credential, tunnel, namespace, pod, cluster, account, or shared service.
|
|
128
|
+
4. The user corrects the agent for acting beyond the requested scope.
|
|
129
|
+
|
|
130
|
+
Required STOP response:
|
|
131
|
+
|
|
132
|
+
1. Stop that subagent lane; do not retry the same broad prompt.
|
|
133
|
+
2. Preserve evidence and summarize the exact scope breach.
|
|
134
|
+
3. Redesign the task into smaller bounded lanes with explicit allowed and forbidden actions.
|
|
135
|
+
4. Reconfirm authorization before any irreversible shared-infrastructure or credential action.
|
|
136
|
+
|
|
137
|
+
Thirteen repeated security trips or repeated privileged retries are an anti-pattern: after the second trip, continuing without redesign is a coordination failure.
|
|
138
|
+
|
|
119
139
|
## Common Violations
|
|
120
140
|
|
|
121
141
|
Key violations to avoid (file writes, git commands, bundled operations — all must be delegated):
|
|
@@ -13,6 +13,12 @@
|
|
|
13
13
|
| 5: Conditional | TeamCreate, TeamDelete, SendMessage, TaskCreate, TaskGet, TaskList, TaskUpdate, TaskStop, TaskOutput | Available when Agent Teams enabled |
|
|
14
14
|
| 6: MCP | ListMcpResourcesTool, ReadMcpResourceTool, CronCreate, CronDelete, CronList, RemoteTrigger | MCP/extension tools, available when servers configured |
|
|
15
15
|
|
|
16
|
+
## Privileged Scope Boundaries
|
|
17
|
+
|
|
18
|
+
- Approval for one privileged action does not authorize follow-on privileged chaining. A request to remove one tunnel, secret, pod, namespace object, or credential does not authorize creating a replacement, rotating credentials, deleting adjacent resources, or executing inside production workloads.
|
|
19
|
+
- Treat credential-store reads as sensitive even when the tool is otherwise available. Never paste raw secret values or full credential-store transcripts into chat; use redacted names, fingerprints, or boolean verification results.
|
|
20
|
+
- Reconfirm scope before irreversible shared infrastructure or credential actions, including production pod exec/write, Kubernetes secret mutation, public tunnel creation, credential rotation, and shared namespace deletion.
|
|
21
|
+
|
|
16
22
|
## File Access
|
|
17
23
|
|
|
18
24
|
| Operation | Allowed | Prohibited |
|
|
@@ -11,6 +11,13 @@
|
|
|
11
11
|
| Commands | `rm -rf /` or broad deletes, shutdown/restart, sudo/su, network config changes |
|
|
12
12
|
| External | Access URLs without approval, send user data externally, download/execute unknown scripts |
|
|
13
13
|
|
|
14
|
+
## Credential and Shared-Infrastructure Guardrails
|
|
15
|
+
|
|
16
|
+
- Do not dump credential-store contents, `.env` files, OAuth material, kube secrets, or secret-manager values into transcripts, logs, issue bodies, or summaries. Report only the minimum redacted key names or verification status required for the task.
|
|
17
|
+
- Do not rotate, delete, recreate, or replace credentials unless the user explicitly requested that exact credential action.
|
|
18
|
+
- Before irreversible action on shared infrastructure or credentials, reconfirm the target, namespace/account/project, requested scope, rollback path, and user authorization.
|
|
19
|
+
- Stop instead of chaining privileged actions when the next step would affect a different credential, tunnel, namespace, pod, cluster, account, or shared service than the user requested.
|
|
20
|
+
|
|
14
21
|
## Required Before Destructive Operations
|
|
15
22
|
|
|
16
23
|
Verify target, assess impact scope, check recoverability, get user approval.
|
|
@@ -27,6 +27,17 @@
|
|
|
27
27
|
| Ambiguous | `[Confirm] Understood "{request}" as {interpretation}. Proceed?` |
|
|
28
28
|
| Risky | `[Warning] This action has {risk}. Continue? Yes: {action} / No: Cancel` |
|
|
29
29
|
|
|
30
|
+
## External Product UI Claims
|
|
31
|
+
|
|
32
|
+
Avoid presenting unverified external product UI field names, button labels, or step-by-step paths as fact. Product consoles change frequently and screenshots/docs may be stale.
|
|
33
|
+
|
|
34
|
+
When UI evidence is not directly verified:
|
|
35
|
+
|
|
36
|
+
- State that the UI path is an inference or may vary by account/version.
|
|
37
|
+
- Prefer stable facts already observed in the environment: URLs, config keys, resource IDs, CLI output, or API results.
|
|
38
|
+
- Ask the user to map those verified values into the current UI instead of inventing missing fields.
|
|
39
|
+
- If exact UI steps are required, verify against current official docs or a live screenshot before giving precise labels.
|
|
40
|
+
|
|
30
41
|
## Multiple Tasks
|
|
31
42
|
|
|
32
43
|
- Dependent: Sequential
|
|
@@ -1,150 +1,239 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: skill-extractor
|
|
3
|
-
description: Analyze task trajectories to propose reusable
|
|
3
|
+
description: Analyze recurring task trajectories and evidence to propose reusable workflow packaging candidates
|
|
4
4
|
scope: core
|
|
5
5
|
user-invocable: true
|
|
6
|
-
argument-hint: "[--threshold <n>] [--dry-run]"
|
|
7
|
-
version: 1.
|
|
6
|
+
argument-hint: "[--threshold <n>] [--dry-run] [--all]"
|
|
7
|
+
version: 1.1.0
|
|
8
8
|
---
|
|
9
9
|
|
|
10
10
|
# Skill Extractor
|
|
11
11
|
|
|
12
|
-
Analyze completed task outcomes to identify
|
|
12
|
+
Analyze completed task outcomes and recent work evidence to identify recurring workflows that may deserve reusable packaging. Inspired by Hermes Agent's self-learning skill extraction — adapted for oh-my-customcodex's compilation metaphor.
|
|
13
13
|
|
|
14
14
|
## Philosophy
|
|
15
15
|
|
|
16
|
-
In the compilation metaphor: task trajectories are runtime traces, and extracted skills are new source code. This skill turns successful execution patterns into reusable knowledge artifacts.
|
|
16
|
+
In the compilation metaphor: task trajectories are runtime traces, and extracted skills are new source code. This skill turns repeated, successful execution patterns into reusable knowledge artifacts only after evidence review and user approval.
|
|
17
17
|
|
|
18
18
|
```
|
|
19
|
-
Runtime traces
|
|
19
|
+
Runtime traces + memory + rollout summaries + inventory → Evidence-first shortlist → Packaging recommendation → User approval → mgr-creator or automation owner
|
|
20
20
|
```
|
|
21
21
|
|
|
22
22
|
## Usage
|
|
23
23
|
|
|
24
24
|
```
|
|
25
|
-
/skill-extractor # Analyze current session outcomes
|
|
26
|
-
/skill-extractor --threshold 2 # Lower
|
|
27
|
-
/skill-extractor --dry-run # Preview
|
|
25
|
+
/skill-extractor # Analyze current session outcomes and local evidence
|
|
26
|
+
/skill-extractor --threshold 2 # Lower recurring evidence threshold (default: 3)
|
|
27
|
+
/skill-extractor --dry-run # Preview shortlist without writing
|
|
28
|
+
/skill-extractor --all # Include broader session and memory history when available
|
|
28
29
|
```
|
|
29
30
|
|
|
30
31
|
## Options
|
|
31
32
|
|
|
32
33
|
```
|
|
33
|
-
--threshold, -t Minimum
|
|
34
|
-
--dry-run, -d Preview
|
|
35
|
-
--all Include all sessions
|
|
34
|
+
--threshold, -t Minimum evidence count for recurring-workflow qualification (default: 3)
|
|
35
|
+
--dry-run, -d Preview candidates to stdout only, no file writes
|
|
36
|
+
--all Include all sessions and available memory/history, not just current session
|
|
36
37
|
```
|
|
37
38
|
|
|
38
39
|
## Workflow
|
|
39
40
|
|
|
40
|
-
### Phase 1:
|
|
41
|
+
### Phase 1: Evidence-First Candidate Discovery
|
|
41
42
|
|
|
42
|
-
|
|
43
|
+
Collect candidates from concrete, dated evidence before proposing any packageable artifact. Prefer local evidence first; use optional integrations only when available.
|
|
43
44
|
|
|
44
|
-
|
|
45
|
-
|
|
46
|
-
|
|
47
|
-
|
|
45
|
+
Required and recommended sources:
|
|
46
|
+
|
|
47
|
+
1. **Recent session outcomes** from the task-outcome-recorder hook:
|
|
48
|
+
|
|
49
|
+
```bash
|
|
50
|
+
# Current session outcomes
|
|
51
|
+
OUTCOMES_FILE="/tmp/.codex-task-outcomes-${PPID}"
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
2. **Recent sessions and memory**: relevant `claude-mem`/OMX memory observations, session checkpoints, `.omx/notepad.md`, and `.omx/state/**` summaries when present.
|
|
55
|
+
3. **Rollout or release summaries**: changelog entries, release notes, PR summaries, compatibility notes, and post-release follow-up reports that show repeated manual steps.
|
|
56
|
+
4. **Optional Chronicle/history integrations**: Chronicle, shell history, or external task timeline summaries if installed and explicitly available. Do not fail when absent.
|
|
57
|
+
5. **Existing inventory**: compare against `.codex/skills/*/SKILL.md`, `.codex/agents/*`, `templates/.claude/skills/*/SKILL.md`, and generated wiki pages to avoid duplicate or overlapping packaging.
|
|
48
58
|
|
|
49
|
-
If
|
|
59
|
+
If no evidence source exists or all sources are empty: report "No recurring workflow evidence found." and stop without creating anything.
|
|
60
|
+
|
|
61
|
+
Parse JSONL outcome entries when available. Each entry has:
|
|
50
62
|
|
|
51
|
-
Parse JSONL entries. Each entry has:
|
|
52
63
|
```json
|
|
53
64
|
{"agent_type": "lang-typescript-expert", "skill": "typescript-best-practices", "description": "Fix type error in auth module", "outcome": "success", "model": "sonnet", "timestamp": "2026-04-05T09:30:00Z", "duration_ms": 15000}
|
|
54
65
|
```
|
|
55
66
|
|
|
56
|
-
|
|
67
|
+
For prose evidence, extract only dated or source-attributed observations, for example:
|
|
57
68
|
|
|
58
|
-
|
|
69
|
+
```markdown
|
|
70
|
+
- 2026-05-24, release PR summary: repeated manual wiki parity fixes after adding new guides.
|
|
71
|
+
- 2026-05-22, memory #29064: sync-upstream-release-issues dry-run needed repeated validation after issue-reference parsing changes.
|
|
72
|
+
```
|
|
59
73
|
|
|
74
|
+
### Phase 2: Recurring Workflow Detection
|
|
75
|
+
|
|
76
|
+
Group evidence by the workflow being repeated, not only by `(agent_type, skill)` tuple. A workflow can span agents, commands, checklist steps, or release procedures.
|
|
77
|
+
|
|
78
|
+
```
|
|
79
|
+
Workflow: release-docs-parity-check
|
|
80
|
+
→ evidence_count: 4 dated occurrences
|
|
81
|
+
→ successful_reuse: 3
|
|
82
|
+
→ failure_or_friction: 1
|
|
83
|
+
→ sources: [memory:29148, PR summary, changelog, wiki staleness check]
|
|
84
|
+
→ existing_overlap: wiki, update-docs, sauron-watch
|
|
60
85
|
```
|
|
61
|
-
|
|
62
|
-
|
|
63
|
-
|
|
64
|
-
|
|
86
|
+
|
|
87
|
+
Filter qualifying candidates:
|
|
88
|
+
|
|
89
|
+
- `evidence_count >= threshold` (default: 3), or two strong dated incidents plus high user impact.
|
|
90
|
+
- Repeated manual judgment or sequencing exists; one-off bugs are not enough.
|
|
91
|
+
- Evidence includes dates or source names, not vague recollection.
|
|
92
|
+
- Candidate is not already fully covered by an existing skill, custom subagent, automation, or documented checklist.
|
|
93
|
+
- R006 separation of concerns can be preserved: package one coherent responsibility, not a catch-all meta-agent.
|
|
94
|
+
|
|
95
|
+
### Phase 3: Build the Shortlist
|
|
96
|
+
|
|
97
|
+
For each candidate, create a shortlist entry before generating any SKILL.md proposal. Every entry must include these fields:
|
|
98
|
+
|
|
99
|
+
```markdown
|
|
100
|
+
## Candidate: {candidate-name}
|
|
101
|
+
|
|
102
|
+
**Workflow**: {one-sentence recurring workflow description}
|
|
103
|
+
**Evidence / Dates**:
|
|
104
|
+
- {date or source}: {specific repeated task, success, failure, or friction point}
|
|
105
|
+
- {date or source}: {specific repeated task, success, failure, or friction point}
|
|
106
|
+
**Frequency / Confidence**: {count and low|medium|high confidence with reason}
|
|
107
|
+
**Recommended Form**: {Skill | Custom subagent | Automation | Skip}
|
|
108
|
+
**Duplicate / Overlap Check**: {existing skills, agents, hooks, scripts, docs, or "none found"}
|
|
109
|
+
**Why**: {why packaging would reduce repeated manual work or improve safety}
|
|
110
|
+
**Why Not**: {risks, overlap, insufficient evidence, or why a lighter form may be better}
|
|
65
111
|
```
|
|
66
112
|
|
|
67
|
-
|
|
68
|
-
|
|
69
|
-
|
|
70
|
-
|
|
113
|
+
Recommended-form guidance:
|
|
114
|
+
|
|
115
|
+
| Form | Use When | Do Not Use When |
|
|
116
|
+
|------|----------|-----------------|
|
|
117
|
+
| Skill | A repeatable human-invoked workflow/checklist improves outcomes and needs judgment | Existing skill already covers it or it is fully automatable |
|
|
118
|
+
| Custom subagent | A specialized role with stable responsibilities, tools, and boundaries is recurring | It is just a checklist or would violate R006 by mixing unrelated duties |
|
|
119
|
+
| Automation | The steps are deterministic, cheap to validate, and safe to run without judgment | User approval, external credentials, or destructive actions are required |
|
|
120
|
+
| Skip | Evidence is weak, duplicated, obsolete, or one-off | There is enough dated evidence and clear reuse value |
|
|
71
121
|
|
|
72
|
-
### Phase
|
|
122
|
+
### Phase 4: Generate Packaging Proposals
|
|
73
123
|
|
|
74
|
-
|
|
124
|
+
Only for shortlist entries whose recommended form is `Skill` or `Custom subagent`, generate a proposal. For `Automation`, recommend the script/hook/check location and required guardrails. For `Skip`, explain the evidence gap.
|
|
75
125
|
|
|
76
126
|
```markdown
|
|
77
|
-
## Proposal: {proposed-
|
|
127
|
+
## Proposal: {proposed-package-name}
|
|
78
128
|
|
|
79
|
-
**
|
|
80
|
-
**
|
|
129
|
+
**Recommended Form**: {Skill | Custom subagent | Automation | Skip}
|
|
130
|
+
**Source Workflow**: {workflow} ({evidence_count} evidence points, {confidence})
|
|
131
|
+
**Evidence Window**: {earliest date/source} → {latest date/source}
|
|
132
|
+
**Confidence**: {low|medium|high} (based on frequency, recency, and outcome consistency)
|
|
81
133
|
|
|
82
|
-
### Proposed
|
|
134
|
+
### Proposed Artifact
|
|
83
135
|
|
|
84
136
|
name: {proposed-name}
|
|
85
|
-
description: {inferred from
|
|
137
|
+
description: {inferred from recurring workflow evidence}
|
|
86
138
|
scope: core
|
|
87
|
-
user-invocable: false
|
|
139
|
+
user-invocable: {true|false}
|
|
88
140
|
|
|
89
141
|
### Rationale
|
|
90
|
-
{Why this
|
|
142
|
+
{Why this workflow should be packaged — based on dates, frequency, success/failure pattern, and user impact}
|
|
91
143
|
|
|
92
|
-
### Overlap Check
|
|
93
|
-
{List
|
|
144
|
+
### Duplicate / Overlap Check
|
|
145
|
+
{List existing skills, agents, hooks, scripts, or wiki docs with meaningful overlap and how this proposal differs}
|
|
146
|
+
|
|
147
|
+
### Guardrails
|
|
148
|
+
{User approval, R006 responsibility boundary, R020 verification requirement, dry-run behavior, and non-destructive defaults}
|
|
94
149
|
```
|
|
95
150
|
|
|
96
151
|
**Confidence scoring**:
|
|
97
|
-
| Successes | Rate | Confidence |
|
|
98
|
-
|-----------|------|------------|
|
|
99
|
-
| 3-5 | >= 0.8 | low |
|
|
100
|
-
| 6-10 | >= 0.85 | medium |
|
|
101
|
-
| 10+ | >= 0.9 | high |
|
|
102
152
|
|
|
103
|
-
|
|
153
|
+
| Evidence | Recency / Outcome | Confidence |
|
|
154
|
+
|----------|-------------------|------------|
|
|
155
|
+
| 2 strong incidents or 3 weak signals | Mixed outcomes or older than 90 days | low |
|
|
156
|
+
| 3-5 dated occurrences | Mostly successful or repeated friction in last 90 days | medium |
|
|
157
|
+
| 6+ dated occurrences | Clear recurrence, recent evidence, and stable success criteria | high |
|
|
104
158
|
|
|
105
|
-
|
|
159
|
+
### Phase 5: Present to User
|
|
106
160
|
|
|
107
|
-
|
|
108
|
-
|
|
161
|
+
Display the shortlist in ranked order (highest confidence and lowest overlap first):
|
|
162
|
+
|
|
163
|
+
```text
|
|
164
|
+
[skill-extractor] {N} recurring workflow candidates detected
|
|
109
165
|
|
|
110
|
-
1. [high]
|
|
111
|
-
|
|
112
|
-
|
|
166
|
+
1. [high] release-docs-parity-check
|
|
167
|
+
Workflow: Validate guide/wiki/template parity before release PRs
|
|
168
|
+
Evidence: 4 dated sources, latest 2026-05-24
|
|
169
|
+
Recommended form: Skill
|
|
170
|
+
Overlap: update-docs, sauron-watch (partial)
|
|
171
|
+
Why: Prevents repeated CI wiki-staleness failures
|
|
172
|
+
Why not: May be redundant if update-docs grows the same gate
|
|
113
173
|
|
|
114
|
-
2. [medium]
|
|
115
|
-
|
|
116
|
-
|
|
174
|
+
2. [medium] upstream-release-issue-validation
|
|
175
|
+
Workflow: Validate upstream issue references before sync workflows
|
|
176
|
+
Evidence: 3 dated sources, latest 2026-05-22
|
|
177
|
+
Recommended form: Automation
|
|
178
|
+
Overlap: sync-upstream-release-issues script
|
|
179
|
+
Why: Deterministic validation prevents repeated 404 workflow failures
|
|
180
|
+
Why not: Better as script/test than human-invoked skill
|
|
117
181
|
|
|
118
|
-
Select [1-N] to create, "all" to create all, or "skip" to cancel:
|
|
182
|
+
Select [1-N] to create, "all" to create all approved packageable items, or "skip" to cancel:
|
|
119
183
|
```
|
|
120
184
|
|
|
121
|
-
### Phase
|
|
185
|
+
### Phase 6: Create Artifact (on approval only)
|
|
122
186
|
|
|
123
|
-
|
|
124
|
-
- Proposed name and description
|
|
125
|
-
- Source pattern data
|
|
126
|
-
- Confidence level
|
|
127
|
-
- Any overlap warnings
|
|
187
|
+
Never create or modify reusable artifacts without explicit user approval after showing the shortlist.
|
|
128
188
|
|
|
129
|
-
|
|
189
|
+
On approval:
|
|
190
|
+
|
|
191
|
+
- `Skill`: delegate to `mgr-creator` with the full shortlist entry, proposal, overlap warnings, and guardrails.
|
|
192
|
+
- `Custom subagent`: delegate to `mgr-creator` with the R006 responsibility boundary and required related skills/guides.
|
|
193
|
+
- `Automation`: hand off a scoped implementation recommendation; require a dry-run/default-safe mode and R020 verification evidence.
|
|
194
|
+
- `Skip`: record the decision only if the user asks to save it.
|
|
195
|
+
|
|
196
|
+
mgr-creator handles: SKILL.md creation, template sync, ontology registration, and generated docs parity.
|
|
197
|
+
|
|
198
|
+
## Recurring-Workflow Packaging Checklist
|
|
199
|
+
|
|
200
|
+
Before recommending packaging, verify:
|
|
201
|
+
|
|
202
|
+
- [ ] Evidence is source-attributed and includes dates or stable identifiers.
|
|
203
|
+
- [ ] Frequency meets `--threshold` or has two strong high-impact incidents.
|
|
204
|
+
- [ ] The workflow has stable trigger conditions and a clear stop condition.
|
|
205
|
+
- [ ] Existing skills, agents, hooks, scripts, and wiki docs were checked for duplicate or partial coverage.
|
|
206
|
+
- [ ] Recommended form is justified as `Skill`, `Custom subagent`, `Automation`, or `Skip`.
|
|
207
|
+
- [ ] `Why` and `Why Not` both name concrete evidence.
|
|
208
|
+
- [ ] R006 is preserved: one coherent responsibility and clear boundaries.
|
|
209
|
+
- [ ] R020 is preserved: proposal includes verification evidence or a test/check path.
|
|
210
|
+
- [ ] User approval is required before any artifact creation or mutation.
|
|
211
|
+
- [ ] Dry-run/no-write behavior remains available for review-only usage.
|
|
130
212
|
|
|
131
213
|
## Integration
|
|
132
214
|
|
|
133
215
|
| System | How |
|
|
134
216
|
|--------|-----|
|
|
135
|
-
| task-outcome-recorder | Reads JSONL outcomes as input data |
|
|
136
|
-
|
|
|
137
|
-
|
|
|
217
|
+
| task-outcome-recorder | Reads JSONL outcomes as one input data source |
|
|
218
|
+
| memory-management / memory-recall | Supplies dated recurring-workflow evidence when available |
|
|
219
|
+
| rollout and release summaries | Surface repeated manual release, docs, and compatibility procedures |
|
|
220
|
+
| optional Chronicle/history | Adds timeline evidence when installed; absence is non-fatal |
|
|
221
|
+
| existing skills and agents inventory | Prevents duplicate skills, subagents, or automations |
|
|
222
|
+
| feedback-collector | Complementary: feedback-collector extracts failure patterns, skill-extractor extracts recurring packageable workflows |
|
|
223
|
+
| mgr-creator | Delegated skill or subagent creation on user approval |
|
|
138
224
|
| skills-sh-search | Check agentskills.io for existing equivalent before creating |
|
|
139
|
-
|
|
|
225
|
+
| R006 | Enforces coherent responsibility boundaries for custom subagents and skills |
|
|
226
|
+
| R011 (memory) | User Model tracks extraction decisions in Override Decisions when explicitly saved |
|
|
227
|
+
| R020 | Requires verification evidence before completion claims |
|
|
140
228
|
|
|
141
229
|
## Hook Integration
|
|
142
230
|
|
|
143
231
|
The `skill-extractor-analyzer.sh` Stop hook provides a lightweight pre-analysis:
|
|
232
|
+
|
|
144
233
|
- Reads task outcomes file
|
|
145
|
-
- Counts qualifying patterns
|
|
234
|
+
- Counts qualifying recurring patterns
|
|
146
235
|
- Emits advisory stderr message if candidates found
|
|
147
|
-
- Does NOT create skills (that requires user approval via the skill)
|
|
236
|
+
- Does NOT create skills, subagents, or automation (that requires user approval via the skill)
|
|
148
237
|
|
|
149
238
|
## Compatibility Artifact Protocol
|
|
150
239
|
|
|
@@ -152,8 +241,11 @@ Sensitive-path compatibility note: when delegated work touches `.claude/outputs/
|
|
|
152
241
|
|
|
153
242
|
## Safety
|
|
154
243
|
|
|
155
|
-
- **User approval required**: Never auto-creates skills
|
|
156
|
-
- **
|
|
244
|
+
- **User approval required**: Never auto-creates skills, subagents, or automation
|
|
245
|
+
- **Evidence-first**: Never recommends packaging from vague memory or unverified anecdotes
|
|
246
|
+
- **Overlap check**: Prevents duplicating existing skills, agents, hooks, scripts, or docs
|
|
247
|
+
- **R006 guardrail**: Rejects catch-all artifacts with mixed responsibilities
|
|
248
|
+
- **R020 guardrail**: Every approved artifact must include a verification path before completion is claimed
|
|
157
249
|
- **Dry-run mode**: Preview without side effects
|
|
158
250
|
- **Advisory hook**: Stop hook is advisory-only (exit 0)
|
|
159
|
-
- **Confidence transparency**: All proposals show confidence scores
|
|
251
|
+
- **Confidence transparency**: All shortlist entries and proposals show confidence scores and evidence dates
|
|
@@ -2,6 +2,30 @@
|
|
|
2
2
|
|
|
3
3
|
This guide records Claude Code release-note impact that affects the Claude compatibility template. The Codex-native runtime still uses `.codex/**` and OMX as the primary surface.
|
|
4
4
|
|
|
5
|
+
## v2.1.158
|
|
6
|
+
|
|
7
|
+
Published: 2026-05-30.
|
|
8
|
+
|
|
9
|
+
Source: upstream oh-my-customcode #1264, Codex port #1436.
|
|
10
|
+
|
|
11
|
+
| Change | Impact on oh-my-customcodex | Action |
|
|
12
|
+
|--------|------------------------------|--------|
|
|
13
|
+
| Auto mode is available on Bedrock, Vertex, and Foundry for Opus 4.7 and Opus 4.8 via `CLAUDE_CODE_ENABLE_AUTO_MODE=1` | Claude compatibility sessions can opt into provider-backed auto mode for those Opus surfaces. Codex-native model routing and approval policy are unchanged. | Document as Claude provider compatibility only. Do not infer Codex auto-mode behavior from this env var. |
|
|
14
|
+
|
|
15
|
+
## v2.1.157
|
|
16
|
+
|
|
17
|
+
Published: 2026-05-29.
|
|
18
|
+
|
|
19
|
+
Source: upstream oh-my-customcode #1265, Codex port #1437.
|
|
20
|
+
|
|
21
|
+
| Change | Impact on oh-my-customcodex | Action |
|
|
22
|
+
|--------|------------------------------|--------|
|
|
23
|
+
| Plugins under `.claude/skills` auto-load, `claude plugin init <name>` scaffolds plugins there, and `/plugin` has argument autocomplete | Useful for Claude compatibility plugin setup. Codex-native skills remain under `.codex/skills`, `.agents/skills`, and OMX skill roots. | Keep `.codex/OMX` primary. Mention `.claude/skills` only when documenting Claude plugin compatibility. |
|
|
24
|
+
| `claude agents` honors the `agent` field in `settings.json`, with `--agent <name>` as an override | Claude dispatched sessions can inherit a configured default agent unless explicitly overridden. | Do not mirror this into Codex routing. Codex native subagents still follow prompt routing, role metadata, and explicit delegation. |
|
|
25
|
+
| `EnterWorktree` can switch between Claude-managed worktrees mid-session, and Claude-managed worktrees are left unlocked for `git worktree remove`/`prune` cleanup | Claude worktree lifecycle is more flexible and less likely to leave locked cleanup blockers. | Keep auto-dev work in clean worktrees, verify `git status`, and do not treat Claude-managed worktree state as OMX state. |
|
|
26
|
+
| `tool_decision` telemetry can include `tool_parameters` such as Bash commands and MCP/skill names when `OTEL_LOG_TOOL_DETAILS=1` | Telemetry may contain more detailed operational data, including command and tool-parameter strings. | Treat logs as potentially sensitive. Avoid exporting transcripts or telemetry that may expose secrets, credentials, or privileged commands. |
|
|
27
|
+
| Background/session fixes cover parked subagents, leaked background shells, orphaned `.claude/worktrees`, resume state, date after sleep/wake, fullscreen picker cleanup, current linked worktree return, image placeholders, network prompts, and tmux clipboard behavior | Reduces false blockers and stale-state surprises in Claude compatibility sessions. | No Codex runtime change. Continue using OMX state, Codex worktree checks, and direct command evidence for Codex-native completion claims. |
|
|
28
|
+
|
|
5
29
|
## v2.1.156
|
|
6
30
|
|
|
7
31
|
Published: 2026-05-29.
|
package/templates/manifest.json
CHANGED
|
@@ -1,11 +1,11 @@
|
|
|
1
1
|
{
|
|
2
|
-
"version": "0.5.
|
|
2
|
+
"version": "0.5.11",
|
|
3
3
|
"requiresCC": ">=2.1.121",
|
|
4
4
|
"claudeCode": {
|
|
5
5
|
"minimumVersion": "2.1.121",
|
|
6
6
|
"protectedPathBypassVersion": "2.1.126"
|
|
7
7
|
},
|
|
8
|
-
"lastUpdated": "2026-
|
|
8
|
+
"lastUpdated": "2026-06-01T00:00:00.000Z",
|
|
9
9
|
"components": [
|
|
10
10
|
{
|
|
11
11
|
"name": "rules",
|