oh-my-customcodex 0.5.9 → 0.5.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/dist/cli/index.js CHANGED
@@ -3091,7 +3091,7 @@ var init_package = __esm(() => {
3091
3091
  workspaces: [
3092
3092
  "packages/*"
3093
3093
  ],
3094
- version: "0.5.9",
3094
+ version: "0.5.11",
3095
3095
  requiresCC: ">=2.1.121",
3096
3096
  claudeCode: {
3097
3097
  minimumVersion: "2.1.121",
package/dist/index.js CHANGED
@@ -2316,7 +2316,7 @@ var package_default = {
2316
2316
  workspaces: [
2317
2317
  "packages/*"
2318
2318
  ],
2319
- version: "0.5.9",
2319
+ version: "0.5.11",
2320
2320
  requiresCC: ">=2.1.121",
2321
2321
  claudeCode: {
2322
2322
  minimumVersion: "2.1.121",
package/package.json CHANGED
@@ -3,7 +3,7 @@
3
3
  "workspaces": [
4
4
  "packages/*"
5
5
  ],
6
- "version": "0.5.9",
6
+ "version": "0.5.11",
7
7
  "requiresCC": ">=2.1.121",
8
8
  "claudeCode": {
9
9
  "minimumVersion": "2.1.121",
@@ -22,6 +22,18 @@ Before declaring any task `[Done]`, verify completion against task-type-specific
22
22
 
23
23
  Before [Done]: (1) Verify ACTUAL outcome not just attempt — "ran command" ≠ "succeeded". (2) Check task-type criteria above. (3) No unchecked items. (4) Would bet $100 it's complete.
24
24
 
25
+ ## Workflow Prompt and Verifier Ground Truth
26
+
27
+ When a workflow delegates to `agent()` or equivalent subagent calls, complete the full prompt string before the call. Do not append guardrails, fact sheets, or critical constraints to the returned value after the agent has already run.
28
+
29
+ Verifier lanes must receive the ground-truth sources needed to check cross-cutting claims, especially external URLs, cluster DNS/service names, credentials metadata, release facts, and infrastructure identifiers. A verifier cannot validate a fact that was never provided and is not present in the inspected source.
30
+
31
+ ## Read-Before-Characterize Diagnostics
32
+
33
+ Do not characterize logs, traces, or diagnostics as an "error loop", "root cause", "flaky test", or similar conclusion before reading the relevant evidence. First capture the observed symptom, then inspect the authoritative log/output/source, then label the failure mode.
34
+
35
+ For large or noisy logs, read a representative targeted slice before making permanent workflow, rule, template, or release-process changes. If the initial characterization changes after reading, report the correction explicitly.
36
+
25
37
  ## Diagnostic Hypothesis Verification
26
38
 
27
39
  When a failure diagnosis would cause a permanent workflow, rule, template, or release-process change, the diagnosis must be treated as a hypothesis until it is directly verified.
@@ -116,6 +116,26 @@ Main Conversation (orchestrator)
116
116
  ```
117
117
  -->
118
118
 
119
+ ## Subagent Scope-Creep STOP Protocol
120
+
121
+ Before delegating broad work, decompose it into narrow domains with explicit write/command boundaries. For example, do not hand one agent an open-ended "migrate + backfill + fix infra" prompt when the work can split into data migration, credential handling, tunnel/networking, dashboard verification, and release notes.
122
+
123
+ The orchestrator must stop and redesign a delegated lane when any of these occur:
124
+
125
+ 1. The same subagent trips the security classifier twice on one assignment.
126
+ 2. The subagent requests or attempts privileged actions outside the delegated scope.
127
+ 3. The subagent chains from an approved action into a different credential, tunnel, namespace, pod, cluster, account, or shared service.
128
+ 4. The user corrects the agent for acting beyond the requested scope.
129
+
130
+ Required STOP response:
131
+
132
+ 1. Stop that subagent lane; do not retry the same broad prompt.
133
+ 2. Preserve evidence and summarize the exact scope breach.
134
+ 3. Redesign the task into smaller bounded lanes with explicit allowed and forbidden actions.
135
+ 4. Reconfirm authorization before any irreversible shared-infrastructure or credential action.
136
+
137
+ Thirteen repeated security trips or repeated privileged retries are an anti-pattern: after the second trip, continuing without redesign is a coordination failure.
138
+
119
139
  ## Common Violations
120
140
 
121
141
  Key violations to avoid (file writes, git commands, bundled operations — all must be delegated):
@@ -13,6 +13,12 @@
13
13
  | 5: Conditional | TeamCreate, TeamDelete, SendMessage, TaskCreate, TaskGet, TaskList, TaskUpdate, TaskStop, TaskOutput | Available when Agent Teams enabled |
14
14
  | 6: MCP | ListMcpResourcesTool, ReadMcpResourceTool, CronCreate, CronDelete, CronList, RemoteTrigger | MCP/extension tools, available when servers configured |
15
15
 
16
+ ## Privileged Scope Boundaries
17
+
18
+ - Approval for one privileged action does not authorize follow-on privileged chaining. A request to remove one tunnel, secret, pod, namespace object, or credential does not authorize creating a replacement, rotating credentials, deleting adjacent resources, or executing inside production workloads.
19
+ - Treat credential-store reads as sensitive even when the tool is otherwise available. Never paste raw secret values or full credential-store transcripts into chat; use redacted names, fingerprints, or boolean verification results.
20
+ - Reconfirm scope before irreversible shared infrastructure or credential actions, including production pod exec/write, Kubernetes secret mutation, public tunnel creation, credential rotation, and shared namespace deletion.
21
+
16
22
  ## File Access
17
23
 
18
24
  | Operation | Allowed | Prohibited |
@@ -11,6 +11,13 @@
11
11
  | Commands | `rm -rf /` or broad deletes, shutdown/restart, sudo/su, network config changes |
12
12
  | External | Access URLs without approval, send user data externally, download/execute unknown scripts |
13
13
 
14
+ ## Credential and Shared-Infrastructure Guardrails
15
+
16
+ - Do not dump credential-store contents, `.env` files, OAuth material, kube secrets, or secret-manager values into transcripts, logs, issue bodies, or summaries. Report only the minimum redacted key names or verification status required for the task.
17
+ - Do not rotate, delete, recreate, or replace credentials unless the user explicitly requested that exact credential action.
18
+ - Before irreversible action on shared infrastructure or credentials, reconfirm the target, namespace/account/project, requested scope, rollback path, and user authorization.
19
+ - Stop instead of chaining privileged actions when the next step would affect a different credential, tunnel, namespace, pod, cluster, account, or shared service than the user requested.
20
+
14
21
  ## Required Before Destructive Operations
15
22
 
16
23
  Verify target, assess impact scope, check recoverability, get user approval.
@@ -27,6 +27,17 @@
27
27
  | Ambiguous | `[Confirm] Understood "{request}" as {interpretation}. Proceed?` |
28
28
  | Risky | `[Warning] This action has {risk}. Continue? Yes: {action} / No: Cancel` |
29
29
 
30
+ ## External Product UI Claims
31
+
32
+ Avoid presenting unverified external product UI field names, button labels, or step-by-step paths as fact. Product consoles change frequently and screenshots/docs may be stale.
33
+
34
+ When UI evidence is not directly verified:
35
+
36
+ - State that the UI path is an inference or may vary by account/version.
37
+ - Prefer stable facts already observed in the environment: URLs, config keys, resource IDs, CLI output, or API results.
38
+ - Ask the user to map those verified values into the current UI instead of inventing missing fields.
39
+ - If exact UI steps are required, verify against current official docs or a live screenshot before giving precise labels.
40
+
30
41
  ## Multiple Tasks
31
42
 
32
43
  - Dependent: Sequential
@@ -1,150 +1,239 @@
1
1
  ---
2
2
  name: skill-extractor
3
- description: Analyze task trajectories to propose reusable SKILL.md candidates from successful patterns
3
+ description: Analyze recurring task trajectories and evidence to propose reusable workflow packaging candidates
4
4
  scope: core
5
5
  user-invocable: true
6
- argument-hint: "[--threshold <n>] [--dry-run]"
7
- version: 1.0.0
6
+ argument-hint: "[--threshold <n>] [--dry-run] [--all]"
7
+ version: 1.1.0
8
8
  ---
9
9
 
10
10
  # Skill Extractor
11
11
 
12
- Analyze completed task outcomes to identify reusable patterns and propose new SKILL.md candidates. Inspired by Hermes Agent's self-learning skill extraction — adapted for oh-my-customcodex's compilation metaphor.
12
+ Analyze completed task outcomes and recent work evidence to identify recurring workflows that may deserve reusable packaging. Inspired by Hermes Agent's self-learning skill extraction — adapted for oh-my-customcodex's compilation metaphor.
13
13
 
14
14
  ## Philosophy
15
15
 
16
- In the compilation metaphor: task trajectories are runtime traces, and extracted skills are new source code. This skill turns successful execution patterns into reusable knowledge artifacts.
16
+ In the compilation metaphor: task trajectories are runtime traces, and extracted skills are new source code. This skill turns repeated, successful execution patterns into reusable knowledge artifacts only after evidence review and user approval.
17
17
 
18
18
  ```
19
- Runtime traces (task outcomes)Pattern analysisSKILL.md proposal → User approval → mgr-creator
19
+ Runtime traces + memory + rollout summaries + inventory Evidence-first shortlistPackaging recommendation → User approval → mgr-creator or automation owner
20
20
  ```
21
21
 
22
22
  ## Usage
23
23
 
24
24
  ```
25
- /skill-extractor # Analyze current session outcomes
26
- /skill-extractor --threshold 2 # Lower success threshold (default: 3)
27
- /skill-extractor --dry-run # Preview proposals without writing
25
+ /skill-extractor # Analyze current session outcomes and local evidence
26
+ /skill-extractor --threshold 2 # Lower recurring evidence threshold (default: 3)
27
+ /skill-extractor --dry-run # Preview shortlist without writing
28
+ /skill-extractor --all # Include broader session and memory history when available
28
29
  ```
29
30
 
30
31
  ## Options
31
32
 
32
33
  ```
33
- --threshold, -t Minimum success count for pattern qualification (default: 3)
34
- --dry-run, -d Preview proposals to stdout only, no file writes
35
- --all Include all sessions (not just current, requires task outcome history)
34
+ --threshold, -t Minimum evidence count for recurring-workflow qualification (default: 3)
35
+ --dry-run, -d Preview candidates to stdout only, no file writes
36
+ --all Include all sessions and available memory/history, not just current session
36
37
  ```
37
38
 
38
39
  ## Workflow
39
40
 
40
- ### Phase 1: Collect Task Outcomes
41
+ ### Phase 1: Evidence-First Candidate Discovery
41
42
 
42
- Read task outcome data from the session:
43
+ Collect candidates from concrete, dated evidence before proposing any packageable artifact. Prefer local evidence first; use optional integrations only when available.
43
44
 
44
- ```bash
45
- # Current session outcomes (from task-outcome-recorder hook)
46
- OUTCOMES_FILE="/tmp/.codex-task-outcomes-${PPID}"
47
- ```
45
+ Required and recommended sources:
46
+
47
+ 1. **Recent session outcomes** from the task-outcome-recorder hook:
48
+
49
+ ```bash
50
+ # Current session outcomes
51
+ OUTCOMES_FILE="/tmp/.codex-task-outcomes-${PPID}"
52
+ ```
53
+
54
+ 2. **Recent sessions and memory**: relevant `claude-mem`/OMX memory observations, session checkpoints, `.omx/notepad.md`, and `.omx/state/**` summaries when present.
55
+ 3. **Rollout or release summaries**: changelog entries, release notes, PR summaries, compatibility notes, and post-release follow-up reports that show repeated manual steps.
56
+ 4. **Optional Chronicle/history integrations**: Chronicle, shell history, or external task timeline summaries if installed and explicitly available. Do not fail when absent.
57
+ 5. **Existing inventory**: compare against `.codex/skills/*/SKILL.md`, `.codex/agents/*`, `templates/.claude/skills/*/SKILL.md`, and generated wiki pages to avoid duplicate or overlapping packaging.
48
58
 
49
- If file doesn't exist or is empty: report "No task outcomes recorded in this session." and stop.
59
+ If no evidence source exists or all sources are empty: report "No recurring workflow evidence found." and stop without creating anything.
60
+
61
+ Parse JSONL outcome entries when available. Each entry has:
50
62
 
51
- Parse JSONL entries. Each entry has:
52
63
  ```json
53
64
  {"agent_type": "lang-typescript-expert", "skill": "typescript-best-practices", "description": "Fix type error in auth module", "outcome": "success", "model": "sonnet", "timestamp": "2026-04-05T09:30:00Z", "duration_ms": 15000}
54
65
  ```
55
66
 
56
- ### Phase 2: Pattern Detection
67
+ For prose evidence, extract only dated or source-attributed observations, for example:
57
68
 
58
- Group outcomes by `(agent_type, skill)` tuple:
69
+ ```markdown
70
+ - 2026-05-24, release PR summary: repeated manual wiki parity fixes after adding new guides.
71
+ - 2026-05-22, memory #29064: sync-upstream-release-issues dry-run needed repeated validation after issue-reference parsing changes.
72
+ ```
59
73
 
74
+ ### Phase 2: Recurring Workflow Detection
75
+
76
+ Group evidence by the workflow being repeated, not only by `(agent_type, skill)` tuple. A workflow can span agents, commands, checklist steps, or release procedures.
77
+
78
+ ```
79
+ Workflow: release-docs-parity-check
80
+ → evidence_count: 4 dated occurrences
81
+ → successful_reuse: 3
82
+ → failure_or_friction: 1
83
+ → sources: [memory:29148, PR summary, changelog, wiki staleness check]
84
+ → existing_overlap: wiki, update-docs, sauron-watch
60
85
  ```
61
- Pattern: (lang-typescript-expert, typescript-best-practices)
62
- success: 5, failure: 1, total: 6
63
- → success_rate: 0.83
64
- descriptions: ["Fix type error...", "Refactor module...", ...]
86
+
87
+ Filter qualifying candidates:
88
+
89
+ - `evidence_count >= threshold` (default: 3), or two strong dated incidents plus high user impact.
90
+ - Repeated manual judgment or sequencing exists; one-off bugs are not enough.
91
+ - Evidence includes dates or source names, not vague recollection.
92
+ - Candidate is not already fully covered by an existing skill, custom subagent, automation, or documented checklist.
93
+ - R006 separation of concerns can be preserved: package one coherent responsibility, not a catch-all meta-agent.
94
+
95
+ ### Phase 3: Build the Shortlist
96
+
97
+ For each candidate, create a shortlist entry before generating any SKILL.md proposal. Every entry must include these fields:
98
+
99
+ ```markdown
100
+ ## Candidate: {candidate-name}
101
+
102
+ **Workflow**: {one-sentence recurring workflow description}
103
+ **Evidence / Dates**:
104
+ - {date or source}: {specific repeated task, success, failure, or friction point}
105
+ - {date or source}: {specific repeated task, success, failure, or friction point}
106
+ **Frequency / Confidence**: {count and low|medium|high confidence with reason}
107
+ **Recommended Form**: {Skill | Custom subagent | Automation | Skip}
108
+ **Duplicate / Overlap Check**: {existing skills, agents, hooks, scripts, docs, or "none found"}
109
+ **Why**: {why packaging would reduce repeated manual work or improve safety}
110
+ **Why Not**: {risks, overlap, insufficient evidence, or why a lighter form may be better}
65
111
  ```
66
112
 
67
- Filter qualifying patterns:
68
- - `success_count >= threshold` (default: 3)
69
- - `success_rate >= 0.8`
70
- - Not already an existing skill (check `.codex/skills/*/SKILL.md`)
113
+ Recommended-form guidance:
114
+
115
+ | Form | Use When | Do Not Use When |
116
+ |------|----------|-----------------|
117
+ | Skill | A repeatable human-invoked workflow/checklist improves outcomes and needs judgment | Existing skill already covers it or it is fully automatable |
118
+ | Custom subagent | A specialized role with stable responsibilities, tools, and boundaries is recurring | It is just a checklist or would violate R006 by mixing unrelated duties |
119
+ | Automation | The steps are deterministic, cheap to validate, and safe to run without judgment | User approval, external credentials, or destructive actions are required |
120
+ | Skip | Evidence is weak, duplicated, obsolete, or one-off | There is enough dated evidence and clear reuse value |
71
121
 
72
- ### Phase 3: Generate Proposals
122
+ ### Phase 4: Generate Packaging Proposals
73
123
 
74
- For each qualifying pattern, generate a SKILL.md proposal:
124
+ Only for shortlist entries whose recommended form is `Skill` or `Custom subagent`, generate a proposal. For `Automation`, recommend the script/hook/check location and required guardrails. For `Skip`, explain the evidence gap.
75
125
 
76
126
  ```markdown
77
- ## Proposal: {proposed-skill-name}
127
+ ## Proposal: {proposed-package-name}
78
128
 
79
- **Source Pattern**: {agent_type} + {skill} ({success_count} successes, {success_rate}% rate)
80
- **Confidence**: {low|medium|high} (based on count and rate)
129
+ **Recommended Form**: {Skill | Custom subagent | Automation | Skip}
130
+ **Source Workflow**: {workflow} ({evidence_count} evidence points, {confidence})
131
+ **Evidence Window**: {earliest date/source} → {latest date/source}
132
+ **Confidence**: {low|medium|high} (based on frequency, recency, and outcome consistency)
81
133
 
82
- ### Proposed SKILL.md
134
+ ### Proposed Artifact
83
135
 
84
136
  name: {proposed-name}
85
- description: {inferred from common description patterns}
137
+ description: {inferred from recurring workflow evidence}
86
138
  scope: core
87
- user-invocable: false
139
+ user-invocable: {true|false}
88
140
 
89
141
  ### Rationale
90
- {Why this pattern should be extracted as a skill — based on frequency and success rate}
142
+ {Why this workflow should be packaged — based on dates, frequency, success/failure pattern, and user impact}
91
143
 
92
- ### Overlap Check
93
- {List any existing skills with >50% keyword overlap}
144
+ ### Duplicate / Overlap Check
145
+ {List existing skills, agents, hooks, scripts, or wiki docs with meaningful overlap and how this proposal differs}
146
+
147
+ ### Guardrails
148
+ {User approval, R006 responsibility boundary, R020 verification requirement, dry-run behavior, and non-destructive defaults}
94
149
  ```
95
150
 
96
151
  **Confidence scoring**:
97
- | Successes | Rate | Confidence |
98
- |-----------|------|------------|
99
- | 3-5 | >= 0.8 | low |
100
- | 6-10 | >= 0.85 | medium |
101
- | 10+ | >= 0.9 | high |
102
152
 
103
- ### Phase 4: Present to User
153
+ | Evidence | Recency / Outcome | Confidence |
154
+ |----------|-------------------|------------|
155
+ | 2 strong incidents or 3 weak signals | Mixed outcomes or older than 90 days | low |
156
+ | 3-5 dated occurrences | Mostly successful or repeated friction in last 90 days | medium |
157
+ | 6+ dated occurrences | Clear recurrence, recent evidence, and stable success criteria | high |
104
158
 
105
- Display proposals in ranked order (highest confidence first):
159
+ ### Phase 5: Present to User
106
160
 
107
- ```
108
- [skill-extractor] {N} skill candidates detected
161
+ Display the shortlist in ranked order (highest confidence and lowest overlap first):
162
+
163
+ ```text
164
+ [skill-extractor] {N} recurring workflow candidates detected
109
165
 
110
- 1. [high] proposed-skill-name
111
- Source: {agent_type} + {skill} (12 successes, 92%)
112
- Description: {inferred description}
166
+ 1. [high] release-docs-parity-check
167
+ Workflow: Validate guide/wiki/template parity before release PRs
168
+ Evidence: 4 dated sources, latest 2026-05-24
169
+ Recommended form: Skill
170
+ Overlap: update-docs, sauron-watch (partial)
171
+ Why: Prevents repeated CI wiki-staleness failures
172
+ Why not: May be redundant if update-docs grows the same gate
113
173
 
114
- 2. [medium] another-skill-name
115
- Source: {agent_type} + {skill} (7 successes, 86%)
116
- Description: {inferred description}
174
+ 2. [medium] upstream-release-issue-validation
175
+ Workflow: Validate upstream issue references before sync workflows
176
+ Evidence: 3 dated sources, latest 2026-05-22
177
+ Recommended form: Automation
178
+ Overlap: sync-upstream-release-issues script
179
+ Why: Deterministic validation prevents repeated 404 workflow failures
180
+ Why not: Better as script/test than human-invoked skill
117
181
 
118
- Select [1-N] to create, "all" to create all, or "skip" to cancel:
182
+ Select [1-N] to create, "all" to create all approved packageable items, or "skip" to cancel:
119
183
  ```
120
184
 
121
- ### Phase 5: Create Skill (on approval)
185
+ ### Phase 6: Create Artifact (on approval only)
122
186
 
123
- Delegate to mgr-creator with the proposal context:
124
- - Proposed name and description
125
- - Source pattern data
126
- - Confidence level
127
- - Any overlap warnings
187
+ Never create or modify reusable artifacts without explicit user approval after showing the shortlist.
128
188
 
129
- mgr-creator handles: SKILL.md creation, template sync, ontology registration.
189
+ On approval:
190
+
191
+ - `Skill`: delegate to `mgr-creator` with the full shortlist entry, proposal, overlap warnings, and guardrails.
192
+ - `Custom subagent`: delegate to `mgr-creator` with the R006 responsibility boundary and required related skills/guides.
193
+ - `Automation`: hand off a scoped implementation recommendation; require a dry-run/default-safe mode and R020 verification evidence.
194
+ - `Skip`: record the decision only if the user asks to save it.
195
+
196
+ mgr-creator handles: SKILL.md creation, template sync, ontology registration, and generated docs parity.
197
+
198
+ ## Recurring-Workflow Packaging Checklist
199
+
200
+ Before recommending packaging, verify:
201
+
202
+ - [ ] Evidence is source-attributed and includes dates or stable identifiers.
203
+ - [ ] Frequency meets `--threshold` or has two strong high-impact incidents.
204
+ - [ ] The workflow has stable trigger conditions and a clear stop condition.
205
+ - [ ] Existing skills, agents, hooks, scripts, and wiki docs were checked for duplicate or partial coverage.
206
+ - [ ] Recommended form is justified as `Skill`, `Custom subagent`, `Automation`, or `Skip`.
207
+ - [ ] `Why` and `Why Not` both name concrete evidence.
208
+ - [ ] R006 is preserved: one coherent responsibility and clear boundaries.
209
+ - [ ] R020 is preserved: proposal includes verification evidence or a test/check path.
210
+ - [ ] User approval is required before any artifact creation or mutation.
211
+ - [ ] Dry-run/no-write behavior remains available for review-only usage.
130
212
 
131
213
  ## Integration
132
214
 
133
215
  | System | How |
134
216
  |--------|-----|
135
- | task-outcome-recorder | Reads JSONL outcomes as input data |
136
- | feedback-collector | Complementary: feedback-collector extracts failure patterns, skill-extractor extracts success patterns |
137
- | mgr-creator | Delegated skill creation on user approval |
217
+ | task-outcome-recorder | Reads JSONL outcomes as one input data source |
218
+ | memory-management / memory-recall | Supplies dated recurring-workflow evidence when available |
219
+ | rollout and release summaries | Surface repeated manual release, docs, and compatibility procedures |
220
+ | optional Chronicle/history | Adds timeline evidence when installed; absence is non-fatal |
221
+ | existing skills and agents inventory | Prevents duplicate skills, subagents, or automations |
222
+ | feedback-collector | Complementary: feedback-collector extracts failure patterns, skill-extractor extracts recurring packageable workflows |
223
+ | mgr-creator | Delegated skill or subagent creation on user approval |
138
224
  | skills-sh-search | Check agentskills.io for existing equivalent before creating |
139
- | R011 (memory) | User Model tracks extraction decisions in Override Decisions |
225
+ | R006 | Enforces coherent responsibility boundaries for custom subagents and skills |
226
+ | R011 (memory) | User Model tracks extraction decisions in Override Decisions when explicitly saved |
227
+ | R020 | Requires verification evidence before completion claims |
140
228
 
141
229
  ## Hook Integration
142
230
 
143
231
  The `skill-extractor-analyzer.sh` Stop hook provides a lightweight pre-analysis:
232
+
144
233
  - Reads task outcomes file
145
- - Counts qualifying patterns
234
+ - Counts qualifying recurring patterns
146
235
  - Emits advisory stderr message if candidates found
147
- - Does NOT create skills (that requires user approval via the skill)
236
+ - Does NOT create skills, subagents, or automation (that requires user approval via the skill)
148
237
 
149
238
  ## Compatibility Artifact Protocol
150
239
 
@@ -152,8 +241,11 @@ Sensitive-path compatibility note: when delegated work touches `.claude/outputs/
152
241
 
153
242
  ## Safety
154
243
 
155
- - **User approval required**: Never auto-creates skills
156
- - **Overlap check**: Prevents duplicating existing skills
244
+ - **User approval required**: Never auto-creates skills, subagents, or automation
245
+ - **Evidence-first**: Never recommends packaging from vague memory or unverified anecdotes
246
+ - **Overlap check**: Prevents duplicating existing skills, agents, hooks, scripts, or docs
247
+ - **R006 guardrail**: Rejects catch-all artifacts with mixed responsibilities
248
+ - **R020 guardrail**: Every approved artifact must include a verification path before completion is claimed
157
249
  - **Dry-run mode**: Preview without side effects
158
250
  - **Advisory hook**: Stop hook is advisory-only (exit 0)
159
- - **Confidence transparency**: All proposals show confidence scores
251
+ - **Confidence transparency**: All shortlist entries and proposals show confidence scores and evidence dates
@@ -2,6 +2,30 @@
2
2
 
3
3
  This guide records Claude Code release-note impact that affects the Claude compatibility template. The Codex-native runtime still uses `.codex/**` and OMX as the primary surface.
4
4
 
5
+ ## v2.1.158
6
+
7
+ Published: 2026-05-30.
8
+
9
+ Source: upstream oh-my-customcode #1264, Codex port #1436.
10
+
11
+ | Change | Impact on oh-my-customcodex | Action |
12
+ |--------|------------------------------|--------|
13
+ | Auto mode is available on Bedrock, Vertex, and Foundry for Opus 4.7 and Opus 4.8 via `CLAUDE_CODE_ENABLE_AUTO_MODE=1` | Claude compatibility sessions can opt into provider-backed auto mode for those Opus surfaces. Codex-native model routing and approval policy are unchanged. | Document as Claude provider compatibility only. Do not infer Codex auto-mode behavior from this env var. |
14
+
15
+ ## v2.1.157
16
+
17
+ Published: 2026-05-29.
18
+
19
+ Source: upstream oh-my-customcode #1265, Codex port #1437.
20
+
21
+ | Change | Impact on oh-my-customcodex | Action |
22
+ |--------|------------------------------|--------|
23
+ | Plugins under `.claude/skills` auto-load, `claude plugin init <name>` scaffolds plugins there, and `/plugin` has argument autocomplete | Useful for Claude compatibility plugin setup. Codex-native skills remain under `.codex/skills`, `.agents/skills`, and OMX skill roots. | Keep `.codex/OMX` primary. Mention `.claude/skills` only when documenting Claude plugin compatibility. |
24
+ | `claude agents` honors the `agent` field in `settings.json`, with `--agent <name>` as an override | Claude dispatched sessions can inherit a configured default agent unless explicitly overridden. | Do not mirror this into Codex routing. Codex native subagents still follow prompt routing, role metadata, and explicit delegation. |
25
+ | `EnterWorktree` can switch between Claude-managed worktrees mid-session, and Claude-managed worktrees are left unlocked for `git worktree remove`/`prune` cleanup | Claude worktree lifecycle is more flexible and less likely to leave locked cleanup blockers. | Keep auto-dev work in clean worktrees, verify `git status`, and do not treat Claude-managed worktree state as OMX state. |
26
+ | `tool_decision` telemetry can include `tool_parameters` such as Bash commands and MCP/skill names when `OTEL_LOG_TOOL_DETAILS=1` | Telemetry may contain more detailed operational data, including command and tool-parameter strings. | Treat logs as potentially sensitive. Avoid exporting transcripts or telemetry that may expose secrets, credentials, or privileged commands. |
27
+ | Background/session fixes cover parked subagents, leaked background shells, orphaned `.claude/worktrees`, resume state, date after sleep/wake, fullscreen picker cleanup, current linked worktree return, image placeholders, network prompts, and tmux clipboard behavior | Reduces false blockers and stale-state surprises in Claude compatibility sessions. | No Codex runtime change. Continue using OMX state, Codex worktree checks, and direct command evidence for Codex-native completion claims. |
28
+
5
29
  ## v2.1.156
6
30
 
7
31
  Published: 2026-05-29.
@@ -1,11 +1,11 @@
1
1
  {
2
- "version": "0.5.9",
2
+ "version": "0.5.11",
3
3
  "requiresCC": ">=2.1.121",
4
4
  "claudeCode": {
5
5
  "minimumVersion": "2.1.121",
6
6
  "protectedPathBypassVersion": "2.1.126"
7
7
  },
8
- "lastUpdated": "2026-05-30T00:00:00.000Z",
8
+ "lastUpdated": "2026-06-01T00:00:00.000Z",
9
9
  "components": [
10
10
  {
11
11
  "name": "rules",