oh-my-customcodex 0.5.8 → 0.5.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (33) hide show
  1. package/README.md +4 -4
  2. package/dist/cli/index.js +1 -1
  3. package/dist/index.js +1 -1
  4. package/package.json +1 -1
  5. package/templates/.claude/hooks/scripts/agent-teams-advisor.sh +4 -1
  6. package/templates/.claude/rules/MUST-agent-teams.md +85 -246
  7. package/templates/.claude/rules/MUST-completion-verification.md +12 -0
  8. package/templates/.claude/rules/MUST-orchestrator-coordination.md +20 -0
  9. package/templates/.claude/rules/MUST-permissions.md +6 -0
  10. package/templates/.claude/rules/MUST-safety.md +7 -0
  11. package/templates/.claude/rules/SHOULD-interaction.md +11 -0
  12. package/templates/.claude/skills/de-lead-routing/SKILL.md +6 -13
  13. package/templates/.claude/skills/dev-lead-routing/SKILL.md +6 -13
  14. package/templates/.claude/skills/intent-detection/SKILL.md +7 -9
  15. package/templates/.claude/skills/research/SKILL.md +8 -23
  16. package/templates/.claude/skills/roundtable-debate/SKILL.md +3 -4
  17. package/templates/.claude/skills/skill-extractor/SKILL.md +165 -73
  18. package/templates/.claude/skills/structured-dev-cycle/SKILL.md +7 -10
  19. package/templates/AGENTS.md.en +1 -2
  20. package/templates/AGENTS.md.ko +1 -2
  21. package/templates/CLAUDE.md +2 -2
  22. package/templates/CLAUDE.md.en +1 -2
  23. package/templates/CLAUDE.md.ko +1 -2
  24. package/templates/README.md +2 -2
  25. package/templates/guides/claude-code/15-version-compatibility.md +24 -0
  26. package/templates/guides/multi-agent-debate-patterns/README.md +1 -1
  27. package/templates/guides/multi-provider-exec/README.md +9 -79
  28. package/templates/manifest.json +3 -3
  29. package/templates/.claude/skills/agora/SKILL.md +0 -209
  30. package/templates/.claude/skills/codex-exec/SKILL.md +0 -218
  31. package/templates/.claude/skills/codex-exec/scripts/codex-wrapper.cjs +0 -433
  32. package/templates/.claude/skills/gemini-exec/SKILL.md +0 -215
  33. package/templates/.claude/skills/gemini-exec/scripts/gemini-wrapper.cjs +0 -485
@@ -104,24 +104,17 @@ Check if Agent Teams is available (`OMCODEX_AGENT_TEAMS=1` or TeamCreate/SendMes
104
104
  | Cross-layer debugging (FE + BE + DB) | Agent Teams |
105
105
  | Simple file search/validation | Task Tool |
106
106
 
107
- ### Step 2: Codex-Exec Hybrid (Implementation Tasks)
107
+ ### Step 2: External Interop Guidance (Implementation Tasks)
108
108
  For **new file creation**, **boilerplate**, or **test code generation**:
109
109
 
110
- 1. Check `/tmp/.codex-env-status-*` for codex, gemini, and rtk availability
111
- 2. If codex available AND task involves new file creation automatically delegate to `/codex-exec` for scaffolding:
112
- - Display: `[Codex Hybrid] Delegating to codex-exec...`
113
- - codex-exec generates initial code (strength: fast generation)
114
- - Selected Claude expert reviews and refines codex output (strength: reasoning, quality)
115
- 3. If codex unavailable but gemini available → delegate to `/gemini-exec` for scaffolding:
116
- - Display: `[Gemini Hybrid] Delegating to gemini-exec...`
117
- - gemini-exec generates initial code
118
- - Selected Claude expert reviews and refines output
119
- 4. If RTK available (`RTK=available` in env status) → optionally wrap Claude expert output through RTK to reduce token consumption by 60-90%:
110
+ 1. Use the selected expert agent as the default implementation path.
111
+ 2. If the native Claude Code plugin `openai/codex-plugin-cc` is explicitly installed and requested, it may provide Codex interop for scaffolding before expert review.
112
+ 3. If RTK is available (`RTK=available` in env status), optionally wrap expert output through `rtk-exec` to reduce token consumption by 60-90%:
120
113
  - Display: `[RTK Proxy] Token optimization active via rtk-exec`
121
114
  - RTK acts as a transparent proxy — no change to expert selection
122
- 5. If none available → display `[External CLI] Unavailable — proceeding with {expert} directly` and use Claude expert directly
115
+ 4. Otherwise display `[External CLI] Not requested — proceeding with {expert} directly` and use the expert directly.
123
116
 
124
- **Suitable**: New file creation, boilerplate, scaffolding, test code
117
+ **Suitable for optional plugin interop**: New file creation, boilerplate, scaffolding, test code
125
118
  **Unsuitable**: Existing code modification, architecture decisions, bug fixes
126
119
 
127
120
  ### Step 3: Expert Agent Selection
@@ -262,14 +262,12 @@ english:
262
262
  ```
263
263
  Research intent detected (confidence >= 70%)
264
264
 
265
- Check Codex CLI availability
266
- ├─ Available (codex binary + OPENAI_API_KEY)
267
- │ → Use codex-exec skill with --effort xhigh
268
- │ → Prompt: "Research and analyze: {user_request}"
269
- Returns: structured findings for orchestrator
270
- └─ Unavailable
271
- → Fall back to Claude's WebFetch/WebSearch
272
- → Orchestrator handles directly or via general-purpose agent
265
+ Use the research workflow or researcher/expert agents
266
+ ├─ Native Claude Code plugin `openai/codex-plugin-cc` explicitly installed/requested
267
+ │ → Optional Codex interop may assist analysis
268
+ └─ Otherwise
269
+ Use existing expert agents, researcher, WebFetch/WebSearch, or RTK paths
270
+ Orchestrator handles directly or via specialist agent
273
271
  ```
274
272
 
275
273
  ### Confidence Scoring
@@ -288,6 +286,6 @@ Check Codex CLI availability
288
286
  ├── Input: "{user input}"
289
287
  ├── Workflow: research-workflow
290
288
  ├── Confidence: {percentage}%
291
- ├── Method: codex-exec (xhigh) | WebFetch fallback
289
+ ├── Method: research workflow | researcher/expert agent | optional Codex plugin when requested
292
290
  └── Reason: {explanation}
293
291
  ```
@@ -167,17 +167,8 @@ Batch 3: T9, T10 (Innovation)
167
167
 
168
168
  Before starting verification rounds, check codex availability:
169
169
 
170
- ```bash
171
- # Run this check once before Phase 2 begins
172
- which codex &>/dev/null && [ -n "$OPENAI_API_KEY" ]
173
- # Exit 0 → codex available: enable dual-model verification (opus + codex)
174
- # Exit 1 → codex unavailable: display notice and proceed with opus-only
175
- ```
176
-
177
- If unavailable, display: `[Phase 2] Codex unavailable — opus-only verification`
178
-
179
170
  ```
180
- Team findings ──→ opus 4.6 verification ──→ codex-exec xhigh verification (if available)
171
+ Team findings ──→ opus 4.6 verification ──→ optional plugin-assisted verification
181
172
  │ │
182
173
  └── Contradiction detected? ── YES ──→ Round N+1
183
174
  NO ──→ Consensus reached → Phase 3
@@ -185,8 +176,7 @@ Team findings ──→ opus 4.6 verification ──→ codex-exec xhigh verific
185
176
 
186
177
  Each round:
187
178
  1. **opus 4.6**: Deep reasoning verification — checks logical consistency, identifies gaps, challenges assumptions
188
- 2. **codex-exec xhigh** (when available): Independent code-level verification validates technical claims, tests feasibility
189
- - If unavailable: display `[Phase 2] Round {N}: Codex unavailable, proceeding with opus verification only`
179
+ 2. **Optional Codex interop**: Use native Claude Code plugin `openai/codex-plugin-cc` only when explicitly installed/requested; otherwise use researcher or RTK-backed local evidence gathering.
190
180
  3. **Contradiction resolution**: Reconcile divergent findings between teams and verifiers
191
181
  4. **Convergence check**: All major claims verified with no outstanding contradictions → proceed
192
182
 
@@ -319,16 +309,11 @@ Round N:
319
309
  - Internal consistency (breadth ↔ depth alignment)
320
310
  - Cross-domain consistency (security ↔ architecture)
321
311
  - Evidence quality (claims without backing)
322
- Step 2: codex-exec validates technical claims (when available):
323
- a. Invoke: /codex-exec with findings from all teams
324
- b. Prompt: "Validate technical claims: {findings}.
325
- Check code patterns, benchmark reproducibility,
326
- dependency resolution."
327
- c. Effort: --effort xhigh
328
- d. Parse: contradictions → merge with opus findings
329
- e. On timeout/error: log "[Phase 2] Round {N}: codex-exec error — {reason},
330
- continuing with opus results only"
331
- If unavailable: log "[Phase 2] Round {N}: Codex unavailable, proceeding with opus verification only"
312
+ Step 2: Optional plugin-assisted validation (only if `openai/codex-plugin-cc` is explicitly installed/requested):
313
+ a. Validate technical claims against code patterns, benchmark reproducibility,
314
+ and dependency resolution.
315
+ b. Parse contradictions merge with opus findings.
316
+ c. On unavailable plugin or error: log "[Phase 2] Round {N}: optional Codex interop unavailable — continuing with opus/researcher results".
332
317
  Step 3: Compile contradiction list
333
318
  - 0 contradictions → CONVERGED
334
319
  - >0 contradictions → feedback to relevant teams → Round N+1
@@ -410,7 +395,7 @@ This advisory is informational only and does not block execution.
410
395
 
411
396
  | Scenario | Fallback |
412
397
  |----------|----------|
413
- | codex-exec unavailable | opus-only verification (still min 2 rounds) |
398
+ | Optional Codex plugin unavailable | opus/researcher verification (still min 2 rounds) |
414
399
  | Agent Teams unavailable | Standard Agent tool with R009 batching |
415
400
  | Partial team failure | Synthesize from available results, note gaps in report |
416
401
  | GitHub issue creation fails | Output report to conversation only |
@@ -12,7 +12,7 @@ version: 1.0.0
12
12
 
13
13
  ## Purpose
14
14
 
15
- Run a bounded debate when convergence would hide useful disagreement. Unlike `agora`, which drives toward consensus, this workflow preserves minority positions and requires explicit justification before dismissing them.
15
+ Run a bounded debate when convergence would hide useful disagreement. Use this workflow when debate should preserve minority positions and require explicit justification before dismissing them.
16
16
 
17
17
  ## When To Use
18
18
 
@@ -54,11 +54,10 @@ Hard cap: two debate rounds. If the decision still depends on missing facts, sto
54
54
  {adopt | defer | reject | gather-more-evidence}
55
55
  ```
56
56
 
57
- ## Relationship To Agora
57
+ ## Relationship To Other Review Workflows
58
58
 
59
59
  | Workflow | Goal | Best For |
60
60
  |----------|------|----------|
61
- | `agora` | adversarial consensus | release gates, spec approval |
62
61
  | `roundtable-debate` | dissent preservation | ambiguous strategy, architectural tradeoffs |
63
62
 
64
- Use `agora --anti-groupthink` when you need consensus plus explicit dissent handling.
63
+ Use `roundtable-debate` when current guidance needs adversarial review plus explicit dissent handling.
@@ -1,150 +1,239 @@
1
1
  ---
2
2
  name: skill-extractor
3
- description: Analyze task trajectories to propose reusable SKILL.md candidates from successful patterns
3
+ description: Analyze recurring task trajectories and evidence to propose reusable workflow packaging candidates
4
4
  scope: core
5
5
  user-invocable: true
6
- argument-hint: "[--threshold <n>] [--dry-run]"
7
- version: 1.0.0
6
+ argument-hint: "[--threshold <n>] [--dry-run] [--all]"
7
+ version: 1.1.0
8
8
  ---
9
9
 
10
10
  # Skill Extractor
11
11
 
12
- Analyze completed task outcomes to identify reusable patterns and propose new SKILL.md candidates. Inspired by Hermes Agent's self-learning skill extraction — adapted for oh-my-customcodex's compilation metaphor.
12
+ Analyze completed task outcomes and recent work evidence to identify recurring workflows that may deserve reusable packaging. Inspired by Hermes Agent's self-learning skill extraction — adapted for oh-my-customcodex's compilation metaphor.
13
13
 
14
14
  ## Philosophy
15
15
 
16
- In the compilation metaphor: task trajectories are runtime traces, and extracted skills are new source code. This skill turns successful execution patterns into reusable knowledge artifacts.
16
+ In the compilation metaphor: task trajectories are runtime traces, and extracted skills are new source code. This skill turns repeated, successful execution patterns into reusable knowledge artifacts only after evidence review and user approval.
17
17
 
18
18
  ```
19
- Runtime traces (task outcomes)Pattern analysisSKILL.md proposal → User approval → mgr-creator
19
+ Runtime traces + memory + rollout summaries + inventory Evidence-first shortlistPackaging recommendation → User approval → mgr-creator or automation owner
20
20
  ```
21
21
 
22
22
  ## Usage
23
23
 
24
24
  ```
25
- /skill-extractor # Analyze current session outcomes
26
- /skill-extractor --threshold 2 # Lower success threshold (default: 3)
27
- /skill-extractor --dry-run # Preview proposals without writing
25
+ /skill-extractor # Analyze current session outcomes and local evidence
26
+ /skill-extractor --threshold 2 # Lower recurring evidence threshold (default: 3)
27
+ /skill-extractor --dry-run # Preview shortlist without writing
28
+ /skill-extractor --all # Include broader session and memory history when available
28
29
  ```
29
30
 
30
31
  ## Options
31
32
 
32
33
  ```
33
- --threshold, -t Minimum success count for pattern qualification (default: 3)
34
- --dry-run, -d Preview proposals to stdout only, no file writes
35
- --all Include all sessions (not just current, requires task outcome history)
34
+ --threshold, -t Minimum evidence count for recurring-workflow qualification (default: 3)
35
+ --dry-run, -d Preview candidates to stdout only, no file writes
36
+ --all Include all sessions and available memory/history, not just current session
36
37
  ```
37
38
 
38
39
  ## Workflow
39
40
 
40
- ### Phase 1: Collect Task Outcomes
41
+ ### Phase 1: Evidence-First Candidate Discovery
41
42
 
42
- Read task outcome data from the session:
43
+ Collect candidates from concrete, dated evidence before proposing any packageable artifact. Prefer local evidence first; use optional integrations only when available.
43
44
 
44
- ```bash
45
- # Current session outcomes (from task-outcome-recorder hook)
46
- OUTCOMES_FILE="/tmp/.codex-task-outcomes-${PPID}"
47
- ```
45
+ Required and recommended sources:
46
+
47
+ 1. **Recent session outcomes** from the task-outcome-recorder hook:
48
+
49
+ ```bash
50
+ # Current session outcomes
51
+ OUTCOMES_FILE="/tmp/.codex-task-outcomes-${PPID}"
52
+ ```
53
+
54
+ 2. **Recent sessions and memory**: relevant `claude-mem`/OMX memory observations, session checkpoints, `.omx/notepad.md`, and `.omx/state/**` summaries when present.
55
+ 3. **Rollout or release summaries**: changelog entries, release notes, PR summaries, compatibility notes, and post-release follow-up reports that show repeated manual steps.
56
+ 4. **Optional Chronicle/history integrations**: Chronicle, shell history, or external task timeline summaries if installed and explicitly available. Do not fail when absent.
57
+ 5. **Existing inventory**: compare against `.codex/skills/*/SKILL.md`, `.codex/agents/*`, `templates/.claude/skills/*/SKILL.md`, and generated wiki pages to avoid duplicate or overlapping packaging.
48
58
 
49
- If file doesn't exist or is empty: report "No task outcomes recorded in this session." and stop.
59
+ If no evidence source exists or all sources are empty: report "No recurring workflow evidence found." and stop without creating anything.
60
+
61
+ Parse JSONL outcome entries when available. Each entry has:
50
62
 
51
- Parse JSONL entries. Each entry has:
52
63
  ```json
53
64
  {"agent_type": "lang-typescript-expert", "skill": "typescript-best-practices", "description": "Fix type error in auth module", "outcome": "success", "model": "sonnet", "timestamp": "2026-04-05T09:30:00Z", "duration_ms": 15000}
54
65
  ```
55
66
 
56
- ### Phase 2: Pattern Detection
67
+ For prose evidence, extract only dated or source-attributed observations, for example:
57
68
 
58
- Group outcomes by `(agent_type, skill)` tuple:
69
+ ```markdown
70
+ - 2026-05-24, release PR summary: repeated manual wiki parity fixes after adding new guides.
71
+ - 2026-05-22, memory #29064: sync-upstream-release-issues dry-run needed repeated validation after issue-reference parsing changes.
72
+ ```
59
73
 
74
+ ### Phase 2: Recurring Workflow Detection
75
+
76
+ Group evidence by the workflow being repeated, not only by `(agent_type, skill)` tuple. A workflow can span agents, commands, checklist steps, or release procedures.
77
+
78
+ ```
79
+ Workflow: release-docs-parity-check
80
+ → evidence_count: 4 dated occurrences
81
+ → successful_reuse: 3
82
+ → failure_or_friction: 1
83
+ → sources: [memory:29148, PR summary, changelog, wiki staleness check]
84
+ → existing_overlap: wiki, update-docs, sauron-watch
60
85
  ```
61
- Pattern: (lang-typescript-expert, typescript-best-practices)
62
- success: 5, failure: 1, total: 6
63
- → success_rate: 0.83
64
- descriptions: ["Fix type error...", "Refactor module...", ...]
86
+
87
+ Filter qualifying candidates:
88
+
89
+ - `evidence_count >= threshold` (default: 3), or two strong dated incidents plus high user impact.
90
+ - Repeated manual judgment or sequencing exists; one-off bugs are not enough.
91
+ - Evidence includes dates or source names, not vague recollection.
92
+ - Candidate is not already fully covered by an existing skill, custom subagent, automation, or documented checklist.
93
+ - R006 separation of concerns can be preserved: package one coherent responsibility, not a catch-all meta-agent.
94
+
95
+ ### Phase 3: Build the Shortlist
96
+
97
+ For each candidate, create a shortlist entry before generating any SKILL.md proposal. Every entry must include these fields:
98
+
99
+ ```markdown
100
+ ## Candidate: {candidate-name}
101
+
102
+ **Workflow**: {one-sentence recurring workflow description}
103
+ **Evidence / Dates**:
104
+ - {date or source}: {specific repeated task, success, failure, or friction point}
105
+ - {date or source}: {specific repeated task, success, failure, or friction point}
106
+ **Frequency / Confidence**: {count and low|medium|high confidence with reason}
107
+ **Recommended Form**: {Skill | Custom subagent | Automation | Skip}
108
+ **Duplicate / Overlap Check**: {existing skills, agents, hooks, scripts, docs, or "none found"}
109
+ **Why**: {why packaging would reduce repeated manual work or improve safety}
110
+ **Why Not**: {risks, overlap, insufficient evidence, or why a lighter form may be better}
65
111
  ```
66
112
 
67
- Filter qualifying patterns:
68
- - `success_count >= threshold` (default: 3)
69
- - `success_rate >= 0.8`
70
- - Not already an existing skill (check `.codex/skills/*/SKILL.md`)
113
+ Recommended-form guidance:
114
+
115
+ | Form | Use When | Do Not Use When |
116
+ |------|----------|-----------------|
117
+ | Skill | A repeatable human-invoked workflow/checklist improves outcomes and needs judgment | Existing skill already covers it or it is fully automatable |
118
+ | Custom subagent | A specialized role with stable responsibilities, tools, and boundaries is recurring | It is just a checklist or would violate R006 by mixing unrelated duties |
119
+ | Automation | The steps are deterministic, cheap to validate, and safe to run without judgment | User approval, external credentials, or destructive actions are required |
120
+ | Skip | Evidence is weak, duplicated, obsolete, or one-off | There is enough dated evidence and clear reuse value |
71
121
 
72
- ### Phase 3: Generate Proposals
122
+ ### Phase 4: Generate Packaging Proposals
73
123
 
74
- For each qualifying pattern, generate a SKILL.md proposal:
124
+ Only for shortlist entries whose recommended form is `Skill` or `Custom subagent`, generate a proposal. For `Automation`, recommend the script/hook/check location and required guardrails. For `Skip`, explain the evidence gap.
75
125
 
76
126
  ```markdown
77
- ## Proposal: {proposed-skill-name}
127
+ ## Proposal: {proposed-package-name}
78
128
 
79
- **Source Pattern**: {agent_type} + {skill} ({success_count} successes, {success_rate}% rate)
80
- **Confidence**: {low|medium|high} (based on count and rate)
129
+ **Recommended Form**: {Skill | Custom subagent | Automation | Skip}
130
+ **Source Workflow**: {workflow} ({evidence_count} evidence points, {confidence})
131
+ **Evidence Window**: {earliest date/source} → {latest date/source}
132
+ **Confidence**: {low|medium|high} (based on frequency, recency, and outcome consistency)
81
133
 
82
- ### Proposed SKILL.md
134
+ ### Proposed Artifact
83
135
 
84
136
  name: {proposed-name}
85
- description: {inferred from common description patterns}
137
+ description: {inferred from recurring workflow evidence}
86
138
  scope: core
87
- user-invocable: false
139
+ user-invocable: {true|false}
88
140
 
89
141
  ### Rationale
90
- {Why this pattern should be extracted as a skill — based on frequency and success rate}
142
+ {Why this workflow should be packaged — based on dates, frequency, success/failure pattern, and user impact}
91
143
 
92
- ### Overlap Check
93
- {List any existing skills with >50% keyword overlap}
144
+ ### Duplicate / Overlap Check
145
+ {List existing skills, agents, hooks, scripts, or wiki docs with meaningful overlap and how this proposal differs}
146
+
147
+ ### Guardrails
148
+ {User approval, R006 responsibility boundary, R020 verification requirement, dry-run behavior, and non-destructive defaults}
94
149
  ```
95
150
 
96
151
  **Confidence scoring**:
97
- | Successes | Rate | Confidence |
98
- |-----------|------|------------|
99
- | 3-5 | >= 0.8 | low |
100
- | 6-10 | >= 0.85 | medium |
101
- | 10+ | >= 0.9 | high |
102
152
 
103
- ### Phase 4: Present to User
153
+ | Evidence | Recency / Outcome | Confidence |
154
+ |----------|-------------------|------------|
155
+ | 2 strong incidents or 3 weak signals | Mixed outcomes or older than 90 days | low |
156
+ | 3-5 dated occurrences | Mostly successful or repeated friction in last 90 days | medium |
157
+ | 6+ dated occurrences | Clear recurrence, recent evidence, and stable success criteria | high |
104
158
 
105
- Display proposals in ranked order (highest confidence first):
159
+ ### Phase 5: Present to User
106
160
 
107
- ```
108
- [skill-extractor] {N} skill candidates detected
161
+ Display the shortlist in ranked order (highest confidence and lowest overlap first):
162
+
163
+ ```text
164
+ [skill-extractor] {N} recurring workflow candidates detected
109
165
 
110
- 1. [high] proposed-skill-name
111
- Source: {agent_type} + {skill} (12 successes, 92%)
112
- Description: {inferred description}
166
+ 1. [high] release-docs-parity-check
167
+ Workflow: Validate guide/wiki/template parity before release PRs
168
+ Evidence: 4 dated sources, latest 2026-05-24
169
+ Recommended form: Skill
170
+ Overlap: update-docs, sauron-watch (partial)
171
+ Why: Prevents repeated CI wiki-staleness failures
172
+ Why not: May be redundant if update-docs grows the same gate
113
173
 
114
- 2. [medium] another-skill-name
115
- Source: {agent_type} + {skill} (7 successes, 86%)
116
- Description: {inferred description}
174
+ 2. [medium] upstream-release-issue-validation
175
+ Workflow: Validate upstream issue references before sync workflows
176
+ Evidence: 3 dated sources, latest 2026-05-22
177
+ Recommended form: Automation
178
+ Overlap: sync-upstream-release-issues script
179
+ Why: Deterministic validation prevents repeated 404 workflow failures
180
+ Why not: Better as script/test than human-invoked skill
117
181
 
118
- Select [1-N] to create, "all" to create all, or "skip" to cancel:
182
+ Select [1-N] to create, "all" to create all approved packageable items, or "skip" to cancel:
119
183
  ```
120
184
 
121
- ### Phase 5: Create Skill (on approval)
185
+ ### Phase 6: Create Artifact (on approval only)
122
186
 
123
- Delegate to mgr-creator with the proposal context:
124
- - Proposed name and description
125
- - Source pattern data
126
- - Confidence level
127
- - Any overlap warnings
187
+ Never create or modify reusable artifacts without explicit user approval after showing the shortlist.
128
188
 
129
- mgr-creator handles: SKILL.md creation, template sync, ontology registration.
189
+ On approval:
190
+
191
+ - `Skill`: delegate to `mgr-creator` with the full shortlist entry, proposal, overlap warnings, and guardrails.
192
+ - `Custom subagent`: delegate to `mgr-creator` with the R006 responsibility boundary and required related skills/guides.
193
+ - `Automation`: hand off a scoped implementation recommendation; require a dry-run/default-safe mode and R020 verification evidence.
194
+ - `Skip`: record the decision only if the user asks to save it.
195
+
196
+ mgr-creator handles: SKILL.md creation, template sync, ontology registration, and generated docs parity.
197
+
198
+ ## Recurring-Workflow Packaging Checklist
199
+
200
+ Before recommending packaging, verify:
201
+
202
+ - [ ] Evidence is source-attributed and includes dates or stable identifiers.
203
+ - [ ] Frequency meets `--threshold` or has two strong high-impact incidents.
204
+ - [ ] The workflow has stable trigger conditions and a clear stop condition.
205
+ - [ ] Existing skills, agents, hooks, scripts, and wiki docs were checked for duplicate or partial coverage.
206
+ - [ ] Recommended form is justified as `Skill`, `Custom subagent`, `Automation`, or `Skip`.
207
+ - [ ] `Why` and `Why Not` both name concrete evidence.
208
+ - [ ] R006 is preserved: one coherent responsibility and clear boundaries.
209
+ - [ ] R020 is preserved: proposal includes verification evidence or a test/check path.
210
+ - [ ] User approval is required before any artifact creation or mutation.
211
+ - [ ] Dry-run/no-write behavior remains available for review-only usage.
130
212
 
131
213
  ## Integration
132
214
 
133
215
  | System | How |
134
216
  |--------|-----|
135
- | task-outcome-recorder | Reads JSONL outcomes as input data |
136
- | feedback-collector | Complementary: feedback-collector extracts failure patterns, skill-extractor extracts success patterns |
137
- | mgr-creator | Delegated skill creation on user approval |
217
+ | task-outcome-recorder | Reads JSONL outcomes as one input data source |
218
+ | memory-management / memory-recall | Supplies dated recurring-workflow evidence when available |
219
+ | rollout and release summaries | Surface repeated manual release, docs, and compatibility procedures |
220
+ | optional Chronicle/history | Adds timeline evidence when installed; absence is non-fatal |
221
+ | existing skills and agents inventory | Prevents duplicate skills, subagents, or automations |
222
+ | feedback-collector | Complementary: feedback-collector extracts failure patterns, skill-extractor extracts recurring packageable workflows |
223
+ | mgr-creator | Delegated skill or subagent creation on user approval |
138
224
  | skills-sh-search | Check agentskills.io for existing equivalent before creating |
139
- | R011 (memory) | User Model tracks extraction decisions in Override Decisions |
225
+ | R006 | Enforces coherent responsibility boundaries for custom subagents and skills |
226
+ | R011 (memory) | User Model tracks extraction decisions in Override Decisions when explicitly saved |
227
+ | R020 | Requires verification evidence before completion claims |
140
228
 
141
229
  ## Hook Integration
142
230
 
143
231
  The `skill-extractor-analyzer.sh` Stop hook provides a lightweight pre-analysis:
232
+
144
233
  - Reads task outcomes file
145
- - Counts qualifying patterns
234
+ - Counts qualifying recurring patterns
146
235
  - Emits advisory stderr message if candidates found
147
- - Does NOT create skills (that requires user approval via the skill)
236
+ - Does NOT create skills, subagents, or automation (that requires user approval via the skill)
148
237
 
149
238
  ## Compatibility Artifact Protocol
150
239
 
@@ -152,8 +241,11 @@ Sensitive-path compatibility note: when delegated work touches `.claude/outputs/
152
241
 
153
242
  ## Safety
154
243
 
155
- - **User approval required**: Never auto-creates skills
156
- - **Overlap check**: Prevents duplicating existing skills
244
+ - **User approval required**: Never auto-creates skills, subagents, or automation
245
+ - **Evidence-first**: Never recommends packaging from vague memory or unverified anecdotes
246
+ - **Overlap check**: Prevents duplicating existing skills, agents, hooks, scripts, or docs
247
+ - **R006 guardrail**: Rejects catch-all artifacts with mixed responsibilities
248
+ - **R020 guardrail**: Every approved artifact must include a verification path before completion is claimed
157
249
  - **Dry-run mode**: Preview without side effects
158
250
  - **Advisory hook**: Stop hook is advisory-only (exit 0)
159
- - **Confidence transparency**: All proposals show confidence scores
251
+ - **Confidence transparency**: All shortlist entries and proposals show confidence scores and evidence dates
@@ -93,15 +93,12 @@ A PreToolUse hook in `.codex/hooks/hooks.json` checks this marker and blocks Wri
93
93
  └── Output: Implementation complete
94
94
  ```
95
95
 
96
- **Codex-Exec Hybrid Option**: When entering Stage 3:
97
- 1. Check `/tmp/.codex-env-status-*` for codex CLI availability
98
- 2. If available AND task involves new file creation automatically delegate scaffolding to `/codex-exec`:
99
- - Display: `[Codex Hybrid] Delegating scaffolding to codex-exec...`
100
- - codex-exec generates initial code (strength: fast generation)
101
- - Claude expert reviews and refines codex output (strength: reasoning, quality)
102
- 3. If unavailable → display `[Codex] Unavailable — proceeding with Claude experts directly` and proceed with standard implementation via Claude experts
103
-
104
- Suitable for codex hybrid: new files, boilerplate, test stubs, scaffolding
96
+ **Optional Codex Plugin Interop**: When entering Stage 3:
97
+ 1. Use domain expert agents as the default implementation path.
98
+ 2. If the native Claude Code plugin `openai/codex-plugin-cc` is explicitly installed and requested, it may provide Codex interop for new-file scaffolding before expert review.
99
+ 3. Otherwise display `[Codex Plugin] Not requested proceeding with expert agents directly` and proceed with standard implementation.
100
+
101
+ Suitable for optional plugin interop: new files, boilerplate, test stubs, scaffolding
105
102
  Not suitable: modifying existing code, architecture-dependent changes
106
103
 
107
104
  **Exit criteria**: All planned files created/modified, tests written.
@@ -157,7 +154,7 @@ The stage marker file (`/tmp/.codex-dev-stage`) is read by a PreToolUse hook tha
157
154
  For complex tasks, Agent Teams is **preferred** when available (R018):
158
155
  - Plan: architect agent
159
156
  - Verify: reviewer agent(s) — multi-model-verification via Agent Teams
160
- - Implement: domain expert agent (+ codex-exec hybrid if available)
157
+ - Implement: domain expert agent (+ optional `openai/codex-plugin-cc` interop only when explicitly installed/requested)
161
158
  - Compound: QA agent
162
159
 
163
160
  When Agent Teams is enabled AND task involves 3+ agents or review→fix cycles, using Agent Teams is MANDATORY per R018.
@@ -110,7 +110,6 @@ NO EXCEPTIONS. NO EXCUSES.
110
110
  | `/omcustomcodex:npm-version` | Manage semantic versions |
111
111
  | `/omcustomcodex:npm-audit` | Audit dependencies |
112
112
  | `/omcustomcodex-release-notes` | Generate release notes from git history |
113
- | `/codex-exec` | Execute Codex CLI prompts |
114
113
  | `/optimize-analyze` | Analyze bundle and performance |
115
114
  | `/optimize-bundle` | Optimize bundle size |
116
115
  | `/optimize-report` | Generate optimization report |
@@ -134,7 +133,7 @@ project/
134
133
  | +-- hooks/ # Hook scripts (security, validation, HUD)
135
134
  | +-- contexts/ # Context files (ecomode)
136
135
  +-- .agents/
137
- | +-- skills/ # Installed skills (123 directories)
136
+ | +-- skills/ # Installed skills (120 directories)
138
137
  +-- guides/ # Reference docs (51 topics)
139
138
  ```
140
139
 
@@ -110,7 +110,6 @@ oh-my-customcodex로 구동됩니다.
110
110
  | `/omcustomcodex:npm-version` | 시맨틱 버전 관리 |
111
111
  | `/omcustomcodex:npm-audit` | 의존성 감사 |
112
112
  | `/omcustomcodex-release-notes` | 릴리즈 노트 생성 (git 히스토리 기반) |
113
- | `/codex-exec` | Codex CLI 프롬프트 실행 |
114
113
  | `/optimize-analyze` | 번들 및 성능 분석 |
115
114
  | `/optimize-bundle` | 번들 크기 최적화 |
116
115
  | `/optimize-report` | 최적화 리포트 생성 |
@@ -134,7 +133,7 @@ project/
134
133
  | +-- hooks/ # 훅 스크립트 (보안, 검증, HUD)
135
134
  | +-- contexts/ # 컨텍스트 파일 (ecomode)
136
135
  +-- .agents/
137
- | +-- skills/ # 설치된 스킬 (123 디렉토리)
136
+ | +-- skills/ # 설치된 스킬 (120 디렉토리)
138
137
  +-- guides/ # 레퍼런스 문서 (51 토픽)
139
138
  ```
140
139
 
@@ -102,7 +102,7 @@ oh-my-customcodex로 구동됩니다.
102
102
  | 개발 | `/omcustomcodex:goal`, `/dev-review`, `/dev-refactor`, `/sdd-dev`, `/structured-dev-cycle` | 목표 실행, 코드 리뷰, 리팩토링, SDD |
103
103
  | 검증 | `/omcustomcodex:sauron-watch`, `/deep-verify`, `/adversarial-review` | R017 검증, 릴리즈 품질, 보안 리뷰 |
104
104
  | 릴리즈 | `/pipeline auto-dev`, `/omcustomcodex-release-notes`, `/release-plan` | 자동 개발, 릴리즈 노트 |
105
- | 리서치 | `/research`, `/scout`, `/deep-plan`, `/omcustomcodex:agora` | 병렬 분석, URL 평가, 연구 계획 |
105
+ | 리서치 | `/research`, `/scout`, `/deep-plan`, `/roundtable-debate` | 병렬 분석, URL 평가, 토론형 검증 |
106
106
  | 메모리 | `/memory-save`, `/memory-recall` | 세션 메모리 관리 |
107
107
  | 시스템 | `/token-efficiency-audit`, `/pre-generation-arch-check`, `/omcustomcodex:lists`, `/omcustomcodex:status`, `/omcustomcodex:help` | 토큰 효율/아키텍처 감사, 전체 목록, 상태, 도움말 |
108
108
 
@@ -119,7 +119,7 @@ project/
119
119
  | +-- hooks/ # 훅 스크립트 (보안, 검증, HUD)
120
120
  | +-- contexts/ # 컨텍스트 파일 (ecomode)
121
121
  +-- .agents/
122
- | +-- skills/ # 스킬 (123 디렉토리)
122
+ | +-- skills/ # 스킬 (120 디렉토리)
123
123
  +-- guides/ # 레퍼런스 문서 (51 토픽)
124
124
  ```
125
125
 
@@ -112,7 +112,6 @@ NO EXCEPTIONS. NO EXCUSES.
112
112
  | `/omcustomcodex:npm-version` | Manage semantic versions |
113
113
  | `/omcustomcodex:npm-audit` | Audit dependencies |
114
114
  | `/omcustomcodex-release-notes` | Generate release notes from git history |
115
- | `/codex-exec` | Execute Codex CLI prompts |
116
115
  | `/optimize-analyze` | Analyze bundle and performance |
117
116
  | `/optimize-bundle` | Optimize bundle size |
118
117
  | `/optimize-report` | Generate optimization report |
@@ -133,7 +132,7 @@ project/
133
132
  +-- AGENTS.md # Entry point
134
133
  +-- .codex/
135
134
  | +-- agents/ # Subagent definitions (50 files)
136
- | +-- skills/ # Skills (123 directories)
135
+ | +-- skills/ # Skills (120 directories)
137
136
  | +-- rules/ # Global rules (22 files)
138
137
  | +-- hooks/ # Hook scripts (security, validation, HUD)
139
138
  | +-- contexts/ # Context files (4 files)
@@ -112,7 +112,6 @@ oh-my-customcodex로 구동됩니다.
112
112
  | `/omcustomcodex:npm-version` | 시맨틱 버전 관리 |
113
113
  | `/omcustomcodex:npm-audit` | 의존성 감사 |
114
114
  | `/omcustomcodex-release-notes` | 릴리즈 노트 생성 (git 히스토리 기반) |
115
- | `/codex-exec` | Codex CLI 프롬프트 실행 |
116
115
  | `/optimize-analyze` | 번들 및 성능 분석 |
117
116
  | `/optimize-bundle` | 번들 크기 최적화 |
118
117
  | `/optimize-report` | 최적화 리포트 생성 |
@@ -133,7 +132,7 @@ project/
133
132
  +-- AGENTS.md # 진입점
134
133
  +-- .codex/
135
134
  | +-- agents/ # 서브에이전트 정의 (50 파일)
136
- | +-- skills/ # 스킬 (123 디렉토리)
135
+ | +-- skills/ # 스킬 (120 디렉토리)
137
136
  | +-- rules/ # 전역 규칙 (22 파일)
138
137
  | +-- hooks/ # 훅 스크립트 (보안, 검증, HUD)
139
138
  | +-- contexts/ # 컨텍스트 파일 (4 파일)
@@ -46,7 +46,7 @@ templates/
46
46
  +-- workflows/ # project-level pipeline definitions
47
47
  +-- .claude/
48
48
  | +-- agents/ # agent definitions (50 files)
49
- | +-- skills/ # skill modules (123 SKILL.md files)
49
+ | +-- skills/ # skill modules (120 SKILL.md files)
50
50
  | +-- rules/ # global rules (22 files)
51
51
  | +-- hooks/ # hook registry and scripts (40 scripts)
52
52
  | +-- contexts/ # context files
@@ -65,7 +65,7 @@ The counts below should stay aligned with `templates/manifest.json`, README comp
65
65
 
66
66
  Flat agent definition files. During Codex installation these land under `.codex/agents/`.
67
67
 
68
- ### Skills (123)
68
+ ### Skills (120)
69
69
 
70
70
  `templates/.claude/skills/*/SKILL.md`
71
71