claudecode-omc 4.3.5 → 4.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,158 @@
1
+ # Skill Ecosystem Optimization Design
2
+
3
+ **Date:** 2026-02-26
4
+ **Scope:** oh-my-claudecode — full skill quality overhaul (description + structure + quality pipeline)
5
+ **Inspiration:** oh-my-codex skill-development, skill-debugger, skill-quality-analyzer, skill-create patterns
6
+
7
+ ---
8
+
9
+ ## Problem Statement
10
+
11
+ The oh-my-claudecode skill ecosystem has three interconnected issues:
12
+
13
+ 1. **Weak trigger descriptions** — YAML `description` fields are one-liner jargon (e.g., "Full autonomous execution from idea to working code") with no specific trigger phrases. When Claude reads the system-reminder skill list, these descriptions don't tell it _when_ to invoke the skill.
14
+
15
+ 2. **Progressive disclosure violations** — The `skill` management skill is 838 lines; all content is loaded into context every time. The oh-my-codex standard requires ≤200 lines in SKILL.md, with detailed content in `references/`.
16
+
17
+ 3. **Codex-branded ported skills** — `skill-quality-analyzer` and `skill-tester` still reference "Codex CLI" and "Codex skill quality", which is confusing and slightly broken in the Claude Code context.
18
+
19
+ 4. **No writing standard for new skills** — No equivalent of oh-my-codex's `skill-development` guide exists for OMC. New skills get written ad hoc with inconsistent quality.
20
+
21
+ ---
22
+
23
+ ## Architecture
24
+
25
+ ### Trigger System in Claude Code (vs Codex)
26
+
27
+ Unlike oh-my-codex where descriptions drive auto-triggering, OMC's skills work in two modes:
28
+
29
+ - **Core workflow skills** (autopilot, ralph, plan, ultrawork, team…): Triggered by `CLAUDE.md` `<skills>` block keyword matching — descriptions are supplementary
30
+ - **User-created skills** (`~/.claude/skills/`, `.omc/skills/`): Triggered by description semantic matching — descriptions are critical
31
+
32
+ **System-reminder display format:**
33
+ ```
34
+ - skill-name: [description content shown verbatim to Claude]
35
+ ```
36
+
37
+ Therefore, descriptions should read like a natural "use when" statement, not a capability tagline.
38
+
39
+ ---
40
+
41
+ ## Three-Layer Optimization
42
+
43
+ ### Layer 1: Description Standardization (~30 files, frontmatter edits only)
44
+
45
+ **Standard format:**
46
+ ```yaml
47
+ description: "Use when user wants [specific scenario] ('[trigger phrase 1]', '[trigger phrase 2]'). [One-sentence capability summary]."
48
+ ```
49
+
50
+ **Transformation examples:**
51
+
52
+ | Skill | Before | After |
53
+ |-------|--------|-------|
54
+ | `autopilot` | "Full autonomous execution from idea to working code" | "Use when user wants full-auto development ('build me', 'autopilot', 'I want a'). Handles spec→plan→code→QA→validation end-to-end." |
55
+ | `plan` | "Strategic planning with optional interview workflow" | "Use when user wants to plan before coding ('plan this', 'plan the', 'let's plan'). Supports interview, direct, consensus, and review modes." |
56
+ | `ralph` | "Self-referential loop until task completion with architect verification" | "Use when task must complete guaranteed ('ralph', 'don't stop', 'must complete'). Persistence loop with architect verification." |
57
+ | `ultrawork` | "Parallel execution engine for high-throughput task completion" | "Use when multiple independent tasks need parallel execution ('ulw', 'ultrawork', 'run in parallel'). Routes tasks to tiered agents simultaneously." |
58
+ | `skill-quality-analyzer` | "Analyzes Codex skill quality with 6-dimension scoring…" | "Use when checking a Claude Code skill's quality ('analyze skill', 'score skill', 'skill quality'). 6-dimension scoring: clarity, structure, examples, triggers, practices, maintainability." |
59
+ | `skill-tester` | "Tests Codex skill functionality with TDD approach…" | "Use when verifying a Claude Code skill works correctly ('test skill', 'verify skill behavior'). TDD workflow with trigger tests and functional validation." |
60
+
61
+ **All skills to update:** autopilot, build-fix, deepinit, learner, note, skill, trace, ultraqa, conductor, skill-tester, skill-quality-analyzer, skill-debugger, analyze, cancel, ccg, code-review, configure-notifications, external-context, hud, learn-about-omc, mcp-setup, omc-doctor, omc-help, omc-setup, pipeline, plan, ralph, ralph-init, ralplan, release, review, sciomc, security-review, tdd, team, ultrapilot, ultrawork, ai-commenting, writing-skills (new)
62
+
63
+ ### Layer 2: Progressive Disclosure for `skill` Management Skill
64
+
65
+ **Current:** `skills/skill/SKILL.md` — 838 lines, all loaded at invocation
66
+
67
+ **Target structure:**
68
+ ```
69
+ skills/skill/
70
+ ├── SKILL.md (~150 lines — commands overview + core workflow)
71
+ ├── references/
72
+ │ ├── templates.md (4 skill templates: Error/Workflow/Pattern/Integration)
73
+ │ └── setup-guide.md (setup wizard detail + bash scan scripts)
74
+ ```
75
+
76
+ **What stays in SKILL.md:**
77
+ - Subcommand overview table (list/add/remove/search/edit/sync/setup/scan)
78
+ - Core behavior for each command (condensed, pointer to references/ for detail)
79
+ - Error handling format
80
+ - Skill quality guidelines (condensed to 4 bullets)
81
+
82
+ **What moves to references/:**
83
+ - All 4 full skill templates (Error Solution, Workflow, Code Pattern, Integration) → `references/templates.md`
84
+ - Full bash scripts for directory scanning → `references/setup-guide.md`
85
+ - Complete example session → `references/setup-guide.md`
86
+
87
+ ### Layer 3: `writing-skills` Skill (New)
88
+
89
+ **OMC's equivalent of oh-my-codex's `skill-development`.**
90
+
91
+ ```
92
+ skills/writing-skills/
93
+ ├── SKILL.md (~120 lines)
94
+ └── references/
95
+ ├── format-guide.md (XML tag structure, size targets, style rules)
96
+ └── description-patterns.md (Good/Bad description catalog)
97
+ ```
98
+
99
+ **SKILL.md content:**
100
+ - When to use (trigger: "write a skill", "create a skill", "new skill")
101
+ - OMC Hybrid Workflow: Hard Steps (file creation, frontmatter) + Soft Steps (description quality judgment)
102
+ - 6-step process: Understand → Plan → Create structure → Write SKILL.md → Quality-analyze → Test
103
+ - Integration: triggers skill-quality-analyzer, skill-debugger, skill-tester
104
+ - Validation checklist
105
+ - References to `references/format-guide.md` and `references/description-patterns.md`
106
+
107
+ **`references/format-guide.md` content:**
108
+ - XML tag structure: `<Purpose>`, `<Use_When>`, `<Do_Not_Use_When>`, `<Why_This_Exists>`, `<Execution_Policy>`, `<Steps>`, `<Tool_Usage>`, `<Examples>`, `<Escalation_And_Stop_Conditions>`, `<Final_Checklist>`, `<Advanced>`
109
+ - When to use XML vs markdown headers (workflow skills → XML; utility skills → markdown)
110
+ - Size targets: SKILL.md ≤200 lines, references/ files ≤300 lines each
111
+ - Description standard with trigger phrase format
112
+
113
+ **`references/description-patterns.md` content:**
114
+ - Side-by-side catalog of all 30+ skill descriptions (before/after with rationale)
115
+ - Pattern library: workflow triggers, delegation triggers, utility triggers
116
+ - Anti-patterns with explanations
117
+
118
+ ### Bonus: Adapt Ported Skills
119
+
120
+ **`skill-quality-analyzer/SKILL.md`:**
121
+ - Replace all "Codex" references with "Claude Code"
122
+ - Update Agent Workflow section: replace `python3 analyzer.py` with Glob/Read/Grep tool sequence as primary hard-metrics step (keep analyzer.py as optional enhancement)
123
+ - Update skill location paths: `~/.codex/skills/` → `~/.claude/skills/` and `.agents/skills/` → `.claude/skills/`
124
+
125
+ **`skill-tester/SKILL.md`:**
126
+ - Replace all "Codex" references with "Claude Code"
127
+ - Remove JSON-based test case format (more Codex-native) — replace with scenario-based format
128
+ - Simplify: this skill is mainly used for "does the skill trigger?" and "does it produce the right output?" — remove performance testing and coverage metrics (overkill for prompt-based skills)
129
+
130
+ ---
131
+
132
+ ## What We Are NOT Doing
133
+
134
+ - No changes to body content of autopilot/ralph/plan/ultrawork/team — XML structure is well-designed
135
+ - No rewrite of `analyzer.py` Python script — functional, keep as optional tool
136
+ - No `version` field added to frontmatter — no versioning system in OMC
137
+ - No changes to `project-session-manager/` shell scripts — not a skill content issue
138
+ - No changes to `CLAUDE.md` skill trigger keywords — those work correctly
139
+
140
+ ---
141
+
142
+ ## Acceptance Criteria
143
+
144
+ 1. All skill descriptions follow "Use when user [scenario] ('[trigger]'). [Summary]." format
145
+ 2. `skill/SKILL.md` is ≤200 lines; templates and scripts moved to `references/`
146
+ 3. `writing-skills/` skill exists with SKILL.md + 2 references files
147
+ 4. `skill-quality-analyzer/SKILL.md` and `skill-tester/SKILL.md` contain no "Codex" references
148
+ 5. Running `skill-quality-analyzer` on `writing-skills` returns score ≥80
149
+ 6. No existing skill functionality is removed — only descriptions and structure change
150
+
151
+ ---
152
+
153
+ ## Implementation Order
154
+
155
+ 1. Create `writing-skills/` skill (establishes the standard we'll apply)
156
+ 2. Update all descriptions (Layer 1) — mechanical, high volume
157
+ 3. Refactor `skill/` (Layer 2) — structural surgery on largest skill
158
+ 4. Update `skill-quality-analyzer` and `skill-tester` (ported skill adaptation)