oh-my-customcodex 0.4.11 → 0.4.13
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/cli/index.js +6 -1
- package/dist/index.js +6 -1
- package/package.json +6 -1
- package/templates/.claude/agents/mgr-creator.md +1 -1
- package/templates/.claude/agents/mgr-gitnerd.md +1 -1
- package/templates/.claude/agents/sys-memory-keeper.md +1 -1
- package/templates/.claude/agents/tracker-checkpoint.md +1 -1
- package/templates/.claude/agents/wiki-curator.md +1 -1
- package/templates/.claude/rules/MUST-agent-design.md +21 -16
- package/templates/.claude/rules/MUST-orchestrator-coordination.md +13 -0
- package/templates/.claude/rules/MUST-permissions.md +15 -0
- package/templates/.claude/skills/action-validator/SKILL.md +41 -0
- package/templates/.claude/skills/adaptive-harness/SKILL.md +5 -0
- package/templates/.claude/skills/agora/SKILL.md +1 -1
- package/templates/.claude/skills/dag-orchestration/SKILL.md +1 -1
- package/templates/.claude/skills/de-lead-routing/SKILL.md +1 -1
- package/templates/.claude/skills/deep-plan/SKILL.md +38 -322
- package/templates/.claude/skills/deep-verify/SKILL.md +1 -1
- package/templates/.claude/skills/dev-lead-routing/SKILL.md +1 -1
- package/templates/.claude/skills/dev-review/SKILL.md +5 -0
- package/templates/.claude/skills/hada-scout/SKILL.md +1 -1
- package/templates/.claude/skills/harness-eval/SKILL.md +1 -1
- package/templates/.claude/skills/harness-synthesizer/SKILL.md +5 -0
- package/templates/.claude/skills/omcodex-improve-report/SKILL.md +1 -1
- package/templates/.claude/skills/omcodex-takeover/SKILL.md +1 -1
- package/templates/.claude/skills/optimize-analyze/SKILL.md +1 -1
- package/templates/.claude/skills/optimize-report/SKILL.md +1 -1
- package/templates/.claude/skills/post-release-followup/SKILL.md +10 -5
- package/templates/.claude/skills/professor-triage/SKILL.md +55 -288
- package/templates/.claude/skills/qa-lead-routing/SKILL.md +1 -1
- package/templates/.claude/skills/research/SKILL.md +6 -1
- package/templates/.claude/skills/result-aggregation/SKILL.md +4 -0
- package/templates/.claude/skills/roundtable-debate/SKILL.md +1 -1
- package/templates/.claude/skills/scout/SKILL.md +1 -1
- package/templates/.claude/skills/secretary-routing/SKILL.md +1 -1
- package/templates/.claude/skills/skill-extractor/SKILL.md +4 -0
- package/templates/.claude/skills/task-decomposition/SKILL.md +1 -1
- package/templates/.claude/skills/worker-reviewer-pipeline/SKILL.md +1 -1
- package/templates/manifest.json +6 -1
- package/templates/workflows/auto-dev.yaml +2 -2
|
@@ -1,9 +1,9 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: deep-plan
|
|
3
|
-
description: Research-validated planning
|
|
3
|
+
description: Research-validated planning - research -> plan -> verify cycle for high-confidence implementation plans
|
|
4
4
|
scope: core
|
|
5
5
|
context: fork
|
|
6
|
-
version: 1.
|
|
6
|
+
version: 1.1.0
|
|
7
7
|
user-invocable: true
|
|
8
8
|
argument-hint: "<topic-or-issue>"
|
|
9
9
|
teams-compatible: true
|
|
@@ -11,353 +11,69 @@ teams-compatible: true
|
|
|
11
11
|
|
|
12
12
|
# Deep Plan Skill
|
|
13
13
|
|
|
14
|
-
Research-validated planning that eliminates the gap between research assumptions and actual code.
|
|
14
|
+
Research-validated planning that eliminates the gap between research assumptions and actual code. It runs Discovery Research, Reality-Check Planning, and Plan Verification before handing off implementation.
|
|
15
15
|
|
|
16
|
-
**
|
|
16
|
+
**Full phase detail**: `guides/deep-plan/phases.md`
|
|
17
17
|
|
|
18
18
|
## Usage
|
|
19
19
|
|
|
20
|
-
```
|
|
20
|
+
```bash
|
|
21
21
|
/deep-plan <topic-or-issue>
|
|
22
22
|
/deep-plan "implement caching layer for API responses"
|
|
23
23
|
/deep-plan #325 new authentication system
|
|
24
|
-
/deep-plan Rust async runtime migration
|
|
25
|
-
```
|
|
26
|
-
|
|
27
|
-
## Problem Solved
|
|
28
|
-
|
|
29
|
-
Research-only analysis (like `/research`) produces findings based on assumptions about the codebase. These assumptions often diverge from reality:
|
|
30
|
-
|
|
31
|
-
| Assumption | Reality | Impact |
|
|
32
|
-
|------------|---------|--------|
|
|
33
|
-
| "Feature X is missing" | Already implemented | Wasted effort on duplicate work |
|
|
34
|
-
| "Pattern Y is needed" | Partially exists | Over-engineering existing code |
|
|
35
|
-
| "Library Z is required" | Already a dependency | Unnecessary integration effort |
|
|
36
|
-
|
|
37
|
-
`/deep-plan` solves this by cross-referencing research findings against actual code before committing to a plan.
|
|
38
|
-
|
|
39
|
-
## Architecture — 3 Phases
|
|
40
|
-
|
|
41
|
-
### Phase 1: Discovery Research
|
|
42
|
-
|
|
43
|
-
Invoke the `/research` skill internally for comprehensive topic analysis.
|
|
44
|
-
|
|
45
|
-
```
|
|
46
|
-
Phase 1: Discovery Research
|
|
47
|
-
├── Skill(research, args="<topic>")
|
|
48
|
-
├── 10-team parallel analysis (3 batches × 4/4/2)
|
|
49
|
-
├── Cross-verification loop (opus + codex)
|
|
50
|
-
├── ADOPT / ADAPT / AVOID taxonomy
|
|
51
|
-
└── Output: research report (artifact)
|
|
52
|
-
```
|
|
53
|
-
|
|
54
|
-
**Execution**:
|
|
55
|
-
- **Orchestrator mode**: Delegates to `/research` skill via `Skill(research, args="<topic>")`.
|
|
56
|
-
- **Teams mode**: Executes the research workflow inline (see Teams Mode section). The member spawns research teams directly as sub-agents.
|
|
57
|
-
|
|
58
|
-
The executor waits for completion before proceeding to Phase 2.
|
|
59
|
-
|
|
60
|
-
**Output**: Full research report with ADOPT/ADAPT/AVOID taxonomy.
|
|
61
|
-
|
|
62
|
-
### Phase 2: Reality-Check Planning
|
|
63
|
-
|
|
64
|
-
Ground-truth the research findings against the actual codebase.
|
|
65
|
-
|
|
66
|
-
```
|
|
67
|
-
Phase 2: Reality-Check Planning
|
|
68
|
-
├── EnterPlanMode
|
|
69
|
-
├── Explore agents (up to 3 parallel)
|
|
70
|
-
│ ├── Explore 1: Verify ADOPT items exist/don't exist
|
|
71
|
-
│ ├── Explore 2: Check ADAPT items for current state
|
|
72
|
-
│ └── Explore 3: Validate AVOID alternatives
|
|
73
|
-
├── Gap analysis table
|
|
74
|
-
├── Refined plan (real gaps only)
|
|
75
|
-
└── ExitPlanMode → user approval
|
|
76
|
-
```
|
|
77
|
-
|
|
78
|
-
**Steps**:
|
|
79
|
-
|
|
80
|
-
1. **Enter Plan Mode**: `EnterPlanMode` to activate planning context
|
|
81
|
-
2. **Codebase Exploration**: Spawn up to 3 Explore agents in parallel to verify research assumptions:
|
|
82
|
-
- Each ADOPT item: Does it already exist? Partially implemented?
|
|
83
|
-
- Each ADAPT item: What is the current state to adapt from?
|
|
84
|
-
- Each AVOID item: Are the alternatives already available?
|
|
85
|
-
3. **Deliverable Dependency Verification**: After exploration, verify inter-deliverable dependencies:
|
|
86
|
-
- For each deliverable pair, check: do they share files, functions, or modules?
|
|
87
|
-
- Classify each pair: `independent` (parallel-safe), `sequential` (order required), `shared-state` (synchronization needed)
|
|
88
|
-
- **Default bias**: Assume `independent` unless exploration finds concrete shared state
|
|
89
|
-
- Build dependency matrix:
|
|
90
|
-
|
|
91
|
-
```
|
|
92
|
-
| Deliverable A | Deliverable B | Classification | Evidence |
|
|
93
|
-
|---------------|---------------|----------------|----------|
|
|
94
|
-
| D1: Auth | D2: API | independent | No shared files |
|
|
95
|
-
| D1: Auth | D3: Tests | sequential | D3 tests D1 output |
|
|
96
|
-
```
|
|
97
|
-
|
|
98
|
-
- **Orchestrator override**: The dependency classification is advisory. The orchestrator or user can reclassify pairs when the automated analysis is overly conservative.
|
|
99
|
-
|
|
100
|
-
4. **Gap Analysis**: Build a reconciliation table:
|
|
101
|
-
|
|
102
|
-
```
|
|
103
|
-
| Research Finding | Actual Code State | Gap Type | Action | Dependencies |
|
|
104
|
-
|-----------------|-------------------|----------|--------|-------------|
|
|
105
|
-
| "No caching" | Redis client exists | Overestimate | Remove from plan | — |
|
|
106
|
-
| "Need auth middleware" | No auth layer | Real gap | Keep in plan | D3 (sequential) |
|
|
107
|
-
| "Migrate to v3" | Already on v3.1 | Overestimate | Remove from plan | — |
|
|
108
|
-
| "Add rate limiting" | Basic limiter exists | Partial gap | Adapt existing | independent |
|
|
109
|
-
```
|
|
110
|
-
|
|
111
|
-
5. **Refined Plan**: Write implementation plan containing ONLY real gaps:
|
|
112
|
-
- Remove overestimates (already implemented)
|
|
113
|
-
- Adjust partial gaps (adapt, don't rebuild)
|
|
114
|
-
- Prioritize real gaps by impact
|
|
115
|
-
6. **User Approval**: `ExitPlanMode` presents the refined plan for user review
|
|
116
|
-
- Include dependency matrix in plan output
|
|
117
|
-
- Display override option: "Dependency classifications are advisory. Reply with reclassifications if needed."
|
|
118
|
-
|
|
119
|
-
### Phase 3: Plan Verification Research
|
|
120
|
-
|
|
121
|
-
Validate the refined plan with focused research before implementation begins.
|
|
122
|
-
|
|
123
24
|
```
|
|
124
|
-
Phase 3: Plan Verification Research
|
|
125
|
-
├── 3-team focused verification
|
|
126
|
-
│ ├── T1: Technical feasibility
|
|
127
|
-
│ ├── T2: Conflict/duplication check
|
|
128
|
-
│ └── T3: Test strategy & risk
|
|
129
|
-
├── Verdict: PASS or REVISE
|
|
130
|
-
├── PASS → implementation advisory
|
|
131
|
-
└── REVISE → return to Phase 2
|
|
132
|
-
```
|
|
133
|
-
|
|
134
|
-
**Teams** (3 parallel, NOT full 10-team):
|
|
135
|
-
|
|
136
|
-
| Team | Focus | Verifies |
|
|
137
|
-
|------|-------|----------|
|
|
138
|
-
| T1 | Technical feasibility | Can the plan be implemented with current stack/deps? |
|
|
139
|
-
| T2 | Conflict & duplication | Does the plan conflict with in-flight work or duplicate existing code? |
|
|
140
|
-
| T3 | Test strategy & risk | Is the plan testable? What are the failure modes? |
|
|
141
|
-
|
|
142
|
-
**Invocation**: Phase 3 teams are spawned directly as parallel agents (NOT via `Skill(research)`). The orchestrator creates 3 focused agents, each with a specific verification mandate derived from the Phase 2 plan.
|
|
143
|
-
|
|
144
|
-
**Model selection**: sonnet for teams, opus for synthesis.
|
|
145
|
-
|
|
146
|
-
**Verdict**:
|
|
147
|
-
- **PASS**: Plan is verified. Display implementation advisory.
|
|
148
|
-
- **REVISE**: Issues found. Return to Phase 2 with feedback for plan refinement.
|
|
149
|
-
- **REVISE limit**: After 2 REVISE cycles, escalate to user for manual judgment.
|
|
150
|
-
|
|
151
|
-
## Workflow Diagram
|
|
152
|
-
|
|
153
|
-
```
|
|
154
|
-
User: /deep-plan "topic"
|
|
155
|
-
│
|
|
156
|
-
├─ Phase 1: Discovery Research
|
|
157
|
-
│ ├─ Skill(research, args="topic")
|
|
158
|
-
│ ├─ 10-team analysis → ADOPT/ADAPT/AVOID
|
|
159
|
-
│ └─ Output: research artifact
|
|
160
|
-
│
|
|
161
|
-
├─ Phase 2: Reality-Check Planning
|
|
162
|
-
│ ├─ EnterPlanMode
|
|
163
|
-
│ ├─ Explore agents (up to 3 parallel)
|
|
164
|
-
│ ├─ Gap analysis: research vs actual code
|
|
165
|
-
│ ├─ Refined plan (real gaps only)
|
|
166
|
-
│ └─ ExitPlanMode → user approval
|
|
167
|
-
│
|
|
168
|
-
└─ Phase 3: Plan Verification
|
|
169
|
-
├─ 3-team focused research
|
|
170
|
-
├─ Verdict: PASS or REVISE
|
|
171
|
-
├─ PASS → implementation advisory
|
|
172
|
-
└─ REVISE → loop back to Phase 2 (max 2 cycles)
|
|
173
|
-
```
|
|
174
|
-
|
|
175
|
-
## Differentiation
|
|
176
25
|
|
|
177
|
-
|
|
178
|
-
|-------|-------|-------------------|--------|
|
|
179
|
-
| `/research` | Analysis only | None — assumption-based | 1 |
|
|
180
|
-
| Plan mode | Planning only | Yes — code exploration | 1 |
|
|
181
|
-
| `/structured-dev-cycle` | Full implementation | Yes — stage-by-stage | 6 |
|
|
182
|
-
| **`/deep-plan`** | **Analysis + Planning + Verification** | **3-pass cross-verification** | **3** |
|
|
26
|
+
## Workflow Contract
|
|
183
27
|
|
|
184
|
-
|
|
185
|
-
|
|
186
|
-
|
|
187
|
-
|
|
188
|
-
|
|
189
|
-
```
|
|
190
|
-
[Deep Plan] {topic}
|
|
191
|
-
├── Phase 1: Discovery Research (10 teams, 3 batches)
|
|
192
|
-
├── Phase 2: Reality-Check Planning (up to 3 Explore agents)
|
|
193
|
-
└── Phase 3: Plan Verification (3 focused teams)
|
|
194
|
-
|
|
195
|
-
Estimated phases: 3 | Models: sonnet → opus
|
|
196
|
-
Execute? [Y/n]
|
|
197
|
-
```
|
|
198
|
-
|
|
199
|
-
Phase transitions:
|
|
200
|
-
```
|
|
201
|
-
[Deep Plan] Phase 1/3 — Discovery Research
|
|
202
|
-
├── Research skill active...
|
|
203
|
-
└── Awaiting 10-team results
|
|
204
|
-
|
|
205
|
-
[Deep Plan] Phase 2/3 — Reality-Check Planning
|
|
206
|
-
├── Gap analysis: 6 ADOPT items → 2 real gaps, 4 overestimates
|
|
207
|
-
└── Refined plan: 5 action items (down from 12)
|
|
208
|
-
|
|
209
|
-
[Deep Plan] Phase 3/3 — Plan Verification
|
|
210
|
-
├── T1 (feasibility): ✓ PASS
|
|
211
|
-
├── T2 (conflicts): ✓ PASS
|
|
212
|
-
├── T3 (test/risk): ✓ PASS
|
|
213
|
-
└── Verdict: PASS — ready for implementation
|
|
214
|
-
```
|
|
215
|
-
|
|
216
|
-
## Post-Completion Advisory
|
|
217
|
-
|
|
218
|
-
After PASS verdict:
|
|
219
|
-
```
|
|
220
|
-
[Advisory] Verified plan ready for implementation.
|
|
221
|
-
├── For complex implementations (10+ files): /structured-dev-cycle
|
|
222
|
-
├── For parallel task execution: superpowers:subagent-driven-development
|
|
223
|
-
└── For simple tasks (< 3 files): proceed directly
|
|
224
|
-
```
|
|
28
|
+
| Phase | Name | Key Activity |
|
|
29
|
+
|-------|------|--------------|
|
|
30
|
+
| 1 | Discovery Research | Comprehensive research with ADOPT / ADAPT / AVOID taxonomy |
|
|
31
|
+
| 2 | Reality-Check Planning | Explore current code and keep only real gaps |
|
|
32
|
+
| 3 | Plan Verification | Feasibility, duplication, and test-risk verification |
|
|
225
33
|
|
|
226
34
|
## Execution Rules
|
|
227
35
|
|
|
228
36
|
| Rule | Detail |
|
|
229
37
|
|------|--------|
|
|
230
|
-
| Phase 1 |
|
|
231
|
-
| Phase 2 |
|
|
232
|
-
| Phase 3 |
|
|
233
|
-
| Orchestrator
|
|
234
|
-
| Intent display | Show phase plan before execution (R015) |
|
|
235
|
-
| Ecomode | Auto-activate for team result aggregation (R013) |
|
|
38
|
+
| Phase 1 | Invoke or inline the research workflow, depending on runtime context |
|
|
39
|
+
| Phase 2 | Use up to 3 parallel Explore agents for current-code verification |
|
|
40
|
+
| Phase 3 | Use up to 3 focused verification agents |
|
|
41
|
+
| Orchestrator owns synthesis | The main conversation manages phase transitions and verdicts |
|
|
236
42
|
| REVISE limit | Max 2 cycles before user escalation |
|
|
237
43
|
|
|
238
|
-
##
|
|
239
|
-
|
|
240
|
-
When Agent Teams is enabled, Phase 1 and Phase 3 parallel teams SHOULD use Agent Teams instead of individual Agent tool calls:
|
|
241
|
-
|
|
242
|
-
| Phase | Without Agent Teams | With Agent Teams |
|
|
243
|
-
|-------|--------------------|--------------------|
|
|
244
|
-
| Phase 1 | Delegates to `/research` (handles internally) | Delegates to `/research` (handles internally) |
|
|
245
|
-
| Phase 2 | Up to 3 Explore agents via Agent tool | Up to 3 Explore agents via Agent tool (below threshold) |
|
|
246
|
-
| Phase 3 | 3 agents via Agent tool | 3 agents — at threshold, prefer Agent Teams for coordination |
|
|
247
|
-
|
|
248
|
-
Phase 1 delegation to `/research` means Agent Teams decisions are handled by the research skill itself. Phase 3's 3-team verification is at the Agent Teams threshold (3+ agents) and benefits from peer messaging for cross-verification.
|
|
249
|
-
|
|
250
|
-
## Model Selection
|
|
251
|
-
|
|
252
|
-
| Phase | Component | Model | Rationale |
|
|
253
|
-
|-------|-----------|-------|-----------|
|
|
254
|
-
| Phase 1 | Research teams | sonnet | Delegated to /research skill |
|
|
255
|
-
| Phase 1 | Verification | opus | Delegated to /research skill |
|
|
256
|
-
| Phase 2 | Explore agents | haiku | Fast codebase search |
|
|
257
|
-
| Phase 2 | Gap analysis | opus | Complex reconciliation reasoning |
|
|
258
|
-
| Phase 3 | Verification teams | sonnet | Balanced analysis |
|
|
259
|
-
| Phase 3 | Synthesis/verdict | opus | Final judgment |
|
|
260
|
-
|
|
261
|
-
## Cost Estimate
|
|
262
|
-
|
|
263
|
-
| Phase | Approximate Cost | Driver |
|
|
264
|
-
|-------|-----------------|--------|
|
|
265
|
-
| Phase 1 | High | Full 10-team `/research` invocation |
|
|
266
|
-
| Phase 2 | Low-Medium | Up to 3 Explore agents (haiku) + 1 opus synthesis |
|
|
267
|
-
| Phase 3 | Medium | 3 sonnet verification teams + 1 opus synthesis |
|
|
268
|
-
| **Total** | **High** | Dominated by Phase 1 research cost |
|
|
269
|
-
|
|
270
|
-
`/deep-plan` is designed for high-stakes decisions where plan quality justifies the cost. For quick planning, use `EnterPlanMode` directly.
|
|
271
|
-
|
|
272
|
-
## Integration
|
|
273
|
-
|
|
274
|
-
| Component | Integration |
|
|
275
|
-
|-----------|-------------|
|
|
276
|
-
| `/research` | Phase 1 full invocation (via Skill tool or inline in Teams mode) + Phase 3 reduced invocation pattern |
|
|
277
|
-
| EnterPlanMode/ExitPlanMode | Phase 2 plan creation and user approval |
|
|
278
|
-
| Explore agents | Phase 2 codebase verification (up to 3 parallel) |
|
|
279
|
-
| R009 | Phase 1 (10 teams batched), Phase 2 (3 Explore), Phase 3 (3 teams) |
|
|
280
|
-
| R010 | Orchestrator manages all 3 phases; teams are subagents |
|
|
281
|
-
| R013 | Ecomode for team result aggregation |
|
|
282
|
-
| R015 | Phase transition intent display |
|
|
283
|
-
| result-aggregation | Phase 1 and 3 result formatting |
|
|
284
|
-
| superpowers:subagent-driven-development | Post-PASS implementation advisory (external plugin) |
|
|
285
|
-
|
|
286
|
-
## Fallback Behavior
|
|
287
|
-
|
|
288
|
-
| Scenario | Fallback |
|
|
289
|
-
|----------|----------|
|
|
290
|
-
| Phase 1 `/research` fails | Manual analysis, then proceed to Phase 2 |
|
|
291
|
-
| Phase 2 EnterPlanMode unavailable | Perform analysis without plan mode context |
|
|
292
|
-
| Phase 3 REVISE ≥ 2 times | Escalate to user for manual judgment |
|
|
293
|
-
| Explore agent failure | Reduce parallel count, retry with remaining |
|
|
294
|
-
| Partial team failure | Synthesize from available results, note gaps |
|
|
295
|
-
|
|
296
|
-
## Teams Mode
|
|
297
|
-
|
|
298
|
-
When running inside an Agent Teams member (not via Skill tool), the deep-plan workflow operates identically but with these adaptations:
|
|
299
|
-
|
|
300
|
-
### How It Works
|
|
301
|
-
|
|
302
|
-
The orchestrator reads this SKILL.md and includes the deep-plan instructions directly in the Teams member's prompt. The member then:
|
|
44
|
+
## Differentiation
|
|
303
45
|
|
|
304
|
-
|
|
305
|
-
|
|
306
|
-
|
|
307
|
-
|
|
46
|
+
| Skill | Scope | Code Verification |
|
|
47
|
+
|-------|-------|-------------------|
|
|
48
|
+
| `/research` | Analysis only | None by default |
|
|
49
|
+
| Plan mode | Planning only | Yes |
|
|
50
|
+
| `/structured-dev-cycle` | Implementation pipeline | Yes, stage by stage |
|
|
51
|
+
| `/deep-plan` | Analysis + planning + verification | Yes, 3-pass cross-verification |
|
|
308
52
|
|
|
309
|
-
|
|
53
|
+
## Sensitive-path compatibility note
|
|
310
54
|
|
|
311
|
-
|
|
312
|
-
# When spawning a Teams member for deep-plan:
|
|
313
|
-
Agent(
|
|
314
|
-
name: "planner-1",
|
|
315
|
-
team_name: "my-team",
|
|
316
|
-
prompt: """
|
|
317
|
-
You are a deep-plan agent. Follow the deep-plan skill workflow below:
|
|
318
|
-
{contents of deep-plan/SKILL.md}
|
|
55
|
+
Codex-native verification artifacts go under `.codex/outputs/sessions/YYYY-MM-DD/deep-plan-HHmmss.md` and may be written with the repo's normal file-write APIs.
|
|
319
56
|
|
|
320
|
-
|
|
321
|
-
{contents of research/SKILL.md}
|
|
57
|
+
If any research, planning, or verification delegate must touch `.claude/**`, `.claude/outputs/**`, or `templates/.claude/**`, include this block in that delegate prompt:
|
|
322
58
|
|
|
323
|
-
|
|
324
|
-
|
|
325
|
-
|
|
326
|
-
|
|
59
|
+
```text
|
|
60
|
+
Sensitive-path compatibility note:
|
|
61
|
+
1. Keep Codex-native .codex/** artifacts on the normal file-write path.
|
|
62
|
+
2. On Claude Code v2.1.121+ with bypassPermissions, direct writes to .claude/skills/, .claude/agents/, and .claude/commands/ are allowed.
|
|
63
|
+
3. On Claude Code v2.1.126+ with bypassPermissions, broader protected paths such as .claude/**, .git/**, and .vscode/** are also covered.
|
|
64
|
+
4. Use /tmp/deep-plan-<timestamp>.md only as a legacy fallback when the runtime is older or still prompts, then verify the resulting diff.
|
|
327
65
|
```
|
|
328
66
|
|
|
329
|
-
|
|
330
|
-
|
|
331
|
-
| Aspect | Orchestrator Mode | Teams Mode |
|
|
332
|
-
|--------|------------------|------------|
|
|
333
|
-
| Invocation | `Skill(deep-plan)` | Prompt embedding |
|
|
334
|
-
| Phase 1 research | `Skill(research)` | Inline execution |
|
|
335
|
-
| Result delivery | Return to main conversation | `SendMessage` to team lead |
|
|
336
|
-
| Plan approval | User via ExitPlanMode | Team lead via SendMessage |
|
|
337
|
-
| Context isolation | Previously used `context: fork` | Standard context (no fork) |
|
|
67
|
+
The protocol must be copied into delegate prompts. A parent-only mention is insufficient because forked contexts can lose the directive.
|
|
338
68
|
|
|
339
|
-
|
|
69
|
+
## Agent Teams
|
|
340
70
|
|
|
341
|
-
|
|
71
|
+
When Agent Teams are available and the plan has multiple independent verification lanes, the Phase 3 agents may run as a coordinated team. Otherwise, use bounded parallel subagents and aggregate results in the main conversation.
|
|
342
72
|
|
|
343
|
-
##
|
|
344
|
-
|
|
345
|
-
Phase 1 research artifact is persisted by the `/research` skill.
|
|
73
|
+
## Post-Completion Advisory
|
|
346
74
|
|
|
347
|
-
|
|
348
|
-
```
|
|
349
|
-
.codex/outputs/sessions/{YYYY-MM-DD}/deep-plan-{HHmmss}.md
|
|
350
|
-
```
|
|
75
|
+
After PASS, return an implementation-ready plan with file scopes, dependencies, test commands, release gates, rollback notes, and any unresolved risks.
|
|
351
76
|
|
|
352
|
-
|
|
77
|
+
## Permission Mode
|
|
353
78
|
|
|
354
|
-
|
|
355
|
-
```markdown
|
|
356
|
-
---
|
|
357
|
-
skill: deep-plan
|
|
358
|
-
date: {ISO-8601 with timezone}
|
|
359
|
-
query: "{original user query}"
|
|
360
|
-
phases_completed: 3
|
|
361
|
-
verdict: PASS|REVISE
|
|
362
|
-
---
|
|
363
|
-
```
|
|
79
|
+
When spawning agents, explicitly pass `mode: "bypassPermissions"` if the runtime supports it. Defaults may override agent frontmatter and reintroduce permission prompts during unattended execution.
|
|
@@ -73,7 +73,7 @@ Each agent receives the full diff and returns findings as structured JSON:
|
|
|
73
73
|
- Verify all changes align with project's compilation metaphor (Skills=source, Agents=artifacts, Rules=spec)
|
|
74
74
|
- Check separation of concerns: no agents containing skill logic, no skills with agent definitions
|
|
75
75
|
- Verify orchestrator rules: no new file writes from orchestrator context
|
|
76
|
-
- Verify sensitive-path
|
|
76
|
+
- Verify sensitive-path compatibility: prompts that touch `.claude/**`, `.claude/outputs/**`, or `templates/.claude/**` include the `Sensitive-path compatibility note`, keep `.codex/**` artifacts on the normal file-write path, and treat `/tmp/{skill}-{timestamp}.md` only as a legacy fallback for older Claude Code versions or sessions that still prompt
|
|
77
77
|
- Check advisory-first: no new hard-blocking hooks introduced
|
|
78
78
|
- Confirm no feature regressions: existing APIs preserved, test coverage maintained
|
|
79
79
|
- Performance sanity: no O(n^2) on large datasets, no missing indexes for new queries
|
|
@@ -10,7 +10,7 @@ context: fork
|
|
|
10
10
|
|
|
11
11
|
## Sensitive-Path Delegation
|
|
12
12
|
|
|
13
|
-
Sensitive-path
|
|
13
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
14
14
|
|
|
15
15
|
## Engineers
|
|
16
16
|
|
|
@@ -113,6 +113,11 @@ If only PASS/INFO: proceed automatically.
|
|
|
113
113
|
6. **Artifact persistence** (optional): Review agent saves findings to:
|
|
114
114
|
```
|
|
115
115
|
.codex/outputs/sessions/{YYYY-MM-DD}/dev-review-{HHmmss}.md
|
|
116
|
+
|
|
117
|
+
### Compatibility artifact protocol
|
|
118
|
+
|
|
119
|
+
Sensitive-path compatibility note: when delegated work touches `.claude/outputs/`, `.claude/**`, or `templates/.claude/**`, keep `.codex/**` artifacts on the normal file-write path. On Claude Code v2.1.121+ with `bypassPermissions`, direct compatibility writes are allowed for `.claude/skills/`, `.claude/agents/`, and `.claude/commands/`; on v2.1.126+ broader protected paths are covered. Use `/tmp/<skill>-<timestamp>.md` only as a legacy fallback when the runtime is older or still prompts.
|
|
120
|
+
|
|
116
121
|
```
|
|
117
122
|
With metadata header:
|
|
118
123
|
```markdown
|
|
@@ -15,7 +15,7 @@ high-scoring candidates.
|
|
|
15
15
|
|
|
16
16
|
## Sensitive-Path Delegation
|
|
17
17
|
|
|
18
|
-
Sensitive-path
|
|
18
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
19
19
|
|
|
20
20
|
## Purpose
|
|
21
21
|
|
|
@@ -16,7 +16,7 @@ Evaluate agent quality using 15 structured software engineering task definitions
|
|
|
16
16
|
|
|
17
17
|
## Sensitive-Path Delegation
|
|
18
18
|
|
|
19
|
-
Sensitive-path
|
|
19
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
20
20
|
|
|
21
21
|
## Usage
|
|
22
22
|
|
|
@@ -93,6 +93,11 @@ harness:
|
|
|
93
93
|
|
|
94
94
|
1. **Read target agent frontmatter** — extract `tools`, `domain`, `limitations` fields
|
|
95
95
|
2. **Analyze recent tool call patterns** — check `.codex/outputs/` for prior session logs (if available)
|
|
96
|
+
|
|
97
|
+
### Compatibility artifact protocol
|
|
98
|
+
|
|
99
|
+
Sensitive-path compatibility note: when delegated work touches `.claude/outputs/`, `.claude/**`, or `templates/.claude/**`, keep `.codex/**` artifacts on the normal file-write path. On Claude Code v2.1.121+ with `bypassPermissions`, direct compatibility writes are allowed for `.claude/skills/`, `.claude/agents/`, and `.claude/commands/`; on v2.1.126+ broader protected paths are covered. Use `/tmp/<skill>-<timestamp>.md` only as a legacy fallback when the runtime is older or still prompts.
|
|
100
|
+
|
|
96
101
|
3. **Synthesize validation harness** — generate YAML harness matching agent's declared capabilities
|
|
97
102
|
4. **Refine via evaluator-optimizer loop** — iterate harness against edge cases (3 rounds max)
|
|
98
103
|
5. **Save output** — write to `.codex/outputs/harnesses/{agent-name}-{mode}.yaml`
|
|
@@ -15,7 +15,7 @@ Surface actionable improvement suggestions gathered by the eval-core analysis en
|
|
|
15
15
|
|
|
16
16
|
## Sensitive-Path Delegation
|
|
17
17
|
|
|
18
|
-
Sensitive-path
|
|
18
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
19
19
|
|
|
20
20
|
## Usage
|
|
21
21
|
|
|
@@ -16,7 +16,7 @@ When an agent or skill has evolved organically without a formal spec, `omcodex:t
|
|
|
16
16
|
|
|
17
17
|
## Sensitive-Path Delegation
|
|
18
18
|
|
|
19
|
-
Sensitive-path
|
|
19
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
20
20
|
|
|
21
21
|
## Usage
|
|
22
22
|
|
|
@@ -24,7 +24,7 @@ target Build output path or project root (optional, auto-detects)
|
|
|
24
24
|
|
|
25
25
|
## Sensitive-Path Delegation
|
|
26
26
|
|
|
27
|
-
Sensitive-path
|
|
27
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
28
28
|
|
|
29
29
|
## Workflow
|
|
30
30
|
|
|
@@ -20,7 +20,7 @@ Generate comprehensive optimization report with analysis, metrics, and recommend
|
|
|
20
20
|
|
|
21
21
|
## Sensitive-Path Delegation
|
|
22
22
|
|
|
23
|
-
Sensitive-path
|
|
23
|
+
Sensitive-path compatibility note: if this skill delegates work that touches `.claude/**`, `.claude/outputs/**`, `templates/.claude/**`, or read-only measurements of those paths, keep `.codex/**` edits on the normal Codex path. On Claude Code v2.1.121+ with `bypassPermissions`, direct writes to `.claude/skills/`, `.claude/agents/`, and `.claude/commands/` are allowed; on v2.1.126+ that extends to broader protected paths. Only use `/tmp/{skill}-{timestamp}.md` as a legacy fallback when the target runtime is older or still prompts.
|
|
24
24
|
|
|
25
25
|
## Workflow
|
|
26
26
|
|
|
@@ -24,6 +24,11 @@ Gather unfinished work from multiple sources:
|
|
|
24
24
|
|
|
25
25
|
**Source B — Deep-verify findings**:
|
|
26
26
|
- Read the latest deep-verify output from `.codex/outputs/sessions/{today}/`
|
|
27
|
+
|
|
28
|
+
### Compatibility artifact protocol
|
|
29
|
+
|
|
30
|
+
Sensitive-path compatibility note: when delegated work touches `.claude/outputs/`, `.claude/**`, or `templates/.claude/**`, keep `.codex/**` artifacts on the normal file-write path. On Claude Code v2.1.121+ with `bypassPermissions`, direct compatibility writes are allowed for `.claude/skills/`, `.claude/agents/`, and `.claude/commands/`; on v2.1.126+ broader protected paths are covered. Use `/tmp/<skill>-<timestamp>.md` only as a legacy fallback when the runtime is older or still prompts.
|
|
31
|
+
|
|
27
32
|
- Extract any MEDIUM or LOW severity findings that were flagged but not fixed
|
|
28
33
|
|
|
29
34
|
**Source C — Triage deferred items**:
|
|
@@ -46,9 +51,9 @@ Remove duplicates (same issue referenced from multiple sources). Categorize:
|
|
|
46
51
|
|
|
47
52
|
| Category | Criteria | Default Action |
|
|
48
53
|
|----------|----------|----------------|
|
|
49
|
-
|
|
|
50
|
-
|
|
|
51
|
-
|
|
|
54
|
+
| **즉시 실행** | P1/P2 잔여 이슈, MEDIUM+ 검증 발견사항, Critical/High PR 리뷰 발견사항 | 즉시 실행 |
|
|
55
|
+
| **이슈 등록** | P3 이슈, LOW 검증 발견사항, 새 TODO, Medium PR 리뷰 발견사항 | 이슈로 등록 |
|
|
56
|
+
| **참고** | 이미 추적 중인 이슈, 외관 관련 메모 | 건너뛰기 |
|
|
52
57
|
|
|
53
58
|
### 3. Present to User
|
|
54
59
|
|
|
@@ -117,8 +122,8 @@ When creating follow-up issues:
|
|
|
117
122
|
|
|
118
123
|
```bash
|
|
119
124
|
gh issue create \
|
|
120
|
-
--title "{
|
|
121
|
-
--body "##
|
|
125
|
+
--title "{간결한 설명}" \
|
|
126
|
+
--body "## 출처\n\nv{version} 릴리즈 워크플로우에서 발견.\n\n## 컨텍스트\n\n{triage/verify에서의 상세 컨텍스트}\n\n## 권장 조치\n\n{권장 사항}" \
|
|
122
127
|
--label "professor"
|
|
123
128
|
```
|
|
124
129
|
|