codex-workflows 0.4.9 → 0.4.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -9,6 +9,8 @@ description: "Add integration/E2E tests to existing codebase using Design Docs."
9
9
  2. [LOAD IF NOT ACTIVE] `integration-e2e-testing` — integration and E2E test patterns
10
10
  3. [LOAD IF NOT ACTIVE] `documentation-criteria` — document creation rules and templates
11
11
 
12
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
13
+
12
14
  **Context**: Test addition workflow for existing implementations
13
15
 
14
16
  ## Orchestrator Definition
@@ -10,6 +10,8 @@ description: "Execute decomposed backend tasks in autonomous execution mode usin
10
10
  3. [LOAD IF NOT ACTIVE] `ai-development-guide` — AI development patterns
11
11
  4. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` — agent coordination and workflow flows
12
12
 
13
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
14
+
13
15
  ## Orchestrator Definition
14
16
 
15
17
  **Core Identity**: "I am not a worker. I am an orchestrator." (see subagents-orchestration-guide skill)
@@ -9,6 +9,8 @@ description: "Execute from requirement analysis to design document creation."
9
9
  2. [LOAD IF NOT ACTIVE] `implementation-approach` — implementation strategy
10
10
  3. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` — agent coordination and workflow flows
11
11
 
12
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
13
+
12
14
  **Context**: Dedicated to the design phase.
13
15
 
14
16
  ## Orchestrator Definition
@@ -8,6 +8,8 @@ description: "Investigate problem, verify findings, and derive solutions through
8
8
  1. [LOAD IF NOT ACTIVE] `ai-development-guide` — AI development patterns
9
9
  2. [LOAD IF NOT ACTIVE] `coding-rules` — coding standards
10
10
 
11
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
12
+
11
13
  **Context**: Diagnosis flow to identify concrete failure points and present solutions
12
14
 
13
15
  Target problem: $ARGUMENTS
@@ -10,6 +10,8 @@ description: "Execute frontend tasks in autonomous execution mode using task-exe
10
10
  3. [LOAD IF NOT ACTIVE] `ai-development-guide` -- AI development patterns
11
11
  4. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` -- agent coordination and workflow flows
12
12
 
13
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
14
+
13
15
  ## Orchestrator Definition
14
16
 
15
17
  **Core Identity**: "I am not a worker. I am an orchestrator." (see subagents-orchestration-guide skill)
@@ -11,6 +11,8 @@ description: "Execute from requirement analysis to frontend design document crea
11
11
  2. [LOAD IF NOT ACTIVE] `implementation-approach` -- implementation methodology
12
12
  3. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` -- agent coordination and workflow flows
13
13
 
14
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
15
+
14
16
  ## Orchestrator Definition
15
17
 
16
18
  **Core Identity**: "I am not a worker. I am an orchestrator."
@@ -11,6 +11,8 @@ description: "Create frontend work plan from design document with test skeleton
11
11
  2. [LOAD IF NOT ACTIVE] `implementation-approach` -- implementation methodology
12
12
  3. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` -- agent coordination and workflow flows
13
13
 
14
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
15
+
14
16
  ## Orchestrator Definition
15
17
 
16
18
  **Core Identity**: "I am not a worker. I am an orchestrator."
@@ -11,6 +11,8 @@ description: "Frontend Design Doc compliance and security validation with option
11
11
  2. [LOAD IF NOT ACTIVE] `testing` -- test strategy and quality gates
12
12
  3. [LOAD IF NOT ACTIVE] `ai-development-guide` -- AI development patterns
13
13
 
14
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
15
+
14
16
  ## Execution Method
15
17
 
16
18
  - Compliance validation -> performed by code-reviewer
@@ -10,6 +10,8 @@ description: "Execute decomposed fullstack tasks with layer-aware agent routing
10
10
  3. [LOAD IF NOT ACTIVE] `ai-development-guide` -- AI development patterns
11
11
  4. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` -- agent coordination and workflow flows
12
12
 
13
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
14
+
13
15
  ## Orchestrator Definition
14
16
 
15
17
  **Core Identity**: "I am not a worker. I am an orchestrator." (see subagents-orchestration-guide skill)
@@ -14,6 +14,8 @@ description: "Orchestrate full-cycle implementation across backend and frontend
14
14
  5. [LOAD IF NOT ACTIVE] `implementation-approach` -- implementation methodology
15
15
  6. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` -- agent coordination and workflow flows
16
16
 
17
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
18
+
17
19
  ## Orchestrator Definition
18
20
 
19
21
  **Core Identity**: "I am not a worker. I am an orchestrator." (see subagents-orchestration-guide skill)
@@ -8,6 +8,8 @@ description: "Orchestrate the complete implementation lifecycle from requirement
8
8
  1. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` — agent coordination and workflow flows
9
9
  2. [LOAD IF NOT ACTIVE] `documentation-criteria` — document creation rules and templates
10
10
 
11
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
12
+
11
13
  # Full-Cycle Implementation
12
14
 
13
15
  $ARGUMENTS
@@ -9,6 +9,8 @@ description: "Create work plan from design document with optional test skeleton
9
9
  2. [LOAD IF NOT ACTIVE] `implementation-approach` — implementation strategy
10
10
  3. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` — agent coordination and workflow flows
11
11
 
12
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
13
+
12
14
  **Context**: Dedicated to the planning phase.
13
15
 
14
16
  ## Orchestrator Definition
@@ -9,6 +9,8 @@ description: "Generate PRD and Design Docs from existing codebase through discov
9
9
  2. [LOAD IF NOT ACTIVE] `ai-development-guide` — AI development patterns
10
10
  3. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` — agent coordination and workflow flows
11
11
 
12
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
13
+
12
14
  **Context**: Reverse engineering workflow to create documentation from existing code
13
15
 
14
16
  Target: $ARGUMENTS
@@ -9,6 +9,8 @@ description: "Design Doc compliance and security validation with optional auto-f
9
9
  2. [LOAD IF NOT ACTIVE] `testing` — test strategy and quality gates
10
10
  3. [LOAD IF NOT ACTIVE] `ai-development-guide` — AI development patterns
11
11
 
12
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
13
+
12
14
  **Context**: Post-implementation quality assurance
13
15
 
14
16
  ## Orchestrator Definition
@@ -7,6 +7,8 @@ description: "Execute tasks with metacognitive analysis and appropriate rule sel
7
7
 
8
8
  1. [LOAD IF NOT ACTIVE] `task-analyzer` — task analysis and skill selection (rule-advisor handles remaining skill selection)
9
9
 
10
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
11
+
10
12
  # Task Execution with Metacognitive Analysis
11
13
 
12
14
  Task: $ARGUMENTS
@@ -8,6 +8,8 @@ description: "Update existing design documents (Design Doc / PRD / ADR) with rev
8
8
  1. [LOAD IF NOT ACTIVE] `documentation-criteria` — document creation rules and templates
9
9
  2. [LOAD IF NOT ACTIVE] `subagents-orchestration-guide` — agent coordination and workflow flows
10
10
 
11
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
12
+
11
13
  **Context**: Dedicated to updating existing design documents.
12
14
 
13
15
  ## Orchestrator Definition
@@ -5,6 +5,8 @@ description: "Guides subagent coordination through implementation workflows. Use
5
5
 
6
6
  # Subagents Orchestration Guide
7
7
 
8
+ **Spawn rule**: every `spawn_agent` call MUST pass `fork_turns="none"` or `fork_context=false` for context isolation.
9
+
8
10
  ## Role: The Orchestrator
9
11
 
10
12
  The orchestrator coordinates subagents. All investigation, analysis, and implementation work flows through specialized subagents.
@@ -96,28 +98,51 @@ Subagents CANNOT directly call other subagents — all coordination MUST flow th
96
98
 
97
99
  **ENFORCEMENT**: Direct subagent-to-subagent communication is PROHIBITED
98
100
 
101
+ ### Subagent Completion Discipline [MANDATORY]
102
+
103
+ The orchestrator owns subagent completion. Base waiting decisions on assigned responsibility and observed state, not on an expectation of quick completion. Multi-step search, review, verification, generation, implementation, and quality work can run for extended periods.
104
+
105
+ Use this contract:
106
+ - Wait for required subagent outputs with `wait_agent`
107
+ - Keep the current task assignment while the subagent remains `running`
108
+ - Treat missing intermediate output as a normal execution state while the subagent remains `running`
109
+ - Hold final artifact production until every required subagent output is available
110
+ - After repeated empty waits, run non-destructive diagnostics: re-check prompt, inputs, expected deliverable, and agent-task fit; send a focused follow-up when it would clarify the pending deliverable
111
+ - Resume waiting after diagnostics unless the user redirects the workflow or the orchestrator confirms a launch mistake
112
+
113
+ Treat the following as explicit contradictory evidence:
114
+ - The subagent returns a terminal status such as `approved`, `needs_revision`, `blocked`, `skipped`, `completed`, or `escalation_needed`
115
+ - The orchestrator verifies that it launched the wrong subagent or sent materially incorrect inputs
116
+ - A newer explicit user instruction changes or cancels the task
117
+
118
+ Close a running subagent only when the user redirects the workflow, the orchestrator corrects a launch mistake, or a newer user instruction supersedes the pending task.
119
+
120
+ **ENFORCEMENT**: Preserve subagent execution until completion, user redirection, or explicit correction of an orchestrator launch mistake. Speed-based early termination is a CRITICAL VIOLATION.
121
+
99
122
  ## How to Spawn Agents
100
123
 
101
- Spawn agents using natural language prompts. Provide clear context about what the agent should accomplish.
124
+ Spawn agents using natural language prompts. Provide clear context about what the agent should accomplish. Every `spawn_agent` call MUST include `fork_turns="none"` or `fork_context=false` (see Spawn rule at top of this skill).
102
125
 
103
126
  ### Spawn Examples
104
127
 
105
- **requirement-analyzer**:
128
+ Each example below is a spawn with `fork_turns="none"` or `fork_context=false` plus the quoted prompt.
129
+
130
+ **requirement-analyzer** (`fork_turns="none"` or `fork_context=false`):
106
131
  > "Analyze the following requirements and determine the work scale: [user requirements]. Perform requirement analysis and scale determination."
107
132
 
108
- **codebase-analyzer**:
133
+ **codebase-analyzer** (`fork_turns="none"` or `fork_context=false`):
109
134
  > "Analyze the existing codebase to provide evidence for Design Doc creation. Focus on existing implementations, data model elements, and constraints the design should respect. requirement_analysis: [JSON]. prd_path: [path if available]. requirements: [original user requirements]. layer: [target layer if applicable]. target_paths: [paths if narrowed]. Return codebase facts and focus areas."
110
135
 
111
- **task-executor**:
136
+ **task-executor** (`fork_turns="none"` or `fork_context=false`):
112
137
  > "Execute the implementation task defined in docs/plans/tasks/[filename].md. Complete the implementation following TDD Red-Green-Refactor."
113
138
 
114
- **quality-fixer**:
139
+ **quality-fixer** (`fork_turns="none"` or `fork_context=false`):
115
140
  > "Run quality checks on the codebase: static analysis, style check, all test execution. Fix any issues found and report when all checks pass."
116
141
 
117
- **document-reviewer**:
142
+ **document-reviewer** (`fork_turns="none"` or `fork_context=false`):
118
143
  > "Review the document at [path] for quality and rule compliance. Check against documentation-criteria standards."
119
144
 
120
- **design-sync**:
145
+ **design-sync** (`fork_turns="none"` or `fork_context=false`):
121
146
  > "Verify consistency between Design Docs in docs/design/. Use [path] as the source document for comparison."
122
147
 
123
148
  ## Explicit Stop Points [MANDATORY]
@@ -149,16 +174,9 @@ All agents MUST use this vocabulary consistently:
149
174
  | `blocked` | security-reviewer | Committed secrets or high-confidence exploitable risk | Halt workflow immediately, escalate to user (requires human intervention) |
150
175
  | `skipped` | All agents | Preconditions not met for this step | Report reason, proceed |
151
176
 
152
- **approved_with_conditions handling** (document agents):
153
- - Conditions MUST be listed explicitly in the agent's output
154
- - Orchestrator MUST append conditions to the document's "Undetermined Items" or "Open Items" section before proceeding
155
- - Orchestrator MUST pass conditions to the next phase's agent as context
156
- - Conditions do not block progression but MUST be resolved before implementation phase
157
-
158
- **approved_with_notes handling** (security-reviewer):
159
- - Notes are informational — they do NOT require resolution before proceeding
160
- - Orchestrator MUST include notes in the completion report for awareness
161
- - Do not apply approved_with_conditions handling (no resolution tracking)
177
+ Handling rules:
178
+ - `approved_with_conditions`: append the listed conditions to the document's open-items section, carry them into the next phase, and resolve them before implementation
179
+ - `approved_with_notes`: include the notes in the completion report for awareness
162
180
 
163
181
  **ENFORCEMENT**: Using any status value outside this vocabulary is a VIOLATION.
164
182
 
@@ -176,17 +194,20 @@ All agents MUST use this vocabulary consistently:
176
194
 
177
195
  ## Structured Response Specification
178
196
 
179
- Subagents respond in JSON format. The final response from each JSON-returning subagent must be the JSON payload itself, with no trailing prose. Key fields for orchestrator decisions:
180
- - **requirement-analyzer**: scale, confidence, affectedLayers, adrRequired, scopeDependencies, questions
181
- - **codebase-analyzer**: analysisScope, existingElements, dataModel, qualityAssurance, focusAreas, limitations
182
- - **task-executor**: status (escalation_needed/completed), escalation_type (design_compliance_violation/similar_function_found/similar_component_found/investigation_target_not_found/out_of_scope_file/test_environment_not_ready/dependency_version_uncertain), testsAdded, requiresTestReview
183
- - **quality-fixer**: Input: `task_file` (always pass the current task file path in orchestrated flows). Status (`stub_detected`/approved/blocked). `stub_detected` returns `stubFindings[]` and routes back to the task executor. For blocked responses, discriminate by `reason`: specification conflicts use `blockingIssues[]`; execution prerequisites use `missingPrerequisites[]`, and each item provides its own `resolutionSteps`
184
- - **document-reviewer**: verdict.decision (approved/approved_with_conditions/needs_revision/rejected)
185
- - **code-verifier**: summary.status, summary.consistencyScore, discrepancies, reverseCoverage
186
- - **design-sync**: sync_status (CONFLICTS_FOUND/NO_CONFLICTS) text format with [SUMMARY] block
187
- - **integration-test-reviewer**: status (approved/needs_revision/blocked), requiredFixes
188
- - **security-reviewer**: status (approved/approved_with_notes/needs_revision/blocked), findings, notes, requiredFixes
189
- - **acceptance-test-generator**: status, generatedFiles, `e2eAbsenceReason`
197
+ Subagents respond in JSON format. The final response from each JSON-returning subagent must be the JSON payload itself, with no trailing prose. Agent TOML files define the full schemas; the orchestrator only relies on these routing keys:
198
+
199
+ | Agent | Routing fields the orchestrator uses |
200
+ |-------|--------------------------------------|
201
+ | `requirement-analyzer` | `scale`, `confidence`, `affectedLayers`, `adrRequired`, `scopeDependencies`, `questions` |
202
+ | `codebase-analyzer` | `focusAreas`, `dataModel`, `qualityAssurance`, `dataTransformationPipelines`, `limitations` |
203
+ | `task-executor*` | `status`, `escalation_type`, `filesModified`, `requiresTestReview` |
204
+ | `quality-fixer*` | `status`, `reason`, `stubFindings`, `blockingIssues`, `missingPrerequisites` |
205
+ | `document-reviewer` | `verdict.decision`, `verdict.conditions` |
206
+ | `code-verifier` | `summary.status`, `discrepancies`, `reverseCoverage` |
207
+ | `design-sync` | `sync_status` |
208
+ | `integration-test-reviewer` | `status`, `requiredFixes` |
209
+ | `security-reviewer` | `status`, `findings`, `notes`, `requiredFixes` |
210
+ | `acceptance-test-generator` | `status`, `generatedFiles`, `e2eAbsenceReason` |
190
211
 
191
212
  ## Handling Requirement Changes
192
213
 
@@ -215,51 +236,21 @@ Document generation agents (work-planner, technical-designer, prd-creator) can u
215
236
 
216
237
  ## Basic Flow for Work Planning
217
238
 
218
- When receiving new features or change requests, start with requirement-analyzer.
219
-
220
- ### Large Scale (6+ Files) - 13 Steps (backend) / 15 Steps (frontend/fullstack)
221
-
222
- 1. requirement-analyzer: Requirement analysis + Check existing PRD **[Stop]**
223
- 2. prd-creator: PRD creation
224
- 3. document-reviewer: PRD review **[Stop: PRD Approval]**
225
- 4. **(frontend/fullstack only)** Ask user for prototype code; ui-spec-designer: UI Spec creation
226
- 5. **(frontend/fullstack only)** document-reviewer: UI Spec review **[Stop: UI Spec Approval]**
227
- 6. technical-designer: ADR creation (if architecture/technology/data flow changes)
228
- 7. document-reviewer: ADR review (if ADR created) **[Stop: ADR Approval]**
229
- 8. codebase-analyzer: Codebase analysis (pass requirement-analyzer output and PRD path when available)
230
- 9. technical-designer: Design Doc creation
231
- 10. code-verifier: Design Doc verification against code
232
- 11. document-reviewer: Design Doc review with code verification evidence
233
- 12. design-sync: Consistency verification **[Stop: Design Doc Approval]**
234
- 13. acceptance-test-generator: Test skeleton generation, pass to work-planner
235
- 14. work-planner: Work plan creation **[Stop: Batch approval]**
236
- 15. task-decomposer: Autonomous execution to Completion report
237
-
238
- ### Medium Scale (3-5 Files) - 9 Steps (backend) / 11 Steps (frontend/fullstack)
239
-
240
- 1. requirement-analyzer: Requirement analysis **[Stop]**
241
- 2. codebase-analyzer: Codebase analysis
242
- 3. **(frontend/fullstack only)** Ask user for prototype code; ui-spec-designer: UI Spec creation
243
- 4. **(frontend/fullstack only)** document-reviewer: UI Spec review **[Stop: UI Spec Approval]**
244
- 5. technical-designer: Design Doc creation
245
- 6. code-verifier: Design Doc verification against code
246
- 7. document-reviewer: Design Doc review with code verification evidence
247
- 8. design-sync: Consistency verification **[Stop: Design Doc Approval]**
248
- 9. acceptance-test-generator: Test skeleton generation, pass to work-planner
249
- 10. work-planner: Work plan creation **[Stop: Batch approval]**
250
- 11. task-decomposer: Autonomous execution to Completion report
251
-
252
- ### Design Flow Data Passing
253
-
254
- - Pass requirement-analyzer output and original requirements to codebase-analyzer
255
- - Pass codebase-analyzer JSON to technical-designer or technical-designer-frontend as `Codebase Analysis`, including `dataTransformationPipelines` and `qualityAssurance` when present
256
- - Pass Design Doc path to code-verifier
257
- - Pass code-verifier JSON to document-reviewer as `code_verification`
258
-
259
- ### Small Scale (1-2 Files) - 2 Steps
260
-
261
- 1. Create simplified plan **[Stop: Batch approval]**
262
- 2. Direct implementation to Completion report
239
+ Always start with `requirement-analyzer`, then follow the minimum flow required by scale and affected layers.
240
+
241
+ | Scale | Required flow |
242
+ |-------|---------------|
243
+ | Large | `requirement-analyzer` **[Stop]** -> `prd-creator` -> `document-reviewer` **[Stop]** -> optional `ui-spec-designer` + `document-reviewer` **[Stop]** -> optional ADR + `document-reviewer` **[Stop]** -> `codebase-analyzer` -> `technical-designer*` -> `code-verifier` -> `document-reviewer` -> `design-sync` **[Stop]** -> `acceptance-test-generator` -> `work-planner` **[Stop]** -> `task-decomposer` |
244
+ | Medium | `requirement-analyzer` **[Stop]** -> `codebase-analyzer` -> optional `ui-spec-designer` + `document-reviewer` **[Stop]** -> `technical-designer*` -> `code-verifier` -> `document-reviewer` -> `design-sync` **[Stop]** -> `acceptance-test-generator` -> `work-planner` **[Stop]** -> `task-decomposer` |
245
+ | Small | `requirement-analyzer` **[Stop]** -> simplified plan **[Stop: Batch approval]** -> direct implementation |
246
+
247
+ Flow rules:
248
+ - Frontend and fullstack flows add UI Spec before Design Doc creation
249
+ - Create ADR only when architecture, technology, or data-flow changes require it
250
+ - Pass requirement-analyzer output and original requirements to `codebase-analyzer`
251
+ - Pass `codebase-analyzer` output to the designer as `Codebase Analysis`
252
+ - Pass Design Doc path to `code-verifier`, then pass `code_verification` to `document-reviewer`
253
+ - Fullstack layer sequencing is defined in `references/monorepo-flow.md`
263
254
 
264
255
  ## Autonomous Execution Mode
265
256
 
@@ -316,6 +307,13 @@ Stop autonomous execution and escalate to user in the following cases:
316
307
  3. **Work-planner update restriction violated**: Requirement changes after task-decomposer starts require overall redesign
317
308
  4. **User explicitly stops**: Direct stop instruction or interruption
318
309
 
310
+ Continue autonomous execution in the following situations:
311
+ - A subagent takes longer than expected
312
+ - `wait_agent` returns without a completion payload while the subagent remains `running`
313
+ - The orchestrator has partial context but is still waiting on a required subagent output
314
+
315
+ If repeated waits return the same `running` state, apply the completion-diagnostics contract above.
316
+
319
317
  Use the task loop defined in the autonomous execution diagram above. The canonical per-task cycle is:
320
318
  1. task-executor implementation
321
319
  2. escalation or integration-test-reviewer decision
@@ -338,49 +336,27 @@ Maximum retry count is 1 verification fix cycle. If any failed verifier still fa
338
336
  2. **Information Bridging**: Data conversion and transmission between subagents
339
337
  - Convert each subagent's output to next subagent's input format
340
338
  - **Always pass deliverables from previous process to next agent**
341
- - Extract necessary information from structured responses
342
- - Compose commit messages from changeSummary
339
+ - Extract the routing fields listed above
343
340
  - Explicitly integrate initial and additional requirements when requirements change
344
341
  3. **Quality Assurance and Commit Execution**: Execute git commit per the 4-step task cycle
345
342
  4. **Autonomous Execution Mode Management**: Start/stop autonomous execution after approval, escalation decisions
346
343
  5. **ADR Status Management**: Update ADR status after user decision (Accepted/Rejected)
347
344
 
348
- ### acceptance-test-generator to work-planner Bridge
349
-
350
- **Pass to acceptance-test-generator**:
351
- - Design Doc: [path]
352
- - UI Spec: [path] (if exists)
353
-
354
- **Orchestrator verification items**:
355
- - Verify integration test file path retrieval and existence
356
- - Verify E2E test file path retrieval and existence when `generatedFiles.e2e` is not null
357
- - Verify `e2eAbsenceReason` is present when `generatedFiles.e2e` is null
358
-
359
- **Pass to work-planner**:
360
- - Integration test file: [path] (create and execute simultaneously with each phase implementation)
361
- - E2E test file: [path] or `null` (execute only in final phase when present)
362
- - E2E absence reason: [value when E2E test file is null]
363
-
364
- **On error**: Escalate to user only when required outputs are missing without a valid absence reason
365
-
366
- ### Design Doc to Work Plan Verification Handoff
367
-
368
- When a Design Doc contains a Verification Strategy section, the orchestrator must carry forward:
369
- - Design Doc path
370
- - Verification Strategy details:
371
- - Correctness definition
372
- - Target comparison
373
- - Verification method
374
- - Observable success indicator
375
- - Verification timing
376
- - Early verification point (first target, success criteria, failure response)
377
-
378
- The resulting work plan must include this summary in its header so the plan remains self-sufficient for downstream task generation and execution planning.
379
- When the Design Doc includes an `Output Comparison` section, carry forward the comparison input, expected output fields or format, diff method, and transformation pipeline coverage as part of that summary.
380
-
381
- In addition, the orchestrator must preserve implementation-relevant technical requirements from each Design Doc so work-planner can create a Design-to-Plan Traceability table. Use the category values and normalization rules defined in the plan template, including protected no-change boundaries from sections such as `No Ripple Effect`.
382
-
383
- Work-planner maps each extracted item to a covering task or phase. If no covering task exists, the row is marked `gap` with justification. Justified gaps require user confirmation before plan approval.
345
+ ### Required Handoffs
346
+
347
+ | From | To | Required pass-through |
348
+ |------|----|-----------------------|
349
+ | `requirement-analyzer` | `codebase-analyzer` | requirement analysis JSON, original requirements, PRD path when available |
350
+ | `codebase-analyzer` | `technical-designer*` | `Codebase Analysis`, including `focusAreas`, `dataModel`, `qualityAssurance`, `dataTransformationPipelines`, `limitations` |
351
+ | `technical-designer*` | `code-verifier` | Design Doc path |
352
+ | `code-verifier` | `document-reviewer` | `code_verification` JSON |
353
+ | `acceptance-test-generator` | `work-planner` | integration test path, E2E path or `null`, `e2eAbsenceReason` when E2E is absent |
354
+ | Design Doc | `work-planner` | Verification Strategy summary, Output Comparison details, implementation-relevant technical requirements, protected no-change boundaries |
355
+
356
+ Handoff rules:
357
+ - Verify generated integration and E2E file paths exist before passing them onward
358
+ - Escalate only when required outputs are missing without a valid absence reason
359
+ - Require work-planner to map every carried-forward technical requirement to a covering task or a justified `gap`
384
360
 
385
361
  ## Important Constraints [MANDATORY]
386
362
 
package/README.md CHANGED
@@ -363,9 +363,9 @@ A: `$recipe-implement` is the universal entry point. It runs requirement-analyze
363
363
 
364
364
  A: Yes. Codex skills and subagents work alongside [MCP](https://developers.openai.com/codex/mcp) — skills operate at the instruction layer while MCP operates at the tool transport layer. You can add MCP servers to any agent's TOML configuration.
365
365
 
366
- **Q: What if a subagent gets stuck?**
366
+ **Q: What if a subagent seems stuck?**
367
367
 
368
- A: Subagents escalate to the user when they encounter design deviations, ambiguous requirements, or specification conflicts. The framework stops autonomous execution and presents the issue with options.
368
+ A: Long waits can be normal in this workflow because many subagents perform substantial multi-step work. The orchestrator keeps ownership of the pending deliverable, continues waiting for the required output, and uses diagnostics only to confirm missing inputs or restate the pending deliverable. User direction remains the boundary for interrupting that work.
369
369
 
370
370
  ---
371
371
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "codex-workflows",
3
- "version": "0.4.9",
3
+ "version": "0.4.11",
4
4
  "description": "Task-oriented agentic coding framework for OpenAI Codex CLI — skills, recipes, and subagents for structured development workflows",
5
5
  "license": "MIT",
6
6
  "author": "Shinsuke Kagawa",