codex-workflows 0.5.0 → 0.6.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.agents/skills/coding-rules/SKILL.md +14 -2
- package/.agents/skills/documentation-criteria/references/design-template.md +47 -3
- package/.agents/skills/documentation-criteria/references/plan-template.md +3 -3
- package/.agents/skills/documentation-criteria/references/task-template.md +1 -1
- package/.agents/skills/documentation-criteria/references/ui-spec-template.md +8 -0
- package/.agents/skills/external-resource-context/SKILL.md +99 -0
- package/.agents/skills/external-resource-context/agents/openai.yaml +7 -0
- package/.agents/skills/external-resource-context/references/api.md +20 -0
- package/.agents/skills/external-resource-context/references/backend.md +21 -0
- package/.agents/skills/external-resource-context/references/frontend.md +21 -0
- package/.agents/skills/external-resource-context/references/infra.md +21 -0
- package/.agents/skills/external-resource-context/references/template.md +72 -0
- package/.agents/skills/recipe-front-adjust/SKILL.md +113 -0
- package/.agents/skills/recipe-front-adjust/agents/openai.yaml +7 -0
- package/.agents/skills/recipe-front-design/SKILL.md +28 -9
- package/.agents/skills/recipe-fullstack-implement/SKILL.md +4 -1
- package/.agents/skills/subagents-orchestration-guide/SKILL.md +2 -0
- package/.agents/skills/subagents-orchestration-guide/references/monorepo-flow.md +44 -47
- package/.agents/skills/task-analyzer/SKILL.md +3 -2
- package/.agents/skills/task-analyzer/references/skills-index.yaml +42 -3
- package/.codex/agents/code-reviewer.toml +7 -1
- package/.codex/agents/document-reviewer.toml +3 -0
- package/.codex/agents/quality-fixer-frontend.toml +5 -0
- package/.codex/agents/quality-fixer.toml +5 -0
- package/.codex/agents/task-executor-frontend.toml +11 -0
- package/.codex/agents/task-executor.toml +11 -0
- package/.codex/agents/technical-designer-frontend.toml +64 -17
- package/.codex/agents/technical-designer.toml +54 -17
- package/.codex/agents/ui-analyzer.toml +307 -0
- package/.codex/agents/ui-spec-designer.toml +15 -0
- package/README.md +42 -24
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -4,9 +4,11 @@
|
|
|
4
4
|
[](https://developers.openai.com/codex/skills/)
|
|
5
5
|
[](https://opensource.org/licenses/MIT)
|
|
6
6
|
|
|
7
|
-
**Structured
|
|
7
|
+
**Structured workflows for [OpenAI Codex CLI](https://developers.openai.com/codex/cli).**
|
|
8
8
|
|
|
9
|
-
|
|
9
|
+
They help when multi-step changes stop being easy to reason about, test, or review.
|
|
10
|
+
|
|
11
|
+
Built on the [Agent Skills specification](https://developers.openai.com/codex/skills/) and [Codex subagents](https://developers.openai.com/codex/subagents). This starts to matter when tasks get large: refactors, migrations, or anything that spans multiple files and needs to stay reviewable.
|
|
10
12
|
|
|
11
13
|
---
|
|
12
14
|
|
|
@@ -25,43 +27,49 @@ $recipe-implement Add user authentication with JWT
|
|
|
25
27
|
|
|
26
28
|
`$` is Codex CLI's syntax for invoking a skill explicitly. Type `$recipe-` to see all available recipes via tab completion.
|
|
27
29
|
|
|
28
|
-
Small changes stay lightweight. Larger tasks
|
|
30
|
+
Small changes stay lightweight. Larger tasks are broken into requirements, design, task decomposition, TDD implementation, and quality checks.
|
|
29
31
|
|
|
30
32
|
---
|
|
31
33
|
|
|
32
34
|
## Why codex-workflows?
|
|
33
35
|
|
|
34
|
-
Codex
|
|
36
|
+
Codex works well for short, focused tasks. The problems start when a change spans multiple files, needs design decisions to stay visible, or has to survive review, testing, and follow-up edits.
|
|
35
37
|
|
|
36
|
-
|
|
38
|
+
Many developers have seen the same pattern: things work at first, then drift. Context grows, assumptions accumulate, intermediate decisions disappear, and results become harder to trust.
|
|
37
39
|
|
|
38
|
-
codex-workflows
|
|
40
|
+
codex-workflows is built around those failure modes. Instead of asking Codex to "just implement it", it turns a request into a sequence of steps you can inspect and verify:
|
|
39
41
|
- Traceable artifacts: PRD → Design Doc → Task → Commit
|
|
40
|
-
- Built-in TDD and quality
|
|
42
|
+
- Built-in TDD and quality checks before code is ready to commit
|
|
41
43
|
- Agent context separation for large refactors, migrations, and PR-sized changes
|
|
42
44
|
- Diagnosis and reverse-engineering flows for bugs and legacy code
|
|
43
45
|
|
|
46
|
+
## Background
|
|
47
|
+
|
|
48
|
+
The recipes, subagents, and quality checks in this repo were not designed top-down. Each piece was added in response to a concrete failure mode encountered during delivery work.
|
|
49
|
+
|
|
50
|
+
That is why the workflow separates requirements, design, verification, implementation, and quality checks instead of treating them as one long session.
|
|
51
|
+
|
|
44
52
|
## Not Designed For
|
|
45
53
|
|
|
46
|
-
- One-shot
|
|
47
|
-
- Repositories
|
|
48
|
-
- Teams that
|
|
54
|
+
- One-shot scripts or exploratory sessions where speed matters more than traceability
|
|
55
|
+
- Repositories without tests, lint, builds, or reviewable commits
|
|
56
|
+
- Teams that would rather skip design docs and quality checks entirely
|
|
49
57
|
|
|
50
58
|
---
|
|
51
59
|
|
|
52
60
|
## What It Does
|
|
53
61
|
|
|
54
|
-
|
|
62
|
+
Instead of forcing a fixed workflow, the framework adjusts how much structure it adds based on scope:
|
|
55
63
|
|
|
56
64
|
| Scale | File Count | What Happens |
|
|
57
65
|
|-------|------------|-------------|
|
|
58
66
|
| Small | 1-2 | Simplified plan → direct implementation |
|
|
59
67
|
| Medium | 3-5 | Design Doc → work plan → task execution |
|
|
60
|
-
| Large | 6+ | PRD → ADR → Design Doc → test skeletons → work plan → autonomous execution |
|
|
68
|
+
| Large | 6+ | PRD → ADR → Design Doc → test skeletons → work plan → guided autonomous execution |
|
|
61
69
|
|
|
62
70
|
For larger work, the path usually looks like this: understand the problem, analyze the codebase, design the change, break it into atomic tasks, implement with tests, and run quality checks before commit.
|
|
63
71
|
|
|
64
|
-
Each step
|
|
72
|
+
Each step isolates one concern, so decisions can be checked before they carry into later stages. Specialized subagents run in their own contexts to reduce carry-over assumptions during changes that would otherwise require long sessions:
|
|
65
73
|
|
|
66
74
|
```
|
|
67
75
|
User Request
|
|
@@ -86,7 +94,7 @@ task-decomposer → Atomic tasks (1 task = 1 commit)
|
|
|
86
94
|
↓
|
|
87
95
|
task-executor → TDD implementation per task
|
|
88
96
|
↓
|
|
89
|
-
quality-fixer → Lint, test, build
|
|
97
|
+
quality-fixer → Lint, test, build; no failing checks
|
|
90
98
|
↓
|
|
91
99
|
Ready to commit
|
|
92
100
|
```
|
|
@@ -158,7 +166,7 @@ Invoke recipes with `$recipe-name` in Codex. Type `$recipe-` and use tab complet
|
|
|
158
166
|
| `$recipe-design` | Requirements → ADR/Design Doc | Architecture planning |
|
|
159
167
|
| `$recipe-plan` | Design Doc → test skeletons → work plan | Planning phase, including nullable E2E skeleton handling |
|
|
160
168
|
| `$recipe-prepare-implementation` | Verify work plan readiness and resolve prep gaps | Pre-build check that the plan is implementable |
|
|
161
|
-
| `$recipe-build` | Execute backend tasks
|
|
169
|
+
| `$recipe-build` | Execute backend tasks with validation between steps | Resume backend implementation |
|
|
162
170
|
| `$recipe-review` | Design Doc compliance and security validation with auto-fixes | Post-implementation check |
|
|
163
171
|
| `$recipe-diagnose` | Problem investigation → failure-point verification → solution | Bug investigation |
|
|
164
172
|
| `$recipe-reverse-engineer` | Generate PRD + Design Docs from existing code | Legacy system documentation |
|
|
@@ -170,6 +178,7 @@ Invoke recipes with `$recipe-name` in Codex. Type `$recipe-` and use tab complet
|
|
|
170
178
|
| Recipe | What it does | When to use |
|
|
171
179
|
|--------|-------------|-------------|
|
|
172
180
|
| `$recipe-front-design` | Requirements → UI Spec → frontend Design Doc | Frontend architecture planning |
|
|
181
|
+
| `$recipe-front-adjust` | Implemented UI adjustment with external context and verification | Focused UI changes after implementation |
|
|
173
182
|
| `$recipe-front-plan` | Frontend Design Doc → test skeletons → work plan | Frontend planning phase |
|
|
174
183
|
| `$recipe-front-build` | Execute frontend tasks with RTL + quality checks | Resume frontend implementation |
|
|
175
184
|
| `$recipe-front-review` | Frontend compliance and security validation with React-specific fixes | Frontend post-implementation check |
|
|
@@ -217,7 +226,7 @@ $recipe-reverse-engineer src/auth module
|
|
|
217
226
|
|
|
218
227
|
## Foundational Skills
|
|
219
228
|
|
|
220
|
-
These
|
|
229
|
+
These are applied automatically based on context. You rarely need to think about them directly.
|
|
221
230
|
|
|
222
231
|
| Skill | What it provides |
|
|
223
232
|
|-------|-----------------|
|
|
@@ -227,8 +236,9 @@ These load automatically when the conversation context matches — no explicit i
|
|
|
227
236
|
| `documentation-criteria` | Document creation rules and templates (PRD, ADR, Design Doc, Work Plan) |
|
|
228
237
|
| `implementation-approach` | Strategy selection: vertical / horizontal / hybrid slicing |
|
|
229
238
|
| `integration-e2e-testing` | Integration/E2E test design, value-based selection, review criteria |
|
|
239
|
+
| `external-resource-context` | Access methods for design sources, design systems, API schemas, and verification environments |
|
|
230
240
|
| `task-analyzer` | Task analysis, scale estimation, skill selection |
|
|
231
|
-
| `subagents-orchestration-guide` | Multi-agent coordination, workflow flows, autonomous execution |
|
|
241
|
+
| `subagents-orchestration-guide` | Multi-agent coordination, workflow flows, guided autonomous execution |
|
|
232
242
|
|
|
233
243
|
Language-specific references are included for TypeScript/React projects (`coding-rules/references/typescript.md`, `testing/references/typescript.md`).
|
|
234
244
|
|
|
@@ -236,7 +246,7 @@ Language-specific references are included for TypeScript/React projects (`coding
|
|
|
236
246
|
|
|
237
247
|
## Subagents
|
|
238
248
|
|
|
239
|
-
Codex spawns these as needed during recipe execution. Each agent runs in its own context with specialized instructions and skill configurations.
|
|
249
|
+
Codex spawns these as needed during recipe execution. You do not need to learn them first; recipes route work to the right agents automatically. Each agent runs in its own context with specialized instructions and skill configurations.
|
|
240
250
|
|
|
241
251
|
### Document Creation Agents
|
|
242
252
|
|
|
@@ -248,6 +258,7 @@ Codex spawns these as needed during recipe execution. Each agent runs in its own
|
|
|
248
258
|
| `technical-designer-frontend` | Frontend ADR and Design Doc creation (React) |
|
|
249
259
|
| `ui-spec-designer` | UI Specification from PRD and optional prototype code |
|
|
250
260
|
| `codebase-analyzer` | Existing codebase analysis before Design Doc creation |
|
|
261
|
+
| `ui-analyzer` | UI facts from external resources (design tools, design-system docs, deployed UI) and frontend code |
|
|
251
262
|
| `work-planner` | Work plan creation from Design Docs |
|
|
252
263
|
| `document-reviewer` | Document consistency and approval |
|
|
253
264
|
| `design-sync` | Cross-document consistency verification |
|
|
@@ -286,18 +297,18 @@ Codex spawns these as needed during recipe execution. Each agent runs in its own
|
|
|
286
297
|
|
|
287
298
|
## How It Works
|
|
288
299
|
|
|
289
|
-
### Autonomous Execution Mode
|
|
300
|
+
### Guided Autonomous Execution Mode
|
|
290
301
|
|
|
291
|
-
After work plan approval, the framework
|
|
302
|
+
After work plan approval, the framework executes task files with explicit validation points:
|
|
292
303
|
|
|
293
304
|
1. **task-executor** implements each task with TDD
|
|
294
305
|
2. **quality-fixer** first rejects incomplete task-scoped implementations, then runs lint, tests, and build before every commit
|
|
295
306
|
3. Escalation pauses execution when design deviation or ambiguity is detected
|
|
296
|
-
4. Each task produces one commit
|
|
307
|
+
4. Each task produces one commit for rollback-friendly granularity
|
|
297
308
|
|
|
298
309
|
### Context Separation
|
|
299
310
|
|
|
300
|
-
Each subagent runs in a fresh context. This
|
|
311
|
+
Each subagent runs in a fresh context. This pattern keeps multi-step coding tasks legible and reviewable:
|
|
301
312
|
- generation and verification happen in separate contexts, reducing author bias and carry-over assumptions
|
|
302
313
|
- **document-reviewer** reviews without the author's bias
|
|
303
314
|
- **investigator** collects evidence without confirmation bias
|
|
@@ -318,11 +329,13 @@ your-project/
|
|
|
318
329
|
│ ├── documentation-criteria/
|
|
319
330
|
│ ├── implementation-approach/
|
|
320
331
|
│ ├── integration-e2e-testing/
|
|
332
|
+
│ ├── external-resource-context/
|
|
321
333
|
│ ├── task-analyzer/
|
|
322
334
|
│ ├── subagents-orchestration-guide/
|
|
323
335
|
│ ├── recipe-implement/ # Recipes ($recipe-*)
|
|
324
336
|
│ ├── recipe-design/
|
|
325
337
|
│ ├── recipe-build/
|
|
338
|
+
│ ├── recipe-front-adjust/
|
|
326
339
|
│ ├── recipe-plan/
|
|
327
340
|
│ ├── recipe-prepare-implementation/
|
|
328
341
|
│ ├── recipe-review/
|
|
@@ -334,8 +347,9 @@ your-project/
|
|
|
334
347
|
├── .codex/agents/ # Subagent TOML definitions
|
|
335
348
|
│ ├── requirement-analyzer.toml
|
|
336
349
|
│ ├── technical-designer.toml
|
|
350
|
+
│ ├── ui-analyzer.toml
|
|
337
351
|
│ ├── task-executor.toml
|
|
338
|
-
│ └── ... (
|
|
352
|
+
│ └── ... (25 agents total)
|
|
339
353
|
└── docs/ # Created as you use the recipes
|
|
340
354
|
├── prd/
|
|
341
355
|
├── design/
|
|
@@ -371,7 +385,11 @@ A: `$recipe-implement` is the universal entry point. It runs requirement-analyze
|
|
|
371
385
|
|
|
372
386
|
**Q: Does this work with MCP servers?**
|
|
373
387
|
|
|
374
|
-
A: Yes. Codex skills and subagents work alongside [MCP](https://developers.openai.com/codex/mcp) — skills operate at the instruction layer while MCP operates at the tool transport layer.
|
|
388
|
+
A: Yes. Codex skills and subagents work alongside [MCP](https://developers.openai.com/codex/mcp) — skills operate at the instruction layer while MCP operates at the tool transport layer. Custom agents inherit parent `mcp_servers` when the agent TOML omits `mcp_servers`; add agent-local MCP config only for agent-specific servers or tool filtering.
|
|
389
|
+
|
|
390
|
+
**Q: How is this related to claude-code-workflows?**
|
|
391
|
+
|
|
392
|
+
A: [claude-code-workflows](https://github.com/shinpr/claude-code-workflows) is the Claude Code counterpart. The repositories share the same workflow philosophy, adapted to each tool's native extension points. They can coexist in the same project because Codex uses `.agents/skills/`, `.codex/agents/`, and `AGENTS.md`, while Claude Code uses its own `.claude/` files and `CLAUDE.md`.
|
|
375
393
|
|
|
376
394
|
**Q: What if a subagent seems stuck?**
|
|
377
395
|
|
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "codex-workflows",
|
|
3
|
-
"version": "0.
|
|
3
|
+
"version": "0.6.1",
|
|
4
4
|
"description": "Task-oriented agentic coding framework for OpenAI Codex CLI — skills, recipes, and subagents for structured development workflows",
|
|
5
5
|
"license": "MIT",
|
|
6
6
|
"author": "Shinsuke Kagawa",
|