@fro.bot/systematic 2.7.2 → 2.8.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (58) hide show
  1. package/agents/design/design-implementation-reviewer.md +0 -1
  2. package/agents/design/design-iterator.md +0 -1
  3. package/agents/design/figma-design-sync.md +0 -1
  4. package/agents/docs/ankane-readme-writer.md +0 -1
  5. package/agents/document-review/adversarial-document-reviewer.md +0 -1
  6. package/agents/document-review/coherence-reviewer.md +0 -1
  7. package/agents/document-review/design-lens-reviewer.md +0 -1
  8. package/agents/document-review/feasibility-reviewer.md +0 -1
  9. package/agents/document-review/product-lens-reviewer.md +0 -1
  10. package/agents/document-review/scope-guardian-reviewer.md +0 -1
  11. package/agents/document-review/security-lens-reviewer.md +0 -1
  12. package/agents/research/best-practices-researcher.md +0 -1
  13. package/agents/research/framework-docs-researcher.md +0 -1
  14. package/agents/research/git-history-analyzer.md +0 -1
  15. package/agents/research/issue-intelligence-analyst.md +0 -1
  16. package/agents/research/learnings-researcher.md +0 -1
  17. package/agents/research/repo-research-analyst.md +0 -1
  18. package/agents/research/slack-researcher.md +0 -1
  19. package/agents/review/adversarial-reviewer.md +0 -1
  20. package/agents/review/agent-native-reviewer.md +0 -1
  21. package/agents/review/api-contract-reviewer.md +0 -1
  22. package/agents/review/architecture-strategist.md +0 -1
  23. package/agents/review/cli-agent-readiness-reviewer.md +0 -1
  24. package/agents/review/cli-readiness-reviewer.md +0 -1
  25. package/agents/review/code-simplicity-reviewer.md +0 -1
  26. package/agents/review/correctness-reviewer.md +0 -1
  27. package/agents/review/data-integrity-guardian.md +0 -1
  28. package/agents/review/data-migration-expert.md +0 -1
  29. package/agents/review/data-migrations-reviewer.md +0 -1
  30. package/agents/review/deployment-verification-agent.md +0 -1
  31. package/agents/review/dhh-rails-reviewer.md +0 -1
  32. package/agents/review/julik-frontend-races-reviewer.md +0 -1
  33. package/agents/review/kieran-python-reviewer.md +0 -1
  34. package/agents/review/kieran-rails-reviewer.md +0 -1
  35. package/agents/review/kieran-typescript-reviewer.md +0 -1
  36. package/agents/review/maintainability-reviewer.md +0 -1
  37. package/agents/review/pattern-recognition-specialist.md +0 -1
  38. package/agents/review/performance-oracle.md +0 -1
  39. package/agents/review/performance-reviewer.md +0 -1
  40. package/agents/review/previous-comments-reviewer.md +0 -1
  41. package/agents/review/project-standards-reviewer.md +0 -1
  42. package/agents/review/reliability-reviewer.md +0 -1
  43. package/agents/review/schema-drift-detector.md +0 -1
  44. package/agents/review/security-reviewer.md +0 -1
  45. package/agents/review/security-sentinel.md +0 -1
  46. package/agents/review/testing-reviewer.md +0 -1
  47. package/agents/workflow/bug-reproduction-validator.md +0 -1
  48. package/agents/workflow/lint.md +0 -1
  49. package/agents/workflow/pr-comment-resolver.md +0 -1
  50. package/agents/workflow/spec-flow-analyzer.md +0 -1
  51. package/agents/workflow/systematic-implementer.md +39 -0
  52. package/package.json +1 -1
  53. package/skills/ce-work/SKILL.md +12 -1
  54. package/skills/writing-systematic-skills/SKILL.md +10 -6
  55. package/skills/writing-systematic-skills/references/foundation-conventions.md +11 -5
  56. package/skills/ce-work-beta/SKILL.md +0 -416
  57. package/skills/ce-work-beta/references/codex-delegation-workflow.md +0 -327
  58. package/skills/ce-work-beta/references/shipping-workflow.md +0 -129
@@ -1,416 +0,0 @@
1
- ---
2
- name: ce:work-beta
3
- description: "[BETA] Execute work with external delegate support. Same as ce:work but includes experimental Codex delegation mode for token-conserving code implementation."
4
- disable-model-invocation: true
5
- argument-hint: "[Plan doc path or description of work. Blank to auto use latest plan doc] [delegate:codex]"
6
- ---
7
-
8
- # Work Execution Command
9
-
10
- Execute work efficiently while maintaining quality and finishing features.
11
-
12
- ## Introduction
13
-
14
- This command takes a work document (plan, specification, or todo file) or a bare prompt describing the work, and executes it systematically. The focus is on **shipping complete features** by understanding requirements quickly, following existing patterns, and maintaining quality throughout.
15
-
16
- **Beta rollout note:** Invoke `ce:work-beta` manually when you want to trial Codex delegation. During the beta period, planning and workflow handoffs remain pointed at stable `ce:work` to avoid dual-path orchestration complexity.
17
-
18
- ## Input Document
19
-
20
- <input_document> #$ARGUMENTS </input_document>
21
-
22
- ## Argument Parsing
23
-
24
- Parse `$ARGUMENTS` for the following optional tokens. Strip each recognized token before interpreting the remainder as the plan file path or bare prompt.
25
-
26
- | Token | Example | Effect |
27
- |-------|---------|--------|
28
- | `delegate:codex` | `delegate:codex` | Activate Codex delegation mode for plan execution |
29
- | `delegate:local` | `delegate:local` | Deactivate delegation even if enabled in config |
30
-
31
- All tokens are optional. When absent, fall back to the resolution chain below.
32
-
33
- **Fuzzy activation:** Also recognize imperative delegation-intent phrases such as "use codex", "delegate to codex", "codex mode", or "delegate mode" as equivalent to `delegate:codex`. A bare mention of "codex" in a prompt (e.g., "fix codex converter bugs") must NOT activate delegation -- only clear delegation intent triggers it.
34
-
35
- **Fuzzy deactivation:** Also recognize phrases such as "no codex", "local mode", "standard mode" as equivalent to `delegate:local`.
36
-
37
- ### Settings Resolution Chain
38
-
39
- After extracting tokens from arguments, resolve the delegation state using this precedence chain:
40
-
41
- 1. **Argument flag** -- `delegate:codex` or `delegate:local` from the current invocation (highest priority)
42
- 2. **Config file** -- extract settings from the config block below. Value `codex` for `work_delegate` activates delegation; `false` deactivates.
43
- 3. **Hard default** -- `false` (delegation off)
44
-
45
- **Config (pre-resolved):**
46
- !`cat "$(git rev-parse --show-toplevel 2>/dev/null)/.systematic/config.local.yaml" 2>/dev/null || cat "$(dirname "$(git rev-parse --path-format=absolute --git-common-dir 2>/dev/null)")/.systematic/config.local.yaml" 2>/dev/null || echo '__NO_CONFIG__'`
47
-
48
- If the block above contains YAML key-value pairs, extract values for the keys listed below.
49
- If it shows `__NO_CONFIG__`, the file does not exist — all settings fall through to defaults.
50
- If it shows an unresolved command string, read `.systematic/config.local.yaml` from the repo root using the native file-read tool (e.g., Read in OpenCode, read_file in Codex). If the file does not exist, all settings fall through to defaults.
51
-
52
- If any setting has an unrecognized value, fall through to the hard default for that setting.
53
-
54
- Config keys:
55
- - `work_delegate` -- `codex` or default `false`
56
- - `work_delegate_consent` -- `true` or default `false`
57
- - `work_delegate_sandbox` -- `yolo` (default) or `full-auto`
58
- - `work_delegate_decision` -- `auto` (default) or `ask`
59
- - `work_delegate_model` -- Codex model to use (default `gpt-5.4`). Passthrough — any valid model name accepted.
60
- - `work_delegate_effort` -- `minimal`, `low`, `medium`, `high` (default), or `xhigh`
61
-
62
- Store the resolved state for downstream consumption:
63
- - `delegation_active` -- boolean, whether delegation mode is on
64
- - `delegation_source` -- `argument` or `config` or `default` -- how delegation was resolved (used by environment guard to decide notification verbosity)
65
- - `sandbox_mode` -- `yolo` or `full-auto` (from config or default `yolo`)
66
- - `consent_granted` -- boolean (from config `work_delegate_consent`)
67
- - `delegate_model` -- string (from config or default `gpt-5.4`)
68
- - `delegate_effort` -- string (from config or default `high`)
69
-
70
- ---
71
-
72
- ## Execution Workflow
73
-
74
- ### Phase 0: Input Triage
75
-
76
- Determine how to proceed based on what was provided in `<input_document>`.
77
-
78
- **Plan document** (input is a file path to an existing plan, specification, or todo file) → skip to Phase 1.
79
-
80
- **Bare prompt** (input is a description of work, not a file path):
81
-
82
- 1. **Scan the work area**
83
-
84
- - Identify files likely to change based on the prompt
85
- - Find existing test files for those areas (search for test/spec files that import, reference, or share names with the implementation files)
86
- - Note local patterns and conventions in the affected areas
87
-
88
- 2. **Assess complexity and route**
89
-
90
- | Complexity | Signals | Action |
91
- |-----------|---------|--------|
92
- | **Trivial** | 1-2 files, no behavioral change (typo, config, rename) | Proceed to Phase 1 step 2 (environment setup), then implement directly — no task list, no execution loop. Apply Test Discovery if the change touches behavior-bearing code |
93
- | **Small / Medium** | Clear scope, under ~10 files | Build a task list from discovery. Proceed to Phase 1 step 2 |
94
- | **Large** | Cross-cutting, architectural decisions, 10+ files, touches auth/payments/migrations | Inform the user this would benefit from `/ce:brainstorm` or `/ce:plan` to surface edge cases and scope boundaries. Honor their choice. If proceeding, build a task list and continue to Phase 1 step 2 |
95
-
96
- ---
97
-
98
- ### Phase 1: Quick Start
99
-
100
- 1. **Read Plan and Clarify** _(skip if arriving from Phase 0 with a bare prompt)_
101
-
102
- - Read the work document completely
103
- - Treat the plan as a decision artifact, not an execution script
104
- - If the plan includes sections such as `Implementation Units`, `Work Breakdown`, `Requirements Trace`, `Files`, `Test Scenarios`, or `Verification`, use those as the primary source material for execution
105
- - Check for `Execution note` on each implementation unit — these carry the plan's execution posture signal for that unit (for example, test-first or characterization-first). Note them when creating tasks.
106
- - Check for a `Deferred to Implementation` or `Implementation-Time Unknowns` section — these are questions the planner intentionally left for you to resolve during execution. Note them before starting so they inform your approach rather than surprising you mid-task
107
- - Check for a `Scope Boundaries` section — these are explicit non-goals. Refer back to them if implementation starts pulling you toward adjacent work
108
- - Review any references or links provided in the plan
109
- - If the user explicitly asks for TDD, test-first, or characterization-first execution in this session, honor that request even if the plan has no `Execution note`
110
- - If anything is unclear or ambiguous, ask clarifying questions now
111
- - Get user approval to proceed
112
- - **Do not skip this** - better to ask questions now than build the wrong thing
113
-
114
- 2. **Setup Environment**
115
-
116
- First, check the current branch:
117
-
118
- ```bash
119
- current_branch=$(git branch --show-current)
120
- default_branch=$(git symbolic-ref refs/remotes/origin/HEAD 2>/dev/null | sed 's@^refs/remotes/origin/@@')
121
-
122
- # Fallback if remote HEAD isn't set
123
- if [ -z "$default_branch" ]; then
124
- default_branch=$(git rev-parse --verify origin/main >/dev/null 2>&1 && echo "main" || echo "master")
125
- fi
126
- ```
127
-
128
- **If already on a feature branch** (not the default branch):
129
-
130
- First, check whether the branch name is **meaningful** — a name like `feat/crowd-sniff` or `fix/email-validation` tells future readers what the work is about. Auto-generated worktree names (e.g., `worktree-jolly-beaming-raven`) or other opaque names do not.
131
-
132
- If the branch name is meaningless or auto-generated, suggest renaming it before continuing:
133
- ```bash
134
- git branch -m <meaningful-name>
135
- ```
136
- Derive the new name from the plan title or work description (e.g., `feat/crowd-sniff`). Present the rename as a recommended option alongside continuing as-is.
137
-
138
- Then ask: "Continue working on `[current_branch]`, or create a new branch?"
139
- - If continuing (with or without rename), proceed to step 3
140
- - If creating new, follow Option A or B below
141
-
142
- **If on the default branch**, choose how to proceed:
143
-
144
- **Option A: Create a new branch**
145
- ```bash
146
- git pull origin [default_branch]
147
- git checkout -b feature-branch-name
148
- ```
149
- Use a meaningful name based on the work (e.g., `feat/user-authentication`, `fix/email-validation`).
150
-
151
- **Option B: Use a worktree (recommended for parallel development)**
152
- ```bash
153
- skill: git-worktree
154
- # The skill will create a new branch from the default branch in an isolated worktree
155
- ```
156
-
157
- **Option C: Continue on the default branch**
158
- - Requires explicit user confirmation
159
- - Only proceed after user explicitly says "yes, commit to [default_branch]"
160
- - Never commit directly to the default branch without explicit permission
161
-
162
- **Recommendation**: Use worktree if:
163
- - You want to work on multiple features simultaneously
164
- - You want to keep the default branch clean while experimenting
165
- - You plan to switch between branches frequently
166
-
167
- 3. **Create Todo List** _(skip if Phase 0 already built one, or if Phase 0 routed as Trivial)_
168
- - Use your available task tracking tool (e.g., todowrite, task lists) to break the plan into actionable tasks
169
- - Derive tasks from the plan's implementation units, dependencies, files, test targets, and verification criteria
170
- - Carry each unit's `Execution note` into the task when present
171
- - For each unit, read the `Patterns to follow` field before implementing — these point to specific files or conventions to mirror
172
- - Use each unit's `Verification` field as the primary "done" signal for that task
173
- - Do not expect the plan to contain implementation code, micro-step TDD instructions, or exact shell commands
174
- - Include dependencies between tasks
175
- - Prioritize based on what needs to be done first
176
- - Include testing and quality check tasks
177
- - Keep tasks specific and completable
178
-
179
- 4. **Choose Execution Strategy**
180
-
181
- **Delegation routing gate:** If `delegation_active` is true AND the input is a plan file (not a bare prompt), read `references/codex-delegation-workflow.md` and follow its Pre-Delegation Checks and Delegation Decision flow. If all checks pass and delegation proceeds, force **serial execution** and proceed directly to Phase 2 using the workflow's batched execution loop. If any check disables delegation, fall through to the standard strategy table below. If delegation is active but the input is a bare prompt (no plan file), set `delegation_active` to false with a brief note: "Codex delegation requires a plan file -- using standard mode." and continue with the standard strategy selection below.
182
-
183
- After creating the task list, decide how to execute based on the plan's size and dependency structure:
184
-
185
- | Strategy | When to use |
186
- |----------|-------------|
187
- | **Inline** | 1-2 small tasks, or tasks needing user interaction mid-flight. **Default for bare-prompt work** — bare prompts rarely produce enough structured context to justify subagent dispatch |
188
- | **Serial subagents** | 3+ tasks with dependencies between them. Each subagent gets a fresh context window focused on one unit — prevents context degradation across many tasks. Requires plan-unit metadata (Goal, Files, Approach, Test scenarios) |
189
- | **Parallel subagents** | 3+ tasks that pass the Parallel Safety Check (below). Dispatch independent units simultaneously, run dependent units after their prerequisites complete. Requires plan-unit metadata |
190
-
191
- **Parallel Safety Check** — required before choosing parallel dispatch:
192
-
193
- 1. Build a file-to-unit mapping from every candidate unit's `Files:` section (Create, Modify, and Test paths)
194
- 2. Check for intersection — any file path appearing in 2+ units means overlap
195
- 3. If any overlap is found, downgrade to serial subagents. Log the reason (e.g., "Units 2 and 4 share `config/routes.rb` — using serial dispatch"). Serial subagents still provide context-window isolation without shared-directory risks
196
-
197
- Even with no file overlap, parallel subagents sharing a working directory face git index contention (concurrent staging/committing corrupts the index) and test interference (concurrent test runs pick up each other's in-progress changes). The parallel subagent constraints below mitigate these.
198
-
199
- **Subagent dispatch** uses your available subagent or task spawning mechanism. For each unit, give the subagent:
200
- - The full plan file path (for overall context)
201
- - The specific unit's Goal, Files, Approach, Execution note, Patterns, Test scenarios, and Verification
202
- - Any resolved deferred questions relevant to that unit
203
- - Instruction to check whether the unit's test scenarios cover all applicable categories (happy paths, edge cases, error paths, integration) and supplement gaps before writing tests
204
-
205
- **Parallel subagent constraints** — when dispatching units in parallel (not serial or inline):
206
- - Instruct each subagent: "Do not stage files (`git add`), create commits, or run the project test suite. The orchestrator handles testing, staging, and committing after all parallel units complete."
207
- - These constraints prevent git index contention and test interference between concurrent subagents
208
-
209
- **Permission mode:** Omit the `mode` parameter when dispatching subagents so the user's configured permission settings apply. Do not pass `mode: "auto"` — it overrides user-level settings like `bypassPermissions`.
210
-
211
- **After each subagent completes (serial mode):**
212
- 1. Review the subagent's diff — verify changes match the unit's scope and `Files:` list
213
- 2. Run the relevant test suite to confirm the tree is healthy
214
- 3. If tests fail, diagnose and fix before proceeding — do not dispatch dependent units on a broken tree
215
- 4. Update the plan checkboxes and task list
216
- 5. Dispatch the next unit
217
-
218
- **After all parallel subagents in a batch complete:**
219
- 1. Wait for every subagent in the current parallel batch to finish before acting on any of their results
220
- 2. Cross-check for discovered file collisions: compare the actual files modified by all subagents in the batch (not just their declared `Files:` lists). Subagents may create or modify files not anticipated during planning — this is expected, since plans describe *what* not *how*. A collision only matters when 2+ subagents in the same batch modified the same file. In a shared working directory, only the last writer's version survives — the other unit's changes to that file are lost. If a collision is detected: commit all non-colliding files from all units first, then re-run the affected units serially for the shared file so each builds on the other's committed work
221
- 3. For each completed unit, in dependency order: review the diff, run the relevant test suite, stage only that unit's files, and commit with a conventional message derived from the unit's Goal
222
- 4. If tests fail after committing a unit's changes, diagnose and fix before committing the next unit
223
- 5. Update the plan checkboxes and task list
224
- 6. Dispatch the next batch of independent units, or the next dependent unit
225
-
226
- ### Phase 2: Execute
227
-
228
- 1. **Task Execution Loop**
229
-
230
- For each task in priority order:
231
-
232
- ```
233
- while (tasks remain):
234
- - Mark task as in-progress
235
- - Read any referenced files from the plan or discovered during Phase 0
236
- - Look for similar patterns in codebase
237
- - Find existing test files for implementation files being changed (Test Discovery — see below)
238
- - If delegation_active: branch to the Codex Delegation Execution Loop
239
- (see `references/codex-delegation-workflow.md`)
240
- - Otherwise: implement following existing conventions
241
- - Add, update, or remove tests to match implementation changes (see Test Discovery below)
242
- - Run System-Wide Test Check (see below)
243
- - Run tests after changes
244
- - Assess testing coverage: did this task change behavior? If yes, were tests written or updated? If no tests were added, is the justification deliberate (e.g., pure config, no behavioral change)?
245
- - Mark task as completed
246
- - Evaluate for incremental commit (see below)
247
- ```
248
-
249
- When a unit carries an `Execution note`, honor it. For test-first units, write the failing test before implementation for that unit. For characterization-first units, capture existing behavior before changing it. For units without an `Execution note`, proceed pragmatically.
250
-
251
- Guardrails for execution posture:
252
- - Do not write the test and implementation in the same step when working test-first
253
- - Do not skip verifying that a new test fails before implementing the fix or feature
254
- - Do not over-implement beyond the current behavior slice when working test-first
255
- - Skip test-first discipline for trivial renames, pure configuration, and pure styling work
256
-
257
- **Test Discovery** — Before implementing changes to a file, find its existing test files (search for test/spec files that import, reference, or share naming patterns with the implementation file). When a plan specifies test scenarios or test files, start there, then check for additional test coverage the plan may not have enumerated. Changes to implementation files should be accompanied by corresponding test updates — new tests for new behavior, modified tests for changed behavior, removed or updated tests for deleted behavior.
258
-
259
- **Test Scenario Completeness** — Before writing tests for a feature-bearing unit, check whether the plan's `Test scenarios` cover all categories that apply to this unit. If a category is missing or scenarios are vague (e.g., "validates correctly" without naming inputs and expected outcomes), supplement from the unit's own context before writing tests:
260
-
261
- | Category | When it applies | How to derive if missing |
262
- |----------|----------------|------------------------|
263
- | **Happy path** | Always for feature-bearing units | Read the unit's Goal and Approach for core input/output pairs |
264
- | **Edge cases** | When the unit has meaningful boundaries (inputs, state, concurrency) | Identify boundary values, empty/nil inputs, and concurrent access patterns |
265
- | **Error/failure paths** | When the unit has failure modes (validation, external calls, permissions) | Enumerate invalid inputs the unit should reject, permission/auth denials it should enforce, and downstream failures it should handle |
266
- | **Integration** | When the unit crosses layers (callbacks, middleware, multi-service) | Identify the cross-layer chain and write a scenario that exercises it without mocks |
267
-
268
- **System-Wide Test Check** — Before marking a task done, pause and ask:
269
-
270
- | Question | What to do |
271
- |----------|------------|
272
- | **What fires when this runs?** Callbacks, middleware, observers, event handlers — trace two levels out from your change. | Read the actual code (not docs) for callbacks on models you touch, middleware in the request chain, `after_*` hooks. |
273
- | **Do my tests exercise the real chain?** If every dependency is mocked, the test proves your logic works *in isolation* — it says nothing about the interaction. | Write at least one integration test that uses real objects through the full callback/middleware chain. No mocks for the layers that interact. |
274
- | **Can failure leave orphaned state?** If your code persists state (DB row, cache, file) before calling an external service, what happens when the service fails? Does retry create duplicates? | Trace the failure path with real objects. If state is created before the risky call, test that failure cleans up or that retry is idempotent. |
275
- | **What other interfaces expose this?** Mixins, DSLs, alternative entry points (Agent vs Chat vs ChatMethods). | Grep for the method/behavior in related classes. If parity is needed, add it now — not as a follow-up. |
276
- | **Do error strategies align across layers?** Retry middleware + application fallback + framework error handling — do they conflict or create double execution? | List the specific error classes at each layer. Verify your rescue list matches what the lower layer actually raises. |
277
-
278
- **When to skip:** Leaf-node changes with no callbacks, no state persistence, no parallel interfaces. If the change is purely additive (new helper method, new view partial), the check takes 10 seconds and the answer is "nothing fires, skip."
279
-
280
- **When this matters most:** Any change that touches models with callbacks, error handling with fallback/retry, or functionality exposed through multiple interfaces.
281
-
282
-
283
- 2. **Incremental Commits**
284
-
285
- After completing each task, evaluate whether to create an incremental commit:
286
-
287
- | Commit when... | Don't commit when... |
288
- |----------------|---------------------|
289
- | Logical unit complete (model, service, component) | Small part of a larger unit |
290
- | Tests pass + meaningful progress | Tests failing |
291
- | About to switch contexts (backend → frontend) | Purely scaffolding with no behavior |
292
- | About to attempt risky/uncertain changes | Would need a "WIP" commit message |
293
-
294
- **Heuristic:** "Can I write a commit message that describes a complete, valuable change? If yes, commit. If the message would be 'WIP' or 'partial X', wait."
295
-
296
- If the plan has Implementation Units, use them as a starting guide for commit boundaries — but adapt based on what you find during implementation. A unit might need multiple commits if it's larger than expected, or small related units might land together. Use each unit's Goal to inform the commit message.
297
-
298
- **Commit workflow:**
299
- ```bash
300
- # 1. Verify tests pass (use project's test command)
301
- # Examples: bin/rails test, npm test, pytest, go test, etc.
302
-
303
- # 2. Stage only files related to this logical unit (not `git add .`)
304
- git add <files related to this logical unit>
305
-
306
- # 3. Commit with conventional message
307
- git commit -m "feat(scope): description of this unit"
308
- ```
309
-
310
- **Handling merge conflicts:** If conflicts arise during rebasing or merging, resolve them immediately. Incremental commits make conflict resolution easier since each commit is small and focused.
311
-
312
- **Note:** Incremental commits use clean conventional messages without attribution footers. The final Phase 4 commit/PR includes the full attribution.
313
-
314
- **Parallel subagent mode:** When units run as parallel subagents, the subagents do not commit — the orchestrator handles staging and committing after the entire parallel batch completes (see Parallel subagent constraints in Phase 1 Step 4). The commit guidance in this section applies to inline and serial execution, and to the orchestrator's commit decisions after parallel batch completion.
315
-
316
- 3. **Follow Existing Patterns**
317
-
318
- - The plan should reference similar code - read those files first
319
- - Match naming conventions exactly
320
- - Reuse existing components where possible
321
- - Follow project coding standards (see AGENTS.md; use AGENTS.md only if the repo still keeps a compatibility shim)
322
- - When in doubt, grep for similar implementations
323
-
324
- 4. **Test Continuously**
325
-
326
- - Run relevant tests after each significant change
327
- - Don't wait until the end to test
328
- - Fix failures immediately
329
- - Add new tests for new behavior, update tests for changed behavior, remove tests for deleted behavior
330
- - **Unit tests with mocks prove logic in isolation. Integration tests with real objects prove the layers work together.** If your change touches callbacks, middleware, or error handling — you need both.
331
-
332
- 5. **Simplify as You Go**
333
-
334
- After completing a cluster of related implementation units (or every 2-3 units), review recently changed files for simplification opportunities — consolidate duplicated patterns, extract shared helpers, and improve code reuse and efficiency. This is especially valuable when using subagents, since each agent works with isolated context and can't see patterns emerging across units.
335
-
336
- Don't simplify after every single unit — early patterns may look duplicated but diverge intentionally in later units. Wait for a natural phase boundary or when you notice accumulated complexity.
337
-
338
- If a `/simplify` skill or equivalent is available, use it. Otherwise, review the changed files yourself for reuse and consolidation opportunities.
339
-
340
- 6. **Figma Design Sync** (if applicable)
341
-
342
- For UI work with Figma designs:
343
-
344
- - Implement components following design specs
345
- - Use figma-design-sync agent iteratively to compare
346
- - Fix visual differences identified
347
- - Repeat until implementation matches design
348
-
349
- 7. **Frontend Design Guidance** (if applicable)
350
-
351
- For UI tasks without a Figma design -- where the implementation touches view, template, component, layout, or page files, creates user-visible routes, or the plan contains explicit UI/frontend/design language:
352
-
353
- - Load the `frontend-design` skill before implementing
354
- - Follow its detection, guidance, and verification flow
355
- - If the skill produced a verification screenshot, it satisfies Phase 4's screenshot requirement -- no need to capture separately. If the skill fell back to mental review (no browser access), Phase 4's screenshot capture still applies
356
-
357
- 8. **Track Progress**
358
- - Keep the task list updated as you complete tasks
359
- - Note any blockers or unexpected discoveries
360
- - Create new tasks if scope expands
361
- - Keep user informed of major milestones
362
-
363
- ### Phase 3-4: Quality Check and Ship It
364
-
365
- When all Phase 2 tasks are complete and execution transitions to quality check, read `references/shipping-workflow.md` for the full shipping workflow: quality checks, code review, final validation, PR creation, and notification.
366
-
367
- ---
368
-
369
- ## Codex Delegation Mode
370
-
371
- When `delegation_active` is true after argument parsing, read `references/codex-delegation-workflow.md` for the complete delegation workflow: pre-checks, batching, prompt template, execution loop, and result classification.
372
-
373
- ---
374
-
375
- ## Key Principles
376
-
377
- ### Start Fast, Execute Faster
378
-
379
- - Get clarification once at the start, then execute
380
- - Don't wait for perfect understanding - ask questions and move
381
- - The goal is to **finish the feature**, not create perfect process
382
-
383
- ### The Plan is Your Guide
384
-
385
- - Work documents should reference similar code and patterns
386
- - Load those references and follow them
387
- - Don't reinvent - match what exists
388
-
389
- ### Test As You Go
390
-
391
- - Run tests after each change, not at the end
392
- - Fix failures immediately
393
- - Continuous testing prevents big surprises
394
-
395
- ### Quality is Built In
396
-
397
- - Follow existing patterns
398
- - Write tests for new code
399
- - Run linting before pushing
400
- - Review every change — inline for simple additive work, full review for everything else
401
-
402
- ### Ship Complete Features
403
-
404
- - Mark all tasks completed before moving on
405
- - Don't leave features 80% done
406
- - A finished feature that ships beats a perfect feature that doesn't
407
-
408
- ## Common Pitfalls to Avoid
409
-
410
- - **Analysis paralysis** - Don't overthink, read the plan and execute
411
- - **Skipping clarifying questions** - Ask now, not after building wrong thing
412
- - **Ignoring plan references** - The plan has links for a reason
413
- - **Testing at the end** - Test continuously or suffer later
414
- - **Forgetting to track progress** - Update task status as you go or lose track of what's done
415
- - **80% done syndrome** - Finish the feature, don't move on early
416
- - **Skipping review** - Every change gets reviewed; only the depth varies