@laitszkin/apollo-toolkit 2.4.0 → 2.4.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +14 -0
- package/codex-subagent-orchestration/SKILL.md +27 -9
- package/package.json +1 -1
package/CHANGELOG.md
CHANGED
|
@@ -4,6 +4,20 @@ All notable changes to this repository are documented in this file.
|
|
|
4
4
|
|
|
5
5
|
## [Unreleased]
|
|
6
6
|
|
|
7
|
+
## [v2.4.2] - 2026-03-19
|
|
8
|
+
|
|
9
|
+
### Changed
|
|
10
|
+
- Relax `codex-subagent-orchestration` so reusable custom agents no longer require repeated historical use before creation or persistence.
|
|
11
|
+
- Require agents to abstract task-specific delegation into the most general reusable role that still preserves clear ownership boundaries, such as `code_reviewer` before narrower one-off task agents.
|
|
12
|
+
- Clarify when domain-specific specialization such as `rust_reviewer` is warranted and when a generic reusable reviewer should be preferred.
|
|
13
|
+
|
|
14
|
+
## [v2.4.1] - 2026-03-19
|
|
15
|
+
|
|
16
|
+
### Changed
|
|
17
|
+
- Tighten `codex-subagent-orchestration` so non-trivial tasks must use actual subagent tool calls when delegation is allowed, instead of stopping at prose-only delegation guidance.
|
|
18
|
+
- Require `codex-subagent-orchestration` to default to a parallel subagents workflow whenever two or more independent workstreams can run safely in parallel.
|
|
19
|
+
- Clarify runtime handoff and orchestration boundaries for delegated agents, including tool-rule, sandbox, write-scope, and isolated-review expectations.
|
|
20
|
+
|
|
7
21
|
## [v2.4.0] - 2026-03-19
|
|
8
22
|
|
|
9
23
|
### Added
|
|
@@ -15,8 +15,8 @@ description: Use for almost every non-trivial Codex task. Inspect existing custo
|
|
|
15
15
|
## Standards
|
|
16
16
|
|
|
17
17
|
- Evidence: Inspect the current task shape and the existing custom-agent catalog before creating or updating any agent.
|
|
18
|
-
- Execution: Use this skill for nearly every non-trivial task
|
|
19
|
-
- Quality: Keep each custom agent narrow, opinionated, and non-overlapping; prefer read-only sandboxes for explorers and reviewers; avoid parallel write conflicts.
|
|
18
|
+
- Execution: Use this skill for nearly every non-trivial task. When current tool rules allow delegation, the agent must actually launch one or more subagents instead of only describing delegation in prose. Treat a parallel subagents workflow as the default whenever two or more independent workstreams can run safely in parallel, use a single helper agent only when there is exactly one bounded sidecar job, and keep shared planning, conflict resolution, and final synthesis in the main agent.
|
|
19
|
+
- Quality: Keep each custom agent narrow, opinionated, and non-overlapping; prefer read-only sandboxes for explorers and reviewers; keep review contexts isolated from the implementation thread; avoid parallel write conflicts.
|
|
20
20
|
- Output: State which agents were reused or created, what each owned, whether they waited in parallel or were staged, and what remained with the main agent.
|
|
21
21
|
|
|
22
22
|
## Overview
|
|
@@ -54,7 +54,7 @@ Prefer `gpt-5.3-codex` first for agentic coding roles. Escalate to `gpt-5.4` whe
|
|
|
54
54
|
|
|
55
55
|
## When To Use
|
|
56
56
|
|
|
57
|
-
Use this skill by default unless the task is genuinely trivial, such as:
|
|
57
|
+
Use this skill by default unless the task is genuinely trivial or current tool rules disallow delegation, such as:
|
|
58
58
|
|
|
59
59
|
- a one-shot factual answer with no decomposition value
|
|
60
60
|
- a single obvious command or one-line edit
|
|
@@ -68,16 +68,20 @@ Subagents are most valuable for:
|
|
|
68
68
|
- browser reproduction and debugging
|
|
69
69
|
- parallel edits across unrelated files or modules
|
|
70
70
|
|
|
71
|
-
Keep the main agent in charge when the work is highly continuous, tightly coupled, or depends on a single evolving mental model. In those cases, let subagents provide bounded context, not final ownership.
|
|
71
|
+
Keep the main agent in charge when the work is highly continuous, tightly coupled, or depends on a single evolving mental model. In those cases, let subagents provide bounded context, not final ownership, and do not force parallel writers.
|
|
72
|
+
|
|
73
|
+
This skill is not satisfied by merely writing that Codex should delegate later. When parallelizable sidecar work exists and delegation is allowed, the default compliant shape is a parallel subagents workflow.
|
|
72
74
|
|
|
73
75
|
## Workflow
|
|
74
76
|
|
|
75
77
|
### 1) Triage the task first
|
|
76
78
|
|
|
77
79
|
- Decide whether the task is trivial, serial-but-complex, or parallelizable.
|
|
80
|
+
- If the task is non-trivial and delegation is allowed, you must delegate at least one bounded subtask to a subagent.
|
|
81
|
+
- If the task has two or more independent read/review/exploration tracks, you must use a parallel subagents workflow rather than a single helper agent or a staged suggestion-only plan.
|
|
78
82
|
- Use subagents for most non-trivial tasks, but do not force them into tiny or tightly coupled work.
|
|
79
83
|
- Prefer one writer plus supporting read-only agents when ownership would otherwise overlap.
|
|
80
|
-
-
|
|
84
|
+
- If tool rules require explicit user intent before delegation, confirm that gate first; once satisfied, launch the chosen subagents and do not stay in suggestion-only mode.
|
|
81
85
|
|
|
82
86
|
### 2) Inspect the current agent catalog
|
|
83
87
|
|
|
@@ -100,11 +104,19 @@ Reuse an existing custom agent when all of the following are true:
|
|
|
100
104
|
- its tools, sandbox, and model profile are suitable
|
|
101
105
|
- using it will not create role overlap with another active agent
|
|
102
106
|
|
|
103
|
-
Create a new custom agent
|
|
107
|
+
Create a new custom agent whenever the current task exposes a stable reusable role and:
|
|
104
108
|
|
|
105
109
|
- no existing agent owns the job cleanly
|
|
106
|
-
- the job is likely to recur on similar tasks
|
|
107
110
|
- the responsibility can be described independently from the current one-off prompt
|
|
111
|
+
- the role can be named, bounded, and reused on future tasks even if this is its first appearance
|
|
112
|
+
|
|
113
|
+
Do not require repeated historical use before creating a reusable custom agent. Treat "reusable" as a property of role clarity and stable boundaries, not as proof that the exact same task has already repeated many times.
|
|
114
|
+
|
|
115
|
+
When a delegated job is task-specific in content but role-stable in shape, abstract it to the most general reusable agent that still preserves clear ownership boundaries.
|
|
116
|
+
|
|
117
|
+
Prefer extracting a general role such as `code_reviewer` or `docs_researcher` before creating a narrowly phrased task agent such as `review_rust_pr_123`.
|
|
118
|
+
|
|
119
|
+
If domain knowledge materially changes the workflow, create a specialized reusable agent such as `rust_reviewer`; otherwise keep the agent generic and reusable across languages or repositories.
|
|
108
120
|
|
|
109
121
|
Do not create near-duplicates. Tighten or extend an existing agent when the gap is small and the responsibility remains coherent.
|
|
110
122
|
|
|
@@ -115,6 +127,7 @@ Do not create near-duplicates. Tighten or extend an existing agent when the gap
|
|
|
115
127
|
- Match the filename to the `name` field unless there is a strong reason not to.
|
|
116
128
|
- Keep `description` human-facing and routing-oriented: it should explain when Codex should use the agent.
|
|
117
129
|
- Keep `developer_instructions` stable and role-specific; do not leak current task noise into reusable instructions.
|
|
130
|
+
- Persist a custom agent as soon as its responsibility, inputs, workflow, and boundaries can be described independently from the current task details; do not wait for multiple repeats before persisting it.
|
|
118
131
|
- Set `model` to either `gpt-5.3-codex` or `gpt-5.4`.
|
|
119
132
|
- Set `model_reasoning_effort` from actual task complexity, not from agent prestige or habit.
|
|
120
133
|
|
|
@@ -122,7 +135,7 @@ Naming rule for this skill:
|
|
|
122
135
|
|
|
123
136
|
- choose a short English noun phrase
|
|
124
137
|
- normalize it to snake_case
|
|
125
|
-
- examples: `code_mapper`, `docs_researcher`, `
|
|
138
|
+
- examples: `code_mapper`, `code_reviewer`, `docs_researcher`, `rust_reviewer`, `browser_debugger`
|
|
126
139
|
|
|
127
140
|
### 5) Use the fixed instruction format
|
|
128
141
|
|
|
@@ -147,6 +160,7 @@ Whenever you prompt a subagent, include:
|
|
|
147
160
|
- the expected summary or output format
|
|
148
161
|
- the file or module ownership boundary
|
|
149
162
|
- the stop condition if the agent hits uncertainty or overlap
|
|
163
|
+
- the instruction to stay within current tool-rule limits for delegation, sandbox, and write scope
|
|
150
164
|
|
|
151
165
|
### 6) Decompose ownership before spawning
|
|
152
166
|
|
|
@@ -163,9 +177,11 @@ Avoid combining exploration, review, and editing into one reusable agent when th
|
|
|
163
177
|
|
|
164
178
|
### 7) Orchestrate the run
|
|
165
179
|
|
|
166
|
-
-
|
|
180
|
+
- Use actual subagent tool calls when delegation is allowed; do not stop at writing that Codex should spawn agents later.
|
|
181
|
+
- State exactly how to split the work before each launch.
|
|
167
182
|
- Say whether to wait for all agents before continuing or to stage them in sequence.
|
|
168
183
|
- Ask for concise returned summaries, not raw logs.
|
|
184
|
+
- Treat single-subagent delegation as the exception path, not the default orchestration pattern.
|
|
169
185
|
|
|
170
186
|
Preferred patterns:
|
|
171
187
|
|
|
@@ -176,6 +192,7 @@ Preferred patterns:
|
|
|
176
192
|
Practical default:
|
|
177
193
|
|
|
178
194
|
- spawn 2-4 agents for a complex task
|
|
195
|
+
- spawn at least 2 agents when the task clearly contains parallelizable investigation or review tracks
|
|
179
196
|
- keep within the current `agents.max_threads`
|
|
180
197
|
- keep nesting shallow; many Codex setups leave `agents.max_depth` at 1 unless configured otherwise
|
|
181
198
|
|
|
@@ -194,6 +211,7 @@ If the task turns into one tightly coupled stream of work, stop delegating new e
|
|
|
194
211
|
### 9) Maintain the agent catalog after the task
|
|
195
212
|
|
|
196
213
|
- Persist any new reusable custom agent to `~/.codex/agents/`.
|
|
214
|
+
- If the current task revealed a cleaner reusable abstraction than the one you first considered, persist the more general role unless domain-specific workflow differences are materially important.
|
|
197
215
|
- If a newly created agent proved too broad, narrow its description and instructions before finishing.
|
|
198
216
|
- If two agents overlap heavily, keep one and tighten the other instead of letting both drift.
|
|
199
217
|
- Do not persist throwaway agents that are really just one-off prompts.
|