@laitszkin/apollo-toolkit 2.4.0 → 2.4.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -4,6 +4,20 @@ All notable changes to this repository are documented in this file.
4
4
 
5
5
  ## [Unreleased]
6
6
 
7
+ ## [v2.4.2] - 2026-03-19
8
+
9
+ ### Changed
10
+ - Relax `codex-subagent-orchestration` so reusable custom agents no longer require repeated historical use before creation or persistence.
11
+ - Require agents to abstract task-specific delegation into the most general reusable role that still preserves clear ownership boundaries, such as `code_reviewer` before narrower one-off task agents.
12
+ - Clarify when domain-specific specialization such as `rust_reviewer` is warranted and when a generic reusable reviewer should be preferred.
13
+
14
+ ## [v2.4.1] - 2026-03-19
15
+
16
+ ### Changed
17
+ - Tighten `codex-subagent-orchestration` so non-trivial tasks must use actual subagent tool calls when delegation is allowed, instead of stopping at prose-only delegation guidance.
18
+ - Require `codex-subagent-orchestration` to default to a parallel subagents workflow whenever two or more independent workstreams can run safely in parallel.
19
+ - Clarify runtime handoff and orchestration boundaries for delegated agents, including tool-rule, sandbox, write-scope, and isolated-review expectations.
20
+
7
21
  ## [v2.4.0] - 2026-03-19
8
22
 
9
23
  ### Added
@@ -15,8 +15,8 @@ description: Use for almost every non-trivial Codex task. Inspect existing custo
15
15
  ## Standards
16
16
 
17
17
  - Evidence: Inspect the current task shape and the existing custom-agent catalog before creating or updating any agent.
18
- - Execution: Use this skill for nearly every non-trivial task; explicitly tell Codex to spawn one or more subagents unless the task is trivial or delegation is disallowed; delegate read-heavy exploration, review, verification, and unrelated module edits; keep shared planning, conflict resolution, and final synthesis in the main agent.
19
- - Quality: Keep each custom agent narrow, opinionated, and non-overlapping; prefer read-only sandboxes for explorers and reviewers; avoid parallel write conflicts.
18
+ - Execution: Use this skill for nearly every non-trivial task. When current tool rules allow delegation, the agent must actually launch one or more subagents instead of only describing delegation in prose. Treat a parallel subagents workflow as the default whenever two or more independent workstreams can run safely in parallel, use a single helper agent only when there is exactly one bounded sidecar job, and keep shared planning, conflict resolution, and final synthesis in the main agent.
19
+ - Quality: Keep each custom agent narrow, opinionated, and non-overlapping; prefer read-only sandboxes for explorers and reviewers; keep review contexts isolated from the implementation thread; avoid parallel write conflicts.
20
20
  - Output: State which agents were reused or created, what each owned, whether they waited in parallel or were staged, and what remained with the main agent.
21
21
 
22
22
  ## Overview
@@ -54,7 +54,7 @@ Prefer `gpt-5.3-codex` first for agentic coding roles. Escalate to `gpt-5.4` whe
54
54
 
55
55
  ## When To Use
56
56
 
57
- Use this skill by default unless the task is genuinely trivial, such as:
57
+ Use this skill by default unless the task is genuinely trivial or current tool rules disallow delegation, such as:
58
58
 
59
59
  - a one-shot factual answer with no decomposition value
60
60
  - a single obvious command or one-line edit
@@ -68,16 +68,20 @@ Subagents are most valuable for:
68
68
  - browser reproduction and debugging
69
69
  - parallel edits across unrelated files or modules
70
70
 
71
- Keep the main agent in charge when the work is highly continuous, tightly coupled, or depends on a single evolving mental model. In those cases, let subagents provide bounded context, not final ownership.
71
+ Keep the main agent in charge when the work is highly continuous, tightly coupled, or depends on a single evolving mental model. In those cases, let subagents provide bounded context, not final ownership, and do not force parallel writers.
72
+
73
+ This skill is not satisfied by merely writing that Codex should delegate later. When parallelizable sidecar work exists and delegation is allowed, the default compliant shape is a parallel subagents workflow.
72
74
 
73
75
  ## Workflow
74
76
 
75
77
  ### 1) Triage the task first
76
78
 
77
79
  - Decide whether the task is trivial, serial-but-complex, or parallelizable.
80
+ - If the task is non-trivial and delegation is allowed, you must delegate at least one bounded subtask to a subagent.
81
+ - If the task has two or more independent read/review/exploration tracks, you must use a parallel subagents workflow rather than a single helper agent or a staged suggestion-only plan.
78
82
  - Use subagents for most non-trivial tasks, but do not force them into tiny or tightly coupled work.
79
83
  - Prefer one writer plus supporting read-only agents when ownership would otherwise overlap.
80
- - For any non-trivial task, explicitly instruct Codex to spawn the chosen subagents unless delegation is blocked.
84
+ - If tool rules require explicit user intent before delegation, confirm that gate first; once satisfied, launch the chosen subagents and do not stay in suggestion-only mode.
81
85
 
82
86
  ### 2) Inspect the current agent catalog
83
87
 
@@ -100,11 +104,19 @@ Reuse an existing custom agent when all of the following are true:
100
104
  - its tools, sandbox, and model profile are suitable
101
105
  - using it will not create role overlap with another active agent
102
106
 
103
- Create a new custom agent only when:
107
+ Create a new custom agent whenever the current task exposes a stable reusable role and:
104
108
 
105
109
  - no existing agent owns the job cleanly
106
- - the job is likely to recur on similar tasks
107
110
  - the responsibility can be described independently from the current one-off prompt
111
+ - the role can be named, bounded, and reused on future tasks even if this is its first appearance
112
+
113
+ Do not require repeated historical use before creating a reusable custom agent. Treat "reusable" as a property of role clarity and stable boundaries, not as proof that the exact same task has already repeated many times.
114
+
115
+ When a delegated job is task-specific in content but role-stable in shape, abstract it to the most general reusable agent that still preserves clear ownership boundaries.
116
+
117
+ Prefer extracting a general role such as `code_reviewer` or `docs_researcher` before creating a narrowly phrased task agent such as `review_rust_pr_123`.
118
+
119
+ If domain knowledge materially changes the workflow, create a specialized reusable agent such as `rust_reviewer`; otherwise keep the agent generic and reusable across languages or repositories.
108
120
 
109
121
  Do not create near-duplicates. Tighten or extend an existing agent when the gap is small and the responsibility remains coherent.
110
122
 
@@ -115,6 +127,7 @@ Do not create near-duplicates. Tighten or extend an existing agent when the gap
115
127
  - Match the filename to the `name` field unless there is a strong reason not to.
116
128
  - Keep `description` human-facing and routing-oriented: it should explain when Codex should use the agent.
117
129
  - Keep `developer_instructions` stable and role-specific; do not leak current task noise into reusable instructions.
130
+ - Persist a custom agent as soon as its responsibility, inputs, workflow, and boundaries can be described independently from the current task details; do not wait for multiple repeats before persisting it.
118
131
  - Set `model` to either `gpt-5.3-codex` or `gpt-5.4`.
119
132
  - Set `model_reasoning_effort` from actual task complexity, not from agent prestige or habit.
120
133
 
@@ -122,7 +135,7 @@ Naming rule for this skill:
122
135
 
123
136
  - choose a short English noun phrase
124
137
  - normalize it to snake_case
125
- - examples: `code_mapper`, `docs_researcher`, `browser_debugger`, `payments_reviewer`
138
+ - examples: `code_mapper`, `code_reviewer`, `docs_researcher`, `rust_reviewer`, `browser_debugger`
126
139
 
127
140
  ### 5) Use the fixed instruction format
128
141
 
@@ -147,6 +160,7 @@ Whenever you prompt a subagent, include:
147
160
  - the expected summary or output format
148
161
  - the file or module ownership boundary
149
162
  - the stop condition if the agent hits uncertainty or overlap
163
+ - the instruction to stay within current tool-rule limits for delegation, sandbox, and write scope
150
164
 
151
165
  ### 6) Decompose ownership before spawning
152
166
 
@@ -163,9 +177,11 @@ Avoid combining exploration, review, and editing into one reusable agent when th
163
177
 
164
178
  ### 7) Orchestrate the run
165
179
 
166
- - Explicitly tell Codex to spawn the selected subagents and state exactly how to split the work.
180
+ - Use actual subagent tool calls when delegation is allowed; do not stop at writing that Codex should spawn agents later.
181
+ - State exactly how to split the work before each launch.
167
182
  - Say whether to wait for all agents before continuing or to stage them in sequence.
168
183
  - Ask for concise returned summaries, not raw logs.
184
+ - Treat single-subagent delegation as the exception path, not the default orchestration pattern.
169
185
 
170
186
  Preferred patterns:
171
187
 
@@ -176,6 +192,7 @@ Preferred patterns:
176
192
  Practical default:
177
193
 
178
194
  - spawn 2-4 agents for a complex task
195
+ - spawn at least 2 agents when the task clearly contains parallelizable investigation or review tracks
179
196
  - keep within the current `agents.max_threads`
180
197
  - keep nesting shallow; many Codex setups leave `agents.max_depth` at 1 unless configured otherwise
181
198
 
@@ -194,6 +211,7 @@ If the task turns into one tightly coupled stream of work, stop delegating new e
194
211
  ### 9) Maintain the agent catalog after the task
195
212
 
196
213
  - Persist any new reusable custom agent to `~/.codex/agents/`.
214
+ - If the current task revealed a cleaner reusable abstraction than the one you first considered, persist the more general role unless domain-specific workflow differences are materially important.
197
215
  - If a newly created agent proved too broad, narrow its description and instructions before finishing.
198
216
  - If two agents overlap heavily, keep one and tighten the other instead of letting both drift.
199
217
  - Do not persist throwaway agents that are really just one-off prompts.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@laitszkin/apollo-toolkit",
3
- "version": "2.4.0",
3
+ "version": "2.4.2",
4
4
  "description": "Apollo Toolkit npm installer for managed skill linking across Codex, OpenClaw, and Trae.",
5
5
  "license": "MIT",
6
6
  "author": "LaiTszKin",