@moon791017/neo-skills 1.1.9 → 1.1.11
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json
CHANGED
|
@@ -2,11 +2,11 @@
|
|
|
2
2
|
|
|
3
3
|
Use this reference when designing loop architectures that automate agent-driven workflows beyond a single session.
|
|
4
4
|
|
|
5
|
-
##
|
|
5
|
+
## Relationship Between Loops and Harnesses
|
|
6
6
|
|
|
7
|
-
- Harness =
|
|
8
|
-
- Loop = harness
|
|
9
|
-
-
|
|
7
|
+
- Harness = the working environment for a single agent (guides + sensors + gates)
|
|
8
|
+
- Loop = the scheduling layer on top of the harness that lets the harness run itself
|
|
9
|
+
- Designing a loop does not replace prompts; it systematizes repetitive prompt actions
|
|
10
10
|
|
|
11
11
|
```text
|
|
12
12
|
Loop = Automations + Worktrees + Skills + Connectors + Sub-agents + State
|
|
@@ -14,137 +14,136 @@ Loop = Automations + Worktrees + Skills + Connectors + Sub-agents + State
|
|
|
14
14
|
running on top of the Harness
|
|
15
15
|
```
|
|
16
16
|
|
|
17
|
-
##
|
|
17
|
+
## Five Primitives + State
|
|
18
18
|
|
|
19
|
-
### 1. Automations
|
|
19
|
+
### 1. Automations (Heartbeat)
|
|
20
20
|
|
|
21
|
-
|
|
21
|
+
A loop without automations runs only once; automations make it repeat.
|
|
22
22
|
|
|
23
|
-
-
|
|
24
|
-
-
|
|
25
|
-
-
|
|
26
|
-
- `/loop`
|
|
23
|
+
- Schedule-driven triggers that periodically run exploration and classification.
|
|
24
|
+
- Findings go to the triage inbox; non-findings are auto-archived.
|
|
25
|
+
- Pair with skills to keep scheduled tasks maintainable—invoke `$skill-name` instead of pasting a wall of instructions.
|
|
26
|
+
- `/loop` repeats at a set frequency; `/goal` runs until a stop condition is met, with an independent model judging completion.
|
|
27
27
|
|
|
28
|
-
|
|
28
|
+
Tool mapping:
|
|
29
29
|
|
|
30
|
-
- Codex
|
|
31
|
-
- Claude Code
|
|
30
|
+
- Codex: Automations tab (select project, prompt, frequency, environment); results go to Triage inbox; `/goal` for run-until-done.
|
|
31
|
+
- Claude Code: `/loop`, `/goal`, hooks, cron, GitHub Actions.
|
|
32
32
|
|
|
33
|
-
### 2. Worktrees
|
|
33
|
+
### 2. Worktrees (Isolation)
|
|
34
34
|
|
|
35
|
-
|
|
35
|
+
Prevent file conflicts when multiple agents run in parallel.
|
|
36
36
|
|
|
37
|
-
-
|
|
38
|
-
-
|
|
39
|
-
-
|
|
37
|
+
- Each agent works in its own git worktree, sharing repo history.
|
|
38
|
+
- One agent's edits never touch another agent's checkout.
|
|
39
|
+
- Human review bandwidth is still the bottleneck—worktrees solve mechanical conflicts, but the number of agents you can run is limited by how many threads you can review simultaneously (orchestration tax).
|
|
40
40
|
|
|
41
|
-
|
|
41
|
+
Tool mapping:
|
|
42
42
|
|
|
43
|
-
- Codex
|
|
44
|
-
- Claude Code
|
|
43
|
+
- Codex: Built-in worktree per thread.
|
|
44
|
+
- Claude Code: `git worktree`, `--worktree` flag, subagent `isolation: worktree` setting.
|
|
45
45
|
|
|
46
|
-
### 3. Skills
|
|
46
|
+
### 3. Skills (Crystallized Knowledge)
|
|
47
47
|
|
|
48
|
-
|
|
48
|
+
Write repeatedly explained project context into a SKILL.md.
|
|
49
49
|
|
|
50
|
-
-
|
|
51
|
-
-
|
|
52
|
-
-
|
|
50
|
+
- Eliminate intent debt: on every cold start, an agent fills intent gaps with confident guesses. A skill externalizes intent so the agent reads it every time instead of reconstructing it.
|
|
51
|
+
- A loop without skills re-derives your entire project from scratch each cycle; a loop with skills carries forward knowledge from the last run.
|
|
52
|
+
- A skill is an authoring format; a plugin is a distribution format—package skills as plugins when sharing across repos.
|
|
53
53
|
|
|
54
|
-
|
|
54
|
+
Tool mapping:
|
|
55
55
|
|
|
56
|
-
- Codex
|
|
57
|
-
- Claude Code
|
|
56
|
+
- Codex: Agent Skills (`SKILL.md`), invoked via `$name` or `/skills`, or auto-triggered by description.
|
|
57
|
+
- Claude Code: Agent Skills (`SKILL.md`).
|
|
58
58
|
|
|
59
|
-
### 4. Plugins / Connectors
|
|
59
|
+
### 4. Plugins / Connectors (External Integration)
|
|
60
60
|
|
|
61
|
-
|
|
61
|
+
Connect external tools via MCP so the loop can act in real environments.
|
|
62
62
|
|
|
63
|
-
-
|
|
64
|
-
- Codex
|
|
65
|
-
- Plugins
|
|
63
|
+
- Can connect to issue trackers, databases, staging APIs, Slack.
|
|
64
|
+
- Both Codex and Claude Code use MCP; connectors are generally cross-tool portable.
|
|
65
|
+
- Plugins bundle connectors and skills together for one-step team installation.
|
|
66
66
|
|
|
67
|
-
|
|
67
|
+
A loop without connectors can only output suggestions; a loop with connectors can open PRs, link tickets, and ping channels directly.
|
|
68
68
|
|
|
69
|
-
### 5. Sub-agents
|
|
69
|
+
### 5. Sub-agents (Separating Generation from Verification)
|
|
70
70
|
|
|
71
|
-
|
|
71
|
+
The structural premise of a loop is separating maker from checker.
|
|
72
72
|
|
|
73
|
-
-
|
|
74
|
-
- `/goal`
|
|
75
|
-
-
|
|
76
|
-
- Sub-agents
|
|
73
|
+
- The model that writes the code grades its own work too leniently. A second agent with different instructions (sometimes a different model) catches issues the first agent convinced itself to accept.
|
|
74
|
+
- `/goal` also uses maker/checker separation under the hood—an independent small model judges whether the loop is done, rather than letting the working agent declare itself finished.
|
|
75
|
+
- Common division of labor: one explores, one implements, one verifies against spec.
|
|
76
|
+
- Sub-agents burn more tokens; spend them where a second opinion is worthwhile.
|
|
77
77
|
|
|
78
|
-
>
|
|
78
|
+
> **Responsibility boundary**: This section only covers the design rationale for why loops need maker/checker separation. For implementation details such as sub-agent definition format, instruction writing, and model selection, use the `neo-sub-agent` skill.
|
|
79
79
|
|
|
80
|
-
|
|
80
|
+
Tool mapping:
|
|
81
81
|
|
|
82
|
-
- Codex
|
|
83
|
-
- Claude Code
|
|
82
|
+
- Codex: TOML definition files under `.codex/agents/`, each with name, description, instructions, optional model, and reasoning effort.
|
|
83
|
+
- Claude Code: Subagent definitions under `.claude/agents/` + agent teams.
|
|
84
84
|
|
|
85
|
-
### 6. State
|
|
85
|
+
### 6. State (External Memory)
|
|
86
86
|
|
|
87
|
-
|
|
87
|
+
Models forget between conversations; progress must be written to the repo.
|
|
88
88
|
|
|
89
|
-
-
|
|
90
|
-
- State
|
|
91
|
-
- 看起來太簡單不重要,但每個 long-running agent 都依賴它:agent 會忘,repo 不會。
|
|
89
|
+
- Format: markdown files, Linear boards, or any persistent store outside the conversation.
|
|
90
|
+
- State tracks what was done, what passed, and what remains. Every long-running agent depends on it: agents forget, repos don't.
|
|
92
91
|
|
|
93
|
-
##
|
|
92
|
+
## Primitives Comparison Table
|
|
94
93
|
|
|
95
|
-
|
|
|
94
|
+
| Primitive | Role in Loop | Codex | Claude Code |
|
|
96
95
|
|:--|:--|:--|:--|
|
|
97
|
-
| Automations |
|
|
98
|
-
| Worktrees |
|
|
99
|
-
| Skills |
|
|
100
|
-
| Plugins / Connectors |
|
|
101
|
-
| Sub-agents |
|
|
102
|
-
| State |
|
|
96
|
+
| Automations | Scheduled exploration and classification | Automations tab, `/goal` | `/loop`, `/goal`, hooks, cron, GitHub Actions |
|
|
97
|
+
| Worktrees | Parallel isolation | Built-in worktree per thread | `git worktree`, `--worktree`, `isolation: worktree` |
|
|
98
|
+
| Skills | Crystallized project knowledge | Agent Skills (`SKILL.md`), `$name` | Agent Skills (`SKILL.md`) |
|
|
99
|
+
| Plugins / Connectors | External tool integration | Connectors (MCP) + Plugins | MCP servers + Plugins |
|
|
100
|
+
| Sub-agents | Separating generation from verification | `.codex/agents/` TOML | `.claude/agents/` + agent teams |
|
|
101
|
+
| State | Cross-conversation progress | Markdown / Linear connector | Markdown (`AGENTS.md`, progress files) / Linear MCP |
|
|
103
102
|
|
|
104
|
-
##
|
|
103
|
+
## Example: A Complete Loop Flow
|
|
105
104
|
|
|
106
|
-
1. **Automation**
|
|
107
|
-
2. Triage skill
|
|
108
|
-
3.
|
|
109
|
-
4.
|
|
110
|
-
5.
|
|
111
|
-
6.
|
|
112
|
-
7. **Connectors**
|
|
113
|
-
8.
|
|
114
|
-
9. **
|
|
115
|
-
10.
|
|
105
|
+
1. **Automation** runs on the repo every morning; prompt invokes the triage skill.
|
|
106
|
+
2. Triage skill reads yesterday's CI failures, open issues, and recent commits.
|
|
107
|
+
3. Noteworthy findings are written to a **state file** or Linear board.
|
|
108
|
+
4. For each finding, an isolated **worktree** is created.
|
|
109
|
+
5. A **sub-agent** (maker) is sent into the worktree to draft a fix.
|
|
110
|
+
6. A second **sub-agent** (checker) reviews the draft using project **skills** and existing tests.
|
|
111
|
+
7. **Connectors** open a PR, update the ticket, and ping the channel once CI passes.
|
|
112
|
+
8. Findings that cannot be handled are sent to the triage inbox for humans.
|
|
113
|
+
9. The **state file** records what was attempted, what passed, and what remains open.
|
|
114
|
+
10. Tomorrow morning's run picks up from state.
|
|
116
115
|
|
|
117
|
-
|
|
116
|
+
You design it once; after that, you never manually prompt any step.
|
|
118
117
|
|
|
119
|
-
## Loop
|
|
118
|
+
## Three Major Loop Risks
|
|
120
119
|
|
|
121
|
-
### 1.
|
|
120
|
+
### 1. Verification Is Still on You
|
|
122
121
|
|
|
123
|
-
|
|
122
|
+
An unattended loop also makes mistakes unattended. Maker/checker separation is necessary but not sufficient—"done" is a claim, not a proof. Your job is to ship code you have confirmed works.
|
|
124
123
|
|
|
125
|
-
### 2.
|
|
124
|
+
### 2. Comprehension Debt
|
|
126
125
|
|
|
127
|
-
|
|
126
|
+
The faster a loop produces code you didn't write, the larger your understanding gap grows. Unless you read what the loop produces, comprehension debt only accelerates.
|
|
128
127
|
|
|
129
|
-
### 3.
|
|
128
|
+
### 3. Cognitive Surrender
|
|
130
129
|
|
|
131
|
-
|
|
130
|
+
When a loop runs itself, people easily stop having opinions and accept everything at face value. The same loop design, used by someone with judgment, accelerates deeply understood work; used by someone without judgment, it becomes a way to avoid understanding the work itself—same action, opposite outcomes.
|
|
132
131
|
|
|
133
|
-
###
|
|
132
|
+
### Risk Mitigation Strategies
|
|
134
133
|
|
|
135
|
-
-
|
|
136
|
-
-
|
|
137
|
-
-
|
|
138
|
-
-
|
|
139
|
-
-
|
|
134
|
+
- Periodically spot-check loop output; don't rely solely on green CI.
|
|
135
|
+
- Set output volume caps on the loop to prevent review backlog from spiraling.
|
|
136
|
+
- Record the timestamp of the last human review in the state file.
|
|
137
|
+
- Force high-risk changes (security, compliance, product scope) to exit the loop and wait for a human.
|
|
138
|
+
- Regularly feed loop error patterns back to improve the harness (agentic flywheel).
|
|
140
139
|
|
|
141
|
-
##
|
|
140
|
+
## When to Introduce a Loop vs. Stay with the Harness
|
|
142
141
|
|
|
143
|
-
|
|
|
142
|
+
| Condition | Recommendation |
|
|
144
143
|
|:--|:--|
|
|
145
|
-
|
|
|
146
|
-
| CI
|
|
147
|
-
|
|
|
148
|
-
| Maturity Level < 3 |
|
|
149
|
-
|
|
|
150
|
-
|
|
|
144
|
+
| Project lacks reliable local verification commands | Build the harness first |
|
|
145
|
+
| CI is unstable or frequently red | Fix CI first |
|
|
146
|
+
| Team has no review process for agent output | Establish a review process first |
|
|
147
|
+
| Maturity Level < 3 | Upgrade the harness first |
|
|
148
|
+
| Highly repetitive, low-risk tasks (triage, format fixes, dependency updates) | Good fit for a loop |
|
|
149
|
+
| Changes involve product scope, security, or architecture trade-offs | Not suitable for a fully automated loop |
|