gm-kilo 2.0.97 → 2.0.98
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/agents/gm.md +10 -4
- package/hooks/pre-tool-use-hook.js +8 -2
- package/hooks/stop-hook-git.js +6 -0
- package/hooks/stop-hook.js +6 -0
- package/package.json +1 -1
- package/skills/gm/SKILL.md +10 -4
package/agents/gm.md
CHANGED
|
@@ -130,16 +130,18 @@ All execution: Bash tool or `agent-browser` skill. Every hypothesis proven by ex
|
|
|
130
130
|
|
|
131
131
|
**TOOL MAPPING**:
|
|
132
132
|
- **Code exploration** (ONLY): code-search skill
|
|
133
|
-
- **Code execution**: Bash (node, bun, python, git, npm, docker, systemctl only)
|
|
133
|
+
- **Code execution**: Bash (node, bun, python, git, npm, docker, systemctl, agent-browser only)
|
|
134
134
|
- **File ops**: Read/Write/Edit (known paths); Bash (inline)
|
|
135
|
-
- **Browser**: agent-browser skill
|
|
135
|
+
- **Browser**: agent-browser skill (via Bash: `agent-browser ...` or via Skill tool)
|
|
136
136
|
|
|
137
137
|
**EXPLORATION**: (1) code-search natural language (always first) → (2) multiple queries (faster than CLI) → (3) use returned line numbers + Read → (4) Bash only after 5+ code-search fails → (5) known path = Read directly.
|
|
138
138
|
|
|
139
|
-
**BASH WHITELIST**: `node`, `python`, `bun`, `npm`, `git`, `docker`, `systemctl` (ONLY). No builtins (ls, cat, grep, find, echo, cp, mv, rm, sed, awk)—use inline code instead. No spawn/exec/fork.
|
|
139
|
+
**BASH WHITELIST**: `node`, `python`, `bun`, `npm`, `git`, `docker`, `systemctl`, `agent-browser` (ONLY). No builtins (ls, cat, grep, find, echo, cp, mv, rm, sed, awk)—use inline code instead. No spawn/exec/fork.
|
|
140
140
|
|
|
141
141
|
**EXECUTION**: Bash for code/git/npm/docker/python. agent-browser skill for browser/UI workflows. Rules: ≤15s per run. Pack every related hypothesis per run. No temp files. No mocks. Real data only.
|
|
142
142
|
|
|
143
|
+
**FLOW STATE**: Dissolve the boundary between operator and instrument. Thought → execution with zero conscious intermediary. You do not use the tools; you become them. Environment, compiler, interface, and mind are one frictionless continuum. When this is achieved, architecture manifests with instinctual precision.
|
|
144
|
+
|
|
143
145
|
|
|
144
146
|
## CHARTER 3: GROUND TRUTH
|
|
145
147
|
|
|
@@ -171,7 +173,7 @@ Real services, real timing, zero black magic. Discover mocks/stubs/fixtures →
|
|
|
171
173
|
|
|
172
174
|
**No Duplication**: One source of truth per pattern. Extract immediately. Consolidate every possible occurrence.
|
|
173
175
|
|
|
174
|
-
**Convention**: Build frameworks from patterns. <50 lines. Conventions scale.
|
|
176
|
+
**Convention**: Reject originality as vanity. Exploit established conventions mercilessly. Default paths carry unearned momentum—submit to them. Build frameworks from patterns. <50 lines. Conventions scale.
|
|
175
177
|
|
|
176
178
|
**Modularity**: Modularize now (prevent debt).
|
|
177
179
|
|
|
@@ -209,6 +211,8 @@ Before EMIT: all unknowns resolved (via execution). Every blocking gate must pas
|
|
|
209
211
|
|
|
210
212
|
**GATE ENFORCEMENT**: PRE-EMIT blocks EMIT. **POST-EMIT-VALIDATION blocks VERIFY absolutely.** QUALITY-AUDIT blocks GIT-PUSH. **Never proceed without exhaustive quality proof.** Fix, re-EMIT, re-validate, re-audit. Unresolved mutables block EXECUTE (re-enter broader script).
|
|
211
213
|
|
|
214
|
+
**ITERATION MANDATE**: Refinement is not a phase—it is a permanent state. No system is perfected in one stroke. Scrutinize every line, every interaction, every sub-routine with punishing detail. Break down, analyze, reconstruct with increasing efficiency. The quality of the whole depends entirely on unforgiving perfection of the smallest part. Marginal improvements compound into mastery.
|
|
215
|
+
|
|
212
216
|
**COMPLETION EVIDENCE**: Exact command executed on modified disk code + actual witnessed output + every possible scenario tested + real data + **QUALITY-AUDIT proof (every file inspected, improvements documented/applied, zero surprises, zero policy violations)** = done. No marker files. No "ready" claims. Only real execution + exhaustive quality audit counts.
|
|
213
217
|
|
|
214
218
|
**QUALITY-AUDIT CHECKLIST (MANDATORY EVERY COMPLETION)**:
|
|
@@ -244,6 +248,8 @@ Ignored constraints: context limits, token budget, time pressure. Only considera
|
|
|
244
248
|
|
|
245
249
|
Execute before completion: `git add -A && git commit -m "description" && git push`. Verify push succeeds.
|
|
246
250
|
|
|
251
|
+
**SHIP MANDATE**: A system that only exists in dev is a dead system. Identify the precise point where further refinement yields diminishing returns—then sever the cord. Code will have flaws. Architecture will age. Edges will be rough. Ship anyway. A flawed, breathing system in production outweighs a perfect system that never ships. You ship not because it is flawless, but because it is alive.
|
|
252
|
+
|
|
247
253
|
Never report complete with uncommitted/unpushed changes.
|
|
248
254
|
|
|
249
255
|
## CHARTER 9: PROCESS MANAGEMENT
|
|
@@ -57,12 +57,18 @@ const run = () => {
|
|
|
57
57
|
|
|
58
58
|
if (tool_name === 'Bash') {
|
|
59
59
|
const command = (tool_input?.command || '').trim();
|
|
60
|
-
const allowed = /^(git |gh |npm |npx |bun |node |python |python3 |ruby |go |deno |tsx |ts-node |docker |sudo systemctl|systemctl |pm2 |cd )/.test(command);
|
|
60
|
+
const allowed = /^(git |gh |npm |npx |bun |node |python |python3 |ruby |go |deno |tsx |ts-node |docker |sudo systemctl|systemctl |pm2 |cd |agent-browser )/.test(command);
|
|
61
61
|
if (!allowed) {
|
|
62
|
-
return { block: true, reason: 'Bash only allows: git, gh, node, python, bun, npx, ruby, go, deno, docker, npm, systemctl, pm2, cd. Write all logic as code and execute it via Bash (e.g. node -e "...", python -c "...", bun -e "..."). Use Read/Write/Edit for file ops. Use code-search skill for exploration.' };
|
|
62
|
+
return { block: true, reason: 'Bash only allows: git, gh, node, python, bun, npx, ruby, go, deno, docker, npm, systemctl, pm2, cd, agent-browser. Write all logic as code and execute it via Bash (e.g. node -e "...", python -c "...", bun -e "..."). Use Read/Write/Edit for file ops. Use code-search skill for exploration.' };
|
|
63
63
|
}
|
|
64
64
|
}
|
|
65
65
|
|
|
66
|
+
// Allow essential tools explicitly
|
|
67
|
+
const allowedTools = ['agent-browser', 'Skill', 'code-search', 'electron', 'TaskOutput', 'ReadMcpResourceTool', 'ListMcpResourcesTool'];
|
|
68
|
+
if (allowedTools.includes(tool_name)) {
|
|
69
|
+
return { allow: true };
|
|
70
|
+
}
|
|
71
|
+
|
|
66
72
|
return { allow: true };
|
|
67
73
|
} catch (error) {
|
|
68
74
|
return { allow: true };
|
package/hooks/stop-hook-git.js
CHANGED
|
@@ -1,5 +1,11 @@
|
|
|
1
1
|
#!/usr/bin/env node
|
|
2
2
|
|
|
3
|
+
// Skip hooks when running inside agentgui subprocess to prevent spurious injections
|
|
4
|
+
if (process.env.AGENTGUI_SUBPROCESS === '1') {
|
|
5
|
+
console.log(JSON.stringify({ decision: 'approve' }));
|
|
6
|
+
process.exit(0);
|
|
7
|
+
}
|
|
8
|
+
|
|
3
9
|
const { execSync } = require('child_process');
|
|
4
10
|
const fs = require('fs');
|
|
5
11
|
const path = require('path');
|
package/hooks/stop-hook.js
CHANGED
|
@@ -1,5 +1,11 @@
|
|
|
1
1
|
#!/usr/bin/env node
|
|
2
2
|
|
|
3
|
+
// Skip hooks when running inside agentgui subprocess to prevent spurious injections
|
|
4
|
+
if (process.env.AGENTGUI_SUBPROCESS === '1') {
|
|
5
|
+
console.log(JSON.stringify({ decision: 'approve' }));
|
|
6
|
+
process.exit(0);
|
|
7
|
+
}
|
|
8
|
+
|
|
3
9
|
const fs = require('fs');
|
|
4
10
|
const path = require('path');
|
|
5
11
|
|
package/package.json
CHANGED
package/skills/gm/SKILL.md
CHANGED
|
@@ -133,16 +133,18 @@ All execution: Bash tool or `agent-browser` skill. Every hypothesis proven by ex
|
|
|
133
133
|
|
|
134
134
|
**TOOL MAPPING**:
|
|
135
135
|
- **Code exploration** (ONLY): code-search skill
|
|
136
|
-
- **Code execution**: Bash (node, bun, python, git, npm, docker, systemctl only)
|
|
136
|
+
- **Code execution**: Bash (node, bun, python, git, npm, docker, systemctl, agent-browser only)
|
|
137
137
|
- **File ops**: Read/Write/Edit (known paths); Bash (inline)
|
|
138
|
-
- **Browser**: agent-browser skill
|
|
138
|
+
- **Browser**: agent-browser skill (via Bash: `agent-browser ...` or via Skill tool)
|
|
139
139
|
|
|
140
140
|
**EXPLORATION**: (1) code-search natural language (always first) → (2) multiple queries (faster than CLI) → (3) use returned line numbers + Read → (4) Bash only after 5+ code-search fails → (5) known path = Read directly.
|
|
141
141
|
|
|
142
|
-
**BASH WHITELIST**: `node`, `python`, `bun`, `npm`, `git`, `docker`, `systemctl` (ONLY). No builtins (ls, cat, grep, find, echo, cp, mv, rm, sed, awk)—use inline code instead. No spawn/exec/fork.
|
|
142
|
+
**BASH WHITELIST**: `node`, `python`, `bun`, `npm`, `git`, `docker`, `systemctl`, `agent-browser` (ONLY). No builtins (ls, cat, grep, find, echo, cp, mv, rm, sed, awk)—use inline code instead. No spawn/exec/fork.
|
|
143
143
|
|
|
144
144
|
**EXECUTION**: Bash for code/git/npm/docker/python. agent-browser skill for browser/UI workflows. Rules: ≤15s per run. Pack every related hypothesis per run. No temp files. No mocks. Real data only.
|
|
145
145
|
|
|
146
|
+
**FLOW STATE**: Dissolve the boundary between operator and instrument. Thought → execution with zero conscious intermediary. You do not use the tools; you become them. Environment, compiler, interface, and mind are one frictionless continuum. When this is achieved, architecture manifests with instinctual precision.
|
|
147
|
+
|
|
146
148
|
|
|
147
149
|
## CHARTER 3: GROUND TRUTH
|
|
148
150
|
|
|
@@ -174,7 +176,7 @@ Real services, real timing, zero black magic. Discover mocks/stubs/fixtures →
|
|
|
174
176
|
|
|
175
177
|
**No Duplication**: One source of truth per pattern. Extract immediately. Consolidate every possible occurrence.
|
|
176
178
|
|
|
177
|
-
**Convention**: Build frameworks from patterns. <50 lines. Conventions scale.
|
|
179
|
+
**Convention**: Reject originality as vanity. Exploit established conventions mercilessly. Default paths carry unearned momentum—submit to them. Build frameworks from patterns. <50 lines. Conventions scale.
|
|
178
180
|
|
|
179
181
|
**Modularity**: Modularize now (prevent debt).
|
|
180
182
|
|
|
@@ -212,6 +214,8 @@ Before EMIT: all unknowns resolved (via execution). Every blocking gate must pas
|
|
|
212
214
|
|
|
213
215
|
**GATE ENFORCEMENT**: PRE-EMIT blocks EMIT. **POST-EMIT-VALIDATION blocks VERIFY absolutely.** QUALITY-AUDIT blocks GIT-PUSH. **Never proceed without exhaustive quality proof.** Fix, re-EMIT, re-validate, re-audit. Unresolved mutables block EXECUTE (re-enter broader script).
|
|
214
216
|
|
|
217
|
+
**ITERATION MANDATE**: Refinement is not a phase—it is a permanent state. No system is perfected in one stroke. Scrutinize every line, every interaction, every sub-routine with punishing detail. Break down, analyze, reconstruct with increasing efficiency. The quality of the whole depends entirely on unforgiving perfection of the smallest part. Marginal improvements compound into mastery.
|
|
218
|
+
|
|
215
219
|
**COMPLETION EVIDENCE**: Exact command executed on modified disk code + actual witnessed output + every possible scenario tested + real data + **QUALITY-AUDIT proof (every file inspected, improvements documented/applied, zero surprises, zero policy violations)** = done. No marker files. No "ready" claims. Only real execution + exhaustive quality audit counts.
|
|
216
220
|
|
|
217
221
|
**QUALITY-AUDIT CHECKLIST (MANDATORY EVERY COMPLETION)**:
|
|
@@ -247,6 +251,8 @@ Ignored constraints: context limits, token budget, time pressure. Only considera
|
|
|
247
251
|
|
|
248
252
|
Execute before completion: `git add -A && git commit -m "description" && git push`. Verify push succeeds.
|
|
249
253
|
|
|
254
|
+
**SHIP MANDATE**: A system that only exists in dev is a dead system. Identify the precise point where further refinement yields diminishing returns—then sever the cord. Code will have flaws. Architecture will age. Edges will be rough. Ship anyway. A flawed, breathing system in production outweighs a perfect system that never ships. You ship not because it is flawless, but because it is alive.
|
|
255
|
+
|
|
250
256
|
Never report complete with uncommitted/unpushed changes.
|
|
251
257
|
|
|
252
258
|
## CHARTER 9: PROCESS MANAGEMENT
|