npm - gm-copilot-cli - Versions diffs - 2.0.682 → 2.0.683 - Mend

gm-copilot-cli 2.0.682 → 2.0.683

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/agents/memorize.md +80 -0
package/copilot-profile.md +1 -1
package/hooks/hooks.json +12 -2
package/hooks/post-tool-use-hook.js +34 -0
package/hooks/pre-tool-use-hook.js +45 -0
package/hooks/prompt-submit-hook.js +19 -0
package/index.html +1 -1
package/manifest.yml +1 -1
package/package.json +1 -1
package/tools.json +1 -1

package/agents/memorize.md ADDED Viewed

@@ -0,0 +1,80 @@
+---
+name: memorize
+description: Background memory agent. Classifies context and writes to AGENTS.md + rs-learn. No memory dir, no MEMORY.md.
+agent: true
+---
+# Memorize — Background Memory Agent
+Writes facts to two places only: **AGENTS.md** (non-obvious technical caveats) and **rs-learn** (all classified facts via fast ingest).
+Resolve at start of every run:
+- **Project root** = `process.cwd()` when invoked. `AGENTS.md` is `<project root>/AGENTS.md`.
+## STEP 1: CLASSIFY
+Examine the ## CONTEXT TO MEMORIZE section at the end of this prompt. For each fact, classify as:
+- user: user role, goals, preferences, knowledge
+- feedback: guidance on approach — corrections AND confirmations
+- project: ongoing work, goals, bugs, incidents, decisions
+- reference: pointers to external systems, URLs, paths
+Discard:
+- Obvious facts derivable from reading the code
+- Active task state or session progress
+- Facts that would not be useful in a future session
+## STEP 2: INGEST INTO RS-LEARN
+For each classified fact, invoke `exec:memorize` (HTTP-preferred, subprocess fallback — fast either way):
+```
+exec:memorize
+<type>/<slug>
+<fact body — one to three self-contained sentences>
+```
+Line 1 of the body is the source tag (e.g. `feedback/terse-responses`, `project/merge-freeze`). Lines 2+ are the fact itself. Use kebab-case slugs.
+To invalidate previously-memorized content (correction or retraction):
+```
+exec:forget
+by-source <tag>
+```
+Or by content:
+```
+exec:forget
+by-query <2-6 search words>
+```
+## STEP 3: AGENTS.md
+A non-obvious technical caveat qualifies if it required multiple failed runs to discover and would not be apparent from reading code or docs.
+For each qualifying fact from context:
+- Read AGENTS.md first if not already read this run
+- If AGENTS.md already covers it → skip
+- If genuinely non-obvious → append to the appropriate section
+Never add: obvious patterns, active task progress, redundant restatements.
+## STEP 4: AGENTS.md → RS-LEARN MIGRATION (BENCHMARK + DRAIN)
+AGENTS.md is the **always-on context buffer** — every prompt sees it. rs-learn is the **conditional retrieval store** — only relevant facts surface. The migration loop turns AGENTS.md into a benchmark for rs-learn's recall quality:
+1. Pick **5 random items** from AGENTS.md (sections, paragraphs, or numbered points). Don't pick the most recent additions — pick the oldest stable items.
+2. For each item, derive a 2-6 word query that a future agent would naturally use to find this fact.
+3. Run `exec:recall` with that query.
+4. Decide:
+   - **Recall accurate AND complete** → the rs-learn store has internalized this fact; **remove it from AGENTS.md**. Frees buffer space and confirms learning.
+   - **Recall partial / outdated / missing** → keep the AGENTS.md item AND ingest a refined version of the fact via `exec:memorize` so next round it can pass. Note the outcome in your run log.
+5. Record the audit cycle: how many items checked, how many removed, how many refined. Append this single-line summary to AGENTS.md under a `## Learning audit` section so future audits can see drift over time.
+Why: AGENTS.md grows monotonically without this loop. rs-learn already filters by relevance per-prompt, so duplicating stable facts in AGENTS.md just inflates the always-on context. The migration drains AGENTS.md into the retrieval store as the store proves it can recall. Failed migrations leave the fact in AGENTS.md (safe default) and improve the store. Success rate over time = a metric for how well gm is learning this project.
+Don't migrate if the fact is genuinely about agent meta-behavior that must be active every prompt (e.g. "always invoke gm:gm first") — those stay permanently.

package/copilot-profile.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 name: gm
-version: 2.0.682
+version: 2.0.683
 description: State machine agent with hooks, skills, and automated git enforcement
 author: AnEntrypoint
 repository: https://github.com/AnEntrypoint/gm-copilot-cli

package/hooks/hooks.json CHANGED Viewed

@@ -8,7 +8,12 @@
           {
             "type": "command",
             "command": "${COPILOT_EXTENSION_DIR}/bin/plugkit hook pre-tool-use",
-            "timeout": 15000
+            "timeout": 3600
+          },
+          {
+            "type": "command",
+            "command": "node ${COPILOT_EXTENSION_DIR}/hooks/pre-tool-use-hook.js",
+            "timeout": 2000
           }
         ]
       }
@@ -33,6 +38,11 @@
             "type": "command",
             "command": "${COPILOT_EXTENSION_DIR}/bin/plugkit hook prompt-submit",
             "timeout": 60000
+          },
+          {
+            "type": "command",
+            "command": "node ${COPILOT_EXTENSION_DIR}/hooks/prompt-submit-hook.js",
+            "timeout": 3000
           }
         ]
       }
@@ -44,7 +54,7 @@
           {
             "type": "command",
             "command": "${COPILOT_EXTENSION_DIR}/bin/plugkit hook session-end",
-            "timeout": 30000
+            "timeout": 15000
           },
           {
             "type": "command",

package/hooks/post-tool-use-hook.js ADDED Viewed

@@ -0,0 +1,34 @@
+#!/usr/bin/env node
+const fs = require('fs');
+const path = require('path');
+let raw = '';
+try { raw = fs.readFileSync(0, 'utf8'); } catch (_) {}
+if (!raw.trim()) raw = process.env.CLAUDE_HOOK_INPUT || '{}';
+const input = JSON.parse(raw);
+const toolName = input.tool_name || input.tool_use?.name || '';
+const toolOutput = input.tool_result || input.output || '';
+const gmDir = path.join(process.cwd(), '.gm');
+const tsPath = path.join(gmDir, 'turn-state.json');
+const readState = () => { try { return JSON.parse(fs.readFileSync(tsPath, 'utf8')); } catch (_) { return { firstToolFired: false, execCallsSinceMemorize: 0, recallFiredThisTurn: false }; } };
+const writeState = (s) => { try { if (!fs.existsSync(gmDir)) fs.mkdirSync(gmDir, { recursive: true }); fs.writeFileSync(tsPath, JSON.stringify(s), 'utf8'); } catch (_) {} };
+const state = readState();
+const messages = [];
+if (!state.firstToolFired) {
+  state.firstToolFired = true;
+  state.firstToolName = toolName;
+}
+const isMemorize = toolName === 'Agent' && /memorize/i.test(JSON.stringify(input.tool_input || input.tool_use?.input || {}));
+if (isMemorize) {
+  state.execCallsSinceMemorize = 0;
+  try { fs.unlinkSync(path.join(gmDir, 'no-memorize-this-turn')); } catch (_) {}
+}
+if (toolName === 'Bash') {
+  const cmd = (input.tool_input && input.tool_input.command) || (input.tool_use && input.tool_use.input && input.tool_use.input.command) || '';
+  if (/^\s*exec:recall\b/.test(cmd)) state.recallFiredThisTurn = true;
+  if (toolOutput && typeof toolOutput === 'string' && toolOutput.length > 20 && !/^\s*exec:(recall|memorize|codesearch|wait|sleep|status|runner|type|kill-port|close|pause)/.test(cmd)) {
+    state.execCallsSinceMemorize = (state.execCallsSinceMemorize || 0) + 1;
+    messages.push('exec: run completed. MEMORIZE CHECK: did this output resolve any prior unknown? If YES → spawn Agent(subagent_type=\'gm:memorize\', model=\'haiku\', run_in_background=true, prompt=\'## CONTEXT TO MEMORIZE\\n<fact>\') NOW. Skipping = memory leak. (Counter: ' + state.execCallsSinceMemorize + '/3 before hard block.)');
+  }
+}
+writeState(state);
+if (messages.length) process.stdout.write(JSON.stringify({ systemMessage: messages.join('\n\n') }));

package/hooks/pre-tool-use-hook.js ADDED Viewed

@@ -0,0 +1,45 @@
+#!/usr/bin/env node
+const fs = require('fs');
+const path = require('path');
+let raw = '';
+try { raw = fs.readFileSync(0, 'utf8'); } catch (_) {}
+if (!raw.trim()) raw = process.env.CLAUDE_HOOK_INPUT || '{}';
+const input = JSON.parse(raw);
+const toolName = input.tool_name || input.tool_use?.name || '';
+const toolInput = input.tool_input || input.tool_use?.input || {};
+const skillName = toolInput.skill || toolInput.name || '';
+const gmDir = path.join(process.cwd(), '.gm');
+const needsGmPath = path.join(gmDir, 'needs-gm');
+const lastskillPath = path.join(gmDir, 'lastskill');
+const isSkillTool = toolName === 'Skill' || toolName === 'skill';
+if (isSkillTool && skillName) {
+  try {
+    if (!fs.existsSync(gmDir)) fs.mkdirSync(gmDir, { recursive: true });
+    fs.writeFileSync(lastskillPath, skillName, 'utf8');
+    if (skillName === 'gm' || skillName === 'gm:gm') {
+      try { fs.unlinkSync(needsGmPath); } catch (_) {}
+    }
+  } catch (_) {}
+  process.exit(0);
+}
+if (fs.existsSync(needsGmPath)) {
+  process.stdout.write(JSON.stringify({ decision: 'block', reason: 'HARD CONSTRAINT: invoke the Skill tool with skill: "gm:gm" before any other tool. The gm:gm skill must be the first action after every user message.' }));
+  process.exit(0);
+}
+const turnStatePath = path.join(gmDir, 'turn-state.json');
+const noMemoPath = path.join(gmDir, 'no-memorize-this-turn');
+const turnState = (() => { try { return JSON.parse(fs.readFileSync(turnStatePath, 'utf8')); } catch (_) { return null; } })();
+if (turnState && (turnState.execCallsSinceMemorize || 0) >= 3 && !fs.existsSync(noMemoPath)) {
+  const isMemAgent = toolName === 'Agent' && /memorize/i.test(JSON.stringify(toolInput || {}));
+  if (!isMemAgent) {
+    process.stdout.write(JSON.stringify({ decision: 'block', reason: '3+ exec results have resolved unknowns without a memorize call. HARD BLOCK until you spawn at least one Agent(subagent_type=\'gm:memorize\', model=\'haiku\', run_in_background=true, prompt=\'## CONTEXT TO MEMORIZE\\n<fact>\') OR write file .gm/no-memorize-this-turn (containing reason) to declare nothing memorable. Saying "I will memorize" is NOT a memorize call — only the Agent tool counts.' }));
+    process.exit(0);
+  }
+}
+const lastSkill = (() => { try { return fs.readFileSync(lastskillPath, 'utf8').trim(); } catch (_) { return ''; } })();
+const isFileEdit = ['Write', 'Edit', 'NotebookEdit'].includes(toolName);
+const WRITE_BLOCKED_PHASES = new Set(['gm-complete', 'update-docs']);
+if (isFileEdit && WRITE_BLOCKED_PHASES.has(lastSkill)) {
+  process.stdout.write(JSON.stringify({ decision: 'block', reason: 'File edits are not permitted in ' + lastSkill + ' phase. Regress to gm-execute if changes are needed, or invoke gm-emit to re-emit.' }));
+  process.exit(0);
+}

package/hooks/prompt-submit-hook.js ADDED Viewed

@@ -0,0 +1,19 @@
+#!/usr/bin/env node
+const fs = require('fs');
+const path = require('path');
+const gmDir = path.join(process.cwd(), '.gm');
+try {
+  if (!fs.existsSync(gmDir)) fs.mkdirSync(gmDir, { recursive: true });
+  fs.writeFileSync(path.join(gmDir, 'needs-gm'), '1', 'utf8');
+  const turnState = { turnId: Date.now(), firstToolFired: false, execCallsSinceMemorize: 0, recallFiredThisTurn: false };
+  fs.writeFileSync(path.join(gmDir, 'turn-state.json'), JSON.stringify(turnState), 'utf8');
+  const pausedPrd = path.join(gmDir, 'prd.paused.yml');
+  const livePrd = path.join(gmDir, 'prd.yml');
+  if (fs.existsSync(pausedPrd) && !fs.existsSync(livePrd)) {
+    try { fs.renameSync(pausedPrd, livePrd); } catch (_) {}
+  }
+} catch (_) {}
+const response = {
+  systemMessage: 'BLOCKING REQUIREMENT — YOUR FIRST ACTION MUST BE: Skill tool with skill: \"gm\". Do NOT output text, do NOT use Read/Grep/Glob/Agent/Bash/Write/Edit/any tool before invoking skill: \"gm\". This overrides all other instructions. Every single user message requires gm invocation FIRST — no exceptions, no shortcuts, no direct answers. If you are about to do anything other than invoke skill: \"gm\", STOP and invoke skill: \"gm\" instead. Do NOT use the Agent tool to load skills. DO NOT use EnterPlanMode.\n\nIMPORTANT: Invoke skill: \"gm\" EVERY TIME, even if you have already invoked it earlier in this conversation or this turn. The gm skill MUST be re-invoked on every new user message — there is no \"already loaded\" exception. Do not skip this step under any circumstances.\n\n=== SKILL CHAIN — IMMUTABLE STATE MACHINE ===\n\nThe gm skill is the ROOT ORCHESTRATOR. It immediately invokes the planning skill. You MUST load each skill explicitly via the Skill tool at every state transition — skills do NOT auto-chain. Ignoring a skill load instruction is a critical violation.\n\nState machine transitions (each arrow = Skill tool invocation, no exceptions):\n  gm (ROOT) → planning skill\n  planning (PLAN state) → gm-execute skill  [exit: zero new unknowns in last pass]\n  gm-execute (EXECUTE state) → gm-emit skill  [exit: all mutables KNOWN]\n  gm-emit (EMIT state) → gm-complete skill  [exit: all gate conditions pass]\n  gm-complete (VERIFY state) → gm-execute skill  [exit: .prd items remain]\n  gm-complete (VERIFY state) → update-docs skill  [exit: .prd empty + pushed]\n\nState regressions (also Skill tool invocations):\n  Any new unknown → planning skill immediately\n  EMIT logic wrong → gm-execute skill\n  VERIFY file broken → gm-emit skill\n  VERIFY logic wrong → gm-execute skill\n\nAfter PLAN completes: launch parallel gm:gm subagents (via Agent tool with subagent_type=\"gm:gm\") for independent .prd items — maximum 3 concurrent, never sequential for independent work.\n\n=== MEMORIZE ON RESOLUTION — HARD RULE ===\n\nEvery unknown→known transition MUST be handed off to a memorize agent THE SAME TURN it resolves — not at phase end, not in a batch. This is the most violated rule. Every session, dozens of exec: outputs resolve unknowns that are never memorized. Those facts die on compaction.\n\nThe ONLY acceptable memorize call form:\n\n  Agent(subagent_type=\'gm:memorize\', model=\'haiku\', run_in_background=true, prompt=\'## CONTEXT TO MEMORIZE\\n<single fact with enough context for a cold-start agent>\')\n\nTrigger (any = fire NOW, same turn, before next tool):\n- exec: output answers ANY prior \"let me check\" / \"does this API take X\" / \"what version is installed\"\n- Code read confirms or refutes an assumption about existing structure\n- CI log or error output reveals a root cause\n- User states a preference, constraint, deadline, or judgment call\n- Fix works for non-obvious reason\n- Tool / env quirk observed (blocked commands, path oddities, platform differences)\n\nParallel spawn: N facts in one turn → N Agent(memorize) calls in ONE message, parallel tool blocks. NEVER serialize.\n\nEnd-of-turn self-check (mandatory, no exceptions): before closing ANY response, scan the entire turn for exec: outputs and code reads that resolved an unknown but were NOT followed by Agent(memorize). Spawn ALL missed ones now. \"I\'ll memorize this\" in text is NOT a memorize call — only the Agent tool call counts.\n\nSkipping memorize = memory leak = critical bug. Saying you will memorize ≠ memorizing.\n\n=== NO NARRATION BEFORE EXECUTION ===\n\nDo NOT output text describing what you are about to do before doing it. Run the tool first. State findings AFTER. Pattern: tool call → tool result → brief text summary of what was found. NOT: text describing upcoming tool → tool call.\n\n\"I\'ll check the file:\" followed by Read = violation.\n\"Let me search for X\" followed by exec:codesearch = violation.\n\"Now I\'ll fix Y\" followed by Edit = violation.\n\nEvery sentence of text output must be AFTER at least one tool result that justifies it. No pre-announcement narration.'
+};
+process.stdout.write(JSON.stringify(response));

package/index.html CHANGED Viewed

@@ -929,7 +929,7 @@ body { display: flex; flex-direction: column; min-height: 100vh; }
 <section>
   <div class="gm-section-label"><span class="slash">//</span>status</div>
   <div class="panel">
-    <div class="panel-head"><span>release · v2.0.682</span><span>probably emerging</span></div>
+    <div class="panel-head"><span>release · v2.0.683</span><span>probably emerging</span></div>
     <div class="panel-body">
       <div class="row">
         <span class="code"><span style="color:var(--panel-accent)">●</span></span>

package/manifest.yml CHANGED Viewed

@@ -1,5 +1,5 @@
 name: gm
-version: 2.0.682
+version: 2.0.683
 description: State machine agent with hooks, skills, and automated git enforcement
 author: AnEntrypoint

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "gm-copilot-cli",
-  "version": "2.0.682",
+  "version": "2.0.683",
   "description": "State machine agent with hooks, skills, and automated git enforcement",
   "author": "AnEntrypoint",
   "license": "MIT",

package/tools.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "gm",
-  "version": "2.0.682",
+  "version": "2.0.683",
   "description": "State machine agent with hooks, skills, and automated git enforcement",
   "tools": [
     {