npm - gm-cc - Versions diffs - 2.0.496 → 2.0.497 - Mend

gm-cc 2.0.496 → 2.0.497

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/.claude-plugin/marketplace.json +1 -1
package/package.json +1 -1
package/plugin.json +1 -1
package/scripts/postinstall-kilo.js +1 -1
package/scripts/postinstall-oc.js +1 -1
package/scripts/postinstall.js +1 -1
package/skills/gm-complete/SKILL.md +22 -8
package/skills/gm-emit/SKILL.md +2 -2
package/skills/gm-execute/SKILL.md +3 -3
package/skills/planning/SKILL.md +19 -5

package/.claude-plugin/marketplace.json CHANGED Viewed

@@ -4,7 +4,7 @@
     "name": "AnEntrypoint"
   },
   "description": "State machine agent with hooks, skills, and automated git enforcement",
-  "version": "2.0.496",
+  "version": "2.0.497",
   "metadata": {
     "description": "State machine agent with hooks, skills, and automated git enforcement"
   },

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "gm-cc",
-  "version": "2.0.496",
+  "version": "2.0.497",
   "description": "State machine agent with hooks, skills, and automated git enforcement",
   "author": "AnEntrypoint",
   "license": "MIT",

package/plugin.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "gm",
-  "version": "2.0.496",
+  "version": "2.0.497",
   "description": "State machine agent with hooks, skills, and automated git enforcement",
   "author": {
     "name": "AnEntrypoint",

package/scripts/postinstall-kilo.js CHANGED Viewed

@@ -44,7 +44,7 @@ function safeCopyDirectory(src, dst) {
 function updateGitignore(projectRoot) {
   try {
     const gitignorePath = path.join(projectRoot, '.gitignore');
-    const entry = '.gm-stop-verified';
+    const entry = '.gm/';
     let content = fs.existsSync(gitignorePath) ? fs.readFileSync(gitignorePath, 'utf-8') : '';
     if (content.includes(entry)) return;
     if (content && !content.endsWith('\n')) content += '\n';

package/scripts/postinstall-oc.js CHANGED Viewed

@@ -46,7 +46,7 @@ function safeCopyDirectory(src, dst) {
 function updateGitignore(projectRoot) {
   try {
     const gitignorePath = path.join(projectRoot, '.gitignore');
-    const entry = '.gm-stop-verified';
+    const entry = '.gm/';
     let content = fs.existsSync(gitignorePath) ? fs.readFileSync(gitignorePath, 'utf-8') : '';
     if (content.includes(entry)) return;
     if (content && !content.endsWith('\n')) content += '\n';

package/scripts/postinstall.js CHANGED Viewed

@@ -46,7 +46,7 @@ function safeCopyDirectory(src, dst) {
 function updateGitignore(projectRoot) {
   try {
     const gitignorePath = path.join(projectRoot, '.gitignore');
-    const entry = '.gm-stop-verified';
+    const entry = '.gm/';
     let content = fs.existsSync(gitignorePath) ? fs.readFileSync(gitignorePath, 'utf-8') : '';
     if (content.includes(entry)) return;
     if (content && !content.endsWith('\n')) content += '\n';

package/skills/gm-complete/SKILL.md CHANGED Viewed

@@ -12,9 +12,9 @@ You are in the **VERIFY → COMPLETE** phase. Files are written. Prove the whole
 ## TRANSITIONS
-**EXIT — .prd items remain**: Verified items completed, .prd still has pending items → invoke `gm-execute` skill immediately (next wave). Do not stop.
+**EXIT — .gm/prd.yml items remain**: Verified items completed, .gm/prd.yml still has pending items → invoke `gm-execute` skill immediately (next wave). Do not stop.
-**EXIT — COMPLETE**: .prd empty + all work pushed + CI green → invoke `update-docs` skill.
+**EXIT — COMPLETE**: .gm/prd.yml empty + test.js passes + all work pushed + CI green → invoke `update-docs` skill.
 **STATE REGRESSIONS**:
 - Verification reveals broken file output → invoke `gm-emit` skill, reset to EMIT state, re-verify on return
@@ -32,7 +32,7 @@ You are in the **VERIFY → COMPLETE** phase. Files are written. Prove the whole
 - `git_clean=UNKNOWN` until `exec:bash\ngit status --porcelain` returns empty
 - `git_pushed=UNKNOWN` until `git log origin/main..HEAD --oneline` returns empty
 - `ci_passed=UNKNOWN` until all GitHub Actions runs triggered by the push reach `conclusion: success`
-- `prd_empty=UNKNOWN` until .prd file is deleted (not just empty — file must not exist)
+- `prd_empty=UNKNOWN` until `.gm/prd.yml` is deleted (not just empty — file must not exist)
 All five must resolve to KNOWN before COMPLETE. Any UNKNOWN = absolute barrier.
@@ -57,6 +57,20 @@ console.log(await fn(realInput));
 For browser/UI: invoke `browser` skill with real workflows. Server + client features require both exec:nodejs AND browser diagnostics. After every success: enumerate what remains — never stop at first green. First green is not COMPLETE.
+## INTEGRATION TEST GATE
+Before git enforcement, run the project's `test.js` if it exists:
+```
+exec:nodejs
+const { execSync } = require('child_process');
+try { execSync('node test.js', { stdio: 'inherit', timeout: 30000 }); console.log('test.js: PASS'); } catch (e) { console.error('test.js: FAIL'); process.exit(1); }
+```
+Failure = regression to `gm-execute`. Do not proceed to git enforcement with failing tests.
+If `test.js` does not exist and the project has testable surface, regress to `gm-execute` to create it.
 ## CODE EXECUTION
 **exec:<lang> is the only way to run code.** Bash tool body: `exec:<lang>\n<code>`
@@ -131,7 +145,7 @@ Before declaring complete, sweep the entire codebase for violations:
 1. **Files >200 lines** → split immediately
 2. **Comments in code** → remove all
-3. **Test files** (.test.js, .spec.js, __tests__/) → delete
+3. **Scattered test files** (.test.js, .spec.js, __tests__/, fixtures/, mocks/) → delete, consolidate coverage into root `test.js`
 4. **Mock/stub/simulation files** → delete
 5. **Unnecessary doc files** (not CHANGELOG/CLAUDE/README/TODO.md) → delete
 6. **Duplicate concern** (overlapping responsibility, similar logic, parallel implementations, consolidatable code) → snake to `planning` with restructuring instructions — do not patch locally
@@ -147,17 +161,17 @@ Any violation found = fix immediately before advancing.
 ## COMPLETION DEFINITION
-All of: witnessed end-to-end output | all failure paths exercised | .prd empty | git clean and pushed | all CI runs green | codebase hygiene sweep clean | TODO.md empty/deleted | CHANGELOG.md updated | `user_steps_remaining=0`
+All of: witnessed end-to-end output | all failure paths exercised | test.js passes | .gm/prd.yml empty | git clean and pushed | all CI runs green | codebase hygiene sweep clean | TODO.md empty/deleted | CHANGELOG.md updated | `user_steps_remaining=0`
 ## DO NOT STOP
-After end-to-end verification passes: read .prd from disk. If any items remain, immediately invoke `gm-execute` skill — do not respond to the user. Only respond when .prd is deleted AND git is clean AND all commits are pushed.
+After end-to-end verification passes: read `.gm/prd.yml` from disk. If any items remain, immediately invoke `gm-execute` skill — do not respond to the user. Only respond when `.gm/prd.yml` is deleted AND git is clean AND all commits are pushed.
 ## CONSTRAINTS
-**Never**: claim done without witnessed output | uncommitted changes | unpushed commits | failed CI runs | .prd items remaining | TODO.md with items remaining | stop at first green | absorb surprises silently | respond to user while .prd has items | skip hygiene sweep | leave comments/mocks/test files/fallbacks
+**Never**: claim done without witnessed output | uncommitted changes | unpushed commits | failed CI runs | .gm/prd.yml items remaining | TODO.md with items remaining | stop at first green | absorb surprises silently | respond to user while .gm/prd.yml has items | skip hygiene sweep | leave comments/mocks/scattered test files/fallbacks | skip test.js execution
-**Always**: triage failure before regressing | witness end-to-end | regress to planning on any new unknown | enumerate remaining after every success | check .prd after every verification pass | run hygiene sweep before declaring complete | deploy/publish if applicable | update CHANGELOG.md
+**Always**: triage failure before regressing | witness end-to-end | run test.js before git enforcement | regress to planning on any new unknown | enumerate remaining after every success | check .gm/prd.yml after every verification pass | run hygiene sweep before declaring complete | deploy/publish if applicable | update CHANGELOG.md
 ---

package/skills/gm-emit/SKILL.md CHANGED Viewed

@@ -8,7 +8,7 @@ description: EMIT phase. Pre-emit debug, write files, post-emit verify from disk
 You are in the **EMIT** phase. Every mutable is KNOWN. Prove the write is correct, write, confirm from disk. Any new unknown = snake to `planning`, restart chain.
 **GRAPH POSITION**: `PLAN → EXECUTE → [EMIT] → VERIFY → COMPLETE`
-- **Entry**: All .prd mutables resolved. Entered from `gm-execute` or via snake from VERIFY.
+- **Entry**: All .gm/prd.yml mutables resolved. Entered from `gm-execute` or via snake from VERIFY.
 ## TRANSITIONS
@@ -78,7 +78,7 @@ The post-emit verification is a differential diagnosis against the pre-emit base
 - Post-emit verification matches pre-emit exactly
 - Hot reloadable: state outside reloadable modules, handlers swap atomically
 - Errors throw with clear context — no fallbacks, demo modes, silent swallowing, `|| default`, `catch { return null }`
-- No mocks/fakes/stubs/simulations/test files anywhere — delete on discovery
+- No mocks/fakes/stubs/simulations/scattered test files anywhere — delete on discovery (only root test.js permitted)
 - Files ≤200 lines — split immediately if over, do not advance
 - No duplicate concern — after writing, run exec:codesearch for the primary concern. If ANY other code serves the same concern → do NOT advance, snake to `planning` with consolidation instructions
 - No comments — remove any found

package/skills/gm-execute/SKILL.md CHANGED Viewed

@@ -117,7 +117,7 @@ Invoke `browser` skill. Exhaust each level before advancing to next:
 ## GROUND TRUTH ENFORCEMENT
-Real services, real data, real timing. Mocks/fakes/stubs/simulations = diagnostic noise = delete immediately. No .test.js/.spec.js. Delete on discovery. No fallback/demo modes — errors must surface with full diagnostic context and fail loud.
+Real services, real data, real timing. Mocks/fakes/stubs/simulations = diagnostic noise = delete immediately. No scattered test files (.test.js, .spec.js, __tests__/) — delete on discovery. All test coverage belongs in the single root `test.js`. If `test.js` does not exist, create it. Every behavior change updates `test.js`. Every bug fix adds a regression case. No fallback/demo modes — errors must surface with full diagnostic context and fail loud.
 **SCAN BEFORE EDIT**: Before modifying or creating any file, search the codebase (exec:codesearch) for existing implementations of the same concern. "Duplicate" means overlapping responsibility, similar logic, or parallel implementations — not just identical files. If consolidation is possible, regress to `planning` with restructuring instructions instead of continuing.
@@ -139,9 +139,9 @@ Never respond to the user from this phase. When all mutables are KNOWN, immediat
 ## CONSTRAINTS
-**Never**: `Bash(node/npm/npx/bun)` | fake data | mock files | test files | fallback/demo modes | Glob/Grep/Read/Explore (hook-blocked — use exec:codesearch) | sequential independent items | absorb surprises silently | respond to user or pause for input | edit files before executing to understand current behavior | duplicate existing code | write explicit if/else chains when a dispatch table or native method suffices | write packed one-liners that obscure structure | reinvent what a library or native API already provides
+**Never**: `Bash(node/npm/npx/bun)` | fake data | mock files | scattered test files (only root test.js) | fallback/demo modes | Glob/Grep/Read/Explore (hook-blocked — use exec:codesearch) | sequential independent items | absorb surprises silently | respond to user or pause for input | edit files before executing to understand current behavior | duplicate existing code | write explicit if/else chains when a dispatch table or native method suffices | write packed one-liners that obscure structure | reinvent what a library or native API already provides
-**Always**: witness every hypothesis | import real modules | scan codebase before creating/editing files | regress to planning on any new unknown | fix immediately on discovery | delete mocks/stubs/comments/test files on discovery | invoke next skill immediately when done | ask "what native feature solves this?" before writing any new logic | prefer structures where wrong states are unrepresentable
+**Always**: witness every hypothesis | import real modules | scan codebase before creating/editing files | regress to planning on any new unknown | fix immediately on discovery | delete mocks/stubs/comments/scattered test files on discovery | consolidate test coverage into root test.js | add regression case to test.js for every bug fix | invoke next skill immediately when done | ask "what native feature solves this?" before writing any new logic | prefer structures where wrong states are unrepresentable
 ---

package/skills/planning/SKILL.md CHANGED Viewed

@@ -16,12 +16,12 @@ Root of all work. Runs `PLAN → EXECUTE → EMIT → VERIFY → UPDATE-DOCS →
 **REGRESSIONS**: new unknown at any state → re-invoke `planning` | EXECUTE unresolvable 2 passes → `planning` | EMIT logic error → `gm-execute` | EMIT new unknown → `planning` | VERIFY broken output → `gm-emit` | VERIFY logic wrong → `gm-execute` | VERIFY new unknown → `planning`
-**Runs until**: .prd empty AND git clean AND all pushes confirmed AND CI green.
+**Runs until**: .gm/prd.yml empty AND git clean AND all pushes confirmed AND CI green.
 ## ENFORCEMENT — COMPLETE EVERY TASK END-TO-END
 **Cannot respond or stop while**:
-- .prd file exists and has items
+- .gm/prd.yml exists and has items
 - git has uncommitted changes
 - git has unpushed commits
@@ -59,7 +59,7 @@ During every planning pass, enumerate every possible aspect of the app's runtime
 ## .PRD FORMAT
-Path: `./.prd`. YAML via `exec:nodejs` (use `fs.writeFileSync`). Delete when empty — never leave empty file.
+Path: `./.gm/prd.yml`. YAML via `exec:nodejs` (use `fs.writeFileSync`). Ensure `.gm/` dir exists before writing. Delete when empty — never leave empty file. Delete `.gm/` dir when completely empty.
 ```yaml
 - id: kebab-id
@@ -120,7 +120,7 @@ Invoke `browser` skill. Escalation: (1) `exec:browser <js>` → (2) browser skil
 ## MANDATORY DEV WORKFLOW
-No comments. No test files. 200-line limit — split before continuing. Fail loud. No duplication. Scan before every edit. Duplicate concern = regress to PLAN. Errors throw with context — no `|| default`, no `catch { return null }`. `window.__debug` exposes all client state. AGENTS.md via memorize only. CHANGELOG.md: append per commit.
+No comments. No scattered test files. 200-line limit — split before continuing. Fail loud. No duplication. Scan before every edit. Duplicate concern = regress to PLAN. Errors throw with context — no `|| default`, no `catch { return null }`. `window.__debug` exposes all client state. AGENTS.md via memorize only. CHANGELOG.md: append per commit.
 **Minimal code / maximal DX process**: Before writing any logic, run this process in order — stop at the first step that resolves the need:
 1. **Native first** — does the language or runtime already do this? Use it exactly as designed.
@@ -130,6 +130,20 @@ No comments. No test files. 200-line limit — split before continuing. Fail lou
 When structure eliminates a whole class of wrong states — name that pattern explicitly. Dispatch tables replacing switch chains, pipelines replacing loop-with-accumulator, maps replacing if/else forests — these are not just style preferences, they are correctness properties. Code that cannot be wrong because of how it is shaped is the goal. Readable top-to-bottom without mental simulation = done right. Requires decoding = not done.
+## SINGLE INTEGRATION TEST POLICY
+Every project maintains exactly one `test.js` at project root. 200-line max. No other test files anywhere — no `.test.js`, `.spec.js`, `__tests__/`, `fixtures/`, `mocks/`. Delete all scattered tests on discovery and consolidate coverage into `test.js`.
+**test.js replaces all unit tests.** It tests the real system end-to-end with real data. No mocks, no stubs, no test frameworks. Plain node assertions or process exit codes.
+**Creation**: if `test.js` does not exist, create it during EXECUTE phase covering all testable surface of current work.
+**Maintenance**: every code change that adds or modifies behavior must update `test.js` to cover it. Every bug fix must add a regression case that would have caught the bug.
+**Structure**: group by subsystem, each subsystem gets a section. When approaching 200 lines, compress older stable tests into tighter assertions to make room for new coverage.
+**Execution**: `gm-complete` runs `test.js` before allowing completion. Failure = regression to EXECUTE.
 ## RESPONSE POLICY
 Terse like smart caveman. Technical substance stays. Fluff dies. Default: **full**. Switch: `/caveman lite|full|ultra`.
@@ -147,6 +161,6 @@ Auto-Clarity: drop caveman for security warnings, irreversible confirmations, am
 **Tier 2**: no_duplication, no_hardcoded_values, modularity
 **Tier 3**: no_comments, convention_over_code
-**Never**: `Bash(node/npm/npx/bun)` | skip planning | partial execution | stop while .prd has items | stop while git dirty | sequential independent items | screenshot before JS exhausted | fallback/demo modes | silently swallow errors | duplicate concern | leave comments | create test files | write if/else chains where a map or pipeline suffices | write one-liners that require decoding | branch on enumerated cases when a dispatch table exists
+**Never**: `Bash(node/npm/npx/bun)` | skip planning | partial execution | stop while .gm/prd.yml has items | stop while git dirty | sequential independent items | screenshot before JS exhausted | fallback/demo modes | silently swallow errors | duplicate concern | leave comments | create scattered test files (only root test.js) | write if/else chains where a map or pipeline suffices | write one-liners that require decoding | branch on enumerated cases when a dispatch table exists
 **Always**: invoke named skill at every state transition | regress to planning on any new unknown | witnessed execution only | scan codebase before edits | enumerate every possible observability improvement every planning pass | follow skill chain completely end-to-end on every task without exception | prefer dispatch tables over switch/if chains | prefer pipelines over loop-with-accumulator | make wrong states structurally impossible | name patterns when structure eliminates a whole class of bugs