npm - gm-oc - Versions diffs - 2.0.402 → 2.0.403 - Mend

gm-oc 2.0.402 → 2.0.403

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/package.json +1 -1
package/skills/gm/SKILL.md +47 -4
package/skills/gm-complete/SKILL.md +22 -3
package/skills/gm-emit/SKILL.md +11 -4
package/skills/gm-execute/SKILL.md +7 -3
package/skills/planning/SKILL.md +2 -0

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "gm-oc",
-  "version": "2.0.402",
+  "version": "2.0.403",
   "description": "State machine agent with hooks, skills, and automated git enforcement",
   "author": "AnEntrypoint",
   "license": "MIT",

package/skills/gm/SKILL.md CHANGED Viewed

@@ -126,6 +126,51 @@ Invoke `browser` skill. Escalation — exhaust each before advancing:
 Completing a phase is NOT stopping. After every phase: read .prd, check git, invoke next skill. Only when .prd is deleted AND git is clean AND all commits are pushed may you return a final response to the user.
+## MANDATORY DEV WORKFLOW — ABSOLUTE RULES
+These rules apply to ALL phases. Violations trigger immediate snake to planning.
+**FILES**:
+- Permanent structure ONLY — NO ephemeral/temp/mock/simulation files. Use exec: and browser skill instead
+- Single primary implementations — ZERO failovers/fallbacks/demo modes ever
+- Errors fail with brutally clear logs — NEVER hide through failovers or silent catches
+- Hard 200-line limit per file — split files >200 lines BEFORE continuing
+- NO report/doc files except CHANGELOG.md, CLAUDE.md, README.md, TODO.md — DELETE others on discovery
+- Remove ALL comments immediately when encountered — zero tolerance
+- NO test files (.test.js, .spec.js, __tests__/) — manual testing only via exec: and browser skill
+- Clean ALL files not required for the program to function
+**CODE QUALITY**:
+- ALWAYS scan codebase (exec:codesearch) before editing to find existing implementations — resolve duplicates immediately, NEVER duplicate existing functionality
+- DRY/clean/generalized/forward-thinking architecture — immediately solve architectural issues, CONTINUOUSLY reorganize to be maximally concise/simple without losing functionality
+- Every extra symbol = technical debt — enforce clean short concise functional code
+- ALWAYS write dynamic/modular code using ground truth — ZERO hardcoded values
+- NO adjectives/descriptive language in code (variable/function names must be terse and functional)
+- No mocks/simulations/fallbacks/hardcoded/fake elements — delete on discovery
+- Set client-side debugging globals to make ALL client-side data accessible via simple REPL (window.__debug or similar)
+**ERROR HANDLING**:
+- Every error must throw and propagate with clear context
+- No `|| defaultValue`, no `catch { return null }`, no graceful degradation
+- The only acceptable error handling: catch → log the real error → re-throw or display to user
+**DEBUGGING**:
+- ALWAYS hypothesize/troubleshoot via execution BEFORE editing any files
+- Check git history (`git log`, `git diff`) for troubleshooting known regressions — never revert, use differential comparisons and edit new code manually
+- Keep execution logs concise (<4k chars ideal, 30k max)
+- Clear cache before playwright/browser debugging
+- Test locally when possible over live
+**DOCUMENTATION**:
+- CLAUDE.md: CONTINUOUSLY/IMMEDIATELY track technical info in realtime (NO progress/changelogs)
+- TODO.md: CONTINUOUSLY track persistent todos — MUST completely clear/empty/delete before stopping
+- CHANGELOG.md: CONTINUOUSLY append concise change summaries
+- Deploy if deployable, publish to npm if package is on npm
+**PROCESS**:
+- Only persistent background shells for long-running CLI processes
+- Manual testing ONLY — NO test files ever
 ## CONSTRAINTS
 **Tier 0**: no_crash, no_exit, ground_truth_only, real_execution, fail_loud
@@ -133,8 +178,6 @@ Completing a phase is NOT stopping. After every phase: read .prd, check git, inv
 **Tier 2**: no_duplication, no_hardcoded_values, modularity
 **Tier 3**: no_comments, convention_over_code
-**FAIL LOUD — NO FALLBACKS**: Never create fallback modes, demo modes, graceful degradation, silent error swallowing, or try/catch that returns a default value instead of propagating the error. Every error must throw and propagate. If something fails, the user must see exactly what failed and why. No `|| defaultValue`, no `catch { return null }`, no "demo mode", no "offline mode", no degraded functionality. The only acceptable error handling is: catch → log the real error → re-throw or display to user.
-**Never**: `Bash(node/npm/npx/bun)` | skip planning | sequential independent items | screenshot before JS exhausted | narrate past unresolved mutables | stop while .prd has items | ask the user what to do next while work remains | create fallback/demo modes | silently swallow errors
+**Never**: `Bash(node/npm/npx/bun)` | skip planning | sequential independent items | screenshot before JS exhausted | narrate past unresolved mutables | stop while .prd has items | ask the user what to do next while work remains | create fallback/demo modes | silently swallow errors | duplicate existing code | leave comments | create test files
-**Always**: invoke named skill at every transition | snake to planning on any new unknown | witnessed execution only | keep going until .prd deleted and git clean
+**Always**: invoke named skill at every transition | snake to planning on any new unknown | witnessed execution only | scan codebase before edits | keep going until .prd deleted and git clean

package/skills/gm-complete/SKILL.md CHANGED Viewed

@@ -118,9 +118,28 @@ gh run list --repo AnEntrypoint/<downstream-repo> --limit 3 --json databaseId,na
 ```
 Monitor any cascade runs the same way — poll, diagnose failures, fix if the cause is in this repo.
+## CODEBASE HYGIENE SWEEP
+Before declaring complete, sweep the entire codebase for violations:
+1. **Files >200 lines** → split immediately
+2. **Comments in code** → remove all
+3. **Test files** (.test.js, .spec.js, __tests__/) → delete
+4. **Mock/stub/simulation files** → delete
+5. **Unnecessary doc files** (not CHANGELOG/CLAUDE/README/TODO.md) → delete
+6. **Duplicate code** → consolidate
+7. **Hardcoded values** → extract to config/constants
+8. **Fallback/demo modes** → remove, fail loud instead
+9. **TODO.md** → must be empty/deleted before completion
+10. **CHANGELOG.md** → must have entries for this session's changes
+11. **CLAUDE.md** → must reflect current technical state
+12. **Deploy/publish** → if deployable, deploy. If npm package, publish.
+Any violation found = fix immediately before advancing.
 ## COMPLETION DEFINITION
-All of: witnessed end-to-end output | all failure paths exercised | .prd empty | git clean and pushed | all CI runs green | `user_steps_remaining=0`
+All of: witnessed end-to-end output | all failure paths exercised | .prd empty | git clean and pushed | all CI runs green | codebase hygiene sweep clean | TODO.md empty/deleted | CHANGELOG.md updated | `user_steps_remaining=0`
 ## DO NOT STOP
@@ -128,9 +147,9 @@ After end-to-end verification passes: read .prd from disk. If any items remain,
 ## CONSTRAINTS
-**Never**: claim done without witnessed output | uncommitted changes | unpushed commits | failed CI runs | .prd items remaining | stop at first green | absorb surprises silently | respond to user while .prd has items
+**Never**: claim done without witnessed output | uncommitted changes | unpushed commits | failed CI runs | .prd items remaining | TODO.md with items remaining | stop at first green | absorb surprises silently | respond to user while .prd has items | skip hygiene sweep | leave comments/mocks/test files/fallbacks
-**Always**: triage failure before snaking | witness end-to-end | snake to planning on any new unknown | enumerate remaining after every success | check .prd after every verification pass
+**Always**: triage failure before snaking | witness end-to-end | snake to planning on any new unknown | enumerate remaining after every success | check .prd after every verification pass | run hygiene sweep before declaring complete | deploy/publish if applicable | update CHANGELOG.md
 ---

package/skills/gm-emit/SKILL.md CHANGED Viewed

@@ -71,11 +71,18 @@ Pre-emit revealing unexpected behavior → new unknown → snake to `planning`.
 - Pre-emit debug passed with real inputs and error inputs
 - Post-emit verification matches pre-emit exactly
 - Hot reloadable: state outside reloadable modules, handlers swap atomically
-- Crash-proof: catch at every boundary, recovery hierarchy
-- No mocks/fakes/stubs anywhere
-- Files ≤200 lines, no duplicate code, no comments, no hardcoded values
-- No fallbacks, demo modes, or silent error swallowing — every error must throw and propagate
+- Errors throw with clear context — no fallbacks, demo modes, silent swallowing, `|| default`, `catch { return null }`
+- No mocks/fakes/stubs/simulations/test files anywhere — delete on discovery
+- Files ≤200 lines — split immediately if over, do not advance
+- No duplicate code — scan codebase for existing implementations before writing
+- No comments — remove any found
+- No hardcoded values — dynamic/modular code using ground truth only
+- No adjectives/descriptive language in variable/function names
+- No unnecessary files — clean anything not required for the program to function
+- Client-side code exposes debug globals (window.__debug or similar)
 - CLAUDE.md reflects actual behavior
+- CHANGELOG.md updated with changes
+- TODO.md cleared or deleted
 ## CODEBASE EXPLORATION

package/skills/gm-execute/SKILL.md CHANGED Viewed

@@ -114,7 +114,11 @@ Invoke `browser` skill. Escalation — exhaust each before advancing:
 ## GROUND TRUTH
-Real services, real data, real timing. Mocks/fakes/stubs = delete immediately. No .test.js/.spec.js. Delete on discovery.
+Real services, real data, real timing. Mocks/fakes/stubs/simulations = delete immediately. No .test.js/.spec.js. Delete on discovery. No fallback/demo modes — errors must fail loud with clear logs.
+**SCAN BEFORE EDIT**: Before modifying or creating any file, search the codebase (exec:codesearch) for existing implementations of the same functionality. Resolve duplicates immediately — NEVER duplicate existing code.
+**HYPOTHESIZE VIA EXECUTION**: Always troubleshoot and validate hypotheses through witnessed execution BEFORE editing files. Never edit based on assumptions — run the code first, observe the actual behavior, then edit with ground truth.
 ## DO NOT STOP
@@ -122,9 +126,9 @@ Never respond to the user from this phase. When all mutables are KNOWN, immediat
 ## CONSTRAINTS
-**Never**: `Bash(node/npm/npx/bun)` | fake data | mock files | Glob/Grep/Read/Explore (hook-blocked — use exec:codesearch) | sequential independent items | absorb surprises silently | respond to user or pause for input
+**Never**: `Bash(node/npm/npx/bun)` | fake data | mock files | test files | fallback/demo modes | Glob/Grep/Read/Explore (hook-blocked — use exec:codesearch) | sequential independent items | absorb surprises silently | respond to user or pause for input | edit files before executing to understand current behavior | duplicate existing code
-**Always**: witness every hypothesis | import real modules | snake to planning on any new unknown | fix immediately on discovery | invoke next skill immediately when done
+**Always**: witness every hypothesis | import real modules | scan codebase before creating/editing files | snake to planning on any new unknown | fix immediately on discovery | delete mocks/stubs/comments/test files on discovery | invoke next skill immediately when done
 ---

package/skills/planning/SKILL.md CHANGED Viewed

@@ -39,6 +39,8 @@ Planning = exhaustive mutable discovery. For every aspect of the task ask:
 Categories of unknowns to enumerate: file existence | API shape | data format | dependency versions | runtime behavior | environment differences | error conditions | concurrency | integration points | backwards compatibility | rollback paths | deployment steps | verification criteria
+**MANDATORY CODEBASE SCAN**: Before planning any new implementation, search (exec:codesearch) for existing code that already does what you need. Add `existingImpl=UNKNOWN` mutable for every new feature — resolve by searching. Duplicating existing functionality is a Tier 2 violation.
 ## .PRD FORMAT
 Path: exactly `./.prd` in current working directory. **JSON array** written via `exec:nodejs`.