theslopmachine 0.9.12 → 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (102) hide show
  1. package/MANUAL.md +1 -0
  2. package/README.md +2 -2
  3. package/assets/agents/developer.md +20 -14
  4. package/assets/agents/slopmachine-claude.md +112 -77
  5. package/assets/agents/slopmachine.md +97 -74
  6. package/assets/claude/agents/developer.md +11 -6
  7. package/assets/skills/beads-operations/SKILL.md +9 -0
  8. package/assets/skills/clarification-gate/SKILL.md +48 -14
  9. package/assets/skills/claude-worker-management/SKILL.md +128 -80
  10. package/assets/skills/developer-session-lifecycle/SKILL.md +13 -9
  11. package/assets/skills/development-guidance/SKILL.md +29 -12
  12. package/assets/skills/evaluation-triage/SKILL.md +28 -11
  13. package/assets/skills/final-evaluation-orchestration/SKILL.md +83 -32
  14. package/assets/skills/integrated-verification/SKILL.md +61 -49
  15. package/assets/skills/planning-gate/SKILL.md +66 -11
  16. package/assets/skills/planning-guidance/SKILL.md +51 -17
  17. package/assets/skills/report-output-discipline/SKILL.md +4 -0
  18. package/assets/skills/retrospective-analysis/SKILL.md +28 -1
  19. package/assets/skills/scaffold-guidance/SKILL.md +36 -26
  20. package/assets/skills/submission-packaging/SKILL.md +53 -28
  21. package/assets/skills/verification-gates/SKILL.md +61 -40
  22. package/assets/slopmachine/clarification-faithfulness-review-prompt.md +67 -0
  23. package/assets/slopmachine/clarifier-agent-prompt.md +108 -8
  24. package/assets/slopmachine/exact-readme-template.md +11 -1
  25. package/assets/slopmachine/owner-verification-checklist.md +38 -8
  26. package/assets/slopmachine/phase-1-design-prompt.md +10 -2
  27. package/assets/slopmachine/phase-1-design-template.md +27 -5
  28. package/assets/slopmachine/phase-2-execution-planning-prompt.md +86 -44
  29. package/assets/slopmachine/phase-2-plan-template.md +161 -27
  30. package/assets/slopmachine/scaffold-playbooks/selection-matrix.md +106 -67
  31. package/assets/slopmachine/scaffold-playbooks/shared-contract.md +207 -0
  32. package/assets/slopmachine/scaffold-playbooks/stack-android-room-offline.md +30 -0
  33. package/assets/slopmachine/scaffold-playbooks/stack-browser-only-offline-spa.md +29 -0
  34. package/assets/slopmachine/scaffold-playbooks/stack-generic.md +25 -0
  35. package/assets/slopmachine/scaffold-playbooks/stack-go-gin-templ-postgres.md +70 -0
  36. package/assets/slopmachine/scaffold-playbooks/stack-react-go-postgres.md +33 -0
  37. package/assets/slopmachine/scaffold-playbooks/stack-rust-fullstack-workspace.md +31 -0
  38. package/assets/slopmachine/scaffold-playbooks/stack-vue-koa-mysql.md +77 -0
  39. package/assets/slopmachine/scaffold-playbooks/stack-vue-laravel-mysql.md +32 -0
  40. package/assets/slopmachine/scaffold-playbooks/stack-winforms-localdb.md +30 -0
  41. package/assets/slopmachine/scaffold-playbooks/tech-backend-gin-templ.md +22 -0
  42. package/assets/slopmachine/scaffold-playbooks/tech-backend-go.md +23 -0
  43. package/assets/slopmachine/scaffold-playbooks/tech-backend-koa.md +22 -0
  44. package/assets/slopmachine/scaffold-playbooks/tech-backend-laravel.md +33 -0
  45. package/assets/slopmachine/scaffold-playbooks/tech-db-localdb.md +20 -0
  46. package/assets/slopmachine/scaffold-playbooks/tech-db-mysql.md +22 -0
  47. package/assets/slopmachine/scaffold-playbooks/tech-db-postgres.md +22 -0
  48. package/assets/slopmachine/scaffold-playbooks/tech-db-room.md +20 -0
  49. package/assets/slopmachine/scaffold-playbooks/tech-frontend-react.md +22 -0
  50. package/assets/slopmachine/scaffold-playbooks/tech-frontend-vue.md +23 -0
  51. package/assets/slopmachine/scaffold-playbooks/tech-rust-workspace.md +21 -0
  52. package/assets/slopmachine/scaffold-playbooks/type-api-service.md +52 -0
  53. package/assets/slopmachine/scaffold-playbooks/type-background-jobs.md +45 -0
  54. package/assets/slopmachine/scaffold-playbooks/type-database.md +81 -0
  55. package/assets/slopmachine/scaffold-playbooks/type-desktop.md +30 -0
  56. package/assets/slopmachine/scaffold-playbooks/type-mobile-android.md +32 -0
  57. package/assets/slopmachine/scaffold-playbooks/type-offline-local-first.md +31 -0
  58. package/assets/slopmachine/scaffold-playbooks/type-web-spa.md +63 -0
  59. package/assets/slopmachine/templates/AGENTS.md +24 -15
  60. package/assets/slopmachine/templates/CLAUDE.md +24 -15
  61. package/assets/slopmachine/templates/plan.md +152 -28
  62. package/assets/slopmachine/utils/analyze_claude_project_dir.mjs +197 -0
  63. package/assets/slopmachine/utils/claude_live_common.mjs +22 -6
  64. package/assets/slopmachine/utils/claude_live_launch.mjs +161 -11
  65. package/assets/slopmachine/utils/claude_live_status.mjs +53 -2
  66. package/assets/slopmachine/utils/claude_live_stop.mjs +1 -1
  67. package/assets/slopmachine/utils/claude_live_turn.mjs +23 -14
  68. package/assets/slopmachine/utils/claude_worker_common.mjs +6 -2
  69. package/assets/slopmachine/utils/package_claude_session.mjs +28 -101
  70. package/assets/slopmachine/utils/prepare_evaluation_send_packet.mjs +69 -0
  71. package/package.json +1 -1
  72. package/src/constants.js +31 -28
  73. package/src/init.js +3 -2
  74. package/src/install.js +28 -0
  75. package/assets/slopmachine/scaffold-playbooks/android-kotlin-compose.md +0 -73
  76. package/assets/slopmachine/scaffold-playbooks/android-kotlin-views.md +0 -138
  77. package/assets/slopmachine/scaffold-playbooks/android-native-java.md +0 -203
  78. package/assets/slopmachine/scaffold-playbooks/angular-default.md +0 -129
  79. package/assets/slopmachine/scaffold-playbooks/backend-baseline.md +0 -126
  80. package/assets/slopmachine/scaffold-playbooks/backend-family-matrix.md +0 -80
  81. package/assets/slopmachine/scaffold-playbooks/database-module-matrix.md +0 -80
  82. package/assets/slopmachine/scaffold-playbooks/django-default.md +0 -109
  83. package/assets/slopmachine/scaffold-playbooks/docker-baseline.md +0 -146
  84. package/assets/slopmachine/scaffold-playbooks/docker-shared-contract.md +0 -338
  85. package/assets/slopmachine/scaffold-playbooks/electron-vite-default.md +0 -124
  86. package/assets/slopmachine/scaffold-playbooks/expo-react-native-default.md +0 -73
  87. package/assets/slopmachine/scaffold-playbooks/fastapi-default.md +0 -97
  88. package/assets/slopmachine/scaffold-playbooks/frontend-baseline.md +0 -138
  89. package/assets/slopmachine/scaffold-playbooks/frontend-family-matrix.md +0 -134
  90. package/assets/slopmachine/scaffold-playbooks/generic-unknown-tech-guide.md +0 -136
  91. package/assets/slopmachine/scaffold-playbooks/go-chi-default.md +0 -103
  92. package/assets/slopmachine/scaffold-playbooks/ios-linux-portable.md +0 -93
  93. package/assets/slopmachine/scaffold-playbooks/ios-native-objective-c.md +0 -151
  94. package/assets/slopmachine/scaffold-playbooks/ios-native-swift.md +0 -188
  95. package/assets/slopmachine/scaffold-playbooks/laravel-default.md +0 -143
  96. package/assets/slopmachine/scaffold-playbooks/livewire-default.md +0 -172
  97. package/assets/slopmachine/scaffold-playbooks/overlay-module-matrix.md +0 -130
  98. package/assets/slopmachine/scaffold-playbooks/platform-family-matrix.md +0 -79
  99. package/assets/slopmachine/scaffold-playbooks/spring-boot-default.md +0 -100
  100. package/assets/slopmachine/scaffold-playbooks/tauri-default.md +0 -68
  101. package/assets/slopmachine/scaffold-playbooks/vue-vite-default.md +0 -140
  102. package/assets/slopmachine/scaffold-playbooks/web-default.md +0 -96
package/MANUAL.md CHANGED
@@ -58,6 +58,7 @@ slopmachine init -o
58
58
  - copies the packaged Claude repo rulebook into `repo/CLAUDE.md`
59
59
  - seeds `repo/README.md`, `repo/plan.md`, and `repo/.claude/settings.json`
60
60
  - seeds `.ai/startup-context.md` plus the parent-root planning docs under `docs/`
61
+ - later, when `P5` closes, the workflow preserves the final truthful execution record in `docs/plan.md` and removes `repo/plan.md` before evaluation begins
61
62
  - creates the initial git commit so the workspace starts with a clean tree
62
63
  - optionally opens `opencode` in `repo/`
63
64
  - parallel worktrees should stay under hidden parent-root `.ai/worktrees/` so the visible workspace root stays clean
package/README.md CHANGED
@@ -169,7 +169,7 @@ Important details:
169
169
  - Beads lives in the workspace root, not inside `repo/`
170
170
  - `repo/.claude/settings.json` seeds Claude Code to use the custom `developer` agent by default for that repo
171
171
  - planned parallel git worktrees should live under hidden parent-root `.ai/worktrees/` by default so root-level `repo-lane-*` folders do not clutter the workspace
172
- - final packaging moves `repo/plan.md` to parent-root `docs/plan.md` and removes repo-local `AGENTS.md`, `CLAUDE.md`, and `plan.md` from the delivered `repo/`
172
+ - when `P5` completes, the workflow moves `repo/plan.md` to parent-root `docs/plan.md`; packaging later validates that `repo/plan.md`, `repo/AGENTS.md`, and `repo/CLAUDE.md` are absent from the delivered `repo/`
173
173
  - after non-`-o` bootstrap, the command prints the exact `cd repo` next step so you can continue immediately
174
174
  - `--adopt` moves the current project files into `repo/`, preserves root workflow state in the parent workspace, and skips the automatic bootstrap commit
175
175
  - `--continue-from <PX>` is a smoother alias for existing-project bootstrap; it implies adoption mode and seeds the requested start phase in one step
@@ -177,7 +177,7 @@ Important details:
177
177
  - when a later start phase is seeded for adoption or recovery, the Beads workflow phases before that requested phase are created and immediately marked completed so tracker state matches the seeded entry point
178
178
  - in the `slopmachine-claude` path, if adopted or resumed later-phase work has no recoverable tracked Claude developer session yet, the owner must launch and orient the needed Claude lane first and only then continue the substantive work in that same session
179
179
  - `--phase <PX>` seeds the initial `current_phase` for adoption/recovery bootstrap; the owner should still fall back if the real repo evidence does not support that later phase
180
- - `repo/plan.md` is seeded at bootstrap and becomes the definitive repo-local execution checklist during planning
180
+ - `repo/plan.md` is seeded at bootstrap and becomes the definitive repo-local execution checklist through planning, development, and `P5`; after `P5`, the preserved reference copy is `docs/plan.md`
181
181
 
182
182
  ### `slopmachine set-token`
183
183
 
@@ -69,7 +69,7 @@ When accepted planning artifacts already exist, treat them as the primary execut
69
69
  - treat follow-up prompts mainly as narrow deltas, guardrails, or correction signals
70
70
  - if the current work is the scaffold step at the start of development, treat section 3 of `plan.md` as binding; do not re-choose the playbook, starter, or bootstrap path unless planning is explicitly reopened
71
71
  - if the scaffold-step instructions are still vague about the playbook or bootstrap command, raise that as a planning gap instead of improvising a new baseline contract
72
- - if `plan.md` includes a security execution contract, `Delivery Review Requirements`, `README Contract`, or test coverage execution contract, treat them as binding parts of the current workstream rather than optional follow-up polish
72
+ - if `plan.md` includes a security execution contract, `Core Semantic Path Proof`, `Prompt-Critical Rule Matrix`, `Role Surface Matrix`, `Runtime Lifecycle Checklist`, `Delivery Review Requirements`, `README Contract`, or test coverage execution contract, treat them as binding parts of the current workstream rather than optional follow-up polish
73
73
  - treat the execution file tree and owned-file map in `plan.md` as real execution boundaries, not decorative planning notes
74
74
  - for adopted projects, inspect the current repo tree first and use the accepted `plan.md` delta tree rather than assuming a greenfield layout
75
75
  - keep `plan.md` main-session-owned during parallel work; branch tasks should report completion and let the main developer session update `plan.md` after merge
@@ -100,9 +100,9 @@ When instructed to plan without coding yet:
100
100
  - when backend or fullstack API endpoints are added or changed, prefer real HTTP tests for the exact `METHOD + PATH` over controller or service bypasses when practical
101
101
  - if mocked HTTP tests or unit-only tests still exist for an API surface, do not overstate them as equivalent to true no-mock endpoint coverage
102
102
  - when closing a `plan.md` workstream or bounded follow-up, think briefly about what adjacent flows, runtime paths, or doc/spec claims it could have affected before claiming readiness
103
- - keep `README.md` as the primary documentation file inside the repo; `plan.md` is the explicit execution-plan exception
103
+ - keep `README.md` as the primary documentation file inside the repo; repo-local `plan.md` is the explicit execution-plan exception only during active implementation through `P5`
104
104
  - treat `README.md` and other shared integration-heavy files as main-session-owned by default during parallel work unless the accepted plan explicitly delegates them
105
- - keep the repo self-sufficient and statically reviewable through code plus `README.md`, with `plan.md` as the deliberate execution-plan exception; do not rely on runtime success alone to make the project understandable
105
+ - keep the repo self-sufficient and statically reviewable through code plus `README.md`, with repo-local `plan.md` as the deliberate execution-plan exception only during active implementation through `P5`; do not rely on runtime success alone to make the project understandable
106
106
  - keep the repo self-sufficient; do not make it depend on parent-directory docs or sibling artifacts for startup, build/preview, configuration, verification, or basic understanding
107
107
  - do not touch workflow or rulebook files such as `AGENTS.md` unless explicitly asked
108
108
  - if the work changes acceptance-critical docs or contracts, review those docs yourself before replying instead of assuming someone else will catch inconsistencies later
@@ -111,6 +111,9 @@ When instructed to plan without coding yet:
111
111
  - for backend, fullstack, and web projects, keep the canonical `docker compose up --build` contract in `README.md` and also include the exact legacy compatibility string `docker-compose up` somewhere in startup guidance
112
112
  - for Android, iOS, and desktop projects, keep the required Docker-contained final contract while also maintaining the project-type-specific host-side guidance sections expected by the strict README audit
113
113
  - before reporting development complete, remove local-only setup traces and host-only dependency assumptions from the delivered README and wrapper scripts
114
+ - before reporting development complete, run one deliberate main-session reread against the accepted `plan.md`, `../docs/design.md`, accepted `../docs/api-spec.md` when applicable, `README.md`, and the integrated repo so the owner is not first discovering obvious drift in `P5`
115
+ - before reporting development complete, close the common late-failure classes inside development: `README.md` drift, API-spec drift, missing auth/authorization/ownership enforcement, weak validation or normalized error handling, missing owned tests, startup/test wrapper dishonesty, and partial user-facing or admin-facing flow closure
116
+ - before reporting development complete, explicitly report proof status for the core semantic path, prompt-critical rules, role surface matrix if applicable, runtime lifecycle checklist if applicable, and any residual risks instead of relying only on general test success
114
117
 
115
118
  ## Parallel Execution Model
116
119
 
@@ -121,9 +124,11 @@ When instructed to plan without coding yet:
121
124
  - when the accepted plan already names safe parallel lanes, treat launching them as required unless a real blocker forces a documented revision
122
125
  - good parallel candidates include independent repo reading, verification passes, separate test additions, and implementation branches that touch different modules or well-separated files
123
126
  - do not parallelize tightly coupled work that still depends on unresolved contracts, shared abstractions being invented in real time, or overlapping edits to the same files
124
- - before fan-out, define the branch contract clearly: expected outcome, owned files, boundaries, important shared constraints, support check, and merge condition
127
+ - before fan-out, define the branch contract clearly: expected outcome, owned files, the exact `plan.md` section or checklist items the lane owns, boundaries, important shared constraints, support check, merge condition, and required verification
125
128
  - a branch that owns implementation for a surface should also own the matching tests and coverage work for that surface unless the accepted plan explicitly centralizes shared test harness work first
126
129
  - every planned parallel lane must have its own git worktree, and the assigned subagent should stay in that worktree until the lane is complete or explicitly rerouted
130
+ - before a branch or worktree reports completion, verify its owned implementation against the assigned `plan.md` scope, run the strongest relevant local tests or checks for those owned files, and include the exact commands and results in the handoff back to the main session
131
+ - do not let a branch or worktree report "done" merely because code compiles or the happy path appears present; its owned functionality must be real against the plan and its owned verification must have run
127
132
  - respect the owned-files map from the accepted plan and do not casually cross into another branch's files
128
133
  - after fan-in, reconcile the branches yourself, resolve any overlap cleanly, and run final targeted verification on the integrated result before reporting completion
129
134
  - prefer as many meaningful branches or worktrees as the directory tree safely allows; target at least 5 parallel lanes when the codebase exposes that many low-overlap modules or directories
@@ -152,14 +157,13 @@ During ordinary work, prefer:
152
157
 
153
158
  Broad commands you are not allowed to run during ordinary work:
154
159
 
155
- - never run `./run_tests.sh`
156
160
  - never run `docker compose up --build`
157
161
  - never run any other Docker runtime, Compose, or containerized broad-verification command that stands in for those documented final commands
158
162
  - never run browser E2E or Playwright during ordinary implementation work
159
- - never run full test suites during ordinary implementation work unless explicitly instructed to run that exact command
160
- - do not use those commands even if they are documented in the repo, requested by the owner, suggested by a playbook, implied by `plan.md`, or look convenient for debugging
161
- - if your work would normally call for one of those commands, stop at targeted local verification and report that the change is ready for broader verification
162
- - do not run Docker-based runtime/test commands under any circumstances during planning, development, `P5`, or `P7`; the owner handles the first broad Docker and `./run_tests.sh` verification in `P5` and may rerun it in `P9` for final confirmation
163
+ - do not run full local test suites during ordinary implementation work unless the current milestone or owner instruction actually calls for that exact verification
164
+ - do not use Docker commands even if they are documented in the repo, requested by the owner, suggested by a playbook, implied by `plan.md`, or look convenient for debugging
165
+ - if your work would normally call for Docker, stop at targeted local verification and report that the change is ready for broader verification
166
+ - do not run Docker-based runtime/test commands under any circumstances during planning, development, `P5`, or `P7`; use the prepared local test harness to verify your implementation, the owner reruns that harness in `P5`, and the first real Docker confirmation plus dockerized broad-test run is `P9`
163
167
 
164
168
  Your job is to make the broader verification likely to pass without running it yourself.
165
169
 
@@ -181,7 +185,7 @@ Selected-stack defaults:
181
185
  - do not hardcode database connection values or database bootstrap values anywhere in the repo
182
186
  - for Dockerized web projects, do not require manual `export ...` steps for `docker compose up --build`
183
187
  - for Dockerized web projects, prefer an automatically invoked dev-only runtime bootstrap script instead of checked-in `.env` files or hardcoded runtime values
184
- - for Dockerized web projects, do not introduce a separate pre-seeded secret path for `./run_tests.sh`; use the same runtime bootstrap model or an equivalent generated-value path
188
+ - for Dockerized web projects, do not introduce a separate pre-seeded secret path for `./run_tests.sh`; keep it aligned with the documented local setup model or an equivalent generated-value path
185
189
  - do not treat comments like `dev only`, `test only`, or `not production` as permission to commit secret literals into Compose files, config files, Dockerfiles, or startup scripts
186
190
  - if the project uses mock, stub, fake, or local-data behavior, disclose that scope accurately in `README.md` instead of implying real backend or production behavior
187
191
  - if mock or interception behavior is enabled by default, document that clearly
@@ -235,10 +239,12 @@ If asked to help shape test-coverage evidence, make it acceptance-grade on first
235
239
  Default reply shape for ordinary development follow-up, final release-readiness correction, and fix responses:
236
240
 
237
241
  1. short summary
238
- 2. exact changed files
239
- 3. exact verification commands and results
240
- 4. launched parallel lanes plus any skipped planned lanes with exact reasons when parallel fan-out was part of the work
241
- 5. real unresolved issues only
242
+ 2. closed `plan.md` sections or workstreams
243
+ 3. design and API-contract alignment notes when applicable
244
+ 4. exact changed files
245
+ 5. exact verification commands and results
246
+ 6. launched parallel lanes plus any skipped planned lanes with exact reasons when parallel fan-out was part of the work
247
+ 7. real unresolved issues only
242
248
 
243
249
  Keep the reply compact. Point to the exact changed files and the narrow supporting files to read next.
244
250