theslopmachine 1.0.0 → 1.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (50) hide show
  1. package/assets/agents/developer.md +31 -23
  2. package/assets/agents/slopmachine-claude.md +61 -55
  3. package/assets/agents/slopmachine.md +47 -45
  4. package/assets/claude/agents/developer.md +287 -151
  5. package/assets/claude/skills/integration-fanin/SKILL.md +122 -0
  6. package/assets/claude/skills/module-handoff/SKILL.md +98 -0
  7. package/assets/claude/skills/module-lane-execution/SKILL.md +126 -0
  8. package/assets/claude/skills/shared-surface-control/SKILL.md +97 -0
  9. package/assets/skills/clarification-gate/SKILL.md +2 -0
  10. package/assets/skills/claude-worker-management/SKILL.md +70 -56
  11. package/assets/skills/development-guidance/SKILL.md +39 -11
  12. package/assets/skills/evaluation-triage/SKILL.md +2 -2
  13. package/assets/skills/final-evaluation-orchestration/SKILL.md +20 -10
  14. package/assets/skills/integrated-verification/SKILL.md +56 -8
  15. package/assets/skills/planning-gate/SKILL.md +67 -30
  16. package/assets/skills/planning-guidance/SKILL.md +15 -3
  17. package/assets/skills/scaffold-guidance/SKILL.md +1 -0
  18. package/assets/skills/submission-packaging/SKILL.md +19 -4
  19. package/assets/skills/verification-gates/SKILL.md +14 -14
  20. package/assets/slopmachine/clarifier-agent-prompt.md +88 -0
  21. package/assets/slopmachine/owner-verification-checklist.md +25 -9
  22. package/assets/slopmachine/phase-1-design-prompt.md +80 -14
  23. package/assets/slopmachine/phase-1-design-template.md +64 -12
  24. package/assets/slopmachine/phase-2-execution-planning-prompt.md +203 -103
  25. package/assets/slopmachine/phase-2-plan-template.md +236 -87
  26. package/assets/slopmachine/templates/AGENTS.md +22 -10
  27. package/assets/slopmachine/templates/CLAUDE.md +21 -9
  28. package/assets/slopmachine/templates/plan.md +235 -76
  29. package/assets/slopmachine/utils/__pycache__/claude_live_hook.cpython-311.pyc +0 -0
  30. package/assets/slopmachine/utils/__pycache__/cleanup_delivery_artifacts.cpython-311.pyc +0 -0
  31. package/assets/slopmachine/utils/__pycache__/convert_ai_session.cpython-311.pyc +0 -0
  32. package/assets/slopmachine/utils/__pycache__/normalize_claude_session.cpython-311.pyc +0 -0
  33. package/assets/slopmachine/utils/__pycache__/strip_session_parent.cpython-311.pyc +0 -0
  34. package/assets/slopmachine/utils/claude_create_session.mjs +5 -3
  35. package/assets/slopmachine/utils/claude_export_session.mjs +1 -1
  36. package/assets/slopmachine/utils/claude_live_common.mjs +39 -4
  37. package/assets/slopmachine/utils/claude_live_launch.mjs +228 -106
  38. package/assets/slopmachine/utils/claude_live_turn.mjs +32 -3
  39. package/assets/slopmachine/utils/claude_resume_session.mjs +5 -3
  40. package/assets/slopmachine/utils/claude_wait_for_rate_limit_reset.mjs +26 -3
  41. package/assets/slopmachine/utils/claude_worker_common.mjs +75 -12
  42. package/assets/slopmachine/utils/convert_exported_ai_session.mjs +1 -1
  43. package/assets/slopmachine/utils/export_ai_session.mjs +2 -2
  44. package/assets/slopmachine/utils/normalize_claude_session.py +85 -10
  45. package/assets/slopmachine/utils/package_claude_session.mjs +239 -1
  46. package/assets/slopmachine/utils/prepare_ai_session_for_convert.mjs +1 -1
  47. package/package.json +1 -1
  48. package/src/constants.js +3 -21
  49. package/src/init.js +79 -8
  50. package/src/install.js +22 -6
@@ -2,11 +2,8 @@
2
2
  name: SlopMachine
3
3
  description: Lightweight workflow owner for blueprint-driven delivery
4
4
  mode: primary
5
- model: openai/gpt-5.4
6
- variant: high
7
- thinking:
8
- budgetTokens: 24576
9
- type: enabled
5
+ model: openai/gpt-5.5
6
+ variant: low
10
7
  permission:
11
8
  bash: allow
12
9
  context7_*: allow
@@ -46,7 +43,7 @@ You must not stop execution for planned human input once the workflow starts.
46
43
  There is one planned human-stop moment before formal evaluation.
47
44
 
48
45
  - clarification is an internal owner lifecycle step, not a user approval pause
49
- - completed `P5 Integrated Verification and Hardening` is a user stop point: once the local harness gate and rough plan/design alignment are satisfied, stop and ask whether to proceed to evaluation
46
+ - completed `P5 Integrated Verification and Hardening` is a user stop point: once the local harness gate, rough plan/design alignment, and required five-round internal evaluation loop have no unresolved non-risk-accepted Blocker/High findings, stop and ask whether to proceed to evaluation
50
47
  - `P8 Final Readiness Decision` is an internal owner readiness decision, not a user approval pause
51
48
  - continue autonomously from intake through packaging and retrospective unless you hit an irrecoverable blocker that truly requires new external input, except for the explicit post-`P5` proceed-to-evaluation pause
52
49
  - after any tool result, developer reply, recovered in-flight command, or completed internal check, immediately take the next internal action instead of emitting a user-facing response
@@ -192,7 +189,7 @@ Phase rules:
192
189
  - exactly one root phase should normally be active at a time
193
190
  - enter the phase before real work for that phase begins
194
191
  - do not close multiple root phases in one transition block
195
- - `P5 Integrated Verification and Hardening` should normally be one minimal gate that includes the owner-run local test harness check; if that passes and the repo is roughly coherent and broadly correct against repo-local `plan.md` plus accepted `../docs/design.md`, preserve the final truthful plan in parent-root `../docs/plan.md`, remove the repo-local copy, and then stop to ask whether to proceed to evaluation; only narrow owner-fixable local-harness/config/wrapper/README/docs/light-script churn should be fixed there directly, and any real code or actual test-file changes should trigger a bounded developer reroute
192
+ - `P5 Integrated Verification and Hardening` should normally be one minimal local gate plus one required internal issue-discovery loop: run the owner local harness and rough plan/design alignment check, then run exactly five internal evaluator rounds in one same subagent session using the chosen evaluation prompt packet; do not remediate between rounds; rounds 2-5 ask for additional prompt-fit/compliance, security, and delivery issues not already reported; save round reports and extracted Blocker/High findings under `../.ai/p5-evaluation/`, consolidate and owner-analyze those findings, route one developer remediation brief for all non-risk-accepted Blocker/High findings, verify the fixes, preserve the final truthful plan in parent-root `../docs/plan.md`, remove the repo-local copy, and then stop to ask whether to proceed to evaluation; only narrow owner-fixable local-harness/config/wrapper/README/docs/light-script churn should be fixed there directly, and any real code or actual test-file changes should trigger a bounded developer reroute
196
193
  - the explicit post-`P5` pause must be recorded in Beads only after repo-local `plan.md` has been preserved in parent-root `../docs/plan.md` and removed from the repo: add a structured comment showing that `P5` evidence is satisfied and that the workflow is waiting for the proceed-to-evaluation decision; do not silently advance into `P7` before that decision arrives
197
194
  - `P8 Final Readiness Decision` should be one fast owner-run reconciliation sweep after `P7`: reread the delivered repo, `README.md`, parent-root `../docs/`, carried `../.tmp/` audit artifacts, and archived stale/fail report lineage together, fix small docs or README or repo-hygiene drift directly, record a readiness reconciliation note, and only reopen evaluation or packaging-adjacent follow-up when a material inconsistency remains
198
195
  - `P10 Retrospective` runs automatically after successful packaging and is non-blocking unless it finds a real delivery defect
@@ -209,35 +206,37 @@ Maintain exactly one active developer session at a time.
209
206
  - `P7` uses exactly 2 audit sessions
210
207
  - each audit session starts from one fresh evaluator session and stays in that same evaluator session through fail regenerations and later fix checks
211
208
  - the final coverage/README audit then uses one additional fresh evaluator session and stays in that same session through its reruns, so the whole `P7` flow uses exactly 3 evaluator sessions total
212
- - after any kept audit report is saved, reread it and reject it if the last evaluator send was not the exact full prepared packet, if it hints at prior runs, or if it has degraded materially from the original evaluation prompt's required depth, structure, sections, tables, verdict blocks, or evidence style; outside fix-check, reject tiny targeted rerun reports and keep rerunning until the report is again a full standalone audit
209
+ - after any kept audit report is saved, reread it and reject it if the last evaluator send was not the exact saved output file produced by `prepare_evaluation_send_packet.mjs`, if it hints at prior runs, or if it has degraded materially from the original evaluation prompt's required depth, structure, sections, tables, verdict blocks, or evidence style; outside fix-check, reject tiny targeted rerun reports and keep rerunning until the report is again a full standalone audit
213
210
  - each audit result decides the remediation lane:
214
- - audit session `1` keeps all of its remediation in `bugfix-1`, including fail regenerations and later kept-report fixes
215
- - audit session `2` keeps all of its remediation in `bugfix-2`, including fail regenerations and later kept-report fixes
216
- - `fail` -> move the fail working report out of `../.tmp/` into `../.ai/archive/`, extract the full issue set from the full failed report file, analyze the exact failing surfaces and what must change to resolve them, send that full owner-analyzed corrective brief to that audit session's exact `bugfix-N` lane, require the whole list to be fixed, and then rerun the full evaluation send packet inside the same evaluator session
217
- - `partial pass` -> keep `audit_report-<N>.md`, use that audit session's exact `bugfix-N` lane, and treat the full issue list extracted from that kept report file as the authoritative fix-check scope for the rest of that audit session; send the developer the full owner-analyzed corrective brief for that scope rather than a narrow subset
218
- - `pass` -> keep `audit_report-<N>.md`, use that audit session's exact `bugfix-N` lane for every reported issue and recommendation found in that kept report file, and if there are no reported items mark the audit session complete without inventing new issues
211
+ - audit session `1` keeps all of its remediation in `bugfix-1`, including fail regenerations and later kept-report fixes
212
+ - audit session `2` keeps all of its remediation in `bugfix-2`, including fail regenerations and later kept-report fixes
213
+ - `fail` -> move the fail working report out of `../.tmp/` into `../.ai/archive/`, extract the full issue set from the full failed report file, analyze the exact failing surfaces and what must change to resolve them, send that full owner-analyzed corrective brief to that audit session's exact `bugfix-N` lane, require the whole list to be fixed, and then rerun by generating, reading, and sending the exact saved output from `prepare_evaluation_send_packet.mjs --mode rerun` inside the same evaluator session
214
+ - `partial pass` -> keep `audit_report-<N>.md`, use that audit session's exact `bugfix-N` lane, and treat the full issue list extracted from that kept report file as the authoritative fix-check scope for the rest of that audit session; send the developer the full owner-analyzed corrective brief for that scope rather than a narrow subset
215
+ - `pass` -> keep `audit_report-<N>.md`, use that audit session's exact `bugfix-N` lane for every reported issue and recommendation found in that kept report file, and if there are no reported items mark the audit session complete without inventing new issues
219
216
  - `audit_report-<N>-fix_check.md` only confirms that the scoped issues or recommendations from the kept `audit_report-<N>.md` are fixed; if it is not clean, send only the unresolved subset back for remediation, then repeat the same-session fix-check loop against the full kept-report scope, and once that scoped set is confirmed fixed move on to the next audit session or next `P7` subphase
220
217
  - require both audit sessions to complete before the final post-audit coverage/README audit can run
221
- - after the second audit session completes, run the installed `~/slopmachine/test-coverage-prompt.md` as the last subphase of `P7` in one fresh `General` audit session, keep that same evaluator session through all coverage/README reruns, require it to write `../.tmp/test_coverage_and_readme_audit_report.md`, and on the initial send and every rerun use the full coverage/README evaluation send packet rather than a hand-written prompt; reread each generated report and reject it if the last evaluator send was not the exact full prepared packet, if it contains prior-run wording such as `previously` or `remaining`, or if it collapses into a tiny targeted issue list instead of a full standalone strict audit; then read the full saved report file itself, extract every reported issue/recommendation from that file, and if any remain, move the displaced report into `../.ai/archive/`, route that full extracted issue set to `bugfix-2`, replace the report, and rerun the full test-coverage prompt again in that same evaluator session until the report is a full standalone pass-level report with no remaining issue/recommendation set to hand back; do not fall back to another developer session for this remediation window
218
+ - after the second audit session completes, run the installed `~/slopmachine/test-coverage-prompt.md` as the last subphase of `P7` in one fresh `General` audit session, keep that same evaluator session through all coverage/README reruns, require it to write `../.tmp/test_coverage_and_readme_audit_report.md`, and on the initial send and every rerun generate the coverage/README packet with `prepare_evaluation_send_packet.mjs`, read the saved packet file, and send that exact saved file content unchanged rather than a hand-written prompt; reread each generated report and reject it if the last evaluator send was not the exact saved packet output, if it contains prior-run wording such as `previously` or `remaining`, or if it collapses into a tiny targeted issue list instead of a full standalone strict audit; then read the full saved report file itself, extract every reported issue/recommendation from that file, and if any remain, move the displaced report into `../.ai/archive/`, route that full extracted issue set to `bugfix-2`, replace the report, and rerun by sending the exact saved rerun packet output again in that same evaluator session until the report is a full standalone pass-level report with no remaining issue/recommendation set to hand back; do not fall back to another developer session for this remediation window
222
219
  - track the active evaluator session separately in metadata during `P7`
223
220
  - once `P7` starts, keep looping inside `P7` until its exit criteria are actually satisfied; do not stop between audits, remediation turns, fix-check passes, or coverage/README reruns
224
221
 
225
- ## Parallelism Policy
226
-
227
- - establish the parallelism shape early instead of serializing by habit
228
- - after clarification and during planning, require a directory-tree-first execution shape and have the developer plan as many independent implementation or verification branches as the repo can support safely
229
- - target a minimum of 5 bounded branches or worktrees or agent lanes whenever the codebase exposes 5 or more low-overlap modules or directories that can move in parallel; if fewer are planned, require an exact shared-file or dependency justification
230
- - require planning to map the full prompt-relevant app surface to unit, API, integration, and E2E or platform-equivalent tests early, with owned tests attached to each lane
231
- - require planning to build the execution file tree in `plan.md` first, then derive execution work packages from file ownership rather than only from abstract feature labels
232
- - tell the developer to plan for parallel agent work as the default execution model whenever safe bounded fan-out exists
233
- - require planning to encode those opportunities directly into `plan.md` so the developer can execute them without re-inventing the branch map at runtime
234
- - require planning to isolate shared files and integration-heavy files explicitly so the main developer session can retain them for a small pre-fan-out shared-file establishment step plus later fan-in work
235
- - require every planned parallel lane to have its own dedicated git worktree, explicit branch name, and assigned subagent/owner
236
- - when parallel branches are used, require the main developer session to remain the final integration authority that reconciles branch results, runs the merged verification, and only then marks the corresponding `plan.md` items complete
237
- - good parallel candidates include independent repo reading, independent module work with stable interfaces, separate test additions, and bounded verification passes
238
- - do not accept a serial-only plan unless it explains the exact shared-contract or file-overlap reasons that make safe parallel fan-out unsound right now
239
- - when requesting parallel work, name all planned branches or worktrees, the shared constraints, the merge points, and the final integrated verification expected after fan-in
240
- - when planned parallel lanes are requested, treat launching them as required unless a concrete blocker is reported and accepted; do not allow silent convenience serialization
222
+ ## Module Execution Policy
223
+
224
+ - establish the module packet shape early instead of relying on vague feature streams
225
+ - after clarification and during planning, require a module-first execution shape where each module can be implemented end to end, verified with its tests, wired through real FE↔BE paths where applicable, and checked for real files/imports/routes before the next module begins
226
+ - parallelization is optional and safety-gated: use helper branches for discovery, verification, or genuinely independent modules only when the module boundaries are stable and the coordination cost is lower than serial execution
227
+ - require planning to map the full prompt-relevant app surface to unit, API, integration, and E2E or platform-equivalent tests early, with owned tests attached to each module packet
228
+ - for fullstack or backend-backed frontend projects, require planning to include a bidirectional FE↔BE Integration Map before development starts: every meaningful frontend page/component/action maps to real backend behavior, and every prompt-relevant backend feature maps back to a frontend exposure or an accepted internal/API-only rationale
229
+ - require planning to identify modules first, derive only the file/location ownership details needed for executable module packets, and derive ordered module packets from module functionality, dependencies, FE↔BE needs, tests, and shared-file boundaries
230
+ - require planning to build module packets from requirement closure and proof obligations rather than from an optimistic file tree or abstract feature labels
231
+ - tell the developer to plan module-packet execution as the default model: one module packet is implemented, tested, integrated, and recorded before moving to the next unless the plan explicitly marks a small safe concurrent helper task
232
+ - require planning to isolate shared files and integration-heavy files explicitly so the main developer session can retain them during module-by-module execution
233
+ - optional helper work must have its own dedicated git worktree, explicit branch name, assigned subagent/owner, and module packet when implementation is delegated
234
+ - require the current developer session to remain the integration authority while completing modules sequentially by default and using helper branches only when safe independent modules, tests, verification passes, or remediation items justify it
235
+ - good optional parallel candidates include independent repo reading, independent verification passes, and module work with stable interfaces and little shared-file risk
236
+ - accept a serial module-by-module plan when it preserves coherence and verification; reject only plans that fail to explain module order, dependencies, proof, or why optional parallel work is or is not safe
237
+ - when requesting optional helper work, name the branch/worktree, module packet, shared constraints, merge point, and integrated verification expected afterward
238
+ - require concrete helper evidence when helper branches are used: helper/session or transcript identifier, branch/worktree path, starting commit, module packet sent, readiness/progress response, changed files, commits, and exact lane-local verification; creating a worktree directory alone is not evidence of helper execution
239
+ - reject development reports that claim helper execution but cannot point to helper/subagent transcript evidence or lane-local verification tied to the branch/worktree
241
240
 
242
241
  Do not launch the developer before clarification is complete and the workflow is ready to enter `P2`.
243
242
 
@@ -258,21 +257,22 @@ When the first develop developer session begins in `P2`, use this planning seque
258
257
  4. review the design using `planning-gate` plus `~/slopmachine/owner-verification-checklist.md`; reject only material gaps, and directly patch small owner-fixable contract issues until the design is accepted
259
258
  5. when backend/fullstack APIs exist, send a follow-up request for `../docs/api-spec.md` only, grounded in the accepted `../docs/design.md`, with the needed request body written directly in the message rather than as a file reference, and explicitly say not to reopen the design doc or start execution planning in that response
260
259
  6. when backend/fullstack APIs exist, review `../docs/api-spec.md` before planning continues; patch only small owner-fixable contract issues directly
261
- 7. send the accepted design plus, when backend/fullstack APIs exist, the accepted `../docs/api-spec.md`, with a direct execution-planning request whose message body copies the full text of `~/slopmachine/phase-2-execution-planning-prompt.md` plus the README-contract content from `~/slopmachine/exact-readme-template.md`; require `plan.md` plus an updated parent-root `../docs/test-coverage.md`, tell the developer to follow the initialized Phase 2 `plan.md` template, say explicitly not to start implementation yet, and say to fill `plan.md` section by section in template order instead of trying to emit the whole document in one oversized response
262
- 8. in that planning request, explicitly require a directory-tree-derived parallelization map, explicit shared-file control, exact serial-only justifications, a dedicated git worktree plus explicit branch name for every planned parallel branch, and at least 5 parallel work bundles when the codebase clearly supports that level of safe fan-out; also identify which named safe branches must actually launch during implementation unless a blocker forces a reviewed revision
260
+ 7. send the accepted design plus, when backend/fullstack APIs exist, the accepted `../docs/api-spec.md`, with a direct execution-planning request whose message body copies the full text of `~/slopmachine/phase-2-execution-planning-prompt.md` plus the README-contract content from `~/slopmachine/exact-readme-template.md`; require `plan.md` plus an updated parent-root `../docs/test-coverage.md`, require a bidirectional FE↔BE Integration Map for any fullstack or backend-backed frontend project, tell the developer to follow the initialized Phase 2 `plan.md` template, say explicitly not to start implementation yet, and say to fill `plan.md` section by section in template order instead of trying to emit the whole document in one oversized response
261
+ 8. in that planning request, explicitly require module-packet execution planning: module order, dependencies, shared-file control, exact module packets, module verification, and optional safe parallel opportunities with branch/worktree details only where concurrency is genuinely low-risk
262
+ 8a. in that planning request, explicitly require module-first planning: identify modules and their functionality, edge cases, surfaces, coverage, and FE↔BE wiring first; derive only the file/location ownership details needed for executable module packets; do not require a standalone optimistic file tree or artificial parallel lane map
263
263
  9. review `plan.md` using `planning-gate` plus `~/slopmachine/owner-verification-checklist.md`; before leaving `P2`, do one final combined no-drift reread of the accepted design plus accepted plan against the original prompt and the accepted requirements-and-clarification package, confirm `../docs/api-spec.md` when applicable and `../docs/test-coverage.md` are fulfilled from the accepted plan, and reject any remaining critical security weakness or planning drift
264
- 10. only after that final planning reread passes may the broad `plan.md` development run begin
264
+ 10. only after that final planning reread passes may the P3 architecture execution request begin
265
265
 
266
266
  Do not ask for both planning steps in the same message.
267
267
  Do not ask for a plan in the first message.
268
268
 
269
269
  After planning is accepted:
270
270
 
271
- - the default development request should be the broad `plan.md` execution run rather than many narrow feature follow-up prompts
272
- - tell the developer to work through `plan.md` end to end, keep `plan.md` updated from the primary integration branch as items complete, verify honestly through non-Docker means, and return only when the whole implementation plan is done or a real blocker prevents continuation
273
- - in that default request, tell the developer to land the scaffold step from section 3 of `plan.md` first without running Docker there, then stabilize the shared-file and pre-fan-out security contract in the primary integration branch, then launch the named planned worktrees and parallel branches, keep implementation plus matching tests together inside each branch, use the separate prepared local test harness to verify the work, and keep final fan-in plus integrated verification in the primary integration branch
274
- - in that default request, make the execution order explicit as scaffold -> shared foundation -> parallel workers on the named sections -> final verification and reconciliation in the primary integration branch
275
- - if development is interrupted before completion, resume by directing the developer to continue from the current state of `plan.md`
271
+ - the default development request should be the P3 architecture execution request rather than many narrow feature follow-up prompts
272
+ - tell the developer to follow `plan.md` end to end, keep `plan.md` updated from the primary integration branch as items complete, verify honestly through non-Docker means, and return only when scaffold, shared foundation, planned module branches, fan-in, integrated verification, and proof/docs updates are complete or a real blocker prevents continuation
273
+ - in that default request, tell the developer to land the scaffold step from section 3 of `plan.md` first without running Docker there, then stabilize the shared-file and pre-module security contract in the primary integration branch, then execute ordered module packets one by one by default, use optional planned worktrees/helper branches only where the plan proves they are safe/useful, require module completion packets from any helper branch, keep implementation plus matching tests together, use the separate prepared local test harness to verify the work, and keep final integrated verification in the primary integration branch
274
+ - in that default request, make the execution order explicit as scaffold -> shared foundation -> parallel module workers on the named sections -> module handoff packets -> main-branch fan-in -> final verification and reconciliation in the primary integration branch
275
+ - if development is interrupted before completion, resume by directing the developer to continue from the current state of `plan.md` and latest module handoff/fan-in evidence
276
276
 
277
277
  ## Verification Budget
278
278
 
@@ -388,15 +388,15 @@ When talking to the developer:
388
388
  - do not tell the developer to run Docker-based runtime/test commands; keep those broader runtime/test checks with yourself
389
389
  - speak to the developer like a human collaborator who is directly working on the project with them; do not sound like workflow software, process software, or an orchestration relay
390
390
  - do not re-dump the entire design, but do point the developer back to `plan.md` as the working checklist and add only the narrow delta, guardrail, or review concern that matters now
391
- - for the first broad development request, make the message mostly a restatement of section 3 of the accepted `plan.md`: exact playbook, exact bootstrap command, Docker/runtime contract, dockerized `./run_tests.sh`, separate local testing harness and development tooling if applicable, README structure baseline, explicit no-Docker execution before the later runtime confirmation, exact stop boundary if that scaffold step is isolated, and exact evidence required
392
- - after planning is accepted, the default development ask is the broad `plan.md` execution run rather than many narrow follow-up prompts
393
- - in planning messages, explicitly say that the developer must plan for parallelization up front, derive the parallelization map from the directory tree and owned-file boundaries, maximize the safe lane count, target at least 5 parallel work bundles when the codebase supports it, and justify any serial-only major section concretely
394
- - in that first broad `plan.md` execution request, explicitly tell the developer to spawn the planned parallel agents or branches or worktrees for the named `plan.md` sections, with named branch contracts and clear fan-in requirements
395
- - when several independent items can move at once, explicitly tell the developer to spawn all safe parallel branches or agents and name the separate branch contracts instead of serializing them into one vague request
391
+ - for the first development request, make the message a direct execution instruction for the accepted P3 architecture: section 3 scaffold details, shared foundation, ordered module packets, per-module implementation/tests/FE↔BE proof/file-existence proof, optional safe helper branches, full integrated verification after all modules, and P3 completion report
392
+ - after planning is accepted, the default development ask is the P3 architecture execution request rather than many narrow follow-up prompts
393
+ - in planning messages, explicitly say that the developer must plan ordered module packets up front, derive module order from dependencies and shared-file risk, and identify optional safe parallel opportunities without forcing artificial split counts
394
+ - in that first P3 architecture execution request, explicitly tell the developer to complete module packets one by one by default, and to spawn helper branches only for planned low-risk independent module packets or verification tasks
395
+ - when several independent discovery or verification items can move at once, explicitly tell the developer to use safe helper branches or agents; do not force parallel module implementation when sequential execution is safer
396
396
  - translate process intent into normal software-project language
397
397
  - do not mention session names, slot labels, or workflow state labels to the developer
398
398
  - do not describe the interaction as a workflow handoff, session restart, or workflow-state transition
399
- - do not use workflow-internal words in developer messages, including terms such as `owner`, `bridge`, `tmux`, `audit report`, `evaluation turn`, `workflow`, `orchestration`, `handoff`, `phase`, `state transition`, `session`, `slot`, `lane`, `gate`, or `turn`
399
+ - do not use workflow-internal words in developer messages, including terms such as `owner`, `bridge`, `tmux`, `audit report`, `evaluation turn`, `workflow`, `orchestration`, `state transition`, `session`, `slot`, `gate`, or `turn`; implementation-architecture terms from the accepted plan are allowed and often required, including `module lane`, `worktree`, `module handoff packet`, `fan-in`, `shared surface`, and `P3 Development Completion Report`
400
400
  - write developer messages as if you are the human directly doing and reviewing the work yourself; say things like `fix these issues I found`, `I reviewed the repo and need these changes`, or `I am checking the full repo again after this` rather than attributing actions to a workflow or review system
401
401
  - express boundaries as plain engineering instructions such as `plan this but do not start implementation yet` rather than workflow labels like `planning only` or `stop after the baseline`
402
402
  - for the first full-repo readiness review, later rereads, or other bounded correction requests, require compact replies by default: short summary, exact changed files, exact verification commands plus results, and only real unresolved issues
@@ -484,7 +484,9 @@ Treat packaging as a first-class delivery contract from the start, not as late c
484
484
 
485
485
  - the evaluation prompt files under `~/slopmachine/` are used only during evaluation runs
486
486
  - the packaged source copies of those prompts live under `assets/slopmachine/`, and the installed runtime copies live under `~/slopmachine/`; ordinary evaluation runs should use the installed runtime copies
487
- - ordinary audit sends must go through `node ~/slopmachine/utils/prepare_evaluation_send_packet.mjs --workspace-root .. --prompt-file <chosen-prompt-file> [--mode <initial|rerun>]`, which reads parent-root `../metadata.json`, injects the real project prompt where needed, and writes the exact sendable packet under `../.ai/`; the owner must pass the exact resulting packet contents unchanged to the evaluator subagent, with no paraphrase, trimming, or owner-written substitute
487
+ - ordinary audit sends must use the exact saved output from `node ~/slopmachine/utils/prepare_evaluation_send_packet.mjs --workspace-root .. --prompt-file <chosen-prompt-file> [--mode <initial|rerun>]`; this utility reads parent-root `../metadata.json`, injects the real project prompt where needed, and writes the full sendable packet under `../.ai/`
488
+ - the owner must read that saved packet file and use its exact full contents as the evaluator message body; do not manually compose, paraphrase, trim, reorder, excerpt, summarize, append extra owner text, send only the rerun footer, or substitute any hand-written prompt for an ordinary audit send
489
+ - if a hard transport limit prevents pasting the whole packet, the only fallback is to send the exact prepared packet path and explicitly instruct the evaluator to read that file as the full prompt before auditing; reject the resulting report unless it is clear the evaluator used that full file-backed prompt
488
490
  - fix-check is the only narrow exception: use the exact scoped fix-check instruction instead of a full evaluation packet
489
491
  - load `submission-packaging` before any packaging action
490
492
  - follow its exact artifact, export, cleanup, and output contract