theslopmachine 1.0.2 → 1.0.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (39) hide show
  1. package/assets/agents/developer.md +38 -32
  2. package/assets/agents/slopmachine-claude.md +36 -25
  3. package/assets/agents/slopmachine.md +61 -45
  4. package/assets/claude/agents/developer.md +27 -10
  5. package/assets/skills/claude-worker-management/SKILL.md +4 -4
  6. package/assets/skills/developer-session-lifecycle/SKILL.md +13 -3
  7. package/assets/skills/development-guidance/SKILL.md +24 -5
  8. package/assets/skills/evaluation-triage/SKILL.md +4 -4
  9. package/assets/skills/final-evaluation-orchestration/SKILL.md +29 -3
  10. package/assets/skills/integrated-verification/SKILL.md +24 -23
  11. package/assets/skills/p8-readiness-reconciliation/SKILL.md +98 -0
  12. package/assets/skills/planning-gate/SKILL.md +2 -2
  13. package/assets/skills/planning-guidance/SKILL.md +7 -4
  14. package/assets/skills/scaffold-guidance/SKILL.md +2 -0
  15. package/assets/skills/submission-packaging/SKILL.md +30 -3
  16. package/assets/skills/verification-gates/SKILL.md +11 -7
  17. package/assets/slopmachine/clarification-faithfulness-review-prompt.md +69 -45
  18. package/assets/slopmachine/clarifier-agent-prompt.md +46 -40
  19. package/assets/slopmachine/exact-readme-template.md +38 -11
  20. package/assets/slopmachine/owner-verification-checklist.md +2 -2
  21. package/assets/slopmachine/phase-1-design-prompt.md +94 -17
  22. package/assets/slopmachine/phase-1-design-template.md +124 -21
  23. package/assets/slopmachine/phase-2-execution-planning-prompt.md +155 -87
  24. package/assets/slopmachine/phase-2-plan-template.md +169 -81
  25. package/assets/slopmachine/scaffold-playbooks/selection-matrix.md +8 -1
  26. package/assets/slopmachine/scaffold-playbooks/tech-frontend-vue.md +2 -0
  27. package/assets/slopmachine/scaffold-playbooks/type-web-spa.md +1 -0
  28. package/assets/slopmachine/templates/AGENTS.md +18 -17
  29. package/assets/slopmachine/templates/CLAUDE.md +18 -17
  30. package/assets/slopmachine/templates/plan.md +115 -36
  31. package/package.json +9 -2
  32. package/src/constants.js +1 -0
  33. package/src/init.js +8 -0
  34. package/src/install.js +130 -0
  35. package/assets/slopmachine/utils/__pycache__/claude_live_hook.cpython-311.pyc +0 -0
  36. package/assets/slopmachine/utils/__pycache__/cleanup_delivery_artifacts.cpython-311.pyc +0 -0
  37. package/assets/slopmachine/utils/__pycache__/convert_ai_session.cpython-311.pyc +0 -0
  38. package/assets/slopmachine/utils/__pycache__/normalize_claude_session.cpython-311.pyc +0 -0
  39. package/assets/slopmachine/utils/__pycache__/strip_session_parent.cpython-311.pyc +0 -0
@@ -3,6 +3,8 @@
3
3
  Use this template to produce the accepted execution plan.
4
4
  The output should normally be written to `plan.md`.
5
5
 
6
+ > Write this template section by section, appending each completed section to `plan.md`. Do not paste full section text in chat or dump the accumulated document at the end.
7
+
6
8
  This file is the authoritative execution checklist inside the repo.
7
9
 
8
10
  ---
@@ -30,12 +32,13 @@ This file is the authoritative execution checklist inside the repo.
30
32
  - No unauthorized narrowing
31
33
  - No `.env` or `.env.example` files anywhere
32
34
  - Tests travel with implementation
33
- - Shared files are main-lane controlled
35
+ - Shared files are controlled by the primary implementation sequence
34
36
  - Runtime/test must remain honest and portable
35
- - Final integrated verification and hardening is mandatory
37
+ - Final integrated local verification and hardening is mandatory
38
+ - External planning documents under `../docs/` are planning/reference inputs only. Delivery evidence must exist inside the repo through `README.md`, code structure, route/app/server registration, tests, scripts, config/manifests, Docker/runtime files, and seeded/bootstrap paths where applicable. Do not rely on parent-directory docs as the only evidence for any final delivery, runtime, README, API, frontend, security, or test obligation.
36
39
 
37
40
  ### 1.3 Handoff exceptions
38
- | Exception | Why it exists | Impact | Required owner decision |
41
+ | Exception | Why it exists | Impact | Required decision |
39
42
  |---|---|---|---|
40
43
  | | | | |
41
44
 
@@ -49,13 +52,22 @@ This file is the authoritative execution checklist inside the repo.
49
52
  - Shared-foundation-first rule:
50
53
  - Parallelization rule:
51
54
  - Merge discipline rule:
52
- - Final convergence rule: `P5` is one minimal gate that includes the owner-run local test harness. If that passes and the repo is roughly coherent with `plan.md`, stop and ask whether to proceed to evaluation. Fix only owner-side local-harness/config/wrapper/README/docs/light-script churn there, reroute any real code or actual test-file work back to the developer, and defer Docker/runtime plus dockerized `./run_tests.sh` to final packaging confirmation.
55
+ - Vertical implementation rule: build one complete user/operator flow end-to-end before starting the next; do not build broad placeholder coverage across modules
56
+ - Form-contract rule: every form must have matching frontend field names and backend handler field names, proven by a test that submits the rendered form through the real handler
57
+ - Runtime-wiring rule: every background job must be wired from the entrypoint and verified reachable before claiming complete
58
+ - Security-layer rule: every security control must be enforced at the correct layer (service, middleware, DB, template, runtime) before claiming complete
59
+ - No artifact-count completion rule: do not claim completion based on file counts, route counts, template counts, or test counts
60
+ - Final convergence rule: once implementation is complete, run the accepted local harness, reconcile the repo against `plan.md`, fix narrow docs/config/wrapper/light-script issues directly when assigned, route real code or test-file fixes through the active correction workstream, and leave Docker/runtime plus dockerized `./run_tests.sh` for the explicit final runtime confirmation step.
61
+
62
+ If your active instruction asks only for implementation readiness, report readiness after integrated local verification and a truthful handoff.
63
+
64
+ If implementation is explicitly scoped end to end in one autonomous execution run, it must not stop at rough local readiness. It must continue through all implementation, assigned tests, README/runtime/script reconciliation, integrated local verification, and final truthful handoff unless the active instruction explicitly says to stop earlier.
53
65
 
54
66
  ---
55
67
 
56
68
  ## 3. Scaffold Step at the Start of Development
57
69
 
58
- This section is planned during `P2` and then executed as the first step of development. It is not a separate lifecycle phase or owner gate.
70
+ This section is planned before implementation and then executed as the first step of development. It is not a separate product feature.
59
71
 
60
72
  This scaffold step should be strict and straightforward. Its job is only to establish the minimal honest delivery baseline before real feature development starts.
61
73
 
@@ -64,10 +76,12 @@ This scaffold step should be strict and straightforward. Its job is only to esta
64
76
  - Exact bootstrap command:
65
77
  - Expected directories/files immediately after bootstrap:
66
78
  - Language/framework bootstrap notes to preserve:
79
+ - Frontend defaults applied if prompt/repo are silent: Vue 3 + Vite + TypeScript, Tailwind CSS, and shadcn/ui when compatible:
67
80
  - Minimal bootstrap proof surface to land now (for example a hello-world route/page/app shell):
68
- - Docker/runtime setup required in this step so the owner can later run `docker compose up --build` during final packaging confirmation:
81
+ - Docker/runtime setup required in this step so `docker compose up --build` can be run during final runtime confirmation:
82
+ - Seeded quick-start data strategy required in this step, or `Not Applicable` with reason:
69
83
  - `./run_tests.sh` setup required in this step as the dockerized broad-test path reserved for final packaging confirmation:
70
- - Local test harness setup required in this step so development and owner-side `P5` can use it:
84
+ - Local test harness setup required in this step so implementation and later local readiness checks can use it:
71
85
  This should name the stack-native local suite explicitly, for example Vitest, Jest, PHPUnit, pytest, go test, cargo test, or another framework-native equivalent.
72
86
  - Reliability rules the local test harness and deferred Docker/runtime plus dockerized broad-test path must satisfy later:
73
87
  Include what must be installed/configured locally if the stack-native local suite is not already available on the machine.
@@ -75,29 +89,30 @@ Include what must be installed/configured locally if the stack-native local suit
75
89
  - Local testing harness/tooling to establish now:
76
90
  - Exact README template/section structure to land now:
77
91
  - Files that must exist before deeper feature work:
78
- - Stop boundary if the owner explicitly isolates only this step:
92
+ - Stop boundary if the active instruction explicitly isolates only this step:
79
93
  - Completion evidence for this step, including Docker/runtime files and dockerized `./run_tests.sh` being present plus the separate local test harness being prepared honestly:
80
94
 
81
95
  ### 3.1 Required scaffold work inside development
82
96
 
83
97
  - bootstrap the project for the selected language/framework/runtime with only a minimal hello-world-style proof surface
84
- - set up the real Docker/runtime path required by the project so the owner can run `docker compose up --build` later during final packaging confirmation
98
+ - set up the real Docker/runtime path required by the project so `docker compose up --build` can run later during final runtime confirmation
99
+ - set up deterministic, idempotent seeded quick-start data through the normal bootstrap/database/runtime path when the product is not useful from an empty state
85
100
  - set up repo-root `./run_tests.sh` as the real dockerized broad-test entrypoint for the later final packaging confirmation run
86
101
  - make the local test harness and deferred Docker/runtime plus dockerized broad-test path reviewably reliable: no hidden setup steps, no manual exports for Docker, no hidden shell state for local tests, deterministic execution, and useful failure output
87
102
  - do not run Docker or dockerized `./run_tests.sh` during the scaffold step; prepare the local harness for immediate development use
88
- - set up the local testing harness/tooling used for ordinary development and owner-side `P5`
103
+ - set up the local testing harness/tooling used for ordinary development and later local readiness checks
89
104
  - set up `README.md` to the exact required template/section shape
90
105
  - keep the scaffold free of prompt-specific business logic, deep domain implementation, or polish work beyond the baseline needed to prove startup and test wiring
91
106
  - require the scaffold to avoid local dependency setup beyond Docker and the documented host minimums
92
107
  - finish this step before broader module implementation starts
93
- - once this step is complete, continue directly into the remaining development work unless the owner explicitly says to stop
108
+ - once this step is complete, continue directly into the remaining development work unless the active instruction explicitly says to stop
94
109
 
95
110
  ---
96
111
 
97
112
  ## 4. Security Execution Contract
98
113
 
99
114
  ### 4.1 Security-sensitive surfaces
100
- | Surface | Required control | Must land before module implementation? | Owning module/lane | Required negative tests |
115
+ | Surface | Required control | Must land before module implementation? | Responsible module/work package | Required negative tests |
101
116
  |---|---|---|---|---|
102
117
  | | | | | |
103
118
 
@@ -106,16 +121,16 @@ Include what must be installed/configured locally if the stack-native local suit
106
121
  -
107
122
  -
108
123
 
109
- ### 4.3 Security lane plan if needed
110
- - Dedicated security lane needed? [yes/no]
124
+ ### 4.3 Security work package plan if needed
125
+ - Dedicated security work package needed? [yes/no]
111
126
  - If yes:
112
- - Lane label:
127
+ - Work package label:
113
128
  - Owned files:
114
- - Merge target:
129
+ - Integration target:
115
130
  - Blocking dependencies:
116
- - Merge gate:
131
+ - Integration check:
117
132
 
118
- ### 4.4 Security merge gates
133
+ ### 4.4 Security integration checks
119
134
  -
120
135
  -
121
136
  -
@@ -123,33 +138,36 @@ Include what must be installed/configured locally if the stack-native local suit
123
138
  ### 4.5 Role Surface Matrix
124
139
  Required for any role-sensitive project. If not applicable, state `Not Applicable` and why.
125
140
 
126
- | Route / page / API / action / data object / notification / export / admin function | Roles allowed | Read/write scope | Object ownership rule | Admin override rule | Viewer/read-only rule | Audit/logging requirement | Frontend route/nav expectation | Required implementation evidence | Required test evidence |
141
+ | Route / page / API / action / data object / notification / export / admin function | Roles allowed | Read/write scope | Object ownership rule | Admin override rule | Viewer/read-only rule | Logging/accountability requirement | Frontend route/nav expectation | Required implementation evidence | Required test evidence |
127
142
  |---|---|---|---|---|---|---|---|---|---|
128
143
  | | | | | | | | | | |
129
144
 
130
- Any contradiction between this matrix and `../docs/design.md`, `../docs/api-spec.md`, router/frontend behavior, backend checks, README, or tests is a planning defect before development and a release-alignment defect before evaluation.
145
+ Any contradiction between this matrix and `../docs/design.md`, `../docs/api-spec.md`, router/frontend behavior, backend checks, README, or tests is a planning defect before development and a release-alignment defect before handoff.
131
146
 
132
147
  ---
133
148
 
134
149
  ## 5. Test Coverage Execution Contract
135
150
 
136
151
  ### 5.1 Coverage target and floor
137
- - Coverage target: meaningful mapped tests for every evaluator-relevant endpoint, core flow, security boundary, and major failure path
152
+ - Coverage target: meaningful mapped tests for every delivery-relevant endpoint, core flow, security boundary, and major failure path
153
+ - Unit coverage floor: at least `90%` line/branch coverage for unit-testable product code where stack tooling can measure this honestly; otherwise name the exact measurement limitation and substitute a complete file/function/assertion ledger
154
+ - E2E/platform coverage floor: at least `90%` of planned prompt-critical E2E/platform flow rows complete before clean development completion; otherwise name the exact accepted exception and compensating proof
155
+ - API coverage floor when APIs exist: `100%` of documented prompt-relevant `METHOD + PATH` surfaces have true no-mock HTTP tests unless a per-endpoint narrow exception and compensating proof are recorded
138
156
  - Measurement path:
139
157
  - Confidence notes / expected weak spots if any:
140
158
 
141
159
  ### 5.2 Surface-to-test mapping
142
- | Planned surface / work package | Source requirement ID(s) | Required test layers | Planned test files / suites | Coverage evidence expected | Owning lane | Merge proof required |
160
+ | Planned surface / work package | Source requirement ID(s) | Required test layers | Planned test files / suites | Coverage evidence expected | Responsible module/work package | Integration proof required |
143
161
  |---|---|---|---|---|---|---|
144
162
  | | | | | | | |
145
163
 
146
164
  ### 5.3 Full prompt-relevant surface inventory
147
- | Surface / flow / module | Source requirement ID(s) | Unit coverage | API coverage | Integration coverage | E2E/platform coverage | Planned test files / suites | Owning lane |
165
+ | Surface / flow / module | Source requirement ID(s) | Unit coverage | API coverage | Integration coverage | E2E/platform coverage | Planned test files / suites | Responsible module/work package |
148
166
  |---|---|---|---|---|---|---|---|
149
167
  | | | | | | | | |
150
168
 
151
169
  ### 5.4 Important risk coverage
152
- | Risk point | Planned tests | Expected assertions | Owning lane |
170
+ | Risk point | Planned tests | Expected assertions | Responsible module/work package |
153
171
  |---|---|---|---|
154
172
  | 401 unauthenticated | | | |
155
173
  | 403 unauthorized | | | |
@@ -170,7 +188,7 @@ Any contradiction between this matrix and `../docs/design.md`, `../docs/api-spec
170
188
  ### 5.5 Frontend coverage obligations
171
189
  If applicable:
172
190
  - frontend unit tests required? [yes/no]
173
- - Frontend unit tests verdict for later audit: [PRESENT | MISSING]
191
+ - Frontend unit tests status for later review: [PRESENT | MISSING]
174
192
  - frontend unit-test file-level evidence plan:
175
193
  - important frontend components/modules that must not remain untested:
176
194
  - component/state tests:
@@ -196,7 +214,7 @@ Required for every fullstack or backend-backed frontend project. If not applicab
196
214
 
197
215
  Planning must prove both directions before development starts: every meaningful frontend surface must have real backend support, and every prompt-relevant backend capability must be surfaced in the frontend or explicitly justified as internal/API-only.
198
216
 
199
- | Frontend page/component/action | User intent / requirement ID | Backend `METHOD + PATH` / service / job used | Request payload / query / state input | Expected response / side effect | Frontend states to implement | Real integration proof planned | Owner/lane |
217
+ | Frontend page/component/action | User intent / requirement ID | Backend `METHOD + PATH` / service / job used | Request payload / query / state input | Expected response / side effect | Frontend states to implement | Real integration proof planned | Responsible module/work package |
200
218
  |---|---|---|---|---|---|---|---|
201
219
  | | | | | | loading / empty / submitting / disabled / success / error / duplicate-action | | |
202
220
 
@@ -233,6 +251,7 @@ If applicable:
233
251
  - accepted `../docs/api-spec.md` source of truth:
234
252
  - resolved endpoint inventory required? [yes/no]
235
253
  - target `100%` of documented `METHOD + PATH` surfaces through true no-mock HTTP tests unless a narrow exception is accepted:
254
+ - any endpoint below true no-mock HTTP coverage must have a row naming the exception reason, compensating proof, residual risk, and acceptance decision:
236
255
  - allowed exceptions:
237
256
  - exact planned API test files / suites by endpoint family:
238
257
  - endpoint-family mapping table or equivalent location in this plan:
@@ -294,9 +313,10 @@ The final `README.md` must include these sections exactly:
294
313
  8. verification method
295
314
  9. testing
296
315
  10. authentication and roles
297
- 11. workflow / operational notes if applicable
298
- 12. feature-flag/mock/debug/demo disclosure if applicable
299
- 13. important limitations or non-goals if material
316
+ 11. quick-start seeded data
317
+ 12. lifecycle / operational notes if applicable
318
+ 13. feature-flag/mock/debug/demo disclosure if applicable
319
+ 14. important limitations or non-goals if material
300
320
 
301
321
  ### 6.1 Required command strings
302
322
  - `docker compose up --build`
@@ -309,9 +329,16 @@ Choose one:
309
329
  - credentials table for every relevant role
310
330
  - exact statement `No authentication required`
311
331
 
332
+ ### 6.2.1 Quick-start seeded data rule
333
+ Choose one:
334
+ - seeded accounts/sample records/fixture data table with values, roles, IDs, URLs, and steps needed to exercise the main flows quickly
335
+ - exact statement `No seeded data required; the app is useful from an empty state` with a short reason
336
+
337
+ Seeded data must be deterministic and idempotent. It must be created by the normal runtime/bootstrap/database path, not by hidden manual setup. It may provide local demo/test fixtures, but it must not replace real product behavior with fake success paths or static demo-only UI.
338
+
312
339
  ### 6.3 Ownership rule
313
- - Main-lane owner:
314
- - Parallel lanes may propose updates through:
340
+ - Primary-sequence responsibility:
341
+ - Parallel work packages may propose updates through:
315
342
  - Final reconciliation point:
316
343
 
317
344
  ---
@@ -341,7 +368,7 @@ Choose one:
341
368
 
342
369
  ## 9. Module-First Product Decomposition
343
370
 
344
- Planning must identify modules before creating execution order or file/location ownership details. A full optimistic file tree is no longer required; file, route, directory, and test paths should be named where they make module packets executable and reviewable.
371
+ Planning must identify modules before creating execution order or file/location responsibility details. A full optimistic file tree is no longer required; file, route, directory, and test paths should be named where they make module packets executable and verifiable.
345
372
 
346
373
  ### 9.1 Module inventory
347
374
  | Module | Requirement IDs | Purpose | Actors served | Owns frontend surfaces? | Owns backend/API/jobs? | Owns data/storage? | Core success behavior | Depends on |
@@ -370,7 +397,7 @@ Every critical module behavior needs at least one assertion-level row. Generic p
370
397
  ### 9.3 Module coverage contract
371
398
  Every module must have concrete planned proof. Do not let a module rely only on broad smoke tests unless it is truly a trivial wrapper and the reason is stated.
372
399
 
373
- | Module | Unit tests | API tests | Integration tests | E2E/platform tests | Frontend component/state tests | Required negative/risk tests | Main-lane module verification before development exit | Coverage owner |
400
+ | Module | Unit tests | API tests | Integration tests | E2E/platform tests | Frontend component/state tests | Required negative/risk tests | Integrated module verification before development exit | Coverage responsibility |
374
401
  |---|---|---|---|---|---|---|---|---|
375
402
  | | | | | | | | | |
376
403
 
@@ -384,7 +411,7 @@ Every module must have concrete planned proof. Do not let a module rely only on
384
411
  - every frontend-bearing module must have real backend wiring when backend support exists
385
412
  - every backend feature required by the prompt must be exposed through the frontend when user-facing, or explicitly marked internal/API-only with accepted rationale
386
413
  - every module must own or explicitly inherit its unit/API/integration/E2E/frontend-state proof
387
- - every module must be rechecked by the main lane after fan-in before development is declared complete: files are real, integrated, covered by assigned tests, and not orphaned from the rest of the app
414
+ - every module must be rechecked after fan-in before development is declared complete: files are real, integrated, covered by assigned tests, and not orphaned from the rest of the app
388
415
  - lazy completion examples that must not pass: shell handlers, hardcoded success, uncalled services, static demo data presented as backend data, disconnected forms, TODO client methods, untested security checks, and orphaned backend capabilities
389
416
 
390
417
  ### 9.5.1 Module requirement closure checklist
@@ -395,12 +422,29 @@ Each module packet must carry this checklist into development. A module may not
395
422
  | | | | | | | planned |
396
423
 
397
424
  ### 9.6 Development exit module verification matrix
398
- Complete this before leaving development for owner-side integrated verification. Use the stack-native local suite and targeted module tests; Docker and dockerized `./run_tests.sh` remain deferred to final packaging confirmation.
425
+ Complete this before claiming development readiness. Use the stack-native local suite and targeted module tests; Docker and dockerized `./run_tests.sh` remain deferred to the explicit final runtime confirmation step.
399
426
 
400
- | Module | Files reviewed by main lane | Module tests run | FE↔BE wiring verified | API/route coverage verified | E2E/platform proof status | Integration issues found/fixed | Exit status |
427
+ | Module | Files reviewed after integration | Module tests run | FE↔BE wiring verified | API/route coverage verified | E2E/platform proof status | Integration issues found/fixed | Exit status |
401
428
  |---|---|---|---|---|---|---|---|
402
429
  | | | | | | | | pending |
403
430
 
431
+ ### 9.7 Plan-Row Execution Ledger
432
+ Every actionable plan row must appear here or in an equivalent section-specific checklist. This ledger is the source of truth for whether `plan.md` was actually executed, not merely referenced.
433
+
434
+ | Plan section / row | Required implementation evidence | Required verification evidence | Responsible module/work package | Status | Blocker / risk / exception |
435
+ |---|---|---|---|---|---|
436
+ | | | | | planned | |
437
+
438
+ Clean development completion requires every row to be `complete`, `not applicable`, or explicitly `risk accepted` with evidence. Rows left `planned`, `in progress`, `delegated without receiver`, or `unverified` block completion.
439
+
440
+ ### 9.8 Coverage Closure Ledger
441
+
442
+ | Coverage surface | Required floor | Measured / closed result | Evidence path | Exception / compensating proof | Status |
443
+ |---|---|---|---|---|---|
444
+ | API true no-mock HTTP coverage for documented prompt-relevant endpoints | 100% | | | | planned |
445
+ | Unit-testable product-code line/branch coverage | >=90% where measurable | | | | planned |
446
+ | Planned E2E/platform-critical flow coverage | >=90% closed rows | | | | planned |
447
+
404
448
  ---
405
449
 
406
450
  ## 9.7 In-Scope Domains / Modules Summary
@@ -469,12 +513,19 @@ Every accepted requirement, clarification, design no-orphan row, and API spec ro
469
513
  - Retention/reporting/scheduling rules if applicable:
470
514
 
471
515
  ### 13.1 Prompt-Critical Rule Matrix
472
- Every prompt-critical rule must have an owner and a proof path. Include dates, thresholds, limits, states, scheduling/timing, retries, security constraints, public/private boundaries, actor identity, and operator promises.
516
+ Every prompt-critical rule must have a responsible implementation surface and a proof path. Include dates, thresholds, limits, states, scheduling/timing, retries, security constraints, public/private boundaries, actor identity, and operator promises.
473
517
 
474
- | Source prompt phrase / accepted clarification | Normalized rule | Implementation owner | Test owner | README/doc implication | P5 proof expectation | Status |
518
+ | Source prompt phrase / accepted clarification | Normalized rule | Implementation responsibility | Test responsibility | README/doc implication | Local proof expectation | Status |
475
519
  |---|---|---|---|---|---|---|
476
520
  | | | | | | | planned |
477
521
 
522
+ ### 13.1.1 Prompt Verb Acceptance Matrix
523
+ Every action verb in the original prompt or accepted clarification that implies product behavior must have an acceptance row. Use this to prevent broad nouns such as `dashboard`, `lifecycle`, `approval`, `export`, or `sync` from hiding missing actor actions.
524
+
525
+ | Source phrase / verb | Actor | Starting state / fixture | Action performed | Required state change or side effect | Observable UI/API/artifact result | Negative or edge behavior | Proof path | Owning module |
526
+ |---|---|---|---|---|---|---|---|---|
527
+ | | | | | | | | | |
528
+
478
529
  ### 13.2 Scaffold Versus Product Completion Matrix
479
530
 
480
531
  Use this to prevent setup shells from being mistaken for delivered behavior. A row may move to `done` only when the product-facing path is wired to real behavior and tests.
@@ -491,7 +542,7 @@ Use this to prevent setup shells from being mistaken for delivered behavior. A r
491
542
 
492
543
  ## 14. State / Lifecycle Rules
493
544
 
494
- | Entity / workflow | States | Valid transitions | Invalid transitions | Required cleanup |
545
+ | Entity / lifecycle | States | Valid transitions | Invalid transitions | Required cleanup |
495
546
  |---|---|---|---|---|
496
547
  | | | | | |
497
548
 
@@ -563,12 +614,22 @@ Complete only when the product has background work, timed behavior, async jobs,
563
614
  - If yes, where it is called:
564
615
  - If no, why not applicable:
565
616
 
617
+ ### 18.4 Seeded quick-start data
618
+ - Seeded data applicable? [yes/no]
619
+ - If yes, seed entrypoint and when it runs:
620
+ - Idempotency rule:
621
+ - Seeded accounts / roles:
622
+ - Seeded sample records / IDs / URLs:
623
+ - Main flows the seeded data unlocks:
624
+ - README quick-start section responsibility:
625
+ - If no, why the app is useful from empty state:
626
+
566
627
  ---
567
628
 
568
629
  ## 19. Delivery Review Requirement Matrices
569
630
 
570
631
  ### 19.0 Core Semantic Path Proof
571
- This is the centerpiece path that proves the product is not only a shell. `P5` cannot close unless this path is explicitly proven or the user accepts a named residual risk.
632
+ This is the centerpiece path that proves the product is not only a shell. Do not claim implementation readiness unless this path is explicitly proven or the user accepts a named residual risk.
572
633
 
573
634
  | Field | Required value |
574
635
  |---|---|
@@ -580,17 +641,17 @@ This is the centerpiece path that proves the product is not only a shell. `P5` c
580
641
  | Expected failure-mode behavior | |
581
642
  | Test file(s) proving the path | |
582
643
  | Manual/runtime proof if automated proof is not feasible | |
583
- | P5 acceptance evidence expected | |
644
+ | Local readiness evidence expected | |
584
645
 
585
646
  ### 19.0.1 Prompt-critical proof ledger
586
- Use this ledger during development and `P5` to prevent planned-but-missing drift.
647
+ Use this ledger during development and readiness hardening to prevent planned-but-missing drift.
587
648
 
588
649
  | Planned claim | Required proof | Current proof location | Gap | Decision |
589
650
  |---|---|---|---|---|
590
651
  | | | | | |
591
652
 
592
653
  ### 19.1 Prompt-fit and end-to-end delivery obligations matrix
593
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
654
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
594
655
  |---|---|---|---|
595
656
  | core business goal preserved | | | |
596
657
  | main user flows complete | | | |
@@ -599,18 +660,21 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
599
660
  | explicit non-goals are safe | | | |
600
661
 
601
662
  ### 19.2 Static reviewability and repository traceability obligations matrix
602
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
663
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
603
664
  |---|---|---|---|
604
665
  | startup path traceable | | | |
605
666
  | broad test path traceable | | | |
606
667
  | entry points visible | | | |
607
668
  | route registration visible | | | |
608
669
  | config path centralized | | | |
670
+ | README/docs/scripts/routes/config/examples/manifests/env examples statically consistent | | | |
609
671
  | module boundaries visible | | | |
672
+ | no excessive single-file implementation or fragmented snippets | | | |
673
+ | redundant/unnecessary files removed or justified | | | |
610
674
  | mock/local-data boundary visible | | | |
611
675
 
612
676
  ### 19.3 Runtime and broad-test contract obligations matrix
613
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
677
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
614
678
  |---|---|---|---|
615
679
  | exact runtime command honored | | | |
616
680
  | exact broad test command honored | | | |
@@ -624,7 +688,7 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
624
688
  | Linux portability is preserved | | | |
625
689
 
626
690
  ### 19.4 Logging, validation, and error-handling obligations matrix
627
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
691
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
628
692
  |---|---|---|---|
629
693
  | logging categories explicit | | | |
630
694
  | redaction enforced | | | |
@@ -633,7 +697,7 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
633
697
  | operator-visible failures traceable | | | |
634
698
 
635
699
  ### 19.5 Backend and API obligations matrix
636
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
700
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
637
701
  |---|---|---|---|
638
702
  | endpoint inventory complete | | | |
639
703
  | endpoint coverage summary computable | | | |
@@ -643,24 +707,28 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
643
707
  | API surfaces mapped to tests | | | |
644
708
 
645
709
  ### 19.6 Frontend and UX obligations matrix
646
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
710
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
647
711
  |---|---|---|---|
648
712
  | pages/routes/screens complete | | | |
713
+ | pages/routes/app shell connected | | | |
649
714
  | core interactions complete | | | |
715
+ | state/data flow organized and traceable | | | |
716
+ | service/adaptor/mock/storage boundaries clear | | | |
650
717
  | task closure real | | | |
651
718
  | UI states complete | | | |
652
719
  | validation/disclosure explicit | | | |
653
720
  | frontend test obligations explicit | | | |
654
721
  | static visual/interaction quality evidence planned | | | |
722
+ | pure-frontend mock/local data disclosed without implying backend integration where applicable | | | |
655
723
 
656
724
  ### 19.7 End-to-end and platform verification obligations matrix
657
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
725
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
658
726
  |---|---|---|---|
659
727
  | major flow E2E/platform coverage exists | | | |
660
728
  | platform-proof boundary is honest | | | |
661
729
 
662
730
  ### 19.8 README and documentation obligations matrix
663
- | Obligation | Planned repo evidence | Planned verification evidence | Owning lane |
731
+ | Obligation | Planned repo evidence | Planned verification evidence | Responsible module/work package |
664
732
  |---|---|---|---|
665
733
  | README exists at repo root | `README.md` | static file check | |
666
734
  | project type near top is one of `backend`, `fullstack`, `web`, `android`, `ios`, `desktop` | | static README check | |
@@ -671,17 +739,29 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
671
739
  | access method documented | | | |
672
740
  | verification method documented | | | |
673
741
  | credentials/no-auth disclosure correct | | | |
742
+ | quick-start seeded data or empty-state rationale documented | | | |
743
+ | Configuration and Environment Model explains local config, runtime defaults, Docker/Compose defaults, seeded/bootstrap data, auth/no-auth, no committed `.env` requirement, no manual package/runtime/database setup beyond documented host prerequisites, and config-sensitive verification | | static README/config check | |
744
+
674
745
  | no manual install/runtime setup dependency (`npm install`, `pip install`, `apt-get`, manual DB setup, `.env`, `.env.example`) | | static README/config check | |
675
746
  | mock/debug/demo disclosure honest | | | |
676
747
 
748
+ If no configuration is required, the README should state: `No manual environment configuration is required for local Docker startup. The delivered local runtime uses repo-controlled Docker/Compose defaults and bootstrap paths.`
749
+
677
750
  ---
678
751
 
679
752
  ## 20. Shared-File Contract
680
753
 
681
- | Shared file / surface | Why shared | Main-sequence owner | Module edit rule | Notes |
754
+ | Shared file / surface | Why shared | Primary-sequence responsibility | Module edit rule | Notes |
682
755
  |---|---|---|---|---|
683
756
  | | | | | |
684
757
 
758
+ ### 20.0 Canonical Cross-Surface Contracts
759
+ Use this when data crosses boundaries between UI, API, jobs, imports, exports, notifications, files, reports, clipboard payloads, templates, or third-party adapters. If not applicable, state `Not Applicable`.
760
+
761
+ | Contract / payload | Surfaces using it | Canonical schema or fields | Validation / normalization rule | Producer | Consumer | Tests / proof | README or docs implication |
762
+ |---|---|---|---|---|---|---|---|
763
+ | | | | | | | | |
764
+
685
765
  ### 20.1 Shared foundation to land before module execution
686
766
  -
687
767
  -
@@ -689,7 +769,7 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
689
769
 
690
770
  ### 20.2 Execution order rule
691
771
  - scaffold step lands first
692
- - shared foundation lands second in the main lane
772
+ - shared foundation lands second in the primary implementation sequence
693
773
  - ordered module packets execute after the shared foundation is stable
694
774
  - helper work is used only for low-risk independent module packets or verification tasks where it is safer than normal sequential execution
695
775
  - full-suite, E2E/platform-equivalent, and cross-module verification run after all modules are built
@@ -698,7 +778,7 @@ Use this ledger during development and `P5` to prevent planned-but-missing drift
698
778
 
699
779
  ## 21. Ordered Module Packet Execution Map
700
780
 
701
- Default to executing modules one by one in the main developer lane. Use helper work only when a module packet or verification task is genuinely independent and lower risk than serial execution. Every module packet must be complete enough to execute without inventing scope, files, tests, or proof.
781
+ Default to executing modules one by one in the primary implementation sequence. Use helper work only when a module packet or verification task is genuinely independent and lower risk than serial execution. Every module packet must be complete enough to execute without inventing scope, files, tests, or proof.
702
782
 
703
783
  | Module order | Module ID | Purpose | Dependencies | Owned files/directories | Explicitly shared files | Owned tests | FE↔BE/API proof | File/import/route/service/data existence proof | Verification before next module | Standard completion checklist | Helper work? |
704
784
  |---|---|---|---|---|---|---|---|---|---|---|---|
@@ -709,7 +789,7 @@ For each module, the plan should name the exact actor-facing flows, routes/scree
709
789
  Module completion evidence must use the standard completion checklist and include owned file/test list, exact verification commands/results, FE↔BE proof where applicable, real file/import/route/service/data proof, and a self-check that owned files are not placeholders or orphan code.
710
790
 
711
791
  ### 21.1 Module packet sequence
712
- This section must be directly executable after scaffold and shared foundation land. The developer should be able to complete each module packet in order without inventing ownership, tests, FE↔BE proof, file-existence proof, or report shape.
792
+ This section must be directly executable after scaffold and shared foundation land. The developer should be able to complete each module packet in order without inventing file responsibility, tests, FE↔BE proof, file-existence proof, or report shape.
713
793
 
714
794
  Every module packet below must include: module ID, requirement IDs, owned files/tests, forbidden shared files, frontend/backend/data/security/failure-state scope, verification commands, and required standard completion checklist.
715
795
 
@@ -722,26 +802,33 @@ Every module packet below must include: module ID, requirement IDs, owned files/
722
802
  Each module must produce this packet before the next module starts:
723
803
 
724
804
  - Module ID and status: complete, partial, or blocked
725
- - Files changed and ownership status
805
+ - Files changed and responsibility status
726
806
  - Requirement IDs closed, partial, or blocked
727
807
  - Tests added or updated and proof intent
728
808
  - Exact verification commands run and results
729
809
  - FE↔BE proof where applicable
730
810
  - Security and failure-state proof
731
- - Shared integration required from main lane
811
+ - Shared integration required from the primary sequence
732
812
  - Known risks or `none`
733
813
 
814
+ ### 21.1.1.1 Plan Section Closure Evidence
815
+ Every major completion claim must cite the accepted `plan.md` sections it closes. A development-complete report is incomplete if it only summarizes work without mapping back to section-addressable plan evidence.
816
+
817
+ | Plan section / matrix row | Closure evidence | Test or verification result | Residual risk or blocker | Decision |
818
+ |---|---|---|---|---|
819
+ | | | | | |
820
+
734
821
  ### 21.1.2 Module Integration Consumption Rule
735
822
 
736
823
  - Inspect each module completion checklist before moving to the next module
737
824
  - Reject incomplete or misleading packets instead of silently marking the module complete
738
825
  - Integrate helper work one module at a time when helper work is used
739
- - Wire shared surfaces only in the main lane unless this plan explicitly delegates them
826
+ - Wire shared surfaces only in the primary sequence unless this plan explicitly delegates them
740
827
  - Rerun each module's targeted tests in the integrated checkout after the module is wired
741
828
  - Record integrated evidence in the development completion report
742
829
 
743
830
  ### 21.2 Optional Helper Opportunities
744
- Use only when safer than normal sequential execution. Do not add branch/worktree mechanics unless the owner explicitly asks.
831
+ Use only when safer than normal sequential execution. Do not add branch/worktree mechanics unless explicitly required for implementation.
745
832
 
746
833
  | Module/task | Why helper execution is safer | Owned files/tests | Shared-file restrictions | Verification commands | Integration rule |
747
834
  |---|---|---|---|---|---|
@@ -803,38 +890,39 @@ For `web` and `fullstack`, frontend unit/component/state tests are required unle
803
890
  - Android broad proof avoids emulator as default? [yes/no]
804
891
  - iOS Linux proof boundary stated honestly? [yes/no]
805
892
  - Platform honesty notes:
806
- - Rerun triggers after major merges:
893
+ - Rerun triggers after major integrations:
807
894
 
808
- This section is about overall delivery verification planning. It must not imply that such proof runs before evaluation or before the owner-side pre-submission step.
895
+ This section is about overall delivery verification planning. It must not imply that every broad proof runs during ordinary implementation unless the active instruction asks for it.
809
896
 
810
897
  ---
811
898
 
812
899
  ## 26. Integrated Verification and Hardening Plan
813
900
 
814
- ### 27.1 Broad final proof
815
- - Static/non-Docker checks to use before evaluation:
816
- - Proceed-to-evaluation rule when those checks pass:
901
+ ### 26.1 Broad final proof
902
+ - Static/non-Docker checks to use before readiness handoff:
903
+ - Readiness handoff rule when those checks pass:
817
904
  - Major flow proof required:
818
905
  - Security/auth proof required:
819
906
  - Runtime/README contract proof required:
820
907
  - Coverage proof required:
821
- - Required internal `P5` issue-discovery loop: one evaluator subagent session; send the exact prepared evaluation packet once; run four additional same-session scans for additional prompt-fit/compliance, security, delivery, test-coverage, README/static-verifiability, mock/demo/fake-success, and frontend state/interaction issues not already reported; do not remediate between rounds; save five reports and extracted Blocker/High findings under `../.ai/p5-evaluation/`; consolidate, owner-analyze, route one developer remediation brief, verify fixes, then continue to the proceed-to-evaluation pause
822
- - Development-completion audit before `P5`: reread original prompt, approved requirements breakdown, accepted clarifications, accepted design, accepted API spec when applicable, accepted `plan.md`, `../docs/test-coverage.md`, `README.md`, code, tests, and current proof in one sweep; every module row must be marked complete, blocked, or explicitly risk-accepted before handoff
823
- - P3 Development Completion Report must include: modules completed, requirements closed, files/routes/tests changed, unit/API/integration/E2E/frontend-state commands run, FE↔BE/API/data proof, edge/failure/security proof, remaining risks, and exact handoff status for `P5`
908
+ - Independent readiness review expectation: after implementation, the repo must withstand a strict prompt-fit, security, delivery, test-coverage, README/static-verifiability, mock/demo/fake-success, and frontend state/interaction review. Fix all concrete Blocker/High-equivalent issues before handoff unless the user explicitly accepts the risk.
909
+ - Development-completion review: reread original prompt, approved requirements breakdown, accepted clarifications, accepted design, accepted API spec when applicable, accepted `plan.md`, `../docs/test-coverage.md`, `README.md`, code, tests, and current proof in one sweep; every module row must be marked complete, blocked, or explicitly risk-accepted before handoff
910
+ - Development Completion Report must include: modules completed, requirements closed, files/routes/tests changed, unit/API/integration/E2E/frontend-state commands run, FE↔BE/API/data proof, edge/failure/security proof, remaining risks, and exact readiness handoff status
911
+ - Development Completion Report must include Plan Section Closure Evidence for every major plan section or matrix row that claims completion
824
912
 
825
- ### 27.2 Narrow rerun strategy after discovered failures
826
- - Owner pass scope rule: every `P5` owner pass must reread the accepted design, accepted `../docs/api-spec.md` when applicable, `plan.md`, `README.md`, repo state, and current evidence in one full sweep rather than reducing follow-up passes to targeted sections only
913
+ ### 26.2 Narrow rerun strategy after discovered failures
914
+ - Review scope rule: each integrated readiness pass must reread the accepted design, accepted `../docs/api-spec.md` when applicable, `plan.md`, `README.md`, repo state, and current evidence in one full sweep rather than reducing follow-up passes to targeted sections only
827
915
  - Known failure class isolation rule:
828
916
  - Narrow rerun examples:
829
917
  - When to stop broad reruns:
830
918
  - Issue-batching rule for integrated verification:
831
919
  - Owner-side fixes allowed directly before reroute:
832
- - Remediation expectation: default to ordered fixes through the main developer lane; use helper work only for genuinely independent fixes where that is safer, and require per-bundle verification plus integrated proof before accepting completion
833
- - Maximum owner sweep expectation from `P5` before evaluation: 3
834
- - Maximum developer reroute expectation from `P5` before evaluation:
920
+ - Remediation expectation: default to ordered fixes through the active correction workstream, use helper work only for genuinely independent fixes where that is safer, and require per-bundle verification plus integrated proof before accepting completion
921
+ - Maximum integrated readiness sweep expectation before handoff: 3
922
+ - Maximum correction reroute expectation before handoff:
835
923
 
836
- ### 27.3 Optional UI/platform sanity checks before evaluation
837
- - When extra UI/platform sanity is actually needed before evaluation:
924
+ ### 26.3 Optional UI/platform sanity checks before handoff
925
+ - When extra UI/platform sanity is actually needed before handoff:
838
926
  - Screenshot review requirements if used:
839
927
  - Web/fullstack Playwright sanity if used:
840
928
  - Desktop Playwright Electron/Xvfb or equivalent sanity if used:
@@ -843,15 +931,15 @@ This section is about overall delivery verification planning. It must not imply
843
931
  - Visual regression notes if used:
844
932
  - Artifact storage notes if used:
845
933
 
846
- ### 27.4 Doc tightening after verification
934
+ ### 26.4 Doc tightening after verification
847
935
  - `README.md`:
848
936
  - `../docs/design.md` if applicable:
849
937
  - `../docs/api-spec.md` if applicable:
850
938
  - `../docs/test-coverage.md` if applicable:
851
939
 
852
- ### 27.5 Final close conditions
853
- - No major prompt, product, security, or runtime breakage remains obvious enough to make evaluation premature
854
- - Development-completion audit is done and the handoff to `P5` names exact passed checks, skipped/deferred checks, and residual risks
940
+ ### 26.5 Final close conditions
941
+ - No major prompt, product, security, or runtime breakage remains obvious enough to make handoff premature
942
+ - Development-completion review is done and the handoff names exact passed checks, skipped/deferred checks, and residual risks
855
943
  - Docker execution remains deferred until final packaging confirmation
856
944
 
857
945
  ---
@@ -863,7 +951,7 @@ This section is about overall delivery verification planning. It must not imply
863
951
  - Confirm mock/debug/demo disclosure:
864
952
  - Confirm shared-file coherence:
865
953
  - Confirm coverage docs and README match reality:
866
- - Confirm Docker files and dockerized `./run_tests.sh` are ready for final packaging confirmation, and confirm the separate local test harness is ready for ordinary development plus owner-side `P5`:
954
+ - Confirm Docker files and dockerized `./run_tests.sh` are ready for explicit final runtime confirmation, and confirm the separate local test harness is ready for ordinary development plus later readiness checks:
867
955
 
868
956
  ---
869
957
 
@@ -877,11 +965,11 @@ The execution plan is ready only when:
877
965
  - any helper work is clearly safer than sequential execution and has owned files/tests, shared-file restrictions, verification commands, and an integration rule
878
966
  - each module packet names the exact behaviors it must finish rather than vague implementation buckets
879
967
  - tests are attached to each meaningful surface
880
- - the full prompt-relevant app surface is mapped to planned test ownership early
968
+ - the full prompt-relevant app surface is mapped to planned test responsibility early
881
969
  - runtime/test/README contracts are exact
882
970
  - collision-safe runtime assumptions are explicit where relevant
883
971
  - final integrated verification includes exact README-documented external-contract proof when documented
884
- - the final integrated verification and hardening phase is explicit
885
- - merge/fan-in proof is explicit
972
+ - final integrated verification and hardening is explicit
973
+ - integration/fan-in proof is explicit
886
974
  - shell/demo/placeholder completion paths are made unacceptable by the plan structure itself
887
975
  - the plan is detailed enough to execute without broad reinvention