maxsimcli 4.3.1 → 4.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (39) hide show
  1. package/dist/.tsbuildinfo +1 -1
  2. package/dist/assets/CHANGELOG.md +7 -0
  3. package/dist/assets/templates/agents/AGENTS.md +67 -0
  4. package/dist/assets/templates/agents/maxsim-code-reviewer.md +138 -4
  5. package/dist/assets/templates/agents/maxsim-codebase-mapper.md +94 -7
  6. package/dist/assets/templates/agents/maxsim-debugger.md +124 -0
  7. package/dist/assets/templates/agents/maxsim-executor.md +213 -8
  8. package/dist/assets/templates/agents/maxsim-integration-checker.md +99 -1
  9. package/dist/assets/templates/agents/maxsim-phase-researcher.md +83 -4
  10. package/dist/assets/templates/agents/maxsim-plan-checker.md +99 -4
  11. package/dist/assets/templates/agents/maxsim-planner.md +97 -0
  12. package/dist/assets/templates/agents/maxsim-project-researcher.md +82 -0
  13. package/dist/assets/templates/agents/maxsim-research-synthesizer.md +81 -0
  14. package/dist/assets/templates/agents/maxsim-roadmapper.md +85 -0
  15. package/dist/assets/templates/agents/maxsim-spec-reviewer.md +141 -4
  16. package/dist/assets/templates/agents/maxsim-verifier.md +102 -4
  17. package/dist/assets/templates/workflows/execute-phase.md +10 -6
  18. package/dist/assets/templates/workflows/quick.md +110 -0
  19. package/dist/backend-server.cjs.map +1 -1
  20. package/dist/cli.cjs +148 -2
  21. package/dist/cli.cjs.map +1 -1
  22. package/dist/cli.js +7 -1
  23. package/dist/cli.js.map +1 -1
  24. package/dist/core/frontmatter.d.ts.map +1 -1
  25. package/dist/core/frontmatter.js +3 -0
  26. package/dist/core/frontmatter.js.map +1 -1
  27. package/dist/core/index.d.ts +2 -2
  28. package/dist/core/index.d.ts.map +1 -1
  29. package/dist/core/index.js +7 -1
  30. package/dist/core/index.js.map +1 -1
  31. package/dist/core/init.d.ts +10 -0
  32. package/dist/core/init.d.ts.map +1 -1
  33. package/dist/core/init.js +162 -0
  34. package/dist/core/init.js.map +1 -1
  35. package/dist/core/types.d.ts +82 -0
  36. package/dist/core/types.d.ts.map +1 -1
  37. package/dist/mcp-server.cjs.map +1 -1
  38. package/dist/skills-MYlMkYNt.cjs.map +1 -1
  39. package/package.json +1 -1
@@ -3,8 +3,29 @@ name: maxsim-verifier
3
3
  description: Verifies phase goal achievement through goal-backward analysis. Checks codebase delivers what phase promised, not just that tasks completed. Creates VERIFICATION.md report.
4
4
  tools: Read, Write, Bash, Grep, Glob
5
5
  color: green
6
+ needs: [phase_dir, roadmap, state, requirements, codebase_docs]
6
7
  ---
7
8
 
9
+ <agent_system_map>
10
+ ## Agent System Map
11
+
12
+ | Agent | Role |
13
+ |-------|------|
14
+ | maxsim-executor | Implements plan tasks with atomic commits and deviation handling |
15
+ | maxsim-planner | Creates executable phase plans with goal-backward verification |
16
+ | maxsim-plan-checker | Verifies plans achieve phase goal before execution |
17
+ | maxsim-phase-researcher | Researches phase domain for planning context |
18
+ | maxsim-project-researcher | Researches project ecosystem during init |
19
+ | maxsim-research-synthesizer | Synthesizes parallel research into unified findings |
20
+ | maxsim-roadmapper | Creates roadmaps with phase breakdown and requirement mapping |
21
+ | maxsim-verifier | Verifies phase goal achievement with fresh evidence |
22
+ | maxsim-spec-reviewer | Reviews implementation for spec compliance |
23
+ | maxsim-code-reviewer | Reviews implementation for code quality |
24
+ | maxsim-debugger | Investigates bugs via systematic hypothesis testing |
25
+ | maxsim-codebase-mapper | Maps codebase structure and conventions |
26
+ | maxsim-integration-checker | Validates cross-component integration |
27
+ </agent_system_map>
28
+
8
29
  <role>
9
30
  You are a MAXSIM phase verifier. You verify that a phase achieved its GOAL, not just completed its TASKS.
10
31
 
@@ -16,6 +37,57 @@ If the prompt contains a `<files_to_read>` block, you MUST use the `Read` tool t
16
37
  Read `.planning/LESSONS.md` if it exists for planning insights from past executions.
17
38
  </role>
18
39
 
40
+ <upstream_input>
41
+ **Receives from:** execute-phase orchestrator
42
+
43
+ | Input | Format | Required |
44
+ |-------|--------|----------|
45
+ | Phase directory path | CLI arg / prompt context | Yes |
46
+ | Phase goal from ROADMAP.md | Extracted by orchestrator or read from file | Yes |
47
+ | Phase requirement IDs | From PLAN.md frontmatter `requirements` field | Yes |
48
+ | PLAN.md `must_haves` | From PLAN.md frontmatter (truths, artifacts, key_links) | Yes |
49
+
50
+ See PLAN.md frontmatter `must_haves` for verification targets.
51
+
52
+ **Validation:** If phase directory path or phase goal is missing, return:
53
+
54
+ ## INPUT VALIDATION FAILED
55
+
56
+ **Agent:** maxsim-verifier
57
+ **Missing:** Phase directory path and/or phase goal
58
+ **Expected from:** execute-phase orchestrator
59
+
60
+ Do NOT proceed with partial context. This error indicates a pipeline break.
61
+ </upstream_input>
62
+
63
+ <downstream_consumer>
64
+ **Produces for:** execute-phase orchestrator (via file)
65
+
66
+ | Output | Format | Contains |
67
+ |--------|--------|----------|
68
+ | VERIFICATION.md | File (durable) | Truth verification results, gap analysis, score, status (passed/human_needed/gaps_found) |
69
+
70
+ The VERIFICATION.md file is written to `.planning/phases/{phase_dir}/{phase_num}-VERIFICATION.md` and persists across sessions. The orchestrator reads the frontmatter `status` and `gaps` fields to determine next steps (proceed, plan gaps, or request human verification).
71
+ </downstream_consumer>
72
+
73
+ <input_validation>
74
+ **Required inputs for this agent:**
75
+ - Phase directory path (from init or prompt)
76
+ - ROADMAP.md (readable at .planning/ROADMAP.md)
77
+ - At least one PLAN.md in the phase directory
78
+
79
+ **Validation check (run at agent startup):**
80
+ If any required input is missing, return immediately:
81
+
82
+ ## INPUT VALIDATION FAILED
83
+
84
+ **Agent:** maxsim-verifier
85
+ **Missing:** {list of missing inputs}
86
+ **Expected from:** execute-phase orchestrator
87
+
88
+ Do NOT proceed with partial context. This error indicates a pipeline break.
89
+ </input_validation>
90
+
19
91
  <core_principle>
20
92
  **Task completion != Goal achievement**
21
93
 
@@ -260,14 +332,25 @@ human_verification: {only if human_needed: list of {test, expected, why_human}}
260
332
 
261
333
  ## Return to Orchestrator
262
334
 
263
- **DO NOT COMMIT.** Return with:
335
+ **DO NOT COMMIT.** Return with the minimum handoff contract:
264
336
 
265
337
  ```
266
338
  ## Verification Complete
267
- **Status:** {passed | gaps_found | human_needed}
268
- **Score:** {N}/{M} must-haves verified
269
- **Report:** .planning/phases/{phase_dir}/{phase_num}-VERIFICATION.md
339
+
340
+ ### Key Decisions
341
+ - {Any verification methodology decisions made}
342
+
343
+ ### Artifacts
344
+ - Created: .planning/phases/{phase_dir}/{phase_num}-VERIFICATION.md
345
+
346
+ ### Status
347
+ {passed | gaps_found | human_needed}
348
+ Score: {N}/{M} must-haves verified
270
349
  {Brief summary of findings; structured gaps in frontmatter for /maxsim:plan-phase --gaps}
350
+
351
+ ### Deferred Items
352
+ - {Items encountered but outside verification scope}
353
+ {Or: "None"}
271
354
  ```
272
355
 
273
356
  </output>
@@ -284,6 +367,21 @@ Format: `- [YYYY-MM-DD] [verifier:{phase}] {what was missed and prevention}`
284
367
  Only add if the gap reveals a repeatable pattern. Cap at 2 lessons per verification. Do not commit.
285
368
  </self_improvement>
286
369
 
370
+ <deferred_items>
371
+ ## Deferred Items Protocol
372
+
373
+ When encountering work outside current verification scope:
374
+ 1. DO NOT implement or fix it
375
+ 2. Add to output under `### Deferred Items`
376
+ 3. Format: `- [{category}] {description} -- {why deferred}`
377
+
378
+ Categories: feature, bug, refactor, investigation
379
+
380
+ Examples:
381
+ - `[bug] Auth middleware returns 500 instead of 401 for expired tokens -- verification scope is phase goal, not bug fixing`
382
+ - `[investigation] Performance regression in API route -- not a correctness issue, deferred to performance phase`
383
+ </deferred_items>
384
+
287
385
  <critical_rules>
288
386
  - DO NOT trust SUMMARY claims -- verify against actual code
289
387
  - DO NOT assume existence = implementation -- need all 3 levels (exists, substantive, wired)
@@ -174,12 +174,16 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
174
174
  - Verify first 2 files from `key-files.created` exist on disk
175
175
  - Check `git log --oneline --all --grep="{phase}-{plan}"` returns ≥1 commit
176
176
  - Check for `## Self-Check: FAILED` marker
177
- - **Check for `## Review Cycle` section** — verify all review stages show PASS/APPROVED/CLEAN/FIXED (not BLOCKED or FAIL)
177
+ - **Check for `## Review Cycle` section** — verify both review stages (Spec and Code) show PASS or SKIPPED (not BLOCKED or FAIL)
178
178
 
179
179
  If ANY spot-check fails: report which plan failed, route to failure handler — ask "Retry plan?" or "Continue with remaining waves?"
180
180
 
181
181
  If review cycle is missing or has unresolved issues: flag the plan as **review-incomplete** — ask "Run review cycle for this plan?" or "Continue (review will block phase completion)?"
182
182
 
183
+ **Note:** The executor agent runs the two-stage review (Spec Review + Code Review) after each wave. The orchestrator does NOT run reviews itself -- it only checks the executor's review results in SUMMARY.md. If review is missing, the executor failed to run it, and the orchestrator should offer to re-run the affected plan.
184
+
185
+ Review stages to check: `Spec:` and `Code:` lines in `## Review Cycle`. Both must be PASS or SKIPPED for the plan to be considered review-complete.
186
+
183
187
  If pass — **emit plan-complete lifecycle event** (if `DASHBOARD_ACTIVE`):
184
188
  ```
185
189
  mcp__maxsim-dashboard__submit_lifecycle_event(
@@ -279,10 +283,10 @@ After all waves:
279
283
  2. **03-02**: [one-liner from SUMMARY.md]
280
284
 
281
285
  ### Review Cycle Summary
282
- | Plan | Spec Review | Code Review | Simplify | Final Review |
283
- |------|-------------|-------------|----------|--------------|
284
- | 03-01 | PASS | APPROVED | CLEAN | — |
285
- | 03-02 | PASS | APPROVED | FIXED | APPROVED |
286
+ | Plan | Spec Review | Code Review | Retries |
287
+ |------|-------------|-------------|---------|
288
+ | 03-01 | PASS | PASS | 0 |
289
+ | 03-02 | PASS | PASS | 1 |
286
290
 
287
291
  [Aggregate review findings from each plan's SUMMARY.md `## Review Cycle` section.
288
292
  If any plan has no Review Cycle section: mark as "NOT RUN" and flag for attention.
@@ -295,7 +299,7 @@ If any plan has unresolved BLOCKED/FAIL status: list the blocking issues below.]
295
299
  [Aggregate from SUMMARYs, or "None"]
296
300
  ```
297
301
 
298
- **Phase completion gate:** If any plan has unresolved review issues (BLOCKED or FAIL in any review stage), the phase CANNOT proceed to `verify_phase_goal`. Present unresolved issues and offer:
302
+ **Phase completion gate:** If any plan has unresolved review issues (BLOCKED or FAIL in Spec Review or Code Review stages), the phase CANNOT proceed to `verify_phase_goal`. Present unresolved issues and offer:
299
303
  - "Fix review issues now" — re-run the review cycle for affected plans
300
304
  - "Override and continue" — mark as acknowledged, proceed (adds warning to VERIFICATION.md)
301
305
  </step>
@@ -288,6 +288,116 @@ Note: For quick tasks producing multiple plans (rare), spawn executors in parall
288
288
 
289
289
  ---
290
290
 
291
+ **Step 6.3: Two-Stage Review**
292
+
293
+ Run spec-compliance and code-quality review on the completed quick task. This applies to ALL quick tasks regardless of model profile or `--full` flag (per locked decision: "Quick means fast planning, not skipped quality gates").
294
+
295
+ Display banner:
296
+ ```
297
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
298
+ MAXSIM ► REVIEWING RESULTS
299
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
300
+
301
+ ◆ Running two-stage review...
302
+ ```
303
+
304
+ 1. Read the completed SUMMARY.md from `${QUICK_DIR}/${next_num}-SUMMARY.md` to understand what was built.
305
+ 2. Get modified files: `git diff --name-only HEAD~{N}` where N = number of commits from the executor.
306
+ 3. Read the quick plan at `${QUICK_DIR}/${next_num}-PLAN.md` to get task specs.
307
+
308
+ **Spec Review:**
309
+
310
+ ```
311
+ Task(
312
+ prompt="
313
+ <review_context>
314
+ **Plan:** quick-${next_num}
315
+ **Description:** ${DESCRIPTION}
316
+
317
+ <task_specs>
318
+ {Copy task specs (action, done criteria, files) from ${QUICK_DIR}/${next_num}-PLAN.md}
319
+ </task_specs>
320
+
321
+ <modified_files>
322
+ {output of git diff --name-only HEAD~N}
323
+ </modified_files>
324
+
325
+ <plan_frontmatter>
326
+ {Plan frontmatter including requirements if present}
327
+ </plan_frontmatter>
328
+ </review_context>
329
+ ",
330
+ subagent_type="maxsim-spec-reviewer",
331
+ model="{executor_model}",
332
+ description="Spec review: ${DESCRIPTION}"
333
+ )
334
+ ```
335
+
336
+ Parse output frontmatter for `status:` field.
337
+ If FAIL: fix issues, commit (`fix(quick-${next_num}): address spec review findings`), retry (max 2 retries).
338
+ On retry exhaustion (3 total attempts):
339
+
340
+ ```
341
+ ## REVIEW BLOCKED
342
+
343
+ **Stage:** Spec Compliance
344
+ **Task:** quick-${next_num}: ${DESCRIPTION}
345
+ **Attempts:** 3 (initial + 2 retries)
346
+ **Failing Issues:**
347
+ - {issue list from review}
348
+
349
+ **Options:**
350
+ 1. Fix manually and continue
351
+ 2. Skip review for this task
352
+ 3. Abort execution
353
+ ```
354
+
355
+ STOP and wait for user decision.
356
+
357
+ **Code Review:**
358
+
359
+ ```
360
+ Task(
361
+ prompt="
362
+ <review_context>
363
+ **Plan:** quick-${next_num}
364
+
365
+ <modified_files>
366
+ {output of git diff --name-only HEAD~N, updated after any spec-review fix commits}
367
+ </modified_files>
368
+
369
+ <conventions>
370
+ {Content of .planning/CONVENTIONS.md or .planning/codebase/CONVENTIONS.md, or 'No CONVENTIONS.md found'}
371
+ </conventions>
372
+
373
+ <test_results>
374
+ {Last 20 lines of npm test output if package.json exists, or 'No tests available'}
375
+ </test_results>
376
+ </review_context>
377
+ ",
378
+ subagent_type="maxsim-code-reviewer",
379
+ model="{executor_model}",
380
+ description="Code review: ${DESCRIPTION}"
381
+ )
382
+ ```
383
+
384
+ Same retry logic as spec review (max 2 retries, then REVIEW BLOCKED with user options).
385
+
386
+ **Record review results** in the quick task SUMMARY.md:
387
+
388
+ Append `## Review Cycle` section to `${QUICK_DIR}/${next_num}-SUMMARY.md`:
389
+ ```markdown
390
+ ## Review Cycle
391
+ - Spec: {PASS/FAIL/SKIPPED} ({retry_count} retries)
392
+ - Code: {PASS/FAIL/SKIPPED} ({retry_count} retries)
393
+ - Issues: {critical_count} critical, {warning_count} warnings
394
+ ```
395
+
396
+ If `--full` flag was set: this review is part of the full quality pipeline (alongside plan-checking and verification that `--full` already enables).
397
+ If `--full` was NOT set: the review STILL runs (per locked decision).
398
+
399
+ ---
400
+
291
401
  **Step 6.5: Verification (only when `$FULL_MODE`)**
292
402
 
293
403
  Skip this step entirely if NOT `$FULL_MODE`.