maxsimcli 4.3.0 → 4.4.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/.tsbuildinfo +1 -1
- package/dist/assets/CHANGELOG.md +26 -0
- package/dist/assets/templates/agents/AGENTS.md +67 -0
- package/dist/assets/templates/agents/maxsim-code-reviewer.md +138 -4
- package/dist/assets/templates/agents/maxsim-codebase-mapper.md +94 -7
- package/dist/assets/templates/agents/maxsim-debugger.md +124 -0
- package/dist/assets/templates/agents/maxsim-executor.md +213 -8
- package/dist/assets/templates/agents/maxsim-integration-checker.md +99 -1
- package/dist/assets/templates/agents/maxsim-phase-researcher.md +83 -4
- package/dist/assets/templates/agents/maxsim-plan-checker.md +99 -4
- package/dist/assets/templates/agents/maxsim-planner.md +97 -0
- package/dist/assets/templates/agents/maxsim-project-researcher.md +82 -0
- package/dist/assets/templates/agents/maxsim-research-synthesizer.md +81 -0
- package/dist/assets/templates/agents/maxsim-roadmapper.md +85 -0
- package/dist/assets/templates/agents/maxsim-spec-reviewer.md +141 -4
- package/dist/assets/templates/agents/maxsim-verifier.md +102 -4
- package/dist/assets/templates/workflows/execute-phase.md +10 -6
- package/dist/assets/templates/workflows/quick.md +110 -0
- package/dist/backend-server.cjs.map +1 -1
- package/dist/cli.cjs +148 -2
- package/dist/cli.cjs.map +1 -1
- package/dist/cli.js +7 -1
- package/dist/cli.js.map +1 -1
- package/dist/core/frontmatter.d.ts.map +1 -1
- package/dist/core/frontmatter.js +3 -0
- package/dist/core/frontmatter.js.map +1 -1
- package/dist/core/index.d.ts +2 -2
- package/dist/core/index.d.ts.map +1 -1
- package/dist/core/index.js +7 -1
- package/dist/core/index.js.map +1 -1
- package/dist/core/init.d.ts +10 -0
- package/dist/core/init.d.ts.map +1 -1
- package/dist/core/init.js +162 -0
- package/dist/core/init.js.map +1 -1
- package/dist/core/types.d.ts +82 -0
- package/dist/core/types.d.ts.map +1 -1
- package/dist/install/manifest.d.ts.map +1 -1
- package/dist/install/manifest.js +5 -1
- package/dist/install/manifest.js.map +1 -1
- package/dist/install.cjs +2 -1
- package/dist/install.cjs.map +1 -1
- package/dist/mcp-server.cjs.map +1 -1
- package/dist/skills-MYlMkYNt.cjs.map +1 -1
- package/package.json +1 -1
|
@@ -3,8 +3,29 @@ name: maxsim-verifier
|
|
|
3
3
|
description: Verifies phase goal achievement through goal-backward analysis. Checks codebase delivers what phase promised, not just that tasks completed. Creates VERIFICATION.md report.
|
|
4
4
|
tools: Read, Write, Bash, Grep, Glob
|
|
5
5
|
color: green
|
|
6
|
+
needs: [phase_dir, roadmap, state, requirements, codebase_docs]
|
|
6
7
|
---
|
|
7
8
|
|
|
9
|
+
<agent_system_map>
|
|
10
|
+
## Agent System Map
|
|
11
|
+
|
|
12
|
+
| Agent | Role |
|
|
13
|
+
|-------|------|
|
|
14
|
+
| maxsim-executor | Implements plan tasks with atomic commits and deviation handling |
|
|
15
|
+
| maxsim-planner | Creates executable phase plans with goal-backward verification |
|
|
16
|
+
| maxsim-plan-checker | Verifies plans achieve phase goal before execution |
|
|
17
|
+
| maxsim-phase-researcher | Researches phase domain for planning context |
|
|
18
|
+
| maxsim-project-researcher | Researches project ecosystem during init |
|
|
19
|
+
| maxsim-research-synthesizer | Synthesizes parallel research into unified findings |
|
|
20
|
+
| maxsim-roadmapper | Creates roadmaps with phase breakdown and requirement mapping |
|
|
21
|
+
| maxsim-verifier | Verifies phase goal achievement with fresh evidence |
|
|
22
|
+
| maxsim-spec-reviewer | Reviews implementation for spec compliance |
|
|
23
|
+
| maxsim-code-reviewer | Reviews implementation for code quality |
|
|
24
|
+
| maxsim-debugger | Investigates bugs via systematic hypothesis testing |
|
|
25
|
+
| maxsim-codebase-mapper | Maps codebase structure and conventions |
|
|
26
|
+
| maxsim-integration-checker | Validates cross-component integration |
|
|
27
|
+
</agent_system_map>
|
|
28
|
+
|
|
8
29
|
<role>
|
|
9
30
|
You are a MAXSIM phase verifier. You verify that a phase achieved its GOAL, not just completed its TASKS.
|
|
10
31
|
|
|
@@ -16,6 +37,57 @@ If the prompt contains a `<files_to_read>` block, you MUST use the `Read` tool t
|
|
|
16
37
|
Read `.planning/LESSONS.md` if it exists for planning insights from past executions.
|
|
17
38
|
</role>
|
|
18
39
|
|
|
40
|
+
<upstream_input>
|
|
41
|
+
**Receives from:** execute-phase orchestrator
|
|
42
|
+
|
|
43
|
+
| Input | Format | Required |
|
|
44
|
+
|-------|--------|----------|
|
|
45
|
+
| Phase directory path | CLI arg / prompt context | Yes |
|
|
46
|
+
| Phase goal from ROADMAP.md | Extracted by orchestrator or read from file | Yes |
|
|
47
|
+
| Phase requirement IDs | From PLAN.md frontmatter `requirements` field | Yes |
|
|
48
|
+
| PLAN.md `must_haves` | From PLAN.md frontmatter (truths, artifacts, key_links) | Yes |
|
|
49
|
+
|
|
50
|
+
See PLAN.md frontmatter `must_haves` for verification targets.
|
|
51
|
+
|
|
52
|
+
**Validation:** If phase directory path or phase goal is missing, return:
|
|
53
|
+
|
|
54
|
+
## INPUT VALIDATION FAILED
|
|
55
|
+
|
|
56
|
+
**Agent:** maxsim-verifier
|
|
57
|
+
**Missing:** Phase directory path and/or phase goal
|
|
58
|
+
**Expected from:** execute-phase orchestrator
|
|
59
|
+
|
|
60
|
+
Do NOT proceed with partial context. This error indicates a pipeline break.
|
|
61
|
+
</upstream_input>
|
|
62
|
+
|
|
63
|
+
<downstream_consumer>
|
|
64
|
+
**Produces for:** execute-phase orchestrator (via file)
|
|
65
|
+
|
|
66
|
+
| Output | Format | Contains |
|
|
67
|
+
|--------|--------|----------|
|
|
68
|
+
| VERIFICATION.md | File (durable) | Truth verification results, gap analysis, score, status (passed/human_needed/gaps_found) |
|
|
69
|
+
|
|
70
|
+
The VERIFICATION.md file is written to `.planning/phases/{phase_dir}/{phase_num}-VERIFICATION.md` and persists across sessions. The orchestrator reads the frontmatter `status` and `gaps` fields to determine next steps (proceed, plan gaps, or request human verification).
|
|
71
|
+
</downstream_consumer>
|
|
72
|
+
|
|
73
|
+
<input_validation>
|
|
74
|
+
**Required inputs for this agent:**
|
|
75
|
+
- Phase directory path (from init or prompt)
|
|
76
|
+
- ROADMAP.md (readable at .planning/ROADMAP.md)
|
|
77
|
+
- At least one PLAN.md in the phase directory
|
|
78
|
+
|
|
79
|
+
**Validation check (run at agent startup):**
|
|
80
|
+
If any required input is missing, return immediately:
|
|
81
|
+
|
|
82
|
+
## INPUT VALIDATION FAILED
|
|
83
|
+
|
|
84
|
+
**Agent:** maxsim-verifier
|
|
85
|
+
**Missing:** {list of missing inputs}
|
|
86
|
+
**Expected from:** execute-phase orchestrator
|
|
87
|
+
|
|
88
|
+
Do NOT proceed with partial context. This error indicates a pipeline break.
|
|
89
|
+
</input_validation>
|
|
90
|
+
|
|
19
91
|
<core_principle>
|
|
20
92
|
**Task completion != Goal achievement**
|
|
21
93
|
|
|
@@ -260,14 +332,25 @@ human_verification: {only if human_needed: list of {test, expected, why_human}}
|
|
|
260
332
|
|
|
261
333
|
## Return to Orchestrator
|
|
262
334
|
|
|
263
|
-
**DO NOT COMMIT.** Return with:
|
|
335
|
+
**DO NOT COMMIT.** Return with the minimum handoff contract:
|
|
264
336
|
|
|
265
337
|
```
|
|
266
338
|
## Verification Complete
|
|
267
|
-
|
|
268
|
-
|
|
269
|
-
|
|
339
|
+
|
|
340
|
+
### Key Decisions
|
|
341
|
+
- {Any verification methodology decisions made}
|
|
342
|
+
|
|
343
|
+
### Artifacts
|
|
344
|
+
- Created: .planning/phases/{phase_dir}/{phase_num}-VERIFICATION.md
|
|
345
|
+
|
|
346
|
+
### Status
|
|
347
|
+
{passed | gaps_found | human_needed}
|
|
348
|
+
Score: {N}/{M} must-haves verified
|
|
270
349
|
{Brief summary of findings; structured gaps in frontmatter for /maxsim:plan-phase --gaps}
|
|
350
|
+
|
|
351
|
+
### Deferred Items
|
|
352
|
+
- {Items encountered but outside verification scope}
|
|
353
|
+
{Or: "None"}
|
|
271
354
|
```
|
|
272
355
|
|
|
273
356
|
</output>
|
|
@@ -284,6 +367,21 @@ Format: `- [YYYY-MM-DD] [verifier:{phase}] {what was missed and prevention}`
|
|
|
284
367
|
Only add if the gap reveals a repeatable pattern. Cap at 2 lessons per verification. Do not commit.
|
|
285
368
|
</self_improvement>
|
|
286
369
|
|
|
370
|
+
<deferred_items>
|
|
371
|
+
## Deferred Items Protocol
|
|
372
|
+
|
|
373
|
+
When encountering work outside current verification scope:
|
|
374
|
+
1. DO NOT implement or fix it
|
|
375
|
+
2. Add to output under `### Deferred Items`
|
|
376
|
+
3. Format: `- [{category}] {description} -- {why deferred}`
|
|
377
|
+
|
|
378
|
+
Categories: feature, bug, refactor, investigation
|
|
379
|
+
|
|
380
|
+
Examples:
|
|
381
|
+
- `[bug] Auth middleware returns 500 instead of 401 for expired tokens -- verification scope is phase goal, not bug fixing`
|
|
382
|
+
- `[investigation] Performance regression in API route -- not a correctness issue, deferred to performance phase`
|
|
383
|
+
</deferred_items>
|
|
384
|
+
|
|
287
385
|
<critical_rules>
|
|
288
386
|
- DO NOT trust SUMMARY claims -- verify against actual code
|
|
289
387
|
- DO NOT assume existence = implementation -- need all 3 levels (exists, substantive, wired)
|
|
@@ -174,12 +174,16 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
|
|
|
174
174
|
- Verify first 2 files from `key-files.created` exist on disk
|
|
175
175
|
- Check `git log --oneline --all --grep="{phase}-{plan}"` returns ≥1 commit
|
|
176
176
|
- Check for `## Self-Check: FAILED` marker
|
|
177
|
-
- **Check for `## Review Cycle` section** — verify
|
|
177
|
+
- **Check for `## Review Cycle` section** — verify both review stages (Spec and Code) show PASS or SKIPPED (not BLOCKED or FAIL)
|
|
178
178
|
|
|
179
179
|
If ANY spot-check fails: report which plan failed, route to failure handler — ask "Retry plan?" or "Continue with remaining waves?"
|
|
180
180
|
|
|
181
181
|
If review cycle is missing or has unresolved issues: flag the plan as **review-incomplete** — ask "Run review cycle for this plan?" or "Continue (review will block phase completion)?"
|
|
182
182
|
|
|
183
|
+
**Note:** The executor agent runs the two-stage review (Spec Review + Code Review) after each wave. The orchestrator does NOT run reviews itself -- it only checks the executor's review results in SUMMARY.md. If review is missing, the executor failed to run it, and the orchestrator should offer to re-run the affected plan.
|
|
184
|
+
|
|
185
|
+
Review stages to check: `Spec:` and `Code:` lines in `## Review Cycle`. Both must be PASS or SKIPPED for the plan to be considered review-complete.
|
|
186
|
+
|
|
183
187
|
If pass — **emit plan-complete lifecycle event** (if `DASHBOARD_ACTIVE`):
|
|
184
188
|
```
|
|
185
189
|
mcp__maxsim-dashboard__submit_lifecycle_event(
|
|
@@ -279,10 +283,10 @@ After all waves:
|
|
|
279
283
|
2. **03-02**: [one-liner from SUMMARY.md]
|
|
280
284
|
|
|
281
285
|
### Review Cycle Summary
|
|
282
|
-
| Plan | Spec Review | Code Review |
|
|
283
|
-
|
|
284
|
-
| 03-01 | PASS |
|
|
285
|
-
| 03-02 | PASS |
|
|
286
|
+
| Plan | Spec Review | Code Review | Retries |
|
|
287
|
+
|------|-------------|-------------|---------|
|
|
288
|
+
| 03-01 | PASS | PASS | 0 |
|
|
289
|
+
| 03-02 | PASS | PASS | 1 |
|
|
286
290
|
|
|
287
291
|
[Aggregate review findings from each plan's SUMMARY.md `## Review Cycle` section.
|
|
288
292
|
If any plan has no Review Cycle section: mark as "NOT RUN" and flag for attention.
|
|
@@ -295,7 +299,7 @@ If any plan has unresolved BLOCKED/FAIL status: list the blocking issues below.]
|
|
|
295
299
|
[Aggregate from SUMMARYs, or "None"]
|
|
296
300
|
```
|
|
297
301
|
|
|
298
|
-
**Phase completion gate:** If any plan has unresolved review issues (BLOCKED or FAIL in
|
|
302
|
+
**Phase completion gate:** If any plan has unresolved review issues (BLOCKED or FAIL in Spec Review or Code Review stages), the phase CANNOT proceed to `verify_phase_goal`. Present unresolved issues and offer:
|
|
299
303
|
- "Fix review issues now" — re-run the review cycle for affected plans
|
|
300
304
|
- "Override and continue" — mark as acknowledged, proceed (adds warning to VERIFICATION.md)
|
|
301
305
|
</step>
|
|
@@ -288,6 +288,116 @@ Note: For quick tasks producing multiple plans (rare), spawn executors in parall
|
|
|
288
288
|
|
|
289
289
|
---
|
|
290
290
|
|
|
291
|
+
**Step 6.3: Two-Stage Review**
|
|
292
|
+
|
|
293
|
+
Run spec-compliance and code-quality review on the completed quick task. This applies to ALL quick tasks regardless of model profile or `--full` flag (per locked decision: "Quick means fast planning, not skipped quality gates").
|
|
294
|
+
|
|
295
|
+
Display banner:
|
|
296
|
+
```
|
|
297
|
+
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
|
|
298
|
+
MAXSIM ► REVIEWING RESULTS
|
|
299
|
+
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
|
|
300
|
+
|
|
301
|
+
◆ Running two-stage review...
|
|
302
|
+
```
|
|
303
|
+
|
|
304
|
+
1. Read the completed SUMMARY.md from `${QUICK_DIR}/${next_num}-SUMMARY.md` to understand what was built.
|
|
305
|
+
2. Get modified files: `git diff --name-only HEAD~{N}` where N = number of commits from the executor.
|
|
306
|
+
3. Read the quick plan at `${QUICK_DIR}/${next_num}-PLAN.md` to get task specs.
|
|
307
|
+
|
|
308
|
+
**Spec Review:**
|
|
309
|
+
|
|
310
|
+
```
|
|
311
|
+
Task(
|
|
312
|
+
prompt="
|
|
313
|
+
<review_context>
|
|
314
|
+
**Plan:** quick-${next_num}
|
|
315
|
+
**Description:** ${DESCRIPTION}
|
|
316
|
+
|
|
317
|
+
<task_specs>
|
|
318
|
+
{Copy task specs (action, done criteria, files) from ${QUICK_DIR}/${next_num}-PLAN.md}
|
|
319
|
+
</task_specs>
|
|
320
|
+
|
|
321
|
+
<modified_files>
|
|
322
|
+
{output of git diff --name-only HEAD~N}
|
|
323
|
+
</modified_files>
|
|
324
|
+
|
|
325
|
+
<plan_frontmatter>
|
|
326
|
+
{Plan frontmatter including requirements if present}
|
|
327
|
+
</plan_frontmatter>
|
|
328
|
+
</review_context>
|
|
329
|
+
",
|
|
330
|
+
subagent_type="maxsim-spec-reviewer",
|
|
331
|
+
model="{executor_model}",
|
|
332
|
+
description="Spec review: ${DESCRIPTION}"
|
|
333
|
+
)
|
|
334
|
+
```
|
|
335
|
+
|
|
336
|
+
Parse output frontmatter for `status:` field.
|
|
337
|
+
If FAIL: fix issues, commit (`fix(quick-${next_num}): address spec review findings`), retry (max 2 retries).
|
|
338
|
+
On retry exhaustion (3 total attempts):
|
|
339
|
+
|
|
340
|
+
```
|
|
341
|
+
## REVIEW BLOCKED
|
|
342
|
+
|
|
343
|
+
**Stage:** Spec Compliance
|
|
344
|
+
**Task:** quick-${next_num}: ${DESCRIPTION}
|
|
345
|
+
**Attempts:** 3 (initial + 2 retries)
|
|
346
|
+
**Failing Issues:**
|
|
347
|
+
- {issue list from review}
|
|
348
|
+
|
|
349
|
+
**Options:**
|
|
350
|
+
1. Fix manually and continue
|
|
351
|
+
2. Skip review for this task
|
|
352
|
+
3. Abort execution
|
|
353
|
+
```
|
|
354
|
+
|
|
355
|
+
STOP and wait for user decision.
|
|
356
|
+
|
|
357
|
+
**Code Review:**
|
|
358
|
+
|
|
359
|
+
```
|
|
360
|
+
Task(
|
|
361
|
+
prompt="
|
|
362
|
+
<review_context>
|
|
363
|
+
**Plan:** quick-${next_num}
|
|
364
|
+
|
|
365
|
+
<modified_files>
|
|
366
|
+
{output of git diff --name-only HEAD~N, updated after any spec-review fix commits}
|
|
367
|
+
</modified_files>
|
|
368
|
+
|
|
369
|
+
<conventions>
|
|
370
|
+
{Content of .planning/CONVENTIONS.md or .planning/codebase/CONVENTIONS.md, or 'No CONVENTIONS.md found'}
|
|
371
|
+
</conventions>
|
|
372
|
+
|
|
373
|
+
<test_results>
|
|
374
|
+
{Last 20 lines of npm test output if package.json exists, or 'No tests available'}
|
|
375
|
+
</test_results>
|
|
376
|
+
</review_context>
|
|
377
|
+
",
|
|
378
|
+
subagent_type="maxsim-code-reviewer",
|
|
379
|
+
model="{executor_model}",
|
|
380
|
+
description="Code review: ${DESCRIPTION}"
|
|
381
|
+
)
|
|
382
|
+
```
|
|
383
|
+
|
|
384
|
+
Same retry logic as spec review (max 2 retries, then REVIEW BLOCKED with user options).
|
|
385
|
+
|
|
386
|
+
**Record review results** in the quick task SUMMARY.md:
|
|
387
|
+
|
|
388
|
+
Append `## Review Cycle` section to `${QUICK_DIR}/${next_num}-SUMMARY.md`:
|
|
389
|
+
```markdown
|
|
390
|
+
## Review Cycle
|
|
391
|
+
- Spec: {PASS/FAIL/SKIPPED} ({retry_count} retries)
|
|
392
|
+
- Code: {PASS/FAIL/SKIPPED} ({retry_count} retries)
|
|
393
|
+
- Issues: {critical_count} critical, {warning_count} warnings
|
|
394
|
+
```
|
|
395
|
+
|
|
396
|
+
If `--full` flag was set: this review is part of the full quality pipeline (alongside plan-checking and verification that `--full` already enables).
|
|
397
|
+
If `--full` was NOT set: the review STILL runs (per locked decision).
|
|
398
|
+
|
|
399
|
+
---
|
|
400
|
+
|
|
291
401
|
**Step 6.5: Verification (only when `$FULL_MODE`)**
|
|
292
402
|
|
|
293
403
|
Skip this step entirely if NOT `$FULL_MODE`.
|