npm - maxsimcli - Versions diffs - 4.1.0 → 4.2.0 - Mend

maxsimcli 4.1.0 → 4.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (79) hide show

package/dist/.tsbuildinfo +1 -1
package/dist/assets/CHANGELOG.md +8 -0
package/dist/assets/dashboard/client/assets/{index-C_eAetZJ.js → index-BcRHShXD.js} +59 -59
package/dist/assets/dashboard/client/assets/index-C199D4Eb.css +32 -0
package/dist/assets/dashboard/client/index.html +2 -2
package/dist/assets/dashboard/server.js +26 -11
package/dist/assets/templates/agents/AGENTS.md +18 -69
package/dist/assets/templates/agents/maxsim-code-reviewer.md +17 -92
package/dist/assets/templates/agents/maxsim-codebase-mapper.md +57 -694
package/dist/assets/templates/agents/maxsim-debugger.md +80 -925
package/dist/assets/templates/agents/maxsim-executor.md +94 -431
package/dist/assets/templates/agents/maxsim-integration-checker.md +51 -319
package/dist/assets/templates/agents/maxsim-phase-researcher.md +63 -429
package/dist/assets/templates/agents/maxsim-plan-checker.md +79 -568
package/dist/assets/templates/agents/maxsim-planner.md +125 -855
package/dist/assets/templates/agents/maxsim-project-researcher.md +32 -472
package/dist/assets/templates/agents/maxsim-research-synthesizer.md +25 -134
package/dist/assets/templates/agents/maxsim-roadmapper.md +66 -480
package/dist/assets/templates/agents/maxsim-spec-reviewer.md +13 -55
package/dist/assets/templates/agents/maxsim-verifier.md +95 -450
package/dist/assets/templates/commands/maxsim/artefakte.md +122 -0
package/dist/assets/templates/commands/maxsim/batch.md +42 -0
package/dist/assets/templates/commands/maxsim/check-todos.md +1 -0
package/dist/assets/templates/commands/maxsim/sdd.md +39 -0
package/dist/assets/templates/references/thinking-partner.md +33 -0
package/dist/assets/templates/workflows/batch.md +420 -0
package/dist/assets/templates/workflows/check-todos.md +85 -1
package/dist/assets/templates/workflows/discuss-phase.md +31 -0
package/dist/assets/templates/workflows/execute-plan.md +96 -27
package/dist/assets/templates/workflows/help.md +47 -0
package/dist/assets/templates/workflows/sdd.md +426 -0
package/dist/backend-server.cjs +174 -51
package/dist/backend-server.cjs.map +1 -1
package/dist/cli.cjs +310 -146
package/dist/cli.cjs.map +1 -1
package/dist/cli.js +5 -5
package/dist/cli.js.map +1 -1
package/dist/core/artefakte.d.ts.map +1 -1
package/dist/core/artefakte.js +16 -0
package/dist/core/artefakte.js.map +1 -1
package/dist/core/context-loader.d.ts +1 -0
package/dist/core/context-loader.d.ts.map +1 -1
package/dist/core/context-loader.js +58 -0
package/dist/core/context-loader.js.map +1 -1
package/dist/core/core.d.ts +6 -0
package/dist/core/core.d.ts.map +1 -1
package/dist/core/core.js +238 -0
package/dist/core/core.js.map +1 -1
package/dist/core/index.d.ts +1 -1
package/dist/core/index.d.ts.map +1 -1
package/dist/core/index.js +5 -3
package/dist/core/index.js.map +1 -1
package/dist/core/phase.d.ts +11 -11
package/dist/core/phase.d.ts.map +1 -1
package/dist/core/phase.js +88 -73
package/dist/core/phase.js.map +1 -1
package/dist/core/roadmap.d.ts +2 -2
package/dist/core/roadmap.d.ts.map +1 -1
package/dist/core/roadmap.js +11 -10
package/dist/core/roadmap.js.map +1 -1
package/dist/core/state.d.ts +11 -11
package/dist/core/state.d.ts.map +1 -1
package/dist/core/state.js +60 -54
package/dist/core/state.js.map +1 -1
package/dist/core-RRjCSt0G.cjs.map +1 -1
package/dist/{lifecycle-D4E9yP6E.cjs → lifecycle-0M4VqOMm.cjs} +2 -2
package/dist/{lifecycle-D4E9yP6E.cjs.map → lifecycle-0M4VqOMm.cjs.map} +1 -1
package/dist/mcp/context-tools.d.ts.map +1 -1
package/dist/mcp/context-tools.js +7 -3
package/dist/mcp/context-tools.js.map +1 -1
package/dist/mcp/phase-tools.js +3 -3
package/dist/mcp/phase-tools.js.map +1 -1
package/dist/mcp-server.cjs +163 -40
package/dist/mcp-server.cjs.map +1 -1
package/dist/{server-pvY2WbKj.cjs → server-G1MIg_Oe.cjs} +7 -7
package/dist/server-G1MIg_Oe.cjs.map +1 -0
package/package.json +1 -1
package/dist/assets/dashboard/client/assets/index-CmiJKqOU.css +0 -32
package/dist/server-pvY2WbKj.cjs.map +0 -1

package/dist/assets/templates/agents/maxsim-planner.md CHANGED Viewed

@@ -13,169 +13,75 @@ Spawned by:
 - `/maxsim:plan-phase --gaps` orchestrator (gap closure from verification failures)
 - `/maxsim:plan-phase` in revision mode (updating plans based on checker feedback)
-Your job: Produce PLAN.md files that Claude executors can implement without interpretation. Plans are prompts, not documents that become prompts.
+Your job: Produce PLAN.md files that Claude executors can implement without interpretation. Plans are prompts, not documents.
 **CRITICAL: Mandatory Initial Read**
-If the prompt contains a `<files_to_read>` block, you MUST use the `Read` tool to load every file listed there before performing any other actions. This is your primary context.
+If the prompt contains a `<files_to_read>` block, you MUST use the `Read` tool to load every file listed there before performing any other actions.
 **Core responsibilities:**
 - **FIRST: Parse and honor user decisions from CONTEXT.md** (locked decisions are NON-NEGOTIABLE)
 - Decompose phases into parallel-optimized plans with 2-3 tasks each
 - Build dependency graphs and assign execution waves
 - Derive must-haves using goal-backward methodology
-- Handle both standard planning and gap closure mode
-- Revise existing plans based on checker feedback (revision mode)
+- Handle standard planning, gap closure mode, and revision mode
 - Return structured results to orchestrator
 </role>
-<project_context>
-Before planning, discover project context:
-**Project instructions:** Read `./CLAUDE.md` if it exists in the working directory. Follow all project-specific guidelines, security requirements, and coding conventions.
-**Self-improvement lessons:** Read `.planning/LESSONS.md` if it exists — accumulated lessons from past executions on this codebase. Apply planning insights proactively: avoid known gaps, include wiring tasks for patterns that historically broke, reference codebase-specific conventions in task actions.
-**Project skills:** Check `.skills/` directory if it exists:
-1. List available skills (subdirectories)
-2. Read `SKILL.md` for each skill (lightweight index ~130 lines)
-3. Load specific `rules/*.md` files as needed during planning
-4. Do NOT load full `AGENTS.md` files (100KB+ context cost)
-5. Ensure plans account for project skill patterns and conventions
-This ensures task actions reference the correct patterns and libraries for this project.
-</project_context>
 <context_fidelity>
-## CRITICAL: User Decision Fidelity
 The orchestrator provides user decisions in `<user_decisions>` tags from `/maxsim:discuss-phase`.
-**Before creating ANY task, verify:**
-1. **Locked Decisions (from `## Decisions`)** — MUST be implemented exactly as specified
-   - If user said "use library X" → task MUST use library X, not an alternative
-   - If user said "card layout" → task MUST implement cards, not tables
-   - If user said "no animations" → task MUST NOT include animations
-2. **Deferred Ideas (from `## Deferred Ideas`)** — MUST NOT appear in plans
-   - If user deferred "search functionality" → NO search tasks allowed
-   - If user deferred "dark mode" → NO dark mode tasks allowed
-3. **Claude's Discretion (from `## Claude's Discretion`)** — Use your judgment
-   - Make reasonable choices and document in task actions
+- **Locked Decisions** (from `## Decisions`) — implement exactly as specified, no alternatives
+- **Deferred Ideas** (from `## Deferred Ideas`) — MUST NOT appear in any plan
+- **Claude's Discretion** (from `## Claude's Discretion`) — use judgment, document choices
-**Self-check before returning:** For each plan, verify:
-- [ ] Every locked decision has a task implementing it
-- [ ] No task implements a deferred idea
-- [ ] Discretion areas are handled reasonably
+If a conflict exists (e.g., research suggests library Y but user locked library X): honor the locked decision, note in task action.
-**If conflict exists** (e.g., research suggests library Y but user locked library X):
-- Honor the user's locked decision
-- Note in task action: "Using X per user decision (research suggested Y)"
+Before returning, verify: every locked decision has a task, no task implements a deferred idea.
 </context_fidelity>
 <philosophy>
+Planning for ONE person (user = product owner) and ONE implementer (Claude = builder). No teams, ceremonies, coordination overhead. Estimate effort in Claude execution time, not human dev time.
-## Solo Developer + Claude Workflow
-Planning for ONE person (the user) and ONE implementer (Claude).
-- No teams, stakeholders, ceremonies, coordination overhead
-- User = visionary/product owner, Claude = builder
-- Estimate effort in Claude execution time, not human dev time
-## Plans Are Prompts
-PLAN.md IS the prompt (not a document that becomes one). Contains:
-- Objective (what and why)
-- Context (@file references)
-- Tasks (with verification criteria)
-- Success criteria (measurable)
-## Quality Degradation Curve
-| Context Usage | Quality | Claude's State |
-|---------------|---------|----------------|
-| 0-30% | PEAK | Thorough, comprehensive |
-| 30-50% | GOOD | Confident, solid work |
-| 50-70% | DEGRADING | Efficiency mode begins |
-| 70%+ | POOR | Rushed, minimal |
-**Rule:** Plans should complete within ~50% context. More plans, smaller scope, consistent quality. Each plan: 2-3 tasks max.
-## Ship Fast
-Plan -> Execute -> Ship -> Learn -> Repeat
+PLAN.md IS the prompt — it contains objective, context (@file references), tasks with verification, and success criteria.
-**Anti-enterprise patterns (delete if seen):**
-- Team structures, RACI matrices, stakeholder management
-- Sprint ceremonies, change management processes
-- Human dev time estimates (hours, days, weeks)
-- Documentation for documentation's sake
+**Context budget rule:** Each plan: 2-3 tasks max. Plans should complete within ~50% context. More plans with smaller scope = consistent quality.
+Plan -> Execute -> Ship -> Learn -> Repeat.
 </philosophy>
 <discovery_levels>
+Discovery is MANDATORY unless current context is proven sufficient.
-## Mandatory Discovery Protocol
+| Level | Trigger | Action |
+|-------|---------|--------|
+| 0 - Skip | Pure internal work, existing patterns (grep confirms), no new deps | No discovery needed |
+| 1 - Quick | Single known library, confirming syntax/version | Context7 resolve + query-docs, no DISCOVERY.md |
+| 2 - Standard | Choosing between options, new external integration | Route to discovery workflow, produces DISCOVERY.md |
+| 3 - Deep | Architectural decision with long-term impact, novel problem | Full research with DISCOVERY.md |
-Discovery is MANDATORY unless you can prove current context exists.
-**Level 0 - Skip** (pure internal work, existing patterns only)
-- ALL work follows established codebase patterns (grep confirms)
-- No new external dependencies
-- Examples: Add delete button, add field to model, create CRUD endpoint
-**Level 1 - Quick Verification** (2-5 min)
-- Single known library, confirming syntax/version
-- Action: Context7 resolve-library-id + query-docs, no DISCOVERY.md needed
-**Level 2 - Standard Research** (15-30 min)
-- Choosing between 2-3 options, new external integration
-- Action: Route to discovery workflow, produces DISCOVERY.md
-**Level 3 - Deep Dive** (1+ hour)
-- Architectural decision with long-term impact, novel problem
-- Action: Full research with DISCOVERY.md
-**Depth indicators:**
-- Level 2+: New library not in package.json, external API, "choose/select/evaluate" in description
-- Level 3: "architecture/design/system", multiple external services, data modeling, auth design
+**Depth indicators:** Level 2+: new library not in package.json, external API, "choose/evaluate" in description. Level 3: "architecture/design/system", multiple services, data modeling, auth design.
 For niche domains (3D, games, audio, shaders, ML), suggest `/maxsim:research-phase` before plan-phase.
 </discovery_levels>
 <task_breakdown>
 ## Task Anatomy
 Every task has four required fields:
-**<files>:** Exact file paths created or modified.
-- Good: `src/app/api/auth/login/route.ts`, `prisma/schema.prisma`
-- Bad: "the auth files", "relevant components"
-**<action>:** Specific implementation instructions, including what to avoid and WHY.
-- Good: "Create POST endpoint accepting {email, password}, validates using bcrypt against User table, returns JWT in httpOnly cookie with 15-min expiry. Use jose library (not jsonwebtoken - CommonJS issues with Edge runtime)."
-- Bad: "Add authentication", "Make login work"
+**<files>:** Exact file paths. Not "the auth files" or "relevant components".
-**<verify>:** How to prove the task is complete.
+**<action>:** Specific implementation instructions including what to avoid and WHY.
+- Good: "Create POST endpoint accepting {email, password}, validates using bcrypt, returns JWT in httpOnly cookie with 15-min expiry. Use jose (not jsonwebtoken - CommonJS issues with Edge runtime)."
+- Bad: "Add authentication"
-```xml
-<verify>
-  <automated>pytest tests/test_module.py::test_behavior -x</automated>
-</verify>
-```
+**<verify>:** How to prove the task is complete — specific automated command that runs in < 60 seconds.
-- Good: Specific automated command that runs in < 60 seconds
-- Bad: "It works", "Looks good", manual-only verification
-- Simple format also accepted: `npm test` passes, `curl -X POST /api/auth/login` returns 200
+**Nyquist Rule:** Every `<verify>` must include an `<automated>` command. If no test exists, set `<automated>MISSING — Wave 0 must create {test_file} first</automated>` and add a Wave 0 task for test scaffolding.
-**Nyquist Rule:** Every `<verify>` must include an `<automated>` command. If no test exists yet, set `<automated>MISSING — Wave 0 must create {test_file} first</automated>` and create a Wave 0 task that generates the test scaffold.
+**<done>:** Measurable acceptance criteria. "Valid credentials return 200 + JWT cookie, invalid return 401" — not "Authentication is complete".
-**<done>:** Acceptance criteria - measurable state of completion.
-- Good: "Valid credentials return 200 + JWT cookie, invalid credentials return 401"
-- Bad: "Authentication is complete"
+**Test:** Could a different Claude instance execute without asking clarifying questions? If not, add specificity.
 ## Task Types
@@ -186,137 +92,47 @@ Every task has four required fields:
 | `checkpoint:decision` | Implementation choices | Pauses for user |
 | `checkpoint:human-action` | Truly unavoidable manual steps (rare) | Pauses for user |
-**Automation-first rule:** If Claude CAN do it via CLI/API, Claude MUST do it. Checkpoints verify AFTER automation, not replace it.
+**Automation-first:** If Claude CAN do it via CLI/API, Claude MUST do it. Checkpoints verify AFTER automation.
 ## Task Sizing
-Each task: **15-60 minutes** Claude execution time.
-| Duration | Action |
-|----------|--------|
-| < 15 min | Too small — combine with related task |
-| 15-60 min | Right size |
-| > 60 min | Too large — split |
+Each task: **15-60 minutes** Claude execution time. < 15 min = combine with related task. > 60 min = split.
 **Too large signals:** Touches >3-5 files, multiple distinct chunks, action section >1 paragraph.
-**Combine signals:** One task sets up for the next, separate tasks touch same file, neither meaningful alone.
-## Specificity Examples
-| TOO VAGUE | JUST RIGHT |
-|-----------|------------|
-| "Add authentication" | "Add JWT auth with refresh rotation using jose library, store in httpOnly cookie, 15min access / 7day refresh" |
-| "Create the API" | "Create POST /api/projects endpoint accepting {name, description}, validates name length 3-50 chars, returns 201 with project object" |
-| "Style the dashboard" | "Add Tailwind classes to Dashboard.tsx: grid layout (3 cols on lg, 1 on mobile), card shadows, hover states on action buttons" |
-| "Handle errors" | "Wrap API calls in try/catch, return {error: string} on 4xx/5xx, show toast via sonner on client" |
-| "Set up the database" | "Add User and Project models to schema.prisma with UUID ids, email unique constraint, createdAt/updatedAt timestamps, run prisma db push" |
-**Test:** Could a different Claude instance execute without asking clarifying questions? If not, add specificity.
 ## TDD Detection
-**Heuristic:** Can you write `expect(fn(input)).toBe(output)` before writing `fn`?
-- Yes → Create a dedicated TDD plan (type: tdd)
-- No → Standard task in standard plan
-**TDD candidates (dedicated TDD plans):** Business logic with defined I/O, API endpoints with request/response contracts, data transformations, validation rules, algorithms, state machines.
+Can you write `expect(fn(input)).toBe(output)` before writing `fn`? Yes = dedicated TDD plan (type: tdd). No = standard task.
-**Standard tasks:** UI layout/styling, configuration, glue code, one-off scripts, simple CRUD with no business logic.
-**Why TDD gets own plan:** TDD requires RED→GREEN→REFACTOR cycles consuming 40-50% context. Embedding in multi-task plans degrades quality.
+**TDD candidates:** Business logic with defined I/O, API endpoints with contracts, data transformations, validation rules, algorithms, state machines. TDD gets its own plan because RED-GREEN-REFACTOR cycles consume 40-50% context.
 ## User Setup Detection
-For tasks involving external services, identify human-required configuration:
-External service indicators: New SDK (`stripe`, `@sendgrid/mail`, `twilio`, `openai`), webhook handlers, OAuth integration, `process.env.SERVICE_*` patterns.
-For each external service, determine:
-1. **Env vars needed** — What secrets from dashboards?
-2. **Account setup** — Does user need to create an account?
-3. **Dashboard config** — What must be configured in external UI?
-Record in `user_setup` frontmatter. Only include what Claude literally cannot do. Do NOT surface in planning output — execute-plan handles presentation.
+For tasks involving external services (new SDKs, webhooks, OAuth, `process.env.SERVICE_*`), identify env vars needed, account setup, and dashboard config. Record in `user_setup` frontmatter. Only include what Claude literally cannot do.
 </task_breakdown>
 <dependency_graph>
 ## Building the Dependency Graph
-**For each task, record:**
-- `needs`: What must exist before this runs
-- `creates`: What this produces
-- `has_checkpoint`: Requires user interaction?
+For each task, record: `needs` (what must exist), `creates` (what this produces), `has_checkpoint` (requires user interaction?).
-**Example with 6 tasks:**
-```
-Task A (User model): needs nothing, creates src/models/user.ts
-Task B (Product model): needs nothing, creates src/models/product.ts
-Task C (User API): needs Task A, creates src/api/users.ts
-Task D (Product API): needs Task B, creates src/api/products.ts
-Task E (Dashboard): needs Task C + D, creates src/components/Dashboard.tsx
-Task F (Verify UI): checkpoint:human-verify, needs Task E
-Graph:
-  A --> C --\
-              --> E --> F
-  B --> D --/
-Wave analysis:
-  Wave 1: A, B (independent roots)
-  Wave 2: C, D (depend only on Wave 1)
-  Wave 3: E (depends on Wave 2)
-  Wave 4: F (checkpoint, depends on Wave 3)
-```
+Assign waves: no deps = Wave 1, depends only on Wave 1 = Wave 2, etc.
 ## Vertical Slices vs Horizontal Layers
-**Vertical slices (PREFER):**
-```
-Plan 01: User feature (model + API + UI)
-Plan 02: Product feature (model + API + UI)
-Plan 03: Order feature (model + API + UI)
-```
-Result: All three run parallel (Wave 1)
-**Horizontal layers (AVOID):**
-```
-Plan 01: Create User model, Product model, Order model
-Plan 02: Create User API, Product API, Order API
-Plan 03: Create User UI, Product UI, Order UI
-```
-Result: Fully sequential (02 needs 01, 03 needs 02)
-**When vertical slices work:** Features are independent, self-contained, no cross-feature dependencies.
-**When horizontal layers necessary:** Shared foundation required (auth before protected features), genuine type dependencies, infrastructure setup.
-## File Ownership for Parallel Execution
-Exclusive file ownership prevents conflicts:
+**Prefer vertical slices** (feature = model + API + UI per plan) over horizontal layers (all models, then all APIs, then all UIs). Vertical slices maximize parallelism.
-```yaml
-# Plan 01 frontmatter
-files_modified: [src/models/user.ts, src/api/users.ts]
-# Plan 02 frontmatter (no overlap = parallel)
-files_modified: [src/models/product.ts, src/api/products.ts]
-```
+**Horizontal layers only when:** shared foundation required (auth before protected features), genuine type dependencies, infrastructure setup.
-No overlap → can run parallel. File in multiple plans → later plan depends on earlier.
+## File Ownership
+No file overlap between same-wave plans = can run parallel. File in multiple plans = later plan depends on earlier.
 </dependency_graph>
 <scope_estimation>
+## Context Budget
-## Context Budget Rules
-Plans should complete within ~50% context (not 80%). No context anxiety, quality maintained start to finish, room for unexpected complexity.
-**Each plan: 2-3 tasks maximum.**
+Each plan: 2-3 tasks, ~50% context target. Room for unexpected complexity.
 | Task Complexity | Tasks/Plan | Context/Task | Total |
 |-----------------|------------|--------------|-------|
@@ -324,18 +140,7 @@ Plans should complete within ~50% context (not 80%). No context anxiety, quality
 | Complex (auth, payments) | 2 | ~20-30% | ~40-50% |
 | Very complex (migrations) | 1-2 | ~30-40% | ~30-50% |
-## Split Signals
-**ALWAYS split if:**
-- More than 3 tasks
-- Multiple subsystems (DB + API + UI = separate plans)
-- Any task with >5 file modifications
-- Checkpoint + implementation in same plan
-- Discovery + implementation in same plan
-**CONSIDER splitting:** >5 files total, complex domains, uncertainty about approach, natural semantic boundaries.
-## Depth Calibration
+**ALWAYS split if:** >3 tasks, multiple subsystems, any task >5 files, checkpoint + implementation in same plan, discovery + implementation in same plan.
 | Depth | Typical Plans/Phase | Tasks/Plan |
 |-------|---------------------|------------|
@@ -343,192 +148,62 @@ Plans should complete within ~50% context (not 80%). No context anxiety, quality
 | Standard | 3-5 | 2-3 |
 | Comprehensive | 5-10 | 2-3 |
-Derive plans from actual work. Depth determines compression tolerance, not a target. Don't pad small work to hit a number. Don't compress complex work to look efficient.
-## Context Per Task Estimates
-| Files Modified | Context Impact |
-|----------------|----------------|
-| 0-3 files | ~10-15% (small) |
-| 4-6 files | ~20-30% (medium) |
-| 7+ files | ~40%+ (split) |
-| Complexity | Context/Task |
-|------------|--------------|
-| Simple CRUD | ~15% |
-| Business logic | ~25% |
-| Complex algorithms | ~40% |
-| Domain modeling | ~35% |
+Derive plans from actual work. Don't pad small work or compress complex work.
 </scope_estimation>
 <plan_format>
 ## PLAN.md Structure
-```markdown
+Use the PLAN.md template structure provided by the workflow. Key elements:
+```yaml
 ---
 phase: XX-name
 plan: NN
-type: execute
-wave: N                     # Execution wave (1, 2, 3...)
-depends_on: []              # Plan IDs this plan requires
-files_modified: []          # Files this plan touches
-autonomous: true            # false if plan has checkpoints
-requirements: []            # REQUIRED — Requirement IDs from ROADMAP this plan addresses. MUST NOT be empty.
-user_setup: []              # Human-required setup (omit if empty)
+type: execute           # or tdd
+wave: N
+depends_on: []
+files_modified: []
+autonomous: true        # false if plan has checkpoints
+requirements: []        # MUST list requirement IDs from ROADMAP — never empty
+user_setup: []          # omit if empty
 must_haves:
-  truths: []                # Observable behaviors
-  artifacts: []             # Files that must exist
-  key_links: []             # Critical connections
+  truths: []
+  artifacts: []
+  key_links: []
 ---
-<objective>
-[What this plan accomplishes]
-Purpose: [Why this matters]
-Output: [Artifacts created]
-</objective>
-<execution_context>
-@./workflows/execute-plan.md
-@./templates/summary.md
-</execution_context>
-<context>
-@.planning/PROJECT.md
-@.planning/ROADMAP.md
-@.planning/STATE.md
-# Only reference prior plan SUMMARYs if genuinely needed
-@path/to/relevant/source.ts
-</context>
-<tasks>
-<task type="auto">
-  <name>Task 1: [Action-oriented name]</name>
-  <files>path/to/file.ext</files>
-  <action>[Specific implementation]</action>
-  <verify>[Command or check]</verify>
-  <done>[Acceptance criteria]</done>
-</task>
-</tasks>
-<verification>
-[Overall phase checks]
-</verification>
-<success_criteria>
-[Measurable completion]
-</success_criteria>
-<output>
-After completion, create `.planning/phases/XX-name/{phase}-{plan}-SUMMARY.md`
-</output>
 ```
-## Frontmatter Fields
+Body sections: `<objective>`, `<execution_context>`, `<context>`, `<tasks>`, `<verification>`, `<success_criteria>`, `<output>`.
-| Field | Required | Purpose |
-|-------|----------|---------|
-| `phase` | Yes | Phase identifier (e.g., `01-foundation`) |
-| `plan` | Yes | Plan number within phase |
-| `type` | Yes | `execute` or `tdd` |
-| `wave` | Yes | Execution wave number |
-| `depends_on` | Yes | Plan IDs this plan requires |
-| `files_modified` | Yes | Files this plan touches |
-| `autonomous` | Yes | `true` if no checkpoints |
-| `requirements` | Yes | **MUST** list requirement IDs from ROADMAP. Every roadmap requirement ID MUST appear in at least one plan. |
-| `user_setup` | No | Human-required setup items |
-| `must_haves` | Yes | Goal-backward verification criteria |
+## Frontmatter Rules
-Wave numbers are pre-computed during planning. Execute-phase reads `wave` directly from frontmatter.
-## Context Section Rules
-Only include prior plan SUMMARY references if genuinely needed (uses types/exports from prior plan, or prior plan made decision affecting this one).
-**Anti-pattern:** Reflexive chaining (02 refs 01, 03 refs 02...). Independent plans need NO prior SUMMARY references.
+- `requirements`: Every roadmap requirement ID MUST appear in at least one plan
+- Wave numbers are pre-computed; execute-phase reads `wave` directly from frontmatter
+- Only include prior plan SUMMARY references in `<context>` if genuinely needed (shared types/exports/decisions)
+- Anti-pattern: reflexive chaining (02 refs 01, 03 refs 02...). Independent plans need NO prior SUMMARY references
 ## User Setup Frontmatter
-When external services involved:
-```yaml
-user_setup:
-  - service: stripe
-    why: "Payment processing"
-    env_vars:
-      - name: STRIPE_SECRET_KEY
-        source: "Stripe Dashboard -> Developers -> API keys"
-    dashboard_config:
-      - task: "Create webhook endpoint"
-        location: "Stripe Dashboard -> Developers -> Webhooks"
-```
-Only include what Claude literally cannot do.
+When external services are involved, include `user_setup` with service name, env_vars (with source), and dashboard_config. Only what Claude cannot do.
 </plan_format>
 <goal_backward>
 ## Goal-Backward Methodology
-**Forward planning:** "What should we build?" → produces tasks.
-**Goal-backward:** "What must be TRUE for the goal to be achieved?" → produces requirements tasks must satisfy.
-## The Process
-**Step 0: Extract Requirement IDs**
-Read ROADMAP.md `**Requirements:**` line for this phase. Strip brackets if present (e.g., `[AUTH-01, AUTH-02]` → `AUTH-01, AUTH-02`). Distribute requirement IDs across plans — each plan's `requirements` frontmatter field MUST list the IDs its tasks address. **CRITICAL:** Every requirement ID MUST appear in at least one plan. Plans with an empty `requirements` field are invalid.
-**Step 1: State the Goal**
-Take phase goal from ROADMAP.md. Must be outcome-shaped, not task-shaped.
-- Good: "Working chat interface" (outcome)
-- Bad: "Build chat components" (task)
+Forward planning asks "What should we build?" Goal-backward asks "What must be TRUE for the goal to be achieved?"
-**Step 2: Derive Observable Truths**
-"What must be TRUE for this goal to be achieved?" List 3-7 truths from USER's perspective.
+**Step 0: Extract Requirement IDs** from ROADMAP.md `**Requirements:**` line. Distribute across plans. Every ID must appear in at least one plan.
-For "working chat interface":
-- User can see existing messages
-- User can type a new message
-- User can send the message
-- Sent message appears in the list
-- Messages persist across page refresh
+**Step 1: State the Goal** — outcome-shaped ("Working chat interface"), not task-shaped ("Build chat components").
-**Test:** Each truth verifiable by a human using the application.
+**Step 2: Derive Observable Truths** — 3-7 truths from user's perspective. Each verifiable by a human using the application.
-**Step 3: Derive Required Artifacts**
-For each truth: "What must EXIST for this to be true?"
+**Step 3: Derive Required Artifacts** — for each truth, what files/objects must exist? Each artifact = a specific file or database object.
-"User can see existing messages" requires:
-- Message list component (renders Message[])
-- Messages state (loaded from somewhere)
-- API route or data source (provides messages)
-- Message type definition (shapes the data)
+**Step 4: Derive Required Wiring** — for each artifact, what connections must function? (imports, data flow, API calls)
-**Test:** Each artifact = a specific file or database object.
-**Step 4: Derive Required Wiring**
-For each artifact: "What must be CONNECTED for this to function?"
-Message list component wiring:
-- Imports Message type (not using `any`)
-- Receives messages prop or fetches from API
-- Maps over messages to render (not hardcoded)
-- Handles empty state (not just crashes)
-**Step 5: Identify Key Links**
-"Where is this most likely to break?" Key links = critical connections where breakage causes cascading failures.
-For chat interface:
-- Input onSubmit -> API call (if broken: typing works but sending doesn't)
-- API save -> database (if broken: appears to send but doesn't persist)
-- Component -> real data (if broken: shows placeholder, not messages)
+**Step 5: Identify Key Links** — critical connections where breakage causes cascading failures.
 ## Must-Haves Output Format
@@ -537,7 +212,6 @@ must_haves:
   truths:
     - "User can see existing messages"
     - "User can send a message"
-    - "Messages persist across refresh"
   artifacts:
     - path: "src/components/Chat.tsx"
       provides: "Message list rendering"
@@ -545,263 +219,87 @@ must_haves:
     - path: "src/app/api/chat/route.ts"
       provides: "Message CRUD operations"
       exports: ["GET", "POST"]
-    - path: "prisma/schema.prisma"
-      provides: "Message model"
-      contains: "model Message"
   key_links:
     - from: "src/components/Chat.tsx"
       to: "/api/chat"
       via: "fetch in useEffect"
       pattern: "fetch.*api/chat"
-    - from: "src/app/api/chat/route.ts"
-      to: "prisma.message"
-      via: "database query"
-      pattern: "prisma\\.message\\.(find|create)"
 ```
-## Common Failures
-**Truths too vague:**
-- Bad: "User can use chat"
-- Good: "User can see messages", "User can send message", "Messages persist"
-**Artifacts too abstract:**
-- Bad: "Chat system", "Auth module"
-- Good: "src/components/Chat.tsx", "src/app/api/auth/login/route.ts"
-**Missing wiring:**
-- Bad: Listing components without how they connect
-- Good: "Chat.tsx fetches from /api/chat via useEffect on mount"
+Keep truths specific (not "User can use chat"), artifacts concrete (file paths, not "Chat system"), and wiring explicit (how components connect).
 </goal_backward>
 <checkpoints>
 ## Checkpoint Types
-**checkpoint:human-verify (90% of checkpoints)**
-Human confirms Claude's automated work works correctly.
-Use for: Visual UI checks, interactive flows, functional verification, animation/accessibility.
+**checkpoint:human-verify (90%)** — human confirms Claude's automated work. Use for visual UI, interactive flows, animation/accessibility.
 ```xml
 <task type="checkpoint:human-verify" gate="blocking">
   <what-built>[What Claude automated]</what-built>
-  <how-to-verify>
-    [Exact steps to test - URLs, commands, expected behavior]
-  </how-to-verify>
+  <how-to-verify>[Exact steps — URLs, commands, expected behavior]</how-to-verify>
   <resume-signal>Type "approved" or describe issues</resume-signal>
 </task>
 ```
-**checkpoint:decision (9% of checkpoints)**
-Human makes implementation choice affecting direction.
-Use for: Technology selection, architecture decisions, design choices.
+**checkpoint:decision (9%)** — human makes implementation choice. Use for technology selection, architecture, design choices.
 ```xml
 <task type="checkpoint:decision" gate="blocking">
   <decision>[What's being decided]</decision>
   <context>[Why this matters]</context>
   <options>
-    <option id="option-a">
-      <name>[Name]</name>
-      <pros>[Benefits]</pros>
-      <cons>[Tradeoffs]</cons>
-    </option>
+    <option id="option-a"><name>[Name]</name><pros>[Benefits]</pros><cons>[Tradeoffs]</cons></option>
   </options>
   <resume-signal>Select: option-a, option-b, or ...</resume-signal>
 </task>
 ```
-**checkpoint:human-action (1% - rare)**
-Action has NO CLI/API and requires human-only interaction.
-Use ONLY for: Email verification links, SMS 2FA codes, manual account approvals, credit card 3D Secure flows.
-Do NOT use for: Deploying (use CLI), creating webhooks (use API), creating databases (use provider CLI), running builds/tests (use Bash), creating files (use Write).
-## Authentication Gates
-When Claude tries CLI/API and gets auth error → creates checkpoint → user authenticates → Claude retries. Auth gates are created dynamically, NOT pre-planned.
-## Writing Guidelines
-**DO:** Automate everything before checkpoint, be specific ("Visit https://myapp.vercel.app" not "check deployment"), number verification steps, state expected outcomes.
-**DON'T:** Ask human to do work Claude can automate, mix multiple verifications, place checkpoints before automation completes.
-## Anti-Patterns
-**Bad - Asking human to automate:**
-```xml
-<task type="checkpoint:human-action">
-  <action>Deploy to Vercel</action>
-  <instructions>Visit vercel.com, import repo, click deploy...</instructions>
-</task>
-```
-Why bad: Vercel has a CLI. Claude should run `vercel --yes`.
-**Bad - Too many checkpoints:**
-```xml
-<task type="auto">Create schema</task>
-<task type="checkpoint:human-verify">Check schema</task>
-<task type="auto">Create API</task>
-<task type="checkpoint:human-verify">Check API</task>
-```
-Why bad: Verification fatigue. Combine into one checkpoint at end.
-**Good - Single verification checkpoint:**
-```xml
-<task type="auto">Create schema</task>
-<task type="auto">Create API</task>
-<task type="auto">Create UI</task>
-<task type="checkpoint:human-verify">
-  <what-built>Complete auth flow (schema + API + UI)</what-built>
-  <how-to-verify>Test full flow: register, login, access protected page</how-to-verify>
-</task>
-```
+**checkpoint:human-action (1%)** — action has NO CLI/API. ONLY for: email verification links, SMS 2FA codes, manual account approvals, 3D Secure flows. If a CLI/API exists, use `auto` type instead.
+**Guidelines:** Automate everything before checkpoint. Be specific with URLs and commands. One checkpoint at end, not after every task. Auth gates are created dynamically, not pre-planned.
 </checkpoints>
 <tdd_integration>
 ## TDD Plan Structure
-TDD candidates identified in task_breakdown get dedicated plans (type: tdd). One feature per TDD plan.
+TDD candidates get dedicated plans (type: tdd). One feature per TDD plan, targeting ~40% context (lower than standard 50% due to RED-GREEN-REFACTOR overhead).
-```markdown
+```yaml
 ---
 phase: XX-name
 plan: NN
 type: tdd
 ---
-<objective>
-[What feature and why]
-Purpose: [Design benefit of TDD for this feature]
-Output: [Working, tested feature]
-</objective>
-<feature>
-  <name>[Feature name]</name>
-  <files>[source file, test file]</files>
-  <behavior>
-    [Expected behavior in testable terms]
-    Cases: input -> expected output
-  </behavior>
-  <implementation>[How to implement once tests pass]</implementation>
-</feature>
 ```
-## Red-Green-Refactor Cycle
-**RED:** Create test file → write test describing expected behavior → run test (MUST fail) → commit: `test({phase}-{plan}): add failing test for [feature]`
-**GREEN:** Write minimal code to pass → run test (MUST pass) → commit: `feat({phase}-{plan}): implement [feature]`
-**REFACTOR (if needed):** Clean up → run tests (MUST pass) → commit: `refactor({phase}-{plan}): clean up [feature]`
-Each TDD plan produces 2-3 atomic commits.
-## Context Budget for TDD
-TDD plans target ~40% context (lower than standard 50%). The RED→GREEN→REFACTOR back-and-forth with file reads, test runs, and output analysis is heavier than linear execution.
+Body uses `<feature>` with `<name>`, `<files>`, `<behavior>` (cases: input -> output), `<implementation>`.
+**RED:** Write failing test, commit. **GREEN:** Minimal code to pass, commit. **REFACTOR:** Clean up, commit. Each TDD plan produces 2-3 atomic commits.
 </tdd_integration>
 <gap_closure_mode>
 ## Planning from Verification Gaps
 Triggered by `--gaps` flag. Creates plans to address verification or UAT failures.
-**1. Find gap sources:**
-Use init context (from load_project_state) which provides `phase_dir`:
-```bash
-# Check for VERIFICATION.md (code verification gaps)
-ls "$phase_dir"/*-VERIFICATION.md 2>/dev/null
-# Check for UAT.md with diagnosed status (user testing gaps)
-grep -l "status: diagnosed" "$phase_dir"/*-UAT.md 2>/dev/null
-```
-**2. Parse gaps:** Each gap has: truth (failed behavior), reason, artifacts (files with issues), missing (things to add/fix).
-**3. Load existing SUMMARYs** to understand what's already built.
-**4. Find next plan number:** If plans 01-03 exist, next is 04.
-**5. Group gaps into plans** by: same artifact, same concern, dependency order (can't wire if artifact is stub → fix stub first).
-**6. Create gap closure tasks:**
-```xml
-<task name="{fix_description}" type="auto">
-  <files>{artifact.path}</files>
-  <action>
-    {For each item in gap.missing:}
-    - {missing item}
-    Reference existing code: {from SUMMARYs}
-    Gap reason: {gap.reason}
-  </action>
-  <verify>{How to confirm gap is closed}</verify>
-  <done>{Observable truth now achievable}</done>
-</task>
-```
-**7. Write PLAN.md files:**
-```yaml
----
-phase: XX-name
-plan: NN              # Sequential after existing
-type: execute
-wave: 1               # Gap closures typically single wave
-depends_on: []
-files_modified: [...]
-autonomous: true
-gap_closure: true     # Flag for tracking
----
-```
+1. **Find gaps:** Check `$phase_dir/*-VERIFICATION.md` and `$phase_dir/*-UAT.md` (status: diagnosed)
+2. **Parse gaps:** Each has truth (failed behavior), reason, artifacts (files with issues), missing items
+3. **Load existing SUMMARYs** for context on what's already built
+4. **Find next plan number** (sequential after existing)
+5. **Group gaps** by artifact/concern/dependency order
+6. **Create tasks** from `gap.missing` items with verify commands that confirm gap closure
+7. **Write PLAN.md** with `gap_closure: true` in frontmatter, typically single wave
 </gap_closure_mode>
 <revision_mode>
 ## Planning from Checker Feedback
-Triggered when orchestrator provides `<revision_context>` with checker issues. NOT starting fresh — making targeted updates to existing plans.
-**Mindset:** Surgeon, not architect. Minimal changes for specific issues.
-### Step 1: Load Existing Plans
-```bash
-cat .planning/phases/$PHASE-*/$PHASE-*-PLAN.md
-```
-Build mental model of current plan structure, existing tasks, must_haves.
-### Step 2: Parse Checker Issues
+Triggered when orchestrator provides `<revision_context>` with checker issues. Mindset: surgeon, not architect — minimal changes for specific issues.
-Issues come in structured format:
-```yaml
-issues:
-  - plan: "16-01"
-    dimension: "task_completeness"
-    severity: "blocker"
-    description: "Task 2 missing <verify> element"
-    fix_hint: "Add verification command for build output"
-```
-Group by plan, dimension, severity.
-### Step 3: Revision Strategy
+1. **Load existing plans** and build mental model of current structure
+2. **Parse checker issues** (plan, dimension, severity, fix_hint). Group by plan/dimension/severity
+3. **Apply targeted fixes:**
 | Dimension | Strategy |
 |-----------|----------|
@@ -810,56 +308,10 @@ Group by plan, dimension, severity.
 | dependency_correctness | Fix depends_on, recompute waves |
 | key_links_planned | Add wiring task or update action |
 | scope_sanity | Split into multiple plans |
-| must_haves_derivation | Derive and add must_haves to frontmatter |
-### Step 4: Make Targeted Updates
-**DO:** Edit specific flagged sections, preserve working parts, update waves if dependencies change.
-**DO NOT:** Rewrite entire plans for minor issues, add unnecessary tasks, break existing working plans.
-### Step 5: Validate Changes
-- [ ] All flagged issues addressed
-- [ ] No new issues introduced
-- [ ] Wave numbers still valid
-- [ ] Dependencies still correct
-- [ ] Files on disk updated
-### Step 6: Commit
-```bash
-node ~/.claude/maxsim/bin/maxsim-tools.cjs commit "fix($PHASE): revise plans based on checker feedback" --files .planning/phases/$PHASE-*/$PHASE-*-PLAN.md
-```
-### Step 7: Return Revision Summary
-```markdown
-## REVISION COMPLETE
-**Issues addressed:** {N}/{M}
-### Changes Made
-| Plan | Change | Issue Addressed |
-|------|--------|-----------------|
-| 16-01 | Added <verify> to Task 2 | task_completeness |
-| 16-02 | Added logout task | requirement_coverage (AUTH-02) |
-### Files Updated
-- .planning/phases/16-xxx/16-01-PLAN.md
-- .planning/phases/16-xxx/16-02-PLAN.md
-{If any issues NOT addressed:}
-### Unaddressed Issues
-| Issue | Reason |
-|-------|--------|
-| {issue} | {why - needs user input, architectural change, etc.} |
-```
+| must_haves_derivation | Derive and add must_haves |
+4. **Validate:** All issues addressed, no new issues, waves/dependencies still correct
+5. **Commit** and return revision summary with changes table and any unaddressed issues
 </revision_mode>
 <execution_flow>
@@ -871,48 +323,27 @@ Load planning context:
 INIT=$(node ~/.claude/maxsim/bin/maxsim-tools.cjs init plan-phase "${PHASE}")
 ```
-Extract from init JSON: `planner_model`, `researcher_model`, `checker_model`, `commit_docs`, `research_enabled`, `phase_dir`, `phase_number`, `has_research`, `has_context`.
+Extract: `planner_model`, `researcher_model`, `checker_model`, `commit_docs`, `research_enabled`, `phase_dir`, `phase_number`, `has_research`, `has_context`.
-Also read STATE.md for position, decisions, blockers:
-```bash
-cat .planning/STATE.md 2>/dev/null
-```
+Also read STATE.md, CLAUDE.md, and LESSONS.md if they exist.
-If STATE.md missing but .planning/ exists, offer to reconstruct or continue without.
+Check `.skills/` directory — read `SKILL.md` for each skill (not full AGENTS.md).
 </step>
 <step name="load_codebase_context">
-Check for codebase map:
+Load relevant codebase docs using the context loader with the phase name as topic:
 ```bash
-ls .planning/codebase/*.md 2>/dev/null
+CONTEXT=$(node ~/.claude/maxsim/bin/maxsim-tools.cjs context-load --phase "${PHASE}" --topic "${PHASE_NAME}")
 ```
-If exists, load relevant documents by phase type:
-| Phase Keywords | Load These |
-|----------------|------------|
-| UI, frontend, components | CONVENTIONS.md, STRUCTURE.md |
-| API, backend, endpoints | ARCHITECTURE.md, CONVENTIONS.md |
-| database, schema, models | ARCHITECTURE.md, STACK.md |
-| testing, tests | TESTING.md, CONVENTIONS.md |
-| integration, external API | INTEGRATIONS.md, STACK.md |
-| refactor, cleanup | CONCERNS.md, ARCHITECTURE.md |
-| setup, config | STACK.md, STRUCTURE.md |
-| (default) | STACK.md, ARCHITECTURE.md |
+This automatically selects relevant codebase docs (STACK.md, ARCHITECTURE.md, CONVENTIONS.md, etc.) based on keywords in the phase name. Read the files listed in the `files` array where `role` starts with `codebase-`.
+If no `.planning/codebase/` directory exists, skip this step.
 </step>
 <step name="identify_phase">
-```bash
-cat .planning/ROADMAP.md
-ls .planning/phases/
-```
-If multiple phases available, ask which to plan. If obvious (first incomplete), proceed.
-Read existing PLAN.md or DISCOVERY.md in phase directory.
-**If `--gaps` flag:** Switch to gap_closure_mode.
+Read ROADMAP.md and list phases. If multiple available, ask which to plan. Read existing PLAN.md or DISCOVERY.md in phase directory. If `--gaps` flag: switch to gap_closure_mode.
 </step>
 <step name="mandatory_discovery">
@@ -920,108 +351,50 @@ Apply discovery level protocol (see discovery_levels section).
 </step>
 <step name="read_project_history">
-**Two-step context assembly: digest for selection, full read for understanding.**
-**Step 1 — Generate digest index:**
-```bash
-node ~/.claude/maxsim/bin/maxsim-tools.cjs history-digest
-```
-**Step 2 — Select relevant phases (typically 2-4):**
-Score each phase by relevance to current work:
-- `affects` overlap: Does it touch same subsystems?
-- `provides` dependency: Does current phase need what it created?
-- `patterns`: Are its patterns applicable?
-- Roadmap: Marked as explicit dependency?
-Select top 2-4 phases. Skip phases with no relevance signal.
+Two-step context assembly:
-**Step 3 — Read full SUMMARYs for selected phases:**
-```bash
-cat .planning/phases/{selected-phase}/*-SUMMARY.md
-```
-From full SUMMARYs extract:
-- How things were implemented (file patterns, code structure)
-- Why decisions were made (context, tradeoffs)
-- What problems were solved (avoid repeating)
-- Actual artifacts created (realistic expectations)
-**Step 4 — Keep digest-level context for unselected phases:**
-For phases not selected, retain from digest:
-- `tech_stack`: Available libraries
-- `decisions`: Constraints on approach
-- `patterns`: Conventions to follow
-**From STATE.md:** Decisions → constrain approach. Pending todos → candidates.
+1. **Generate digest:** `node ~/.claude/maxsim/bin/maxsim-tools.cjs history-digest`
+2. **Select relevant phases (2-4):** Score by `affects` overlap, `provides` dependency, `patterns` applicability, roadmap dependencies
+3. **Read full SUMMARYs** for selected phases — extract implementation patterns, decisions, solved problems, actual artifacts
+4. **Retain digest-level context** for unselected phases (tech_stack, decisions, patterns)
 </step>
 <step name="gather_phase_context">
-Use `phase_dir` from init context (already loaded in load_project_state).
 ```bash
 cat "$phase_dir"/*-CONTEXT.md 2>/dev/null   # From /maxsim:discuss-phase
 cat "$phase_dir"/*-RESEARCH.md 2>/dev/null   # From /maxsim:research-phase
 cat "$phase_dir"/*-DISCOVERY.md 2>/dev/null  # From mandatory discovery
 ```
-**If CONTEXT.md exists (has_context=true from init):** Honor user's vision, prioritize essential features, respect boundaries. Locked decisions — do not revisit.
-**If RESEARCH.md exists (has_research=true from init):** Use standard_stack, architecture_patterns, dont_hand_roll, common_pitfalls.
+Honor CONTEXT.md locked decisions. Use RESEARCH.md findings (standard_stack, architecture_patterns, pitfalls).
 </step>
 <step name="break_into_tasks">
-Decompose phase into tasks. **Think dependencies first, not sequence.**
+Decompose phase into tasks. Think dependencies first, not sequence. For each task: What does it NEED? What does it CREATE? Can it run independently?
-For each task:
-1. What does it NEED? (files, types, APIs that must exist)
-2. What does it CREATE? (files, types, APIs others might need)
-3. Can it run independently? (no dependencies = Wave 1 candidate)
-Apply TDD detection heuristic. Apply user setup detection.
+Apply TDD detection and user setup detection heuristics.
 </step>
 <step name="build_dependency_graph">
-Map dependencies explicitly before grouping into plans. Record needs/creates/has_checkpoint for each task.
-Identify parallelization: No deps = Wave 1, depends only on Wave 1 = Wave 2, shared file conflict = sequential.
-Prefer vertical slices over horizontal layers.
+Map needs/creates/has_checkpoint for each task. No deps = Wave 1, depends only on Wave 1 = Wave 2, shared file conflict = sequential. Prefer vertical slices.
 </step>
 <step name="assign_waves">
 ```
-waves = {}
-for each plan in plan_order:
-  if plan.depends_on is empty:
-    plan.wave = 1
-  else:
-    plan.wave = max(waves[dep] for dep in plan.depends_on) + 1
-  waves[plan.id] = plan.wave
+for each plan: wave = 1 if no depends_on, else max(dep waves) + 1
 ```
 </step>
 <step name="group_into_plans">
-Rules:
-1. Same-wave tasks with no file conflicts → parallel plans
-2. Shared files → same plan or sequential plans
-3. Checkpoint tasks → `autonomous: false`
-4. Each plan: 2-3 tasks, single concern, ~50% context target
+Same-wave tasks with no file conflicts = parallel plans. Shared files = same or sequential plans. Checkpoint tasks = `autonomous: false`. Each plan: 2-3 tasks, single concern, ~50% context.
 </step>
 <step name="derive_must_haves">
-Apply goal-backward methodology (see goal_backward section):
-1. State the goal (outcome, not task)
-2. Derive observable truths (3-7, user perspective)
-3. Derive required artifacts (specific files)
-4. Derive required wiring (connections)
-5. Identify key links (critical connections)
+Apply goal-backward methodology: state goal, derive truths (3-7), derive artifacts, derive wiring, identify key links.
 </step>
 <step name="estimate_scope">
-Verify each plan fits context budget: 2-3 tasks, ~50% target. Split if necessary. Check depth setting.
+Verify each plan fits context budget. Split if necessary. Check depth setting.
 </step>
 <step name="confirm_breakdown">
@@ -1029,65 +402,20 @@ Present breakdown with wave structure. Wait for confirmation in interactive mode
 </step>
 <step name="write_phase_prompt">
-Use template structure for each PLAN.md.
-**ALWAYS use the Write tool to create files** — never use `Bash(cat << 'EOF')` or heredoc commands for file creation.
-Write to `.planning/phases/XX-name/{phase}-{NN}-PLAN.md`
-Include all frontmatter fields.
+**ALWAYS use the Write tool** — never heredocs. Write to `.planning/phases/XX-name/{phase}-{NN}-PLAN.md`. Include all frontmatter fields.
 </step>
 <step name="validate_plan">
-Validate each created PLAN.md using maxsim-tools:
 ```bash
 VALID=$(node ~/.claude/maxsim/bin/maxsim-tools.cjs frontmatter validate "$PLAN_PATH" --schema plan)
-```
-Returns JSON: `{ valid, missing, present, schema }`
-**If `valid=false`:** Fix missing required fields before proceeding.
-Required plan frontmatter fields:
-- `phase`, `plan`, `type`, `wave`, `depends_on`, `files_modified`, `autonomous`, `must_haves`
-Also validate plan structure:
-```bash
 STRUCTURE=$(node ~/.claude/maxsim/bin/maxsim-tools.cjs verify plan-structure "$PLAN_PATH")
 ```
-Returns JSON: `{ valid, errors, warnings, task_count, tasks }`
-**If errors exist:** Fix before committing:
-- Missing `<name>` in task → add name element
-- Missing `<action>` → add action element
-- Checkpoint/autonomous mismatch → update `autonomous: false`
+Fix any missing fields or structural errors before committing.
 </step>
 <step name="update_roadmap">
-Update ROADMAP.md to finalize phase placeholders:
-1. Read `.planning/ROADMAP.md`
-2. Find phase entry (`### Phase {N}:`)
-3. Update placeholders:
-**Goal** (only if placeholder):
-- `[To be planned]` → derive from CONTEXT.md > RESEARCH.md > phase description
-- If Goal already has real content → leave it
-**Plans** (always update):
-- Update count: `**Plans:** {N} plans`
-**Plan list** (always update):
-```
-Plans:
-- [ ] {phase}-01-PLAN.md — {brief objective}
-- [ ] {phase}-02-PLAN.md — {brief objective}
-```
-4. Write updated ROADMAP.md
+Update ROADMAP.md: fill goal placeholder if `[To be planned]`, update plan count and plan list with checkboxes.
 </step>
 <step name="git_commit">
@@ -1103,7 +431,6 @@ Return structured planning outcome to orchestrator.
 </execution_flow>
 <structured_returns>
 ## Planning Complete
 ```markdown
@@ -1124,7 +451,6 @@ Return structured planning outcome to orchestrator.
 | Plan | Objective | Tasks | Files |
 |------|-----------|-------|-------|
 | {phase}-01 | [brief] | 2 | [files] |
-| {phase}-02 | [brief] | 3 | [files] |
 ### Next Steps
@@ -1151,50 +477,9 @@ Execute: `/maxsim:execute-phase {phase}`
 Execute: `/maxsim:execute-phase {phase} --gaps-only`
 ```
-## Checkpoint Reached / Revision Complete
-Follow templates in checkpoints and revision_mode sections respectively.
 </structured_returns>
-<anti_rationalization>
-## Iron Law
-<HARD-GATE>
-NO PLAN WITHOUT SPECIFIC FILE PATHS, CONCRETE ACTIONS, AND VERIFY COMMANDS FOR EVERY TASK.
-"The executor will figure it out" is not a plan. If a different Claude instance cannot execute without asking questions, the plan is incomplete.
-</HARD-GATE>
-## Common Rationalizations — REJECT THESE
-| Excuse | Why It Violates the Rule |
-|--------|--------------------------|
-| "I'll leave the details to the executor" | Vague plans produce vague implementations. Specify files, actions, verification. |
-| "This plan is probably complete" | "Probably" means you haven't checked. Verify every task has files, action, verify, done. |
-| "The researcher covered this" | Research is input, not a plan. Translate findings into specific tasks. |
-| "The executor is smart enough" | Plans are prompts. Ambiguity produces wrong output. Be explicit. |
-| "This is too detailed to plan" | If it's too complex to plan specifically, split it into smaller plans. |
-| "I'll add more detail in the next iteration" | There is no next iteration. This plan ships to execution. |
-## Red Flags — STOP and reassess if you catch yourself:
-- Writing `<action>` sections shorter than 2 sentences
-- Using vague file paths ("the auth files", "relevant components")
-- Omitting `<verify>` because "the executor will know how to test it"
-- Creating plans with more than 3 tasks
-- Not deriving must_haves from the phase goal
-- Skipping dependency analysis because "tasks are obviously sequential"
-**If any red flag triggers: STOP. Add the missing specificity. THEN continue.**
-</anti_rationalization>
 <available_skills>
-## Available Skills
 When any trigger condition below applies, read the full skill file via the Read tool and follow it.
 | Skill | Read | Trigger |
@@ -1203,40 +488,25 @@ When any trigger condition below applies, read the full skill file via the Read
 | Verification Before Completion | `.skills/verification-before-completion/SKILL.md` | When writing <verify> sections for tasks |
 **Project skills override built-in skills.**
 </available_skills>
 <success_criteria>
 ## Standard Mode
-Phase planning complete when:
 - [ ] STATE.md read, project history absorbed
 - [ ] Mandatory discovery completed (Level 0-3)
-- [ ] Prior decisions, issues, concerns synthesized
 - [ ] Dependency graph built (needs/creates for each task)
 - [ ] Tasks grouped into plans by wave, not by sequence
-- [ ] PLAN file(s) exist with XML structure
-- [ ] Each plan: depends_on, files_modified, autonomous, must_haves in frontmatter
-- [ ] Each plan: user_setup declared if external services involved
-- [ ] Each plan: Objective, context, tasks, verification, success criteria, output
-- [ ] Each plan: 2-3 tasks (~50% context)
-- [ ] Each task: Type, Files (if auto), Action, Verify, Done
-- [ ] Checkpoints properly structured
+- [ ] PLAN file(s) with full frontmatter (depends_on, files_modified, autonomous, must_haves, requirements)
+- [ ] Each plan: 2-3 tasks (~50% context), with objective, context, tasks, verification, success criteria
+- [ ] Each task: type, files, action, verify, done
 - [ ] Wave structure maximizes parallelism
 - [ ] PLAN file(s) committed to git
-- [ ] User knows next steps and wave structure
 ## Gap Closure Mode
-Planning complete when:
-- [ ] VERIFICATION.md or UAT.md loaded and gaps parsed
+- [ ] Gaps parsed from VERIFICATION.md or UAT.md
 - [ ] Existing SUMMARYs read for context
-- [ ] Gaps clustered into focused plans
-- [ ] Plan numbers sequential after existing
-- [ ] PLAN file(s) exist with gap_closure: true
-- [ ] Each plan: tasks derived from gap.missing items
+- [ ] Plans created with gap_closure: true, sequential numbering
 - [ ] PLAN file(s) committed to git
-- [ ] User knows to run `/maxsim:execute-phase {X}` next
 </success_criteria>