npm - @undeemed/get-shit-done-codex - Versions diffs - 1.6.12 → 1.20.3 - Mend

@undeemed/get-shit-done-codex 1.6.12 → 1.20.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (114) hide show

package/README.md +40 -7
package/agents/gsd-codebase-mapper.md +761 -0
package/agents/gsd-debugger.md +1198 -0
package/agents/gsd-executor.md +419 -0
package/agents/gsd-integration-checker.md +423 -0
package/agents/gsd-phase-researcher.md +469 -0
package/agents/gsd-plan-checker.md +622 -0
package/agents/gsd-planner.md +1159 -0
package/agents/gsd-project-researcher.md +618 -0
package/agents/gsd-research-synthesizer.md +236 -0
package/agents/gsd-roadmapper.md +639 -0
package/agents/gsd-verifier.md +541 -0
package/bin/install.js +108 -102
package/commands/gsd/add-phase.md +17 -185
package/commands/gsd/add-todo.md +23 -163
package/commands/gsd/audit-milestone.md +3 -219
package/commands/gsd/check-todos.md +20 -196
package/commands/gsd/cleanup.md +18 -0
package/commands/gsd/complete-milestone.md +2 -2
package/commands/gsd/debug.md +13 -0
package/commands/gsd/discuss-phase.md +13 -6
package/commands/gsd/execute-phase.md +4 -266
package/commands/gsd/health.md +22 -0
package/commands/gsd/help.md +8 -369
package/commands/gsd/insert-phase.md +9 -203
package/commands/gsd/join-discord.md +18 -0
package/commands/gsd/list-phase-assumptions.md +4 -4
package/commands/gsd/map-codebase.md +1 -1
package/commands/gsd/new-milestone.md +16 -682
package/commands/gsd/new-project.md +12 -866
package/commands/gsd/new-project.md.bak +1041 -0
package/commands/gsd/pause-work.md +17 -105
package/commands/gsd/plan-milestone-gaps.md +3 -247
package/commands/gsd/plan-phase.md +13 -444
package/commands/gsd/progress.md +5 -337
package/commands/gsd/quick.md +40 -0
package/commands/gsd/reapply-patches.md +110 -0
package/commands/gsd/remove-phase.md +9 -315
package/commands/gsd/research-phase.md +27 -20
package/commands/gsd/resume-work.md +2 -2
package/commands/gsd/set-profile.md +34 -0
package/commands/gsd/settings.md +36 -0
package/commands/gsd/update.md +25 -160
package/commands/gsd/verify-work.md +6 -186
package/get-shit-done/bin/gsd-tools.cjs +5243 -0
package/get-shit-done/bin/gsd-tools.test.cjs +2273 -0
package/get-shit-done/references/checkpoints.md +270 -283
package/get-shit-done/references/decimal-phase-calculation.md +65 -0
package/get-shit-done/references/git-integration.md +7 -13
package/get-shit-done/references/git-planning-commit.md +38 -0
package/get-shit-done/references/model-profile-resolution.md +34 -0
package/get-shit-done/references/model-profiles.md +92 -0
package/get-shit-done/references/phase-argument-parsing.md +61 -0
package/get-shit-done/references/planning-config.md +196 -0
package/get-shit-done/references/questioning.md +5 -1
package/get-shit-done/references/verification-patterns.md +17 -0
package/get-shit-done/templates/DEBUG.md +4 -4
package/get-shit-done/templates/UAT.md +1 -1
package/get-shit-done/templates/codebase/architecture.md +1 -1
package/get-shit-done/templates/codebase/concerns.md +1 -1
package/get-shit-done/templates/codebase/conventions.md +1 -1
package/get-shit-done/templates/codebase/structure.md +9 -9
package/get-shit-done/templates/config.json +10 -0
package/get-shit-done/templates/context.md +7 -15
package/get-shit-done/templates/continue-here.md +1 -1
package/get-shit-done/templates/phase-prompt.md +32 -41
package/get-shit-done/templates/planner-subagent-prompt.md +4 -4
package/get-shit-done/templates/project.md +1 -1
package/get-shit-done/templates/research-project/ARCHITECTURE.md +1 -1
package/get-shit-done/templates/research.md +27 -4
package/get-shit-done/templates/state.md +1 -31
package/get-shit-done/templates/summary-complex.md +59 -0
package/get-shit-done/templates/summary-minimal.md +41 -0
package/get-shit-done/templates/summary-standard.md +48 -0
package/get-shit-done/templates/summary.md +5 -28
package/get-shit-done/templates/user-setup.md +8 -20
package/get-shit-done/templates/verification-report.md +3 -3
package/get-shit-done/workflows/add-phase.md +111 -0
package/get-shit-done/workflows/add-todo.md +157 -0
package/get-shit-done/workflows/audit-milestone.md +242 -0
package/get-shit-done/workflows/check-todos.md +176 -0
package/get-shit-done/workflows/cleanup.md +152 -0
package/get-shit-done/workflows/complete-milestone.md +225 -301
package/get-shit-done/workflows/diagnose-issues.md +3 -17
package/get-shit-done/workflows/discovery-phase.md +11 -15
package/get-shit-done/workflows/discuss-phase.md +105 -42
package/get-shit-done/workflows/execute-phase.md +205 -349
package/get-shit-done/workflows/execute-plan.md +179 -1569
package/get-shit-done/workflows/health.md +156 -0
package/get-shit-done/workflows/help.md +486 -0
package/get-shit-done/workflows/insert-phase.md +129 -0
package/get-shit-done/workflows/list-phase-assumptions.md +9 -9
package/get-shit-done/workflows/map-codebase.md +56 -18
package/get-shit-done/workflows/new-milestone.md +373 -0
package/get-shit-done/workflows/new-project.md +1113 -0
package/get-shit-done/workflows/pause-work.md +122 -0
package/get-shit-done/workflows/plan-milestone-gaps.md +256 -0
package/get-shit-done/workflows/plan-phase.md +448 -0
package/get-shit-done/workflows/progress.md +393 -0
package/get-shit-done/workflows/quick.md +444 -0
package/get-shit-done/workflows/remove-phase.md +154 -0
package/get-shit-done/workflows/research-phase.md +74 -0
package/get-shit-done/workflows/resume-project.md +18 -23
package/get-shit-done/workflows/set-profile.md +80 -0
package/get-shit-done/workflows/settings.md +200 -0
package/get-shit-done/workflows/transition.md +78 -103
package/get-shit-done/workflows/update.md +214 -0
package/get-shit-done/workflows/verify-phase.md +109 -496
package/get-shit-done/workflows/verify-work.md +22 -15
package/hooks/dist/gsd-check-update.js +66 -0
package/hooks/dist/gsd-statusline.js +91 -0
package/package.json +19 -3
package/scripts/build-hooks.js +42 -0
package/commands/gsd/whats-new.md +0 -124

package/get-shit-done/workflows/execute-plan.md CHANGED Viewed

@@ -4,1230 +4,296 @@ Execute a phase prompt (PLAN.md) and create the outcome summary (SUMMARY.md).
 <required_reading>
 Read STATE.md before any operation to load project context.
+Read config.json for planning behavior settings.
+@~/.codex/get-shit-done/references/git-integration.md
 </required_reading>
 <process>
-<step name="load_project_state" priority="first">
-Before any operation, read project state:
+<step name="init_context" priority="first">
+Load execution context (uses `init execute-phase` for full context, including file contents):
 ```bash
-cat .planning/STATE.md 2>/dev/null
+INIT=$(node ~/.codex/get-shit-done/bin/gsd-tools.cjs init execute-phase "${PHASE}" --include state,config)
 ```
-**If file exists:** Parse and internalize:
-- Current position (phase, plan, status)
-- Accumulated decisions (constraints on this execution)
-- Blockers/concerns (things to watch for)
-- Brief alignment status
+Extract from init JSON: `executor_model`, `commit_docs`, `phase_dir`, `phase_number`, `plans`, `summaries`, `incomplete_plans`.
-**If file missing but .planning/ exists:**
-```
-STATE.md missing but planning artifacts exist.
-Options:
-1. Reconstruct from existing artifacts
-2. Continue without project state (may lose accumulated context)
+**File contents (from --include):** `state_content`, `config_content`. Access with:
+```bash
+STATE_CONTENT=$(echo "$INIT" | jq -r '.state_content // empty')
+CONFIG_CONTENT=$(echo "$INIT" | jq -r '.config_content // empty')
 ```
-**If .planning/ doesn't exist:** Error - project not initialized.
-This ensures every execution has full project context.
+If `.planning/` missing: error.
 </step>
 <step name="identify_plan">
-Find the next plan to execute:
-- Check roadmap for "In progress" phase
-- Find plans in that phase directory
-- Identify first plan without corresponding SUMMARY
 ```bash
-cat .planning/ROADMAP.md
-# Look for phase with "In progress" status
-# Then find plans in that phase
+# Use plans/summaries from INIT JSON, or list files
 ls .planning/phases/XX-name/*-PLAN.md 2>/dev/null | sort
 ls .planning/phases/XX-name/*-SUMMARY.md 2>/dev/null | sort
 ```
-**Logic:**
-- If `01-01-PLAN.md` exists but `01-01-SUMMARY.md` doesn't → execute 01-01
-- If `01-01-SUMMARY.md` exists but `01-02-SUMMARY.md` doesn't → execute 01-02
-- Pattern: Find first PLAN file without matching SUMMARY file
-**Decimal phase handling:**
-Phase directories can be integer or decimal format:
-- Integer: `.planning/phases/01-foundation/01-01-PLAN.md`
-- Decimal: `.planning/phases/01.1-hotfix/01.1-01-PLAN.md`
-Parse phase number from path (handles both formats):
+Find first PLAN without matching SUMMARY. Decimal phases supported (`01.1-hotfix/`):
 ```bash
-# Extract phase number (handles XX or XX.Y format)
 PHASE=$(echo "$PLAN_PATH" | grep -oE '[0-9]+(\.[0-9]+)?-[0-9]+')
+# config_content already loaded via --include config in init_context
 ```
-SUMMARY naming follows same pattern:
-- Integer: `01-01-SUMMARY.md`
-- Decimal: `01.1-01-SUMMARY.md`
-Confirm with user if ambiguous.
-<config-check>
-```bash
-cat .planning/config.json 2>/dev/null
-```
-</config-check>
 <if mode="yolo">
-```
-⚡ Auto-approved: Execute {phase}-{plan}-PLAN.md
-[Plan X of Y for Phase Z]
-Starting execution...
-```
-Proceed directly to parse_segments step.
+Auto-approve: `⚡ Execute {phase}-{plan}-PLAN.md [Plan X of Y for Phase Z]` → parse_segments.
 </if>
 <if mode="interactive" OR="custom with gates.execute_next_plan true">
-Present:
-```
-Found plan to execute: {phase}-{plan}-PLAN.md
-[Plan X of Y for Phase Z]
-Proceed with execution?
-```
-Wait for confirmation before proceeding.
+Present plan identification, wait for confirmation.
 </if>
 </step>
 <step name="record_start_time">
-Record execution start time for performance tracking:
 ```bash
 PLAN_START_TIME=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
 PLAN_START_EPOCH=$(date +%s)
 ```
-Store in shell variables for duration calculation at completion.
 </step>
 <step name="parse_segments">
-**Intelligent segmentation: Parse plan into execution segments.**
-Plans are divided into segments by checkpoints. Each segment is routed to optimal execution context (subagent or main).
-**1. Check for checkpoints:**
 ```bash
-# Find all checkpoints and their types
 grep -n "type=\"checkpoint" .planning/phases/XX-name/{phase}-{plan}-PLAN.md
 ```
-**2. Analyze execution strategy:**
-**If NO checkpoints found:**
-- **Fully autonomous plan** - spawn single subagent for entire plan
-- Subagent gets fresh 200k context, executes all tasks, creates SUMMARY, commits
-- Main context: Just orchestration (~5% usage)
-**If checkpoints found, parse into segments:**
-Segment = tasks between checkpoints (or start→first checkpoint, or last checkpoint→end)
-**For each segment, determine routing:**
-```
-Segment routing rules:
-IF segment has no prior checkpoint:
-  → SUBAGENT (first segment, nothing to depend on)
-IF segment follows checkpoint:human-verify:
-  → SUBAGENT (verification is just confirmation, doesn't affect next work)
-IF segment follows checkpoint:decision OR checkpoint:human-action:
-  → MAIN CONTEXT (next tasks need the decision/result)
-```
-**3. Execution pattern:**
-**Pattern A: Fully autonomous (no checkpoints)**
+**Routing by checkpoint type:**
-```
-Spawn subagent → execute all tasks → SUMMARY → commit → report back
-```
+| Checkpoints | Pattern | Execution |
+|-------------|---------|-----------|
+| None | A (autonomous) | Single subagent: full plan + SUMMARY + commit |
+| Verify-only | B (segmented) | Segments between checkpoints. After none/human-verify → SUBAGENT. After decision/human-action → MAIN |
+| Decision | C (main) | Execute entirely in main context |
-**Pattern B: Segmented with verify-only checkpoints**
+**Pattern A:** init_agent_tracking → spawn Task(subagent_type="gsd-executor", model=executor_model) with prompt: execute plan at [path], autonomous, all tasks + SUMMARY + commit, follow deviation/auth rules, report: plan name, tasks, SUMMARY path, commit hash → track agent_id → wait → update tracking → report.
-```
-Segment 1 (tasks 1-3): Spawn subagent → execute → report back
-Checkpoint 4 (human-verify): Main context → you verify → continue
-Segment 2 (tasks 5-6): Spawn NEW subagent → execute → report back
-Checkpoint 7 (human-verify): Main context → you verify → continue
-Aggregate results → SUMMARY → commit
-```
-**Pattern C: Decision-dependent (must stay in main)**
-```
-Checkpoint 1 (decision): Main context → you decide → continue in main
-Tasks 2-5: Main context (need decision from checkpoint 1)
-No segmentation benefit - execute entirely in main
-```
-**4. Why this works:**
-**Segmentation benefits:**
-- Fresh context for each autonomous segment (0% start every time)
-- Main context only for checkpoints (~10-20% total)
-- Can handle 10+ task plans if properly segmented
-- Quality impossible to degrade in autonomous segments
-**When segmentation provides no benefit:**
-- Checkpoint is decision/human-action and following tasks depend on outcome
-- Better to execute sequentially in main than break flow
-**5. Implementation:**
-**For fully autonomous plans:**
-```
-1. Run init_agent_tracking step first (see step below)
+**Pattern B:** Execute segment-by-segment. Autonomous segments: spawn subagent for assigned tasks only (no SUMMARY/commit). Checkpoints: main context. After all segments: aggregate, create SUMMARY, commit. See segment_execution.
-2. Use Task tool with subagent_type="gsd-executor":
+**Pattern C:** Execute in main using standard flow (step name="execute").
-   Prompt: "Execute plan at .planning/phases/{phase}-{plan}-PLAN.md
-   This is an autonomous plan (no checkpoints). Execute all tasks, create SUMMARY.md in phase directory, commit with message following plan's commit guidance.
-   Follow all deviation rules and authentication gate protocols from the plan.
-   When complete, report: plan name, tasks completed, SUMMARY path, commit hash."
-3. After Task tool returns with agent_id:
-   a. Write agent_id to current-agent-id.txt:
-      echo "[agent_id]" > .planning/current-agent-id.txt
-   b. Append spawn entry to agent-history.json:
-      {
-        "agent_id": "[agent_id from Task response]",
-        "task_description": "Execute full plan {phase}-{plan} (autonomous)",
-        "phase": "{phase}",
-        "plan": "{plan}",
-        "segment": null,
-        "timestamp": "[ISO timestamp]",
-        "status": "spawned",
-        "completion_timestamp": null
-      }
-4. Wait for subagent to complete
-5. After subagent completes successfully:
-   a. Update agent-history.json entry:
-      - Find entry with matching agent_id
-      - Set status: "completed"
-      - Set completion_timestamp: "[ISO timestamp]"
-   b. Clear current-agent-id.txt:
-      rm .planning/current-agent-id.txt
-6. Report completion to user
-```
-**For segmented plans (has verify-only checkpoints):**
-```
-Execute segment-by-segment:
-For each autonomous segment:
-  Spawn subagent with prompt: "Execute tasks [X-Y] from plan at .planning/phases/{phase}-{plan}-PLAN.md. Read the plan for full context and deviation rules. Do NOT create SUMMARY or commit - just execute these tasks and report results."
-  Wait for subagent completion
-For each checkpoint:
-  Execute in main context
-  Wait for user interaction
-  Continue to next segment
-After all segments complete:
-  Aggregate all results
-  Create SUMMARY.md
-  Commit with all changes
-```
-**For decision-dependent plans:**
-```
-Execute in main context (standard flow below)
-No subagent routing
-Quality maintained through small scope (2-3 tasks per plan)
-```
-See step name="segment_execution" for detailed segment execution loop.
+Fresh context per subagent preserves peak quality. Main context stays lean.
 </step>
 <step name="init_agent_tracking">
-**Initialize agent tracking for subagent resume capability.**
-Before spawning any subagents, set up tracking infrastructure:
-**1. Create/verify tracking files:**
 ```bash
-# Create agent history file if doesn't exist
 if [ ! -f .planning/agent-history.json ]; then
   echo '{"version":"1.0","max_entries":50,"entries":[]}' > .planning/agent-history.json
 fi
-# Clear any stale current-agent-id (from interrupted sessions)
-# Will be populated when subagent spawns
 rm -f .planning/current-agent-id.txt
-```
-**2. Check for interrupted agents (resume detection):**
-```bash
-# Check if current-agent-id.txt exists from previous interrupted session
 if [ -f .planning/current-agent-id.txt ]; then
   INTERRUPTED_ID=$(cat .planning/current-agent-id.txt)
   echo "Found interrupted agent: $INTERRUPTED_ID"
 fi
 ```
-**If interrupted agent found:**
-- The agent ID file exists from a previous session that didn't complete
-- This agent can potentially be resumed using Task tool's `resume` parameter
-- Present to user: "Previous session was interrupted. Resume agent [ID] or start fresh?"
-- If resume: Use Task tool with `resume` parameter set to the interrupted ID
-- If fresh: Clear the file and proceed normally
+If interrupted: ask user to resume (Task `resume` parameter) or start fresh.
-**3. Prune old entries (housekeeping):**
+**Tracking protocol:** On spawn: write agent_id to `current-agent-id.txt`, append to agent-history.json: `{"agent_id":"[id]","task_description":"[desc]","phase":"[phase]","plan":"[plan]","segment":[num|null],"timestamp":"[ISO]","status":"spawned","completion_timestamp":null}`. On completion: status → "completed", set completion_timestamp, delete current-agent-id.txt. Prune: if entries > max_entries, remove oldest "completed" (never "spawned").
-If agent-history.json has more than `max_entries`:
-- Remove oldest entries with status "completed"
-- Never remove entries with status "spawned" (may need resume)
-- Keep file under size limit for fast reads
-**When to run this step:**
-- Pattern A (fully autonomous): Before spawning the single subagent
-- Pattern B (segmented): Before the segment execution loop
-- Pattern C (main context): Skip - no subagents spawned
+Run for Pattern A/B before spawning. Pattern C: skip.
 </step>
 <step name="segment_execution">
-**Detailed segment execution loop for segmented plans.**
-**This step applies ONLY to segmented plans (Pattern B: has checkpoints, but they're verify-only).**
-For Pattern A (fully autonomous) and Pattern C (decision-dependent), skip this step.
-**Execution flow:**
-````
-1. Parse plan to identify segments:
-   - Read plan file
-   - Find checkpoint locations: grep -n "type=\"checkpoint" PLAN.md
-   - Identify checkpoint types: grep "type=\"checkpoint" PLAN.md | grep -o 'checkpoint:[^"]*'
-   - Build segment map:
-     * Segment 1: Start → first checkpoint (tasks 1-X)
-     * Checkpoint 1: Type and location
-     * Segment 2: After checkpoint 1 → next checkpoint (tasks X+1 to Y)
-     * Checkpoint 2: Type and location
-     * ... continue for all segments
-2. For each segment in order:
-   A. Determine routing (apply rules from parse_segments):
-      - No prior checkpoint? → Subagent
-      - Prior checkpoint was human-verify? → Subagent
-      - Prior checkpoint was decision/human-action? → Main context
-   B. If routing = Subagent:
-      ```
-      Spawn Task tool with subagent_type="gsd-executor":
-      Prompt: "Execute tasks [task numbers/names] from plan at [plan path].
-      **Context:**
-      - Read the full plan for objective, context files, and deviation rules
-      - You are executing a SEGMENT of this plan (not the full plan)
-      - Other segments will be executed separately
+Pattern B only (verify-only checkpoints). Skip for A/C.
-      **Your responsibilities:**
-      - Execute only the tasks assigned to you
-      - Follow all deviation rules and authentication gate protocols
-      - Track deviations for later Summary
-      - DO NOT create SUMMARY.md (will be created after all segments complete)
-      - DO NOT commit (will be done after all segments complete)
+1. Parse segment map: checkpoint locations and types
+2. Per segment:
+   - Subagent route: spawn gsd-executor for assigned tasks only. Prompt: task range, plan path, read full plan for context, execute assigned tasks, track deviations, NO SUMMARY/commit. Track via agent protocol.
+   - Main route: execute tasks using standard flow (step name="execute")
+3. After ALL segments: aggregate files/deviations/decisions → create SUMMARY.md → commit → self-check:
+   - Verify key-files.created exist on disk with `[ -f ]`
+   - Check `git log --oneline --all --grep="{phase}-{plan}"` returns ≥1 commit
+   - Append `## Self-Check: PASSED` or `## Self-Check: FAILED` to SUMMARY
-      **Report back:**
-      - Tasks completed
-      - Files created/modified
-      - Deviations encountered
-      - Any issues or blockers"
+   **Known Codex CLI bug (classifyHandoffIfNeeded):** If any segment agent reports "failed" with `classifyHandoffIfNeeded is not defined`, this is a Codex CLI runtime bug — not a real failure. Run spot-checks; if they pass, treat as successful.
-      **After Task tool returns with agent_id:**
-      1. Write agent_id to current-agent-id.txt:
-         echo "[agent_id]" > .planning/current-agent-id.txt
-      2. Append spawn entry to agent-history.json:
-         {
-           "agent_id": "[agent_id from Task response]",
-           "task_description": "Execute tasks [X-Y] from plan {phase}-{plan}",
-           "phase": "{phase}",
-           "plan": "{plan}",
-           "segment": [segment_number],
-           "timestamp": "[ISO timestamp]",
-           "status": "spawned",
-           "completion_timestamp": null
-         }
-      Wait for subagent to complete
-      Capture results (files changed, deviations, etc.)
-      **After subagent completes successfully:**
-      1. Update agent-history.json entry:
-         - Find entry with matching agent_id
-         - Set status: "completed"
-         - Set completion_timestamp: "[ISO timestamp]"
-      2. Clear current-agent-id.txt:
-         rm .planning/current-agent-id.txt
-      ```
-   C. If routing = Main context:
-      Execute tasks in main using standard execution flow (step name="execute")
-      Track results locally
-   D. After segment completes (whether subagent or main):
-      Continue to next checkpoint/segment
-3. After ALL segments complete:
-   A. Aggregate results from all segments:
-      - Collect files created/modified from all segments
-      - Collect deviations from all segments
-      - Collect decisions from all checkpoints
-      - Merge into complete picture
-   B. Create SUMMARY.md:
-      - Use aggregated results
-      - Document all work from all segments
-      - Include deviations from all segments
-      - Note which segments were subagented
-   C. Commit:
-      - Stage all files from all segments
-      - Stage SUMMARY.md
-      - Commit with message following plan guidance
-      - Include note about segmented execution if relevant
-   D. Report completion
-**Example execution trace:**
-````
-Plan: 01-02-PLAN.md (8 tasks, 2 verify checkpoints)
-Parsing segments...
-- Segment 1: Tasks 1-3 (autonomous)
-- Checkpoint 4: human-verify
-- Segment 2: Tasks 5-6 (autonomous)
-- Checkpoint 7: human-verify
-- Segment 3: Task 8 (autonomous)
-Routing analysis:
-- Segment 1: No prior checkpoint → SUBAGENT ✓
-- Checkpoint 4: Verify only → MAIN (required)
-- Segment 2: After verify → SUBAGENT ✓
-- Checkpoint 7: Verify only → MAIN (required)
-- Segment 3: After verify → SUBAGENT ✓
-Execution:
-[1] Spawning subagent for tasks 1-3...
-→ Subagent completes: 3 files modified, 0 deviations
-[2] Executing checkpoint 4 (human-verify)...
-╔═══════════════════════════════════════════════════════╗
-║  CHECKPOINT: Verification Required                    ║
-╚═══════════════════════════════════════════════════════╝
-Progress: 3/8 tasks complete
-Task: Verify database schema
-Built: User and Session tables with relations
-How to verify:
-  1. Check src/db/schema.ts for correct types
-────────────────────────────────────────────────────────
-→ YOUR ACTION: Type "approved" or describe issues
-────────────────────────────────────────────────────────
-User: "approved"
-[3] Spawning subagent for tasks 5-6...
-→ Subagent completes: 2 files modified, 1 deviation (added error handling)
-[4] Executing checkpoint 7 (human-verify)...
-User: "approved"
-[5] Spawning subagent for task 8...
-→ Subagent completes: 1 file modified, 0 deviations
-Aggregating results...
-- Total files: 6 modified
-- Total deviations: 1
-- Segmented execution: 3 subagents, 2 checkpoints
-Creating SUMMARY.md...
-Committing...
-✓ Complete
-````
-**Benefits of this pattern:**
-- Main context usage: ~20% (just orchestration + checkpoints)
-- Subagent 1: Fresh 0-30% (tasks 1-3)
-- Subagent 2: Fresh 0-30% (tasks 5-6)
-- Subagent 3: Fresh 0-20% (task 8)
-- All autonomous work: Peak quality
-- Can handle large plans with many tasks if properly segmented
-**When NOT to use segmentation:**
-- Plan has decision/human-action checkpoints that affect following tasks
-- Following tasks depend on checkpoint outcome
-- Better to execute in main sequentially in those cases
 </step>
 <step name="load_prompt">
-Read the plan prompt:
 ```bash
 cat .planning/phases/XX-name/{phase}-{plan}-PLAN.md
-````
-This IS the execution instructions. Follow it exactly.
-**If plan references CONTEXT.md:**
-The CONTEXT.md file provides the user's vision for this phase — how they imagine it working, what's essential, and what's out of scope. Honor this context throughout execution.
+```
+This IS the execution instructions. Follow exactly. If plan references CONTEXT.md: honor user's vision throughout.
 </step>
 <step name="previous_phase_check">
-Before executing, check if previous phase had issues:
 ```bash
-# Find previous phase summary
-ls .planning/phases/*/SUMMARY.md 2>/dev/null | sort -r | head -2 | tail -1
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs phases list --type summaries --raw
+# Extract the second-to-last summary from the JSON result
 ```
-If previous phase SUMMARY.md has "Issues Encountered" != "None" or "Next Phase Readiness" mentions blockers:
-Use AskUserQuestion:
-- header: "Previous Issues"
-- question: "Previous phase had unresolved items: [summary]. How to proceed?"
-- options:
-  - "Proceed anyway" - Issues won't block this phase
-  - "Address first" - Let's resolve before continuing
-  - "Review previous" - Show me the full summary
-    </step>
+If previous SUMMARY has unresolved "Issues Encountered" or "Next Phase Readiness" blockers: AskUserQuestion(header="Previous Issues", options: "Proceed anyway" | "Address first" | "Review previous").
+</step>
 <step name="execute">
-Execute each task in the prompt. **Deviations are normal** - handle them automatically using embedded rules below.
-1. Read the @context files listed in the prompt
-2. For each task:
-   **If `type="auto"`:**
-   **Before executing:** Check if task has `tdd="true"` attribute:
-   - If yes: Follow TDD execution flow (see `<tdd_execution>`) - RED → GREEN → REFACTOR cycle with atomic commits per stage
-   - If no: Standard implementation
-   - Work toward task completion
-   - **If CLI/API returns authentication error:** Handle as authentication gate (see below)
-   - **When you discover additional work not in plan:** Apply deviation rules (see below) automatically
-   - Continue implementing, applying rules as needed
-   - Run the verification
-   - Confirm done criteria met
-   - **Commit the task** (see `<task_commit>` below)
-   - Track task completion and commit hash for Summary documentation
-   - Continue to next task
-   **If `type="checkpoint:*"`:**
-   - STOP immediately (do not continue to next task)
-   - Execute checkpoint_protocol (see below)
-   - Wait for user response
-   - Verify if possible (check files, env vars, etc.)
-   - Only after user confirmation: continue to next task
-3. Run overall verification checks from `<verification>` section
-4. Confirm all success criteria from `<success_criteria>` section met
-5. Document all deviations in Summary (automatic - see deviation_documentation below)
-   </step>
+Deviations are normal — handle via rules below.
+1. Read @context files from prompt
+2. Per task:
+   - `type="auto"`: if `tdd="true"` → TDD execution. Implement with deviation rules + auth gates. Verify done criteria. Commit (see task_commit). Track hash for Summary.
+   - `type="checkpoint:*"`: STOP → checkpoint_protocol → wait for user → continue only after confirmation.
+3. Run `<verification>` checks
+4. Confirm `<success_criteria>` met
+5. Document deviations in Summary
+</step>
 <authentication_gates>
-## Handling Authentication Errors During Execution
-**When you encounter authentication errors during `type="auto"` task execution:**
-This is NOT a failure. Authentication gates are expected and normal. Handle them dynamically:
-**Authentication error indicators:**
-- CLI returns: "Error: Not authenticated", "Not logged in", "Unauthorized", "401", "403"
-- API returns: "Authentication required", "Invalid API key", "Missing credentials"
-- Command fails with: "Please run {tool} login" or "Set {ENV_VAR} environment variable"
-**Authentication gate protocol:**
-1. **Recognize it's an auth gate** - Not a bug, just needs credentials
-2. **STOP current task execution** - Don't retry repeatedly
-3. **Create dynamic checkpoint:human-action** - Present it to user immediately
-4. **Provide exact authentication steps** - CLI commands, where to get keys
-5. **Wait for user to authenticate** - Let them complete auth flow
-6. **Verify authentication works** - Test that credentials are valid
-7. **Retry the original task** - Resume automation where you left off
-8. **Continue normally** - Don't treat this as an error in Summary
-**Example: Vercel deployment hits auth error**
-```
-Task 3: Deploy to Vercel
-Running: vercel --yes
-Error: Not authenticated. Please run 'vercel login'
-[Create checkpoint dynamically]
-╔═══════════════════════════════════════════════════════╗
-║  CHECKPOINT: Action Required                          ║
-╚═══════════════════════════════════════════════════════╝
-Progress: 2/8 tasks complete
-Task: Authenticate Vercel CLI
-Attempted: vercel --yes
-Error: Not authenticated
-What you need to do:
-  1. Run: vercel login
-  2. Complete browser authentication
-I'll verify: vercel whoami returns your account
-────────────────────────────────────────────────────────
-→ YOUR ACTION: Type "done" when authenticated
-────────────────────────────────────────────────────────
-[Wait for user response]
-[User types "done"]
-Verifying authentication...
-Running: vercel whoami
-✓ Authenticated as: user@example.com
-Retrying deployment...
-Running: vercel --yes
-✓ Deployed to: https://myapp-abc123.vercel.app
-Task 3 complete. Continuing to task 4...
-```
-**In Summary documentation:**
-Document authentication gates as normal flow, not deviations:
-```markdown
 ## Authentication Gates
-During execution, I encountered authentication requirements:
-1. Task 3: Vercel CLI required authentication
-   - Paused for `vercel login`
-   - Resumed after authentication
-   - Deployed successfully
-These are normal gates, not errors.
-```
-**Key principles:**
-- Authentication gates are NOT failures or bugs
-- They're expected interaction points during first-time setup
-- Handle them gracefully and continue automation after unblocked
-- Don't mark tasks as "failed" or "incomplete" due to auth gates
-- Document them as normal flow, separate from deviations
-  </authentication_gates>
-<deviation_rules>
-## Automatic Deviation Handling
-**While executing tasks, you WILL discover work not in the plan.** This is normal.
-Apply these rules automatically. Track all deviations for Summary documentation.
----
-**RULE 1: Auto-fix bugs**
-**Trigger:** Code doesn't work as intended (broken behavior, incorrect output, errors)
-**Action:** Fix immediately, track for Summary
-**Examples:**
-- Wrong SQL query returning incorrect data
-- Logic errors (inverted condition, off-by-one, infinite loop)
-- Type errors, null pointer exceptions, undefined references
-- Broken validation (accepts invalid input, rejects valid input)
-- Security vulnerabilities (SQL injection, XSS, CSRF, insecure auth)
-- Race conditions, deadlocks
-- Memory leaks, resource leaks
-**Process:**
-1. Fix the bug inline
-2. Add/update tests to prevent regression
-3. Verify fix works
-4. Continue task
-5. Track in deviations list: `[Rule 1 - Bug] [description]`
-**No user permission needed.** Bugs must be fixed for correct operation.
----
-**RULE 2: Auto-add missing critical functionality**
-**Trigger:** Code is missing essential features for correctness, security, or basic operation
-**Action:** Add immediately, track for Summary
-**Examples:**
-- Missing error handling (no try/catch, unhandled promise rejections)
-- No input validation (accepts malicious data, type coercion issues)
-- Missing null/undefined checks (crashes on edge cases)
-- No authentication on protected routes
-- Missing authorization checks (users can access others' data)
-- No CSRF protection, missing CORS configuration
-- No rate limiting on public APIs
-- Missing required database indexes (causes timeouts)
-- No logging for errors (can't debug production)
-**Process:**
-1. Add the missing functionality inline
-2. Add tests for the new functionality
-3. Verify it works
-4. Continue task
-5. Track in deviations list: `[Rule 2 - Missing Critical] [description]`
-**Critical = required for correct/secure/performant operation**
-**No user permission needed.** These are not "features" - they're requirements for basic correctness.
----
-**RULE 3: Auto-fix blocking issues**
-**Trigger:** Something prevents you from completing current task
-**Action:** Fix immediately to unblock, track for Summary
-**Examples:**
-- Missing dependency (package not installed, import fails)
-- Wrong types blocking compilation
-- Broken import paths (file moved, wrong relative path)
-- Missing environment variable (app won't start)
-- Database connection config error
-- Build configuration error (webpack, tsconfig, etc.)
-- Missing file referenced in code
-- Circular dependency blocking module resolution
-**Process:**
+Auth errors during execution are NOT failures — they're expected interaction points.
-1. Fix the blocking issue
-2. Verify task can now proceed
-3. Continue task
-4. Track in deviations list: `[Rule 3 - Blocking] [description]`
+**Indicators:** "Not authenticated", "Unauthorized", 401/403, "Please run {tool} login", "Set {ENV_VAR}"
-**No user permission needed.** Can't complete task without fixing blocker.
+**Protocol:**
+1. Recognize auth gate (not a bug)
+2. STOP task execution
+3. Create dynamic checkpoint:human-action with exact auth steps
+4. Wait for user to authenticate
+5. Verify credentials work
+6. Retry original task
+7. Continue normally
----
+**Example:** `vercel --yes` → "Not authenticated" → checkpoint asking user to `vercel login` → verify with `vercel whoami` → retry deploy → continue
-**RULE 4: Ask about architectural changes**
+**In Summary:** Document as normal flow under "## Authentication Gates", not as deviations.
-**Trigger:** Fix/addition requires significant structural modification
+</authentication_gates>
-**Action:** STOP, present to user, wait for decision
-**Examples:**
+<deviation_rules>
-- Adding new database table (not just column)
-- Major schema changes (changing primary key, splitting tables)
-- Introducing new service layer or architectural pattern
-- Switching libraries/frameworks (React → Vue, REST → GraphQL)
-- Changing authentication approach (sessions → JWT)
-- Adding new infrastructure (message queue, cache layer, CDN)
-- Changing API contracts (breaking changes to endpoints)
-- Adding new deployment environment
+## Deviation Rules
-**Process:**
+You WILL discover unplanned work. Apply automatically, track all for Summary.
-1. STOP current task
-2. Present clearly:
+| Rule | Trigger | Action | Permission |
+|------|---------|--------|------------|
+| **1: Bug** | Broken behavior, errors, wrong queries, type errors, security vulns, race conditions, leaks | Fix → test → verify → track `[Rule 1 - Bug]` | Auto |
+| **2: Missing Critical** | Missing essentials: error handling, validation, auth, CSRF/CORS, rate limiting, indexes, logging | Add → test → verify → track `[Rule 2 - Missing Critical]` | Auto |
+| **3: Blocking** | Prevents completion: missing deps, wrong types, broken imports, missing env/config/files, circular deps | Fix blocker → verify proceeds → track `[Rule 3 - Blocking]` | Auto |
+| **4: Architectural** | Structural change: new DB table, schema change, new service, switching libs, breaking API, new infra | STOP → present decision (below) → track `[Rule 4 - Architectural]` | Ask user |
+**Rule 4 format:**
 ```
 ⚠️ Architectural Decision Needed
 Current task: [task name]
-Discovery: [what you found that prompted this]
-Proposed change: [architectural modification]
+Discovery: [what prompted this]
+Proposed change: [modification]
 Why needed: [rationale]
-Impact: [what this affects - APIs, deployment, dependencies, etc.]
-Alternatives: [other approaches, or "none apparent"]
+Impact: [what this affects]
+Alternatives: [other approaches]
 Proceed with proposed change? (yes / different approach / defer)
 ```
-3. WAIT for user response
-4. If approved: implement, track as `[Rule 4 - Architectural] [description]`
-5. If different approach: discuss and implement
-6. If deferred: note in Summary and continue without change
-**User decision required.** These changes affect system design.
----
-**RULE PRIORITY (when multiple could apply):**
-1. **If Rule 4 applies** → STOP and ask (architectural decision)
-2. **If Rules 1-3 apply** → Fix automatically, track for Summary
-3. **If genuinely unsure which rule** → Apply Rule 4 (ask user)
-**Edge case guidance:**
-- "This validation is missing" → Rule 2 (critical for security)
-- "This crashes on null" → Rule 1 (bug)
-- "Need to add table" → Rule 4 (architectural)
-- "Need to add column" → Rule 1 or 2 (depends: fixing bug or adding critical field)
-**When in doubt:** Ask yourself "Does this affect correctness, security, or ability to complete task?"
-- YES → Rules 1-3 (fix automatically)
-- MAYBE → Rule 4 (ask user)
+**Priority:** Rule 4 (STOP) > Rules 1-3 (auto) > unsure → Rule 4
+**Edge cases:** missing validation → R2 | null crash → R1 | new table → R4 | new column → R1/2
+**Heuristic:** Affects correctness/security/completion? → R1-3. Maybe? → R4.
 </deviation_rules>
 <deviation_documentation>
-## Documenting Deviations in Summary
-After all tasks complete, Summary MUST include deviations section.
-**If no deviations:**
-```markdown
-## Deviations from Plan
-None - plan executed exactly as written.
-```
-**If deviations occurred:**
+## Documenting Deviations
-```markdown
-## Deviations from Plan
+Summary MUST include deviations section. None? → `## Deviations from Plan\n\nNone - plan executed exactly as written.`
-### Auto-fixed Issues
+Per deviation: **[Rule N - Category] Title** — Found during: Task X | Issue | Fix | Files modified | Verification | Commit hash
-**1. [Rule 1 - Bug] Fixed case-sensitive email uniqueness constraint**
+End with: **Total deviations:** N auto-fixed (breakdown). **Impact:** assessment.
-- **Found during:** Task 4 (Follow/unfollow API implementation)
-- **Issue:** User.email unique constraint was case-sensitive - Test@example.com and test@example.com were both allowed, causing duplicate accounts
-- **Fix:** Changed to `CREATE UNIQUE INDEX users_email_unique ON users (LOWER(email))`
-- **Files modified:** src/models/User.ts, migrations/003_fix_email_unique.sql
-- **Verification:** Unique constraint test passes - duplicate emails properly rejected
-- **Commit:** abc123f
-**2. [Rule 2 - Missing Critical] Added JWT expiry validation to auth middleware**
-- **Found during:** Task 3 (Protected route implementation)
-- **Issue:** Auth middleware wasn't checking token expiry - expired tokens were being accepted
-- **Fix:** Added exp claim validation in middleware, reject with 401 if expired
-- **Files modified:** src/middleware/auth.ts, src/middleware/auth.test.ts
-- **Verification:** Expired token test passes - properly rejects with 401
-- **Commit:** def456g
----
+</deviation_documentation>
-**Total deviations:** 4 auto-fixed (1 bug, 1 missing critical, 1 blocking, 1 architectural with approval)
-**Impact on plan:** All auto-fixes necessary for correctness/security/performance. No scope creep.
-```
+<tdd_plan_execution>
+## TDD Execution
-**This provides complete transparency:**
+For `type: tdd` plans — RED-GREEN-REFACTOR:
-- Every deviation documented
-- Why it was needed
-- What rule applied
-- What was done
-- User can see exactly what happened beyond the plan
+1. **Infrastructure** (first TDD plan only): detect project, install framework, config, verify empty suite
+2. **RED:** Read `<behavior>` → failing test(s) → run (MUST fail) → commit: `test({phase}-{plan}): add failing test for [feature]`
+3. **GREEN:** Read `<implementation>` → minimal code → run (MUST pass) → commit: `feat({phase}-{plan}): implement [feature]`
+4. **REFACTOR:** Clean up → tests MUST pass → commit: `refactor({phase}-{plan}): clean up [feature]`
-</deviation_documentation>
+Errors: RED doesn't fail → investigate test/existing feature. GREEN doesn't pass → debug, iterate. REFACTOR breaks → undo.
-<tdd_plan_execution>
-## TDD Plan Execution
-When executing a plan with `type: tdd` in frontmatter, follow the RED-GREEN-REFACTOR cycle for the single feature defined in the plan.
-**1. Check test infrastructure (if first TDD plan):**
-If no test framework configured:
-- Detect project type from package.json/requirements.txt/etc.
-- Install minimal test framework (Jest, pytest, Go testing, etc.)
-- Create test config file
-- Verify: run empty test suite
-- This is part of the RED phase, not a separate task
-**2. RED - Write failing test:**
-- Read `<behavior>` element for test specification
-- Create test file if doesn't exist (follow project conventions)
-- Write test(s) that describe expected behavior
-- Run tests - MUST fail (if passes, test is wrong or feature exists)
-- Commit: `test({phase}-{plan}): add failing test for [feature]`
-**3. GREEN - Implement to pass:**
-- Read `<implementation>` element for guidance
-- Write minimal code to make test pass
-- Run tests - MUST pass
-- Commit: `feat({phase}-{plan}): implement [feature]`
-**4. REFACTOR (if needed):**
-- Clean up code if obvious improvements
-- Run tests - MUST still pass
-- Commit only if changes made: `refactor({phase}-{plan}): clean up [feature]`
-**Commit pattern for TDD plans:**
-Each TDD plan produces 2-3 atomic commits:
-1. `test({phase}-{plan}): add failing test for X`
-2. `feat({phase}-{plan}): implement X`
-3. `refactor({phase}-{plan}): clean up X` (optional)
-**Error handling:**
-- If test doesn't fail in RED phase: Test is wrong or feature already exists. Investigate before proceeding.
-- If test doesn't pass in GREEN phase: Debug implementation, keep iterating until green.
-- If tests fail in REFACTOR phase: Undo refactor, commit was premature.
-**Verification:**
-After TDD plan completion, ensure:
-- All tests pass
-- Test coverage for the new behavior exists
-- No unrelated tests broken
-**Why TDD uses dedicated plans:** TDD requires 2-3 execution cycles (RED → GREEN → REFACTOR), each with file reads, test runs, and potential debugging. This consumes 40-50% of context for a single feature. Dedicated plans ensure full quality throughout the cycle.
-**Comparison:**
-- Standard plans: Multiple tasks, 1 commit per task, 2-4 commits total
-- TDD plans: Single feature, 2-3 commits for RED/GREEN/REFACTOR cycle
-See `~/.claude/get-shit-done/references/tdd.md` for TDD plan structure.
+See `~/.codex/get-shit-done/references/tdd.md` for structure.
 </tdd_plan_execution>
 <task_commit>
 ## Task Commit Protocol
-After each task completes (verification passed, done criteria met), commit immediately:
+After each task (verification passed, done criteria met), commit immediately.
-**1. Identify modified files:**
-Track files changed during this specific task (not the entire plan):
+**1. Check:** `git status --short`
+**2. Stage individually** (NEVER `git add .` or `git add -A`):
 ```bash
-git status --short
-```
-**2. Stage only task-related files:**
-Stage each file individually (NEVER use `git add .` or `git add -A`):
-```bash
-# Example - adjust to actual files modified by this task
 git add src/api/auth.ts
 git add src/types/user.ts
 ```
-**3. Determine commit type:**
-| Type | When to Use | Example |
-|------|-------------|---------|
-| `feat` | New feature, endpoint, component, functionality | feat(08-02): create user registration endpoint |
-| `fix` | Bug fix, error correction | fix(08-02): correct email validation regex |
-| `test` | Test-only changes (TDD RED phase) | test(08-02): add failing test for password hashing |
-| `refactor` | Code cleanup, no behavior change (TDD REFACTOR phase) | refactor(08-02): extract validation to helper |
-| `perf` | Performance improvement | perf(08-02): add database index for user lookups |
-| `docs` | Documentation changes | docs(08-02): add API endpoint documentation |
-| `style` | Formatting, linting fixes | style(08-02): format auth module |
-| `chore` | Config, tooling, dependencies | chore(08-02): add bcrypt dependency |
+**3. Commit type:**
-**4. Craft commit message:**
+| Type | When | Example |
+|------|------|---------|
+| `feat` | New functionality | feat(08-02): create user registration endpoint |
+| `fix` | Bug fix | fix(08-02): correct email validation regex |
+| `test` | Test-only (TDD RED) | test(08-02): add failing test for password hashing |
+| `refactor` | No behavior change (TDD REFACTOR) | refactor(08-02): extract validation to helper |
+| `perf` | Performance | perf(08-02): add database index |
+| `docs` | Documentation | docs(08-02): add API docs |
+| `style` | Formatting | style(08-02): format auth module |
+| `chore` | Config/deps | chore(08-02): add bcrypt dependency |
-Format: `{type}({phase}-{plan}): {task-name-or-description}`
-```bash
-git commit -m "{type}({phase}-{plan}): {concise task description}
-- {key change 1}
-- {key change 2}
-- {key change 3}
-"
-```
-**Examples:**
-```bash
-# Standard plan task
-git commit -m "feat(08-02): create user registration endpoint
-- POST /auth/register validates email and password
-- Checks for duplicate users
-- Returns JWT token on success
-"
-# Another standard task
-git commit -m "fix(08-02): correct email validation regex
-- Fixed regex to accept plus-addressing
-- Added tests for edge cases
-"
-```
-**Note:** TDD plans have their own commit pattern (test/feat/refactor for RED/GREEN/REFACTOR phases). See `<tdd_plan_execution>` section above.
-**5. Record commit hash:**
-After committing, capture hash for SUMMARY.md:
+**4. Format:** `{type}({phase}-{plan}): {description}` with bullet points for key changes.
+**5. Record hash:**
 ```bash
 TASK_COMMIT=$(git rev-parse --short HEAD)
-echo "Task ${TASK_NUM} committed: ${TASK_COMMIT}"
-```
-Store in array or list for SUMMARY generation:
-```bash
 TASK_COMMITS+=("Task ${TASK_NUM}: ${TASK_COMMIT}")
 ```
-**Atomic commit benefits:**
-- Each task independently revertable
-- Git bisect finds exact failing task
-- Git blame traces line to specific task context
-- Clear history for Claude in future sessions
-- Better observability for AI-automated workflow
 </task_commit>
 <step name="checkpoint_protocol">
-When encountering `type="checkpoint:*"`:
-**Critical: Claude automates everything with CLI/API before checkpoints.** Checkpoints are for verification and decisions, not manual work.
-**Display checkpoint clearly:**
-```
-╔═══════════════════════════════════════════════════════╗
-║  CHECKPOINT: [Type]                                   ║
-╚═══════════════════════════════════════════════════════╝
-Progress: {X}/{Y} tasks complete
-Task: [task name]
+On `type="checkpoint:*"`: automate everything possible first. Checkpoints are for verification/decisions only.
-[Display task-specific content based on type]
+Display: `CHECKPOINT: [Type]` box → Progress {X}/{Y} → Task name → type-specific content → `YOUR ACTION: [signal]`
-────────────────────────────────────────────────────────
-→ YOUR ACTION: [Resume signal instruction]
-────────────────────────────────────────────────────────
-```
-**For checkpoint:human-verify (90% of checkpoints):**
-```
-Built: [what was automated - deployed, built, configured]
-How to verify:
-  1. [Step 1 - exact command/URL]
-  2. [Step 2 - what to check]
-  3. [Step 3 - expected behavior]
-────────────────────────────────────────────────────────
-→ YOUR ACTION: Type "approved" or describe issues
-────────────────────────────────────────────────────────
-```
+| Type | Content | Resume signal |
+|------|---------|---------------|
+| human-verify (90%) | What was built + verification steps (commands/URLs) | "approved" or describe issues |
+| decision (9%) | Decision needed + context + options with pros/cons | "Select: option-id" |
+| human-action (1%) | What was automated + ONE manual step + verification plan | "done" |
-**For checkpoint:decision (9% of checkpoints):**
+After response: verify if specified. Pass → continue. Fail → inform, wait. WAIT for user — do NOT hallucinate completion.
-```
-Decision needed: [decision]
-Context: [why this matters]
-Options:
-1. [option-id]: [name]
-   Pros: [pros]
-   Cons: [cons]
-2. [option-id]: [name]
-   Pros: [pros]
-   Cons: [cons]
-[Resume signal - e.g., "Select: option-id"]
-```
-**For checkpoint:human-action (1% - rare, only for truly unavoidable manual steps):**
-```
-I automated: [what Claude already did via CLI/API]
-Need your help with: [the ONE thing with no CLI/API - email link, 2FA code]
-Instructions:
-[Single unavoidable step]
-I'll verify after: [verification]
-[Resume signal - e.g., "Type 'done' when complete"]
-```
-**After displaying:** WAIT for user response. Do NOT hallucinate completion. Do NOT continue to next task.
-**After user responds:**
-- Run verification if specified (file exists, env var set, tests pass, etc.)
-- If verification passes or N/A: continue to next task
-- If verification fails: inform user, wait for resolution
-See ~/.claude/get-shit-done/references/checkpoints.md for complete checkpoint guidance.
+See ~/.codex/get-shit-done/references/checkpoints.md for details.
 </step>
 <step name="checkpoint_return_for_orchestrator">
-**When spawned by an orchestrator (execute-phase or execute-plan command):**
-If you were spawned via Task tool and hit a checkpoint, you cannot directly interact with the user. Instead, RETURN to the orchestrator with structured checkpoint state so it can present to the user and spawn a fresh continuation agent.
-**Return format for checkpoints:**
-**Required in your return:**
-1. **Completed Tasks table** - Tasks done so far with commit hashes and files created
-2. **Current Task** - Which task you're on and what's blocking it
-3. **Checkpoint Details** - User-facing content (verification steps, decision options, or action instructions)
-4. **Awaiting** - What you need from the user
-**Example return:**
-```
-## CHECKPOINT REACHED
-**Type:** human-action
-**Plan:** 01-01
-**Progress:** 1/3 tasks complete
-### Completed Tasks
-| Task | Name | Commit | Files |
-|------|------|--------|-------|
-| 1 | Initialize Next.js 15 project | d6fe73f | package.json, tsconfig.json, app/ |
-### Current Task
-**Task 2:** Initialize Convex backend
-**Status:** blocked
-**Blocked by:** Convex CLI authentication required
-### Checkpoint Details
+When spawned via Task and hitting checkpoint: return structured state (cannot interact with user directly).
-**Automation attempted:**
-Ran `npx convex dev` to initialize Convex backend
+**Required return:** 1) Completed Tasks table (hashes + files) 2) Current Task (what's blocking) 3) Checkpoint Details (user-facing content) 4) Awaiting (what's needed from user)
-**Error encountered:**
-"Error: Not authenticated. Run `npx convex login` first."
-**What you need to do:**
-1. Run: `npx convex login`
-2. Complete browser authentication
-3. Run: `npx convex dev`
-4. Create project when prompted
-**I'll verify after:**
-`cat .env.local | grep CONVEX` returns the Convex URL
-### Awaiting
-Type "done" when Convex is authenticated and project created.
-```
-**After you return:**
-The orchestrator will:
-1. Parse your structured return
-2. Present checkpoint details to the user
-3. Collect user's response
-4. Spawn a FRESH continuation agent with your completed tasks state
-You will NOT be resumed. A new agent continues from where you stopped, using your Completed Tasks table to know what's done.
-**How to know if you were spawned:**
-If you're reading this workflow because an orchestrator spawned you (vs running directly), the orchestrator's prompt will include checkpoint return instructions. Follow those instructions when you hit a checkpoint.
-**If running in main context (not spawned):**
-Use the standard checkpoint_protocol - display checkpoint and wait for direct user response.
+Orchestrator parses → presents to user → spawns fresh continuation with your completed tasks state. You will NOT be resumed. In main context: use checkpoint_protocol above.
 </step>
 <step name="verification_failure_gate">
-If any task verification fails:
-STOP. Do not continue to next task.
-Present inline:
-"Verification failed for Task [X]: [task name]
-Expected: [verification criteria]
-Actual: [what happened]
-How to proceed?
-1. Retry - Try the task again
-2. Skip - Mark as incomplete, continue
-3. Stop - Pause execution, investigate"
-Wait for user decision.
-If user chose "Skip", note it in SUMMARY.md under "Issues Encountered".
+If verification fails: STOP. Present: "Verification failed for Task [X]: [name]. Expected: [criteria]. Actual: [result]." Options: Retry | Skip (mark incomplete) | Stop (investigate). If skipped → SUMMARY "Issues Encountered".
 </step>
 <step name="record_completion_time">
-Record execution end time and calculate duration:
 ```bash
 PLAN_END_TIME=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
 PLAN_END_EPOCH=$(date +%s)
@@ -1243,577 +309,121 @@ else
   DURATION="${DURATION_MIN} min"
 fi
 ```
-Pass timing data to SUMMARY.md creation.
 </step>
 <step name="generate_user_setup">
-**Generate USER-SETUP.md if plan has user_setup in frontmatter.**
-Check PLAN.md frontmatter for `user_setup` field:
 ```bash
 grep -A 50 "^user_setup:" .planning/phases/XX-name/{phase}-{plan}-PLAN.md | head -50
 ```
-**If user_setup exists and is not empty:**
-Create `.planning/phases/XX-name/{phase}-USER-SETUP.md` using template from `~/.claude/get-shit-done/templates/user-setup.md`.
-**Content generation:**
-1. Parse each service in `user_setup` array
-2. For each service, generate sections:
-   - Environment Variables table (from `env_vars`)
-   - Account Setup checklist (from `account_setup`, if present)
-   - Dashboard Configuration steps (from `dashboard_config`, if present)
-   - Local Development notes (from `local_dev`, if present)
-3. Add verification section with commands to confirm setup works
-4. Set status to "Incomplete"
-**Example output:**
-```markdown
-# Phase 10: User Setup Required
-**Generated:** 2025-01-14
-**Phase:** 10-monetization
-**Status:** Incomplete
-## Environment Variables
-| Status | Variable | Source | Add to |
-|--------|----------|--------|--------|
-| [ ] | `STRIPE_SECRET_KEY` | Stripe Dashboard → Developers → API keys → Secret key | `.env.local` |
-| [ ] | `STRIPE_WEBHOOK_SECRET` | Stripe Dashboard → Developers → Webhooks → Signing secret | `.env.local` |
-## Dashboard Configuration
-- [ ] **Create webhook endpoint**
-  - Location: Stripe Dashboard → Developers → Webhooks → Add endpoint
-  - Details: URL: https://[your-domain]/api/webhooks/stripe, Events: checkout.session.completed
-## Local Development
-For local testing:
-\`\`\`bash
-stripe listen --forward-to localhost:3000/api/webhooks/stripe
-\`\`\`
-## Verification
-[Verification commands based on service]
----
-**Once all items complete:** Mark status as "Complete"
-```
-**If user_setup is empty or missing:**
-Skip this step - no USER-SETUP.md needed.
-**Track for offer_next:**
-Set `USER_SETUP_CREATED=true` if file was generated, for use in completion messaging.
+If user_setup exists: create `{phase}-USER-SETUP.md` using template `~/.codex/get-shit-done/templates/user-setup.md`. Per service: env vars table, account setup checklist, dashboard config, local dev notes, verification commands. Status "Incomplete". Set `USER_SETUP_CREATED=true`. If empty/missing: skip.
 </step>
 <step name="create_summary">
-Create `{phase}-{plan}-SUMMARY.md` as specified in the prompt's `<output>` section.
-Use ~/.claude/get-shit-done/templates/summary.md for structure.
-**File location:** `.planning/phases/XX-name/{phase}-{plan}-SUMMARY.md`
-**Frontmatter population:**
-Before writing summary content, populate frontmatter fields from execution context:
+Create `{phase}-{plan}-SUMMARY.md` at `.planning/phases/XX-name/`. Use `~/.codex/get-shit-done/templates/summary.md`.
-1. **Basic identification:**
-   - phase: From PLAN.md frontmatter
-   - plan: From PLAN.md frontmatter
-   - subsystem: Categorize based on phase focus (auth, payments, ui, api, database, infra, testing, etc.)
-   - tags: Extract tech keywords (libraries, frameworks, tools used)
+**Frontmatter:** phase, plan, subsystem, tags | requires/provides/affects | tech-stack.added/patterns | key-files.created/modified | key-decisions | duration ($DURATION), completed ($PLAN_END_TIME date).
-2. **Dependency graph:**
-   - requires: List prior phases this built upon (check PLAN.md context section for referenced prior summaries)
-   - provides: Extract from accomplishments - what was delivered
-   - affects: Infer from phase description/goal what future phases might need this
+Title: `# Phase [X] Plan [Y]: [Name] Summary`
-3. **Tech tracking:**
-   - tech-stack.added: New libraries from package.json changes or requirements
-   - tech-stack.patterns: Architectural patterns established (from decisions/accomplishments)
+One-liner SUBSTANTIVE: "JWT auth with refresh rotation using jose library" not "Authentication implemented"
-4. **File tracking:**
-   - key-files.created: From "Files Created/Modified" section
-   - key-files.modified: From "Files Created/Modified" section
+Include: duration, start/end times, task count, file count.
-5. **Decisions:**
-   - key-decisions: Extract from "Decisions Made" section
-6. **Metrics:**
-   - duration: From $DURATION variable
-   - completed: From $PLAN_END_TIME (date only, format YYYY-MM-DD)
-Note: If subsystem/affects are unclear, use best judgment based on phase name and accomplishments. Can be refined later.
-**Title format:** `# Phase [X] Plan [Y]: [Name] Summary`
-The one-liner must be SUBSTANTIVE:
-- Good: "JWT auth with refresh rotation using jose library"
-- Bad: "Authentication implemented"
-**Include performance data:**
-- Duration: `$DURATION`
-- Started: `$PLAN_START_TIME`
-- Completed: `$PLAN_END_TIME`
-- Tasks completed: (count from execution)
-- Files modified: (count from execution)
-**Next Step section:**
-- If more plans exist in this phase: "Ready for {phase}-{next-plan}-PLAN.md"
-- If this is the last plan: "Phase complete, ready for transition"
-  </step>
+Next: more plans → "Ready for {next-plan}" | last → "Phase complete, ready for transition".
+</step>
 <step name="update_current_position">
-Update Current Position section in STATE.md to reflect plan completion.
-**Format:**
-```markdown
-Phase: [current] of [total] ([phase name])
-Plan: [just completed] of [total in phase]
-Status: [In progress / Phase complete]
-Last activity: [today] - Completed {phase}-{plan}-PLAN.md
-Progress: [progress bar]
-```
-**Calculate progress bar:**
+Update STATE.md using gsd-tools:
-- Count total plans across all phases (from ROADMAP.md or ROADMAP.md)
-- Count completed plans (count SUMMARY.md files that exist)
-- Progress = (completed / total) × 100%
-- Render: ░ for incomplete, █ for complete
-**Example - completing 02-01-PLAN.md (plan 5 of 10 total):**
-Before:
-```markdown
-## Current Position
-Phase: 2 of 4 (Authentication)
-Plan: Not started
-Status: Ready to execute
-Last activity: 2025-01-18 - Phase 1 complete
-Progress: ██████░░░░ 40%
-```
-After:
-```markdown
-## Current Position
+```bash
+# Advance plan counter (handles last-plan edge case)
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs state advance-plan
-Phase: 2 of 4 (Authentication)
-Plan: 1 of 2 in current phase
-Status: In progress
-Last activity: 2025-01-19 - Completed 02-01-PLAN.md
+# Recalculate progress bar from disk state
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs state update-progress
-Progress: ███████░░░ 50%
+# Record execution metrics
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs state record-metric \
+  --phase "${PHASE}" --plan "${PLAN}" --duration "${DURATION}" \
+  --tasks "${TASK_COUNT}" --files "${FILE_COUNT}"
 ```
-**Step complete when:**
-- [ ] Phase number shows current phase (X of total)
-- [ ] Plan number shows plans complete in current phase (N of total-in-phase)
-- [ ] Status reflects current state (In progress / Phase complete)
-- [ ] Last activity shows today's date and the plan just completed
-- [ ] Progress bar calculated correctly from total completed plans
-      </step>
+</step>
 <step name="extract_decisions_and_issues">
-Extract decisions, issues, and concerns from SUMMARY.md into STATE.md accumulated context.
+From SUMMARY: Extract decisions and add to STATE.md:
-**Decisions Made:**
-- Read SUMMARY.md "## Decisions Made" section
-- If content exists (not "None"):
-  - Add each decision to STATE.md Decisions table
-  - Format: `| [phase number] | [decision summary] | [rationale] |`
-**Blockers/Concerns:**
+```bash
+# Add each decision from SUMMARY key-decisions
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs state add-decision \
+  --phase "${PHASE}" --summary "${DECISION_TEXT}" --rationale "${RATIONALE}"
-- Read SUMMARY.md "## Next Phase Readiness" section
-- If contains blockers or concerns:
-  - Add to STATE.md "Blockers/Concerns Carried Forward"
-    </step>
+# Add blockers if any found
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs state add-blocker "Blocker description"
+```
+</step>
 <step name="update_session_continuity">
-Update Session Continuity section in STATE.md to enable resumption in future sessions.
+Update session info using gsd-tools:
-**Format:**
-```markdown
-Last session: [current date and time]
-Stopped at: Completed {phase}-{plan}-PLAN.md
-Resume file: [path to .continue-here if exists, else "None"]
+```bash
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs state record-session \
+  --stopped-at "Completed ${PHASE}-${PLAN}-PLAN.md" \
+  --resume-file "None"
 ```
-**Size constraint note:** Keep STATE.md under 150 lines total.
+Keep STATE.md under 150 lines.
 </step>
 <step name="issues_review_gate">
-Before proceeding, check SUMMARY.md content.
-If "Issues Encountered" is NOT "None":
-<if mode="yolo">
-```
-⚡ Auto-approved: Issues acknowledgment
-⚠️ Note: Issues were encountered during execution:
-- [Issue 1]
-- [Issue 2]
-(Logged - continuing in yolo mode)
-```
-Continue without waiting.
-</if>
-<if mode="interactive" OR="custom with gates.issues_review true">
-Present issues and wait for acknowledgment before proceeding.
-</if>
+If SUMMARY "Issues Encountered" ≠ "None": yolo → log and continue. Interactive → present issues, wait for acknowledgment.
 </step>
 <step name="update_roadmap">
-Update the roadmap file:
 ```bash
-ROADMAP_FILE=".planning/ROADMAP.md"
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs roadmap update-plan-progress "${PHASE}"
 ```
-**If more plans remain in this phase:**
-- Update plan count: "2/3 plans complete"
-- Keep phase status as "In progress"
-**If this was the last plan in the phase:**
-- Mark phase complete: status → "Complete"
-- Add completion date
+Counts PLAN vs SUMMARY files on disk. Updates progress table row with correct count and status (`In Progress` or `Complete` with date).
 </step>
 <step name="git_commit_metadata">
-Commit execution metadata (SUMMARY + STATE + ROADMAP):
-**Note:** All task code has already been committed during execution (one commit per task).
-PLAN.md was already committed during plan-phase. This final commit captures execution results only.
-**1. Stage execution artifacts:**
-```bash
-git add .planning/phases/XX-name/{phase}-{plan}-SUMMARY.md
-git add .planning/STATE.md
-```
-**2. Stage roadmap:**
-```bash
-git add .planning/ROADMAP.md
-```
-**3. Verify staging:**
+Task code already committed per-task. Commit plan metadata:
 ```bash
-git status
-# Should show only execution artifacts (SUMMARY, STATE, ROADMAP), no code files
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "docs({phase}-{plan}): complete [plan-name] plan" --files .planning/phases/XX-name/{phase}-{plan}-SUMMARY.md .planning/STATE.md .planning/ROADMAP.md
 ```
-**4. Commit metadata:**
-```bash
-git commit -m "$(cat <<'EOF'
-docs({phase}-{plan}): complete [plan-name] plan
-Tasks completed: [N]/[N]
-- [Task 1 name]
-- [Task 2 name]
-- [Task 3 name]
-SUMMARY: .planning/phases/XX-name/{phase}-{plan}-SUMMARY.md
-EOF
-)"
-```
-**Example:**
-```bash
-git commit -m "$(cat <<'EOF'
-docs(08-02): complete user registration plan
-Tasks completed: 3/3
-- User registration endpoint
-- Password hashing with bcrypt
-- Email confirmation flow
-SUMMARY: .planning/phases/08-user-auth/08-02-registration-SUMMARY.md
-EOF
-)"
-```
-**Git log after plan execution:**
-```
-abc123f docs(08-02): complete user registration plan
-def456g feat(08-02): add email confirmation flow
-hij789k feat(08-02): implement password hashing with bcrypt
-lmn012o feat(08-02): create user registration endpoint
-```
-Each task has its own commit, followed by one metadata commit documenting plan completion.
-For commit message conventions, see ~/.claude/get-shit-done/references/git-integration.md
 </step>
 <step name="update_codebase_map">
-**If .planning/codebase/ exists:**
-Check what changed across all task commits in this plan:
+If .planning/codebase/ doesn't exist: skip.
 ```bash
-# Find first task commit (right after previous plan's docs commit)
 FIRST_TASK=$(git log --oneline --grep="feat({phase}-{plan}):" --grep="fix({phase}-{plan}):" --grep="test({phase}-{plan}):" --reverse | head -1 | cut -d' ' -f1)
-# Get all changes from first task through now
 git diff --name-only ${FIRST_TASK}^..HEAD 2>/dev/null
 ```
-**Update only if structural changes occurred:**
-| Change Detected | Update Action |
-|-----------------|---------------|
-| New directory in src/ | STRUCTURE.md: Add to directory layout |
-| package.json deps changed | STACK.md: Add/remove from dependencies list |
-| New file pattern (e.g., first .test.ts) | CONVENTIONS.md: Note new pattern |
-| New external API client | INTEGRATIONS.md: Add service entry with file path |
-| Config file added/changed | STACK.md: Update configuration section |
-| File renamed/moved | Update paths in relevant docs |
-**Skip update if only:**
-- Code changes within existing files
-- Bug fixes
-- Content changes (no structural impact)
-**Update format:**
-Make single targeted edits - add a bullet point, update a path, or remove a stale entry. Don't rewrite sections.
+Update only structural changes: new src/ dir → STRUCTURE.md | deps → STACK.md | file pattern → CONVENTIONS.md | API client → INTEGRATIONS.md | config → STACK.md | renamed → update paths. Skip code-only/bugfix/content changes.
 ```bash
-git add .planning/codebase/*.md
-git commit --amend --no-edit  # Include in metadata commit
+node ~/.codex/get-shit-done/bin/gsd-tools.cjs commit "" --files .planning/codebase/*.md --amend
 ```
-**If .planning/codebase/ doesn't exist:**
-Skip this step.
 </step>
 <step name="offer_next">
-**MANDATORY: Verify remaining work before presenting next steps.**
-Do NOT skip this verification. Do NOT assume phase or milestone completion without checking.
-**Step 0: Check for USER-SETUP.md**
-If `USER_SETUP_CREATED=true` (from generate_user_setup step), always include this warning block at the TOP of completion output:
-```
-⚠️ USER SETUP REQUIRED
-This phase introduced external services requiring manual configuration:
-📋 .planning/phases/{phase-dir}/{phase}-USER-SETUP.md
-Quick view:
-- [ ] {ENV_VAR_1}
-- [ ] {ENV_VAR_2}
-- [ ] {Dashboard config task}
-Complete this setup for the integration to function.
-Run `cat .planning/phases/{phase-dir}/{phase}-USER-SETUP.md` for full details.
----
-```
-This warning appears BEFORE "Plan complete" messaging. User sees setup requirements prominently.
-**Step 1: Count plans and summaries in current phase**
-List files in the phase directory:
+If `USER_SETUP_CREATED=true`: display `⚠️ USER SETUP REQUIRED` with path + env/config tasks at TOP.
 ```bash
 ls -1 .planning/phases/[current-phase-dir]/*-PLAN.md 2>/dev/null | wc -l
 ls -1 .planning/phases/[current-phase-dir]/*-SUMMARY.md 2>/dev/null | wc -l
 ```
-State the counts: "This phase has [X] plans and [Y] summaries."
-**Step 2: Route based on plan completion**
-Compare the counts from Step 1:
-| Condition | Meaning | Action |
-|-----------|---------|--------|
-| summaries < plans | More plans remain | Go to **Route A** |
-| summaries = plans | Phase complete | Go to Step 3 |
----
-**Route A: More plans remain in this phase**
-Identify the next unexecuted plan:
-- Find the first PLAN.md file that has no matching SUMMARY.md
-- Read its `<objective>` section
-<if mode="yolo">
-```
-Plan {phase}-{plan} complete.
-Summary: .planning/phases/{phase-dir}/{phase}-{plan}-SUMMARY.md
-{Y} of {X} plans complete for Phase {Z}.
-⚡ Auto-continuing: Execute next plan ({phase}-{next-plan})
-```
-Loop back to identify_plan step automatically.
-</if>
-<if mode="interactive" OR="custom with gates.execute_next_plan true">
-```
-Plan {phase}-{plan} complete.
-Summary: .planning/phases/{phase-dir}/{phase}-{plan}-SUMMARY.md
-{Y} of {X} plans complete for Phase {Z}.
----
-## ▶ Next Up
-**{phase}-{next-plan}: [Plan Name]** — [objective from next PLAN.md]
-`/gsd:execute-phase {phase}`
-<sub>`/clear` first → fresh context window</sub>
----
-**Also available:**
-- `/gsd:verify-work {phase}-{plan}` — manual acceptance testing before continuing
-- Review what was built before continuing
----
-```
-Wait for user to clear and run next command.
-</if>
-**STOP here if Route A applies. Do not continue to Step 3.**
----
-**Step 3: Check milestone status (only when all plans in phase are complete)**
-Read ROADMAP.md and extract:
-1. Current phase number (from the plan just completed)
-2. All phase numbers listed in the current milestone section
-To find phases in the current milestone, look for:
-- Phase headers: lines starting with `### Phase` or `#### Phase`
-- Phase list items: lines like `- [ ] **Phase X:` or `- [x] **Phase X:`
-Count total phases in the current milestone and identify the highest phase number.
-State: "Current phase is {X}. Milestone has {N} phases (highest: {Y})."
-**Step 4: Route based on milestone status**
-| Condition | Meaning | Action |
-|-----------|---------|--------|
-| current phase < highest phase | More phases remain | Go to **Route B** |
-| current phase = highest phase | Milestone complete | Go to **Route C** |
----
-**Route B: Phase complete, more phases remain in milestone**
-Read ROADMAP.md to get the next phase's name and goal.
-```
-Plan {phase}-{plan} complete.
-Summary: .planning/phases/{phase-dir}/{phase}-{plan}-SUMMARY.md
-## ✓ Phase {Z}: {Phase Name} Complete
-All {Y} plans finished.
----
-## ▶ Next Up
-**Phase {Z+1}: {Next Phase Name}** — {Goal from ROADMAP.md}
-`/gsd:plan-phase {Z+1}`
-<sub>`/clear` first → fresh context window</sub>
----
-**Also available:**
-- `/gsd:verify-work {Z}` — manual acceptance testing before continuing
-- `/gsd:discuss-phase {Z+1}` — gather context first
-- Review phase accomplishments before continuing
----
-```
----
-**Route C: Milestone complete (all phases done)**
-```
-🎉 MILESTONE COMPLETE!
-Plan {phase}-{plan} complete.
-Summary: .planning/phases/{phase-dir}/{phase}-{plan}-SUMMARY.md
-## ✓ Phase {Z}: {Phase Name} Complete
-All {Y} plans finished.
-╔═══════════════════════════════════════════════════════╗
-║  All {N} phases complete! Milestone is 100% done.     ║
-╚═══════════════════════════════════════════════════════╝
----
-## ▶ Next Up
-**Complete Milestone** — archive and prepare for next
-`/gsd:complete-milestone`
-<sub>`/clear` first → fresh context window</sub>
----
-**Also available:**
-- `/gsd:verify-work` — manual acceptance testing before completing milestone
-- `/gsd:add-phase <description>` — add another phase before completing
-- Review accomplishments before archiving
----
-```
+| Condition | Route | Action |
+|-----------|-------|--------|
+| summaries < plans | **A: More plans** | Find next PLAN without SUMMARY. Yolo: auto-continue. Interactive: show next plan, suggest `/gsd:execute-phase {phase}` + `/gsd:verify-work`. STOP here. |
+| summaries = plans, current < highest phase | **B: Phase done** | Show completion, suggest `/gsd:plan-phase {Z+1}` + `/gsd:verify-work {Z}` + `/gsd:discuss-phase {Z+1}` |
+| summaries = plans, current = highest phase | **C: Milestone done** | Show banner, suggest `/gsd:complete-milestone` + `/gsd:verify-work` + `/gsd:add-phase` |
+All routes: `/clear` first for fresh context.
 </step>
 </process>
@@ -1828,4 +438,4 @@ All {Y} plans finished.
 - ROADMAP.md updated
 - If codebase map exists: map updated with execution changes (or skipped if no significant changes)
 - If USER-SETUP.md created: prominently surfaced in completion output
-  </success_criteria>
+</success_criteria>