npm - gsd-opencode - Versions diffs - 1.22.1 → 1.33.0 - Mend

gsd-opencode 1.22.1 → 1.33.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (188) hide show

package/agents/gsd-advisor-researcher.md +112 -0
package/agents/gsd-assumptions-analyzer.md +110 -0
package/agents/gsd-codebase-mapper.md +0 -2
package/agents/gsd-debugger.md +117 -2
package/agents/gsd-doc-verifier.md +207 -0
package/agents/gsd-doc-writer.md +608 -0
package/agents/gsd-executor.md +45 -4
package/agents/gsd-integration-checker.md +0 -2
package/agents/gsd-nyquist-auditor.md +0 -2
package/agents/gsd-phase-researcher.md +191 -5
package/agents/gsd-plan-checker.md +152 -5
package/agents/gsd-planner.md +131 -157
package/agents/gsd-project-researcher.md +28 -3
package/agents/gsd-research-synthesizer.md +0 -2
package/agents/gsd-roadmapper.md +29 -2
package/agents/gsd-security-auditor.md +129 -0
package/agents/gsd-ui-auditor.md +485 -0
package/agents/gsd-ui-checker.md +305 -0
package/agents/gsd-ui-researcher.md +368 -0
package/agents/gsd-user-profiler.md +173 -0
package/agents/gsd-verifier.md +207 -22
package/commands/gsd/gsd-add-backlog.md +76 -0
package/commands/gsd/gsd-analyze-dependencies.md +34 -0
package/commands/gsd/gsd-audit-uat.md +24 -0
package/commands/gsd/gsd-autonomous.md +45 -0
package/commands/gsd/gsd-cleanup.md +5 -0
package/commands/gsd/gsd-debug.md +29 -21
package/commands/gsd/gsd-discuss-phase.md +15 -36
package/commands/gsd/gsd-do.md +30 -0
package/commands/gsd/gsd-docs-update.md +48 -0
package/commands/gsd/gsd-execute-phase.md +24 -2
package/commands/gsd/gsd-fast.md +30 -0
package/commands/gsd/gsd-forensics.md +56 -0
package/commands/gsd/gsd-help.md +2 -0
package/commands/gsd/gsd-join-discord.md +2 -1
package/commands/gsd/gsd-list-workspaces.md +19 -0
package/commands/gsd/gsd-manager.md +40 -0
package/commands/gsd/gsd-milestone-summary.md +51 -0
package/commands/gsd/gsd-new-project.md +4 -0
package/commands/gsd/gsd-new-workspace.md +44 -0
package/commands/gsd/gsd-next.md +24 -0
package/commands/gsd/gsd-note.md +34 -0
package/commands/gsd/gsd-plan-phase.md +8 -1
package/commands/gsd/gsd-plant-seed.md +28 -0
package/commands/gsd/gsd-pr-branch.md +25 -0
package/commands/gsd/gsd-profile-user.md +46 -0
package/commands/gsd/gsd-quick.md +7 -3
package/commands/gsd/gsd-reapply-patches.md +178 -45
package/commands/gsd/gsd-remove-workspace.md +26 -0
package/commands/gsd/gsd-research-phase.md +7 -12
package/commands/gsd/gsd-review-backlog.md +62 -0
package/commands/gsd/gsd-review.md +38 -0
package/commands/gsd/gsd-secure-phase.md +35 -0
package/commands/gsd/gsd-session-report.md +19 -0
package/commands/gsd/gsd-set-profile.md +24 -23
package/commands/gsd/gsd-ship.md +23 -0
package/commands/gsd/gsd-stats.md +18 -0
package/commands/gsd/gsd-thread.md +127 -0
package/commands/gsd/gsd-ui-phase.md +34 -0
package/commands/gsd/gsd-ui-review.md +32 -0
package/commands/gsd/gsd-workstreams.md +71 -0
package/get-shit-done/bin/gsd-tools.cjs +450 -90
package/get-shit-done/bin/lib/commands.cjs +489 -24
package/get-shit-done/bin/lib/config.cjs +329 -48
package/get-shit-done/bin/lib/core.cjs +1143 -102
package/get-shit-done/bin/lib/docs.cjs +267 -0
package/get-shit-done/bin/lib/frontmatter.cjs +125 -43
package/get-shit-done/bin/lib/init.cjs +918 -106
package/get-shit-done/bin/lib/milestone.cjs +65 -33
package/get-shit-done/bin/lib/model-profiles.cjs +70 -0
package/get-shit-done/bin/lib/phase.cjs +434 -404
package/get-shit-done/bin/lib/profile-output.cjs +1048 -0
package/get-shit-done/bin/lib/profile-pipeline.cjs +539 -0
package/get-shit-done/bin/lib/roadmap.cjs +156 -101
package/get-shit-done/bin/lib/schema-detect.cjs +238 -0
package/get-shit-done/bin/lib/security.cjs +384 -0
package/get-shit-done/bin/lib/state.cjs +711 -79
package/get-shit-done/bin/lib/template.cjs +2 -2
package/get-shit-done/bin/lib/uat.cjs +282 -0
package/get-shit-done/bin/lib/verify.cjs +254 -42
package/get-shit-done/bin/lib/workstream.cjs +495 -0
package/get-shit-done/references/agent-contracts.md +79 -0
package/get-shit-done/references/artifact-types.md +113 -0
package/get-shit-done/references/checkpoints.md +12 -10
package/get-shit-done/references/context-budget.md +49 -0
package/get-shit-done/references/continuation-format.md +15 -15
package/get-shit-done/references/decimal-phase-calculation.md +2 -3
package/get-shit-done/references/domain-probes.md +125 -0
package/get-shit-done/references/gate-prompts.md +100 -0
package/get-shit-done/references/git-integration.md +47 -0
package/get-shit-done/references/model-profile-resolution.md +2 -0
package/get-shit-done/references/model-profiles.md +62 -16
package/get-shit-done/references/phase-argument-parsing.md +2 -2
package/get-shit-done/references/planner-gap-closure.md +62 -0
package/get-shit-done/references/planner-reviews.md +39 -0
package/get-shit-done/references/planner-revision.md +87 -0
package/get-shit-done/references/planning-config.md +18 -1
package/get-shit-done/references/revision-loop.md +97 -0
package/get-shit-done/references/ui-brand.md +2 -2
package/get-shit-done/references/universal-anti-patterns.md +58 -0
package/get-shit-done/references/user-profiling.md +681 -0
package/get-shit-done/references/workstream-flag.md +111 -0
package/get-shit-done/templates/SECURITY.md +61 -0
package/get-shit-done/templates/UAT.md +21 -3
package/get-shit-done/templates/UI-SPEC.md +100 -0
package/get-shit-done/templates/VALIDATION.md +3 -3
package/get-shit-done/templates/claude-md.md +145 -0
package/get-shit-done/templates/config.json +14 -3
package/get-shit-done/templates/context.md +61 -6
package/get-shit-done/templates/debug-subagent-prompt.md +2 -6
package/get-shit-done/templates/dev-preferences.md +21 -0
package/get-shit-done/templates/discussion-log.md +63 -0
package/get-shit-done/templates/phase-prompt.md +46 -5
package/get-shit-done/templates/planner-subagent-prompt.md +2 -10
package/get-shit-done/templates/project.md +2 -0
package/get-shit-done/templates/state.md +2 -2
package/get-shit-done/templates/user-profile.md +146 -0
package/get-shit-done/workflows/add-phase.md +4 -4
package/get-shit-done/workflows/add-tests.md +4 -4
package/get-shit-done/workflows/add-todo.md +4 -4
package/get-shit-done/workflows/analyze-dependencies.md +96 -0
package/get-shit-done/workflows/audit-milestone.md +20 -16
package/get-shit-done/workflows/audit-uat.md +109 -0
package/get-shit-done/workflows/autonomous.md +1036 -0
package/get-shit-done/workflows/check-todos.md +4 -4
package/get-shit-done/workflows/cleanup.md +4 -4
package/get-shit-done/workflows/complete-milestone.md +22 -10
package/get-shit-done/workflows/diagnose-issues.md +21 -7
package/get-shit-done/workflows/discovery-phase.md +2 -2
package/get-shit-done/workflows/discuss-phase-assumptions.md +671 -0
package/get-shit-done/workflows/discuss-phase-power.md +291 -0
package/get-shit-done/workflows/discuss-phase.md +558 -47
package/get-shit-done/workflows/do.md +104 -0
package/get-shit-done/workflows/docs-update.md +1093 -0
package/get-shit-done/workflows/execute-phase.md +741 -58
package/get-shit-done/workflows/execute-plan.md +77 -12
package/get-shit-done/workflows/fast.md +105 -0
package/get-shit-done/workflows/forensics.md +265 -0
package/get-shit-done/workflows/health.md +28 -6
package/get-shit-done/workflows/help.md +127 -7
package/get-shit-done/workflows/insert-phase.md +4 -4
package/get-shit-done/workflows/list-phase-assumptions.md +2 -2
package/get-shit-done/workflows/list-workspaces.md +56 -0
package/get-shit-done/workflows/manager.md +363 -0
package/get-shit-done/workflows/map-codebase.md +83 -44
package/get-shit-done/workflows/milestone-summary.md +223 -0
package/get-shit-done/workflows/new-milestone.md +133 -25
package/get-shit-done/workflows/new-project.md +216 -54
package/get-shit-done/workflows/new-workspace.md +237 -0
package/get-shit-done/workflows/next.md +97 -0
package/get-shit-done/workflows/node-repair.md +92 -0
package/get-shit-done/workflows/note.md +156 -0
package/get-shit-done/workflows/pause-work.md +132 -15
package/get-shit-done/workflows/plan-milestone-gaps.md +6 -7
package/get-shit-done/workflows/plan-phase.md +513 -62
package/get-shit-done/workflows/plant-seed.md +169 -0
package/get-shit-done/workflows/pr-branch.md +129 -0
package/get-shit-done/workflows/profile-user.md +450 -0
package/get-shit-done/workflows/progress.md +154 -29
package/get-shit-done/workflows/quick.md +285 -111
package/get-shit-done/workflows/remove-phase.md +2 -2
package/get-shit-done/workflows/remove-workspace.md +90 -0
package/get-shit-done/workflows/research-phase.md +13 -9
package/get-shit-done/workflows/resume-project.md +37 -18
package/get-shit-done/workflows/review.md +281 -0
package/get-shit-done/workflows/secure-phase.md +154 -0
package/get-shit-done/workflows/session-report.md +146 -0
package/get-shit-done/workflows/set-profile.md +2 -2
package/get-shit-done/workflows/settings.md +91 -11
package/get-shit-done/workflows/ship.md +237 -0
package/get-shit-done/workflows/stats.md +60 -0
package/get-shit-done/workflows/transition.md +150 -23
package/get-shit-done/workflows/ui-phase.md +292 -0
package/get-shit-done/workflows/ui-review.md +183 -0
package/get-shit-done/workflows/update.md +262 -30
package/get-shit-done/workflows/validate-phase.md +14 -17
package/get-shit-done/workflows/verify-phase.md +143 -11
package/get-shit-done/workflows/verify-work.md +141 -39
package/package.json +1 -1
package/skills/gsd-audit-milestone/SKILL.md +29 -0
package/skills/gsd-cleanup/SKILL.md +19 -0
package/skills/gsd-complete-milestone/SKILL.md +131 -0
package/skills/gsd-discuss-phase/SKILL.md +54 -0
package/skills/gsd-execute-phase/SKILL.md +49 -0
package/skills/gsd-plan-phase/SKILL.md +37 -0
package/skills/gsd-ui-phase/SKILL.md +24 -0
package/skills/gsd-ui-review/SKILL.md +24 -0
package/skills/gsd-verify-work/SKILL.md +30 -0

package/get-shit-done/workflows/execute-phase.md CHANGED Viewed

@@ -1,26 +1,96 @@
-<purpose>
+<objective>
 Execute all plans in a phase using wave-based parallel execution. Orchestrator stays lean — delegates plan execution to subagents.
-</purpose>
+</objective>
 <core_principle>
 Orchestrator coordinates, not executes. Each subagent loads the full execute-plan context. Orchestrator: discover plans → analyze deps → group waves → spawn agents → handle checkpoints → collect results.
 </core_principle>
+<runtime_compatibility>
+**Subagent spawning is runtime-specific:**
+- **OpenCode:** Uses `task(subagent_type="gsd-executor", ...)` — blocks until complete, returns result
+- **Copilot:** Subagent spawning does not reliably return completion signals. **Default to
+  sequential inline execution**: read and follow execute-plan.md directly for each plan
+  instead of spawning parallel agents. Only attempt parallel spawning if the user
+  explicitly requests it — and in that case, rely on the spot-check fallback in step 3
+  to detect completion.
+- **Other runtimes:** If `task`/`task` tool is unavailable, use sequential inline execution as the
+  fallback. Check for tool availability at runtime rather than assuming based on runtime name.
+**Fallback rule:** If a spawned agent completes its work (commits visible, SUMMARY.md exists) but
+the orchestrator never receives the completion signal, treat it as successful based on spot-checks
+and continue to the next wave/plan. Never block indefinitely waiting for a signal — always verify
+via filesystem and git state.
+</runtime_compatibility>
 <required_reading>
 read STATE.md before any operation to load project context.
+@$HOME/.config/opencode/get-shit-done/references/agent-contracts.md
+@$HOME/.config/opencode/get-shit-done/references/context-budget.md
 </required_reading>
+<available_agent_types>
+These are the valid GSD subagent types registered in .OpenCode/agents/ (or equivalent for your runtime).
+Always use the exact name from this list — do not fall back to 'general' or other built-in types:
+- gsd-executor — Executes plan tasks, commits, creates SUMMARY.md
+- gsd-verifier — Verifies phase completion, checks quality gates
+- gsd-planner — Creates detailed plans from phase scope
+- gsd-phase-researcher — Researches technical approaches for a phase
+- gsd-plan-checker — Reviews plan quality before execution
+- gsd-debugger — Diagnoses and fixes issues
+- gsd-codebase-mapper — Maps project structure and dependencies
+- gsd-integration-checker — Checks cross-phase integration
+- gsd-nyquist-auditor — Validates verification coverage
+- gsd-ui-researcher — Researches UI/UX approaches
+- gsd-ui-checker — Reviews UI implementation quality
+- gsd-ui-auditor — Audits UI against design requirements
+</available_agent_types>
 <process>
+<step name="parse_args" priority="first">
+Parse `$ARGUMENTS` before loading any context:
+- First positional token → `PHASE_ARG`
+- Optional `--wave N` → `WAVE_FILTER`
+- Optional `--gaps-only` keeps its current meaning
+If `--wave` is absent, preserve the current behavior of executing all incomplete waves in the phase.
+</step>
 <step name="initialize" priority="first">
 Load all context in one call:
 ```bash
 INIT=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" init execute-phase "${PHASE_ARG}")
 if [[ "$INIT" == @file:* ]]; then INIT=$(cat "${INIT#@file:}"); fi
+AGENT_SKILLS=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" agent-skills gsd-executor 2>/dev/null)
 ```
-Parse JSON for: `executor_model`, `verifier_model`, `commit_docs`, `parallelization`, `branching_strategy`, `branch_name`, `phase_found`, `phase_dir`, `phase_number`, `phase_name`, `phase_slug`, `plans`, `incomplete_plans`, `plan_count`, `incomplete_count`, `state_exists`, `roadmap_exists`, `phase_req_ids`.
+Parse JSON for: `executor_model`, `verifier_model`, `commit_docs`, `parallelization`, `branching_strategy`, `branch_name`, `phase_found`, `phase_dir`, `phase_number`, `phase_name`, `phase_slug`, `plans`, `incomplete_plans`, `plan_count`, `incomplete_count`, `state_exists`, `roadmap_exists`, `phase_req_ids`, `response_language`.
+**If `response_language` is set:** Include `response_language: {value}` in all spawned subagent prompts so any user-facing output stays in the configured language.
+read worktree config:
+```bash
+USE_WORKTREES=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" config-get workflow.use_worktrees 2>/dev/null || echo "true")
+```
+When `USE_WORKTREES` is `false`, all executor agents run without `isolation="worktree"` — they execute sequentially on the main working tree instead of in parallel worktrees.
+read context window size for adaptive prompt enrichment:
+```bash
+CONTEXT_WINDOW=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" config-get context_window 2>/dev/null || echo "200000")
+```
+When `CONTEXT_WINDOW >= 500000` (1M-class models), subagent prompts include richer context:
+- Executor agents receive prior wave SUMMARY.md files and the phase CONTEXT.md/RESEARCH.md
+- Verifier agents receive all PLAN.md, SUMMARY.md, CONTEXT.md files plus REQUIREMENTS.md
+- This enables cross-phase awareness and history-aware verification
 **If `phase_found` is false:** Error — phase directory not found.
 **If `plan_count` is 0:** Error — no plans found in phase.
@@ -28,14 +98,95 @@ Parse JSON for: `executor_model`, `verifier_model`, `commit_docs`, `parallelizat
 When `parallelization` is false, plans within a wave execute sequentially.
-**Sync chain flag with intent** — if user invoked manually (no `--auto`), clear the ephemeral chain flag from any previous interrupted `--auto` chain. This does NOT touch `workflow.auto_advance` (the user's persistent settings preference). Must happen before any config reads (checkpoint handling also reads auto-advance flags):
+**Runtime detection for Copilot:**
+Check if the current runtime is Copilot by testing for the `@gsd-executor` agent pattern
+or absence of the `task()` subagent API. If running under Copilot, force sequential inline
+execution regardless of the `parallelization` setting — Copilot's subagent completion
+signals are unreliable (see `<runtime_compatibility>`). Set `COPILOT_SEQUENTIAL=true`
+internally and skip the `execute_waves` step in favor of `check_interactive_mode`'s
+inline path for each plan.
+**REQUIRED — Sync chain flag with intent.** If user invoked manually (no `--auto`), clear the ephemeral chain flag from any previous interrupted `--auto` chain. This prevents stale `_auto_chain_active: true` from causing unwanted auto-advance. This does NOT touch `workflow.auto_advance` (the user's persistent settings preference). You MUST execute this bash block before any config reads:
 ```bash
+# REQUIRED: prevents stale auto-chain from previous --auto runs
 if [[ ! "$ARGUMENTS" =~ --auto ]]; then
   node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" config-set workflow._auto_chain_active false 2>/dev/null
 fi
 ```
 </step>
+<step name="check_blocking_antipatterns" priority="first">
+**MANDATORY — Check for blocking anti-patterns before any other work.**
+Look for a `.continue-here.md` in the current phase directory:
+```bash
+ls ${phase_dir}/.continue-here.md 2>/dev/null || true
+```
+If `.continue-here.md` exists, parse its "Critical Anti-Patterns" table for rows with `severity` = `blocking`.
+**If one or more `blocking` anti-patterns are found:**
+This step cannot be skipped. Before proceeding to `check_interactive_mode` or any other step, the agent must demonstrate understanding of each blocking anti-pattern by answering all three questions for each one:
+1. **What is this anti-pattern?** — Describe it in your own words, not by quoting the handoff.
+2. **How did it manifest?** — Explain the specific failure that caused it to be recorded.
+3. **What structural mechanism (not acknowledgment) prevents it?** — Name the concrete step, checklist item, or enforcement mechanism that stops recurrence.
+write these answers inline before continuing. If a blocking anti-pattern cannot be answered from the context in `.continue-here.md`, stop and ask the user for clarification.
+**If no `.continue-here.md` exists, or no `blocking` rows are found:** Proceed directly to `check_interactive_mode`.
+</step>
+<step name="check_interactive_mode">
+**Parse `--interactive` flag from $ARGUMENTS.**
+**If `--interactive` flag present:** Switch to interactive execution mode.
+Interactive mode executes plans sequentially **inline** (no subagent spawning) with user
+checkpoints between tasks. The user can review, modify, or redirect work at any point.
+**Interactive execution flow:**
+1. Load plan inventory as normal (discover_and_group_plans)
+2. For each plan (sequentially, ignoring wave grouping):
+   a. **Present the plan to the user:**
+      ```
+      ## Plan {plan_id}: {plan_name}
+      Objective: {from plan file}
+      Tasks: {task_count}
+      Options:
+      - Execute (proceed with all tasks)
+      - Review first (show task breakdown before starting)
+      - Skip (move to next plan)
+      - Stop (end execution, save progress)
+      ```
+   b. **If "Review first":** read and display the full plan file. Ask again: Execute, Modify, Skip.
+   c. **If "Execute":** read and follow `$HOME/.config/opencode/get-shit-done/workflows/execute-plan.md` **inline**
+      (do NOT spawn a subagent). Execute tasks one at a time.
+   d. **After each task:** Pause briefly. If the user intervenes (types anything), stop and address
+      their feedback before continuing. Otherwise proceed to next task.
+   e. **After plan complete:** Show results, commit, create SUMMARY.md, then present next plan.
+3. After all plans: proceed to verification (same as normal mode).
+**Benefits of interactive mode:**
+- No subagent overhead — dramatically lower token usage
+- User catches mistakes early — saves costly verification cycles
+- Maintains GSD's planning/tracking structure
+- Best for: small phases, bug fixes, verification gaps, learning GSD
+**Skip to handle_branching step** (interactive plans execute inline after grouping).
+</step>
 <step name="handle_branching">
 Check `branching_strategy` from init:
@@ -53,6 +204,12 @@ All subsequent commits go to this branch. User handles merging.
 From init JSON: `phase_dir`, `plan_count`, `incomplete_count`.
 Report: "Found {plan_count} plans in {phase_dir} ({incomplete_count} incomplete)"
+**Update STATE.md for phase start:**
+```bash
+node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" state begin-phase --phase "${PHASE_NUMBER}" --name "${PHASE_NAME}" --plans "${PLAN_COUNT}"
+```
+This updates Status, Last Activity, Current focus, Current Position, and plan counts in STATE.md so frontmatter and body text reflect the active phase immediately.
 </step>
 <step name="discover_and_group_plans">
@@ -64,13 +221,19 @@ PLAN_INDEX=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" phase
 Parse JSON for: `phase`, `plans[]` (each with `id`, `wave`, `autonomous`, `objective`, `files_modified`, `task_count`, `has_summary`), `waves` (map of wave number → plan IDs), `incomplete`, `has_checkpoints`.
-**Filtering:** Skip plans where `has_summary: true`. If `--gaps-only`: also skip non-gap_closure plans. If all filtered: "No matching incomplete plans" → exit.
+**Filtering:** Skip plans where `has_summary: true`. If `--gaps-only`: also skip non-gap_closure plans. If `WAVE_FILTER` is set: also skip plans whose `wave` does not equal `WAVE_FILTER`.
+**Wave safety check:** If `WAVE_FILTER` is set and there are still incomplete plans in any lower wave that match the current execution mode, STOP and tell the user to finish earlier waves first. Do not let Wave 2+ execute while prerequisite earlier-wave plans remain incomplete.
+If all filtered: "No matching incomplete plans" → exit.
 Report:
 ```
 ## Execution Plan
-**Phase {X}: {Name}** — {total_plans} plans across {wave_count} waves
+**Phase {X}: {Name}** — {total_plans} matching plans across {wave_count} wave(s)
+{If WAVE_FILTER is set: `Wave filter active: executing only Wave {WAVE_FILTER}`.}
 | Wave | Plans | What it builds |
 |------|-------|----------------|
@@ -80,11 +243,45 @@ Report:
 </step>
 <step name="execute_waves">
-Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`, sequential if `false`.
+Execute each selected wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`, sequential if `false`.
 **For each wave:**
-1. **Describe what's being built (BEFORE spawning):**
+1. **Intra-wave files_modified overlap check (BEFORE spawning):**
+   Before spawning any agents for this wave, inspect the `files_modified` list of all plans
+   in the wave. Check every pair of plans in the wave — if any two plans share even one file
+   in their `files_modified` lists, those plans have an implicit dependency and MUST NOT run
+   in parallel.
+   **Detection algorithm (pseudocode):**
+   ```
+   seen_files = {}
+   overlapping_plans = []
+   for each plan in wave_plans:
+     for each file in plan.files_modified:
+       if file in seen_files:
+         overlapping_plans.add(plan, seen_files[file])  # both plans overlap on this file
+       else:
+         seen_files[file] = plan
+   ```
+   **If overlap is detected:**
+   - Warn the user:
+     ```
+     ⚠ Intra-wave files_modified overlap detected in Wave {N}:
+       Plan {A} and Plan {B} both modify {file}
+       Running these plans sequentially to avoid parallel worktree conflicts.
+     ```
+   - Override `PARALLELIZATION` to `false` for this wave only — run all plans in the wave
+     sequentially regardless of the global parallelization setting.
+   - This is a safety net for plans that were incorrectly assigned to the same wave.
+     The planner should have caught this; flag it as a planning defect so the user can
+     replan the phase if desired.
+   **If no overlap:** proceed normally (parallel if `PARALLELIZATION=true`).
+2. **Describe what's being built (BEFORE spawning):**
    read each plan's `<objective>`. Extract what's being built and why.
@@ -102,37 +299,127 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
    - Bad: "Executing terrain generation plan"
    - Good: "Procedural terrain generator using Perlin noise — creates height maps, biome zones, and collision meshes. Required before vehicle physics can interact with ground."
-2. **Spawn executor agents:**
+3. **Spawn executor agents:**
+   Pass paths only — executors read files themselves with their fresh context window.
+   For 200k models, this keeps orchestrator context lean (~10-15%).
+   For 1M+ models (Opus 4.6, Sonnet 4.6), richer context can be passed directly.
+   **Worktree mode** (`USE_WORKTREES` is not `false`):
-   Pass paths only — executors read files themselves with their fresh 200k context.
-   This keeps orchestrator context lean (~10-15%).
+   Before spawning, capture the current HEAD:
+   ```bash
+   EXPECTED_BASE=$(git rev-parse HEAD)
+   ```
+   **Sequential dispatch for parallel execution (waves with 2+ agents):**
+   When spawning multiple agents in a wave, dispatch each `task()` call **one at a time
+   with `run_in_background: true`** — do NOT send all task calls in a single message.
+   `git worktree add` acquires an exclusive lock on `.git/config.lock`, so simultaneous
+   calls race for this lock and fail. Sequential dispatch ensures each worktree finishes
+   creation before the next begins (the round-trip latency of each tool call provides
+   natural spacing), while all agents still **run in parallel** once created.
+   ```
+   # CORRECT: dispatch one task() per message, each with run_in_background: true
+   # → worktrees created sequentially, agents execute in parallel
+   #
+   # WRONG: multiple task() calls in a single message
+   # → simultaneous git worktree add → .git/config.lock contention → failures
    ```
-   task(
-     subagent_type="gsd-executor",
-     model="{executor_model}",
-     prompt="
-       <objective>
-       Execute plan {plan_number} of phase {phase_number}-{phase_name}.
-       Commit each task atomically. Create SUMMARY.md. Update STATE.md and ROADMAP.md.
-       </objective>
-       <execution_context>
-       @$HOME/.config/opencode/get-shit-done/workflows/execute-plan.md
-       @$HOME/.config/opencode/get-shit-done/templates/summary.md
-       @$HOME/.config/opencode/get-shit-done/references/checkpoints.md
-       @$HOME/.config/opencode/get-shit-done/references/tdd.md
-       </execution_context>
-       <files_to_read>
-       read these files at execution start using the read tool:
-       - {phase_dir}/{plan_file} (Plan)
-       - .planning/STATE.md (State)
-       - .planning/config.json (Config, if exists)
-       - ./AGENTS.md (Project instructions, if exists — follow project-specific guidelines and coding conventions)
-       - .OpenCode/skills/ or .agents/skills/ (Project skills, if either exists — list skills, read SKILL.md for each, follow relevant rules during implementation)
-       </files_to_read>
+    ```
+    @gsd-executor "
+        <objective>
+        Execute plan {plan_number} of phase {phase_number}-{phase_name}.
+        Commit each task atomically. Create SUMMARY.md.
+        Do NOT update STATE.md or ROADMAP.md — the orchestrator owns those writes after all worktree agents in the wave complete.
+        </objective>
+        <worktree_branch_check>
+        FIRST ACTION before any other work: verify this worktree's branch is based on the correct commit.
+        Run:
+        ```bash
+        ACTUAL_BASE=$(git merge-base HEAD {EXPECTED_BASE})
+        CURRENT_HEAD=$(git rev-parse HEAD)
+        ```
+        If `ACTUAL_BASE` != `{EXPECTED_BASE}` (i.e. the worktree branch was created from an older
+        base such as `main` instead of the feature branch HEAD), rebase onto the correct base:
+        ```bash
+        git rebase --onto {EXPECTED_BASE} $(git rev-parse --abbrev-ref HEAD~1 2>/dev/null || git rev-parse HEAD^) HEAD 2>/dev/null || true
+        # If rebase fails or is a no-op, reset the branch to start from the correct base:
+        git reset --soft {EXPECTED_BASE}
+        ```
+        If `ACTUAL_BASE` == `{EXPECTED_BASE}`: the branch base is correct, proceed immediately.
+        This check fixes a known issue on Windows where `EnterWorktree` creates branches from
+        `main` instead of the current feature branch HEAD.
+        </worktree_branch_check>
+        <parallel_execution>
+        You are running as a PARALLEL executor agent. Use --no-verify on all git
+        commits to avoid pre-commit hook contention with other agents. The
+        orchestrator validates hooks once after all agents complete.
+        For gsd-tools commits: add --no-verify flag.
+        For direct git commits: use git commit --no-verify -m "..."
+        </parallel_execution>
+        <execution_context>
+        @$HOME/.config/opencode/get-shit-done/workflows/execute-plan.md
+        @$HOME/.config/opencode/get-shit-done/templates/summary.md
+        @$HOME/.config/opencode/get-shit-done/references/checkpoints.md
+        @$HOME/.config/opencode/get-shit-done/references/tdd.md
+        </execution_context>
+        <files_to_read>
+        read these files at execution start using the read tool:
+        - {phase_dir}/{plan_file} (Plan)
+        - .planning/PROJECT.md (Project context — core value, requirements, evolution rules)
+        - .planning/STATE.md (State)
+        - .planning/config.json (Config, if exists)
+        ${CONTEXT_WINDOW >= 500000 ? `
+        - ${phase_dir}/*-CONTEXT.md (User decisions from discuss-phase — honors locked choices)
+        - ${phase_dir}/*-RESEARCH.md (Technical research — pitfalls and patterns to follow)
+        - ${prior_wave_summaries} (SUMMARY.md files from earlier waves in this phase — what was already built)
+        ` : ''}
+        - ./AGENTS.md (Project instructions, if exists — follow project-specific guidelines and coding conventions)
+        - .OpenCode/skills/ or .agents/skills/ (Project skills, if either exists — list skills, read SKILL.md for each, follow relevant rules during implementation)
+        </files_to_read>
+        ${AGENT_SKILLS}
+        <mcp_tools>
+        If AGENTS.md or project instructions reference MCP tools (e.g. jCodeMunch, context7,
+        or other MCP servers), prefer those tools over grep/glob for code navigation when available.
+        MCP tools often save significant tokens by providing structured code indexes.
+        Check tool availability first — if MCP tools are not accessible, fall back to grep/glob.
+        </mcp_tools>
+        <success_criteria>
+        - [ ] All tasks executed
+        - [ ] Each task committed individually
+        - [ ] SUMMARY.md created in plan directory
+        </success_criteria>
+      "
+    ```
+   **Sequential mode** (`USE_WORKTREES` is `false`):
+   Omit `isolation="worktree"` from the task call. Replace the `<parallel_execution>` block with:
+   ```
+       <sequential_execution>
+       You are running as a SEQUENTIAL executor agent on the main working tree.
+       Use normal git commits (with hooks). Do NOT use --no-verify.
+       </sequential_execution>
+   ```
+   The sequential mode task prompt uses the same structure as worktree mode but with these differences in success_criteria — since there is only one agent writing at a time, there are no shared-file conflicts:
+   ```
        <success_criteria>
        - [ ] All tasks executed
        - [ ] Each task committed individually
@@ -140,13 +427,94 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
        - [ ] STATE.md updated with position and decisions
        - [ ] ROADMAP.md updated with plan progress (via `roadmap update-plan-progress`)
        </success_criteria>
-     "
-   )
    ```
-3. **Wait for all agents in wave to complete.**
+   When worktrees are disabled, execute plans **one at a time within each wave** (sequential) regardless of the `PARALLELIZATION` setting — multiple agents writing to the same working tree concurrently would cause conflicts.
+4. **Wait for all agents in wave to complete.**
+   **Completion signal fallback (Copilot and runtimes where task() may not return):**
+   If a spawned agent does not return a completion signal but appears to have finished
+   its work, do NOT block indefinitely. Instead, verify completion via spot-checks:
+   ```bash
+   # For each plan in this wave, check if the executor finished:
+   SUMMARY_EXISTS=$(test -f "{phase_dir}/{plan_number}-{plan_padded}-SUMMARY.md" && echo "true" || echo "false")
+   COMMITS_FOUND=$(git log --oneline --all --grep="{phase_number}-{plan_padded}" --since="1 hour ago" | head -1)
+   ```
+   **If SUMMARY.md exists AND commits are found:** The agent completed successfully —
+   treat as done and proceed to step 5. Log: `"✓ {Plan ID} completed (verified via spot-check — completion signal not received)"`
+   **If SUMMARY.md does NOT exist after a reasonable wait:** The agent may still be
+   running or may have failed silently. Check `git log --oneline -5` for recent
+   activity. If commits are still appearing, wait longer. If no activity, report
+   the plan as failed and route to the failure handler in step 6.
+   **This fallback applies automatically to all runtimes.** OpenCode's task() normally
+   returns synchronously, but the fallback ensures resilience if it doesn't.
+5. **Post-wave hook validation (parallel mode only):**
+   When agents committed with `--no-verify`, run pre-commit hooks once after the wave:
+   ```bash
+   # Run project's pre-commit hooks on the current state
+   git diff --cached --quiet || git stash  # stash any unstaged changes
+   git hook run pre-commit 2>&1 || echo "⚠ Pre-commit hooks failed — review before continuing"
+   ```
+   If hooks fail: report the failure and ask "Fix hook issues now?" or "Continue to next wave?"
+5.5. **Worktree cleanup (when `isolation="worktree"` was used):**
+   When executor agents ran in worktree isolation, their commits land on temporary branches in separate working trees. After the wave completes, merge these changes back and clean up:
+   ```bash
+   # List worktrees created by this wave's agents
+   WORKTREES=$(git worktree list --porcelain | grep "^worktree " | grep -v "$(pwd)$" | sed 's/^worktree //')
+   for WT in $WORKTREES; do
+     # Get the branch name for this worktree
+     WT_BRANCH=$(git -C "$WT" rev-parse --abbrev-ref HEAD 2>/dev/null)
+     if [ -n "$WT_BRANCH" ] && [ "$WT_BRANCH" != "HEAD" ]; then
+       CURRENT_BRANCH=$(git rev-parse --abbrev-ref HEAD)
+       # Merge the worktree branch into the current branch
+       git merge "$WT_BRANCH" --no-edit -m "chore: merge executor worktree ($WT_BRANCH)" 2>&1 || {
+         echo "⚠ Merge conflict from worktree $WT_BRANCH — resolve manually"
+         continue
+       }
+       # Remove the worktree
+       git worktree remove "$WT" --force 2>/dev/null || true
+       # Delete the temporary branch
+       git branch -D "$WT_BRANCH" 2>/dev/null || true
+     fi
+   done
+   ```
+   **If `workflow.use_worktrees` is `false`:** Agents ran on the main working tree — skip this step entirely.
-4. **Report completion — spot-check claims first:**
+   **If no worktrees found:** Skip silently — agents may have been spawned without worktree isolation.
+5.6. **Post-wave shared artifact update (worktree mode only):**
+   When executor agents ran with `isolation="worktree"`, they skipped STATE.md and ROADMAP.md updates to avoid last-merge-wins overwrites. The orchestrator is the single writer for these files. After worktrees are merged back, update shared artifacts once:
+   ```bash
+   # Update ROADMAP.md for each completed plan in this wave
+   for PLAN_ID in ${WAVE_PLAN_IDS}; do
+     node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" roadmap update-plan-progress "${PHASE_NUMBER}" "${PLAN_ID}" completed
+   done
+   ```
+   Where `WAVE_PLAN_IDS` is the space-separated list of plan IDs that completed in this wave.
+   **If `workflow.use_worktrees` is `false`:** Sequential agents already updated STATE.md and ROADMAP.md themselves — skip this step.
+6. **Report completion — spot-check claims first:**
    For each SUMMARY.md:
    - Verify first 2 files from `key-files.created` exist on disk
@@ -171,15 +539,36 @@ Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`
    - Bad: "Wave 2 complete. Proceeding to Wave 3."
    - Good: "Terrain system complete — 3 biome types, height-based texturing, physics collision meshes. Vehicle physics (Wave 3) can now reference ground surfaces."
-5. **Handle failures:**
+7. **Handle failures:**
-   **Known OpenCode bug (classifyHandoffIfNeeded):** If an agent reports "failed" with error containing `classifyHandoffIfNeeded is not defined`, this is a OpenCode runtime bug — not a GSD or agent issue. The error fires in the completion handler AFTER all tool calls finish. In this case: run the same spot-checks as step 4 (SUMMARY.md exists, git commits present, no Self-Check: FAILED). If spot-checks PASS → treat as **successful**. If spot-checks FAIL → treat as real failure below.
+   **Known OpenCode bug (classifyHandoffIfNeeded):** If an agent reports "failed" with error containing `classifyHandoffIfNeeded is not defined`, this is a OpenCode runtime bug — not a GSD or agent issue. The error fires in the completion handler AFTER all tool calls finish. In this case: run the same spot-checks as step 5 (SUMMARY.md exists, git commits present, no Self-Check: FAILED). If spot-checks PASS → treat as **successful**. If spot-checks FAIL → treat as real failure below.
    For real failures: report which plan failed → ask "Continue?" or "Stop?" → if continue, dependent plans may also fail. If stop, partial completion report.
-6. **Execute checkpoint plans between waves** — see `<checkpoint_handling>`.
+7b. **Pre-wave dependency check (waves 2+ only):**
+    Before spawning wave N+1, for each plan in the upcoming wave:
+    ```bash
+    node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" verify key-links {phase_dir}/{plan}-PLAN.md
+    ```
+    If any key-link from a PRIOR wave's artifact fails verification:
-7. **Proceed to next wave.**
+    ## Cross-Plan Wiring Gap
+    | Plan | Link | From | Expected Pattern | Status |
+    |------|------|------|-----------------|--------|
+    | {plan} | {via} | {from} | {pattern} | NOT FOUND |
+    Wave {N} artifacts may not be properly wired. Options:
+    1. Investigate and fix before continuing
+    2. Continue (may cause cascading failures in wave {N+1})
+    Key-links referencing files in the CURRENT (upcoming) wave are skipped.
+8. **Execute checkpoint plans between waves** — see `<checkpoint_handling>`.
+9. **Proceed to next wave.**
 </step>
 <step name="checkpoint_handling">
@@ -248,6 +637,58 @@ After all waves:
 ### Issues Encountered
 [Aggregate from SUMMARYs, or "None"]
 ```
+**Security gate check:**
+```bash
+SECURITY_CFG=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" config-get workflow.security_enforcement --raw 2>/dev/null || echo "true")
+SECURITY_FILE=$(ls "${PHASE_DIR}"/*-SECURITY.md 2>/dev/null | head -1)
+```
+If `SECURITY_CFG` is `false`: skip.
+If `SECURITY_CFG` is `true` AND `SECURITY_FILE` is empty (no SECURITY.md yet):
+Include in the next-steps routing output:
+```
+⚠ Security enforcement enabled — run before advancing:
+  /gsd-secure-phase {PHASE} ${GSD_WS}
+```
+If `SECURITY_CFG` is `true` AND SECURITY.md exists: check frontmatter `threats_open`. If > 0:
+```
+⚠ Security gate: {threats_open} threats open
+  /gsd-secure-phase {PHASE} — resolve before advancing
+```
+</step>
+<step name="handle_partial_wave_execution">
+If `WAVE_FILTER` was used, re-run plan discovery after execution:
+```bash
+POST_PLAN_INDEX=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" phase-plan-index "${PHASE_NUMBER}")
+```
+Apply the same "incomplete" filtering rules as earlier:
+- ignore plans with `has_summary: true`
+- if `--gaps-only`, only consider `gap_closure: true` plans
+**If incomplete plans still remain anywhere in the phase:**
+- STOP here
+- Do NOT run phase verification
+- Do NOT mark the phase complete in ROADMAP/STATE
+- Present:
+```markdown
+## Wave {WAVE_FILTER} Complete
+Selected wave finished successfully. This phase still has incomplete plans, so phase-level verification and completion were intentionally skipped.
+/gsd-execute-phase {phase} ${GSD_WS}                # Continue remaining waves
+/gsd-execute-phase {phase} --wave {next} ${GSD_WS}  # Run the next wave explicitly
+```
+**If no incomplete plans remain after the selected wave finishes:**
+- continue with the normal phase-level verification and completion flow below
+- this means the selected wave happened to be the last remaining work in the phase
 </step>
 <step name="close_parent_artifacts">
@@ -300,21 +741,161 @@ node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" commit "docs(phase
 ```
 </step>
+<step name="regression_gate">
+Run prior phases' test suites to catch cross-phase regressions BEFORE verification.
+**Skip if:** This is the first phase (no prior phases), or no prior VERIFICATION.md files exist.
+**Step 1: Discover prior phases' test files**
+```bash
+# Find all VERIFICATION.md files from prior phases in current milestone
+PRIOR_VERIFICATIONS=$(find .planning/phases/ -name "*-VERIFICATION.md" ! -path "*${PHASE_NUMBER}*" 2>/dev/null)
+```
+**Step 2: Extract test file lists from prior verifications**
+For each VERIFICATION.md found, look for test file references:
+- Lines containing `test`, `spec`, or `__tests__` paths
+- The "Test Suite" or "Automated Checks" section
+- File patterns from `key-files.created` in corresponding SUMMARY.md files that match `*.test.*` or `*.spec.*`
+Collect all unique test file paths into `REGRESSION_FILES`.
+**Step 3: Run regression tests (if any found)**
+```bash
+# Detect test runner and run prior phase tests
+if [ -f "package.json" ]; then
+  # Node.js — use project's test runner
+  npx jest ${REGRESSION_FILES} --passWithNoTests --no-coverage -q 2>&1 || npx vitest run ${REGRESSION_FILES} 2>&1
+elif [ -f "Cargo.toml" ]; then
+  cargo test 2>&1
+elif [ -f "requirements.txt" ] || [ -f "pyproject.toml" ]; then
+  python -m pytest ${REGRESSION_FILES} -q --tb=short 2>&1
+fi
+```
+**Step 4: Report results**
+If all tests pass:
+```
+✓ Regression gate: {N} prior-phase test files passed — no regressions detected
+```
+→ Proceed to verify_phase_goal
+If any tests fail:
+```
+## ⚠ Cross-Phase Regression Detected
+Phase {X} execution may have broken functionality from prior phases.
+| Test File | Phase | Status | Detail |
+|-----------|-------|--------|--------|
+| {file} | {origin_phase} | FAILED | {first_failure_line} |
+Options:
+1. Fix regressions before verification (recommended)
+2. Continue to verification anyway (regressions will compound)
+3. Abort phase — roll back and re-plan
+```
+Use question to present the options.
+</step>
+<step name="schema_drift_gate">
+Post-execution schema drift detection. Catches false-positive verification where
+build/types pass because TypeScript types come from config, not the live database.
+**Run after execution completes but BEFORE verification marks success.**
+```bash
+SCHEMA_DRIFT=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" verify schema-drift "${PHASE_NUMBER}" 2>/dev/null)
+```
+Parse JSON result for: `drift_detected`, `blocking`, `schema_files`, `orms`, `unpushed_orms`, `message`.
+**If `drift_detected` is false:** Skip to verify_phase_goal.
+**If `drift_detected` is true AND `blocking` is true:**
+Check for override:
+```bash
+SKIP_SCHEMA=$(echo "${GSD_SKIP_SCHEMA_CHECK:-false}")
+```
+**If `SKIP_SCHEMA` is `true`:**
+Display:
+```
+⚠ Schema drift detected but GSD_SKIP_SCHEMA_CHECK=true — bypassing gate.
+Schema files changed: {schema_files}
+ORMs requiring push: {unpushed_orms}
+Proceeding to verification (database may be out of sync).
+```
+→ Continue to verify_phase_goal.
+**If `SKIP_SCHEMA` is not `true`:**
+BLOCK verification. Display:
+```
+## BLOCKED: Schema Drift Detected
+Schema-relevant files changed during this phase but no database push command
+was executed. Build and type checks pass because TypeScript types come from
+config, not the live database — verification would produce a false positive.
+Schema files changed: {schema_files}
+ORMs requiring push: {unpushed_orms}
+Required push commands:
+{For each unpushed ORM, show the push command from the message}
+Options:
+1. Run push command now (recommended) — execute the push, then re-verify
+2. Skip schema check (GSD_SKIP_SCHEMA_CHECK=true) — bypass this gate
+3. Abort — stop execution and investigate
+```
+If `TEXT_MODE` is true, present as a plain-text numbered list. Otherwise use question.
+**If user selects option 1:** Present the specific push command(s) to run. After user confirms execution, re-run the schema drift check. If it passes, continue to verify_phase_goal.
+**If user selects option 2:** Set override and continue to verify_phase_goal.
+**If user selects option 3:** Stop execution. Report partial completion.
+</step>
 <step name="verify_phase_goal">
 Verify phase achieved its GOAL, not just completed tasks.
+```bash
+VERIFIER_SKILLS=$(node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" agent-skills gsd-verifier 2>/dev/null)
+```
 ```
-task(
-  prompt="Verify phase {phase_number} goal achievement.
+@gsd-verifier "Verify phase {phase_number} goal achievement.
 Phase directory: {phase_dir}
 Phase goal: {goal from ROADMAP.md}
 Phase requirement IDs: {phase_req_ids}
 Check must_haves against actual codebase.
 Cross-reference requirement IDs from PLAN frontmatter against REQUIREMENTS.md — every ID MUST be accounted for.
-Create VERIFICATION.md.",
-  subagent_type="gsd-verifier",
-  model="{verifier_model}"
-)
+Create VERIFICATION.md.
+<files_to_read>
+read these files before verification:
+- {phase_dir}/*-PLAN.md (All plans — understand intent, check must_haves)
+- {phase_dir}/*-SUMMARY.md (All summaries — cross-reference claimed vs actual)
+- .planning/REQUIREMENTS.md (Requirement traceability)
+${CONTEXT_WINDOW >= 500000 ? `- {phase_dir}/*-CONTEXT.md (User decisions — verify they were honored)
+- {phase_dir}/*-RESEARCH.md (Known pitfalls — check for traps)
+- Prior VERIFICATION.md files from earlier phases (regression check)
+` : ''}
+</files_to_read>
+${VERIFIER_SKILLS}"
 ```
 read status:
@@ -326,9 +907,54 @@ grep "^status:" "$PHASE_DIR"/*-VERIFICATION.md | cut -d: -f2 | tr -d ' '
 |--------|--------|
 | `passed` | → update_roadmap |
 | `human_needed` | Present items for human testing, get approval or feedback |
-| `gaps_found` | Present gap summary, offer `/gsd-plan-phase {phase} --gaps` |
+| `gaps_found` | Present gap summary, offer `/gsd-plan-phase {phase} --gaps ${GSD_WS}` |
 **If human_needed:**
+**Step A: Persist human verification items as UAT file.**
+Create `{phase_dir}/{phase_num}-HUMAN-UAT.md` using UAT template format:
+```markdown
+---
+status: partial
+phase: {phase_num}-{phase_name}
+source: [{phase_num}-VERIFICATION.md]
+started: [now ISO]
+updated: [now ISO]
+---
+## Current Test
+[awaiting human testing]
+## Tests
+{For each human_verification item from VERIFICATION.md:}
+### {N}. {item description}
+expected: {expected behavior from VERIFICATION.md}
+result: [pending]
+## Summary
+total: {count}
+passed: 0
+issues: 0
+pending: {count}
+skipped: 0
+blocked: 0
+## Gaps
+```
+Commit the file:
+```bash
+node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" commit "test({phase_num}): persist human verification items as UAT" --files "{phase_dir}/{phase_num}-HUMAN-UAT.md"
+```
+**Step B: Present to user:**
 ```
 ## ✓ Phase {X}: {Name} — Human Verification Required
@@ -336,9 +962,15 @@ All automated checks passed. {N} items need human testing:
 {From VERIFICATION.md human_verification section}
+Items saved to `{phase_num}-HUMAN-UAT.md` — they will appear in `/gsd-progress` and `/gsd-audit-uat`.
 "approved" → continue | Report issues → gap closure
 ```
+**If user says "approved":** Proceed to `update_roadmap`. The HUMAN-UAT.md file persists with `status: partial` and will surface in future progress checks until the user runs `/gsd-verify-work` on it.
+**If user reports issues:** Proceed to gap closure as currently implemented.
 **If gaps_found:**
 ```
 ## ⚠ Phase {X}: {Name} — Gaps Found
@@ -352,15 +984,15 @@ All automated checks passed. {N} items need human testing:
 ---
 ## ▶ Next Up
-`/gsd-plan-phase {X} --gaps`
+`/new` then:
-*`/new` first → fresh context window*
+`/gsd-plan-phase {X} --gaps ${GSD_WS}`
 Also: `cat {phase_dir}/{phase_num}-VERIFICATION.md` — full report
-Also: `/gsd-verify-work {X}` — manual testing first
+Also: `/gsd-verify-work {X} ${GSD_WS}` — manual testing first
 ```
-Gap closure cycle: `/gsd-plan-phase {X} --gaps` reads VERIFICATION.md → creates gap plans with `gap_closure: true` → user runs `/gsd-execute-phase {X} --gaps-only` → verifier re-runs.
+Gap closure cycle: `/gsd-plan-phase {X} --gaps ${GSD_WS}` reads VERIFICATION.md → creates gap plans with `gap_closure: true` → user runs `/gsd-execute-phase {X} --gaps-only ${GSD_WS}` → verifier re-runs.
 </step>
 <step name="update_roadmap">
@@ -376,14 +1008,46 @@ The CLI handles:
 - Updating plan count to final
 - Advancing STATE.md to next phase
 - Updating REQUIREMENTS.md traceability
+- Scanning for verification debt (returns `warnings` array)
+Extract from result: `next_phase`, `next_phase_name`, `is_last_phase`, `warnings`, `has_warnings`.
+**If has_warnings is true:**
+```
+## Phase {X} marked complete with {N} warnings:
-Extract from result: `next_phase`, `next_phase_name`, `is_last_phase`.
+{list each warning}
+These items are tracked and will appear in `/gsd-progress` and `/gsd-audit-uat`.
+```
 ```bash
 node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" commit "docs(phase-{X}): complete phase execution" --files .planning/ROADMAP.md .planning/STATE.md .planning/REQUIREMENTS.md {phase_dir}/*-VERIFICATION.md
 ```
 </step>
+<step name="update_project_md">
+**Evolve PROJECT.md to reflect phase completion (prevents planning document drift — #956):**
+PROJECT.md tracks validated requirements, decisions, and current state. Without this step,
+PROJECT.md falls behind silently over multiple phases.
+1. read `.planning/PROJECT.md`
+2. If the file exists and has a `## Validated Requirements` or `## Requirements` section:
+   - Move any requirements validated by this phase from Active → Validated
+   - Add a brief note: `Validated in Phase {X}: {Name}`
+3. If the file has a `## Current State` or similar section:
+   - Update it to reflect this phase's completion (e.g., "Phase {X} complete — {one-liner}")
+4. Update the `Last updated:` footer to today's date
+5. Commit the change:
+```bash
+node "$HOME/.config/opencode/get-shit-done/bin/gsd-tools.cjs" commit "docs(phase-{X}): evolve PROJECT.md after phase completion" --files .planning/PROJECT.md
+```
+**Skip this step if** `.planning/PROJECT.md` does not exist.
+</step>
 <step name="offer_next">
 **Exception:** If `gaps_found`, the `verify_phase_goal` step already presents the gap-closure path (`/gsd-plan-phase {X} --gaps`). No additional routing needed — skip auto-advance.
@@ -433,15 +1097,34 @@ Execute the transition workflow inline (do NOT use task — orchestrator context
 read and follow `$HOME/.config/opencode/get-shit-done/workflows/transition.md`, passing through the `--auto` flag so it propagates to the next phase invocation.
-**If neither `--auto` nor `AUTO_CFG` is true:**
+**If none of `--auto`, `AUTO_CHAIN`, or `AUTO_CFG` is true:**
-The workflow ends. The user runs `/gsd-progress` or invokes the transition workflow manually.
+**STOP. Do not auto-advance. Do not execute transition. Do not plan next phase. Present options to the user and wait.**
+**IMPORTANT: There is NO `/gsd-transition` command. Never suggest it. The transition workflow is internal only.**
+```
+## ✓ Phase {X}: {Name} Complete
+/gsd-progress ${GSD_WS} — see updated roadmap
+/gsd-discuss-phase {next} ${GSD_WS} — discuss next phase before planning
+/gsd-plan-phase {next} ${GSD_WS} — plan next phase
+/gsd-execute-phase {next} ${GSD_WS} — execute next phase
+```
+Only suggest the commands listed above. Do not invent or hallucinate command names.
 </step>
 </process>
 <context_efficiency>
-Orchestrator: ~10-15% context. Subagents: fresh 200k each. No polling (task blocks). No context bleed.
+Orchestrator: ~10-15% context for 200k windows, can use more for 1M+ windows.
+Subagents: fresh context each (200k-1M depending on model). No polling (task blocks). No context bleed.
+For 1M+ context models, consider:
+- Passing richer context (code snippets, dependency outputs) directly to executors instead of just file paths
+- Running small phases (≤3 plans, no dependencies) inline without subagent spawning overhead
+- Relaxing /new recommendations — context rot onset is much further out with 5x window
 </context_efficiency>
 <failure_handling>