RubyGems - ariadna - Versions diffs - 1.3.1 → 2.0.0 - Mend

ariadna 1.3.1 → 2.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (148) hide show

checksums.yaml +4 -4
data/ariadna.gemspec +0 -1
data/data/agents/ariadna-codebase-mapper.md +34 -722
data/data/agents/ariadna-debugger.md +44 -1139
data/data/agents/ariadna-executor.md +75 -396
data/data/agents/ariadna-planner.md +78 -1215
data/data/agents/ariadna-roadmapper.md +55 -582
data/data/agents/ariadna-verifier.md +60 -702
data/data/ariadna/templates/config.json +8 -33
data/data/ariadna/workflows/debug.md +28 -0
data/data/ariadna/workflows/execute-phase.md +31 -513
data/data/ariadna/workflows/map-codebase.md +20 -319
data/data/ariadna/workflows/new-milestone.md +20 -365
data/data/ariadna/workflows/new-project.md +19 -880
data/data/ariadna/workflows/plan-phase.md +24 -443
data/data/ariadna/workflows/progress.md +20 -376
data/data/ariadna/workflows/quick.md +19 -221
data/data/ariadna/workflows/roadmap-ops.md +28 -0
data/data/ariadna/workflows/verify-work.md +23 -560
data/data/commands/ariadna/add-phase.md +11 -22
data/data/commands/ariadna/debug.md +11 -143
data/data/commands/ariadna/execute-phase.md +12 -30
data/data/commands/ariadna/insert-phase.md +7 -14
data/data/commands/ariadna/map-codebase.md +16 -49
data/data/commands/ariadna/new-milestone.md +12 -25
data/data/commands/ariadna/new-project.md +22 -26
data/data/commands/ariadna/plan-phase.md +13 -22
data/data/commands/ariadna/progress.md +16 -6
data/data/commands/ariadna/quick.md +9 -11
data/data/commands/ariadna/remove-phase.md +9 -12
data/data/commands/ariadna/verify-work.md +14 -19
data/data/skills/rails-backend/API.md +138 -0
data/data/skills/rails-backend/CONTROLLERS.md +154 -0
data/data/skills/rails-backend/JOBS.md +132 -0
data/data/skills/rails-backend/MODELS.md +213 -0
data/data/skills/rails-backend/SKILL.md +169 -0
data/data/skills/rails-frontend/ASSETS.md +154 -0
data/data/skills/rails-frontend/COMPONENTS.md +253 -0
data/data/skills/rails-frontend/SKILL.md +187 -0
data/data/skills/rails-frontend/VIEWS.md +168 -0
data/data/skills/rails-performance/PROFILING.md +106 -0
data/data/skills/rails-performance/SKILL.md +217 -0
data/data/skills/rails-security/AUDIT.md +118 -0
data/data/skills/rails-security/SKILL.md +422 -0
data/data/skills/rails-testing/FIXTURES.md +78 -0
data/data/skills/rails-testing/SKILL.md +160 -0
data/data/skills/rails-testing/SYSTEM-TESTS.md +73 -0
data/lib/ariadna/installer.rb +11 -15
data/lib/ariadna/tools/cli.rb +0 -12
data/lib/ariadna/tools/config_manager.rb +10 -72
data/lib/ariadna/tools/frontmatter.rb +23 -1
data/lib/ariadna/tools/init.rb +201 -401
data/lib/ariadna/tools/model_profiles.rb +6 -14
data/lib/ariadna/tools/phase_manager.rb +1 -10
data/lib/ariadna/tools/state_manager.rb +170 -451
data/lib/ariadna/tools/template_filler.rb +4 -12
data/lib/ariadna/tools/verification.rb +21 -399
data/lib/ariadna/uninstaller.rb +9 -0
data/lib/ariadna/version.rb +1 -1
metadata +20 -91
data/data/agents/ariadna-backend-executor.md +0 -261
data/data/agents/ariadna-frontend-executor.md +0 -259
data/data/agents/ariadna-integration-checker.md +0 -418
data/data/agents/ariadna-phase-researcher.md +0 -469
data/data/agents/ariadna-plan-checker.md +0 -622
data/data/agents/ariadna-project-researcher.md +0 -618
data/data/agents/ariadna-research-synthesizer.md +0 -236
data/data/agents/ariadna-test-executor.md +0 -266
data/data/ariadna/references/checkpoints.md +0 -772
data/data/ariadna/references/continuation-format.md +0 -249
data/data/ariadna/references/decimal-phase-calculation.md +0 -65
data/data/ariadna/references/git-integration.md +0 -248
data/data/ariadna/references/git-planning-commit.md +0 -38
data/data/ariadna/references/model-profile-resolution.md +0 -32
data/data/ariadna/references/model-profiles.md +0 -73
data/data/ariadna/references/phase-argument-parsing.md +0 -61
data/data/ariadna/references/planning-config.md +0 -194
data/data/ariadna/references/questioning.md +0 -153
data/data/ariadna/references/rails-conventions.md +0 -416
data/data/ariadna/references/tdd.md +0 -267
data/data/ariadna/references/ui-brand.md +0 -160
data/data/ariadna/references/verification-patterns.md +0 -853
data/data/ariadna/templates/codebase/architecture.md +0 -481
data/data/ariadna/templates/codebase/concerns.md +0 -380
data/data/ariadna/templates/codebase/conventions.md +0 -434
data/data/ariadna/templates/codebase/integrations.md +0 -328
data/data/ariadna/templates/codebase/stack.md +0 -189
data/data/ariadna/templates/codebase/structure.md +0 -418
data/data/ariadna/templates/codebase/testing.md +0 -606
data/data/ariadna/templates/context.md +0 -283
data/data/ariadna/templates/continue-here.md +0 -78
data/data/ariadna/templates/debug-subagent-prompt.md +0 -91
data/data/ariadna/templates/phase-prompt.md +0 -609
data/data/ariadna/templates/planner-subagent-prompt.md +0 -117
data/data/ariadna/templates/research-project/ARCHITECTURE.md +0 -439
data/data/ariadna/templates/research-project/FEATURES.md +0 -168
data/data/ariadna/templates/research-project/PITFALLS.md +0 -406
data/data/ariadna/templates/research-project/STACK.md +0 -251
data/data/ariadna/templates/research-project/SUMMARY.md +0 -247
data/data/ariadna/templates/state.md +0 -176
data/data/ariadna/templates/summary-complex.md +0 -59
data/data/ariadna/templates/summary-minimal.md +0 -41
data/data/ariadna/templates/summary-standard.md +0 -48
data/data/ariadna/templates/user-setup.md +0 -310
data/data/ariadna/workflows/add-phase.md +0 -111
data/data/ariadna/workflows/add-todo.md +0 -157
data/data/ariadna/workflows/audit-milestone.md +0 -241
data/data/ariadna/workflows/check-todos.md +0 -176
data/data/ariadna/workflows/complete-milestone.md +0 -644
data/data/ariadna/workflows/diagnose-issues.md +0 -219
data/data/ariadna/workflows/discovery-phase.md +0 -289
data/data/ariadna/workflows/discuss-phase.md +0 -408
data/data/ariadna/workflows/execute-plan.md +0 -448
data/data/ariadna/workflows/help.md +0 -470
data/data/ariadna/workflows/insert-phase.md +0 -129
data/data/ariadna/workflows/list-phase-assumptions.md +0 -178
data/data/ariadna/workflows/pause-work.md +0 -122
data/data/ariadna/workflows/plan-milestone-gaps.md +0 -256
data/data/ariadna/workflows/remove-phase.md +0 -154
data/data/ariadna/workflows/research-phase.md +0 -74
data/data/ariadna/workflows/resume-project.md +0 -306
data/data/ariadna/workflows/set-profile.md +0 -80
data/data/ariadna/workflows/settings.md +0 -145
data/data/ariadna/workflows/transition.md +0 -493
data/data/ariadna/workflows/update.md +0 -212
data/data/ariadna/workflows/verify-phase.md +0 -226
data/data/commands/ariadna/add-todo.md +0 -42
data/data/commands/ariadna/audit-milestone.md +0 -42
data/data/commands/ariadna/check-todos.md +0 -41
data/data/commands/ariadna/complete-milestone.md +0 -136
data/data/commands/ariadna/discuss-phase.md +0 -86
data/data/commands/ariadna/help.md +0 -22
data/data/commands/ariadna/list-phase-assumptions.md +0 -50
data/data/commands/ariadna/pause-work.md +0 -35
data/data/commands/ariadna/plan-milestone-gaps.md +0 -40
data/data/commands/ariadna/reapply-patches.md +0 -110
data/data/commands/ariadna/research-phase.md +0 -187
data/data/commands/ariadna/resume-work.md +0 -40
data/data/commands/ariadna/set-profile.md +0 -34
data/data/commands/ariadna/settings.md +0 -36
data/data/commands/ariadna/update.md +0 -37
data/data/guides/backend.md +0 -3069
data/data/guides/frontend.md +0 -1479
data/data/guides/performance.md +0 -1193
data/data/guides/security.md +0 -1522
data/data/guides/style-guide.md +0 -1091
data/data/guides/testing.md +0 -504
data/data/templates.md +0 -94

data/data/ariadna/templates/config.json CHANGED Viewed

@@ -1,35 +1,10 @@
 {
-  "mode": "interactive",
-  "depth": "standard",
-  "workflow": {
-    "research": true,
-    "plan_check": true,
-    "verifier": true
-  },
-  "planning": {
-    "commit_docs": true,
-    "search_gitignored": false
-  },
-  "parallelization": {
-    "enabled": true,
-    "plan_level": true,
-    "task_level": false,
-    "skip_checkpoints": true,
-    "max_concurrent_agents": 3,
-    "min_plans_for_parallel": 2
-  },
-  "gates": {
-    "confirm_project": true,
-    "confirm_phases": true,
-    "confirm_roadmap": true,
-    "confirm_breakdown": true,
-    "confirm_plan": true,
-    "execute_next_plan": true,
-    "issues_review": true,
-    "confirm_transition": true
-  },
-  "safety": {
-    "always_confirm_destructive": true,
-    "always_confirm_external_services": true
-  }
+  "model_profile": "balanced",
+  "verifier": true,
+  "branching_strategy": "none",
+  "phase_branch_template": "ariadna/phase-{phase}-{slug}",
+  "milestone_branch_template": "ariadna/{milestone}-{slug}",
+  "commit_docs": true,
+  "search_gitignored": false,
+  "parallelization": true
 }

data/data/ariadna/workflows/debug.md ADDED Viewed

@@ -0,0 +1,28 @@
+---
+name: debug
+description: Systematically diagnose UAT gaps by spawning parallel debug agents — one per gap — to find root causes before planning fixes.
+---
+## Goal
+Turn UAT symptoms into diagnosed root causes. Parse gaps from a phase's UAT.md, spawn one debug agent per gap in parallel, collect root causes with evidence, then update UAT.md with `root_cause`, `artifacts`, `missing`, and `debug_session` fields so `plan-phase --gaps` can create targeted fixes.
+## Context Loading
+```bash
+INIT=$(ariadna-tools init debug "$PHASE")
+```
+Read the phase UAT.md from `.ariadna_planning/phases/{phase-dir}/{phase}-UAT.md`. Extract the "Gaps" section (YAML) — each gap has `truth`, `severity`, `test`, `reason`. Also read the matching "Tests" entries for full context. If no UAT.md or no failed gaps exist, error and exit.
+## Constraints
+- Diagnose only — do NOT apply fixes; that is `plan-phase --gaps`'s job
+- Spawn all debug agents in a single message (true parallel execution via `run_in_background=true`)
+- Each agent writes its own `DEBUG-{slug}.md` to `.ariadna_planning/debug/`; orchestrator only receives root cause + file paths
+- If an agent returns `## INVESTIGATION INCONCLUSIVE`, mark gap as "needs manual review" and continue with remaining gaps
+- After all agents complete, update UAT.md gaps in place and commit with `docs({phase}): add root causes from diagnosis`
+## Success Criteria
+- Every failed gap in UAT.md has `root_cause`, `artifacts`, and `missing` fields populated
+- UAT.md frontmatter `status` updated to `diagnosed` and committed
+- Debug sessions saved to `.ariadna_planning/debug/` for reference
+## On Completion
+Display a root-cause table (gap truth → root cause → files involved). Return to the verify-work orchestrator automatically — do NOT offer manual next steps; verify-work routes to `plan-phase --gaps`.

data/data/ariadna/workflows/execute-phase.md CHANGED Viewed

@@ -1,518 +1,36 @@
-<purpose>
-Execute all plans in a phase using wave-based parallel execution. Orchestrator stays lean — delegates plan execution to subagents.
-</purpose>
-<core_principle>
-Orchestrator coordinates, not executes. Each subagent loads the full execute-plan context. Orchestrator: discover plans → analyze deps → group waves → spawn agents → handle checkpoints → collect results.
-</core_principle>
-<required_reading>
-Read STATE.md before any operation to load project context.
-</required_reading>
-<process>
-<step name="initialize" priority="first">
-Load all context in one call:
-```bash
-INIT=$(ariadna-tools init execute-phase "${PHASE_ARG}")
-```
-Parse JSON for: `executor_model`, `verifier_model`, `commit_docs`, `parallelization`, `branching_strategy`, `branch_name`, `phase_found`, `phase_dir`, `phase_number`, `phase_name`, `phase_slug`, `plans`, `incomplete_plans`, `plan_count`, `incomplete_count`, `team_execution`, `execution_mode`, `backend_executor_model`, `frontend_executor_model`, `test_executor_model`, `state_exists`, `roadmap_exists`.
-**If `phase_found` is false:** Error — phase directory not found.
-**If `plan_count` is 0:** Error — no plans found in phase.
-**If `state_exists` is false but `.ariadna_planning/` exists:** Offer reconstruct or continue.
-When `parallelization` is false, plans within a wave execute sequentially.
-</step>
-<step name="handle_branching">
-Check `branching_strategy` from init:
-**"none":** Skip, continue on current branch.
-**"phase" or "milestone":** Use pre-computed `branch_name` from init:
-```bash
-git checkout -b "$BRANCH_NAME" 2>/dev/null || git checkout "$BRANCH_NAME"
-```
-All subsequent commits go to this branch. User handles merging.
-</step>
-<step name="validate_phase">
-From init JSON: `phase_dir`, `plan_count`, `incomplete_count`.
-Report: "Found {plan_count} plans in {phase_dir} ({incomplete_count} incomplete)"
-</step>
-<step name="discover_and_group_plans">
-Load plan inventory with wave grouping in one call:
-```bash
-PLAN_INDEX=$(ariadna-tools phase-plan-index "${PHASE_NUMBER}")
-```
-Parse JSON for: `plans[]` (each with `file`, `phase`, `plan`, `wave`, `type`, `completed`, `domain`, `depends_on`, `files_modified`, `autonomous`, `objective`, `task_count`), `count`, `domains`, `domain_count`, `multi_domain`, `recommend_team`.
-**Filtering:** Skip plans where `has_summary: true`. If `--gaps-only`: also skip non-gap_closure plans. If all filtered: "No matching incomplete plans" → exit.
-Report:
-```
-## Execution Plan
-**Phase {X}: {Name}** — {total_plans} plans across {wave_count} waves
-| Wave | Plans | What it builds |
-|------|-------|----------------|
-| 1 | 01-01, 01-02 | {from plan objectives, 3-8 words} |
-| 2 | 01-03 | ... |
-```
-</step>
-<step name="decide_execution_mode">
-**Determine execution mode:**
-1. If `--team` flag → team execution
-2. If `--no-team` flag → wave execution
-3. If `team_execution` from init is `true` → team execution
-4. If `team_execution` from init is `false` → wave execution
-5. If `team_execution` is `"auto"` → check plan index:
-   - Parse `multi_domain` and `recommend_team` from PLAN_INDEX
-   - If `recommend_team` is true (3+ plans, 2+ non-general domains) → team execution
-   - Otherwise → wave execution
-Report:
-```
-**Execution mode:** {Team | Wave}
-{If auto: "Auto-detected: {plan_count} plans across {domains}"}
-```
-If team mode: proceed to `team_execution` step.
-If wave mode: proceed to `execute_waves` step.
-</step>
-<step name="execute_waves">
-Execute each wave in sequence. Within a wave: parallel if `PARALLELIZATION=true`, sequential if `false`.
-**For each wave:**
-1. **Describe what's being built (BEFORE spawning):**
-   Read each plan's `<objective>`. Extract what's being built and why.
-   ```
-   ---
-   ## Wave {N}
-   **{Plan ID}: {Plan Name}**
-   {2-3 sentences: what this builds, technical approach, why it matters}
-   Spawning {count} agent(s)...
-   ---
-   ```
-   - Bad: "Executing terrain generation plan"
-   - Good: "Procedural terrain generator using Perlin noise — creates height maps, biome zones, and collision meshes. Required before vehicle physics can interact with ground."
-2. **Spawn executor agents:**
-   Pass paths only — executors read files themselves with their fresh 200k context.
-   This keeps orchestrator context lean (~10-15%).
-   **Domain routing:** Read `domain` from each plan's frontmatter to determine which executor to spawn:
-   ```bash
-   DOMAIN=$(ariadna-tools frontmatter get "{phase_dir}/{plan_file}" --field domain)
-   DOMAIN_GUIDE=$(ariadna-tools frontmatter get "{phase_dir}/{plan_file}" --field domain_guide)
-   ```
-   | Domain | Executor Agent | Guide |
-   |--------|---------------|-------|
-   | `backend` | `ariadna-backend-executor` | `@~/.claude/guides/backend.md` |
-   | `frontend` | `ariadna-frontend-executor` | `@~/.claude/guides/frontend.md` |
-   | `testing` | `ariadna-test-executor` | `@~/.claude/guides/testing.md` |
-   | `general` or unset | `ariadna-executor` | (none) |
-   ```
-   Task(
-     subagent_type="{executor_agent}",
-     model="{executor_model}",
-     prompt="
-       <objective>
-       Execute plan {plan_number} of phase {phase_number}-{phase_name}.
-       Commit each task atomically. Create SUMMARY.md. Update STATE.md.
-       </objective>
-       <execution_context>
-       @~/.claude/ariadna/workflows/execute-plan.md
-       @~/.claude/ariadna/templates/summary.md
-       @~/.claude/ariadna/references/checkpoints.md
-       @~/.claude/ariadna/references/tdd.md
-       {If domain_guide is set:}
-       @~/.claude/guides/{domain_guide}
-       </execution_context>
-       <files_to_read>
-       Read these files at execution start using the Read tool:
-       - Plan: {phase_dir}/{plan_file}
-       - State: .ariadna_planning/STATE.md
-       - Config: .ariadna_planning/config.json (if exists)
-       </files_to_read>
-       <success_criteria>
-       - [ ] All tasks executed
-       - [ ] Each task committed individually
-       - [ ] SUMMARY.md created in plan directory
-       - [ ] STATE.md updated with position and decisions
-       </success_criteria>
-     "
-   )
-   ```
-3. **Wait for all agents in wave to complete.**
-4. **Report completion — spot-check claims first:**
-   For each SUMMARY.md:
-   - Verify first 2 files from `key-files.created` exist on disk
-   - Check `git log --oneline --all --grep="{phase}-{plan}"` returns ≥1 commit
-   - Check for `## Self-Check: FAILED` marker
-   If ANY spot-check fails: report which plan failed, route to failure handler — ask "Retry plan?" or "Continue with remaining waves?"
-   If pass:
-   ```
-   ---
-   ## Wave {N} Complete
-   **{Plan ID}: {Plan Name}**
-   {What was built — from SUMMARY.md}
-   {Notable deviations, if any}
-   {If more waves: what this enables for next wave}
-   ---
-   ```
-   - Bad: "Wave 2 complete. Proceeding to Wave 3."
-   - Good: "Terrain system complete — 3 biome types, height-based texturing, physics collision meshes. Vehicle physics (Wave 3) can now reference ground surfaces."
-5. **Handle failures:**
-   **Known Claude Code bug (classifyHandoffIfNeeded):** If an agent reports "failed" with error containing `classifyHandoffIfNeeded is not defined`, this is a Claude Code runtime bug — not an Ariadna or agent issue. The error fires in the completion handler AFTER all tool calls finish. In this case: run the same spot-checks as step 4 (SUMMARY.md exists, git commits present, no Self-Check: FAILED). If spot-checks PASS → treat as **successful**. If spot-checks FAIL → treat as real failure below.
-   For real failures: report which plan failed → ask "Continue?" or "Stop?" → if continue, dependent plans may also fail. If stop, partial completion report.
-6. **Execute checkpoint plans between waves** — see `<checkpoint_handling>`.
-7. **Proceed to next wave.**
-</step>
-<step name="team_execution">
-**Alternative to `execute_waves`.** Used when `team_execution` config is `true` OR `--team` flag is passed.
-**Decision gate:** Check config and flags:
-```bash
-TEAM_MODE=$(ariadna-tools config get execution.team 2>/dev/null || echo "false")
-```
-If `TEAM_MODE` is `true` OR `--team` flag present → use team execution. Otherwise → use wave-based `execute_waves` (default).
-**Team execution flow:**
-1. **Create team:**
-   ```
-   TeamCreate(team_name="phase-{N}-execution", description="Executing phase {N}")
-   ```
-2. **Create tasks from plans:** One `TaskCreate` per incomplete plan:
-   ```
-   TaskCreate(
-     subject="Execute plan {plan_id}: {objective}",
-     description="Execute {phase_dir}/{plan_file}. Domain: {domain}. Files: {files_modified}.",
-     activeForm="Executing plan {plan_id}"
-   )
-   ```
-   Set up dependencies using `addBlockedBy` matching plan `depends_on` fields.
-3. **Spawn domain executor agents:** One agent per unique domain in the plans:
-   ```
-   Task(
-     team_name="phase-{N}-execution",
-     name="{domain}-executor",
-     subagent_type="ariadna-{domain}-executor",
-     model="{executor_model}",
-     prompt="
-       You are a {domain} executor on team phase-{N}-execution.
-       <protocol>
-       1. Check TaskList for tasks assigned to you
-       2. Claim unblocked tasks via TaskUpdate(status='in_progress')
-       3. Read the plan file, execute all tasks, create SUMMARY.md
-       4. Mark task completed via TaskUpdate(status='completed')
-       5. Check TaskList for next available task
-       6. When no tasks remain, send message to team lead
-       </protocol>
-       <execution_context>
-       @~/.claude/ariadna/workflows/execute-plan.md
-       @~/.claude/ariadna/templates/summary.md
-       @~/.claude/guides/{domain_guide}
-       </execution_context>
-     "
-   )
-   ```
-   For `general` domain, use `ariadna-executor` as the subagent_type.
-4. **Assign tasks:** `TaskUpdate(owner="{domain}-executor")` for each task based on plan domain.
-5. **Monitor progress:** Orchestrator monitors via `TaskList`. When agents complete tasks:
-   - Newly unblocked tasks become available for assignment
-   - Assign unblocked tasks to idle agents of the matching domain
-   - Cross-domain handoffs: if a frontend task depends on a backend task, the frontend executor can read the backend SUMMARY.md for context
-   **Progress reporting (on each task completion message from an agent):**
-   Check `TaskList` and display:
-   ```
-   ## Team Progress
-   | Agent             | Status  | Current Task | Completed |
-   |-------------------|---------|-------------|-----------|
-   | backend-executor  | working | Plan 03-01  | 1/3       |
-   | frontend-executor | idle    | waiting     | 0/2       |
-   | test-executor     | working | Plan 03-03  | 0/1       |
-   ```
-6. **Handle checkpoints:** Same as wave-based — agent sends message to orchestrator, orchestrator presents checkpoint to user, spawns continuation agent.
-7. **Shutdown team:** When all tasks are complete:
-   ```
-   SendMessage(type="shutdown_request", recipient="{domain}-executor")
-   ```
-   for each spawned agent. After all agents shut down:
-   ```
-   TeamDelete()
-   ```
-**Conflict prevention:** File ownership is enforced by `files_modified` frontmatter — the planner ensures no overlap between concurrent plans assigned to different agents.
-**STATE.md serialization:** Agents do NOT update STATE.md in team mode.
-The orchestrator reads each SUMMARY.md after all tasks complete and
-updates STATE.md sequentially. This prevents concurrent write corruption.
-</step>
-<step name="checkpoint_handling">
-Plans with `autonomous: false` require user interaction.
-**Flow:**
-1. Spawn agent for checkpoint plan
-2. Agent runs until checkpoint task or auth gate → returns structured state
-3. Agent return includes: completed tasks table, current task + blocker, checkpoint type/details, what's awaited
-4. **Present to user:**
-   ```
-   ## Checkpoint: [Type]
-   **Plan:** 03-03 Dashboard Layout
-   **Progress:** 2/3 tasks complete
-   [Checkpoint Details from agent return]
-   [Awaiting section from agent return]
-   ```
-5. User responds: "approved"/"done" | issue description | decision selection
-6. **Spawn continuation agent (NOT resume)** using continuation-prompt.md template:
-   - `{completed_tasks_table}`: From checkpoint return
-   - `{resume_task_number}` + `{resume_task_name}`: Current task
-   - `{user_response}`: What user provided
-   - `{resume_instructions}`: Based on checkpoint type
-7. Continuation agent verifies previous commits, continues from resume point
-8. Repeat until plan completes or user stops
-**Why fresh agent, not resume:** Resume relies on internal serialization that breaks with parallel tool calls. Fresh agents with explicit state are more reliable.
-**Checkpoints in parallel waves:** Agent pauses and returns while other parallel agents may complete. Present checkpoint, spawn continuation, wait for all before next wave.
-</step>
-<step name="aggregate_results">
-After all waves (or after all team tasks complete):
-**Team mode state aggregation (if team execution was used):**
-For each completed task's SUMMARY.md, in plan order:
-```bash
-ariadna-tools state advance-plan
-ariadna-tools state record-metric \
-  --phase "${PHASE}" --plan "${PLAN}" --duration "${DURATION}" \
-  --tasks "${TASK_COUNT}" --files "${FILE_COUNT}"
-```
-This runs sequentially from the orchestrator to prevent concurrent writes.
-**Aggregate requirements coverage:** Parse `requirements_covered` from all SUMMARY.md frontmatter in this phase. Cross-check against `requirements_content` (from INIT or re-read REQUIREMENTS.md) for requirements mapped to this phase. Flag uncovered requirements. Show coverage count.
-If covered requirements exist, update REQUIREMENTS.md traceability table: set status to "Complete" for each covered REQ-ID, with evidence from the SUMMARY frontmatter.
-```markdown
-## Phase {X}: {Name} Execution Complete
-**Waves:** {N} | **Plans:** {M}/{total} complete
-**Requirements:** {covered}/{total_for_phase} covered
-| Wave | Plans | Status |
-|------|-------|--------|
-| 1 | plan-01, plan-02 | ✓ Complete |
-| CP | plan-03 | ✓ Verified |
-| 2 | plan-04 | ✓ Complete |
-### Plan Details
-1. **03-01**: [one-liner from SUMMARY.md]
-2. **03-02**: [one-liner from SUMMARY.md]
-### Requirements Coverage
-| REQ-ID | Requirement | Evidence |
-|--------|-------------|----------|
-| {id} | {description} | {evidence from SUMMARY} |
-| {id} | {description} | ⚠ Not covered |
-[Omit section if no REQUIREMENTS.md or no requirements for this phase]
-### Issues Encountered
-[Aggregate from SUMMARYs, or "None"]
-```
-</step>
-<step name="verify_phase_goal">
-Verify phase achieved its GOAL, not just completed tasks.
-```
-Task(
-  prompt="Verify phase {phase_number} goal achievement.
-Phase directory: {phase_dir}
-Phase goal: {goal from ROADMAP.md}
-Check must_haves against actual codebase. Create VERIFICATION.md.",
-  subagent_type="ariadna-verifier",
-  model="{verifier_model}"
-)
-```
-Read status:
-```bash
-grep "^status:" "$PHASE_DIR"/*-VERIFICATION.md | cut -d: -f2 | tr -d ' '
-```
-| Status | Action |
-|--------|--------|
-| `passed` | → user_acceptance |
-| `human_needed` | Present items for human testing, get approval or feedback → user_acceptance |
-| `gaps_found` | Present gap summary, offer `/ariadna:plan-phase {phase} --gaps` |
-**If human_needed:**
-```
-## ✓ Phase {X}: {Name} — Human Verification Required
-All automated checks passed. {N} items need human testing:
-{From VERIFICATION.md human_verification section}
-"approved" → continue | Report issues → gap closure
-```
-**If gaps_found:**
-```
-## ⚠ Phase {X}: {Name} — Gaps Found
-**Score:** {N}/{M} must-haves verified
-**Report:** {phase_dir}/{phase}-VERIFICATION.md
-### What's Missing
-{Gap summaries from VERIFICATION.md}
 ---
-## ▶ Next Up
-`/ariadna:plan-phase {X} --gaps`
-<sub>`/clear` first → fresh context window</sub>
-Also: `cat {phase_dir}/{phase}-VERIFICATION.md` — full report
-Also: `/ariadna:verify-work {X}` — manual testing first
-```
-Gap closure cycle: `/ariadna:plan-phase {X} --gaps` reads VERIFICATION.md → creates gap plans with `gap_closure: true` → user runs `/ariadna:execute-phase {X} --gaps-only` → verifier re-runs.
-</step>
-<step name="user_acceptance">
-**Trigger:** After verifier returns `passed` (or `human_needed` items are approved by user).
-**Skip if:** `--no-review` flag, or ALL plans in this phase have `domain: backend` or `domain: testing` only (no user-facing deliverables).
-**Otherwise:** Show a lightweight acceptance gate. Gather one-liners from each SUMMARY.md + key decisions Claude made during execution:
-```
-questions: [
-  {
-    header: "Acceptance",
-    question: "Phase {X}: {Name} — verified and passing.\n\nWhat was built:\n- {one-liner from SUMMARY 1}\n- {one-liner from SUMMARY 2}\n\nKey decisions made:\n- {decision 1 from SUMMARY}\n- {decision 2 from SUMMARY}\n\nDoes the direction look right?",
-    multiSelect: false,
-    options: [
-      { label: "Looks good", description: "Mark phase complete and continue" },
-      { label: "Test first", description: "Run /ariadna:verify-work before marking complete" },
-      { label: "Issues", description: "Record blocker and suggest gap closure" }
-    ]
-  }
-]
-```
-- **"Looks good":** Proceed to `update_roadmap`.
-- **"Test first":** Display: `Run /ariadna:verify-work {X} to test, then re-run /ariadna:execute-phase {X} to continue.` Exit without marking phase complete.
-- **"Issues":** Ask user to describe the issue. Record as blocker in STATE.md via `ariadna-tools state add-blocker "{issue}"`. Display: `Blocker recorded. Run /ariadna:plan-phase {X} --gaps to create fix plans.` Exit without marking phase complete.
-</step>
+name: execute-phase
+description: Execute all plans in a phase using wave-based ordering, producing SUMMARY.md and atomic commits per task
+---
-<step name="update_roadmap">
-Mark phase complete in ROADMAP.md (date, status).
+## Goal
+Execute every plan in a phase by spawning executor subagents, respecting wave dependencies, and producing committed deliverables. Orchestrator coordinates; subagents do the work.
+## Context Loading
 ```bash
-ariadna-tools commit "docs(phase-{X}): complete phase execution" --files .ariadna_planning/ROADMAP.md .ariadna_planning/STATE.md .ariadna_planning/phases/{phase_dir}/*-VERIFICATION.md .ariadna_planning/REQUIREMENTS.md
-```
-</step>
-<step name="offer_next">
-**If more phases:**
-```
-## Next Up
-**Phase {X+1}: {Name}** — {Goal}
-`/ariadna:plan-phase {X+1}`
-<sub>`/clear` first for fresh context</sub>
-```
-**If milestone complete:**
-```
-MILESTONE COMPLETE!
-All {N} phases executed.
-`/ariadna:complete-milestone`
+INIT=$(ariadna-tools init execute-phase "${PHASE_ARG}")
 ```
-</step>
-</process>
-<context_efficiency>
-Orchestrator: ~10-15% context. Subagents: fresh 200k each. No polling (Task blocks). No context bleed.
-</context_efficiency>
-<failure_handling>
-- **classifyHandoffIfNeeded false failure:** Agent reports "failed" but error is `classifyHandoffIfNeeded is not defined` → Claude Code bug, not Ariadna. Spot-check (SUMMARY exists, commits present) → if pass, treat as success
-- **Agent fails mid-plan:** Missing SUMMARY.md → report, ask user how to proceed
-- **Dependency chain breaks:** Wave 1 fails → Wave 2 dependents likely fail → user chooses attempt or skip
-- **All agents in wave fail:** Systemic issue → stop, report for investigation
-- **Checkpoint unresolvable:** "Skip this plan?" or "Abort phase execution?" → record partial progress in STATE.md
-</failure_handling>
-<resumption>
-Re-run `/ariadna:execute-phase {phase}` → discover_plans finds completed SUMMARYs → skips them → resumes from first incomplete plan → continues wave execution.
-STATE.md tracks: last completed plan, current wave, pending checkpoints.
-</resumption>
+Returns: `phase_dir`, `phase_number`, `phase_name`, `plans[]` (each with `wave`, `domain`, `depends_on`, `has_summary`), `executor_model`, `verifier_model`, `parallelization`, `branching_strategy`, `branch_name`.
+Also read: `.ariadna_planning/STATE.md` for current project position.
+## Constraints
+- Orchestrator stays lean (~10-15% context) — pass paths only, subagents read files themselves
+- Wave ordering is strict: all wave N plans complete before wave N+1 begins
+- Skip plans where `has_summary: true` (already done); resumption is automatic
+- Route executor agent by `domain` frontmatter: `backend` → `ariadna-backend-executor`, `frontend` → `ariadna-frontend-executor`, `testing` → `ariadna-test-executor`, `general`/unset → `ariadna-executor`
+- Load the matching Rails Skills (`@~/.claude/skills/rails-{domain}/SKILL.md`) in each executor prompt
+- Each task must be committed atomically; executor creates SUMMARY.md in plan directory
+- Plans with `autonomous: false` require a checkpoint pause before continuing
+- If agent returns `classifyHandoffIfNeeded` error: spot-check SUMMARY.md + git commits — if present, treat as success
+## Success Criteria
+- Every plan in the phase has a SUMMARY.md
+- `git log` shows at least one commit per plan
+- No `## Self-Check: FAILED` markers in any SUMMARY.md
+## On Completion
+- Spawn `ariadna-verifier` to check phase goal achievement (not just task completion)
+- Update `memory/progress.md` with phase status and any decisions made
+- Record session summary: mark phase complete in ROADMAP.md, commit STATE.md
+- If verifier finds gaps: offer `/ariadna:plan-phase {N} --gaps` for closure