npm - @wazir-dev/cli - Versions diffs - 1.0.0 → 1.2.0 - Mend

@wazir-dev/cli 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (163) hide show

package/CHANGELOG.md +100 -2
package/README.md +6 -6
package/docs/concepts/architecture.md +1 -1
package/docs/concepts/roles-and-workflows.md +2 -0
package/docs/concepts/why-wazir.md +59 -0
package/docs/decisions/2026-03-19-deferred-items.md +564 -0
package/docs/decisions/2026-03-19-enhancement-decisions.md +300 -0
package/docs/plans/2026-03-15-cli-pipeline-integration-plan.md +1 -1
package/docs/readmes/INDEX.md +21 -5
package/docs/readmes/features/expertise/README.md +2 -2
package/docs/readmes/features/exports/README.md +2 -2
package/docs/readmes/features/schemas/README.md +3 -0
package/docs/readmes/features/skills/README.md +17 -0
package/docs/readmes/features/skills/clarifier.md +5 -0
package/docs/readmes/features/skills/claude-cli.md +5 -0
package/docs/readmes/features/skills/codex-cli.md +5 -0
package/docs/readmes/features/skills/dispatching-parallel-agents.md +5 -0
package/docs/readmes/features/skills/executing-plans.md +5 -0
package/docs/readmes/features/skills/executor.md +5 -0
package/docs/readmes/features/skills/finishing-a-development-branch.md +5 -0
package/docs/readmes/features/skills/gemini-cli.md +5 -0
package/docs/readmes/features/skills/humanize.md +5 -0
package/docs/readmes/features/skills/init-pipeline.md +5 -0
package/docs/readmes/features/skills/receiving-code-review.md +5 -0
package/docs/readmes/features/skills/requesting-code-review.md +5 -0
package/docs/readmes/features/skills/reviewer.md +5 -0
package/docs/readmes/features/skills/subagent-driven-development.md +5 -0
package/docs/readmes/features/skills/using-git-worktrees.md +5 -0
package/docs/readmes/features/skills/wazir.md +5 -0
package/docs/readmes/features/skills/writing-skills.md +5 -0
package/docs/readmes/features/workflows/prepare-next.md +1 -1
package/docs/reference/configuration-reference.md +47 -6
package/docs/reference/launch-checklist.md +4 -4
package/docs/reference/review-loop-pattern.md +538 -0
package/docs/reference/roles-reference.md +1 -0
package/docs/reference/skill-tiers.md +147 -0
package/docs/reference/tooling-cli.md +5 -1
package/docs/truth-claims.yaml +18 -0
package/expertise/antipatterns/process/ai-coding-antipatterns.md +97 -1
package/exports/hosts/claude/.claude/agents/clarifier.md +3 -0
package/exports/hosts/claude/.claude/agents/designer.md +3 -0
package/exports/hosts/claude/.claude/agents/executor.md +2 -0
package/exports/hosts/claude/.claude/agents/planner.md +3 -0
package/exports/hosts/claude/.claude/agents/researcher.md +2 -0
package/exports/hosts/claude/.claude/agents/reviewer.md +5 -1
package/exports/hosts/claude/.claude/agents/specifier.md +3 -0
package/exports/hosts/claude/.claude/commands/clarify.md +4 -0
package/exports/hosts/claude/.claude/commands/design-review.md +4 -0
package/exports/hosts/claude/.claude/commands/design.md +4 -0
package/exports/hosts/claude/.claude/commands/discover.md +4 -0
package/exports/hosts/claude/.claude/commands/execute.md +4 -0
package/exports/hosts/claude/.claude/commands/plan-review.md +4 -0
package/exports/hosts/claude/.claude/commands/plan.md +4 -0
package/exports/hosts/claude/.claude/commands/spec-challenge.md +4 -0
package/exports/hosts/claude/.claude/commands/specify.md +4 -0
package/exports/hosts/claude/.claude/commands/verify.md +4 -0
package/exports/hosts/claude/.claude/settings.json +9 -0
package/exports/hosts/claude/CLAUDE.md +1 -1
package/exports/hosts/claude/export.manifest.json +22 -20
package/exports/hosts/claude/host-package.json +3 -1
package/exports/hosts/codex/AGENTS.md +1 -1
package/exports/hosts/codex/export.manifest.json +22 -20
package/exports/hosts/codex/host-package.json +3 -1
package/exports/hosts/cursor/.cursor/hooks.json +4 -0
package/exports/hosts/cursor/.cursor/rules/wazir-core.mdc +1 -1
package/exports/hosts/cursor/export.manifest.json +22 -20
package/exports/hosts/cursor/host-package.json +3 -1
package/exports/hosts/gemini/GEMINI.md +1 -1
package/exports/hosts/gemini/export.manifest.json +22 -20
package/exports/hosts/gemini/host-package.json +3 -1
package/hooks/context-mode-router +191 -0
package/hooks/definitions/context_mode_router.yaml +19 -0
package/hooks/definitions/loop_cap_guard.yaml +1 -1
package/hooks/hooks.json +43 -0
package/hooks/protected-path-write-guard +8 -0
package/hooks/routing-matrix.json +45 -0
package/hooks/session-start +62 -1
package/llms-full.txt +905 -132
package/package.json +3 -3
package/roles/clarifier.md +3 -0
package/roles/designer.md +3 -0
package/roles/executor.md +2 -0
package/roles/planner.md +3 -0
package/roles/researcher.md +2 -0
package/roles/reviewer.md +5 -1
package/roles/specifier.md +3 -0
package/schemas/hook.schema.json +2 -1
package/schemas/phase-report.schema.json +80 -0
package/schemas/usage.schema.json +25 -1
package/schemas/wazir-manifest.schema.json +19 -0
package/skills/brainstorming/SKILL.md +20 -56
package/skills/clarifier/SKILL.md +243 -0
package/skills/claude-cli/SKILL.md +320 -0
package/skills/codex-cli/SKILL.md +260 -0
package/skills/debugging/SKILL.md +24 -1
package/skills/design/SKILL.md +13 -0
package/skills/dispatching-parallel-agents/SKILL.md +13 -0
package/skills/executing-plans/SKILL.md +28 -2
package/skills/executor/SKILL.md +129 -0
package/skills/finishing-a-development-branch/SKILL.md +13 -0
package/skills/gemini-cli/SKILL.md +260 -0
package/skills/humanize/SKILL.md +13 -0
package/skills/init-pipeline/SKILL.md +76 -78
package/skills/prepare-next/SKILL.md +81 -10
package/skills/receiving-code-review/SKILL.md +21 -0
package/skills/requesting-code-review/SKILL.md +38 -5
package/skills/reviewer/SKILL.md +423 -0
package/skills/run-audit/SKILL.md +13 -0
package/skills/scan-project/SKILL.md +13 -0
package/skills/self-audit/SKILL.md +197 -16
package/skills/subagent-driven-development/SKILL.md +38 -2
package/skills/subagent-driven-development/code-quality-reviewer-prompt.md +2 -0
package/skills/subagent-driven-development/implementer-prompt.md +8 -0
package/skills/subagent-driven-development/spec-reviewer-prompt.md +7 -0
package/skills/tdd/SKILL.md +21 -0
package/skills/using-git-worktrees/SKILL.md +13 -0
package/skills/using-skills/SKILL.md +13 -0
package/skills/verification/SKILL.md +13 -0
package/skills/wazir/SKILL.md +286 -262
package/skills/writing-plans/SKILL.md +44 -4
package/skills/writing-skills/SKILL.md +13 -0
package/templates/artifacts/implementation-plan.md +3 -0
package/templates/artifacts/tasks-template.md +133 -0
package/templates/examples/phase-report.example.json +48 -0
package/templates/examples/wazir-manifest.example.yaml +1 -1
package/tooling/src/adapters/composition-engine.js +256 -0
package/tooling/src/adapters/model-router.js +84 -0
package/tooling/src/capture/command.js +111 -2
package/tooling/src/capture/run-config.js +23 -0
package/tooling/src/capture/store.js +24 -0
package/tooling/src/capture/usage.js +106 -0
package/tooling/src/checks/ac-matrix.js +256 -0
package/tooling/src/checks/brand-truth.js +3 -6
package/tooling/src/checks/command-registry.js +13 -0
package/tooling/src/checks/docs-truth.js +1 -1
package/tooling/src/checks/runtime-surface.js +3 -7
package/tooling/src/checks/skills.js +111 -0
package/tooling/src/cli.js +17 -3
package/tooling/src/commands/stats.js +161 -0
package/tooling/src/commands/validate.js +5 -1
package/tooling/src/export/compiler.js +33 -37
package/tooling/src/gating/agent.js +145 -0
package/tooling/src/guards/phase-prerequisite-guard.js +127 -0
package/tooling/src/hooks/routing-logic.js +69 -0
package/tooling/src/init/auto-detect.js +260 -0
package/tooling/src/init/command.js +161 -0
package/tooling/src/input/scanner.js +46 -0
package/tooling/src/reports/command.js +103 -0
package/tooling/src/reports/phase-report.js +323 -0
package/tooling/src/state/command.js +160 -0
package/tooling/src/state/db.js +287 -0
package/tooling/src/status/command.js +53 -1
package/wazir.manifest.yaml +26 -17
package/workflows/clarify.md +4 -0
package/workflows/design-review.md +4 -0
package/workflows/design.md +4 -0
package/workflows/discover.md +4 -0
package/workflows/execute.md +4 -0
package/workflows/plan-review.md +4 -0
package/workflows/plan.md +4 -0
package/workflows/spec-challenge.md +4 -0
package/workflows/specify.md +4 -0
package/workflows/verify.md +4 -0

package/skills/wazir/SKILL.md CHANGED Viewed

@@ -9,6 +9,19 @@ The user typed `/wazir <their request>`. Run the entire pipeline end-to-end, han
 All questions use **numbered interactive options** — one question at a time, defaults marked "(Recommended)", wait for user response before proceeding.
+## Command Routing
+Follow the Canonical Command Matrix in `hooks/routing-matrix.json`.
+- Large commands (test runners, builds, diffs, dependency trees, linting) → context-mode tools
+- Small commands (git status, ls, pwd, wazir CLI) → native Bash
+- If context-mode unavailable, fall back to native Bash with warning
+## Codebase Exploration
+1. Query `wazir index search-symbols <query>` first
+2. Use `wazir recall file <path> --tier L1` for targeted reads
+3. Fall back to direct file reads ONLY for files identified by index queries
+4. Maximum 10 direct file reads without a justifying index query
+5. If no index exists: `wazir index build && wazir index summarize --tier all`
 ## Subcommand Detection
 Before anything else, check if the request starts with a known subcommand:
@@ -18,11 +31,24 @@ Before anything else, check if the request starts with a known subcommand:
 | `/wazir audit ...` | Jump to **Audit Mode** (see below) |
 | `/wazir prd [run-id]` | Jump to **PRD Mode** (see below) |
 | `/wazir init` | Invoke the `init-pipeline` skill directly, then stop |
-| Anything else | Continue to Step 1 (normal pipeline) |
+| Anything else | Continue to Phase 1 (Init) |
+---
+# 4-Phase Pipeline
+The pipeline has 4 phases. Each phase groups related workflows. Individual workflows within a phase can be enabled/disabled via `workflow_policy` in run-config.
+| Phase | Contains | Owner Skill | Key Output |
+|-------|----------|-------------|------------|
+| **Init** | Setup, prereqs, run directory, input scan | `wz:wazir` (inline) | `run-config.yaml` |
+| **Clarifier** | Research, clarify, specify, brainstorm, plan | `wz:clarifier` | Approved spec + design + plan |
+| **Executor** | Implement, verify | `wz:executor` | Code + verification proof |
+| **Final Review** | Review vs original input, learn, prepare next | `wz:reviewer` | Verdict + learnings + handoff |
 ---
-# Normal Pipeline Mode
+# Phase 1: Init
 ## Step 1: Capture the Request
@@ -37,13 +63,22 @@ If the user provided no text after `/wazir`, ask:
 Save their answer as the briefing, then continue.
+### Scan Input Directory
+Scan both `input/` (project-level) and `.wazir/input/` (state-level) for existing briefing materials. If files exist beyond `briefing.md`, list them:
+> **Found input files:**
+> - `input/2026-03-19-deferred-items.md`
+> - `.wazir/input/briefing.md`
+>
+> Using all found input as context for clarification.
 ### Inline Modifiers
-Parse the request for inline modifiers before the main text. These skip the corresponding interview question:
+Parse the request for inline modifiers before the main text:
 - `/wazir quick fix the login redirect` → depth = quick, intent = bugfix
 - `/wazir deep design a new onboarding flow` → depth = deep, intent = feature
-- `/wazir feature add CSV export` → intent = feature, depth = standard (default)
 Recognized modifiers:
 - **Depth:** `quick`, `deep` (standard is default when omitted)
@@ -61,233 +96,310 @@ Run `which wazir` to check if the CLI is installed.
 >
 > **How would you like to install it?**
 >
-> 1. **npm** (Recommended) — `npm install -g wazir`
+> 1. **npm** (Recommended) — `npm install -g @wazir-dev/cli`
 > 2. **Local link** — `npm link` from the Wazir project root
-> 3. **Skip** — Continue without the CLI (some features will be unavailable)
-If the user picks 1, run `npm install -g wazir` and verify with `wazir --version`.
-If the user picks 2, run `npm link` from the project root and verify.
-If the user picks 3, warn that `wazir capture`, `wazir validate`, and `wazir index` commands will not work, then continue.
+The CLI is **required** — the pipeline uses `wazir capture`, `wazir validate`, `wazir index`, and `wazir doctor` throughout execution.
+**If installed**, run `wazir doctor --json` to verify repo health. Stop if unhealthy.
+### Branch Check
+Run `wazir validate branches` to check the current git branch.
+- If on `main` or `develop`:
+  > You're on **[branch]**. The pipeline requires a feature branch.
+  >
+  > 1. **Create feat/<slug>** (Recommended) — branch from current
+  > 2. **Continue on [branch]** — not recommended for feature/refactor work
+### Index Check
-**If installed**, run `wazir doctor --json` to verify repo health.
+```bash
+INDEX_STATS=$(wazir index stats --json 2>/dev/null)
+FILE_COUNT=$(echo "$INDEX_STATS" | jq -r '.file_count // 0')
+if [ "$FILE_COUNT" -eq 0 ]; then
+  wazir index build && wazir index summarize --tier all
+else
+  wazir index refresh
+fi
+```
 ### Pipeline Init Check
 Check if `.wazir/state/config.json` exists.
-- **If missing** — invoke the `init-pipeline` skill. This will ask the user interactive questions to set up the config.
-- **If exists** — continue to Step 2.5.
+- **If missing** — invoke the `init-pipeline` skill.
+- **If exists** — continue.
-## Step 2.5: Create Run Directory
+## Step 3: Create Run Directory
 Generate a run ID using the current timestamp: `run-YYYYMMDD-HHMMSS`
 ```bash
-mkdir -p .wazir/runs/run-YYYYMMDD-HHMMSS/{sources,tasks,artifacts,reviews}
+mkdir -p .wazir/runs/run-YYYYMMDD-HHMMSS/{sources,tasks,artifacts,reviews,clarified}
 ln -sfn run-YYYYMMDD-HHMMSS .wazir/runs/latest
 ```
-If a previous completed run exists (check for a `completed_at` field in the previous `latest` run's `run-config.yaml`), record its `run_id` as `parent_run_id` in the new run's config.
+Initialize event capture:
+```bash
+wazir capture init --run <run-id> --phase init --status starting
+```
-## Step 3: Pre-Flight Configuration
+### Resume Detection
-Build the run configuration. Skip questions that were answered via inline modifiers.
+Check if a previous incomplete run exists (via `latest` symlink pointing to a run without `completed_at`).
-### Question 1: Depth (if not set via modifier)
+**If previous incomplete run found**, present:
-> **How thorough should this run be?**
+> **A previous incomplete run was detected:** `<previous-run-id>`
 >
-> 1. **Quick** — Minimal research, single-pass review, fast execution. Good for small fixes and config changes.
-> 2. **Standard** (Recommended) — Balanced research, multi-pass hardening, full review. Good for most features.
-> 3. **Deep** — Extended research, thorough hardening, strict review thresholds. Good for complex or security-critical work.
+> 1. **Resume** (Recommended) — continue from the last completed phase
+> 2. **Start fresh** — create a new empty run
-### Question 2: Intent (if not set via modifier and not obvious from the request)
+**If Resume:**
+- Copy `clarified/` from previous run into new run, EXCEPT `user-feedback.md`.
+- Detect last completed phase by checking which artifacts exist.
+- **Staleness check:** If input files are newer than copied artifacts, warn and offer to re-run clarification.
-Only ask this if the request is ambiguous. If the intent is clear from the text (e.g., "fix the bug" → bugfix), infer it and skip.
+## Step 4: Build Run Config
-> **What kind of work is this?**
->
-> 1. **Feature** (Recommended) — New functionality or enhancement
-> 2. **Bugfix** — Fix broken behavior
-> 3. **Refactor** — Restructure without changing behavior
-> 4. **Docs** — Documentation only
-> 5. **Spike** — Research and exploration, no production code
+**No questions asked.** Depth, intent, and mode are all inferred or defaulted.
-### Question 3: Agent Teams (conditional)
+### Intent Inference
-Only ask this if ALL of these are true:
-- The host is Claude Code (not Codex/Gemini/Cursor)
-- Depth is `standard` or `deep`
-- Intent is `feature` or `refactor` (not bugfix/docs/spike)
+Infer intent from the request text using keyword matching:
-> **Would you like to use Agent Teams for parallel execution?**
->
-> 1. **No** (Recommended) — Tasks run sequentially. Predictable, lower cost.
-> 2. **Yes** — Spawns parallel teammates for independent tasks. Potentially faster and richer output.
->
-> *Agent Teams is experimental from Claude's side. Requires Opus model. Higher token consumption.*
+| Keywords in request | Inferred Intent |
+|-------------------|-----------------|
+| fix, bug, broken, crash, error, issue, wrong | `bugfix` |
+| refactor, clean, restructure, reorganize, rename, simplify | `refactor` |
+| doc, document, readme, guide, explain | `docs` |
+| research, spike, explore, investigate, prototype | `spike` |
+| (anything else) | `feature` |
+Depth defaults to `standard`. Override only via inline modifiers (`/wazir quick ...`, `/wazir deep ...`).
 ### Write Run Config
-Save all decisions to `.wazir/runs/<run-id>/run-config.yaml`:
+Save to `.wazir/runs/<run-id>/run-config.yaml`:
 ```yaml
-# Identity
-run_id: run-20260317-143000
-parent_run_id: null                    # set if resuming/continuing from a prior run
-continuation_reason: null              # e.g. "review found minor fixes"
+run_id: run-YYYYMMDD-HHMMSS
+parent_run_id: null
+continuation_reason: null
-# User request
 request: "the original user request"
-request_summary: "short summary of intent"
-parsed_intent: feature                 # feature | bugfix | refactor | docs | spike
-entry_point: "/wazir"                  # how the user entered the pipeline
-# Configuration
-depth: standard                        # quick | standard | deep
-team_mode: sequential                  # sequential | parallel
-parallel_backend: none                 # none | claude_teams (future: subagents, worktrees)
-# Phase policy (system-decided, not user-facing)
-phase_policy:
-  discover: { enabled: true, reason: "feature intent requires research" }
-  design: { enabled: true, reason: "new UI component" }
-  spec-challenge: { enabled: true, passes: 2, reason: "standard depth" }
-  author: { enabled: false, reason: "no i18n or seed data needed" }
-  plan-review: { enabled: true, passes: 1 }
-# Research
-research_topics: []                    # populated by researcher phase
-# Timestamps
-created_at: 2026-03-17T14:30:00Z
+request_summary: "short summary"
+parsed_intent: feature
+entry_point: "/wazir"
+depth: standard
+team_mode: sequential
+parallel_backend: none
+# Workflow policy — individual workflows within each phase
+workflow_policy:
+  # Clarifier phase workflows
+  discover:       { enabled: true, loop_cap: 10 }
+  clarify:        { enabled: true, loop_cap: 10 }
+  specify:        { enabled: true, loop_cap: 10 }
+  spec-challenge: { enabled: true, loop_cap: 10 }
+  author:         { enabled: false, loop_cap: 10 }
+  design:         { enabled: true, loop_cap: 10 }
+  design-review:  { enabled: true, loop_cap: 10 }
+  plan:           { enabled: true, loop_cap: 10 }
+  plan-review:    { enabled: true, loop_cap: 10 }
+  # Executor phase workflows
+  execute:        { enabled: true, loop_cap: 10 }
+  verify:         { enabled: true, loop_cap: 5 }
+  # Final Review phase workflows
+  review:         { enabled: true, loop_cap: 10 }
+  learn:          { enabled: true, loop_cap: 5 }
+  prepare_next:   { enabled: true, loop_cap: 5 }
+  run_audit:      { enabled: false, loop_cap: 10 }
+research_topics: []
+created_at: "YYYY-MM-DDTHH:MM:SSZ"
 completed_at: null
 ```
-Mutable execution state (current phase, task progress, error counts) lives in `.wazir/runs/<run-id>/status.json`, NOT in `run-config.yaml`. The run config captures setup decisions only.
-### Phase Policy
+### Workflow Skip Rules
-Map intent + depth to applicable phases. The system decides — the user does NOT pick phases.
+Map intent + depth to applicable workflows. The system decides — the user does NOT pick.
-**Phase classes:**
-| Class | Phases | Rules |
-|-------|--------|-------|
-| **Core** (always run) | `clarify`, `verify`, `review` | Never skipped |
+| Class | Workflows | Rules |
+|-------|-----------|-------|
+| **Core** (always run) | `clarify`, `execute`, `verify`, `review` | Never skipped |
 | **Adaptive** (run when evidence says so) | `discover`, `design`, `author`, `specify` | Skipped for bugfix/docs/spike at quick depth |
-| **Scale** (intensity varies) | `spec-challenge`, `plan-review`, `design-review` | Single-pass at quick, multi-pass at deep |
-Log skip decisions to the run's `run-config.yaml` with reasons:
+| **Scale** (intensity varies) | `spec-challenge`, `plan-review`, `design-review` | Loop cap controls iteration depth |
+| **Post-run** (always run) | `learn`, `prepare_next` | Part of Final Review phase |
-```yaml
-phase_policy:
-  discover: { enabled: true }
-  design: { enabled: false, reason: "bugfix intent — no design needed" }
-  spec-challenge: { enabled: true, passes: 1, reason: "quick depth" }
-```
+Log skip decisions with reasons in `workflow_policy`.
 ### Confidence Gate
-After building the run config, evaluate confidence:
+After building run config:
-- **High confidence** (clear intent, depth set, no ambiguity) — show a one-line summary and proceed:
-  > **Running: standard depth, feature, sequential. 11 of 14 phases. Proceeding...**
+- **High confidence** — one-line summary and proceed:
+  > **Running: standard depth, feature, sequential. Proceeding...**
-- **Low confidence** (ambiguous intent, unclear scope) — show the full plan and ask:
-  > **Here's the run plan:**
-  > - Depth: standard
-  > - Intent: feature
-  > - Phases: [list enabled phases]
-  > - Skipped: [list skipped with reasons]
-  >
+- **Low confidence** — show plan and ask:
   > **Does this look right?**
   > 1. **Yes, proceed** (Recommended)
   > 2. **No, let me adjust**
-## Step 4: Run Clarifier
-### Source Capture
-Before invoking the clarifier, instruct the researcher to capture all referenced sources locally:
-- Fetch all URLs referenced in `.wazir/input/` briefing files
-- Save fetched content to `.wazir/runs/<run-id>/sources/`
-- Name files as `src-NNN-<slug>.md` (fetched content) or `src-NNN-fetch-failed.json` (failures)
-- Create `.wazir/runs/<run-id>/sources/manifest.json` indexing all captures:
-```json
-[
-  {
-    "id": "src-001",
-    "origin_url": "https://...",
-    "fetch_time": "2026-03-17T14:30:00Z",
-    "content_hash": "sha256:abc...",
-    "status": "captured",
-    "local_path": "src-001-github-readme.md"
-  },
-  {
-    "id": "src-002",
-    "origin_url": "https://...",
-    "status": "failed",
-    "error": "403 Forbidden",
-    "fetch_time": "2026-03-17T14:30:01Z"
-  }
-]
+```bash
+wazir capture event --run <run-id> --event phase_exit --phase init --status completed
 ```
-Research briefs produced by the researcher must reference local paths (`sources/src-001-...`) instead of live URLs. The original URL is preserved in the manifest for provenance. Failures are recorded explicitly — never silently skipped.
+---
+# Phase 2: Clarifier
+```bash
+wazir capture event --run <run-id> --event phase_enter --phase clarifier --status in_progress
+```
+Invoke the `wz:clarifier` skill. It handles all sub-workflows internally:
+1. **Source Capture** — fetch URLs from input
+2. **Research** (discover workflow) — codebase + external research
+3. **Clarify** (clarify workflow) — scope, constraints, assumptions
+4. **Spec Harden** (specify + spec-challenge workflows) — measurable spec
+5. **Brainstorm** (design + design-review workflows) — design approaches
+6. **Plan** (plan + plan-review workflows) — execution plan
+Each sub-workflow has its own review loop. User checkpoints between major steps.
+### Scope Invariant
+**Hard rule:** `items_in_plan >= items_in_input` unless the user explicitly approves scope reduction. The clarifier MUST NOT autonomously tier, defer, or drop items from the user's input. It can suggest prioritization, but the decision belongs to the user.
+Output: approved spec + design + execution plan in `.wazir/runs/latest/clarified/`.
+```bash
+wazir capture event --run <run-id> --event phase_exit --phase clarifier --status completed
+```
+---
-### Clarifier Invocation
+# Phase 3: Executor
-Invoke the `clarifier` skill.
+## Phase Gate (Hard Gate)
-This runs the full Phase 0 + Phase 1 pipeline:
-- Phase 0: Research (autonomous — skipped if depth=quick and intent=bugfix)
-- Phase 1A: Clarify (autonomous)
-- Phase 1A+: Spec Harden (passes determined by depth)
-- Phase 1B: Brainstorm (interactive — **will pause for user approval**. If `team_mode: parallel`, uses structured dialogue with Free Thinker + Grounder + Synthesizer agents)
-- Phase 1C: Plan (task generation)
+Before entering the Executor phase, verify ALL clarifier artifacts exist:
-**Resume detection:** If `.wazir/runs/latest/clarified/` already has task specs and an execution plan, ask:
+- [ ] `.wazir/runs/latest/clarified/clarification.md`
+- [ ] `.wazir/runs/latest/clarified/spec-hardened.md`
+- [ ] `.wazir/runs/latest/clarified/design.md`
+- [ ] `.wazir/runs/latest/clarified/execution-plan.md`
-> **Clarification was already completed. What would you like to do?**
+If ANY file is missing, **STOP**:
+> **Cannot enter Executor phase: missing prerequisite artifacts from Clarifier.**
+>
+> Missing: [list missing files]
 >
-> 1. **Skip to execution** (Recommended) — Use existing task specs
-> 2. **Re-run clarifier** — Start fresh
+> The Clarifier phase must complete before execution can begin. Run `/wazir:clarifier` first.
+**Do NOT skip this check. Do NOT rationalize that the input is "clear enough" to bypass clarification. Every pipeline run must produce these artifacts.**
+```bash
+wazir capture event --run <run-id> --event phase_enter --phase executor --status in_progress
+```
+**Pre-execution gate:**
+```bash
+wazir validate manifest && wazir validate hooks
+# Hard gate — stop if either fails.
+```
+Invoke the `wz:executor` skill. It handles:
+1. **Execute** (execute workflow) — per-task TDD cycle with review before each commit
+2. **Verify** (verify workflow) — deterministic verification of all claims
+Per-task review: `--mode task-review`, 5 task-execution dimensions.
+Tasks always run sequentially.
+Output: code changes + verification proof in `.wazir/runs/latest/artifacts/`.
+```bash
+wazir capture event --run <run-id> --event phase_exit --phase executor --status completed
+```
+---
-## Step 5: Run Executor
+# Phase 4: Final Review
-Invoke the `executor` skill.
+## Phase Gate (Hard Gate)
-This runs Phase 2: autonomous execution with the composition engine, TDD, and quality gates.
+Before entering the Final Review phase, verify the Executor produced its proof:
-If `team_mode: parallel` in run-config, the executor spawns Agent Teams for independent tasks. Otherwise, tasks run sequentially.
+- [ ] `.wazir/runs/latest/artifacts/verification-proof.md`
-**Resume detection:** If `.wazir/runs/latest/artifacts/` has completed artifacts, ask:
+If missing, **STOP**:
-> **Some tasks are already completed. What would you like to do?**
+> **Cannot enter Final Review: missing verification proof from Executor.**
 >
-> 1. **Resume** (Recommended) — Continue from where it left off
-> 2. **Start fresh** — Re-run all tasks from scratch
+> The Executor phase must complete and produce `verification-proof.md` before final review. Run `/wazir:executor` first.
+```bash
+wazir capture event --run <run-id> --event phase_enter --phase final_review --status in_progress
+```
+This phase validates the implementation against the **ORIGINAL INPUT** (not the task specs — the executor's per-task reviewer already covered that).
+### 4a: Review (reviewer role in final mode)
+Invoke `wz:reviewer --mode final`.
+7-dimension scored review comparing implementation against the original user input.
+Score 0-70. Verdicts: PASS (56+), NEEDS MINOR FIXES (42-55), NEEDS REWORK (28-41), FAIL (0-27).
+### 4b: Learn (learner role)
+Extract durable learnings from the completed run:
+- Scan all review findings (internal + Codex)
+- Propose learnings to `memory/learnings/proposed/`
+- Findings that recur across 2+ runs → auto-proposed as learnings
+- Learnings require explicit scope tags (roles, stacks, concerns)
+### 4c: Prepare Next (planner role)
+Prepare context and handoff for the next run:
+- Write handoff document
+- Compress/archive unneeded files
+- Record what's left to do
+```bash
+wazir capture event --run <run-id> --event phase_exit --phase final_review --status completed
+```
+---
+## Step 5: CHANGELOG + Gitflow Validation (Hard Gates)
-## Step 6: Run Reviewer
+Before presenting results:
-Invoke the `reviewer` skill.
+```bash
+wazir validate changelog --require-entries --base main
+wazir validate commits --base main
+```
-This runs Phase 3: final scoring across 7 dimensions, produces a verdict.
+Both must pass before PR. These are not warnings.
-## Step 7: Present Results
+## Step 6: Present Results
-After the reviewer completes, present the verdict and offer next steps with numbered options:
+After the reviewer completes, present verdict with numbered options:
 ### If PASS (score 56+):
 > **Result: PASS (score/70)**
 >
-> [score breakdown]
->
-> **What would you like to do?**
 > 1. **Create a PR** (Recommended)
 > 2. **Merge directly**
 > 3. **Review the changes first**
@@ -296,9 +408,6 @@ After the reviewer completes, present the verdict and offer next steps with numb
 > **Result: NEEDS MINOR FIXES (score/70)**
 >
-> [findings list]
->
-> **What would you like to do?**
 > 1. **Auto-fix and re-review** (Recommended)
 > 2. **Fix manually**
 > 3. **Accept as-is**
@@ -307,9 +416,6 @@ After the reviewer completes, present the verdict and offer next steps with numb
 > **Result: NEEDS REWORK (score/70)**
 >
-> [findings list with affected tasks]
->
-> **What would you like to do?**
 > 1. **Re-run affected tasks** (Recommended)
 > 2. **Review findings in detail**
 > 3. **Abandon this run**
@@ -318,53 +424,42 @@ After the reviewer completes, present the verdict and offer next steps with numb
 > **Result: FAIL (score/70)**
 >
-> [full findings]
->
-> Something fundamental went wrong. Review the findings above and decide how to proceed.
+> Something fundamental went wrong. Review the findings above.
-## Error Handling
+### Run Summary
+```bash
+wazir capture summary --run <run-id>
+wazir status --run <run-id> --json
+```
-If any phase fails or the user cancels:
+## Error Handling
-1. Report which phase failed and why
-2. Present recovery options:
+If any phase fails:
 > **Phase [name] failed: [reason]**
 >
-> **What would you like to do?**
 > 1. **Retry this phase** (Recommended)
-> 2. **Skip and continue** (only if phase is adaptive, not core)
+> 2. **Skip and continue** (only if workflows within phase are adaptive)
 > 3. **Abort the run**
-The run config persists, so running `/wazir` again will detect the partial state and offer to resume.
 ---
 # Audit Mode
 Triggered by `/wazir audit` or `/wazir audit <focus>`.
-Runs a structured codebase audit. Invokes the `run-audit` skill with the interactive question flow.
+Runs a structured codebase audit. Invokes the `run-audit` skill.
-## Inline Audit Modifiers
+Parse inline audit types: `/wazir audit security` → skip Question 1.
-Parse for known audit types after `audit`:
+After audit:
-- `/wazir audit security` → audit type = security, skip Question 1
-- `/wazir audit deps` → audit type = dependencies, skip Question 1
-- `/wazir audit` → ask Question 1
-Then let the `run-audit` skill handle the rest (scope, output mode). All its questions already follow the interactive numbered pattern.
-After the audit completes:
-> **Audit complete. What would you like to do?**
->
 > 1. **Review the findings** (Recommended)
-> 2. **Generate a fix plan** — turn findings into implementation tasks
-> 3. **Run the pipeline on the fix plan** — generate plan, then execute and review fixes
+> 2. **Generate a fix plan**
+> 3. **Run the pipeline on the fix plan**
-If the user picks option 3, save the findings as the briefing and run the normal pipeline (Steps 3-7) with intent = `bugfix`.
+If option 3, save findings as briefing and run pipeline with intent = `bugfix`.
 ---
@@ -372,79 +467,10 @@ If the user picks option 3, save the findings as the briefing and run the normal
 Triggered by `/wazir prd` or `/wazir prd <run-id>`.
-Generates a Product Requirements Document from a completed pipeline run.
-## Pre-Flight
-1. If a `<run-id>` was provided, use that run's directory. Otherwise, use `.wazir/runs/latest`.
-2. Verify the run has completed artifacts:
-   - Design doc in the run's tasks or in `docs/plans/`
-   - Task specs in the run's `clarified/`
-   - Review results in the run's `reviews/` (if available)
-3. If the run is incomplete or has no artifacts:
-> **No completed run found. Run `/wazir <your request>` first to create a pipeline run, then use `/wazir prd` to generate the PRD.**
-## Inputs (read-only)
-Read these artifacts from the completed run:
-- Approved design document
-- Task specs (all `spec.md` files in `clarified/`)
-- Execution plan
-- Review results and verification proofs (if available)
-- Run config (for context on depth, intent, decisions)
-## Output
-Generate a PRD and save to `docs/prd/YYYY-MM-DD-<topic>-prd.md`.
-### PRD Template
-```markdown
-# Product Requirements Document — <Topic>
-**Generated from run:** `<run-id>`
-**Date:** YYYY-MM-DD
-## Vision & Core Thesis
-[1-2 paragraphs synthesized from the design document's core approach]
+Generates a PRD from a completed run. Reads approved design, task specs, execution plan, review results. Saves to `docs/prd/YYYY-MM-DD-<topic>-prd.md`.
-## What We're Building
+After generation:
-### Feature Area 1: <name>
-**What:** [description from task specs]
-**Why:** [rationale from design doc]
-**Requirements:**
-- [ ] [from task spec acceptance criteria]
-- [ ] ...
-### Feature Area 2: <name>
-...
-## Success Criteria
-[From review results and verification proofs — what was tested and confirmed]
-## Technical Constraints
-[From architecture decisions, run config, and design trade-offs]
-## What's NOT in Scope
-[From design doc's rejected alternatives and explicit exclusions]
-## Open Questions
-[From design doc's open questions and review findings]
-```
-## After Generation
-> **PRD generated at `docs/prd/YYYY-MM-DD-<topic>-prd.md`.**
->
-> **What would you like to do?**
 > 1. **Review the PRD** (Recommended)
 > 2. **Commit it**
 > 3. **Edit before committing**
@@ -453,8 +479,6 @@ Generate a PRD and save to `docs/prd/YYYY-MM-DD-<topic>-prd.md`.
 ## Interaction Rules
-These rules apply to ALL questions in the pipeline, including those asked by sub-skills (clarifier, executor, reviewer) and audit modes:
 - **One question at a time** — never combine multiple questions
 - **Numbered options** — always present choices as numbered lists
 - **Mark defaults** — always show "(Recommended)" on the suggested option