npm - claude-raid - Versions diffs - 0.2.4 → 0.2.6 - Mend

claude-raid 0.2.4 → 0.2.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +280 -252
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,160 +1,246 @@
-# claude-raid
-```ansi
-[33m  ⚔ ═══════════════════════════════════════════════════════ ⚔[0m
-[1;33m      ██████╗██╗      █████╗ ██╗   ██╗██████╗ ███████╗[0m
-[1;33m     ██╔════╝██║     ██╔══██╗██║   ██║██╔══██╗██╔════╝[0m
-[33m     ██║     ██║     ███████║██║   ██║██║  ██║█████╗  [0m
-[33m     ██║     ██║     ██╔══██║██║   ██║██║  ██║██╔══╝  [0m
-[33m     ╚██████╗███████╗██║  ██║╚██████╔╝██████╔╝███████╗[0m
-[1;31m      ╚═════╝╚══════╝╚═╝  ╚═╝ ╚═════╝ ╚═════╝ ╚══════╝[0m
-[1;31m              ██████╗  █████╗ ██╗██████╗ [0m
-[1;31m              ██╔══██╗██╔══██╗██║██╔══██╗[0m
-[31m              ██████╔╝███████║██║██║  ██║[0m
-[31m              ██╔══██╗██╔══██║██║██║  ██║[0m
-[31m              ██║  ██║██║  ██║██║██████╔╝[0m
-[2;31m              ╚═╝  ╚═╝╚═╝  ╚═╝╚═╝╚═════╝ [0m
-[90m      Adversarial multi-agent development for Claude Code[0m
-[33m  ⚔ ═══════════════════════════════════════════════════════ ⚔[0m
+<div align="center">
+```
+   ⚔ ══════════════════════════════════════════════════════════ ⚔
+       ██████╗██╗      █████╗ ██╗   ██╗██████╗ ███████╗
+      ██╔════╝██║     ██╔══██╗██║   ██║██╔══██╗██╔════╝
+      ██║     ██║     ███████║██║   ██║██║  ██║█████╗
+      ██║     ██║     ██╔══██║██║   ██║██║  ██║██╔══╝
+      ╚██████╗███████╗██║  ██║╚██████╔╝██████╔╝███████╗
+       ╚═════╝╚══════╝╚═╝  ╚═╝ ╚═════╝ ╚═════╝ ╚══════╝
+               ██████╗  █████╗ ██╗██████╗
+               ██╔══██╗██╔══██╗██║██╔══██╗
+               ██████╔╝███████║██║██║  ██║
+               ██╔══██╗██╔══██║██║██║  ██║
+               ██║  ██║██║  ██║██║██████╔╝
+               ╚═╝  ╚═╝╚═╝  ╚═╝╚═╝╚═════╝
+      Round-based Adversarial Intelligence Dungeon
+                   for Claude Code
+   ⚔ ══════════════════════════════════════════════════════════ ⚔
 ```
-Four AI agents work through a strict 4-phase workflow where every design decision, implementation choice, and code review is stress-tested by competing perspectives before it ships.
+**Four AI agents. One shared dungeon. Every decision stress-tested before it ships.**
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
+[![Zero Dependencies](https://img.shields.io/badge/dependencies-0-brightgreen.svg)](#)
+[![Node.js 18+](https://img.shields.io/badge/node-18%2B-blue.svg)](#prerequisites)
+[![294 Tests](https://img.shields.io/badge/tests-294_passing-brightgreen.svg)](#)
+[Quick Start](#quick-start) · [The Canonical Quest](#the-canonical-quest) · [The Party](#the-party) · [The Dungeon](#the-dungeon) · [Configuration](#configuration) · [CLI Reference](#cli-reference)
+</div>
+---
+## What is Claude Raid?
+Claude Raid turns a single Claude Code session into a **4-agent adversarial team** that designs, plans, builds, reviews, and ships code through structured phases. Instead of one AI guessing at a solution, four agents challenge each other's work until only battle-tested decisions survive.
-Built for [Claude Code](https://claude.ai/claude-code). Zero dependencies. One command to install.
+The **Wizard** orchestrates. The **Warrior**, **Archer**, and **Rogue** each bring a different lens — stress tolerance, pattern coherence, and assumption destruction. They work in rounds, pin verified findings to a shared **Dungeon**, and no phase closes until the Wizard rules.
-Adapted from [obra/superpowers](https://github.com/obra/superpowers) by Jesse Vincent.
+One command installs the system. One command starts a quest.
 ---
 ## Quick Start
 ```bash
-# Install into any project
 npx claude-raid summon
+```
+That's it. One command installs agents, hooks, skills, and config into your project's `.claude/` directory.
-# Preview what gets installed (no changes made)
+```bash
+# Preview what gets installed (no changes)
 npx claude-raid summon --dry-run
-# Start a tmux session, then start the Raid
-tmux new-session -s raid
-claude --agent wizard
+# Start a quest
+claude-raid start
 ```
-Each agent gets its own tmux pane. You can click into any pane to talk directly to that agent. Describe your task and the Wizard takes over.
+The Wizard greets you, you describe your task, and the quest begins.
 ### Prerequisites
-| Requirement | Why | Auto-configured? |
-|---|---|---|
-| **Node.js** 18+ | Runs the installer | No |
-| **Claude Code** v2.1.32+ | Agent teams support | No |
-| **tmux** | Multi-pane agent display | No — `brew install tmux` |
-| **jq** | Hook config parsing | Pre-installed on macOS |
-| **teammateMode** | Set to `tmux` in `~/.claude.json` | Yes (wizard prompts) |
+| Requirement | Why |
+|:--|:--|
+| **Node.js** 18+ | Runs the CLI |
+| **Claude Code** v2.1.32+ | Agent teams support |
+| **tmux** | Multi-pane agent display (`brew install tmux`) |
+| **jq** | Config parsing (pre-installed on macOS) |
-**tmux is required for the multi-pane experience.** Each agent runs in its own pane so you can observe and interact with them independently. Without tmux, agents run in-process (single pane, cycle with Shift+Down).
+> **tmux** gives each agent its own pane — click into any pane to observe or talk to that agent directly. Without tmux, agents run in-process and you cycle between them with `Shift+Down`.
-The setup wizard checks all of these during `summon` and offers to fix what it can.
+The setup wizard checks all prerequisites during `summon` and offers to fix what it can.
 ---
-## How It Works
-You describe a task. The Wizard assesses complexity, recommends a mode, and opens the Dungeon — a shared knowledge artifact where agents pin verified findings throughout each phase.
+## The Canonical Quest
+The Canonical Quest is a 6-phase development cycle. Every feature, refactor, or system built through the Raid follows this sequence.
+```mermaid
+flowchart LR
+    PRD["Phase 1\nPRD"]
+    Design["Phase 2\nDesign"]
+    Plan["Phase 3\nPlan"]
+    Impl["Phase 4\nImplementation"]
+    Review["Phase 5\nReview"]
+    Wrap["Phase 6\nWrap Up"]
+    PRD -->|"optional"| Design
+    Design --> Plan
+    Plan --> Impl
+    Impl --> Review
+    Review -->|"optional"| Wrap
+    Impl -->|"skip review"| Wrap
+    style PRD fill:#6c5ce7,color:#fff
+    style Design fill:#6c5ce7,color:#fff
+    style Plan fill:#6c5ce7,color:#fff
+    style Impl fill:#e17055,color:#fff
+    style Review fill:#00b894,color:#fff
+    style Wrap fill:#fdcb6e,color:#000
 ```
-Phase 1: DESIGN          Agents explore the problem from competing angles.
-                          Challenge each other's findings. Pin what survives.
-                          Wizard closes when the design is battle-tested.
-Phase 2: PLAN             Decompose the design into testable tasks.
-                          Fight over naming, ordering, coverage, compliance.
-                          Pin the agreed task list.
+### Phase 1 — PRD *(optional)*
-Phase 3: IMPLEMENTATION   One agent builds each task (TDD enforced).
-                          The others attack the implementation directly.
-                          Every task earns approval before moving on.
+Agents research the problem space and produce a complete Product Requirements Document. No code. The Wizard mediates questions between agents and the human.
-Phase 4: REVIEW           Independent reviews against design and plan.
-                          Agents fight over findings AND missing findings.
-                          Critical and Important issues must be fixed.
+**Output:** `phase-1-prd.md`
-         FINISHING        Agents debate completeness. Wizard presents
-                          merge options: merge, PR, keep branch, or discard.
-```
+### Phase 2 — Design
-No phase is skipped. No work passes unchallenged.
+Agents explore design approaches from competing angles. Each brings their lens — the Warrior stress-tests architecture choices, the Archer traces ripple effects, the Rogue attacks assumptions. The design survives only if it withstands all three.
----
+**Output:** `phase-2-design.md` with mermaid diagrams and battle-tested decisions
-## The Team
+### Phase 3 — Plan
-### Wizard — Dungeon Master
+Agents decompose the approved design into discrete, testable tasks. They fight over ordering, scope boundaries, naming, and test coverage until the plan earns consensus.
-Thinks 5 times before speaking. Opens phases, dispatches the team with precise angles, observes silently (90% of the time), and closes each phase with a binding ruling citing Dungeon evidence. The Wizard doesn't write code — it ensures the process produces the best possible outcome.
+**Output:** `phase-3-plan.md` + individual task files (`phase-3-plan-task-01.md`, etc.)
-### Warrior — Stress Tester
+### Phase 4 — Implementation
-Rips things apart. Race conditions, null input, scale, memory pressure — nothing passes unchecked. Demands evidence, proposes counter-examples, and pushes until things break. Concedes instantly when proven wrong.
+The Wizard assigns tasks in batches. One agent builds each task using strict **TDD** (RED-GREEN-REFACTOR). The others cross-test the implementation — reading code, running tests, and challenging decisions. Every task earns approval before the next batch starts.
-### Archer — Pattern Seeker
+**Output:** `phase-4-implementation.md` + committed, tested code
-Finds what brute force misses. Naming mismatches, violated conventions, design drift. Traces ripple effects — changing X in module A silently breaks Y in module C through an implicit contract in Z. Every finding includes the exact location and the exact consequence.
+### Phase 5 — Review *(optional)*
-### Rogue — Assumption Destroyer
+Two sub-phases: **Pinning** (find issues) and **Fixing** (resolve them). Agents review independently, then fight over findings *and* missing findings. Critical and Important issues must be fixed. The **Black Card** system handles breaking architectural concerns.
-Thinks like a malicious user, a failing network, a corrupted database, a race condition at 3 AM. Constructs the exact sequence of events that turns a minor oversight into a critical failure. Every finding is a concrete attack scenario, not a theoretical concern.
+**Output:** `phase-5-review.md`
----
+### Phase 6 — Wrap Up
-## Modes
+The Wizard generates a quest storyboard summarizing what was built and why. Creates the PR, archives the dungeon to the vault, and dismisses the party.
-Not every task needs all four agents. The Wizard recommends a mode based on complexity, or you can override directly.
+**Output:** `phase-6-wrap-up.md` + PR + vault archive
-| | Full Raid | Skirmish | Scout |
-|---|---|---|---|
-| **Agents** | 3 + Wizard | 2 + Wizard | 1 + Wizard |
-| **Design** | Full adversarial | Lightweight | Inline |
-| **Planning** | Full adversarial | Merged with design | Inline |
-| **Implementation** | 1 builds, 2 attack | 1 builds, 1 attacks | 1 builds, Wizard reviews |
-| **Review** | 3 independent reviews | 1 review + Wizard | Wizard only |
-| **TDD** | Enforced | Enforced | Enforced |
+---
+## The Party
+Four agents, each with a distinct lens. They collaborate through rigor, not agreement.
+<table>
+<tr>
+<td width="25%" align="center"><h3>Wizard</h3><em>Dungeon Master</em></td>
+<td width="25%" align="center"><h3>Warrior</h3><em>Stress Tester</em></td>
+<td width="25%" align="center"><h3>Archer</h3><em>Pattern Seeker</em></td>
+<td width="25%" align="center"><h3>Rogue</h3><em>Assumption Destroyer</em></td>
+</tr>
+<tr>
+<td valign="top">Thinks 5 times before speaking. Opens phases, dispatches the team, observes, and closes with binding rulings. The bridge between agents, dungeon, and human. <strong>Never writes code.</strong></td>
+<td valign="top">Does this hold under pressure? Tests boundaries, load, edge cases, and failure modes. Brings the exact scenario that breaks it — not just "this is wrong."</td>
+<td valign="top">Does this fit? Traces how changes ripple through the system. Catches naming drift, contract violations, and implicit dependencies that break silently.</td>
+<td valign="top">What did everyone assume that isn't guaranteed? Thinks like a failing system, a malicious input, a race condition. Constructs the attack sequence that turns oversight into failure.</td>
+</tr>
+</table>
+### How They Work Together
+```mermaid
+sequenceDiagram
+    participant W as Wizard
+    participant Party as Warrior / Archer / Rogue
+    W->>Party: Dispatch tasks + angles
+    Note over Party: Parallel independent work
+    Party->>W: ROUND_COMPLETE
+    W->>Party: Cross-test each other's work
+    Note over Party: Challenge, verify, concede or escalate
+    Party->>W: Findings + evidence
+    W->>W: Pin verified findings to Dungeon
+    Note over W: Next round or phase close
+```
-**When to use each:**
+**Round-based, not real-time.** Agents work independently, flag completion, then cross-test. No mid-thinking interruptions. Each message carries evidence and a conclusion. Converge in 2-3 exchanges per finding — escalate to the Wizard after 3.
-- **Full Raid** — Architecture decisions, security-critical code, major refactors, cross-layer changes
-- **Skirmish** — Medium features, non-trivial bugfixes, multi-file changes
-- **Scout** — Config changes, documentation, single-file fixes
+### Seven Pillars
-Override the Wizard's recommendation: *"Full Raid this"*, *"Skirmish this bugfix"*, *"Scout this"*.
+Every agent, every phase, every interaction:
-The Wizard can escalate mid-task (Scout to Skirmish, Skirmish to Full Raid) with your approval. It cannot de-escalate without asking.
+1. **Intellectual Honesty** — Every claim backed by evidence gathered this turn. No guessing.
+2. **Zero Ego** — Concede instantly when proven wrong. A teammate catching your mistake is a gift.
+3. **Discipline** — Every interaction carries work forward. If you're not adding information, stop talking.
+4. **Round-Based Interaction** — Turn-based work, flag completion, cross-test on dispatch.
+5. **Question Chain** — Agents never ask the human directly. All questions flow through the Wizard.
+6. **Phase Spoils** — Every phase produces a detailed markdown artifact. No exceptions.
+7. **Black Cards** — Architecture-breaking findings that cannot be fixed within the current design. Escalated to the human with rollback options.
 ---
 ## The Dungeon
-The Dungeon (`.claude/raid-dungeon.md`) is the team's shared knowledge artifact — a curated board where agents pin verified findings during each phase.
+The Dungeon is the team's shared knowledge artifact — a curated board where agents pin verified findings during each phase.
+| Concept | What It Means |
+|:--|:--|
+| **Quest** | A complete session — from greeting to PR |
+| **Dungeon** | The quest's artifact directory: phase files, task files, findings |
+| **Vault** | Archive of completed quests for institutional memory |
+| **Phase Spoils** | Mandatory output of each phase: a detailed markdown report |
+| **Black Card** | A finding that fundamentally breaks the architecture — requires human decision |
+| **Pin** | A verified finding that survived challenge from 2+ agents |
-**What goes in:** Findings that survived challenge from 2+ agents, active unresolved battles, key decisions, escalation points.
+### Dungeon Filesystem
-**What stays in conversation:** The back-and-forth of challenges and roasts, exploratory thinking, concessions. The conversation is the sparring ring. The Dungeon is the scoreboard.
+```
+.claude/dungeon/{quest-slug}/          # Active quest artifacts
+├── phase-1-prd.md                     # PRD (optional)
+├── phase-2-design.md                  # Battle-tested design
+├── phase-3-plan.md                    # Task index
+├── phase-3-plan-task-01.md            # Individual task specs
+├── phase-4-implementation.md          # Implementation log
+├── phase-5-review.md                  # Review board (optional)
+└── phase-6-wrap-up.md                 # Quest storyboard
+.claude/vault/{quest-slug}/            # Archived completed quests
+.claude/raid-session                   # Active session state
+```
-**Lifecycle:** Wizard creates it when opening a phase, agents pin findings with `DUNGEON:` prefix, Wizard archives it when closing (`raid-dungeon-phase-N.md`), and all Dungeon files are cleaned up when the session ends.
+**What goes in the Dungeon:** Findings that survived challenge from 2+ agents, key decisions, escalation points.
-Agents interact directly — `@Name` mentions, building on each other's discoveries, roasting weak analysis. The Wizard observes silently, intervening only on destructive loops, drift, deadlock, or misinformation.
+**What stays in conversation:** Back-and-forth challenges, exploratory thinking, concessions. The conversation is the sparring ring. The Dungeon is the scoreboard.
 ---
 ## What Gets Installed
+<details>
+<summary><strong>Full file tree</strong></summary>
 ```
 .claude/
 ├── raid.json                        # Project config (auto-detected, editable)
-├── raid-rules.md                    # 17 team rules across 3 pillars (editable)
+├── party-rules.md                   # Party agent rules (editable)
+├── dungeon-master-rules.md          # Wizard rules (editable)
 ├── settings.json                    # Hooks merged with existing (backup created)
 ├── agents/
 │   ├── wizard.md                    # Dungeon master
@@ -163,127 +249,59 @@ Agents interact directly — `@Name` mentions, building on each other's discover
 │   └── rogue.md                     # Assumption destroyer
 ├── hooks/
 │   ├── raid-lib.sh                  # Shared config and session state
-│   ├── raid-session-start.sh        # Session activation
-│   ├── raid-session-end.sh          # Archive and cleanup
-│   ├── raid-stop.sh                 # Phase transition backup
-│   ├── raid-pre-compact.sh          # Pre-compaction backup
+│   ├── raid-session-start.sh        # Session activation + quest directory
+│   ├── raid-session-end.sh          # Archive to vault + cleanup
+│   ├── raid-pre-compact.sh          # Pre-compaction dungeon backup
 │   ├── raid-task-created.sh         # Task subject validation
-│   ├── raid-task-completed.sh       # Test evidence gate
-│   ├── raid-teammate-idle.sh        # Idle agent nudge
-│   ├── validate-commit.sh           # Conventional commits + test gate
-│   ├── validate-write-gate.sh       # Design doc before implementation
+│   ├── validate-commit.sh           # Conventional commits
+│   ├── validate-write-gate.sh       # Phase-based write protection
 │   ├── validate-file-naming.sh      # Naming convention enforcement
 │   ├── validate-no-placeholders.sh  # No TBD/TODO in specs
 │   ├── validate-dungeon.sh          # Multi-agent verification on pins
 │   ├── validate-browser-tests-exist.sh  # Playwright test detection
 │   └── validate-browser-cleanup.sh  # Browser process cleanup
 └── skills/
-    ├── raid-protocol/               # Session lifecycle and team rules
-    ├── raid-design/                 # Phase 1: adversarial exploration
-    ├── raid-implementation-plan/    # Phase 2: task decomposition
-    ├── raid-implementation/         # Phase 3: TDD with direct challenge
-    ├── raid-review/                 # Phase 4: independent review + fighting
-    ├── raid-finishing/              # Completeness debate + merge options
+    ├── raid-init/                   # Quest selection and session setup
+    ├── raid-canonical-protocol/     # Canonical Quest rules and signals
+    ├── raid-canonical-prd/          # Phase 1: PRD creation
+    ├── raid-canonical-design/       # Phase 2: Adversarial design
+    ├── raid-canonical-implementation-plan/  # Phase 3: Task decomposition
+    ├── raid-canonical-implementation/       # Phase 4: TDD + cross-testing
+    ├── raid-canonical-review/       # Phase 5: Pinning + fixing
+    ├── raid-wrap-up/                # Phase 6: Storyboard + PR + vault
     ├── raid-tdd/                    # RED-GREEN-REFACTOR enforcement
-    ├── raid-debugging/              # Root-cause investigation
     ├── raid-verification/           # Evidence-before-claims gate
-    ├── raid-git-worktrees/          # Isolated workspace creation
-    ├── raid-browser/                # Browser startup discovery
-    ├── raid-browser-playwright/     # Playwright test authoring
-    └── raid-browser-chrome/         # Live browser inspection
+    ├── raid-debugging/              # Root-cause investigation
+    ├── raid-browser/                # Browser orchestration
+    └── raid-browser-chrome/         # Live Chrome inspection
 ```
-Dungeon files (`raid-dungeon.md`, `raid-dungeon-phase-*.md`) and session state (`raid-session`) are created at runtime during Raid sessions and cleaned up automatically.
----
-## Hooks
-Hooks enforce workflow discipline automatically. They split into two categories:
-### Lifecycle Hooks
-Manage session state without manual intervention. These activate and deactivate as sessions begin and end.
-| Hook | Trigger | Purpose |
-|---|---|---|
-| **raid-session-start** | SessionStart | Activates Raid workflow, checks Vault for past quests |
-| **raid-session-end** | SessionEnd | Archives Dungeon, drafts Vault entry, removes session |
-| **raid-stop** | Stop | Backs up Dungeon on phase transitions |
-| **raid-pre-compact** | PreCompact | Backs up Dungeon before message compaction |
-| **raid-task-created** | TaskCreated | Validates task subjects are meaningful |
-| **raid-task-completed** | TaskCompleted | Blocks task completion without test evidence |
-| **raid-teammate-idle** | TeammateIdle | Nudges idle agents to participate |
+</details>
-### Quality Gates
+### Agents (4)
-Enforce code standards and workflow compliance. Quality gates only activate during Raid sessions — they won't interfere with normal coding.
+The Wizard orchestrates. Warrior, Archer, and Rogue each bring a specialized lens. All agents run on Claude Opus 4.6.
-| Hook | Trigger | Purpose |
-|---|---|---|
-| **validate-commit** | PreToolUse (Bash) | Conventional commit format + test gate before commits |
-| **validate-write-gate** | PreToolUse (Write) | Blocks implementation files before design doc exists |
-| **validate-file-naming** | PostToolUse (Write) | Enforces naming convention (kebab-case, snake_case, etc.) |
-| **validate-no-placeholders** | PostToolUse (Write) | Blocks TBD/TODO/FIXME in specs and plans |
-| **validate-dungeon** | PostToolUse (Write) | Requires 2+ agents verified on Dungeon pins |
-| **validate-browser-tests-exist** | PreToolUse (Bash) | Checks Playwright tests exist before commits |
-| **validate-browser-cleanup** | PostToolUse (Bash) | Verifies browser processes cleaned up properly |
+### Hooks (12)
-All hooks are POSIX-compatible, read configuration from `raid.json`, and use `#claude-raid` markers to avoid collisions with your existing hooks.
+Hooks enforce workflow discipline automatically and **only activate during Raid sessions** — they never interfere with normal coding.
-**Exit codes:** `0` = pass, `2` = block with message (agent sees the error and must fix it).
+**Lifecycle hooks** manage session start/end, quest directory creation, vault archival, and context compaction backup.
----
-## Skills
-13 specialized skills guide agent behavior at each stage of the workflow.
-### Phase Skills
-| Skill | Purpose |
-|---|---|
-| **raid-protocol** | Master orchestration. Session lifecycle, team composition, modes, rules, reference tables. Loaded at session start. |
-| **raid-design** | Phase 1. Agents independently explore from assigned angles, challenge findings, pin discoveries. Produces a battle-tested design spec. |
-| **raid-implementation-plan** | Phase 2. Decompose design into testable tasks. Agents debate naming, ordering, coverage, compliance. |
-| **raid-implementation** | Phase 3. One implements (TDD), others attack. Rotate implementer per task. No task passes without all issues resolved. |
-| **raid-review** | Phase 4. Independent reviews against design and plan. Issues classified as Critical (must fix), Important (must fix), or Minor (note). |
-| **raid-finishing** | Completeness debate. Agents argue whether work is done. Four merge options: merge, PR, keep branch, discard. |
+**Quality gate hooks** enforce conventional commits, phase-based write protection, naming conventions, placeholder blocking, multi-agent verification on dungeon pins, and browser test detection.
-### Discipline Skills
+All hooks are POSIX-compatible and use `#claude-raid` markers to coexist safely with your existing hooks.
-| Skill | Purpose |
-|---|---|
-| **raid-tdd** | Strict RED-GREEN-REFACTOR. No production code before a failing test. Challengers attack test quality. |
-| **raid-debugging** | Competing hypotheses in parallel. No fixes without confirmed root cause. 3+ failed fixes triggers architecture rethink. |
-| **raid-verification** | Evidence before assertions. Fresh test run required. Forbidden phrases without evidence: "done", "working", "fixed". |
-| **raid-git-worktrees** | Isolated workspace creation with safety verification and clean test baseline. |
+### Skills (13)
-### Browser Skills
+Skills guide agent behavior across the workflow. Three categories:
-| Skill | Purpose |
-|---|---|
-| **raid-browser** | Browser startup discovery. Detects dev server, auth requirements, startup steps. |
-| **raid-browser-playwright** | Playwright MCP test authoring with network and console assertions. |
-| **raid-browser-chrome** | Live browser inspection via Claude-in-Chrome MCP. Angle-driven investigation. |
----
-## Team Rules
-17 non-negotiable rules organized into 3 pillars. Stored in `.claude/raid-rules.md` (editable).
-### Pillar 1: Intellectual Honesty
-Every claim has evidence you gathered yourself. If you haven't read the code or run the command this turn, you don't know what it says. Never fabricate evidence, certainty, or findings.
-### Pillar 2: Zero Ego Collaboration
-When proven wrong, concede instantly. Defend with evidence, never with authority. A teammate catching your mistake is a gift. Build on each other's work — the best findings come from combining perspectives.
-### Pillar 3: Discipline and Efficiency
-Maximum effort on every task. Every interaction carries work forward. Agents talk directly to each other — the Wizard is not a relay. Escalate to the Wizard only after you've tried to resolve it yourself.
+| Category | Skills | Purpose |
+|:--|:--|:--|
+| **Core** | `raid-init` | Quest selection, greeting, session bootstrap |
+| **Canonical Quest** | 7 phase skills | One skill per phase, chained in order |
+| **Discipline** | `raid-tdd`, `raid-verification`, `raid-debugging` | Quest-agnostic enforcement — invoked within any phase |
+| **Browser** | `raid-browser`, `raid-browser-chrome` | Browser orchestration and live inspection |
 ---
@@ -323,19 +341,20 @@ Maximum effort on every task. Every interaction carries work forward. Agents tal
 ### Auto-Detection
 | Marker File | Language | Test Command | Lint Command |
-|---|---|---|---|
-| `package.json` | JavaScript | `npm test` | `npm run lint` |
+|:--|:--|:--|:--|
+| `package.json` | JavaScript/TypeScript | `npm test` | `npm run lint` |
 | `Cargo.toml` | Rust | `cargo test` | `cargo clippy` |
 | `pyproject.toml` | Python | `pytest` / `poetry run pytest` | `ruff check .` |
 | `requirements.txt` | Python | `pytest` | `ruff check .` |
 | `go.mod` | Go | `go test ./...` | `go vet ./...` |
-Commands are extracted from your project files (e.g., `scripts.test` in `package.json`). Package manager is auto-detected (npm, pnpm, yarn, bun, uv, poetry). Edit `raid.json` to override any value.
+Package manager is auto-detected (npm, pnpm, yarn, bun, uv, poetry). Commands are extracted from your project files where possible. Edit `raid.json` to override any value.
-### Configuration Reference
+<details>
+<summary><strong>Full configuration reference</strong></summary>
 | Key | Default | Description |
-|---|---|---|
+|:--|:--|:--|
 | `project.testCommand` | auto-detected | Command to run tests |
 | `project.lintCommand` | auto-detected | Command to run linting |
 | `project.buildCommand` | auto-detected | Command to build |
@@ -346,9 +365,10 @@ Commands are extracted from your project files (e.g., `scripts.test` in `package
 | `conventions.commits` | `conventional` | Commit message format |
 | `conventions.commitMinLength` | `15` | Minimum commit message length |
 | `conventions.maxDepth` | `8` | Maximum file nesting depth |
-| `raid.defaultMode` | `full` | Default mode: `full`, `skirmish`, `scout` |
 | `raid.lifecycle.testWindowMinutes` | `10` | Max age (minutes) of test run for verification |
+</details>
 ### Browser Testing
 When a browser framework is detected (Next.js, Vite, Angular, etc.), a `browser` section is added to `raid.json`:
@@ -366,25 +386,31 @@ When a browser framework is detected (Next.js, Vite, Angular, etc.), a `browser`
 }
 ```
-This enables browser-specific hooks and skills — Playwright test detection, browser process cleanup, and live browser inspection during reviews.
+This enables browser-specific hooks and skills — Playwright test detection, browser process cleanup, and live Chrome inspection during reviews.
 ---
-## CLI Commands
+## CLI Reference
+| Command | Alias | Purpose |
+|:--|:--|:--|
+| `claude-raid start` | — | Launch the Wizard and begin a quest |
+| `claude-raid summon` | `init` | Install Raid into your project |
+| `claude-raid update` | — | Upgrade hooks, skills, and rules to latest |
+| `claude-raid dismantle` | `remove` | Remove all Raid files, restore original settings |
+| `claude-raid heal` | `doctor` | Check environment health |
+| `claude-raid sync` | — | Git pull + re-summon |
-| Command | Purpose |
-|---|---|
-| `claude-raid summon` | Install Raid into your project |
-| `claude-raid summon --dry-run` | Preview what would be installed |
-| `claude-raid update` | Upgrade hooks, skills, and rules to latest version |
-| `claude-raid dismantle` | Remove all Raid files, restore original settings |
-| `claude-raid heal` | Check environment health, show reference card |
+<details>
+<summary><strong>Command details</strong></summary>
-Old names (`init`, `remove`, `doctor`) still work as aliases.
+### `start`
+Launches `claude --agent wizard` with full permissions. The Wizard loads its rules, greets you, and begins quest selection. This is the primary entry point for running a quest.
 ### `summon`
-Installs the full Raid system into your project. Auto-detects project type, copies agents/hooks/skills, generates `raid.json`, merges settings (with backup), and runs the setup wizard.
+Installs the full Raid system. Auto-detects project type, copies agents/hooks/skills, generates `raid.json`, merges settings (with backup), and runs the setup wizard.
 - Never overwrites existing files — customized agents are preserved
 - Idempotent — safe to run multiple times
@@ -392,15 +418,21 @@ Installs the full Raid system into your project. Auto-detects project type, copi
 ### `update`
-Upgrades hooks, skills, and `raid-rules.md` to the latest version. Skips customized agents (warns which ones were preserved). Does not touch `raid.json` — your project config stays intact.
+Upgrades hooks, skills, and rules to the latest version. Skips customized agents and warns which ones were preserved. Does not touch `raid.json`.
 ### `dismantle`
-Removes all Raid agents, hooks, skills, and config files. Restores `settings.json` from the backup created during install. Preserves non-Raid files in `.claude/`.
+Removes all Raid agents, hooks, skills, and config files. Restores `settings.json` from the backup created during install.
 ### `heal`
-Checks Node.js, Claude Code, jq, teammateMode, and split-pane support. Offers to fix missing configuration interactively. Shows the "How It Works" reference card with modes, phases, hook enforcement, and controls.
+Checks Node.js, Claude Code, jq, tmux, and teammateMode. Offers to fix missing configuration interactively.
+### `sync`
+Pulls latest from remote and re-runs summon to pick up any template changes. Useful after CI bumps the version.
+</details>
 ---
@@ -409,63 +441,59 @@ Checks Node.js, Claude Code, jq, teammateMode, and split-pane support. Offers to
 **tmux pane navigation (recommended):**
 | Action | How |
-|---|---|
-| **Switch to agent pane** | Click the pane, or `Ctrl+B` then arrow key |
-| **Talk to an agent** | Click their pane and type |
-| **View all agents** | All panes visible simultaneously |
+|:--|:--|
+| Switch to agent pane | Click the pane, or `Ctrl+B` then arrow key |
+| Talk to an agent | Click their pane and type |
+| View all agents | All panes visible simultaneously |
 **In-process mode (no tmux):**
 | Shortcut | Action |
-|---|---|
-| **Shift+Down** | Cycle through teammates |
-| **Enter** | View a teammate's session |
-| **Escape** | Interrupt a teammate's turn |
-| **Ctrl+T** | Toggle the shared task list |
-### Starting a Raid session
-```bash
-# Always start tmux first for multi-pane
-tmux new-session -s raid
-# Then start the Wizard inside tmux
-claude --agent wizard
-```
-The Wizard creates a team and spawns agents — each gets its own tmux pane automatically. If you're not inside tmux, agents fall back to in-process mode (single pane).
+|:--|:--|
+| `Shift+Down` | Cycle through teammates |
+| `Enter` | View a teammate's session |
+| `Escape` | Interrupt a teammate's turn |
+| `Ctrl+T` | Toggle the shared task list |
 ---
-## Non-Invasive Design
+## Design Principles
-The Raid is a tool in your toolkit, not your project's operating system.
-- **Never touches your `CLAUDE.md`** — your project instructions stay yours
-- **Merges settings** alongside your existing config, with automatic backup
-- **Won't overwrite** existing agents, hooks, or skills that share a name
-- **Session-scoped hooks** — workflow hooks only activate during Raid sessions, never during normal coding
-- **Clean removal** restores your original `settings.json` from backup
-- **Zero npm dependencies** — pure Node.js stdlib, fast `npx` cold-start
+- **Non-invasive** — Never touches your `CLAUDE.md`. Merges settings alongside your existing config with automatic backup. Clean removal restores originals.
+- **Session-scoped** — Quality gate hooks only activate during Raid sessions. Normal coding is never affected.
+- **Zero dependencies** — Pure Node.js stdlib. Fast `npx` cold-start.
+- **Safe by default** — Never overwrites existing files. Customized agents and rules are always preserved.
+- **Round-based discipline** — Agents work in parallel, flag completion, then cross-test. No mid-thinking interruptions.
+- **Question chain** — Agents never ask the human directly. All questions flow through the Wizard.
+- **Wizard never implements** — Dispatches, observes, digests, rules. The party writes code.
+- **Phase commits** — The Wizard commits at every phase transition with the quest name, phase, and summary.
 ---
-## Inherited from Superpowers
+## Heritage
-The Raid inherits and adapts the [Superpowers](https://github.com/obra/superpowers) behavioral harness:
+Adapted from [obra/superpowers](https://github.com/obra/superpowers) by Jesse Vincent. The Raid inherits five core enforcement principles:
-| Concept | How the Raid Uses It |
-|---|---|
-| HARD-GATEs | No code before design approval. No implementation before plan approval. |
-| TDD Iron Law | No production code without a failing test first. Enforced in all modes. |
-| Verification Iron Law | No completion claims without fresh test evidence. |
-| No Placeholders | Specs and plans must contain complete content, not "TBD" or "implement later". |
-| Conventional Commits | Enforced via hook: `type(scope): description`. |
+| Principle | How the Raid Enforces It |
+|:--|:--|
+| **HARD-GATEs** | No code before design approval. No implementation before plan approval. |
+| **TDD Iron Law** | No production code without a failing test first. Enforced in every phase. |
+| **Verification Iron Law** | No completion claims without fresh test evidence. |
+| **No Placeholders** | Specs and plans must contain complete content — no "TBD" or "implement later". |
+| **Conventional Commits** | Enforced via hook: `type(scope): description`. |
-**What's different:** Superpowers uses a single agent with subagent delegation. The Raid uses 4 agents in an agent team with adversarial cross-testing, direct interaction, and collaborative learning. Agents talk to each other (not through the Wizard), pin verified findings to a shared Dungeon, and self-organize within phases. Every decision is stress-tested from multiple angles before it passes.
+**What's different:** Superpowers uses a single agent with subagent delegation. The Raid uses 4 persistent agents with adversarial cross-testing. Agents challenge each other directly, pin verified findings to a shared Dungeon, and self-organize within phases. Every decision is stress-tested from three angles before it passes.
 ---
 ## License
 MIT
+---
+<div align="center">
+Built for [Claude Code](https://claude.ai/claude-code).
+</div>

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "claude-raid",
-  "version": "0.2.4",
+  "version": "0.2.6",
   "type": "commonjs",
   "description": "Adversarial multi-agent development system for Claude Code",
   "author": "Pedro Picardi",