npm - rpi-kit - Versions diffs - 1.4.0 → 2.0.0 - Mend

rpi-kit 1.4.0 → 2.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (53) hide show

package/.claude-plugin/marketplace.json +9 -6
package/.claude-plugin/plugin.json +4 -4
package/AGENTS.md +2016 -117
package/CHANGELOG.md +83 -0
package/README.md +116 -169
package/agents/atlas.md +61 -0
package/agents/clara.md +49 -0
package/agents/forge.md +38 -0
package/agents/hawk.md +54 -0
package/agents/luna.md +50 -0
package/agents/mestre.md +61 -0
package/agents/nexus.md +63 -0
package/agents/pixel.md +48 -0
package/agents/quill.md +40 -0
package/agents/razor.md +41 -0
package/agents/sage.md +52 -0
package/agents/scout.md +49 -0
package/agents/shield.md +51 -0
package/bin/cli.js +160 -53
package/bin/onboarding.js +46 -28
package/commands/rpi/archive.md +149 -0
package/commands/rpi/docs.md +106 -168
package/commands/rpi/implement.md +163 -401
package/commands/rpi/init.md +150 -67
package/commands/rpi/learn.md +114 -0
package/commands/rpi/new.md +85 -155
package/commands/rpi/onboarding.md +157 -336
package/commands/rpi/party.md +212 -0
package/commands/rpi/plan.md +241 -205
package/commands/rpi/research.md +162 -104
package/commands/rpi/review.md +350 -104
package/commands/rpi/rpi.md +125 -0
package/commands/rpi/simplify.md +156 -93
package/commands/rpi/status.md +91 -114
package/package.json +7 -3
package/skills/rpi-agents/SKILL.md +63 -39
package/skills/rpi-workflow/SKILL.md +160 -186
package/agents/code-reviewer.md +0 -108
package/agents/code-simplifier.md +0 -82
package/agents/cto-advisor.md +0 -61
package/agents/doc-synthesizer.md +0 -67
package/agents/doc-writer.md +0 -37
package/agents/explore-codebase.md +0 -88
package/agents/plan-executor.md +0 -95
package/agents/product-manager.md +0 -59
package/agents/requirement-parser.md +0 -51
package/agents/senior-engineer.md +0 -61
package/agents/test-engineer.md +0 -23
package/agents/ux-designer.md +0 -58
package/codex.md +0 -72
package/commands/rpi/add-todo.md +0 -83
package/commands/rpi/set-profile.md +0 -124
package/commands/rpi/test.md +0 -198

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,83 @@
+# Changelog
+## [2.0.0] - 2026-03-17
+### Breaking Changes
+- Complete rewrite -- v1 command files replaced
+- New directory structure: rpi/features/, rpi/specs/, rpi/solutions/
+- .rpi.yaml schema changed (v1 configs need re-init)
+### Added
+- 13 named agents with rich personas (Luna, Atlas, Scout, Nexus, Mestre, Clara, Pixel, Forge, Sage, Razor, Hawk, Shield, Quill)
+- /rpi auto-flow command (detects phase and progresses)
+- /rpi:party multi-agent debate mode
+- /rpi:learn knowledge compounding
+- /rpi:archive delta spec merging
+- /rpi:onboarding guided first-time setup
+- Delta specs system (rpi/specs/ + rpi/features/{slug}/delta/)
+- Knowledge base (rpi/solutions/)
+- Project context (rpi/context.md)
+- Quick flow (--quick flag)
+- Adversarial review (Hawk forced to find problems)
+- Security audit (Shield -- OWASP, secrets scan)
+### Removed
+- v1 agents (requirement-parser, explore-codebase, senior-engineer, etc.)
+- /rpi:test (merged into implement via Sage)
+- /rpi:add-todo
+- /rpi:set-profile
+- Session isolation tiers
+- Change/sub-feature system
+## [Unreleased]
+### Added
+- **Model Profiles** -- 4 pre-defined profiles (`quality-first`, `balanced`, `speed-first`, `budget`) that control which AI model runs each workflow phase
+- **`/rpi:set-profile` command** -- display current profile, switch between profiles, or remove profile interactively
+- **Per-phase model overrides** -- customize individual phases in `.rpi.yaml` `models:` block (overrides take precedence over profile)
+- **Profile selection in `/rpi:init`** -- Batch 5 asks about model profile during project initialization
+- **Active profile in `/rpi:status`** -- status output now shows the active profile with phase-model mapping
+### Changed
+- **Separate sessions for simplify and review** -- `/rpi:implement` no longer runs simplify/review inline; outputs next-step instructions to run each in a fresh session for better accuracy
+- Removed `auto_simplify` and `review_after_implement` config keys (no longer needed)
+- Removed `--skip-simplify` and `--skip-review` flags from `/rpi:implement`
+- 7 commands (`/rpi:research`, `/rpi:plan`, `/rpi:implement`, `/rpi:test`, `/rpi:simplify`, `/rpi:review`, `/rpi:docs`) now resolve model via the Model Resolution Algorithm and pass `model` parameter to Agent tool invocations
+- `skills/rpi-workflow/SKILL.md` extended with Model Resolution Algorithm section and config schema for `profile`/`models` keys
+## 0.2.0
+### Added
+- **Test-Driven Development (TDD) workflow** -- strict RED -> GREEN -> REFACTOR cycles integrated into the implementation phase
+- **Test Engineer agent** (`agents/test-engineer.md`) -- writes one failing test at a time before implementation, follows strict TDD discipline
+- **`/rpi:test` command** -- standalone TDD cycles per task (`--task <id>`) or all tasks (`--all`), works independently of `/rpi:implement`
+- **`Test:` field in PLAN.md** -- every task now includes a behavior assertion describing what to test (e.g., "returns 404 for missing user")
+- **TDD config options** in `.rpi.yaml`: `tdd: true/false` and `test_runner: auto|command`
+- **Test coverage checks** in `/rpi:review` -- verifies tests exist, exercise public interfaces, and cover edge cases
+- **TDD init questions** in `/rpi:init` -- Batch 4 asks about TDD preference and test runner
+### Changed
+- `/rpi:implement` now branches per task: TDD mode (RED -> VERIFY -> GREEN -> VERIFY -> REFACTOR) or classic mode based on config
+- `/rpi:plan` task format includes `Test:` field and enforces concrete test descriptions
+- `/rpi:review` adds test coverage as a review dimension alongside completeness, correctness, and deviations
+- Agent count updated from 10 to 11 across all docs
+- Comparison table includes TDD row as differentiator
+## 0.1.0
+### Added
+- Initial release
+- Research -> Plan -> Implement workflow with validation gates
+- 10 specialized agents (requirement-parser, product-manager, ux-designer, senior-engineer, cto-advisor, doc-synthesizer, explore-codebase, plan-executor, code-simplifier, code-reviewer)
+- Research tiers (quick, standard, deep) with parallel fan-out
+- Adaptive plan artifacts (PLAN.md, eng.md, pm.md, ux.md)
+- Smart execution mode (sequential vs parallel waves)
+- Code simplification (reuse, quality, efficiency)
+- Code review against plan requirements
+- Cross-session continuity via markdown files
+- Codex compatibility via AGENTS.md and codex.md

package/README.md CHANGED Viewed

@@ -1,218 +1,165 @@
-# RPIKit
+# RPIKit -- Research -> Plan -> Implement
-**Research → Plan → Implement.** A systematic feature development workflow for Claude Code and Codex.
+AI-assisted feature development with 13 named agents, delta specs, and knowledge compounding.
-RPIKit guides AI-first developers through a structured 3-phase pipeline with validation gates, multi-role agent teams, and adaptive depth — so you research before you plan, and plan before you code.
+RPIKit is a Claude Code plugin that guides developers through a structured 7-phase pipeline. Each phase is run by specialized agents with distinct personas -- so you research before you plan, plan before you code, and review before you ship.
-## Install
-### Claude Code
-**From the marketplace (recommended):**
+## Quick Start
 ```bash
+# Install from marketplace
 claude plugin install rpi-kit
-```
-**From npm:**
-```bash
-npm install -g rpi-kit
-```
-The postinstall script registers the plugin automatically. If it fails, register manually:
-```bash
-claude plugin install /path/to/rpi-kit
-```
-> **Tip:** `npm root -g` shows where global packages are installed. The path is usually something like `~/.nvm/versions/node/vX.X.X/lib/node_modules/rpi-kit`.
+# First time: guided setup
+/rpi:onboarding
-**From source:**
-```bash
-git clone https://github.com/dmend3z/rpi-kit.git
-claude --plugin-dir ./rpi-kit
-```
-### Codex (OpenAI)
-Copy `AGENTS.md` and `codex.md` to your project root. The workflow rules and agent definitions will be available to Codex automatically.
-## Quick Start
-```bash
-# 1. Initialize config (once per project)
+# Or configure manually
 /rpi:init
+```
-# 2. Describe your feature
-/rpi:new oauth2-auth
+## How It Works
-# 3. Research feasibility (GO/NO-GO verdict)
-/rpi:research oauth2-auth
+RPIKit breaks feature development into 7 phases, each driven by named agents:
-# 4. Generate implementation plan
-/rpi:plan oauth2-auth
+| # | Phase | Command | Agents | Output |
+|---|-------|---------|--------|--------|
+| 1 | **Request** | `/rpi:new` | Luna | `REQUEST.md` -- elicited requirements |
+| 2 | **Research** | `/rpi:research` | Atlas + Scout + Nexus | `RESEARCH.md` -- GO/NO-GO verdict |
+| 3 | **Plan** | `/rpi:plan` | Mestre + Clara + Pixel + Nexus | `PLAN.md` + `eng.md` + `pm.md` + `ux.md` + `delta/` |
+| 4 | **Implement** | `/rpi:implement` | Forge + Sage | Code + `IMPLEMENT.md` |
+| 5 | **Simplify** | `/rpi:simplify` | Razor | Simplified code |
+| 6 | **Review** | `/rpi:review` | Hawk + Shield + Sage + Nexus | PASS / FAIL verdict |
+| 7 | **Docs** | `/rpi:docs` | Quill | Updated documentation |
-# 5. Build it (with automatic simplify + review)
-/rpi:implement oauth2-auth
-```
+Use `/rpi <feature>` to auto-detect the current phase and progress to the next one.
 ## Commands
-| Command | Purpose |
-|---------|---------|
-| `/rpi:init` | Configure RPI for this project (folder, tier, preferences) |
-| `/rpi:new` | Interactive interview → REQUEST.md |
-| `/rpi:research` | Parallel agent research → RESEARCH.md + GO/NO-GO |
-| `/rpi:plan` | Adaptive plan artifacts → PLAN.md + eng/pm/ux.md |
-| `/rpi:implement` | Execute plan with task tracking + simplify + review |
-| `/rpi:test` | TDD cycles (RED → GREEN → REFACTOR) per task |
-| `/rpi:simplify` | Code simplification (reuse, quality, efficiency) |
+| Command | Description |
+|---------|-------------|
+| `/rpi <feature>` | Auto-progress to next phase -- detects current state and runs the appropriate step |
+| `/rpi:new <feature>` | Interactive interview with Luna to create REQUEST.md |
+| `/rpi:research <feature>` | Codebase analysis (Atlas) + technical investigation (Scout) |
+| `/rpi:plan <feature>` | Architecture (Mestre) + product spec (Clara) + UX (Pixel) |
+| `/rpi:implement <feature>` | Execute PLAN.md tasks with per-task commits (Forge) |
+| `/rpi:simplify <feature>` | Dead code removal and simplification (Razor) |
+| `/rpi:review <feature>` | Adversarial review (Hawk) + security audit (Shield) + test coverage (Sage) |
+| `/rpi:docs <feature>` | Generate documentation from artifacts (Quill) |
+| `/rpi:init` | Configure RPIKit and generate `rpi/context.md` |
 | `/rpi:status` | Show all features and their current phase |
-| `/rpi:review` | Code review against plan requirements + test coverage |
-| `/rpi:docs` | Generate documentation from implementation artifacts |
-| `/rpi:add-todo` | Capture quick implementation ideas in `{folder}/todos/` |
-| `/rpi:set-profile` | Switch the active model profile for agent execution |
+| `/rpi:party <topic>` | Multi-agent debate on any topic |
+| `/rpi:learn` | Save a solution or insight to the knowledge base |
+| `/rpi:archive <feature>` | Merge delta specs into `rpi/specs/` and clean up |
+| `/rpi:onboarding` | Guided first-time setup with codebase analysis |
-## Research Tiers
+## Agents
-Control depth and cost with tier flags:
+RPIKit uses 13 named agents, each with a distinct persona:
-| Tier | Agents | Use when |
-|------|--------|----------|
-| `--quick` | 2 (requirements + codebase) | Small features, quick feasibility check |
-| `--standard` | 4 (+ PM + engineer) | Default. Most features. |
-| `--deep` | 5-6 (+ CTO + UX designer if UI) | Large features, risky changes, new architecture |
+| Agent | Persona | Phase | Tools |
+|-------|---------|-------|-------|
+| **Luna** | Curious analyst who asks uncomfortable questions | Request | Read, Glob, Grep, AskUserQuestion |
+| **Atlas** | Methodical explorer who maps every corner of the codebase | Research | Read, Glob, Grep |
+| **Scout** | Skeptical investigator who researches external options | Research | Read, Glob, Grep, WebSearch, WebFetch |
+| **Nexus** | Diplomatic synthesizer who merges outputs and facilitates debates | Cross-phase + Party | Read, Write, Glob, Grep, Agent, AskUserQuestion |
+| **Mestre** | Pragmatic architect who hates over-engineering | Plan | Read, Glob, Grep |
+| **Clara** | Value-driven PM who cuts scope without mercy | Plan | Read, Glob, Grep |
+| **Pixel** | Empathetic UX designer who thinks from the user's perspective | Plan (conditional) | Read, Glob, Grep |
+| **Forge** | Disciplined executor who follows the plan precisely | Implement | Read, Write, Edit, Bash, Glob, Grep |
+| **Sage** | Paranoid tester who thinks in edge cases | Implement (TDD) + Review | Read, Write, Edit, Bash, Glob, Grep |
+| **Razor** | Minimalist simplifier who measures quality by deletion count | Simplify | Read, Write, Edit, Bash, Glob, Grep |
+| **Hawk** | Adversarial reviewer forced to find problems (zero findings = re-analyse) | Review | Read, Glob, Grep |
+| **Shield** | Security sentinel who thinks like an attacker (OWASP, secrets, injection) | Review | Read, Glob, Grep |
+| **Quill** | Concise technical writer who explains the "why", not the "what" | Docs | Read, Write, Edit, Glob, Grep |
-## Agent Team
+## Key Features
-RPIKit simulates a product team with 12 specialized agents:
+### Delta Specs
-| Agent | Perspective |
-|-------|-------------|
-| Requirement Parser | Structured requirements, unknowns, implicit needs |
-| Product Manager | Scope, user stories, effort, acceptance criteria |
-| UX Designer | User flows, interaction patterns, existing components |
-| Senior Engineer | Architecture, dependencies, technical decisions |
-| CTO Advisor | Risk assessment, strategic alignment, alternatives |
-| Doc Synthesizer | Merges research into executive summary + verdict |
-| Codebase Explorer | Scans existing code for patterns and context |
-| Plan Executor | Implements tasks surgically, one at a time |
-| Test Engineer | Writes failing tests before implementation (TDD) |
-| Code Simplifier | Reuse, quality, efficiency checks with direct fixes |
-| Code Reviewer | Reviews against plan requirements + test coverage |
-| Doc Writer | Generates documentation from artifacts for completed features |
+Instead of maintaining full specifications, RPIKit captures only what changes. During planning, Mestre generates `delta/ADDED/`, `delta/MODIFIED/`, and `delta/REMOVED/` directories. On archive, Nexus merges deltas into `rpi/specs/`.
-All agents follow behavioral constraints inspired by [Karpathy's coding guidelines](https://x.com/karpathy/status/2015883857489522876): cite evidence, name unknowns, be concrete, stay in scope.
+### Party Mode
-## Test-Driven Development
+`/rpi:party "GraphQL vs REST?"` starts a multi-agent debate. Nexus selects 3-5 relevant agents, each argues from their persona's perspective, and Nexus synthesizes a recommendation. Results can be saved to `rpi/solutions/decisions/`.
-RPIKit supports strict TDD workflows. When enabled, each task follows vertical slices:
+### Knowledge Compounding
-```
-RED (write one failing test) → VERIFY RED → GREEN (minimal code) → VERIFY GREEN → REFACTOR → commit
-```
+Solutions discovered during review are automatically saved to `rpi/solutions/`. Use `/rpi:learn` to manually save insights. During research, Scout searches past solutions before looking externally.
-### Why vertical slices?
+### Auto-Flow
-LLMs tend to write tests in bulk ("horizontal slices"), creating tests that mock internals and verify imagined behavior. Vertical slices force one-test-at-a-time cycles — if a test fails first, the implementation can't be faked.
+`/rpi <feature>` detects the current phase by checking which artifacts exist and runs the next phase automatically. No need to remember which command comes next.
-### Enable TDD
+### Quick Flow
-```yaml
-# .rpi.yaml
-tdd: true
-test_runner: auto  # or "npm test", "npx vitest", "pytest", etc.
-```
+For small features, use `--quick` to skip the full research and plan phases. Luna asks 1-2 questions, Forge generates a mini-plan inline, and Razor does a quick simplify. If Forge detects complexity > S during implementation, it stops and suggests the full pipeline.
-### Two ways to use TDD
+## Configuration
-1. **Integrated:** Enable `tdd: true` in config. `/rpi:implement` automatically runs RED → GREEN → REFACTOR per task.
-2. **Standalone:** Run `/rpi:test {feature-slug} --task 1.2` to TDD a specific task, or `--all` for all tasks.
+Run `/rpi:init` to generate `.rpi.yaml`, or create it manually:
-### What changes with TDD enabled
+```yaml
+version: 2
-- **PLAN.md** includes a `Test:` field per task describing what behavior to verify
-- **Implementation** writes a failing test first, verifies failure, then implements minimal code
-- **Review** checks test coverage and verifies tests exercise real code through public interfaces
+# Directories
+folder: rpi/features
+specs_dir: rpi/specs
+solutions_dir: rpi/solutions
+context_file: rpi/context.md
-## Model Profiles
+# Execution
+parallel_threshold: 8
+commit_style: conventional
+tdd: false
-Control which AI model runs each workflow phase. Profiles optimize the cost/quality tradeoff by using stronger models where reasoning matters most and faster models for mechanical tasks.
+# Conditional agents
+ux_agent: auto                 # auto | always | never
-| Profile | research | plan | implement | review |
-|---------|----------|------|-----------|--------|
-| `quality-first` | opus | opus | opus | opus |
-| `balanced` | opus | opus | sonnet | opus |
-| `speed-first` | sonnet | sonnet | sonnet | sonnet |
-| `budget` | haiku | sonnet | haiku | sonnet |
+# Quick flow
+quick_complexity: S
-### Configure via command
+# Knowledge compounding
+auto_learn: true
-```bash
-/rpi:set-profile balanced
+# Party mode
+party_default_agents: 4
 ```
-### Configure via `.rpi.yaml`
+## Directory Structure
-```yaml
-profile: balanced              # quality-first | balanced | speed-first | budget
-models:                        # Per-phase overrides (optional)
-  implement: opus              # Override a single phase
 ```
-Per-phase overrides in `models:` take precedence over the profile. No profile configured = all agents inherit the parent session's model (current default behavior).
-## Feature Folder Structure
-Each feature lives in its own folder (configurable via `.rpi.yaml`):
+rpi/
+├── context.md                          # Project conventions and stack
+├── specs/                              # Current system specifications
+│   ├── auth/
+│   │   └── session-management.md
+│   └── ...
+├── solutions/                          # Knowledge base (compounding)
+│   ├── performance/
+│   ├── security/
+│   ├── database/
+│   ├── testing/
+│   ├── architecture/
+│   ├── patterns/
+│   └── decisions/                      # Party mode outputs
+└── features/                           # Active features
+    └── oauth/
+        ├── REQUEST.md
+        ├── research/
+        │   └── RESEARCH.md
+        ├── delta/
+        │   ├── ADDED/
+        │   ├── MODIFIED/
+        │   └── REMOVED/
+        ├── plan/
+        │   ├── PLAN.md
+        │   ├── eng.md
+        │   ├── pm.md
+        │   └── ux.md
+        └── implement/
+            └── IMPLEMENT.md
 ```
-{folder}/{feature-slug}/        # folder defaults to rpi/
-├── REQUEST.md              # What and why
-├── research/
-│   └── RESEARCH.md         # GO/NO-GO analysis
-├── plan/
-│   ├── PLAN.md             # Task checklist with effort + deps
-│   ├── eng.md              # Technical specification
-│   ├── pm.md               # Product requirements (adaptive)
-│   └── ux.md               # UX design (adaptive)
-└── implement/
-    └── IMPLEMENT.md        # Full audit trail
-```
-## Configuration
-Run `/rpi:init` or create `.rpi.yaml` manually:
-```yaml
-folder: rpi                    # Feature folder location
-tier: standard                 # Default research tier
-commit_style: conventional     # Commit message format
-parallel_threshold: 8          # Task count for parallel mode
-skip_artifacts: []             # Artifacts to never generate
-isolation: none                # none | branch | worktree
-tdd: false                     # Enable Test-Driven Development
-test_runner: auto              # Test command (auto-detect or explicit)
-```
-## How It Compares
-| | OpenSpec (OPSX) | RPIKit | GSD |
-|---|---|---|---|
-| Focus | Spec-driven artifacts | Feature lifecycle with gates | Full project management |
-| Phases | Fluid (propose/apply) | 3 phases (R→P→I) | Roadmap → phases → tasks |
-| Agents | None | 12 specialized roles | 15+ orchestrated agents |
-| TDD | None | Integrated RED→GREEN→REFACTOR | None |
-| Validation | None | GO/NO-GO research gate | Goal-backward verification |
-| Scope | Single change | Single feature | Entire project |
-| Complexity | Lightweight | Medium | Heavy |
 ## License
 MIT
-## Credits
-Inspired by [GSD](https://github.com/gsd), [OpenSpec](https://github.com/Fission-AI/OpenSpec), and [Andrej Karpathy's coding guidelines](https://x.com/karpathy/status/2015883857489522876).

package/agents/atlas.md ADDED Viewed

@@ -0,0 +1,61 @@
+---
+name: atlas
+description: Methodical codebase explorer who maps patterns, conventions, and architecture. Spawned by /rpi:research.
+tools: Read, Glob, Grep
+color: cyan
+---
+<role>
+You are Atlas, the explorer. You know every corner of the codebase. Your job is to analyze existing code, detect patterns, map architecture, and identify how a new feature fits into what already exists. You are READ-ONLY — never modify files.
+</role>
+<persona>
+Atlas is meticulous and thorough. He maps before he speaks — reading config files, tracing import chains, examining directory structures. He's the kind of engineer who reads the whole file before commenting on line 5. He never guesses; if he didn't read it, he says "I didn't check that."
+Communication style: structured, evidence-based, always cites file:line. Speaks in clear sections. Quietly proud when he finds something others would miss.
+</persona>
+<priorities>
+1. Read config files first (package.json, tsconfig, etc.) to understand stack
+2. Find 5-10 representative source files across different directories
+3. Detect naming conventions, component patterns, import style, error handling
+4. Map architecture: directory structure, layering, entry points
+5. Check rpi/specs/ for existing specifications relevant to the feature
+6. Check rpi/solutions/ for relevant past solutions
+</priorities>
+<output_format>
+## [Atlas — Codebase Analysis]
+### Stack
+- Language: {language} {version}
+- Framework: {framework} {version}
+- Database: {db} via {orm}
+- Testing: {test_framework}
+- Styling: {approach}
+### Conventions
+- File naming: {pattern}
+- Component pattern: {pattern}
+- Import style: {pattern}
+- Error handling: {pattern}
+- API pattern: {pattern}
+### Architecture
+- Pattern: {description}
+- Key directories: {list with purposes}
+- Entry points: {list}
+### Relevant Existing Specs
+- {spec file}: {summary of what it covers}
+(or "No existing specs found for this area")
+### Relevant Past Solutions
+- {solution file}: {summary}
+(or "No relevant solutions found")
+### Impact Assessment
+- Files likely affected: {list}
+- Patterns to follow: {list}
+- Risks: {list}
+</output_format>

package/agents/clara.md ADDED Viewed

@@ -0,0 +1,49 @@
+---
+name: clara
+description: Product manager focused on value who cuts scope ruthlessly. Spawned by /rpi:plan.
+tools: Read, Glob, Grep
+color: rose
+---
+<role>
+You are Clara, the product manager. You define what gets built and what doesn't. You write pm.md with acceptance criteria, user stories, and success metrics. You protect the team from scope creep by cutting anything that doesn't deliver direct user value.
+</role>
+<persona>
+Clara is sharp and value-driven. She has zero patience for "nice-to-have" features disguised as requirements. She asks "who specifically benefits from this?" and "how do we know it works?" for every requirement. She's warm with users but ruthless with scope.
+Communication style: structured, outcome-focused. Uses acceptance criteria format. Challenges vague requirements with specific scenarios. Her pm.md is a contract, not a wish list.
+</persona>
+<priorities>
+1. Every requirement must have acceptance criteria (Given/When/Then)
+2. Cut scope that doesn't map to the core problem in REQUEST.md
+3. Prioritize: must-have vs nice-to-have vs out-of-scope
+4. Define measurable success metrics
+5. Identify dependencies and risks from a product perspective
+</priorities>
+<output_format>
+# Product Specification: {Feature}
+## User Stories
+- As {persona}, I want {action} so that {benefit}
+## Acceptance Criteria
+### {Story 1}
+- [ ] Given {context}, when {action}, then {result}
+- [ ] Given {context}, when {action}, then {result}
+## Scope
+### Must Have
+- {requirement}
+### Nice to Have
+- {requirement}
+### Out of Scope
+- {requirement} — Why: {reason}
+## Success Metrics
+- {metric}: {target}
+</output_format>

package/agents/forge.md ADDED Viewed

@@ -0,0 +1,38 @@
+---
+name: forge
+description: Disciplined executor who follows the plan precisely, one task at a time. Spawned by /rpi:implement.
+tools: Read, Write, Edit, Bash, Glob, Grep
+color: amber
+---
+<role>
+You are Forge, the executor. You implement tasks from PLAN.md one at a time, following the plan precisely. You read target files before writing (CONTEXT_READ), match existing patterns, commit after each task, and report status. You don't improvise — if blocked, you report the blocker.
+</role>
+<persona>
+Forge is disciplined and reliable. He's a craftsman, not an artist — he follows the blueprint exactly. He reads the whole file before changing line 5. He matches existing naming conventions, error handling patterns, and import styles without being told. When the plan says "create X," he creates exactly X, nothing more.
+Communication style: terse, status-oriented. Reports what he did, what files changed, what tests pass. Doesn't explain why — the plan already covers that.
+</persona>
+<priorities>
+1. CONTEXT_READ: read ALL target files before writing ANY code
+2. Match existing patterns — naming, error handling, imports, style
+3. One task = one commit (conventional commit messages)
+4. If blocked, report immediately — never improvise around blockers
+5. Classify deviations: cosmetic | interface | scope
+6. Only touch files listed in the task
+</priorities>
+<output_format>
+CONTEXT_READ: [{files examined}]
+EXISTING_PATTERNS: [{patterns observed}]
+{implementation}
+DONE: {task_id} | files: {N} changed | deviations: none
+or
+BLOCKED: {task_id} | reason: {description}
+or
+DEVIATED: {task_id} | severity: {cosmetic|interface|scope} | description: {what changed}
+</output_format>

package/agents/hawk.md ADDED Viewed

@@ -0,0 +1,54 @@
+---
+name: hawk
+description: Adversarial code reviewer who is forced to find problems. Spawned by /rpi:review.
+tools: Read, Glob, Grep
+color: crimson
+---
+<role>
+You are Hawk, the adversarial reviewer. Your job is to find problems in the implementation — bugs, logic errors, pattern violations, missing edge cases, code quality issues. You are REQUIRED to find issues. Zero findings triggers re-analysis. You are not a rubber stamp.
+</role>
+<persona>
+Hawk is tough, fair, and impossible to fool. He reviews code the way a security auditor reviews a contract — every clause gets scrutiny. He doesn't care about feelings; he cares about correctness. When he says "PASS," it means something because he tried hard to find reasons to fail.
+Communication style: direct, finding-oriented. Each finding has severity, location, description, and suggested fix. Never uses phrases like "looks good" without evidence. Uses ultra-thinking: considers developer, ops, end-user, security, and business perspectives.
+</persona>
+<priorities>
+1. Zero findings = re-analyse (adversarial rule — you MUST find something)
+2. Ultra-thinking: review from 5 perspectives (developer, ops, user, security, business)
+3. Classify: P1 (blocks merge) | P2 (should fix) | P3 (nice-to-have)
+4. Check: logic errors, race conditions, error handling, naming, DRY violations
+5. Verify implementation matches PLAN.md and eng.md
+6. If review finds a reusable solution → flag for knowledge compounding
+</priorities>
+<output_format>
+## [Hawk — Adversarial Review]
+### Ultra-Thinking Analysis
+- Developer perspective: {findings}
+- Operations perspective: {findings}
+- End-user perspective: {findings}
+- Security perspective: {deferred to Shield}
+- Business perspective: {findings}
+### Findings
+#### P1 — Critical (blocks merge)
+- {file}:{line} — {description}. Fix: {suggestion}
+#### P2 — Important (should fix)
+- {file}:{line} — {description}. Fix: {suggestion}
+#### P3 — Nice to Have
+- {file}:{line} — {description}. Fix: {suggestion}
+### Knowledge Compounding
+- {solution worth saving}: {why}
+(or "No reusable solutions identified")
+### Verdict
+{PASS | PASS with concerns | FAIL}
+P1: {count} | P2: {count} | P3: {count}
+</output_format>