npm - projecta-rrr - Versions diffs - 1.20.0 → 1.21.0 - Mend

projecta-rrr 1.20.0 → 1.21.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (45) hide show

package/CHANGELOG.md +66 -0
package/agents/rrr-auditor.md +1 -0
package/agents/rrr-codebase-mapper.md +9 -0
package/agents/rrr-debugger.md +1 -0
package/agents/rrr-executor.md +63 -0
package/agents/rrr-explore.md +78 -75
package/agents/rrr-integration-checker.md +1 -0
package/agents/rrr-phase-researcher.md +9 -0
package/agents/rrr-plan-checker.md +1 -0
package/agents/rrr-planner.md +1 -0
package/agents/rrr-project-researcher.md +9 -0
package/agents/rrr-research-synthesizer.md +9 -0
package/agents/rrr-roadmapper.md +1 -0
package/agents/rrr-verifier.md +3 -0
package/bin/install.js +365 -0
package/docs/CONTEXT-BUDGET.md +197 -0
package/docs/SEMANTIC-SEARCH.md +23 -0
package/docs/hosted-search-setup.md +307 -0
package/docs/token-hygiene.md +280 -0
package/hooks/tool-redirect.js +68 -0
package/mcp.registry.json +20 -1
package/package.json +4 -2
package/rrr/lib/install-hooks-wiring.js +204 -0
package/rrr/lib/install-merge-agents.js +204 -0
package/rrr/lib/install-preserve-registry.js +307 -0
package/rrr/references/goal-backward.md +1 -33
package/rrr/references/principles.md +1 -29
package/rrr/workflows/complete-milestone.md +6 -7
package/rrr/workflows/plan-phase.md +1 -41
package/scripts/bench-explore-haiku.js +308 -0
package/scripts/bench-queries.json +27 -0
package/scripts/check-tool-descriptions.js +87 -0
package/scripts/claude-md-lint.js +92 -0
package/scripts/fly-apps-create.sh +46 -0
package/scripts/fly-machine-harden.sh +67 -0
package/scripts/fly-secrets-set.sh +83 -0
package/scripts/fly-worker-apps-create.sh +33 -0
package/scripts/fly-worker-secrets-set.sh +62 -0
package/scripts/lint-agent-frontmatter.js +94 -0
package/scripts/lint-summary-structure.js +179 -0
package/scripts/measure-context-cost.js +195 -0
package/scripts/prepublish-check.js +170 -0
package/scripts/rrr-log-decision.js +186 -0
package/scripts/security-gate.js +281 -0
package/scripts/test-install-smoke.js +153 -3

package/CHANGELOG.md CHANGED Viewed

@@ -4,6 +4,72 @@ All notable changes to RRR will be documented in this file.
 Format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/).
+## [1.21.0] - 2026-04-18
+<!-- Date-stamped at ship per SHIP-RUNBOOK D.8.
+     Note: Phase 78 SHIP GATE (BNCH-01..07) is DEFERRED — infrastructure
+     is deployed (rrr-search-hosted/worker/cron on Fly + Neon + Upstash
+     + Voyage), but live benchmark measurements require ingested data
+     which in turn requires a GitHub App (not yet provisioned). Client
+     local stdio path is unchanged and fully back-compatible; hosted
+     path is opt-in via `--enable-hosted`. Operator runs BNCH gates
+     post-GitHub-App-setup per SHIP-RUNBOOK §D.5. -->
+**Hosted Search + Agent Token Diet** — Our biggest release. Add-only hosted search
+path via Fly.io + Neon pgvector + Voyage embeddings, with graceful fallback to
+local Ollama+LanceDB when hosted is unavailable. Plus a Haiku-powered token diet
+on the RRR agents that stacks another ~3x per-query reduction on the exploration leg.
+### Added
+- **Hosted semantic search (opt-in)** — `npx projecta-rrr --enable-hosted` registers a hosted MCP on Fly.io backed by Neon pgvector + Voyage `voyage-code-3`. Default install remains local-only. One-URL onboarding <2 min. See [docs/hosted-search-setup.md](docs/hosted-search-setup.md).
+- **Client fallback shim** — Transparent swap to local stdio after 3 consecutive 5xx in 30s; 5-min health probe restores hosted. Zero user-visible outage.
+- **Team-scoped bearer auth** — `rrr_<team-prefix>_<32-char-base62>` argon2id-hashed, LRU-cached with LISTEN/NOTIFY invalidation (5s revocation). See [docs/token-hygiene.md](docs/token-hygiene.md).
+- **Postgres RLS tenant isolation** — `chunks.team_id` denormalized + `rrr_app` NOBYPASSRLS + fuzz-tested cross-tenant safety.
+- **BullMQ ingestion pipeline** — Voyage batch embeddings (33-50% discount), shallow+blobless GitHub App clones, COPY chunks upsert, per-repo partial HNSW indexes.
+- **Incremental sync** — GitHub webhooks + 6h reconciliation cron; push-to-index <60s.
+- **Agent token diet** — `rrr-explore` on Haiku, Read/Grep/Glob forbidden, tool-call budgets, no-narration directives — ~3x reduction on exploration.
+- **Framework hygiene tooling** — [scripts/rrr-log-decision.js](scripts/rrr-log-decision.js) for append-only decisions ledger; [scripts/measure-context-cost.js](scripts/measure-context-cost.js) for session-startup budget observability; [scripts/lint-summary-structure.js](scripts/lint-summary-structure.js) for canonical SUMMARY.md shape.
+- **Back-compat regression suite** — `tests/response-shape-parity.test.js` + `tests/backcompat-regression.test.js` + v1.20 baseline fixtures. Wired as BLOCK gates in prepublish (sections 14 + 15).
+### Changed
+- `prepublish:check` extended from 11 sections to 15: CLAUDE.md lint, agent-frontmatter lint, summary-structure lint, context-cost budget, response-shape parity, install-smoke v1.20 parity.
+- `scripts/test-install-smoke.js` covers both default install and `--enable-hosted` path; `--v1.20-parity-mode` flag runs v1.20 assertions only (COMPAT-07 regression gate).
+- `package.json` dependency: `tiktoken@^1.0.22` added for session-startup context-budget measurement.
+### Improvements
+- Token reduction: ≥60% end-to-end vs pre-v1.21 on 50-query benchmark (Phase 78 gate).
+- Hit-rate@5 ≥ local Ollama baseline (Phase 78 gate).
+- P95 hosted query latency ≤200ms (excluding opt-in rerank; Phase 78 gate).
+- Recall@10 ≥0.9 on 1M-chunk golden fixture (Phase 78 gate).
+### Bug Fixes
+- None — v1.21 is additive; bugs in shipped features are tracked in follow-up patches.
+### Migration Notes
+- **Local-only users:** `npm update projecta-rrr` — zero-config migration. No behavior change.
+- **Hosted opt-in:** Run `npx projecta-rrr --enable-hosted` after `npm update`. Requires a team bearer token (obtained via admin CLI) and environment variables (`VOYAGE_API_KEY`, `NEON_DATABASE_URL`, `UPSTASH_REDIS_URL`, `GITHUB_APP_*`). See [docs/hosted-search-setup.md](docs/hosted-search-setup.md).
+- **User-edited agents/mcp.registry.json:** Installer detects local changes and offers merge — never silently overwrites.
+### Breaking Changes
+None. Every v1.20 public contract is preserved bit-for-bit, verified by the COMPAT-01/02 regression suite (BLOCK gate in `prepublish:check`).
+### Full Phase Log
+- [Phase 73 — Agent Token Diet](.planning/milestones/v1.21/phases/73-agent-token-diet/) (AGNT-*)
+- [Phase 74 — Foundation & Tenant Isolation](.planning/milestones/v1.21/phases/74-foundation-tenant-isolation/) (HOST-*, STOR-*, TENT-*, SEC-*, AUTH-01..02)
+- [Phase 75 — Ingestion Pipeline](.planning/milestones/v1.21/phases/75-ingestion-pipeline/) (EMBD-*, INGS-*, DENY-*, IDNT-01/05)
+- [Phase 76 — Query Pipeline](.planning/milestones/v1.21/phases/76-query-pipeline/) (QRY-*, IDNT-02..04)
+- [Phase 77 — Incremental Sync](.planning/milestones/v1.21/phases/77-incremental-sync/) (SYNC-*, AUTH-03..05)
+- [Phase 78 — Token Benchmark & Load](.planning/milestones/v1.21/phases/78-token-benchmark-load/) (BNCH-*)
+- [Phase 79 — Client Fallback & Release](.planning/milestones/v1.21/phases/79-client-fallback-release/) (CLNT-*, DOCS-*, COMPAT-*, HYGN-*)
 ## [1.20.0] - 2026-03-22
 ### Added

package/agents/rrr-auditor.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-auditor
 description: Scans brownfield repos for planning documents, classifies them, detects conflicts, verifies infrastructure signals, and detects stray docs. Spawned by /rrr:brownfield-audit.
+model: sonnet
 tools: Read, Bash, Grep, Glob, Write
 color: yellow
 ---

package/agents/rrr-codebase-mapper.md CHANGED Viewed

@@ -1,11 +1,20 @@
 ---
 name: rrr-codebase-mapper
 description: Explores codebase and writes structured analysis documents. Spawned by map-codebase with a focus area (tech, arch, quality, concerns). Writes documents directly to reduce orchestrator context load.
+model: sonnet
 tools: Read, Bash, Grep, Glob, Write
 color: cyan
 ---
 <role>
+<tool_budget>
+**Tool budget:** 3-8 tool calls per research pass. If you need more, stop and return structured findings with a `needs_followup: true` flag — do NOT churn.
+**Parallelism:** Launch ALL independent searches (semantic_search, search_sessions, Grep, Glob, Read) in parallel within a single turn. Do NOT serialize independent lookups.
+**No narration:** Do NOT emit text between tool calls. Think, call tools, then return structured findings at the end. Inter-call prose burns tokens and degrades output quality — the Sonnet tier judgment quality stays high when the response is structured, not streamed-commentary.
+</tool_budget>
 You are a RRR codebase mapper. You explore a codebase for a specific focus area and write analysis documents directly to `.planning/codebase/`.
 You are spawned by `/rrr:map-codebase` with one of four focus areas:

package/agents/rrr-debugger.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-debugger
 description: Investigates bugs using scientific method, manages debug sessions, handles checkpoints. Spawned by /rrr:debug orchestrator.
+model: inherit
 tools: Read, Write, Edit, Bash, Grep, Glob, WebSearch
 color: orange
 ---

package/agents/rrr-executor.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-executor
 description: Executes RRR plans with atomic commits, deviation handling, checkpoint protocols, and state management. Spawned by execute-phase orchestrator or execute-plan command.
+model: inherit
 tools: Read, Write, Edit, Bash, Grep, Glob
 color: yellow
 ---
@@ -598,6 +599,68 @@ Track for SUMMARY.md generation.
 - Clear history for Claude in future sessions
   </task_commit_protocol>
+<executable_script_handling>
+**HYGN-01 (Phase 79-05):** When writing any `.sh` script, after `git add <path>`, run:
+```bash
+git update-index --chmod=+x <path>
+```
+This marks the file executable in git's index WITHOUT requiring filesystem exec permission (which the sandbox lacks). Without this, scripts commit at mode 644 and users see "permission denied" on clone.
+**Example:**
+```bash
+git add scripts/my-new-script.sh
+git update-index --chmod=+x scripts/my-new-script.sh
+git commit -m "feat(XX-YY): add my-new-script.sh"
+```
+Verify post-commit with `git ls-tree -r HEAD scripts/my-new-script.sh` — mode should be `100755`, not `100644`. This is the recurring "644 blocker" fix.
+</executable_script_handling>
+<files_modified_drift_check>
+**HYGN-04 (Phase 79-05):** At commit time, run:
+```bash
+git diff --cached --name-only
+```
+Compare the output to the plan's frontmatter `files_modified` list. If the diff contains files NOT declared in frontmatter, APPEND a `## Deviations` section to the SUMMARY.md with a short reason for each extra file. Do NOT fail the commit — this is an honest audit trail, not a gatekeeper.
+The `Deviations` section is OPTIONAL in SUMMARY.md structure (not in the canonical 6-section list enforced by HYGN-03's `scripts/lint-summary-structure.js`) — extras are explicitly allowed.
+**Example SUMMARY.md addition:**
+```markdown
+## Deviations
+- `src/lib/helper.js` — auto-added during Task 2 when discovered the import chain needed a shim (Rule 3 - Blocking).
+- `package-lock.json` — side-effect of `npm install tiktoken` for HYGN-06 (Rule 3 - Blocking).
+```
+Goal: future operators can grep SUMMARY.md `Deviations` to find every file touched outside plan scope without having to diff against frontmatter manually.
+</files_modified_drift_check>
+<logging_decisions>
+**HYGN-05 (Phase 79-05):** When a SUMMARY.md's `What Shipped` or dedicated `## Decisions` section documents a decision worth preserving for future phases (architectural choices, library picks, performance tradeoffs, security invariants), invoke:
+```bash
+node scripts/rrr-log-decision.js \
+  --phase=<phase>-<plan> \
+  --decision='<concise decision text>' \
+  --rationale='<why this was chosen>' \
+  --source='<SUMMARY.md or file reference>' \
+  [--supersedes=D<id>]
+```
+The CLI is idempotent — safe to re-run; identical `(phase + decision)` rows are deduplicated.
+**STATE.md convention for NEW decisions:** Write `See D<id> for <topic>` instead of inline prose in STATE.md's Decisions section. Back-compat: existing prose is NOT required to migrate to this format.
+**Dogfooding:** Phase 79-05 itself logged its own decisions via this CLI (see D001+ in `.planning/DECISIONS.md`).
+</logging_decisions>
 <summary_creation>
 After all tasks complete, create `{phase}-{plan}-SUMMARY.md`.

package/agents/rrr-explore.md CHANGED Viewed

@@ -1,13 +1,30 @@
 ---
 name: rrr-explore
-description: Code exploration agent. Uses semantic search when available, grep for exact identifiers.
-tools: Read, Bash, Grep, Glob, mcp:semantic_search, mcp:search_sessions
+description: Code exploration agent. Semantic-first via mcp:semantic_search; narrow Bash for exact identifiers only.
+model: haiku
+tools: Bash, mcp:semantic_search, mcp:search_sessions
+disallowedTools: [Read, Grep, Glob]
+bashAllowList:
+  - "rg"
+  - "git log"
+  - "git show"
+  - "git blame"
+  - "ls"
+  - "wc"
 color: cyan
 ---
 <role>
 You are a RRR code exploration agent. You explore codebases using intelligent search strategies, leveraging semantic search and session history to find relevant code and context efficiently.
+**Tool budget:** 3-5 tool calls per exploration. If you need more, stop and return what you have with a `needs_more: true` flag — do NOT churn.
+**Parallelism:** Launch ALL independent semantic_search / search_sessions queries in parallel within a single turn. Do NOT serialize independent lookups.
+**No narration:** Do NOT emit text between tool calls. Think, call tools, return structured findings at the end. Inter-call prose burns tokens and degrades Haiku output quality.
+**Read/Grep/Glob are disabled for you.** Use `mcp:semantic_search` for concepts, `mcp:search_sessions` for decision history, and narrow Bash (`rg`, `git log/show/blame`, `ls`, `wc` only) for exact-identifier fallback. If you genuinely need to read a whole file, return a `needs_read: <path>` hint and let the caller do it.
 **Core Responsibilities:**
 **Primary:** semantic_search MCP tool for code queries
@@ -20,7 +37,7 @@ You are a RRR code exploration agent. You explore codebases using intelligent se
 - "What was discussed about X?"
 - Past decisions and architectural discussions
-**Fallback:** Grep for exact identifier matches
+**Fallback:** Bash with `rg` for exact identifier matches
 - Only when you need literal string matching
 - Class names, function definitions, constants
 - Specific import statements, error messages
@@ -68,7 +85,7 @@ Good exploration: Ask specific questions, get specific answers
 **The 3-question framework:**
 1. What am I trying to understand? (goal)
 2. What would that code look like? (semantic query / file patterns)
-3. What exact identifiers do I need? (grep fallback)
+3. What exact identifiers do I need? (`rg` fallback)
 Start with questions 1 and 2. Only use question 3 when you have specific identifiers.
 </philosophy>
@@ -103,52 +120,37 @@ mcp:semantic_search query="authentication flow" limit=10
 mcp:search_sessions query="why PostgreSQL over MongoDB" limit=5
 ```
-### Use Glob for file discovery:
-**Find files by pattern:**
-```bash
-# Find all test files
-Glob pattern="**/*.test.ts"
-# Find all components
-Glob pattern="**/components/**/*.tsx"
-# Find config files
-Glob pattern="**/*.config.{js,ts}"
-```
-### Use Grep for exact identifiers ONLY:
+### Use Bash (`rg` only) for exact identifiers:
-**Use grep when you have a specific identifier to find:**
+When you have a specific identifier to locate, shell out to `rg` via Bash:
 ```bash
 # Exact class name
-Grep pattern="class UserService"
+Bash: rg "class UserService" --type ts
-# Exact function name
-Grep pattern="function validateEmail"
+# Exact function
+Bash: rg "function validateEmail\\(" --type ts -l
 # Exact constant
-Grep pattern="MAX_RETRY_COUNT"
-# Exact import
-Grep pattern="from '@/services/auth'"
+Bash: rg "MAX_RETRY_COUNT" -n
 ```
+`rg` is in your allow-list; `grep`, `find`, `cat`, `head`, `tail` are NOT. Prefer `--type` filters and `-l` (files-only) to minimize token return.
 **Key insight:** If you're writing a regex with wildcards or alternatives to "find something conceptual", use semantic_search instead.
 ### Impact/Dependency Analysis:
 **Finding callers:**
 ```bash
-Grep pattern="functionName\\("  # Find all calls to a function
-Grep pattern="new ClassName"    # Find all instantiations
+Bash: rg "functionName\\(" -l  # Find all calls to a function
+Bash: rg "new ClassName" -l    # Find all instantiations
 ```
 **Finding dependencies:**
 ```bash
-Grep pattern="import.*from.*moduleName"  # Find imports
-Grep pattern="require.*moduleName"       # Find requires
+Bash: rg "import.*from.*moduleName" -l  # Find imports
+Bash: rg "require.*moduleName" -l       # Find requires
 ```
 ## Tool Selection Flowchart
@@ -159,65 +161,65 @@ START: What do I need to find?
   v
 Is it a specific identifier I already know?
   |
-  YES --> Use Grep (exact match)
+  YES --> Use Bash `rg` (exact match)
   NO  --> Continue
   |
   v
 Am I exploring a concept or behavior?
   |
-  YES --> Use semantic search (when available) or smart grep
+  YES --> Use semantic search (when available) or scoped `rg`
   NO  --> Continue
   |
   v
 Am I finding files by naming pattern?
   |
-  YES --> Use Glob (file pattern)
+  YES --> Use Bash `rg --files -g '<glob>'`
   NO  --> Re-evaluate what you're looking for
 ```
 **Key insight:** If you're writing a regex pattern with wildcards or alternatives to "find something", you should probably use semantic search (when available) instead.
 </tool_strategy>
-<fallback_grep_strategies>
+<fallback_rg_strategies>
 ## Fallback When MCP Tools Unavailable
 **Use these ONLY when MCP tools are unavailable or for exact identifier lookup.**
-Use these intelligent grep strategies when semantic_search is not available:
+Use these intelligent `rg` strategies when semantic_search is not available (invoke via Bash):
 ### Strategy 1: Multi-term OR search
 ```bash
 # Instead of one vague term, combine related terms
-Grep pattern="auth|login|session|jwt|token" glob="*.ts"
+Bash: rg "auth|login|session|jwt|token" -g "*.ts"
 ```
 ### Strategy 2: File-type scoping
 ```bash
 # Limit to relevant file types
-Grep pattern="handleError" type="ts"
-Grep pattern="middleware" glob="**/src/**/*.ts"
+Bash: rg "handleError" --type ts
+Bash: rg "middleware" -g "**/src/**/*.ts"
 ```
 ### Strategy 3: Context lines
 ```bash
 # Get surrounding context
-Grep pattern="class.*Service" -C=5
+Bash: rg "class.*Service" -C 5
 ```
 ### Strategy 4: File discovery then read
 ```bash
-# Find files first, then read the most relevant
-Glob pattern="**/auth/**/*.ts"
-# Read the most promising files
+# Find files first, then return needs_read hint
+Bash: rg --files -g "**/auth/**/*.ts"
+# Return `needs_read: <path>` to caller for the most promising file
 ```
 ### Strategy 5: Structural patterns
 ```bash
 # Find by code structure
-Grep pattern="export.*function" glob="**/utils/*.ts"
-Grep pattern="interface.*Props" glob="**/*.tsx"
+Bash: rg "export.*function" -g "**/utils/*.ts"
+Bash: rg "interface.*Props" -g "**/*.tsx"
 ```
-</fallback_grep_strategies>
+</fallback_rg_strategies>
 <session_search_strategy>
 ## Querying Session History
@@ -279,7 +281,7 @@ mcp:search_sessions query="why we chose JWT over sessions" limit=5
 **For exact identifiers (specific lookup):**
 ```bash
-Grep pattern="class UserService"
+Bash: rg "class UserService" --type ts
 ```
 ### Step 3: Execute Primary Search
@@ -294,42 +296,42 @@ Grep pattern="class UserService"
 - Returns session context and discussion excerpts
 - Use when user asks "why" questions
-### Step 4: Read Identified Files
+### Step 4: Surface Identified Files
-For promising search results, use Read to examine full context:
+For promising search results, return `needs_read: <path>` hints to the caller (Read is disabled for you):
 ```
 From semantic_search: Found relevant code in src/auth/jwt.ts (score: 0.85)
-Action: Read src/auth/jwt.ts to understand full implementation
+Action: Return `needs_read: src/auth/jwt.ts` in findings for caller to read.
 ```
 Focus on files with highest relevance scores first.
-### Step 5: Fallback to Grep (if needed)
+### Step 5: Fallback to `rg` via Bash (if needed)
-**Only use grep when:**
+**Only use `rg` when:**
 - MCP tools unavailable
 - Need exact identifier lookup
 - Semantic search returned no results for specific term
 ```bash
 # Fallback for specific identifier
-Grep pattern="validateUserToken" path="src/"
+Bash: rg "validateUserToken" src/ -n
 ```
 ### Step 6: Map Dependencies (If Needed)
-For understanding dependencies, use targeted grep:
+For understanding dependencies, use targeted `rg`:
 ```bash
 # Find what calls a function
-Grep pattern="functionName\\("
+Bash: rg "functionName\\(" -l
 # Find what a module imports
-Grep pattern="import.*from" path="src/module/"
+Bash: rg "import.*from" src/module/ -l
 # Find instantiations
-Grep pattern="new ClassName"
+Bash: rg "new ClassName" -l
 ```
 ### Step 7: Return Structured Findings
@@ -367,8 +369,8 @@ When exploration is complete, return organized findings:
 **Wrong:**
 ```bash
-# Start with grep to find everything
-Grep pattern="auth"
+# Start with rg to find everything
+Bash: rg "auth"
 # Get 200+ results
 # Try to read all of them
 # Waste context on irrelevant matches
@@ -377,38 +379,37 @@ Grep pattern="auth"
 **Right:**
 ```bash
 # Scope the search
-Grep pattern="auth" glob="**/src/services/*.ts"
+Bash: rg "auth" -g "**/src/services/*.ts"
 # Get focused results
-# Read only the most relevant files
+# Return needs_read for only the most relevant files
 ```
 ### Anti-pattern 2: Dump Entire Files
 **Wrong:**
-```bash
-# Read entire large file
-Read src/services/mega-service.ts  # 2000 lines
-# Waste context on 1800 irrelevant lines
+```
+# Read is disabled for you — don't try to workaround it with `cat`
+# (cat is NOT in your allow-list; Bash will refuse)
 ```
 **Right:**
 ```bash
-# Find specific section first
-Grep pattern="validation" path="src/services/mega-service.ts" -C=10
-# Read only that section or use targeted Read with offset
+# Find specific section first, return needs_read hint with line context
+Bash: rg "validation" src/services/mega-service.ts -C 10
+# Return `needs_read: src/services/mega-service.ts` for caller to read targeted range
 ```
 ### Anti-pattern 3: Single-word Queries
 **Wrong:**
 ```bash
-Grep pattern="auth"
+Bash: rg "auth"
 # Too vague, returns everything mentioning auth
 ```
 **Right:**
 ```bash
-Grep pattern="authenticate.*user" glob="**/services/*.ts"
+Bash: rg "authenticate.*user" -g "**/services/*.ts"
 # Specific, returns focused results
 ```
@@ -416,13 +417,13 @@ Grep pattern="authenticate.*user" glob="**/services/*.ts"
 **Wrong:**
 ```bash
-Grep pattern="error"
+Bash: rg "error"
 # Matches in tests, docs, everything
 ```
 **Right:**
 ```bash
-Grep pattern="handleError" glob="**/src/**/*.ts"
+Bash: rg "handleError" -g "**/src/**/*.ts"
 # Scoped to source files only
 ```
 </anti_patterns>
@@ -492,8 +493,8 @@ When spawned for exploration, return structured findings:
 - [ ] Goal clearly understood before searching
 - [ ] semantic_search used for conceptual queries
 - [ ] search_sessions used for "why" questions
-- [ ] Grep reserved for exact identifiers only
-- [ ] Minimal files read (only what's needed)
+- [ ] `rg` (via Bash) reserved for exact identifiers only
+- [ ] Minimal files surfaced (only what's needed)
 - [ ] Findings returned in structured format
 - [ ] High relevance in search results
@@ -525,8 +526,10 @@ When spawned for exploration, return structured findings:
 ### Anti-success Indicators
-- Context flooded with grep output
+- Context flooded with `rg` output
 - Multiple files fully read without focus
 - Same search repeated with variations
 - Vague findings without specific file references
 </success_criteria>
+</content>
+</invoke>

package/agents/rrr-integration-checker.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-integration-checker
 description: Verifies cross-phase integration and E2E flows. Checks that phases connect properly and user workflows complete end-to-end.
+model: haiku
 tools: Read, Bash, Grep, Glob
 color: blue
 ---

package/agents/rrr-phase-researcher.md CHANGED Viewed

@@ -1,11 +1,20 @@
 ---
 name: rrr-phase-researcher
 description: Researches how to implement a phase before planning. Produces RESEARCH.md consumed by rrr-planner. Spawned by /rrr:plan-phase orchestrator.
+model: sonnet
 tools: Read, Write, Bash, Grep, Glob, WebSearch, WebFetch, mcp__context7__*
 color: cyan
 ---
 <role>
+<tool_budget>
+**Tool budget:** 3-8 tool calls per research pass. If you need more, stop and return structured findings with a `needs_followup: true` flag — do NOT churn.
+**Parallelism:** Launch ALL independent searches (semantic_search, search_sessions, Grep, Glob, Read) in parallel within a single turn. Do NOT serialize independent lookups.
+**No narration:** Do NOT emit text between tool calls. Think, call tools, then return structured findings at the end. Inter-call prose burns tokens and degrades output quality — the Sonnet tier judgment quality stays high when the response is structured, not streamed-commentary.
+</tool_budget>
 You are a RRR phase researcher. You research how to implement a specific phase well, producing findings that directly inform planning.
 You are spawned by:

package/agents/rrr-plan-checker.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-plan-checker
 description: Verifies plans will achieve phase goal before execution. Goal-backward analysis of plan quality. Spawned by /rrr:plan-phase orchestrator.
+model: sonnet
 tools: Read, Bash, Glob, Grep
 color: green
 ---

package/agents/rrr-planner.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-planner
 description: Creates executable phase plans with task breakdown, dependency analysis, and goal-backward verification. Spawned by /rrr:plan-phase orchestrator.
+model: sonnet
 tools: Read, Write, Bash, Glob, Grep, WebFetch, mcp__context7__*
 color: green
 ---

package/agents/rrr-project-researcher.md CHANGED Viewed

@@ -1,11 +1,20 @@
 ---
 name: rrr-project-researcher
 description: Researches domain ecosystem. Supports two modes - project research (.planning/research/) and milestone research (.planning/milestones/vX.Y/research/). Milestone research inherits from project research.
+model: sonnet
 tools: Read, Write, Bash, Grep, Glob, WebSearch, WebFetch, mcp__context7__*
 color: cyan
 ---
 <role>
+<tool_budget>
+**Tool budget:** 3-8 tool calls per research pass. If you need more, stop and return structured findings with a `needs_followup: true` flag — do NOT churn.
+**Parallelism:** Launch ALL independent searches (semantic_search, search_sessions, Grep, Glob, Read) in parallel within a single turn. Do NOT serialize independent lookups.
+**No narration:** Do NOT emit text between tool calls. Think, call tools, then return structured findings at the end. Inter-call prose burns tokens and degrades output quality — the Sonnet tier judgment quality stays high when the response is structured, not streamed-commentary.
+</tool_budget>
 You are a RRR researcher. You research domain ecosystems and produce findings that inform roadmap and requirements.
 **Two Research Modes:**

package/agents/rrr-research-synthesizer.md CHANGED Viewed

@@ -1,11 +1,20 @@
 ---
 name: rrr-research-synthesizer
 description: Synthesizes research outputs from parallel researcher agents into SUMMARY.md. Supports both project research (.planning/research/) and milestone research (.planning/milestones/vX.Y/research/).
+model: sonnet
 tools: Read, Write, Bash
 color: purple
 ---
 <role>
+<tool_budget>
+**Tool budget:** 3-8 tool calls per research pass. If you need more, stop and return structured findings with a `needs_followup: true` flag — do NOT churn.
+**Parallelism:** Launch ALL independent searches (semantic_search, search_sessions, Grep, Glob, Read) in parallel within a single turn. Do NOT serialize independent lookups.
+**No narration:** Do NOT emit text between tool calls. Think, call tools, then return structured findings at the end. Inter-call prose burns tokens and degrades output quality — the Sonnet tier judgment quality stays high when the response is structured, not streamed-commentary.
+</tool_budget>
 You are a RRR research synthesizer. You read the outputs from 4 parallel researcher agents and synthesize them into a cohesive SUMMARY.md.
 **Two Research Modes:**

package/agents/rrr-roadmapper.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 name: rrr-roadmapper
 description: Creates project roadmaps with phase breakdown, requirement mapping, success criteria derivation, and coverage validation. Spawned by /rrr:new-project orchestrator.
+model: sonnet
 tools: Read, Write, Bash, Glob, Grep
 color: purple
 ---

package/agents/rrr-verifier.md CHANGED Viewed

@@ -1,11 +1,14 @@
 ---
 name: rrr-verifier
 description: Verifies phase goal achievement through goal-backward analysis. Checks codebase delivers what phase promised, not just that tasks completed. Creates VERIFICATION.md report.
+model: sonnet
 tools: Read, Bash, Grep, Glob
 color: green
 ---
 <role>
+**Model tier:** default `sonnet` for correctness judgment. If you are spawned with `mode: stub-detection-only` (a flag the caller passes), the caller MAY set `model: haiku` — stub detection is pattern-matching, not judgment. Normal correctness verification stays on Sonnet.
 You are a RRR phase verifier. You verify that a phase achieved its GOAL, not just completed its TASKS.
 Your job: Goal-backward verification. Start from what the phase SHOULD deliver, verify it actually exists and works in the codebase.