npm - claude-code-pilot - Versions diffs - 3.1.0 → 3.2.0 - Mend

claude-code-pilot 3.1.0 → 3.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (110) hide show

package/README.md +11 -11
package/bin/install.js +20 -2
package/manifest.json +5 -1
package/package.json +18 -6
package/src/agents/a11y-architect.md +141 -0
package/src/agents/code-architect.md +71 -0
package/src/agents/code-explorer.md +69 -0
package/src/agents/code-simplifier.md +47 -0
package/src/agents/comment-analyzer.md +45 -0
package/src/agents/csharp-reviewer.md +101 -0
package/src/agents/dart-build-resolver.md +201 -0
package/src/agents/pr-test-analyzer.md +45 -0
package/src/agents/silent-failure-hunter.md +50 -0
package/src/agents/type-design-analyzer.md +41 -0
package/src/available-rules/README.md +3 -1
package/src/available-rules/dart/coding-style.md +159 -0
package/src/available-rules/dart/hooks.md +66 -0
package/src/available-rules/dart/patterns.md +261 -0
package/src/available-rules/dart/security.md +135 -0
package/src/available-rules/dart/testing.md +215 -0
package/src/available-rules/web/coding-style.md +105 -0
package/src/available-rules/web/design-quality.md +72 -0
package/src/available-rules/web/hooks.md +129 -0
package/src/available-rules/web/patterns.md +88 -0
package/src/available-rules/web/performance.md +73 -0
package/src/available-rules/web/security.md +66 -0
package/src/available-rules/web/testing.md +64 -0
package/src/commands/ccp/ai-integration-phase.md +36 -0
package/src/commands/ccp/audit-fix.md +33 -0
package/src/commands/ccp/code-review-fix.md +52 -0
package/src/commands/ccp/eval-review.md +32 -0
package/src/commands/ccp/extract_learnings.md +22 -0
package/src/commands/ccp/import.md +37 -0
package/src/commands/ccp/ingest-docs.md +42 -0
package/src/commands/ccp/intel.md +179 -0
package/src/commands/ccp/plan-review-convergence.md +58 -0
package/src/commands/ccp/scan.md +26 -0
package/src/commands/ccp/sketch-wrap-up.md +31 -0
package/src/commands/ccp/sketch.md +54 -0
package/src/commands/ccp/spec-phase.md +62 -0
package/src/commands/ccp/spike-wrap-up.md +31 -0
package/src/commands/ccp/spike.md +51 -0
package/src/commands/ccp/ultraplan-phase.md +33 -0
package/src/hooks/ccp-read-injection-scanner.js +152 -0
package/src/hooks/kit-check-update.js +59 -7
package/src/hooks/run-with-flags-shell.sh +1 -0
package/src/hooks/run-with-flags.js +48 -1
package/src/hooks/session-end.js +88 -1
package/src/lib/hook-flags.js +14 -0
package/src/pilot/references/agent-contracts.md +79 -0
package/src/pilot/references/ai-evals.md +156 -0
package/src/pilot/references/ai-frameworks.md +186 -0
package/src/pilot/references/doc-conflict-engine.md +91 -0
package/src/pilot/references/gate-prompts.md +100 -0
package/src/pilot/references/gates.md +70 -0
package/src/pilot/references/mandatory-initial-read.md +2 -0
package/src/pilot/references/project-skills-discovery.md +19 -0
package/src/pilot/references/revision-loop.md +97 -0
package/src/pilot/references/sketch-interactivity.md +41 -0
package/src/pilot/references/sketch-theme-system.md +94 -0
package/src/pilot/references/sketch-tooling.md +45 -0
package/src/pilot/references/sketch-variant-patterns.md +81 -0
package/src/pilot/references/thinking-models-debug.md +44 -0
package/src/pilot/references/thinking-models-execution.md +50 -0
package/src/pilot/references/thinking-models-planning.md +62 -0
package/src/pilot/references/thinking-models-research.md +50 -0
package/src/pilot/references/thinking-models-verification.md +55 -0
package/src/pilot/templates/AI-SPEC.md +246 -0
package/src/pilot/templates/spec.md +307 -0
package/src/pilot/workflows/ai-integration-phase.md +284 -0
package/src/pilot/workflows/audit-fix.md +175 -0
package/src/pilot/workflows/code-review-fix.md +497 -0
package/src/pilot/workflows/eval-review.md +155 -0
package/src/pilot/workflows/extract_learnings.md +242 -0
package/src/pilot/workflows/import.md +246 -0
package/src/pilot/workflows/ingest-docs.md +328 -0
package/src/pilot/workflows/plan-review-convergence.md +329 -0
package/src/pilot/workflows/scan.md +102 -0
package/src/pilot/workflows/sketch-wrap-up.md +285 -0
package/src/pilot/workflows/sketch.md +360 -0
package/src/pilot/workflows/spec-phase.md +262 -0
package/src/pilot/workflows/spike-wrap-up.md +306 -0
package/src/pilot/workflows/spike.md +452 -0
package/src/pilot/workflows/ultraplan-phase.md +189 -0
package/src/skills/accessibility/SKILL.md +146 -0
package/src/skills/agent-eval/SKILL.md +145 -0
package/src/skills/agent-introspection-debugging/SKILL.md +153 -0
package/src/skills/android-clean-architecture/SKILL.md +339 -0
package/src/skills/api-connector-builder/SKILL.md +120 -0
package/src/skills/code-tour/SKILL.md +236 -0
package/src/skills/compose-multiplatform-patterns/SKILL.md +299 -0
package/src/skills/csharp-testing/SKILL.md +321 -0
package/src/skills/dart-flutter-patterns/SKILL.md +563 -0
package/src/skills/dashboard-builder/SKILL.md +108 -0
package/src/skills/dotnet-patterns/SKILL.md +321 -0
package/src/skills/frontend-design/SKILL.md +145 -0
package/src/skills/frontend-slides/SKILL.md +184 -0
package/src/skills/frontend-slides/STYLE_PRESETS.md +330 -0
package/src/skills/gateguard/SKILL.md +121 -0
package/src/skills/github-ops/SKILL.md +144 -0
package/src/skills/hookify-rules/SKILL.md +128 -0
package/src/skills/knowledge-ops/SKILL.md +154 -0
package/src/skills/liquid-glass-design/SKILL.md +279 -0
package/src/skills/nestjs-patterns/SKILL.md +230 -0
package/src/skills/security-bounty-hunter/SKILL.md +99 -0
package/src/skills/swift-actor-persistence/SKILL.md +143 -0
package/src/skills/swift-protocol-di-testing/SKILL.md +190 -0
package/src/skills/swiftui-patterns/SKILL.md +259 -0
package/src/skills/terminal-ops/SKILL.md +109 -0
package/src/skills/ui-demo/SKILL.md +465 -0

package/src/pilot/templates/AI-SPEC.md ADDED Viewed

@@ -0,0 +1,246 @@
+# AI-SPEC — Phase {N}: {phase_name}
+> AI design contract generated by `/ccp:ai-integration-phase`. Consumed by `gsd-planner` and `gsd-eval-auditor`.
+> Locks framework selection, implementation guidance, and evaluation strategy before planning begins.
+---
+## 1. System Classification
+**System Type:** <!-- RAG | Multi-Agent | Conversational | Extraction | Autonomous Agent | Content Generation | Code Automation | Hybrid -->
+**Description:**
+<!-- One-paragraph description of what this AI system does, who uses it, and what "good" looks like -->
+**Critical Failure Modes:**
+<!-- The 3-5 behaviors that absolutely cannot go wrong in this system -->
+1.
+2.
+3.
+---
+## 1b. Domain Context
+> Researched by `gsd-domain-researcher`. Grounds the evaluation strategy in domain expert knowledge.
+**Industry Vertical:** <!-- healthcare | legal | finance | customer service | education | developer tooling | e-commerce | etc. -->
+**User Population:** <!-- who uses this system and in what context -->
+**Stakes Level:** <!-- Low | Medium | High | Critical -->
+**Output Consequence:** <!-- what happens downstream when the AI output is acted on -->
+### What Domain Experts Evaluate Against
+<!-- Domain-specific rubric ingredients — in practitioner language, not AI jargon -->
+<!-- Format: Dimension / Good (expert accepts) / Bad (expert flags) / Stakes / Source -->
+### Known Failure Modes in This Domain
+<!-- Domain-specific failure modes from research — not generic hallucination, but how it manifests here -->
+### Regulatory / Compliance Context
+<!-- Relevant regulations or constraints — or "None identified" if genuinely none apply -->
+### Domain Expert Roles for Evaluation
+| Role | Responsibility |
+|------|---------------|
+| <!-- e.g., Senior practitioner --> | <!-- Dataset labeling / rubric calibration / production sampling --> |
+---
+## 2. Framework Decision
+**Selected Framework:** <!-- e.g., LlamaIndex v0.10.x -->
+**Version:** <!-- Pin the version -->
+**Rationale:**
+<!-- Why this framework fits this system type, team context, and production requirements -->
+**Alternatives Considered:**
+| Framework | Ruled Out Because |
+|-----------|------------------|
+| | |
+**Vendor Lock-In Accepted:** <!-- Yes / No / Partial — document the trade-off consciously -->
+---
+## 3. Framework Quick Reference
+> Fetched from official docs by `gsd-ai-researcher`. Distilled for this specific use case.
+### Installation
+```bash
+# Install command(s)
+```
+### Core Imports
+```python
+# Key imports for this use case
+```
+### Entry Point Pattern
+```python
+# Minimal working example for this system type
+```
+### Key Abstractions
+<!-- Framework-specific concepts the developer must understand before coding -->
+| Concept | What It Is | When You Use It |
+|---------|-----------|-----------------|
+| | | |
+### Common Pitfalls
+<!-- Gotchas specific to this framework and system type — from docs, issues, and community reports -->
+1.
+2.
+3.
+### Recommended Project Structure
+```
+project/
+├── # Framework-specific folder layout
+```
+---
+## 4. Implementation Guidance
+**Model Configuration:**
+<!-- Which model(s), temperature, max tokens, and other key parameters -->
+**Core Pattern:**
+<!-- The primary implementation pattern for this system type in this framework -->
+**Tool Use:**
+<!-- Tools/integrations needed and how to configure them -->
+**State Management:**
+<!-- How state is persisted, retrieved, and updated -->
+**Context Window Strategy:**
+<!-- How to manage context limits for this system type -->
+---
+## 4b. AI Systems Best Practices
+> Written by `gsd-ai-researcher`. Cross-cutting patterns every developer building AI systems needs — independent of framework choice.
+### Structured Outputs with Pydantic
+<!-- Framework-specific Pydantic integration pattern for this use case -->
+<!-- Include: output model definition, how the framework uses it, retry logic on validation failure -->
+```python
+# Pydantic output model for this system type
+```
+### Async-First Design
+<!-- How async is handled in this framework, the one common mistake, and when to stream vs. await -->
+### Prompt Engineering Discipline
+<!-- System vs. user prompt separation, few-shot guidance, token budget strategy -->
+### Context Window Management
+<!-- Strategy specific to this system type: RAG chunking / conversation summarisation / agent compaction -->
+### Cost and Latency Budget
+<!-- Per-call cost estimate, caching strategy, sub-task model routing -->
+---
+## 5. Evaluation Strategy
+### Dimensions
+| Dimension | Rubric (Pass/Fail or 1-5) | Measurement Approach | Priority |
+|-----------|--------------------------|---------------------|----------|
+| | | Code / LLM Judge / Human | Critical / High / Medium |
+### Eval Tooling
+**Primary Tool:** <!-- e.g., RAGAS + Langfuse -->
+**Setup:**
+```bash
+# Install and configure
+```
+**CI/CD Integration:**
+```bash
+# Command to run evals in CI/CD pipeline
+```
+### Reference Dataset
+**Size:** <!-- e.g., 20 examples to start -->
+**Composition:**
+<!-- What scenario types the dataset covers: critical paths, edge cases, failure modes -->
+**Labeling:**
+<!-- Who labels examples and how (domain expert, LLM judge with calibration, etc.) -->
+---
+## 6. Guardrails
+### Online (Real-Time)
+| Guardrail | Trigger | Intervention |
+|-----------|---------|--------------|
+| | | Block / Escalate / Flag |
+### Offline (Flywheel)
+| Metric | Sampling Strategy | Action on Degradation |
+|--------|------------------|----------------------|
+| | | |
+---
+## 7. Production Monitoring
+**Tracing Tool:** <!-- e.g., Langfuse self-hosted -->
+**Key Metrics to Track:**
+<!-- 3-5 metrics that will be monitored in production -->
+**Alert Thresholds:**
+<!-- When to page/alert -->
+**Smart Sampling Strategy:**
+<!-- How to select interactions for human review — signal-based filters -->
+---
+## Checklist
+- [ ] System type classified
+- [ ] Critical failure modes identified (≥ 3)
+- [ ] Domain context researched (Section 1b: vertical, stakes, expert criteria, failure modes)
+- [ ] Regulatory/compliance context identified or explicitly noted as none
+- [ ] Domain expert roles defined for evaluation involvement
+- [ ] Framework selected with rationale documented
+- [ ] Alternatives considered and ruled out
+- [ ] Framework quick reference written (install, imports, pattern, pitfalls)
+- [ ] AI systems best practices written (Section 4b: Pydantic, async, prompt discipline, context)
+- [ ] Evaluation dimensions grounded in domain rubric ingredients
+- [ ] Each eval dimension has a concrete rubric (Good/Bad in domain language)
+- [ ] Eval tooling selected — Arize Phoenix default confirmed or override noted
+- [ ] Reference dataset spec written (size ≥ 10, composition + labeling defined)
+- [ ] CI/CD eval integration specified
+- [ ] Online guardrails defined
+- [ ] Production monitoring configured (tracing tool + sampling strategy)

package/src/pilot/templates/spec.md ADDED Viewed

@@ -0,0 +1,307 @@
+# Phase Spec Template
+Template for `.planning/phases/XX-name/{phase_num}-SPEC.md` — locks requirements before discuss-phase.
+**Purpose:** Capture WHAT a phase delivers and WHY, with enough precision that requirements are falsifiable. discuss-phase reads this file and focuses on HOW to implement (skipping "what/why" questions already answered here).
+**Key principle:** Every requirement must be falsifiable — you can write a test or check that proves it was met or not. Vague requirements like "improve performance" are not allowed.
+**Downstream consumers:**
+- `discuss-phase` — reads SPEC.md at startup; treats Requirements, Boundaries, and Acceptance Criteria as locked; skips "what/why" questions
+- `gsd-planner` — reads locked requirements to constrain plan scope
+- `gsd-verifier` — uses acceptance criteria as explicit pass/fail checks
+---
+## File Template
+```markdown
+# Phase [X]: [Name] — Specification
+**Created:** [date]
+**Ambiguity score:** [score] (gate: ≤ 0.20)
+**Requirements:** [N] locked
+## Goal
+[One precise sentence — specific and measurable. NOT "improve X" — instead "X changes from A to B".]
+## Background
+[Current state from codebase — what exists today, what's broken or missing, what triggers this work. Grounded in code reality, not abstract description.]
+## Requirements
+1. **[Short label]**: [Specific, testable statement.]
+   - Current: [what exists or does NOT exist today]
+   - Target: [what it should become after this phase]
+   - Acceptance: [concrete pass/fail check — how a verifier confirms this was met]
+2. **[Short label]**: [Specific, testable statement.]
+   - Current: [what exists or does NOT exist today]
+   - Target: [what it should become after this phase]
+   - Acceptance: [concrete pass/fail check]
+[Continue for all requirements. Each must have Current/Target/Acceptance.]
+## Boundaries
+**In scope:**
+- [Explicit list of what this phase produces]
+- [Each item is a concrete deliverable or behavior]
+**Out of scope:**
+- [Explicit list of what this phase does NOT do] — [brief reason why it's excluded]
+- [Adjacent problems excluded from this phase] — [brief reason]
+## Constraints
+[Performance, compatibility, data volume, dependency, or platform constraints.
+If none: "No additional constraints beyond standard project conventions."]
+## Acceptance Criteria
+- [ ] [Pass/fail criterion — unambiguous, verifiable]
+- [ ] [Pass/fail criterion]
+- [ ] [Pass/fail criterion]
+[Every acceptance criterion must be a checkbox that resolves to PASS or FAIL.
+No "should feel good", "looks reasonable", or "generally works" — those are not checkboxes.]
+## Ambiguity Report
+| Dimension          | Score | Min  | Status | Notes                              |
+|--------------------|-------|------|--------|------------------------------------|
+| Goal Clarity       |       | 0.75 |        |                                    |
+| Boundary Clarity   |       | 0.70 |        |                                    |
+| Constraint Clarity |       | 0.65 |        |                                    |
+| Acceptance Criteria|       | 0.70 |        |                                    |
+| **Ambiguity**      |       | ≤0.20|        |                                    |
+Status: ✓ = met minimum, ⚠ = below minimum (planner treats as assumption)
+## Interview Log
+[Key decisions made during the Socratic interview. Format: round → question → answer → decision locked.]
+| Round | Perspective    | Question summary         | Decision locked                    |
+|-------|----------------|-------------------------|------------------------------------|
+| 1     | Researcher     | [what was asked]        | [what was decided]                 |
+| 2     | Simplifier     | [what was asked]        | [what was decided]                 |
+| 3     | Boundary Keeper| [what was asked]        | [what was decided]                 |
+[If --auto mode: note "auto-selected" decisions with the reasoning Claude used.]
+---
+*Phase: [XX-name]*
+*Spec created: [date]*
+*Next step: /ccp:discuss-phase [X] — implementation decisions (how to build what's specified above)*
+```
+<good_examples>
+**Example 1: Feature addition (Post Feed)**
+```markdown
+# Phase 3: Post Feed — Specification
+**Created:** 2025-01-20
+**Ambiguity score:** 0.12
+**Requirements:** 4 locked
+## Goal
+Users can scroll through posts from accounts they follow, with new posts available after pull-to-refresh.
+## Background
+The database has a `posts` table and `follows` table. No feed query or feed UI exists today. The home screen shows a placeholder "Your feed will appear here." This phase builds the feed query, API endpoint, and the feed list component.
+## Requirements
+1. **Feed query**: Returns posts from followed accounts ordered by creation time, descending.
+   - Current: No feed query exists — `posts` table is queried directly only from profile pages
+   - Target: `GET /api/feed` returns paginated posts from followed accounts, newest first, max 20 per page
+   - Acceptance: Query returns correct posts for a user who follows 3 accounts with known post counts; cursor-based pagination advances correctly
+2. **Feed display**: Posts display in a scrollable card list.
+   - Current: Home screen shows static placeholder text
+   - Target: Home screen renders feed cards with author, timestamp, post content, and reaction count
+   - Acceptance: Feed renders without error for 0 posts (empty state shown), 1 post, and 20+ posts
+3. **Pull-to-refresh**: User can refresh the feed manually.
+   - Current: No refresh mechanism exists
+   - Target: Pull-down gesture triggers refetch; new posts appear at top of list
+   - Acceptance: After a new post is created in test, pull-to-refresh shows the new post without full app restart
+4. **New posts indicator**: When new posts arrive, a banner appears instead of auto-scrolling.
+   - Current: No such mechanism
+   - Target: "3 new posts" banner appears when refetch returns posts newer than the oldest visible post; tapping banner scrolls to top and shows new posts
+   - Acceptance: Banner appears for ≥1 new post, does not appear when no new posts, tap navigates to top
+## Boundaries
+**In scope:**
+- Feed query (backend) — posts from followed accounts, paginated
+- Feed list UI (frontend) — post cards with author, timestamp, content, reaction counts
+- Pull-to-refresh gesture
+- New posts indicator banner
+- Empty state when user follows no one or no posts exist
+**Out of scope:**
+- Creating posts — that is Phase 4
+- Reacting to posts — that is Phase 5
+- Following/unfollowing accounts — that is Phase 2 (already done)
+- Push notifications for new posts — separate backlog item
+## Constraints
+- Feed query must use cursor-based pagination (not offset) — the database has 500K+ posts and offset pagination is unacceptably slow beyond page 3
+- The feed card component must reuse the existing `<AvatarImage>` component from Phase 2
+## Acceptance Criteria
+- [ ] `GET /api/feed` returns posts only from followed accounts (not all posts)
+- [ ] `GET /api/feed` supports `cursor` parameter for pagination
+- [ ] Feed renders correctly at 0, 1, and 20+ posts
+- [ ] Pull-to-refresh triggers refetch
+- [ ] New posts indicator appears when posts newer than current view exist
+- [ ] Empty state renders when user follows no one
+## Ambiguity Report
+| Dimension          | Score | Min  | Status | Notes                            |
+|--------------------|-------|------|--------|----------------------------------|
+| Goal Clarity       | 0.92  | 0.75 | ✓      |                                  |
+| Boundary Clarity   | 0.95  | 0.70 | ✓      | Explicit out-of-scope list       |
+| Constraint Clarity | 0.80  | 0.65 | ✓      | Cursor pagination required       |
+| Acceptance Criteria| 0.85  | 0.70 | ✓      | 6 pass/fail criteria             |
+| **Ambiguity**      | 0.12  | ≤0.20| ✓      |                                  |
+## Interview Log
+| Round | Perspective     | Question summary              | Decision locked                         |
+|-------|-----------------|------------------------------|-----------------------------------------|
+| 1     | Researcher      | What exists in posts today?  | posts + follows tables exist, no feed  |
+| 2     | Simplifier      | Minimum viable feed?         | Cards + pull-refresh, no auto-scroll   |
+| 3     | Boundary Keeper | What's NOT this phase?       | Creating posts, reactions out of scope |
+| 3     | Boundary Keeper | What does done look like?    | Scrollable feed with 4 card fields     |
+---
+*Phase: 03-post-feed*
+*Spec created: 2025-01-20*
+*Next step: /ccp:discuss-phase 3 — implementation decisions (card layout, loading skeleton, etc.)*
+```
+**Example 2: CLI tool (Database backup)**
+```markdown
+# Phase 2: Backup Command — Specification
+**Created:** 2025-01-20
+**Ambiguity score:** 0.15
+**Requirements:** 3 locked
+## Goal
+A `gsd backup` CLI command creates a reproducible database snapshot that can be restored by `gsd restore` (a separate phase).
+## Background
+No backup tooling exists. The project uses PostgreSQL. Developers currently use `pg_dump` manually — there is no standardized process, no output naming convention, and no CI integration. Three incidents in the last quarter involved restoring from wrong or corrupt dumps.
+## Requirements
+1. **Backup creation**: CLI command executes a full database backup.
+   - Current: No `backup` subcommand exists in the CLI
+   - Target: `gsd backup` connects to the database (via `DATABASE_URL` env or `--db` flag), runs pg_dump, writes output to `./backups/YYYY-MM-DD_HH-MM-SS.dump`
+   - Acceptance: Running `gsd backup` on a test database creates a `.dump` file; running `pg_restore` on that file recreates the database without error
+2. **Network retry**: Transient network failures are retried automatically.
+   - Current: pg_dump fails immediately on network error
+   - Target: Backup retries up to 3 times with 5-second delay; 4th failure exits with code 1 and a message to stderr
+   - Acceptance: Simulating 2 sequential network failures causes 2 retries then success; simulating 4 failures causes exit code 1 and stderr message
+3. **Partial cleanup**: Failed backups do not leave corrupt files.
+   - Current: Manual pg_dump leaves partial files on failure
+   - Target: If backup fails after starting, the partial `.dump` file is deleted before exit
+   - Acceptance: After a simulated failure mid-dump, no `.dump` file exists in `./backups/`
+## Boundaries
+**In scope:**
+- `gsd backup` subcommand (full dump only)
+- Output to `./backups/` directory (created if missing)
+- Network retry (3 attempts)
+- Partial file cleanup on failure
+**Out of scope:**
+- `gsd restore` — that is Phase 3
+- Incremental backups — separate backlog item (full dump only for now)
+- S3 or remote storage — separate backlog item
+- Encryption — separate backlog item
+- Scheduled/cron backups — separate backlog item
+## Constraints
+- Must use `pg_dump` (not a custom query) — ensures compatibility with standard `pg_restore`
+- `--no-retry` flag must be available for CI use (fail fast, no retries)
+## Acceptance Criteria
+- [ ] `gsd backup` creates a `.dump` file in `./backups/YYYY-MM-DD_HH-MM-SS.dump` format
+- [ ] `gsd backup` uses `DATABASE_URL` env var or `--db` flag for connection
+- [ ] 3 retries on network failure, then exit code 1 with stderr message
+- [ ] `--no-retry` flag skips retries and fails immediately on first error
+- [ ] No partial `.dump` file left after a failed backup
+## Ambiguity Report
+| Dimension          | Score | Min  | Status | Notes                          |
+|--------------------|-------|------|--------|--------------------------------|
+| Goal Clarity       | 0.90  | 0.75 | ✓      |                                |
+| Boundary Clarity   | 0.95  | 0.70 | ✓      | Explicit out-of-scope list     |
+| Constraint Clarity | 0.75  | 0.65 | ✓      | pg_dump required               |
+| Acceptance Criteria| 0.80  | 0.70 | ✓      | 5 pass/fail criteria           |
+| **Ambiguity**      | 0.15  | ≤0.20| ✓      |                                |
+## Interview Log
+| Round | Perspective     | Question summary              | Decision locked                         |
+|-------|-----------------|------------------------------|-----------------------------------------|
+| 1     | Researcher      | What backup tooling exists?  | None — pg_dump manual only             |
+| 2     | Simplifier      | Minimum viable backup?       | Full dump only, local only             |
+| 3     | Boundary Keeper | What's NOT this phase?       | Restore, S3, encryption excluded       |
+| 4     | Failure Analyst | What goes wrong on failure?  | Partial files, CI fail-fast needed     |
+---
+*Phase: 02-backup-command*
+*Spec created: 2025-01-20*
+*Next step: /ccp:discuss-phase 2 — implementation decisions (progress reporting, flag design, etc.)*
+```
+</good_examples>
+<guidelines>
+**Every requirement needs all three fields:**
+- Current: grounds the requirement in reality — what exists today?
+- Target: the concrete change — not "improve X" but "X becomes Y"
+- Acceptance: the falsifiable check — how does a verifier confirm this?
+**Ambiguity Report must reflect the actual interview.** If a dimension is below minimum, mark it ⚠ — the planner knows to treat it as an assumption rather than a locked requirement.
+**Interview Log is evidence of rigor.** Don't skip it. It shows that requirements came from discovery, not assumption.
+**Boundaries protect the phase from scope creep.** The out-of-scope list with reasoning is as important as the in-scope list. Future phases that touch adjacent areas can point to this SPEC.md to understand what was intentionally excluded.
+**SPEC.md is a one-way door for requirements.** discuss-phase will treat these as locked. If requirements change after SPEC.md is written, the user should update SPEC.md first, then re-run discuss-phase.
+**SPEC.md does NOT replace CONTEXT.md.** They serve different purposes:
+- SPEC.md: what the phase delivers (requirements, boundaries, acceptance criteria)
+- CONTEXT.md: how the phase will be implemented (decisions, patterns, tradeoffs)
+discuss-phase generates CONTEXT.md after reading SPEC.md.
+</guidelines>