npm - ridgeline - Versions diffs - 0.5.9 → 0.7.2 - Mend

ridgeline 0.5.9 → 0.7.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (212) hide show

package/README.md +53 -9
package/dist/agents/core/designer.md +131 -0
package/dist/agents/core/refiner.md +61 -0
package/dist/agents/core/researcher.md +78 -0
package/dist/agents/core/specifier.md +16 -0
package/dist/agents/researchers/academic.md +27 -0
package/dist/agents/researchers/competitive.md +28 -0
package/dist/agents/researchers/context.md +46 -0
package/dist/agents/researchers/ecosystem.md +28 -0
package/dist/agents/researchers/gaps.md +67 -0
package/dist/agents/specifiers/visual-coherence.md +55 -0
package/dist/cli.js +83 -1
package/dist/cli.js.map +1 -1
package/dist/commands/create.js +20 -2
package/dist/commands/create.js.map +1 -1
package/dist/commands/design.d.ts +8 -0
package/dist/commands/design.js +130 -0
package/dist/commands/design.js.map +1 -0
package/dist/commands/index.d.ts +1 -0
package/dist/commands/index.js +3 -1
package/dist/commands/index.js.map +1 -1
package/dist/commands/plan.js +3 -3
package/dist/commands/plan.js.map +1 -1
package/dist/commands/qa-workflow.d.ts +33 -0
package/dist/commands/qa-workflow.js +139 -0
package/dist/commands/qa-workflow.js.map +1 -0
package/dist/commands/refine.d.ts +8 -0
package/dist/commands/refine.js +105 -0
package/dist/commands/refine.js.map +1 -0
package/dist/commands/research.d.ts +10 -0
package/dist/commands/research.js +146 -0
package/dist/commands/research.js.map +1 -0
package/dist/commands/rewind.js +5 -3
package/dist/commands/rewind.js.map +1 -1
package/dist/commands/shape.js +36 -121
package/dist/commands/shape.js.map +1 -1
package/dist/commands/spec.js +1 -0
package/dist/commands/spec.js.map +1 -1
package/dist/engine/claude/stream.display.js +0 -1
package/dist/engine/claude/stream.display.js.map +1 -1
package/dist/engine/claude/stream.parse.d.ts +1 -15
package/dist/engine/claude/stream.parse.js +3 -21
package/dist/engine/claude/stream.parse.js.map +1 -1
package/dist/engine/claude/stream.result.js +2 -2
package/dist/engine/claude/stream.types.d.ts +15 -0
package/dist/engine/claude/stream.types.js +23 -0
package/dist/engine/claude/stream.types.js.map +1 -0
package/dist/engine/discovery/agent.registry.d.ts +4 -0
package/dist/engine/discovery/agent.registry.js +46 -18
package/dist/engine/discovery/agent.registry.js.map +1 -1
package/dist/engine/discovery/flavour.config.d.ts +9 -0
package/dist/engine/discovery/flavour.config.js +61 -0
package/dist/engine/discovery/flavour.config.js.map +1 -0
package/dist/engine/discovery/plugin.scan.d.ts +1 -0
package/dist/engine/discovery/plugin.scan.js +29 -1
package/dist/engine/discovery/plugin.scan.js.map +1 -1
package/dist/engine/discovery/skill.check.d.ts +19 -0
package/dist/engine/discovery/skill.check.js +145 -0
package/dist/engine/discovery/skill.check.js.map +1 -0
package/dist/engine/pipeline/build.exec.js +1 -0
package/dist/engine/pipeline/build.exec.js.map +1 -1
package/dist/engine/pipeline/ensemble.exec.d.ts +12 -1
package/dist/engine/pipeline/ensemble.exec.js +20 -10
package/dist/engine/pipeline/ensemble.exec.js.map +1 -1
package/dist/engine/pipeline/phase.sequence.js +10 -10
package/dist/engine/pipeline/phase.sequence.js.map +1 -1
package/dist/engine/pipeline/pipeline.shared.d.ts +6 -0
package/dist/engine/pipeline/pipeline.shared.js +24 -1
package/dist/engine/pipeline/pipeline.shared.js.map +1 -1
package/dist/engine/pipeline/plan.exec.js +1 -0
package/dist/engine/pipeline/plan.exec.js.map +1 -1
package/dist/engine/pipeline/refine.exec.d.ts +10 -0
package/dist/engine/pipeline/refine.exec.js +91 -0
package/dist/engine/pipeline/refine.exec.js.map +1 -0
package/dist/engine/pipeline/research.exec.d.ts +17 -0
package/dist/engine/pipeline/research.exec.js +196 -0
package/dist/engine/pipeline/research.exec.js.map +1 -0
package/dist/engine/pipeline/review.exec.js +23 -0
package/dist/engine/pipeline/review.exec.js.map +1 -1
package/dist/engine/pipeline/specify.exec.d.ts +1 -0
package/dist/engine/pipeline/specify.exec.js +114 -44
package/dist/engine/pipeline/specify.exec.js.map +1 -1
package/dist/flavours/data-analysis/core/refiner.md +65 -0
package/dist/flavours/data-analysis/core/researcher.md +81 -0
package/dist/flavours/data-analysis/researchers/academic.md +29 -0
package/dist/flavours/data-analysis/researchers/competitive.md +29 -0
package/dist/flavours/data-analysis/researchers/ecosystem.md +29 -0
package/dist/flavours/data-analysis/researchers/gaps.md +59 -0
package/dist/flavours/game-dev/core/refiner.md +65 -0
package/dist/flavours/game-dev/core/researcher.md +81 -0
package/dist/flavours/game-dev/researchers/academic.md +31 -0
package/dist/flavours/game-dev/researchers/competitive.md +30 -0
package/dist/flavours/game-dev/researchers/ecosystem.md +29 -0
package/dist/flavours/game-dev/researchers/gaps.md +59 -0
package/dist/flavours/legal-drafting/core/refiner.md +65 -0
package/dist/flavours/legal-drafting/core/researcher.md +81 -0
package/dist/flavours/legal-drafting/researchers/academic.md +31 -0
package/dist/flavours/legal-drafting/researchers/competitive.md +31 -0
package/dist/flavours/legal-drafting/researchers/ecosystem.md +30 -0
package/dist/flavours/legal-drafting/researchers/gaps.md +59 -0
package/dist/flavours/machine-learning/core/refiner.md +65 -0
package/dist/flavours/machine-learning/core/researcher.md +81 -0
package/dist/flavours/machine-learning/researchers/academic.md +32 -0
package/dist/flavours/machine-learning/researchers/competitive.md +32 -0
package/dist/flavours/machine-learning/researchers/ecosystem.md +31 -0
package/dist/flavours/machine-learning/researchers/gaps.md +59 -0
package/dist/flavours/mobile-app/core/refiner.md +65 -0
package/dist/flavours/mobile-app/core/researcher.md +81 -0
package/dist/flavours/mobile-app/researchers/academic.md +31 -0
package/dist/flavours/mobile-app/researchers/competitive.md +32 -0
package/dist/flavours/mobile-app/researchers/ecosystem.md +31 -0
package/dist/flavours/mobile-app/researchers/gaps.md +59 -0
package/dist/flavours/music-composition/core/refiner.md +65 -0
package/dist/flavours/music-composition/core/researcher.md +81 -0
package/dist/flavours/music-composition/researchers/academic.md +32 -0
package/dist/flavours/music-composition/researchers/competitive.md +32 -0
package/dist/flavours/music-composition/researchers/ecosystem.md +32 -0
package/dist/flavours/music-composition/researchers/gaps.md +59 -0
package/dist/flavours/novel-writing/core/refiner.md +65 -0
package/dist/flavours/novel-writing/core/researcher.md +81 -0
package/dist/flavours/novel-writing/researchers/academic.md +32 -0
package/dist/flavours/novel-writing/researchers/competitive.md +32 -0
package/dist/flavours/novel-writing/researchers/ecosystem.md +32 -0
package/dist/flavours/novel-writing/researchers/gaps.md +59 -0
package/dist/flavours/screenwriting/core/refiner.md +65 -0
package/dist/flavours/screenwriting/core/researcher.md +81 -0
package/dist/flavours/screenwriting/researchers/academic.md +32 -0
package/dist/flavours/screenwriting/researchers/competitive.md +32 -0
package/dist/flavours/screenwriting/researchers/ecosystem.md +32 -0
package/dist/flavours/screenwriting/researchers/gaps.md +59 -0
package/dist/flavours/security-audit/core/refiner.md +65 -0
package/dist/flavours/security-audit/core/researcher.md +81 -0
package/dist/flavours/security-audit/researchers/academic.md +32 -0
package/dist/flavours/security-audit/researchers/competitive.md +32 -0
package/dist/flavours/security-audit/researchers/ecosystem.md +32 -0
package/dist/flavours/security-audit/researchers/gaps.md +59 -0
package/dist/flavours/software-engineering/core/builder.md +2 -0
package/dist/flavours/software-engineering/core/refiner.md +65 -0
package/dist/flavours/software-engineering/core/researcher.md +81 -0
package/dist/flavours/software-engineering/core/reviewer.md +2 -0
package/dist/flavours/software-engineering/flavour.json +7 -0
package/dist/flavours/software-engineering/researchers/academic.md +32 -0
package/dist/flavours/software-engineering/researchers/competitive.md +32 -0
package/dist/flavours/software-engineering/researchers/ecosystem.md +32 -0
package/dist/flavours/software-engineering/researchers/gaps.md +59 -0
package/dist/flavours/technical-writing/core/refiner.md +65 -0
package/dist/flavours/technical-writing/core/researcher.md +81 -0
package/dist/flavours/technical-writing/researchers/academic.md +32 -0
package/dist/flavours/technical-writing/researchers/competitive.md +32 -0
package/dist/flavours/technical-writing/researchers/ecosystem.md +32 -0
package/dist/flavours/technical-writing/researchers/gaps.md +59 -0
package/dist/flavours/test-suite/core/refiner.md +65 -0
package/dist/flavours/test-suite/core/researcher.md +81 -0
package/dist/flavours/test-suite/researchers/academic.md +32 -0
package/dist/flavours/test-suite/researchers/competitive.md +32 -0
package/dist/flavours/test-suite/researchers/ecosystem.md +32 -0
package/dist/flavours/test-suite/researchers/gaps.md +59 -0
package/dist/flavours/translation/core/refiner.md +65 -0
package/dist/flavours/translation/core/researcher.md +81 -0
package/dist/flavours/translation/researchers/academic.md +32 -0
package/dist/flavours/translation/researchers/competitive.md +32 -0
package/dist/flavours/translation/researchers/ecosystem.md +32 -0
package/dist/flavours/translation/researchers/gaps.md +59 -0
package/dist/flavours/web-game/core/builder.md +123 -0
package/dist/flavours/web-game/core/reviewer.md +159 -0
package/dist/flavours/web-game/flavour.json +9 -0
package/dist/flavours/web-ui/core/builder.md +117 -0
package/dist/flavours/web-ui/core/reviewer.md +155 -0
package/dist/flavours/web-ui/flavour.json +10 -0
package/dist/plugin/visual-tools/plugin.json +4 -0
package/dist/plugin/visual-tools/skills/a11y-audit/SKILL.md +57 -0
package/dist/plugin/visual-tools/skills/agent-browser/SKILL.md +56 -0
package/dist/plugin/visual-tools/skills/agent-browser/references/viewports.md +17 -0
package/dist/plugin/visual-tools/skills/canvas-screenshot/SKILL.md +84 -0
package/dist/plugin/visual-tools/skills/css-audit/SKILL.md +50 -0
package/dist/plugin/visual-tools/skills/lighthouse/SKILL.md +58 -0
package/dist/plugin/visual-tools/skills/shader-validate/SKILL.md +77 -0
package/dist/plugin/visual-tools/skills/visual-diff/SKILL.md +68 -0
package/dist/shapes/detect.d.ts +8 -0
package/dist/shapes/detect.js +87 -0
package/dist/shapes/detect.js.map +1 -0
package/dist/shapes/game-visual.json +8 -0
package/dist/shapes/print-layout.json +8 -0
package/dist/shapes/web-visual.json +9 -0
package/dist/stores/budget.js +2 -1
package/dist/stores/budget.js.map +1 -1
package/dist/stores/feedback.format.d.ts +3 -0
package/dist/stores/feedback.format.js +62 -0
package/dist/stores/feedback.format.js.map +1 -0
package/dist/stores/feedback.parse.d.ts +2 -0
package/dist/stores/feedback.parse.js +121 -0
package/dist/stores/feedback.parse.js.map +1 -0
package/dist/stores/feedback.verdict.d.ts +2 -4
package/dist/stores/feedback.verdict.js +7 -175
package/dist/stores/feedback.verdict.js.map +1 -1
package/dist/stores/index.d.ts +1 -1
package/dist/stores/index.js +1 -2
package/dist/stores/index.js.map +1 -1
package/dist/stores/settings.d.ts +2 -0
package/dist/stores/settings.js +24 -1
package/dist/stores/settings.js.map +1 -1
package/dist/stores/state.d.ts +4 -0
package/dist/stores/state.js +75 -12
package/dist/stores/state.js.map +1 -1
package/dist/stores/trajectory.d.ts +2 -3
package/dist/stores/trajectory.js +6 -7
package/dist/stores/trajectory.js.map +1 -1
package/dist/types.d.ts +15 -3
package/dist/utils/atomic-write.d.ts +6 -0
package/dist/utils/atomic-write.js +62 -0
package/dist/utils/atomic-write.js.map +1 -0
package/package.json +2 -2

package/README.md CHANGED Viewed

@@ -5,9 +5,9 @@
 Build harness for long-horizon software execution using AI agents.
 Ridgeline decomposes large software ideas into phased builds using a
-multi-agent pipeline (shaper, specifier, planner, builder, reviewer) driven by
-the Claude CLI. It manages state through git checkpoints, tracks costs, and
-supports resumable execution when things go wrong.
+multi-agent pipeline (shaper, specifier, researcher, refiner, planner, builder,
+reviewer) driven by the Claude CLI. It manages state through git checkpoints,
+tracks costs, and supports resumable execution when things go wrong.
 ## How it works
@@ -16,15 +16,24 @@ supports resumable execution when things go wrong.
 2. **Specify** -- an ensemble of three specialist agents (completeness, clarity,
    pragmatism) drafts spec proposals, then a synthesizer merges them into
    `spec.md`, `constraints.md`, and optionally `taste.md`.
-3. **Plan** -- an ensemble of three specialist planners (simplicity,
+3. **Research** (optional) -- an ensemble of research specialists (academic,
+   ecosystem, competitive) investigates the spec using web sources, then a
+   synthesizer merges findings into `research.md`. A gap analysis agenda step
+   runs before specialist dispatch to focus research on spec gaps. Findings
+   accumulate across iterations rather than being overwritten. A quick
+   single-agent mode is also available. See [Research and Refine](docs/research.md).
+4. **Refine** (optional) -- the refiner agent rewrites `spec.md` incorporating
+   research findings and writes `spec.changelog.md` documenting what changed.
+   Additive by default -- adds insights without removing user-authored content.
+5. **Plan** -- an ensemble of three specialist planners (simplicity,
    thoroughness, velocity) proposes phase decompositions, then a synthesizer
    merges them into numbered phase files with acceptance criteria.
-4. **Build** -- for each phase the builder agent implements the spec inside your
+6. **Build** -- for each phase the builder agent implements the spec inside your
    repo, then creates a git checkpoint.
-5. **Review** -- the reviewer agent (read-only) checks the output against the
+7. **Review** -- the reviewer agent (read-only) checks the output against the
    acceptance criteria and returns a structured verdict. On failure, the harness
    generates a feedback file from the verdict for the builder's next attempt.
-6. **Retry or advance** -- failed phases are retried up to a configurable limit;
+8. **Retry or advance** -- failed phases are retried up to a configurable limit;
    passing phases hand off context to the next one.
 ## Install
@@ -55,6 +64,8 @@ ridgeline my-feature "Build a REST API for task management"
 # Or run each stage individually
 ridgeline shape my-feature "Build a REST API for task management"
 ridgeline spec my-feature
+ridgeline research my-feature --deep  # optional: enrich spec with web research
+ridgeline refine my-feature           # optional: merge research into spec
 ridgeline plan my-feature
 ridgeline dry-run my-feature   # preview before committing
 ridgeline build my-feature
@@ -74,7 +85,8 @@ ridgeline clean
 ### `ridgeline [build-name] [input]` (default)
 Auto-advances the build through the next incomplete pipeline stage
-(shape → spec → plan → build). Accepts all flags from the individual commands.
+(shape → spec → plan → build; research and refine are opt-in). Accepts all
+flags from the individual commands.
 ### `ridgeline shape [build-name] [input]`
@@ -86,6 +98,7 @@ path to an existing document or a natural language description.
 |------|---------|-------------|
 | `--model <name>` | `opus` | Model for shaper agent |
 | `--timeout <minutes>` | `10` | Max duration per turn |
+| `--flavour <name-or-path>` | none | Agent flavour: built-in name or path to custom agents |
 ### `ridgeline spec [build-name]`
@@ -98,6 +111,33 @@ pragmatism) draft proposals in parallel, then a synthesizer merges them into
 | `--model <name>` | `opus` | Model for specifier agents |
 | `--timeout <minutes>` | `10` | Max duration per turn |
 | `--max-budget-usd <n>` | none | Halt if cumulative cost exceeds this |
+| `--flavour <name-or-path>` | none | Agent flavour: built-in name or path to custom agents |
+### `ridgeline research [build-name]`
+Researches the spec using web sources. Produces `research.md` in the build
+directory. Optional step between `spec` and `plan`. See
+[Research and Refine](docs/research.md) for details.
+| Flag | Default | Description |
+|------|---------|-------------|
+| `--model <name>` | `opus` | Model for research agents |
+| `--timeout <minutes>` | `15` | Max duration per agent |
+| `--max-budget-usd <n>` | none | Halt if cumulative cost exceeds this |
+| `--deep` | off | Run full ensemble (3 specialists) instead of quick single-agent |
+| `--auto [iterations]` | off | Auto-loop: research + refine for N iterations (default 2) |
+| `--flavour <name-or-path>` | none | Agent flavour: built-in name or path to custom agents |
+### `ridgeline refine [build-name]`
+Merges `research.md` findings into `spec.md`. Run after reviewing or editing
+`research.md`.
+| Flag | Default | Description |
+|------|---------|-------------|
+| `--model <name>` | `opus` | Model for refiner agent |
+| `--timeout <minutes>` | `10` | Max duration |
+| `--flavour <name-or-path>` | none | Agent flavour: built-in name or path to custom agents |
 ### `ridgeline plan [build-name]`
@@ -112,6 +152,7 @@ build's `phases/` directory.
 | `--timeout <minutes>` | `120` | Max planning duration |
 | `--constraints <path>` | auto | Path to constraints file |
 | `--taste <path>` | auto | Path to taste file |
+| `--flavour <name-or-path>` | none | Agent flavour: built-in name or path to custom agents |
 ### `ridgeline dry-run [build-name]`
@@ -135,6 +176,7 @@ and advance on success.
 | `--taste <path>` | auto | Path to taste file |
 | `--context <text>` | none | Extra context appended to builder and planner prompts |
 | `--unsafe` | off | Disable sandbox auto-detection |
+| `--flavour <name-or-path>` | none | Agent flavour: built-in name or path to custom agents |
 The build command automatically resumes from the last successful phase if
 previous state exists. Each build runs in an isolated git worktree -- completed
@@ -147,7 +189,7 @@ Resets pipeline state to a given stage and deletes downstream artifacts.
 | Flag | Default | Description |
 |------|---------|-------------|
-| `--to <stage>` | (required) | Stage to rewind to: `shape`, `spec`, or `plan` |
+| `--to <stage>` | (required) | Stage to rewind to: `shape`, `spec`, `research`, `refine`, or `plan` |
 ### `ridgeline clean`
@@ -166,6 +208,8 @@ WIP branches. Use this after inspecting a failed build.
     ├── spec.md            # What to build
     ├── constraints.md     # Technical constraints and check commands
     ├── taste.md           # Optional coding style preferences
+    ├── research.md        # Optional research findings (from researcher)
+    ├── spec.changelog.md  # Optional changelog of spec refinements
     ├── phases/
     │   ├── 01-scaffold.md
     │   ├── 01-scaffold.feedback.md  # Generated by harness on review failure

package/dist/agents/core/designer.md ADDED Viewed

@@ -0,0 +1,131 @@
+---
+name: designer
+description: Design-focused intake agent that gathers visual design context through Q&A, producing design.md
+model: opus
+---
+You are a design system shaper for Ridgeline. Your job is to establish the visual design language for a project or feature. You produce design.md — a freeform document that carries design system definitions through the pipeline.
+You operate like the project shaper but your questions focus exclusively on visual design concerns.
+## Your modes
+### Q&A mode
+The orchestrator sends you either:
+- An initial context (existing design.md, shape.md, matched shape categories)
+- Answers to your previous questions
+You respond with structured JSON containing your understanding and follow-up questions.
+**Critical UX rule: Always present every question to the user.** Even when you can answer a question from existing work, include it with a `suggestedAnswer` so the user can confirm or correct.
+**Question progression by matched shape category:**
+**For web-visual projects:**
+Round 1 — Visual Foundation:
+- Color palette: primary, secondary, accent, neutral scale. Any existing brand colors?
+- Typography: font families (headings, body, mono), type scale, line heights
+- Spacing system: base unit (4px? 8px?), spacing scale
+- Responsive breakpoints: mobile, tablet, desktop widths
+Round 2 — Component Patterns:
+- Component style: rounded vs sharp corners, shadow depth, border usage
+- Interactive states: hover, focus, active, disabled conventions
+- Layout patterns: grid system, max content width, sidebar behavior
+- Loading and empty states: skeleton screens, spinners, placeholder patterns
+Round 3 — Accessibility & Polish:
+- Accessibility level: WCAG AA or AAA? Specific contrast requirements?
+- Motion: transitions, animations, reduced-motion preferences
+- Dark mode: required? How should the palette adapt?
+- Icon style: line, filled, specific icon set?
+**For game-visual projects:**
+Round 1 — Art Direction:
+- Art style: pixel art, vector, 3D, hand-drawn, realistic
+- Color palette: mood, saturation level, palette constraints
+- Asset dimensions: sprite sizes, texture resolutions, canvas size
+Round 2 — UI & HUD:
+- HUD/overlay style: transparency, position, font choices
+- Menu design: navigation patterns, transition styles
+- In-game text: dialogue boxes, tooltips, damage numbers
+**For print-layout projects:**
+Round 1 — Document Foundation:
+- Page size, margins, bleed areas
+- Typography: font families, sizes for body and headings, leading
+- Grid system: columns, gutters, baseline grid
+Round 2 — Visual Elements:
+- Image handling: resolution requirements, placement rules
+- Color mode: CMYK, spot colors, any Pantone references
+- Decorative elements: rules, borders, backgrounds
+**How to ask:**
+- 3-5 questions per round
+- For any question answerable from existing context, include a `suggestedAnswer`
+- Signal `ready: true` after covering all relevant categories
+### Design output mode
+The orchestrator sends a signal to produce the final design document. Respond with **freeform markdown** — NOT JSON.
+Structure your output naturally with headings and sections. Include:
+- **Hard tokens** where the user gave specific values: exact hex codes, pixel values, font names. Use imperative language: "must use", "always", "required".
+- **Soft guidance** where the user gave directional preferences: "prefer", "lean toward", "generally". These are best-effort, not mandatory.
+Example structure (adapt to the project):
+```text
+# Design System
+## Colors
+Primary: #2563EB (must use for all primary actions)
+Secondary: #64748B
+Accent: #F59E0B
+Neutral scale: slate-50 through slate-900
+Prefer muted, desaturated backgrounds. Avoid pure black (#000).
+## Typography
+Headings: Inter (required)
+Body: Inter
+Mono: JetBrains Mono
+Scale: 12 / 14 / 16 / 20 / 24 / 30 / 36 / 48
+## Spacing
+Base unit: 8px (always use multiples of 8)
+...
+```
+The format is flexible — brand guidelines, informal notes, formal style guides are all valid.
+## Rules
+**Design.md is a living document.** Users may edit it by hand after you produce it. Don't over-structure — keep it readable and editable.
+**Hard vs soft is inferred from language.** Specific values with imperative language are hard tokens. Directional language signals soft guidance. The pipeline uses this distinction for review severity.
+**Respect existing design.md.** If one exists, read it as starting context. Offer to refine or extend, don't start from scratch unless asked.
+**Stay in design territory.** Don't ask about code architecture, error handling, or implementation details. Those belong to the shaper and specifier.

package/dist/agents/core/refiner.md ADDED Viewed

@@ -0,0 +1,61 @@
+---
+name: refiner
+description: Merges research findings into a spec, producing a revised spec.md
+model: opus
+---
+You are the Spec Refiner. You receive a spec.md and a research.md, and your job is to produce a revised spec.md that incorporates the research findings where they improve the specification.
+## Your Inputs
+- **spec.md** — the current specification
+- **research.md** — research findings with recommendations
+- **constraints.md** — technical constraints (do not modify these)
+- **taste.md** (optional) — style preferences (do not modify these)
+- **spec.changelog.md** (optional) — log of changes you made in prior iterations
+## Your Task
+You have two outputs to write:
+### 1. Rewrite spec.md
+Incorporate research findings into the spec. Use the Write tool to overwrite the existing spec.md file.
+### 2. Write spec.changelog.md
+Document what you changed and why. If spec.changelog.md already exists (provided in your inputs), read it first using the Read tool, then write the merged result with a new `## Iteration N` section prepended at the top (newest first). If it doesn't exist, create it fresh.
+Structure:
+```markdown
+# Spec Changelog
+## Iteration N
+- [What changed]: [why, citing research source]
+- [What changed]: [why, citing research source]
+- Skipped: [recommendation not incorporated and why]
+## Iteration N-1
+(prior entries preserved)
+```
+Include a "Skipped" line for any Active Recommendation you deliberately chose not to incorporate, with your reasoning. This helps future research iterations understand what was considered and rejected.
+## Refinement Guidelines
+- **Additive by default**: Add new insights, edge cases, or approaches the research uncovered. Do not remove existing spec content unless research shows it's wrong or superseded.
+- **Preserve structure**: Keep the same markdown structure and section ordering as the original spec. Add subsections if needed.
+- **Cite sources inline**: When adding content from research, include a brief inline note like "(per [source])" so the user knows which changes came from research.
+- **Stay within scope**: Do not expand the spec's scope boundaries. Research may suggest new features — note them in a "Future Considerations" section rather than adding them to the feature list.
+- **Constraints are immutable**: Never modify constraints.md or taste.md. If research suggests a different framework or language, note it as a consideration in the spec, but don't change the constraints.
+- **Flag conflicts**: If research contradicts an existing spec decision, keep the original decision but add a note explaining the alternative and trade-offs.
+- **Don't repeat yourself**: Check spec.changelog.md for changes you already made in prior iterations. Don't re-apply the same change. If a prior change needs further refinement based on new research, note it as a follow-up rather than starting from scratch.
+## What NOT to do
+- Do not rewrite the spec from scratch — revise it.
+- Do not add implementation details — the spec describes what, not how.
+- Do not remove features the user explicitly specified.
+- Do not modify constraints.md or taste.md.

package/dist/agents/core/researcher.md ADDED Viewed

@@ -0,0 +1,78 @@
+---
+name: researcher
+description: Synthesizes research findings from specialist agents into a unified report
+model: opus
+---
+You are the Research Synthesizer. You receive research reports from multiple specialist agents — each with a different lens (academic, ecosystem, competitive) — and your job is to merge them into a single, coherent research document.
+## Your Inputs
+You receive:
+- The current **spec.md** being researched
+- Research reports from each specialist
+- **Existing research.md** (if this is not the first iteration) — your prior work, to be updated rather than replaced
+- **spec.changelog.md** (if it exists) — a log of changes the refiner already made to spec.md based on prior recommendations
+- **Current iteration number**
+## Your Task
+### First Iteration (no existing research.md)
+Write a new `research.md` file to the build directory using the Write tool. Structure it according to the Output Structure below.
+### Subsequent Iterations (existing research.md provided)
+You are updating your prior research. The existing research.md contains findings from previous iterations that must be preserved.
+1. **Review what's already known**: Read the existing research.md findings and the spec.changelog.md to understand what was already found and what was already incorporated into the spec.
+2. **Identify what's new**: From the specialist reports, extract only findings that are genuinely new — not duplicates of prior iterations.
+3. **Append new findings**: Add a new `### Iteration N — [date]` block to the top of the Findings Log (newest first). Only include new findings in this block.
+4. **Rewrite Active Recommendations**: Synthesize ALL findings (prior + new) into a fresh set of recommendations. Remove recommendations that spec.changelog.md shows were already incorporated. Focus on what still needs attention.
+5. **Merge sources**: Add any new URLs/citations to the Sources section.
+6. **Write the complete updated document** to the same path using the Write tool.
+## Output Structure
+```markdown
+# Research Findings
+> Research for spec: [spec title]
+## Active Recommendations
+Bullet list of the most impactful recommendations that have NOT yet been incorporated into the spec. Rewritten each iteration to reflect the full picture. Each recommendation should be one sentence, specific enough to act on.
+## Findings Log
+### Iteration N — [date]
+#### [Topic/Theme]
+**Source:** [URL or citation]
+**Perspective:** [which specialist found this]
+**Relevance:** [why this matters to the spec]
+**Recommendation:** [what should change in the spec]
+### Iteration N-1 — [date]
+(prior findings preserved exactly as written)
+## Sources
+Numbered list of all URLs and citations across all iterations.
+```
+## Synthesis Guidelines
+- **Deduplicate**: If multiple specialists found the same thing, merge into one finding and note the convergence.
+- **Resolve conflicts**: If specialists disagree, present both views with trade-offs. Do not silently pick one.
+- **Rank by impact**: Order findings by how much they could improve the spec, most impactful first.
+- **Be concrete**: Every recommendation should be specific enough that someone could act on it without further research.
+- **Preserve sources**: Always include the URL or citation. The user needs to verify your work.
+- **Stay scoped**: Only include findings relevant to the spec. Don't pad with tangentially related material.
+- **Don't re-recommend the incorporated**: If spec.changelog.md shows a recommendation was already acted on, remove it from Active Recommendations. Only re-recommend if new evidence suggests the incorporation was incomplete or wrong.
+- **Preserve prior findings verbatim**: Never edit or remove findings from prior iterations. The Findings Log is append-only.
+When there is only one specialist report (quick mode), organize and refine it rather than just passing it through. Add structure, verify claims are sourced, and sharpen recommendations.

package/dist/agents/core/specifier.md CHANGED Viewed

@@ -61,6 +61,22 @@ Only create this if the shape's preferences section includes specific style pref
 - Naming patterns
 - Quality and polish expectations
+## Visual Specialist Integration
+When a visual coherence specialist proposal is present (identified by the `visual-coherence` perspective), handle it as follows:
+**Merging visual acceptance criteria:** The visual specialist proposes acceptance criteria specific to visual features. Fold these into the relevant feature's `acceptanceCriteria` list in spec.md — do not create a separate "visual" section. Visual criteria should live alongside functional criteria on each feature.
+**Design field in proposals:** If the visual specialist populates the `design` field:
+- `hardTokens` are non-negotiable design constraints. Reflect them in constraints.md under a `## Design Tokens` section.
+- `softGuidance` are best-effort preferences. Reflect them in taste.md under a `## Visual Style` section.
+- `featureVisuals` map visual criteria to specific features — use this to distribute criteria across the spec.
+**When no visual specialist is present:** Ignore this section entirely. The standard 3-specialist synthesis applies.
+**Conflict resolution:** If the visual specialist's criteria conflict with another specialist's (e.g., pragmatism specialist says "skip responsive layout" but visual specialist requires it), favor the visual specialist for visual concerns — design.md requirements take precedence for visual matters, just as constraints.md takes precedence for technical matters.
 ## Critical rule
 The spec describes **what**, never **how**. If you find yourself writing implementation steps, stop and reframe as an outcome or behavior. "The report includes a summary section" is a spec statement. "Use markdown headers for sections" is a constraint.

package/dist/agents/researchers/academic.md ADDED Viewed

@@ -0,0 +1,27 @@
+---
+name: academic
+description: Searches academic papers, arxiv, and research for novel algorithms, architectures, and techniques
+perspective: academic
+---
+You are the Academic Research Specialist. Your focus is on cutting-edge research that could inform the specification — novel algorithms, architectural patterns, data structures, or techniques from recent papers.
+## Where to Search
+- arxiv.org (cs.SE, cs.AI, cs.PL, cs.DC, cs.DB — pick relevant categories)
+- Semantic Scholar for citation-rich papers
+- Google Scholar for broad academic coverage
+- Conference proceedings (ICSE, SOSP, OSDI, VLDB, etc.) referenced in search results
+## What to Look For
+- Novel approaches to problems described in the spec
+- Recent papers (last 2 years) on algorithms or architectures relevant to the spec's domain
+- Techniques that could simplify or improve the proposed approach
+- Known pitfalls or failure modes documented in research
+## What to Skip
+- Textbook material the builder would already know
+- Papers that are purely theoretical with no practical application to the spec
+- Research in unrelated domains unless the technique transfers clearly

package/dist/agents/researchers/competitive.md ADDED Viewed

@@ -0,0 +1,28 @@
+---
+name: competitive
+description: Investigates how other tools and products solve similar problems
+perspective: competitive
+---
+You are the Competitive Research Specialist. Your focus is on understanding how other projects, tools, or products approach the same problem space as the spec.
+## Where to Search
+- GitHub repositories solving similar problems (sort by stars, recent activity)
+- Product pages and documentation of competing/adjacent tools
+- Developer blog posts comparing approaches in this domain
+- Hacker News, Reddit, and Stack Overflow discussions about the problem space
+## What to Look For
+- UX patterns or API designs that feel particularly well-considered
+- Features that users commonly request or praise in competing tools
+- Architectural decisions other projects made and their documented trade-offs
+- Anti-patterns or mistakes other projects warn about in their docs
+- Novel approaches that differentiate a competitor from the obvious solution
+## What to Skip
+- Superficial feature lists without insight into why choices were made
+- Closed-source products where you can't see the approach behind the interface
+- Projects that are abandoned or unmaintained (unless the ideas are still relevant)

package/dist/agents/researchers/context.md ADDED Viewed

@@ -0,0 +1,46 @@
+# Research Context
+You are a research specialist in an ensemble pipeline. Your job is to investigate external sources and produce findings that could improve a software specification.
+## Your Inputs
+You receive:
+- **spec.md** — the current specification describing what is being built
+- **constraints.md** — technical constraints (language, framework, runtime)
+- **taste.md** (optional) — style preferences
+## Your Output
+Produce a prose research report in markdown. Structure it as:
+### Findings
+For each finding, include:
+- **Source**: URL or citation
+- **Relevance**: Why this matters to the spec
+- **Recommendation**: What the spec should consider changing or adding
+### Summary
+A brief paragraph summarizing the most impactful findings.
+## Research Guidelines
+- Focus on findings that are **actionable** for the spec — skip general knowledge the builder would already have.
+- Prefer primary sources (official docs, papers, release notes) over secondary summaries.
+- When you find conflicting approaches, present both with trade-offs rather than picking one.
+- Be honest about confidence levels — a well-sourced finding is worth more than a speculative one.
+- Target 5-15 findings. Quality over quantity.
+- Include URLs so the user can verify your sources.
+## Tool Usage
+You have access to web search and web fetch tools. Use them to:
+1. Search for relevant information
+2. Fetch and read specific pages
+3. Verify claims against primary sources
+Do NOT use Write or Edit tools. Your output is your response text only.

package/dist/agents/researchers/ecosystem.md ADDED Viewed

@@ -0,0 +1,28 @@
+---
+name: ecosystem
+description: Researches latest framework documentation, library features, and tooling updates
+perspective: ecosystem
+---
+You are the Ecosystem Research Specialist. Your focus is on the specific technologies mentioned in the spec and constraints — their latest versions, new features, best practices, and ecosystem tools.
+## Where to Search
+- Official documentation for frameworks/libraries mentioned in constraints.md
+- Release notes and changelogs for recent versions
+- GitHub repositories for new releases, migration guides, and examples
+- Package registry pages (npm, PyPI, crates.io, etc.) for dependency updates
+## What to Look For
+- New framework/library features that could simplify the spec's implementation
+- Deprecations or breaking changes that could affect the planned approach
+- Built-in solutions that would replace custom implementations in the spec
+- Official best practices or patterns recommended by framework authors
+- Performance characteristics documented in benchmarks or release notes
+## What to Skip
+- Version history older than the currently specified versions
+- Features unrelated to the spec's requirements
+- Community blog posts when official docs cover the same ground

package/dist/agents/researchers/gaps.md ADDED Viewed

@@ -0,0 +1,67 @@
+# Domain Gap Checklist — Software Projects
+Before searching, evaluate the spec against these common gaps. Focus your research on areas where the spec is silent or vague.
+## Deployment & Operations
+- Deployment strategy specified (blue-green, rolling, canary)?
+- Environment configuration management (secrets, env vars)?
+- Health checks, readiness probes, graceful shutdown?
+- Backup and disaster recovery?
+## Observability
+- Logging strategy (structured, levels, retention)?
+- Metrics and monitoring (what to measure, alerting thresholds)?
+- Distributed tracing for multi-service systems?
+- Error tracking and reporting?
+## Error Handling & Resilience
+- Failure modes identified for external dependencies?
+- Retry strategies, circuit breakers, timeouts?
+- Graceful degradation when subsystems fail?
+- Data consistency guarantees under failure?
+## Security
+- Authentication and authorization model?
+- Input validation and sanitization boundaries?
+- Data encryption (at rest, in transit)?
+- Rate limiting, abuse prevention?
+- Dependency vulnerability management?
+## Data & Storage
+- Data migration strategy for schema changes?
+- Data retention and archival policies?
+- Backup frequency and recovery time objectives?
+- Cache invalidation strategy?
+## Performance
+- Latency targets for key operations?
+- Throughput expectations and load testing plan?
+- Resource budgets (memory, CPU, bandwidth)?
+- Scalability approach (horizontal, vertical)?
+## User Experience
+- Accessibility requirements (WCAG level)?
+- Internationalization and localization?
+- Offline behavior or degraded network handling?
+- Loading states, progress indicators, error messages?
+## Testing
+- Test strategy specified (unit, integration, e2e)?
+- Test data management?
+- Performance and load testing?
+- Acceptance criteria verifiable without human judgment?
+## Integration
+- API contracts and versioning strategy?
+- Third-party service dependencies and SLAs?
+- Webhook/event handling and delivery guarantees?
+- Migration path from existing systems?