npm - valent-pipeline - Versions diffs - 0.1.1 → 0.1.2 - Mend

valent-pipeline 0.1.1 → 0.1.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (108) hide show

package/package.json +1 -1
package/pipeline/agents-manifest.yaml +170 -0
package/pipeline/docker-compose.chromadb.yml +15 -0
package/pipeline/docs/agent-reference.md +72 -0
package/pipeline/docs/communication-standard.md +452 -0
package/pipeline/docs/knowledge-system.md +237 -0
package/pipeline/docs/lead-lifecycle.md +262 -0
package/pipeline/docs/lean-spawn-human-tasks.md +207 -0
package/pipeline/docs/npx-implementation-plan.md +171 -0
package/pipeline/docs/npx-packaging.md +85 -0
package/pipeline/docs/pipeline-overview.md +174 -0
package/pipeline/docs/pipeline-state-schema.md +103 -0
package/pipeline/docs/task-graph.md +184 -0
package/pipeline/docs/template-skeleton.md +281 -0
package/pipeline/prompts/bend.md +111 -0
package/pipeline/prompts/critic.md +105 -0
package/pipeline/prompts/embed.md +80 -0
package/pipeline/prompts/fend.md +136 -0
package/pipeline/prompts/help.md +28 -0
package/pipeline/prompts/judge-g1.md +121 -0
package/pipeline/prompts/judge-g2.md +112 -0
package/pipeline/prompts/knowledge.md +129 -0
package/pipeline/prompts/lead.md +682 -0
package/pipeline/prompts/pmcp.md +77 -0
package/pipeline/prompts/qa-a.md +149 -0
package/pipeline/prompts/qa-b.md +132 -0
package/pipeline/prompts/reqs.md +105 -0
package/pipeline/prompts/retrospective.md +82 -0
package/pipeline/prompts/uxa.md +143 -0
package/pipeline/scripts/embed-sqlite.ts +282 -0
package/pipeline/scripts/embed.ts +425 -0
package/pipeline/spawn-templates/agent-spawn.template.md +16 -0
package/pipeline/spawn-templates/knowledge-spawn.template.md +17 -0
package/pipeline/spawn-templates/pipeline-context.template.md +46 -0
package/pipeline/steps/bend/handoff.md +9 -0
package/pipeline/steps/bend/implement.md +13 -0
package/pipeline/steps/bend/read-inputs.md +13 -0
package/pipeline/steps/bend/write-tests.md +15 -0
package/pipeline/steps/common/distilled-handoff-format.md +49 -0
package/pipeline/steps/common/no-api-passthrough.md +18 -0
package/pipeline/steps/common/no-ui-passthrough.md +18 -0
package/pipeline/steps/critic/acceptance-audit.md +24 -0
package/pipeline/steps/critic/blind-hunt.md +18 -0
package/pipeline/steps/critic/edge-case-hunt.md +22 -0
package/pipeline/steps/critic/test-review.md +19 -0
package/pipeline/steps/critic/triage-depth.md +17 -0
package/pipeline/steps/critic/triage.md +12 -0
package/pipeline/steps/critic/write-verdict.md +31 -0
package/pipeline/steps/fend/handoff.md +9 -0
package/pipeline/steps/fend/implement.md +16 -0
package/pipeline/steps/fend/read-inputs.md +10 -0
package/pipeline/steps/fend/write-tests.md +12 -0
package/pipeline/steps/judge-g1/pass1-review.md +117 -0
package/pipeline/steps/judge-g1/pass2-review.md +51 -0
package/pipeline/steps/judge-g2/evidence-review.md +105 -0
package/pipeline/steps/judge-g2/ship-decision.md +43 -0
package/pipeline/steps/orchestration/adopt-lead-and-create-team.md +91 -0
package/pipeline/steps/orchestration/load-agents-manifest.md +9 -0
package/pipeline/steps/orchestration/load-pipeline-config.md +33 -0
package/pipeline/steps/orchestration/resolve-next-work-item.md +32 -0
package/pipeline/steps/orchestration/resolve-story-path.md +12 -0
package/pipeline/steps/orchestration/update-backlog-status.md +28 -0
package/pipeline/steps/orchestration/validate-story-inputs.md +43 -0
package/pipeline/steps/qa-a/api.md +31 -0
package/pipeline/steps/qa-a/read-inputs.md +34 -0
package/pipeline/steps/qa-a/write-spec.md +144 -0
package/pipeline/steps/qa-b/api.md +52 -0
package/pipeline/steps/qa-b/execute-tests.md +90 -0
package/pipeline/steps/qa-b/file-bugs.md +41 -0
package/pipeline/steps/qa-b/write-report.md +55 -0
package/pipeline/steps/reqs/analyze.md +41 -0
package/pipeline/steps/reqs/draft-brief.md +29 -0
package/pipeline/steps/reqs/pre-mortem.md +27 -0
package/pipeline/steps/reqs/read-inputs.md +25 -0
package/pipeline/steps/reqs/self-review.md +22 -0
package/pipeline/steps/reqs/write-output.md +14 -0
package/pipeline/steps/retrospective/aggregate-review.md +51 -0
package/pipeline/steps/retrospective/analyze.md +35 -0
package/pipeline/steps/retrospective/directives.md +60 -0
package/pipeline/steps/retrospective/embed-instructions.md +39 -0
package/pipeline/steps/retrospective/report.md +34 -0
package/pipeline/steps/uxa/read-inputs.md +22 -0
package/pipeline/steps/uxa/translate-spec.md +124 -0
package/pipeline/steps/uxa/write-output.md +15 -0
package/pipeline/task-graphs/backend-api.yaml +139 -0
package/pipeline/task-graphs/data-pipeline.yaml +139 -0
package/pipeline/task-graphs/document-generation.yaml +139 -0
package/pipeline/task-graphs/frontend-only.yaml +178 -0
package/pipeline/task-graphs/fullstack-web.yaml +186 -0
package/pipeline/task-graphs/library.yaml +139 -0
package/pipeline/task-graphs/mcp-server.yaml +139 -0
package/pipeline/templates/bend-handoff.template.md +83 -0
package/pipeline/templates/bugs.template.md +111 -0
package/pipeline/templates/critic-review.template.md +101 -0
package/pipeline/templates/decisions.template.md +29 -0
package/pipeline/templates/embed-instructions.template.md +46 -0
package/pipeline/templates/execution-report.template.md +119 -0
package/pipeline/templates/fend-handoff.template.md +85 -0
package/pipeline/templates/judge-g1-review.template.md +155 -0
package/pipeline/templates/judge-g2-decision.template.md +64 -0
package/pipeline/templates/pmcp-evidence.template.md +49 -0
package/pipeline/templates/qa-test-spec.template.md +153 -0
package/pipeline/templates/reqs-brief.template.md +119 -0
package/pipeline/templates/retrospective.template.md +108 -0
package/pipeline/templates/story-report.template.md +89 -0
package/pipeline/templates/traceability-matrix.template.md +90 -0
package/pipeline/templates/uxa-spec.template.md +169 -0
package/pipeline/templates/visual-validation-checklist.template.md +71 -0

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "valent-pipeline",
-  "version": "0.1.1",
+  "version": "0.1.2",
   "description": "v3 multi-agent AI pipeline for software development lifecycle",
   "type": "module",
   "bin": {

package/pipeline/agents-manifest.yaml ADDED Viewed

@@ -0,0 +1,170 @@
+# =============================================================================
+# V3 Agent Manifest — Lead Agent's Primary Reference for Team Composition
+# =============================================================================
+#
+# This manifest defines every agent the lead can spawn, their models, roles,
+# dependencies, and outputs. The lead reads this file at pipeline start to:
+#
+#   1. Phase 1 kick-off: Filter by lifecycle: per-story, check degraded_without
+#      to determine which optional inputs are available, and spawn teammates.
+#   2. Task dependency graph: Built from reads_from / writes_to — if agent B
+#      reads from what agent A writes, B is blockedBy A.
+#   3. Communication: All teammates can message any other living teammate via
+#      inbox. No routing restrictions.
+#   4. Model selection: Centralized here, not scattered across prompt templates.
+#      Changing an agent's model is a one-line YAML edit.
+#   5. Ephemeral spawning: The ephemeral_agents section tells the lead what it
+#      can spawn on-demand and what triggers the spawn.
+#
+# IMPORTANT — Communication & Access:
+#   - All teammates can message any other living teammate. There are NO
+#     communication restrictions.
+#   - reads_from indicates PRIMARY DEPENDENCIES for task sequencing, not access
+#     restrictions. All agents can read any file in the story folder.
+# =============================================================================
+agents:
+  lead:
+    name: Lead
+    model: opus
+    lifecycle: persistent
+    role: "Pipeline orchestrator — spawns team, monitors execution, manages story lifecycle"
+    prompt_template: v3/prompts/lead.md
+    reads_from: [story-input, agents-manifest.yaml, pipeline-config.yaml, pipeline-state.json]
+    writes_to: [pipeline-state.json]
+  reqs:
+    name: REQS
+    model: sonnet
+    lifecycle: per-story
+    role: "Requirements analyst — translates ACs into implementation brief"
+    prompt_template: v3/prompts/reqs.md
+    reads_from: [story-input]  # all story folder contents: ACs, trigger-map, architecture-decisions, UX spec when available
+    writes_to: [reqs-brief.md]
+  uxa:
+    name: UXA
+    model: sonnet
+    lifecycle: per-story
+    role: "UX specification agent — translates UX spec into component specs"
+    prompt_template: v3/prompts/uxa.md
+    reads_from: [reqs-brief.md, ux-spec, trigger-map, scenarios]
+    writes_to: [uxa-spec.md]
+    project_types: [fullstack-web, frontend-only]
+    degraded_without: [trigger-map, scenarios]  # runs translation-only without these
+  qa_a:
+    name: QA-A
+    model: sonnet
+    lifecycle: per-story
+    role: "QA spec writer — produces behavioral test specifications"
+    prompt_template: v3/prompts/qa-a.md
+    reads_from: [reqs-brief.md, uxa-spec.md]
+    writes_to: [qa-test-spec.md, visual-validation-checklist.md]
+  judge_g1:
+    name: JUDGE-G1
+    model: sonnet
+    lifecycle: per-story
+    role: "Quality gate — validates reqs, UXA spec, test specs (Pass 1) and bug priorities (Pass 2)"
+    prompt_template: v3/prompts/judge-g1.md
+    passes:
+      pass1_review_order: [reqs-validation, uxa-validation, qa-spec-validation]  # sequential, stop on first failure
+      pass2: bug-review
+    reads_from: [reqs-brief.md, uxa-spec.md, qa-test-spec.md, bugs.md, execution-report.md]
+    writes_to: [judge-g1-review.md]
+  bend:
+    name: BEND
+    model: sonnet
+    lifecycle: per-story
+    role: "Backend developer — implements production code and tests"
+    prompt_template: v3/prompts/bend.md
+    reads_from: [reqs-brief.md, qa-test-spec.md]
+    writes_to: [bend-handoff.md]
+  fend:
+    name: FEND
+    model: sonnet
+    lifecycle: per-story
+    role: "Frontend developer — implements UI components and tests"
+    prompt_template: v3/prompts/fend.md
+    reads_from: [reqs-brief.md, uxa-spec.md, qa-test-spec.md]
+    writes_to: [fend-handoff.md]
+    project_types: [fullstack-web, frontend-only]
+  critic:
+    name: CRITIC
+    model: opus
+    lifecycle: per-story
+    role: "Code reviewer — 3-pass adversarial review of production and test code"
+    prompt_template: v3/prompts/critic.md
+    review_passes: [blind-hunt, edge-case-hunt, acceptance-audit, triage]
+    reads_from: [git-diff, reqs-brief.md, qa-test-spec.md]
+    writes_to: [critic-review.md]
+  qa_b:
+    name: QA-B
+    model: sonnet
+    lifecycle: per-story
+    role: "Test executor — runs tests, validates spec alignment, files bugs"
+    prompt_template: v3/prompts/qa-b.md
+    reads_from: [qa-test-spec.md, critic-review.md, reqs-brief.md]
+    writes_to: [execution-report.md, bugs.md, traceability-matrix.md]
+    can_request_spawn: [pmcp]  # asks lead to spawn PMCP
+  judge_g2:
+    name: JUDGE-G2
+    model: sonnet
+    lifecycle: per-story
+    role: "Final ship gate — evidence-based approval or rejection"
+    prompt_template: v3/prompts/judge-g2.md
+    reads_from: [execution-report.md, traceability-matrix.md, pmcp-evidence.md, bugs.md, judge-g1-review.md, qa-test-spec.md]  # critic-review.md intentionally excluded — G2 validates test/execution evidence, not code review; qa-test-spec.md used as reference for assertion cross-check
+    writes_to: [judge-g2-decision.md, story-report.md]
+  knowledge:
+    name: Knowledge
+    model: haiku
+    lifecycle: per-story
+    role: "Knowledge retrieval — answers queries from persistent data sources"
+    prompt_template: v3/prompts/knowledge.md
+    data_sources: [chromadb, curated-knowledge-files, correction-directives]
+    context_variables: [knowledge_mode, chromadb_host, chromadb_collection_prefix, curated_files_path, correction_directives]
+    # No writes_to — Knowledge Agent responds via inbox only, no file output
+ephemeral_agents:
+  pmcp:
+    name: PMCP
+    model: sonnet
+    role: "Visual validation — executes browser automation MCP checklist, captures screenshots"
+    prompt_template: v3/prompts/pmcp.md
+    reads_from: [visual-validation-checklist.md]
+    writes_to: [pmcp-evidence.md]
+    spawned_by: lead
+    spawn_trigger: qa_a_checklist  # Lead spawns idle when QA-A writes visual-validation-checklist.md
+    execution_trigger: qa_b        # QA-B sends [PMCP-TRIGGER] with dev server URL to start execution
+  embed:
+    name: Embed
+    model: haiku
+    role: "Knowledge indexer — indexes curated patterns into knowledge base"
+    prompt_template: v3/prompts/embed.md
+    spawned_by: lead
+    triggered_by: retrospective  # only runs after Retrospective Agent curates what to index
+  retrospective:
+    name: Retrospective
+    model: sonnet
+    role: "Batch reviewer — analyzes last N stories for recurring patterns"
+    prompt_template: v3/prompts/retrospective.md
+    spawned_by: lead
+    triggered_by: every-n-stories
+  help:
+    name: Help
+    model: haiku
+    role: "Pipeline help — explains any piece of the pipeline from documentation"
+    prompt_template: v3/prompts/help.md
+    reads_from: [v3/docs/]
+    spawned_by: lead
+    triggered_by: user-request

package/pipeline/docker-compose.chromadb.yml ADDED Viewed

@@ -0,0 +1,15 @@
+services:
+  chromadb:
+    image: chromadb/chroma:latest
+    ports:
+      - "8000:8000"
+    volumes:
+      - ./chromadb-data:/chroma/chroma
+    environment:
+      IS_PERSISTENT: "TRUE"
+      ANONYMIZED_TELEMETRY: "FALSE"
+    healthcheck:
+      test: ["CMD-SHELL", "curl -f http://localhost:8000/api/v1/heartbeat || exit 1"]
+      interval: 10s
+      timeout: 5s
+      retries: 5

package/pipeline/docs/agent-reference.md ADDED Viewed

@@ -0,0 +1,72 @@
+# V3 Agent Reference
+> Quick reference for all 15 agents in the v3 pipeline.
+> Definitive source: `v3/agents-manifest.yaml`
+---
+## Agent Roster
+### Per-Story Agents (10)
+Spawned fresh for each story and torn down after the story ships or is cancelled.
+| Agent | Model | Role | Reads | Writes | Key Behavior |
+|-------|-------|------|-------|--------|--------------|
+| REQS | Sonnet | Requirements analyst -- translates ACs into implementation brief | story-input (ACs, trigger-map, architecture-decisions, UX spec) | `reqs-brief.md` | Brainstorms ambiguity resolutions; escalates only when options have genuinely competing tradeoffs |
+| UXA | Sonnet | UX specification -- translates UX spec into component specs | `reqs-brief.md`, ux-spec, trigger-map, scenarios | `uxa-spec.md` | Runs translation-only mode without trigger-map or scenarios; skipped for backend-only projects |
+| QA-A | Sonnet | QA spec writer -- produces behavioral test specifications | `reqs-brief.md`, `uxa-spec.md` | `qa-test-spec.md`, `visual-validation-checklist.md` | Writes test specs before code exists; tests are specified, not implemented |
+| JUDGE-G1 | Sonnet | Quality gate -- validates specs (Pass 1) and bug priorities (Pass 2) | `reqs-brief.md`, `uxa-spec.md`, `qa-test-spec.md`, `bugs.md`, `execution-report.md` | `judge-g1-review.md` | Sequential review: stops on first failure in Pass 1 |
+| BEND | Opus | Backend developer -- implements production code and tests | `reqs-brief.md`, `qa-test-spec.md` | `bend-handoff.md` | Implements to QA-A test spec; coordinates with FEND via inbox for shared files |
+| FEND | Opus | Frontend developer -- implements UI components and tests | `reqs-brief.md`, `uxa-spec.md`, `qa-test-spec.md` | `fend-handoff.md` | Implements to UXA component spec; skipped for backend-only projects |
+| CRITIC | Opus | Code reviewer -- 3-pass adversarial review | git-diff, `reqs-brief.md`, `qa-test-spec.md` | `critic-review.md` | 3-pass sequential review (blind hunt, edge-case hunt, acceptance audit) + triage |
+| QA-B | Sonnet | Test executor -- runs tests, validates spec alignment, files bugs | `qa-test-spec.md`, `critic-review.md`, `reqs-brief.md` | `execution-report.md`, `bugs.md`, `traceability-matrix.md` | Runs tests against real infrastructure; can request PMCP spawn for visual validation |
+| JUDGE-G2 | Sonnet | Final ship gate -- evidence-based approval or rejection | `execution-report.md`, `traceability-matrix.md`, `pmcp-evidence.md`, `bugs.md`, `judge-g1-review.md` | `judge-g2-decision.md` | Evidence over assertion -- independently verifies every upstream claim |
+| Knowledge | Haiku | Knowledge retrieval -- answers queries from persistent data sources | chromadb, curated-knowledge-files, correction-directives | _(none -- inbox only)_ | Responds via inbox only; no file output |
+### Persistent Agent (1)
+Lives across stories. Manages the backlog and orchestrates each story team.
+| Agent | Model | Role | Reads | Writes | Key Behavior |
+|-------|-------|------|-------|--------|--------------|
+| Lead | Opus | Pipeline orchestrator -- spawns team, monitors execution, manages story lifecycle | story-input, `agents-manifest.yaml`, `pipeline-config.yaml`, `pipeline-state.json` | `story-report.md`, `pipeline-state.json` | Builds task graph from manifest; enforces circuit breaker on rejection loops; escalates to user as last resort |
+### Ephemeral Agents (4)
+Spawned on-demand by the Lead when triggered by specific events.
+| Agent | Model | Role | Reads | Writes | Trigger |
+|-------|-------|------|-------|--------|---------|
+| PMCP | Sonnet | Visual validation -- browser automation MCP, captures screenshots | `visual-validation-checklist.md` | `pmcp-evidence.md` | Requested by QA-B, BEND, or FEND |
+| Embed | Haiku | Knowledge indexer -- indexes curated patterns into knowledge base | _(retrospective output)_ | _(indexing instructions)_ | After Retrospective agent curates what to index |
+| Retrospective | Sonnet | Batch reviewer -- analyzes last N stories for recurring patterns | _(story reports)_ | _(retrospective report)_ | Every N stories (configurable) |
+| Help | Haiku | Pipeline help -- explains any piece of the pipeline from documentation | `v3/docs/` | _(inbox only)_ | User request |
+---
+## Project-Type Agent Selection
+Not all agents run for every project type. The Lead reads `project_type` from `pipeline-config.yaml` and skips agents that don't apply.
+| Project Type | Agents Skipped |
+|-------------|----------------|
+| fullstack-web | _(none -- all agents active)_ |
+| backend-api | UXA, FEND, PMCP |
+| frontend-only | BEND |
+| data-pipeline | UXA, FEND, PMCP |
+| mcp-server | UXA, FEND, PMCP |
+| document-generation | UXA, FEND, PMCP |
+| library | UXA, FEND, PMCP |
+---
+## Model Tier Summary
+| Tier | Agents | Use Case | Cost |
+|------|--------|----------|------|
+| Opus | Lead, BEND, FEND, CRITIC | Complex code generation, orchestration, nuanced multi-pass review | Highest |
+| Sonnet | REQS, UXA, QA-A, QA-B, JUDGE-G1, JUDGE-G2, PMCP, Retrospective | Analysis, spec writing, test execution, judgment, coordination | Balanced |
+| Haiku | Knowledge, Embed, Help | Mechanical retrieval, indexing instructions, documentation lookups | Lowest |
+Model assignments are configurable in `pipeline-config.yaml` under the `models` section. Move agents between tiers to adjust the quality/cost tradeoff for your project.