npm - buildanything - Versions diffs - 2.0.0 → 2.1.2 - Mend

buildanything 2.0.0 → 2.1.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (115) hide show

package/.claude-plugin/marketplace.json +1 -1
package/.claude-plugin/plugin.json +9 -1
package/README.md +57 -61
package/agents/a11y-architect.md +2 -0
package/agents/briefing-officer.md +172 -0
package/agents/business-model.md +14 -12
package/agents/code-architect.md +6 -1
package/agents/code-reviewer.md +3 -2
package/agents/code-simplifier.md +12 -4
package/agents/design-brand-guardian.md +19 -0
package/agents/design-critic.md +16 -11
package/agents/design-inclusive-visuals-specialist.md +2 -0
package/agents/design-ui-designer.md +17 -0
package/agents/design-ux-architect.md +15 -0
package/agents/design-ux-researcher.md +102 -7
package/agents/engineering-ai-engineer.md +2 -0
package/agents/engineering-backend-architect.md +2 -0
package/agents/engineering-data-engineer.md +2 -0
package/agents/engineering-devops-automator.md +2 -0
package/agents/engineering-frontend-developer.md +13 -0
package/agents/engineering-mobile-app-builder.md +2 -0
package/agents/engineering-rapid-prototyper.md +15 -2
package/agents/engineering-security-engineer.md +2 -0
package/agents/engineering-senior-developer.md +13 -0
package/agents/engineering-sre.md +2 -0
package/agents/engineering-technical-writer.md +2 -0
package/agents/feature-intel.md +8 -7
package/agents/ios-app-review-guardian.md +2 -0
package/agents/ios-foundation-models-specialist.md +2 -0
package/agents/ios-product-reality-auditor.md +292 -0
package/agents/ios-storekit-specialist.md +2 -0
package/agents/ios-swift-architect.md +1 -0
package/agents/ios-swift-search.md +1 -0
package/agents/ios-swift-ui-design.md +7 -4
package/agents/marketing-app-store-optimizer.md +2 -0
package/agents/planner.md +6 -1
package/agents/pr-test-analyzer.md +3 -2
package/agents/product-feedback-synthesizer.md +62 -0
package/agents/product-owner.md +163 -0
package/agents/product-reality-auditor.md +216 -0
package/agents/product-spec-writer.md +176 -0
package/agents/refactor-cleaner.md +9 -1
package/agents/security-reviewer.md +2 -1
package/agents/silent-failure-hunter.md +2 -1
package/agents/swift-build-resolver.md +2 -0
package/agents/swift-reviewer.md +2 -1
package/agents/tech-feasibility.md +5 -3
package/agents/testing-api-tester.md +2 -0
package/agents/testing-evidence-collector.md +24 -0
package/agents/testing-performance-benchmarker.md +2 -0
package/agents/testing-reality-checker.md +2 -1
package/agents/visual-research.md +7 -5
package/bin/adapters/scribe-tool.ts +4 -2
package/bin/adapters/write-lease-tool.ts +1 -1
package/bin/buildanything-runtime.ts +20 -107
package/bin/graph-index.js +24 -0
package/bin/graph-index.ts +340 -0
package/bin/mcp-servers/graph-mcp.js +26 -0
package/bin/mcp-servers/graph-mcp.ts +481 -0
package/bin/mcp-servers/orchestrator-mcp.js +26 -0
package/bin/mcp-servers/orchestrator-mcp.ts +361 -0
package/bin/setup.js +272 -111
package/commands/build.md +424 -177
package/commands/idea-sweep.md +2 -2
package/commands/setup.md +15 -4
package/commands/ux-review.md +3 -3
package/commands/verify.md +3 -0
package/docs/migration/phase-graph.yaml +573 -157
package/hooks/design-md-lint +4 -0
package/hooks/design-md-lint.ts +295 -0
package/hooks/pre-tool-use.ts +37 -6
package/hooks/record-mode-transitions.ts +63 -6
package/hooks/subagent-start.ts +3 -2
package/package.json +3 -1
package/protocols/agent-prompt-authoring.md +165 -0
package/protocols/architecture-schema.md +10 -3
package/protocols/cleanup.md +4 -0
package/protocols/decision-log.md +8 -4
package/protocols/design-md-authoring.md +520 -0
package/protocols/design-md-spec.md +362 -0
package/protocols/fake-data-detector.md +1 -1
package/protocols/ios-fake-data-detector.md +65 -0
package/protocols/ios-phase-branches.md +128 -43
package/protocols/launch-readiness.md +9 -5
package/protocols/metric-loop.md +1 -1
package/protocols/page-spec-schema.md +234 -0
package/protocols/product-spec-schema.md +354 -0
package/protocols/sprint-tasks-schema.md +53 -0
package/protocols/state-schema.json +38 -3
package/protocols/state-schema.md +32 -2
package/protocols/verify.md +29 -1
package/protocols/web-phase-branches.md +246 -76
package/skills/ios/ios-bootstrap/SKILL.md +1 -1
package/src/graph/ids.ts +86 -0
package/src/graph/index.ts +32 -0
package/src/graph/parser/architecture.ts +603 -0
package/src/graph/parser/component-manifest.ts +268 -0
package/src/graph/parser/decisions-jsonl.ts +407 -0
package/src/graph/parser/design-md-pass2.ts +253 -0
package/src/graph/parser/design-md.ts +477 -0
package/src/graph/parser/page-spec.ts +496 -0
package/src/graph/parser/product-spec.ts +930 -0
package/src/graph/parser/screenshot.ts +342 -0
package/src/graph/parser/sprint-tasks.ts +317 -0
package/src/graph/storage/index.ts +1154 -0
package/src/graph/types.ts +432 -0
package/src/graph/util/dhash.ts +84 -0
package/src/lrr/aggregator.ts +105 -10
package/src/orchestrator/hooks/context-header.ts +34 -10
package/src/orchestrator/hooks/token-accounting.ts +25 -14
package/src/orchestrator/mcp/cycle-counter.ts +2 -1
package/src/orchestrator/mcp/scribe.ts +27 -16
package/src/orchestrator/mcp/write-lease.ts +30 -13
package/src/orchestrator/phase4-shared-context.ts +20 -4
package/protocols/visual-dna.md +0 -185

package/protocols/agent-prompt-authoring.md ADDED Viewed

@@ -0,0 +1,165 @@
+# Agent Prompt Authoring Standard
+You are producing text that will become another agent's prompt. Task briefs, finding reports, routing rows, fix specs, critic verdicts — when your output is read by a downstream agent and shapes its behavior, the quality of your text is the quality of that next dispatch's input.
+This protocol defines how to write that text well.
+## Who this applies to
+- `briefing-officer` — writes per-task briefs read by implementers
+- `product-feedback-synthesizer` — writes routing rows read by Phase 5.5 fix dispatches
+- `product-reality-auditor` — writes findings.json read by the synthesizer
+- `design-critic` — writes findings that drive metric-loop fix prompts
+- `product-owner` — writes feature-delegation plans read by briefing officers
+- `code-architect` — writes implementation blueprints read by sprint planner + implementers
+- LRR chapter judges — write verdicts read by the aggregator for backward routing
+If your output is consumed by a human, this protocol does not apply — write for humans normally.
+## The single rule
+The receiving agent has not seen this conversation. It cannot ask follow-up questions. It will read your output and act. If a smart colleague reading the same text would be confused, the receiving agent will be too.
+## Standards
+### 1. Ground in quotes, not paraphrase
+When your output references a source artifact (product-spec, page-spec, decisions.jsonl, findings.json, architecture.md), quote the source verbatim and carry the source location as a trailing reference. Paraphrase reintroduces the drift the graph layer was built to prevent.
+- Weak: `User has 3 attempts to enter a code per the spec`
+- Good: `User has 3 verification attempts before lockout (from product-spec.md L142)`
+If the graph returns the field directly, copy it character-for-character. ID-to-label resolution and list-filtering are the only allowed transformations.
+When the source field is negative by design (DNA `Don'ts`, banned phrases, system invariants), quote it verbatim — do not rewrite to positive framing. Standard 1 (verbatim) wins over Standard 2 (positive) when the source itself is intentionally negative.
+### 2. Tell the receiver what to do, not what not to do
+Positive prescriptions are unambiguous. Negative prescriptions force the receiver to enumerate the negative space and pick something from it — that's where drift happens.
+- Weak: `Don't use generic colors`
+- Good: `Use locked DNA palette: primary #0F172A, accent #F59E0B`
+- Weak: `Don't skip error handling`
+- Good: `On 4xx response, render the error toast component with the API body's message field`
+Negative framing is justified for actual invariants the receiver would otherwise rationalize around — see Standard 5 for when to escalate.
+### 3. Add the why for non-obvious constraints
+Without motivation, the receiver cannot judge edge cases. A stated reason lets it reason about whether the rule applies in a borderline situation.
+- Weak: `Use the manifest entry`
+- Good: `Use the manifest entry — this slot is HARD-GATE; rebuilding it from scratch will fail Phase 5 brand audit`
+- Weak: `Cap retries at 3`
+- Good: `Cap retries at 3 — backend rate-limits at 5 req/sec per user (from architecture.md L88)`
+### 4. Wrap structured inputs in XML tags
+When you slot a complex artifact into your output (wireframe, acceptance criteria, persona constraints, API contract), wrap it. The receiver parses the boundary unambiguously.
+```
+<wireframe>
+[verbatim ASCII from page-specs/checkout.md L20-45]
+</wireframe>
+<acceptance_criteria>
+- Form renders all required fields per layout
+- Submitting valid input calls POST /api/checkout once
+</acceptance_criteria>
+```
+This matters most when your output contains long-form quoted content (>10 lines). Short inline values do not need tags.
+### 5. Reserve HARD-GATE language for actual invariants
+`MUST`, `STOP`, `do NOT` lose force when overused. Reserve them for rules where violation breaks the pipeline. Soft preferences dressed up as hard gates dilute the genuine ones.
+Use HARD-GATE language for:
+- Pipeline-breaking failures (graph-call failure → STOP, no silent fallback)
+- Race-condition triggers (writer-owner violations corrupt files)
+- Audit-blocking decisions (manifest `hard_gate: true` slots)
+- Drift-introducing transformations on graph fields (verbatim slotting)
+Use normal prescriptive language for everything else: "Use", "Include", "Quote", "Reference". A rule stated once with rationale is stronger than the same rule stated three times in escalating caps.
+If you find yourself repeating a rule, the rule is fragile — fix it once at the source instead.
+## Worked example — task brief block
+### Weak brief
+```
+### Task 4.1: Build checkout
+- Agent: engineering-frontend-developer
+- Context:
+  - Build the checkout flow per the spec
+  - Make sure errors are handled
+  - Use good design
+- Acceptance: works
+```
+Violations: pointer-not-content (Standard 1), vague verb "handled" (Standard 2), uninstantiated "good design" (Standard 2), no testable criteria.
+### Improved brief
+```
+### Task 4.1: Build checkout payment-method form
+- Agent: engineering-frontend-developer
+- Skills: react-best-practices, shadcn-composition
+- Context:
+  <wireframe>
+  [verbatim ASCII from page-specs/checkout.md L20-45]
+  </wireframe>
+  - Components: shadcn Form.field, shadcn Input.text, shadcn Button.primary
+  - Tokens: colors.primary, spacing.lg
+  - API: POST /api/checkout, body {payment_method_id: string, amount_cents: number}, returns 200 {order_id} or 4xx {error: string}
+  - Error states:
+    - 402 → toast "Payment declined. Try a different card." (from product-spec.md L142)
+    - 422 → inline field error "Invalid card number" (from product-spec.md L145)
+  - Business rules: amount_cents minimum 50 (from product-spec.md L156)
+  - Persona: [Buyer] complete checkout in 3 fields max (from product-spec.md L167)
+- Acceptance:
+  - Form renders all fields specified in wireframe
+  - Submitting valid payment_method_id calls POST /api/checkout once
+  - 402 response shows toast within 200ms
+  - 422 response shows inline error tied to the offending field
+```
+Why this works: wireframe quoted verbatim, components carry library + variant, every constraint has a source ref, acceptance criteria are testable.
+## Worked example — synthesizer routing row
+### Weak finding
+```json
+{
+  "finding_id": "f-042",
+  "description": "Checkout has issues with persona requirements",
+  "target_phase": 1
+}
+```
+### Improved finding
+```json
+{
+  "finding_id": "f-042",
+  "source": "product-reality",
+  "feature_id": "checkout",
+  "target_phase": 1,
+  "target_task_or_step": "1.6",
+  "description": "Buyer persona constraint at product-spec.md L167 requires 3-field checkout, but spec also requires billing_address (5 fields) and shipping_address (5 fields). Constraint and required fields are mutually exclusive — spec needs disambiguation before Phase 4 can proceed.",
+  "evidence_path": "evidence/product-reality/checkout/findings.json#f-042"
+}
+```
+Why this works: contradiction stated concretely with line refs, evidence pointer is reachable, routing decision (`target_phase: 1`) is justified by the description text rather than asserted.
+## When you are tempted to break these rules
+- **"It's faster to summarize than to quote"** — the time you save now is paid with interest by the implementer who has to re-read the source to recover what you summarized away.
+- **"The receiver will figure it out"** — the receiver has no conversation history. It will guess, and the guess will be plausible-looking but wrong.
+- **"Adding the source ref is noisy"** — Phase 5 auditors and reviewers use source refs to spot-check. Without them, every claim is unverifiable.
+- **"This rule is critical, I should repeat it for emphasis"** — repetition signals fragility. State it once with the why; if the rule actually breaks the pipeline, escalate to HARD-GATE language at the rule's source instead of restating in every dispatch.

package/protocols/architecture-schema.md CHANGED Viewed

@@ -14,7 +14,7 @@ The synthesized `architecture.md` MUST contain these top-level headings, in this
 - `# Data Model`
 - `# Security`
 - `# Infrastructure`
-- `# MVP Scope`
+- `# Scope`
 - `# Out of Scope`
 For iOS builds, `# Frontend` MAY be titled `# App` and `# Backend` MAY be omitted if the app is fully on-device (the `refs.json` index reflects whatever headings exist). For web builds, all eight headings are required.
@@ -34,7 +34,7 @@ Every synthesized `architecture.md` MUST provide at least these subsection ancho
 ### Under `# Frontend`
-- `frontend/layout` — page hierarchy, routing, navigation structure. MUST state whether the product exposes a public, unauthenticated, content-indexable surface (yes / no / partial) — downstream SEO skill loading keys off this answer.
+- `frontend/layout` — page hierarchy, routing, navigation structure. MUST state whether the product exposes a public, unauthenticated, content-indexable surface (yes / no / partial) — downstream SEO skill loading keys off this answer. MUST include a navigation map listing every route with its nav element (sidebar item, tab, breadcrumb, direct URL only).
 - `frontend/components` — core component list and responsibilities.
 - `frontend/state` — state management approach (stores, context, local state boundaries).
 - `frontend/styling` — design tokens, CSS/styling approach.
@@ -42,7 +42,7 @@ Every synthesized `architecture.md` MUST provide at least these subsection ancho
 ### Under `# Backend`
 - `backend/services` — service boundaries and responsibilities.
-- `backend/api` — API contracts (routes, request/response shapes).
+- `backend/api` — API contracts (routes, request/response shapes). Each endpoint heading MUST include feature attribution annotations: `**POST /api/orders** (provides: order-placement)` or `**GET /api/inventory/{id}** (provides: inventory) (consumes: order-placement)`. Use the feature's kebab-case name from `product-spec.md`. `provides` = the feature that implements this endpoint; `consumes` = features that call it. These annotations are parsed by the graph indexer to emit `feature_provides_endpoint` / `feature_consumes_endpoint` edges — without them, the Product Owner cannot group features by API dependency.
 - `backend/persistence` — data layer, ORM choice, query patterns.
 ### Under `# Data Model`
@@ -56,6 +56,13 @@ Every synthesized `architecture.md` MUST provide at least these subsection ancho
 - `security/input-validation` — boundary validation rules.
 - `security/secrets` — secret storage and rotation.
+### Under `# Infrastructure`
+- `infrastructure/deployment` — deployment target (Vercel, AWS, self-hosted, TestFlight), CI/CD approach, environment strategy (dev/staging/prod).
+- `infrastructure/caching` — caching strategy (CDN, in-memory, database query cache), cache invalidation approach, what's cached and for how long. Stub with "N/A — no caching layer needed" if the product doesn't warrant it.
+- `infrastructure/background-jobs` — async processing model (email sending, webhook handling, scheduled tasks, queue system). Stub with "N/A — no background processing needed" if all operations are synchronous.
+- `infrastructure/env-vars` — required environment variables and secrets for each environment (dev/staging/prod). List every key the app needs to run, which service it authenticates with, and whether it's a secret or a config value. Format: `STRIPE_SECRET_KEY — Stripe payment processing — secret — required`.
 ## Ref format used by implementer prompts
 Phase 5.1 implementer prompts inject a `ARCHITECTURE REFS:` block in place of pasted architecture content. The block is parsed by the implementer agent, which uses the `Read` tool to fetch refs on demand.

package/protocols/cleanup.md CHANGED Viewed

@@ -52,3 +52,7 @@ After the cleanup agent finishes, spot-check that acceptance criteria still hold
 - Scope is sacred. Only files from the implementation changeset. Zero exceptions.
 - This runs AFTER implementation, BEFORE the metric loop.
 - If cleanup breaks acceptance criteria, revert and skip. Never block the metric loop on a cleanup failure.
+### Build Log Rotation
+At Phase 0, if `docs/plans/build-log.md` exceeds 500 lines, archive it to `docs/plans/build-log.prev.md` and start a fresh log. The archived log preserves full history for debugging. The 500-line threshold keeps the active log within a reasonable context budget for agents that read it.

package/protocols/decision-log.md CHANGED Viewed

@@ -11,12 +11,12 @@ Rows live in `docs/plans/decisions.jsonl`, one JSON object per line, append-only
   "decision_id": "D-<phase>-<seq>",
   "phase": "2.2",
   "timestamp": "<ISO8601>",
-  "decision": "chose SQLite over Postgres for MVP persistence",
+  "decision": "chose SQLite over Postgres for initial persistence",
   "chosen_approach": "SQLite with single-file .db in project root",
   "rejected_alternatives": [
     {
       "approach": "Postgres via Supabase",
-      "reason": "adds infra setup to Phase 0 prereqs; overkill for single-user MVP",
+      "reason": "adds infra setup to Phase 0 prereqs; overkill for single-user app",
       "revisit_criterion": "multi-user access OR >10k rows OR concurrent writes"
     }
   ],
@@ -96,12 +96,12 @@ Subagents return `deviation_row` objects in their structured result. The orchest
   "decision_id": "D-2-03",
   "phase": "2.2",
   "timestamp": "2026-04-13T16:05:41Z",
-  "decision": "chose SQLite over Postgres for MVP persistence",
+  "decision": "chose SQLite over Postgres for initial persistence",
   "chosen_approach": "SQLite with single-file .db in project root, migrations via drizzle-kit",
   "rejected_alternatives": [
     {
       "approach": "Postgres via Supabase",
-      "reason": "adds infra setup to Phase 0 prereqs; overkill for single-user MVP.",
+      "reason": "adds infra setup to Phase 0 prereqs; overkill for single-user app.",
       "revisit_criterion": "multi-user access OR >10k rows OR concurrent writes"
     },
     {
@@ -129,3 +129,7 @@ Subagents return `deviation_row` objects in their structured result. The orchest
 ## Ref Field Convention
 Every row carries a `ref` anchor (e.g., `architecture.md#backend/persistence` or `visual-design-spec.md#<anchor>`) that downstream readers use to widen context without pasting prose. The resume handler passes the row's short fields *plus* the ref — the resumed agent reads the anchor via its own Read tool if it needs the full context. This matches the existing `refs.json` pattern in `commands/build.md` (primary/secondary anchors handed to implementers instead of pasted content), and keeps rehydration token cost bounded to ~300 tokens for the top 5 rows regardless of how much architectural prose sits behind each anchor.
+### Malformed Row Handling
+When reading `decisions.jsonl` or `learnings.jsonl`, consumers MUST skip malformed rows (invalid JSON) with a warning logged to `build-log.md` rather than crashing. Shape: `"WARN: skipped malformed row at line N in [file]: [parse error]"`. This prevents a single corrupt append from blocking the entire LRR Aggregator or Learnings Loader.