npm - sweagent - Versions diffs - 0.0.2 → 0.0.4 - Mend

sweagent 0.0.2 → 0.0.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (11) hide show

package/README.md CHANGED Viewed

@@ -1,10 +1,10 @@
 <p align="center">
   <h1 align="center">sweagent</h1>
   <p align="center">
-    <strong>The planning layer that makes Cursor, Claude Code, and Codex 10x more effective.</strong>
+    <strong>The Deep Thinking layer that makes Cursor, Claude Code, and Codex 10x more effective.</strong>
   </p>
   <p align="center">
-    14 domain-specialized AI agent pipelines — Planning, Requirements, Data Modeling, API Design, Auth, Backend, Frontend, and more — each with dedicated orchestrators, sub-agents, and structured outputs. Generate implementation-ready blueprints, then hand them to your coding agent.
+    Deep, multi-stage reasoning before a single line of code is written. Domain-specialized agent pipelines generate structured blueprints -- requirements, data models, API contracts, auth flows, and architecture specs -- through iterative LLM calls and sub-agent decomposition. Hand the result to your coding agent.
   </p>
 </p>
@@ -16,15 +16,21 @@
 </p>
 <p align="center">
-  <a href="#what-is-sweagent">What is sweagent?</a> •
+  <a href="#the-problem">The Problem</a> •
   <a href="#use-with-cursor-claude-code-and-codex">Use with Coding Agents</a> •
-  <a href="#domain-agents">Domain Agents</a> •
+  <a href="#why-sweagent">Why sweagent?</a> •
+  <a href="#deep-reasoning-philosophy">Deep Reasoning</a> •
+  <a href="#how-it-works">How It Works</a> •
+  <a href="#features">Features</a> •
+  <a href="#planning-pipeline">Planning Pipeline</a> •
   <a href="#full-pipeline">Full Pipeline</a> •
-  <a href="#installation">Installation</a> •
+  <a href="#domain-agent-modules">Modules</a> •
   <a href="#getting-started">Getting Started</a> •
+  <a href="#installation">Installation</a> •
+  <a href="#mcp-server">MCP Server</a> •
   <a href="#architecture">Architecture</a> •
   <a href="#api-reference">API Reference</a> •
-  <a href="#domain-agent-modules">Modules</a> •
+  <a href="#reference">Reference</a> •
   <a href="#examples">Examples</a> •
   <a href="#contributing">Contributing</a>
 </p>
@@ -33,65 +39,59 @@
 ## Table of Contents
-- [What is sweagent?](#what-is-sweagent)
+- [The Problem](#the-problem)
 - [Use with Cursor, Claude Code, and Codex](#use-with-cursor-claude-code-and-codex)
 - [Why sweagent?](#why-sweagent)
-- [Domain Agents](#domain-agents)
-- [Full Pipeline](#full-pipeline)
-- [Planning Pipeline](#planning-pipeline)
+- [Deep Reasoning Philosophy](#deep-reasoning-philosophy)
+- [How It Works](#how-it-works)
 - [Features](#features)
-- [Installation](#installation)
+- [Planning Pipeline](#planning-pipeline)
+- [Full Pipeline](#full-pipeline)
+- [Domain Agent Modules](#domain-agent-modules)
 - [Getting Started](#getting-started)
+- [Installation](#installation)
+- [MCP Server](#mcp-server)
 - [Architecture](#architecture)
 - [API Reference](#api-reference)
-- [Domain Agent Modules](#domain-agent-modules)
+- [Reference](#reference)
 - [Examples](#examples)
-- [Configuration Reference](#configuration-reference)
-- [FAQ](#faq)
-- [Troubleshooting](#troubleshooting)
 - [Contributing](#contributing)
 - [License](#license)
 ---
-## What is sweagent?
+## The Problem
-AI coding agents -- Claude Code, Codex, Cursor -- are powerful executors, but they fail at planning. Hand one a vague requirement and it guesses a tech stack, skips data modeling, forgets auth, and produces half-finished code. Enterprise teams need the same rigor from AI that they expect from senior engineers: structured discovery, explicit requirements, deliberate design, and traceable decisions.
+AI coding agents -- Cursor, Claude Code, Codex -- are powerful executors, but they fail at planning. Hand one a vague requirement and it guesses a tech stack, skips data modeling, forgets auth, and produces half-finished code. Enterprise teams need the same rigor from AI that they expect from senior engineers: structured discovery, explicit requirements, deliberate design, and traceable decisions.
-**sweagent** is a library of **14 domain-specialized AI agent pipelines** that handle every stage of software planning at professional quality. Each domain -- planning, requirements, data modeling, API design, auth, backend, frontend -- gets its own **orchestrator agent** with dedicated **sub-agents**, **tools**, and **multi-stage pipelines** that produce structured, reviewable outputs.
+**Without sweagent**, a coding agent receives `"build a task manager"` and immediately starts writing code:
-| Stage               | Agent                         | Sub-Agents                                                 | Output                                            |
-| ------------------- | ----------------------------- | ---------------------------------------------------------- | ------------------------------------------------- |
-| **Planning**        | `runPlanningAgent`            | --                                                         | Implementation-ready markdown plan (11 sections)  |
-| **Requirements**    | `runRequirementGathererAgent` | --                                                         | Structured JSON (actors, flows, stories, modules) |
-| **Data Modeling**   | `runDataModelerAgent`         | `entity-analyzer`, `relationship-mapper`, `schema-refiner` | MongoDB/PostgreSQL schemas                        |
-| **DB Design**       | `runDbDesignerAgent`          | `entity-analyzer`, `schema-refiner`                        | MongoDB schemas with RBAC permissions             |
-| **API Design**      | `runApiDesignerAgent`         | `endpoint-analyzer`, `contract-designer`                   | REST and/or GraphQL API design                    |
-| **Auth Design**     | `runAuthDesignerAgent`        | `security-analyzer`, `flow-designer`                       | Auth strategy, flows, middleware, RBAC            |
-| **Backend Arch.**   | `runBackendArchitectAgent`    | `framework-selector`, `service-planner`                    | Backend architecture (Express/Apollo)             |
-| **Express Builder** | `runExpressBuilderAgent`      | `route-generator`, `middleware-configurator`               | Express.js REST API config                        |
-| **Apollo Builder**  | `runApolloBuilderAgent`       | `schema-generator`, `resolver-planner`                     | Apollo GraphQL subgraph config                    |
-| **Frontend Arch.**  | `runFrontendArchitectAgent`   | `page-planner`, `component-analyzer`                       | Frontend architecture (React/Next.js)             |
-| **React Builder**   | `runReactBuilderAgent`        | `graphql-analyzer`, `config-validator`                     | React + Vite app config from GraphQL              |
-| **Next.js Builder** | `runNextjsBuilderAgent`       | `route-planner`, `api-route-generator`                     | Next.js App Router config                         |
-| **Execution Plan**  | `runExecutionPlannerAgent`    | `edge-case-analyzer`, `testing-strategist`                 | Phased implementation plan                        |
-| **Hello World**     | `runHelloWorldAgent`          | --                                                         | Template module for custom agents                 |
+- Picks a random framework (maybe Express, maybe Fastify, who knows)
+- Invents a database schema on the fly, misses relationships
+- Forgets authentication entirely
+- Skips error handling and edge cases
+- Produces something that sort-of runs but needs a rewrite
+**With sweagent**, the coding agent receives a structured blueprint before writing a single line:
+- **11-section markdown plan** with tech stack, data models, API routes, auth flow, implementation order, edge cases, and testing checklist
+- **Structured JSON requirements** with actors, user flows, stories, and module breakdowns
+- **Database schemas** with exact field types, relationships, indexes, and validation rules
+- **API contracts** with endpoints, methods, request/response shapes, and auth requirements
+- **Frontend architecture** with pages, components, routing, and state management
 Each pipeline walks through structured stages -- discovery, analysis, design, synthesis -- not a single LLM call. The result is a professional-grade artifact that a coding agent can execute step-by-step, or that a human architect can review and approve.
 ```typescript
-import { runPlanningWithResult } from 'sweagent';
+import { runPlanningAgent } from 'sweagent';
-// Generate an implementation-ready plan -- validated by an LLM judge
-const { planning, plan } = await runPlanningWithResult({
+const result = await runPlanningAgent({
   input: 'Task manager app with user auth, task CRUD, assignments, and a dashboard',
   model: { provider: 'openai', model: 'gpt-4o-mini' },
 });
-if (planning) {
-  console.log('Plan is implementation-ready. Hand it to your coding agent.');
-  console.log(plan); // Full markdown blueprint
-}
+console.log('Plan is implementation-ready. Hand it to your coding agent.');
+console.log(result.output); // Full markdown blueprint
 ```
 TypeScript-first, built on the Vercel AI SDK, ships with all provider SDKs (OpenAI, Anthropic, Google). Set your API keys and go.
@@ -100,89 +100,99 @@ TypeScript-first, built on the Vercel AI SDK, ships with all provider SDKs (Open
 ## Use with Cursor, Claude Code, and Codex
-Coding agents are powerful executors -- but they build faster and better when they start from a structured plan instead of a vague prompt. sweagent generates the blueprints; your coding agent implements them.
+sweagent is an MCP server. Install it once, add one config to your IDE, and all 13 domain agents are available directly in your chat -- planning, requirements, data modeling, API design, auth, architecture, and more. No scripts, no code, no file juggling.
 ```mermaid
 flowchart LR
-  Requirement["Your idea"] --> sweagent["sweagent"]
-  sweagent --> Plan["plan.md / JSON spec"]
-  Plan --> Cursor["Cursor"]
-  Plan --> ClaudeCode["Claude Code"]
-  Plan --> Codex["Codex"]
-  Cursor --> Code["Production code"]
-  ClaudeCode --> Code
-  Codex --> Code
+  You["You type a prompt"] -->|"MCP"| Server["sweagent"]
+  Server --> Plan["plan"]
+  Server --> Req["gather_requirements"]
+  Server --> Data["design_data_model"]
+  Server --> Api["design_api"]
+  Server --> More["... 9 more tools"]
+  Plan --> Agent["Your coding agent implements it"]
+  Req --> Agent
+  Data --> Agent
+  Api --> Agent
+  More --> Agent
 ```
-### With Cursor
+### 1. Install
-Generate a plan, save it to your project, and reference it in Cursor chat or `.cursor/rules/`:
+```bash
+npm install -g sweagent
+```
-```typescript
-import { runPlanningWithResult } from 'sweagent';
-import { writeFileSync } from 'fs';
+### 2. Add the config for your IDE
-const { planning, plan } = await runPlanningWithResult({
-  input: 'E-commerce with users, products, cart, checkout, admin dashboard',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-});
+**Cursor** -- create `.cursor/mcp.json` in your project root:
-writeFileSync('plan.md', plan);
-// Open plan.md in Cursor and say: "Implement this plan step by step"
-// Or copy plan.md to .cursor/rules/ so every agent session uses it as context
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": { "OPENAI_API_KEY": "your-openai-api-key" }
+    }
+  }
+}
 ```
-### With Claude Code
-Generate a plan and save it as `CLAUDE.md` or a reference file. Claude Code automatically reads `CLAUDE.md` for project context:
-```typescript
-import { runPlanningWithResult } from 'sweagent';
-import { writeFileSync } from 'fs';
+**VS Code (Copilot)** -- create `.vscode/mcp.json` in your project root:
-const { plan } = await runPlanningWithResult({
-  input: 'SaaS dashboard with multi-tenancy, billing, and analytics',
-  model: { provider: 'anthropic', model: 'claude-sonnet-4-20250514' },
-});
+```json
+{
+  "servers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": { "OPENAI_API_KEY": "your-openai-api-key" }
+    }
+  }
+}
+```
-// Option 1: Save as CLAUDE.md for automatic context
-writeFileSync('CLAUDE.md', `# Implementation Plan\n\n${plan}`);
+**Windsurf** -- edit `~/.codeium/windsurf/mcp_config.json`:
-// Option 2: Save as plan.md and reference it
-writeFileSync('plan.md', plan);
-// Then tell Claude Code: "Read plan.md and implement phase 1"
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": { "OPENAI_API_KEY": "your-openai-api-key" }
+    }
+  }
+}
 ```
-### With Codex
+**Claude Desktop** -- edit `~/Library/Application Support/Claude/claude_desktop_config.json` (macOS) or `%APPDATA%\Claude\claude_desktop_config.json` (Windows):
-Codex works best with structured, machine-readable specs. Use the Requirement Gatherer or Data Modeler for JSON output:
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": { "OPENAI_API_KEY": "your-openai-api-key" }
+    }
+  }
+}
+```
-```typescript
-import { runRequirementGathererAgent } from 'sweagent';
-import { writeFileSync } from 'fs';
+> **Don't want a global install?** Replace `"command": "sweagent"` with `"command": "npx"` and add `"args": ["-y", "sweagent"]`. See [MCP Server](#mcp-server) for all options including from-source setup.
-const result = await runRequirementGathererAgent({
-  input: 'Task manager with teams, Kanban boards, and time tracking',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
+### 3. Restart your IDE and start prompting
-// Structured JSON with actors, flows, stories, modules, DB schema, API design
-writeFileSync('requirements.json', result.output);
-// Feed requirements.json to Codex as context for implementation
-```
+Open the chat and try:
-### Why this works
+- _"Use the plan tool to plan a task manager app with teams, Kanban boards, and time tracking."_
+- _"Use the gather_requirements tool to extract structured requirements for an e-commerce platform."_
+- _"Use the design_data_model tool to design a PostgreSQL schema for a SaaS billing system."_
-Without sweagent, a coding agent receives "build a task manager" and immediately starts guessing. With sweagent, it receives:
+The agent calls sweagent, gets back a structured blueprint, and can immediately start implementing it.
-- **11-section markdown plan** with tech stack, data models, API routes, auth flow, implementation order, edge cases, and testing checklist
-- **Structured JSON requirements** with actors, user flows, stories, and module breakdowns
-- **Database schemas** with exact field types, relationships, indexes, and validation rules
-- **API contracts** with endpoints, methods, request/response shapes, and auth requirements
-- **Frontend architecture** with pages, components, routing, and state management
+### Full reference
-The coding agent stops guessing and starts executing a professional-grade blueprint.
+- [MCP Server](#mcp-server) -- all 13 tools, input parameters, multi-provider setup, troubleshooting
+- [Getting Started](#getting-started) and [Full Pipeline](#full-pipeline) -- programmatic API for scripted pipelines, CI, or chaining multiple agents in code
 ---
@@ -190,7 +200,7 @@ The coding agent stops guessing and starts executing a professional-grade bluepr
 ### 1. Domain-specialized agents, not generic wrappers
-Each module is a self-contained agent pipeline purpose-built for its domain. The DB Designer doesn't reuse the Planning Agent's prompts -- it has its own `entity-analyzer` and `schema-refiner` sub-agents, its own tools (`design_database`, `validate_schema`), and its own output schema. The React Builder has a `graphql-analyzer` and `config-validator`. Every domain gets the specialized treatment it deserves.
+Each module is a self-contained agent pipeline purpose-built for its domain. The Data Modeler doesn't reuse the Planning Agent's prompts -- it has its own `entity-analyzer` and `schema-refiner` sub-agents, its own tools (`design_schema`, `validate_data_model`), and its own output schema. The React Builder has a `graphql-analyzer` and `config-validator`. Every domain gets the specialized treatment it deserves.
 ### 2. Multi-stage pipelines with structured outputs
@@ -198,7 +208,7 @@ Every domain agent progresses through deliberate stages -- discovery, requiremen
 ### 3. Sub-agent orchestration for complex domains
-When a domain is too complex for a single agent, sweagent delegates to specialized sub-agents. The DB Designer orchestrator spawns an `entity-analyzer` to extract entities and relationships, then a `schema-refiner` to normalize and validate the schema. The React Builder uses a `graphql-analyzer` to parse the schema and a `config-validator` to verify the output. Sub-agents run in isolation with their own context, tools, and models -- then return condensed results to the orchestrator.
+When a domain is too complex for a single agent, sweagent delegates to specialized sub-agents. The Data Modeler orchestrator spawns an `entity-analyzer` to extract entities and relationships, then a `schema-refiner` to normalize and validate the schema. The React Builder uses a `graphql-analyzer` to parse the schema and a `config-validator` to verify the output. Sub-agents run in isolation with their own context, tools, and models -- then return condensed results to the orchestrator.
 ### 4. Enterprise-quality output, not bullet points
@@ -214,571 +224,641 @@ Long-running agents fail when they lose context. sweagent encodes patterns for s
 ---
-## Domain Agents
+## Deep Reasoning Philosophy
+Coding agents generate code fast. But speed without depth produces fragile, incomplete software. sweagent applies deep reasoning -- structured, multi-stage, decomposed -- so your coding agent receives a blueprint that has been thoroughly worked through before implementation begins.
-sweagent ships with 14 domain agent modules organized across the full software planning pipeline. Each is a complete pipeline with its own orchestrator, tools, sub-agents, and output format.
+### Multi-Stage Reasoning
+sweagent never produces one-shot answers. The Planning Agent progresses through four deliberate stages -- discovery, requirements, design, synthesis -- with dedicated LLM calls at each step. Each stage consumes the output of the previous one, building context incrementally rather than cramming everything into a single prompt. Requirements inform design decisions, design decisions shape API contracts, and API contracts feed the implementation order.
 ```mermaid
 flowchart LR
-  subgraph discovery [Discovery and Planning]
-    Planning["Planning"]
+  Input["Requirement"] --> Discovery["Discovery"]
+  Discovery --> Requirements["Requirements"]
+  Requirements --> SubReq1["entity-analyzer"]
+  Requirements --> SubReq2["page-planner"]
+  Requirements --> SubReq3["flow-designer"]
+  SubReq1 --> Design["Design"]
+  SubReq2 --> Design
+  SubReq3 --> Design
+  Design --> SubDes1["endpoint-analyzer"]
+  Design --> SubDes2["contract-designer"]
+  SubDes1 --> Synthesis["Synthesis"]
+  SubDes2 --> Synthesis
+  Synthesis --> Output["Blueprint"]
+```
+### Sub-Agent Decomposition
+When a domain is too complex for a single agent, sweagent delegates to specialized sub-agents that run in isolation with their own context, tools, and models. The Data Modeler spawns an `entity-analyzer` to extract entities, a `relationship-mapper` for cardinality, and a `schema-refiner` to normalize the result. The React Builder uses a `graphql-analyzer` to parse schema structure and a `config-validator` to check the output. Each sub-agent returns condensed results to its orchestrator, keeping context windows focused and reasoning sharp.
+### Inference-Time Depth
+Unlike single-prompt planning tools, sweagent deliberately spends more inference-time compute for higher-quality output. The Planning Agent alone makes 12+ LLM calls across 4 stages and 7 sub-agents. The Data Modeler adds 3 more sub-agent calls. The full pipeline chains 7+ domain agents end-to-end, each with its own multi-call pipeline. This is not a design accident -- more structured reasoning steps produce measurably better blueprints than a single large prompt, the same way a senior engineer produces better architecture by working through each layer separately rather than designing everything at once.
+---
+## How It Works
+sweagent is not a single agent. It is a system of domain-specialized agent pipelines organized across the full software planning lifecycle. Each module can run independently, or you can chain them into a full-stack specification pipeline.
+```mermaid
+flowchart TB
+  subgraph input [Your Idea]
+    Idea["Natural language requirement"]
+  end
+  subgraph discovery [Discovery Layer]
     ReqGatherer["Requirement Gatherer"]
-    ExecPlanner["Execution Planner"]
+    Planning["Planning Agent"]
   end
-  subgraph data [Data Layer]
-    DataModeler["Data Modeler"]
-    DbDesigner["DB Designer"]
+  subgraph bridge [Bridge]
+    FromReqs["from-requirements"]
   end
-  subgraph api [API Layer]
+  subgraph specialists [Specialist Architects]
+    DataModeler["Data Modeler"]
     ApiDesigner["API Designer"]
     AuthDesigner["Auth Designer"]
+    BackendArch["Backend Architect"]
+    FrontendArch["Frontend Architect"]
   end
-  subgraph backend [Backend]
-    BackendArch["Backend Architect"]
+  subgraph builders [Framework Builders]
     ExpressBuilder["Express Builder"]
     ApolloBuilder["Apollo Builder"]
-  end
-  subgraph frontend [Frontend]
-    FrontendArch["Frontend Architect"]
     ReactBuilder["React Builder"]
     NextjsBuilder["Next.js Builder"]
   end
+  subgraph execution [Execution]
+    ExecPlanner["Execution Planner"]
+    CodingAgent["Cursor / Claude Code / Codex"]
+  end
+  Idea --> ReqGatherer
+  Idea --> Planning
+  ReqGatherer -->|"FinalRequirement JSON"| FromReqs
+  FromReqs -->|"PlanningContext"| Planning
   ReqGatherer --> DataModeler
-  ReqGatherer --> DbDesigner
   DataModeler --> ApiDesigner
-  DbDesigner --> ApiDesigner
   ApiDesigner --> AuthDesigner
   AuthDesigner --> BackendArch
-  BackendArch --> ExpressBuilder
-  BackendArch --> ApolloBuilder
   ApiDesigner --> FrontendArch
-  FrontendArch --> ReactBuilder
-  FrontendArch --> NextjsBuilder
+  BackendArch -->|"Express selected"| ExpressBuilder
+  BackendArch -->|"Apollo selected"| ApolloBuilder
+  FrontendArch -->|"React selected"| ReactBuilder
+  FrontendArch -->|"Next.js selected"| NextjsBuilder
   Planning --> ExecPlanner
+  ExecPlanner --> CodingAgent
+  builders --> CodingAgent
+  specialists --> CodingAgent
 ```
-### Planning Agent
-Turns a natural-language project description into an implementation-ready markdown plan through 4 stages and 8+ LLM calls. Covers tech stack, data models, API routes, implementation order, edge cases, and testing checklists. Optional LLM validation judges completeness.
+### Three usage modes
-```typescript
-import { runPlanningWithResult } from 'sweagent';
+**1. Quick plan** -- Planning Agent standalone. One call, one markdown plan. Best for getting a coding agent started fast.
-const { planning, plan } = await runPlanningWithResult({
-  input: 'E-commerce: users, products, cart, checkout, admin dashboard',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-});
-// planning === true means the plan passed all validation criteria
-// plan is the full markdown blueprint
-```
+**2. Structured requirements** -- Requirement Gatherer produces typed JSON (actors, flows, stories, modules, database, API). Feed the JSON to specialist modules (Data Modeler, API Designer, etc.) for detailed specs per layer. Use `runPlanningFromRequirements` to bridge requirement-gatherer output into the planning pipeline, skipping redundant discovery stages.
-**Stages:** Discovery, Requirements (4 LLM calls), Design (2 LLM calls), Synthesis | **Output:** Markdown | **Modes:** One-shot, validated, interactive chat
+**3. Full pipeline** -- Chain all agents together. Each agent's output feeds the next: requirements -> data model -> API design -> auth -> backend architecture -> frontend architecture. Save all specs and hand the directory to your coding agent.
 ---
-### Requirement Gatherer Agent
-Produces structured JSON requirements -- not prose. Extracts actors with permissions, user flows with step-by-step sequences, user stories with acceptance criteria, and module breakdowns with CRUD operations, database schemas, and API designs.
-```typescript
-import { runRequirementGathererAgent } from 'sweagent';
-const result = await runRequirementGathererAgent({
-  input: 'Project management tool with teams, Kanban boards, and time tracking',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-// result.output contains structured JSON: actors, flows, stories, modules
-```
+## Features
-**Stages:** Discovery, Requirements, Design, Synthesis | **Output:** Structured JSON | **Schemas:** Actors, Flows, Stories, Modules, Database, API
+| Feature                     | Description                                                                                                                                                                                                                                                       |
+| --------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| **Domain Agent Modules**    | Planning, Requirement Gatherer, Data Modeler, API Designer, Auth Designer, Backend Architect, Express Builder, Apollo Builder, Frontend Architect, React Builder, Next.js Builder, Execution Planner, and Hello World template -- each a self-contained pipeline. |
+| **Multi-Stage Pipelines**   | Every domain agent progresses through structured stages (discovery, requirements, design, synthesis) with dedicated LLM calls at each step. No single-shot prompts.                                                                                               |
+| **Sub-Agent Orchestration** | Complex domains delegate to specialized sub-agents (`entity-analyzer`, `schema-refiner`, `graphql-analyzer`, `config-validator`) that run in isolation and return condensed results.                                                                              |
+| **Structured Outputs**      | Requirements as typed JSON (actors, flows, stories, modules). DB schemas with field-level detail. Frontend configs with pages, hooks, and branding. Plans with 11 sections.                                                                                       |
+| **Multi-Provider Models**   | Unified API for OpenAI, Anthropic, and Google. One `createModel()` call, zero provider lock-in.                                                                                                                                                                   |
+| **Type-Safe Tools**         | Define tools with Zod schemas; full type inference and validation before execution. Minimal, workflow-oriented tool sets.                                                                                                                                         |
+| **Agent Framework**         | Iterative agent loop with tool calling, step callbacks, and configurable max iterations.                                                                                                                                                                          |
+| **MCP Protocol**            | Connect to Model Context Protocol servers over HTTP or stdio. Lazy connection, typed tool invocation.                                                                                                                                                             |
+| **Vision**                  | Image inputs via `model.generateVision()` for vision-capable models.                                                                                                                                                                                              |
+| **Zero Extra Deps**         | All provider SDKs (OpenAI, Anthropic, Google) included. Set API keys and run.                                                                                                                                                                                     |
 ---
-### DB Designer Agent
+## Planning Pipeline
-An orchestrator agent that delegates to specialized sub-agents for entity analysis and schema refinement. Produces MongoDB-style schemas with modules, fields, relationships, indexes, and validation rules.
+The planning module is the centerpiece for AI coding agents. It turns a natural-language project description into a structured, implementation-ready markdown plan through four stages.
-```typescript
-import { runDbDesignerAgent } from 'sweagent';
+### How it works
-const result = await runDbDesignerAgent({
-  input: 'E-commerce: users, orders, products. Admins manage products.',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-// result.output contains the full database schema
+```mermaid
+flowchart LR
+  Input["User Requirement"] --> Discovery["Discovery"]
+  Discovery --> Requirements["Requirements"]
+  Requirements --> Design["Design"]
+  Design --> Synthesis["Synthesis"]
+  Synthesis --> Output["plan.md"]
 ```
-**Sub-agents:** `entity-analyzer` (extracts entities and relationships), `schema-refiner` (normalizes and validates) | **Tools:** `design_database`, `design_database_pro`, `redesign_database`, `validate_schema` | **Output:** MongoDB schema JSON
----
-### React Builder Agent
-Generates complete frontend application configuration from a GraphQL schema. A `graphql-analyzer` sub-agent parses the schema structure, and a `config-validator` sub-agent verifies the output. Produces app config, modules, pages, fields, and API hooks.
+### Stages
-```typescript
-import { runReactBuilderAgent } from 'sweagent';
+| Stage            | What it produces                                   | Sections                                                            |
+| ---------------- | -------------------------------------------------- | ------------------------------------------------------------------- |
+| **Discovery**    | Understands the project, asks clarifying questions | Project overview                                                    |
+| **Requirements** | 4 sequential LLM calls to flesh out the spec       | Tech stack, feature decisions, data models, pages/routes, auth flow |
+| **Design**       | 2 sequential LLM calls for technical design        | API routes, implementation details                                  |
+| **Synthesis**    | Assembles the final plan                           | Implementation order, edge cases, testing checklist                 |
-const result = await runReactBuilderAgent({
-  input: 'GraphQL schema: type User { id: ID! name: String! } type Task { ... }',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-// result.output contains frontend config JSON
-```
+### Output
-**Sub-agents:** `graphql-analyzer` (schema parsing), `config-validator` (output verification) | **Tools:** `generate_frontend`, `generate_feature_breakdown`, `validate_frontend_config` | **Output:** React app config JSON
+The plan is a markdown document with these sections:
----
+- **Overview** -- project scope and goals
+- **Tech Stack** -- languages, frameworks, database, auth approach
+- **Feature Decisions** -- what to build and what to defer
+- **Data Models** -- schemas, relationships, fields
+- **Pages and Routes** -- frontend structure
+- **Authentication Flow** -- auth strategy and implementation
+- **API Routes** -- endpoints, methods, request/response shapes
+- **Implementation Details** -- architecture decisions, file structure
+- **Execution Plan** -- phased implementation order
+- **Edge Cases** -- error handling, boundary conditions
+- **Testing Checklist** -- what to verify at each phase
-### Data Modeler Agent
+### Two modes
-Designs data models for MongoDB or PostgreSQL with entities, fields, indexes, and relationships. Uses three sub-agents for entity analysis, relationship mapping, and schema refinement.
+**One-shot mode** -- pass a requirement, get a plan:
 ```typescript
-import { runDataModelerAgent } from 'sweagent';
+import { runPlanningAgent } from 'sweagent';
-const result = await runDataModelerAgent({
-  input: 'SaaS platform with organizations, users, projects, and billing',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const result = await runPlanningAgent({
+  input: 'Fitness app with workouts, nutrition tracking, and social features',
+  model: { provider: 'anthropic', model: 'claude-sonnet-4-20250514' },
 });
-// result.output: DataModelDesign JSON with entities, fields, indexes, relationships
+console.log(result.output); // Full plan markdown
 ```
-**Sub-agents:** `entity-analyzer`, `relationship-mapper`, `schema-refiner` | **Tools:** `design_schema`, `design_schema_pro`, `refine_schema`, `validate_data_model` | **Output:** Data model JSON (MongoDB or PostgreSQL)
----
-### API Designer Agent
-Designs REST and/or GraphQL APIs from data models, producing endpoint definitions with request/response contracts, auth requirements, and operation details.
+**Interactive chat mode** -- multi-turn conversation where you refine the plan:
 ```typescript
-import { runApiDesignerAgent } from 'sweagent';
+import { processPlanningChat } from 'sweagent';
+import type { PlanningContext } from 'sweagent';
-const result = await runApiDesignerAgent({
-  input: 'Design REST API for a task manager with users, projects, and tasks',
+let context: PlanningContext | null = null;
+// Turn 1: describe the project
+const turn1 = await processPlanningChat('Build a task manager with teams', context, {
   model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
 });
-// result.output: ApiDesign JSON with REST endpoints and/or GraphQL operations
-```
+context = turn1.context;
+console.log(turn1.message); // Assistant asks clarifying questions
+console.log(turn1.pendingQuestions); // ["What auth provider?", ...]
-**Sub-agents:** `endpoint-analyzer`, `contract-designer` | **Tools:** `design_api`, `design_api_pro`, `validate_api` | **Output:** API design JSON (REST/GraphQL)
+// Turn 2: answer and advance
+const turn2 = await processPlanningChat('Use NextAuth with GitHub OAuth', context, {
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+});
+context = turn2.context;
----
+// Continue until turn.planMarkdown is set (plan complete)
+```
-### Auth Designer Agent
+### Requirements to plan (the bridge)
-Designs authentication and authorization systems with strategies, flows, middleware, roles, and security policies.
+When you already have structured requirements from the Requirement Gatherer, `runPlanningFromRequirements` converts the `FinalRequirement` JSON into a `PlanningContext` pre-filled at the design stage, skipping the redundant discovery and requirements stages:
 ```typescript
-import { runAuthDesignerAgent } from 'sweagent';
+import { runRequirementGathererAgent, runPlanningFromRequirements } from 'sweagent';
-const result = await runAuthDesignerAgent({
-  input: 'JWT auth with email/password, Google OAuth, role-based access (admin, member)',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
+const model = { provider: 'openai', model: 'gpt-4o-mini' } as const;
+// Step 1: Gather structured requirements (actors, flows, stories, modules, DB)
+const reqResult = await runRequirementGathererAgent({
+  input: 'Task manager with teams, Kanban boards, and time tracking',
+  model,
   maxIterations: 15,
 });
-// result.output: AuthDesign JSON with strategy, flows, middleware, roles, policies
-```
-**Sub-agents:** `security-analyzer`, `flow-designer` | **Tools:** `design_auth`, `validate_auth` | **Output:** Auth design JSON
+// Step 2: Convert requirements into a planning context and generate the plan
+// Skips discovery + requirements stages; begins at design
+const planResult = await runPlanningFromRequirements({
+  requirement: JSON.parse(reqResult.output),
+  model,
+});
+console.log(planResult.output); // Full implementation plan as markdown
+```
 ---
-### Frontend Architect Agent
+## Full Pipeline
-Plans frontend architecture including pages, components, routing, and state management. Routes to React Builder or Next.js Builder based on framework selection.
+Chain multiple agents together to go from a vague idea to implementation-ready specs for every layer of your stack. Each agent's output feeds the next.
 ```typescript
-import { runFrontendArchitectAgent } from 'sweagent';
+import {
+  runRequirementGathererAgent,
+  runDataModelerAgent,
+  runApiDesignerAgent,
+  runAuthDesignerAgent,
+  runBackendArchitectAgent,
+  runFrontendArchitectAgent,
+  runExecutionPlannerAgent,
+} from 'sweagent';
+import { writeFileSync } from 'fs';
-const result = await runFrontendArchitectAgent({
-  input: 'Dashboard app with analytics, settings, and user management pages',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
+const model = { provider: 'openai', model: 'gpt-4o-mini' } as const;
+// Step 1: Gather structured requirements
+const requirements = await runRequirementGathererAgent({
+  input: 'Project management SaaS with teams, Kanban boards, time tracking, and billing',
+  model,
   maxIterations: 15,
 });
-// result.output: FrontendDesign JSON with pages, components, state management, routing
-```
-**Sub-agents:** `page-planner`, `component-analyzer`, `framework-selector` | **Output:** Frontend design JSON
----
-### Backend Architect Agent
-Plans backend architecture including framework selection, services, middleware, routes, and folder structure. Routes to Express Builder or Apollo Builder based on framework choice.
-```typescript
-import { runBackendArchitectAgent } from 'sweagent';
-const result = await runBackendArchitectAgent({
-  input: 'REST API backend with user auth, CRUD operations, and file uploads',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
+// Step 2: Design the data model from requirements
+const dataModel = await runDataModelerAgent({
+  input: `Design a data model based on these requirements:\n${requirements.output}`,
+  model,
   maxIterations: 15,
 });
-// result.output: BackendDesign JSON with framework, services, middleware, routes
-```
-**Sub-agents:** `framework-selector`, `service-planner` | **Tools:** `design_backend`, `validate_backend` | **Output:** Backend design JSON
----
-### Express Builder Agent
+// Step 3: Design the API from the data model
+const apiDesign = await runApiDesignerAgent({
+  input: `Design REST API for this data model:\n${dataModel.output}`,
+  model,
+  maxIterations: 15,
+});
-Generates Express.js REST API configuration with routers, models, middleware, and environment variables.
+// Step 4: Design auth from the requirements and API
+const authDesign = await runAuthDesignerAgent({
+  input: `Design auth for this project:\nRequirements: ${requirements.output}\nAPI: ${apiDesign.output}`,
+  model,
+  maxIterations: 15,
+});
-```typescript
-import { runExpressBuilderAgent } from 'sweagent';
+// Step 5: Plan backend architecture
+const backendDesign = await runBackendArchitectAgent({
+  input: `Design backend:\nData model: ${dataModel.output}\nAPI: ${apiDesign.output}\nAuth: ${authDesign.output}`,
+  model,
+  maxIterations: 15,
+});
-const result = await runExpressBuilderAgent({
-  input: 'Express API for e-commerce with products, orders, and user auth',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
+// Step 6: Plan frontend architecture
+const frontendDesign = await runFrontendArchitectAgent({
+  input: `Design frontend:\nAPI: ${apiDesign.output}\nRequirements: ${requirements.output}`,
+  model,
   maxIterations: 15,
 });
-// result.output: ExpressConfig JSON with routers, models, middleware, env vars
+// Save all specs for your coding agent
+writeFileSync('specs/requirements.json', requirements.output);
+writeFileSync('specs/data-model.json', dataModel.output);
+writeFileSync('specs/api-design.json', apiDesign.output);
+writeFileSync('specs/auth-design.json', authDesign.output);
+writeFileSync('specs/backend-design.json', backendDesign.output);
+writeFileSync('specs/frontend-design.json', frontendDesign.output);
+// Now hand the specs/ directory to Cursor, Claude Code, or Codex
+// "Implement the backend using specs/backend-design.json and specs/data-model.json"
 ```
-**Sub-agents:** `route-generator`, `middleware-configurator` | **Tools:** `generate_express`, `scaffold_express`, `validate_express` | **Output:** Express config JSON
+You can also run individual agents standalone -- each works independently with natural-language input. The pipeline approach gives you maximum control over each design decision.
 ---
-### Apollo Builder Agent
+## Domain Agent Modules
-Generates Apollo GraphQL subgraph configuration with modules, types, resolvers, datasources, and Federation v2 support.
+Each module is a self-contained domain agent with its own orchestrator, pipeline stages, tools, sub-agents, and output format. All are exported from the main package.
+| Stage               | Agent                         | Sub-Agents                                                 | Output                                            |
+| ------------------- | ----------------------------- | ---------------------------------------------------------- | ------------------------------------------------- |
+| **Planning**        | `runPlanningAgent`            | --                                                         | Implementation-ready markdown plan (11 sections)  |
+| **Requirements**    | `runRequirementGathererAgent` | --                                                         | Structured JSON (actors, flows, stories, modules) |
+| **Data Modeling**   | `runDataModelerAgent`         | `entity-analyzer`, `relationship-mapper`, `schema-refiner` | MongoDB/PostgreSQL schemas                        |
+| **API Design**      | `runApiDesignerAgent`         | `endpoint-analyzer`, `contract-designer`                   | REST and/or GraphQL API design                    |
+| **Auth Design**     | `runAuthDesignerAgent`        | `security-analyzer`, `flow-designer`                       | Auth strategy, flows, middleware, RBAC            |
+| **Backend Arch.**   | `runBackendArchitectAgent`    | `framework-selector`, `service-planner`                    | Backend architecture (Express/Apollo)             |
+| **Express Builder** | `runExpressBuilderAgent`      | `route-generator`, `middleware-configurator`               | Express.js REST API config                        |
+| **Apollo Builder**  | `runApolloBuilderAgent`       | `schema-generator`, `resolver-planner`                     | Apollo GraphQL subgraph config                    |
+| **Frontend Arch.**  | `runFrontendArchitectAgent`   | `page-planner`, `component-analyzer`, `framework-selector` | Frontend architecture (React/Next.js)             |
+| **React Builder**   | `runReactBuilderAgent`        | `graphql-analyzer`, `config-validator`                     | React + Vite app config from GraphQL              |
+| **Next.js Builder** | `runNextjsBuilderAgent`       | `route-planner`, `api-route-generator`                     | Next.js App Router config                         |
+| **Execution Plan**  | `runExecutionPlannerAgent`    | `edge-case-analyzer`, `testing-strategist`                 | Phased implementation plan                        |
+| **Hello World**     | `runAgent` + `helloWorldTool` | --                                                         | Template module for custom agents                 |
+---
+### Planning Agent
+Turns a natural-language project description into an implementation-ready markdown plan through 4 stages and 12+ LLM calls. Covers tech stack, data models, API routes, implementation order, edge cases, and testing checklists.
+| Attribute         | Detail                                                                  |
+| ----------------- | ----------------------------------------------------------------------- |
+| **Stages**        | Discovery, Requirements (4 LLM calls), Design (2 LLM calls), Synthesis  |
+| **Sub-Agents**    | --                                                                      |
+| **Tools**         | -- (pipeline stages, not tool-based)                                    |
+| **Output Format** | Markdown plan (11 sections)                                             |
+| **Modes**         | One-shot (`runPlanningAgent`), interactive chat (`processPlanningChat`) |
 ```typescript
-import { runApolloBuilderAgent } from 'sweagent';
+import { runPlanningAgent } from 'sweagent';
-const result = await runApolloBuilderAgent({
-  input: 'Apollo subgraph for a task manager with users, projects, and tasks',
+const result = await runPlanningAgent({
+  input: 'E-commerce: users, orders, products. Admins manage products.',
   model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
 });
-// result.output: SubgraphConfig JSON with modules, types, operations, datasources
+console.log(result.output); // Full markdown blueprint
 ```
-**Sub-agents:** `schema-generator`, `resolver-planner` | **Tools:** `generate_subgraph`, `scaffold_subgraph`, `validate_subgraph` | **Output:** Apollo subgraph config JSON
+See [Planning Pipeline](#planning-pipeline) for stage-by-stage details.
 ---
-### Next.js Builder Agent
+### Requirement Gatherer Agent
-Generates Next.js App Router configuration with pages, layouts, API routes, server actions, and middleware.
+Produces structured JSON requirements -- not prose. Unlike the Planning module (markdown output), the Requirement Gatherer extracts typed data that downstream systems can consume programmatically.
+| Attribute         | Detail                                                                                |
+| ----------------- | ------------------------------------------------------------------------------------- |
+| **Stages**        | Discovery, Requirements, Design, Synthesis                                            |
+| **Sub-Agents**    | --                                                                                    |
+| **Tools**         | Stage-specific tools                                                                  |
+| **Output Format** | Structured JSON                                                                       |
+| **Schemas**       | Actors, User Flows, User Stories, Modules, Database Design, API Design                |
+| **Modes**         | One-shot (`runRequirementGathererAgent`), interactive chat (`processRequirementChat`) |
+**Output structure:** Actors (with permissions), User Flows (step-by-step sequences), User Stories (with acceptance criteria), Modules (with CRUD operations), Database Design (schemas, relationships), API Design (REST/GraphQL endpoints).
 ```typescript
-import { runNextjsBuilderAgent } from 'sweagent';
+import { runRequirementGathererAgent } from 'sweagent';
-const result = await runNextjsBuilderAgent({
-  input: 'Next.js app for project management with teams, tasks, and dashboards',
+const result = await runRequirementGathererAgent({
+  input: 'Project management tool with teams and Kanban boards',
   model: { provider: 'openai', model: 'gpt-4o-mini' },
   maxIterations: 15,
 });
-// result.output: NextjsConfig JSON with pages, layouts, API routes, server actions
+// result.output: structured JSON with actors, flows, stories, modules
 ```
-**Sub-agents:** `route-planner`, `api-route-generator` | **Tools:** `generate_nextjs`, `validate_nextjs` | **Output:** Next.js config JSON
 ---
-### Execution Planner Agent
+### Data Modeler Agent
-Creates phased implementation execution plans from plan sections, with edge case analysis and testing checklists.
+Designs data models for MongoDB or PostgreSQL with entities, fields, indexes, and relationships. Uses three sub-agents for entity analysis, relationship mapping, and schema refinement.
+| Attribute         | Detail                                                                       |
+| ----------------- | ---------------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                                 |
+| **Sub-Agents**    | `entity-analyzer`, `relationship-mapper`, `schema-refiner`                   |
+| **Tools**         | `design_schema`, `design_schema_pro`, `refine_schema`, `validate_data_model` |
+| **Output Format** | Data model JSON (entities, fields, indexes, relationships)                   |
+| **Databases**     | MongoDB, PostgreSQL                                                          |
 ```typescript
-import { runExecutionPlannerAgent } from 'sweagent';
+import { runDataModelerAgent } from 'sweagent';
-const result = await runExecutionPlannerAgent({
-  input: 'Create execution plan for the task manager project',
+const result = await runDataModelerAgent({
+  input: 'SaaS platform with organizations, users, projects, and billing',
   model: { provider: 'openai', model: 'gpt-4o-mini' },
   maxIterations: 15,
 });
-// result.output: ExecutionPlan JSON with phases, edge cases, testing checklist
+// result.output: DataModelDesign JSON with entities, fields, indexes, relationships
 ```
-**Sub-agents:** `edge-case-analyzer`, `testing-strategist` | **Tools:** `create_execution_plan`, `validate_execution_plan` | **Output:** Execution plan JSON
 ---
-### Hello World (Template)
+### API Designer Agent
-Minimal example module with a single greeting tool. Use as a starting point when building your own domain agent module.
+Designs REST and/or GraphQL APIs from data models, producing endpoint definitions with request/response contracts, auth requirements, and operation details.
+| Attribute         | Detail                                                                                                  |
+| ----------------- | ------------------------------------------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                                                            |
+| **Sub-Agents**    | `endpoint-analyzer` (derives endpoints from data model), `contract-designer` (designs request/response) |
+| **Tools**         | `design_api`, `design_api_pro`, `validate_api`                                                          |
+| **Output Format** | API design JSON (REST endpoints and/or GraphQL operations)                                              |
 ```typescript
-import { createModel, runAgent, helloWorldTool } from 'sweagent';
+import { runApiDesignerAgent } from 'sweagent';
-const result = await runAgent({
-  model: createModel({ provider: 'openai', model: 'gpt-4o-mini' }),
-  tools: [helloWorldTool],
-  systemPrompt: 'You are helpful.',
-  input: 'Say hello',
+const result = await runApiDesignerAgent({
+  input: 'Design REST API for a task manager with users, projects, and tasks',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
 });
+// result.output: ApiDesign JSON with REST endpoints and/or GraphQL operations
 ```
 ---
-## Planning Pipeline
-The planning module is the centerpiece for AI coding agents. It turns a natural-language project description into a structured, implementation-ready markdown plan through four stages.
-### How it works
+### Auth Designer Agent
-```mermaid
-flowchart LR
-  Input["User Requirement"] --> Discovery["Discovery"]
-  Discovery --> Requirements["Requirements"]
-  Requirements --> Design["Design"]
-  Design --> Synthesis["Synthesis"]
-  Synthesis --> Plan["plan.md"]
-  Plan --> Validate["LLM Validator"]
-  Validate --> Output["planning: bool, plan: string"]
-```
+Designs authentication and authorization systems with strategies, flows, middleware, roles, and security policies.
-### Stages
+| Attribute         | Detail                                                                                     |
+| ----------------- | ------------------------------------------------------------------------------------------ |
+| **Pattern**       | Orchestrator with sub-agents                                                               |
+| **Sub-Agents**    | `security-analyzer` (analyzes security requirements), `flow-designer` (designs auth flows) |
+| **Tools**         | `design_auth`, `validate_auth`                                                             |
+| **Output Format** | Auth design JSON (strategy, flows, middleware, roles, policies)                            |
-| Stage            | What it produces                                   | Sections                                                            |
-| ---------------- | -------------------------------------------------- | ------------------------------------------------------------------- |
-| **Discovery**    | Understands the project, asks clarifying questions | Project overview                                                    |
-| **Requirements** | 4 sequential LLM calls to flesh out the spec       | Tech stack, feature decisions, data models, pages/routes, auth flow |
-| **Design**       | 2 sequential LLM calls for technical design        | API routes, implementation details                                  |
-| **Synthesis**    | Assembles the final plan                           | Implementation order, edge cases, testing checklist                 |
+```typescript
+import { runAuthDesignerAgent } from 'sweagent';
-### Output
+const result = await runAuthDesignerAgent({
+  input: 'JWT auth with email/password, Google OAuth, role-based access (admin, member)',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
+});
+// result.output: AuthDesign JSON with strategy, flows, middleware, roles, policies
+```
-The plan is a markdown document with these sections:
+---
-- **Overview** — project scope and goals
-- **Tech Stack** — languages, frameworks, database, auth approach
-- **Feature Decisions** — what to build and what to defer
-- **Data Models** — schemas, relationships, fields
-- **Pages and Routes** — frontend structure
-- **Authentication Flow** — auth strategy and implementation
-- **API Routes** — endpoints, methods, request/response shapes
-- **Implementation Details** — architecture decisions, file structure
-- **Execution Plan** — phased implementation order
-- **Edge Cases** — error handling, boundary conditions
-- **Testing Checklist** — what to verify at each phase
+### Backend Architect Agent
-### Two modes
+Plans backend architecture including framework selection, services, middleware, routes, and folder structure. Routes to Express Builder or Apollo Builder based on framework choice.
-**One-shot mode** — pass a requirement, get a plan:
+| Attribute         | Detail                                                                          |
+| ----------------- | ------------------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                                    |
+| **Sub-Agents**    | `framework-selector`, `service-planner`                                         |
+| **Tools**         | `design_backend`, `validate_backend`                                            |
+| **Output Format** | Backend design JSON (framework, services, middleware, routes, folder structure) |
+| **Frameworks**    | Express, Apollo, or both                                                        |
 ```typescript
-import { runPlanningAgent } from 'sweagent';
+import { runBackendArchitectAgent } from 'sweagent';
-const result = await runPlanningAgent({
-  input: 'Fitness app with workouts, nutrition tracking, and social features',
-  model: { provider: 'anthropic', model: 'claude-sonnet-4-20250514' },
+const result = await runBackendArchitectAgent({
+  input: 'REST API backend with user auth, CRUD operations, and file uploads',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
 });
-console.log(result.output); // Full plan markdown
+// result.output: BackendDesign JSON with framework, services, middleware, routes
 ```
-**With validation** — run the plan through an LLM judge that checks completeness:
+---
-```typescript
-import { runPlanningWithResult } from 'sweagent';
+### Express Builder Agent
-const { planning, plan } = await runPlanningWithResult({
-  input: 'Fitness app with workouts, nutrition tracking, and social features',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-});
-// planning === true means the plan passed all validation criteria
-```
+Generates Express.js REST API configuration with routers, models, middleware, and environment variables.
-**Interactive chat mode** — multi-turn conversation where you refine the plan:
+| Attribute         | Detail                                                      |
+| ----------------- | ----------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                |
+| **Sub-Agents**    | `route-generator`, `middleware-configurator`                |
+| **Tools**         | `generate_express`, `scaffold_express`, `validate_express`  |
+| **Output Format** | Express config JSON (routers, models, middleware, env vars) |
 ```typescript
-import { processPlanningChat } from 'sweagent';
-import type { PlanningContext } from 'sweagent';
-let context: PlanningContext | null = null;
-// Turn 1: describe the project
-const turn1 = await processPlanningChat('Build a task manager with teams', context, {
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-});
-context = turn1.context;
-console.log(turn1.message); // Assistant asks clarifying questions
-console.log(turn1.pendingQuestions); // ["What auth provider?", ...]
+import { runExpressBuilderAgent } from 'sweagent';
-// Turn 2: answer and advance
-const turn2 = await processPlanningChat('Use NextAuth with GitHub OAuth', context, {
+const result = await runExpressBuilderAgent({
+  input: 'Express API for e-commerce with products, orders, and user auth',
   model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
 });
-context = turn2.context;
-// Continue until turn.planMarkdown is set (plan complete)
+// result.output: ExpressConfig JSON with routers, models, middleware, env vars
 ```
-### Validation criteria
+---
-The LLM validator (`validatePlanForCodingAgent`) checks that the plan includes:
+### Apollo Builder Agent
-1. Clear project overview and scope
-2. Tech stack specified (languages, frameworks, database, auth)
-3. Implementation order or phased steps
-4. Concrete actionable steps (files, routes, APIs, or models)
-5. Data model, authentication, and API surface addressed
+Generates Apollo GraphQL subgraph configuration with modules, types, resolvers, datasources, and Federation v2 support.
----
+| Attribute         | Detail                                                                |
+| ----------------- | --------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                          |
+| **Sub-Agents**    | `schema-generator`, `resolver-planner`                                |
+| **Tools**         | `generate_subgraph`, `scaffold_subgraph`, `validate_subgraph`         |
+| **Output Format** | Apollo subgraph config JSON (modules, types, operations, datasources) |
-## Full Pipeline
+```typescript
+import { runApolloBuilderAgent } from 'sweagent';
-Chain multiple agents together to go from a vague idea to implementation-ready specs for every layer of your stack. Each agent's output feeds the next.
-```typescript
-import {
-  runRequirementGathererAgent,
-  runDataModelerAgent,
-  runApiDesignerAgent,
-  runAuthDesignerAgent,
-  runBackendArchitectAgent,
-  runFrontendArchitectAgent,
-  runExecutionPlannerAgent,
-} from 'sweagent';
-import { writeFileSync } from 'fs';
-const model = { provider: 'openai', model: 'gpt-4o-mini' } as const;
-// Step 1: Gather structured requirements
-const requirements = await runRequirementGathererAgent({
-  input: 'Project management SaaS with teams, Kanban boards, time tracking, and billing',
-  model,
+const result = await runApolloBuilderAgent({
+  input: 'Apollo subgraph for a task manager with users, projects, and tasks',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
   maxIterations: 15,
 });
+// result.output: SubgraphConfig JSON with modules, types, operations, datasources
+```
-// Step 2: Design the data model from requirements
-const dataModel = await runDataModelerAgent({
-  input: `Design a data model based on these requirements:\n${requirements.output}`,
-  model,
-  maxIterations: 15,
-});
+---
-// Step 3: Design the API from the data model
-const apiDesign = await runApiDesignerAgent({
-  input: `Design REST API for this data model:\n${dataModel.output}`,
-  model,
-  maxIterations: 15,
-});
+### Frontend Architect Agent
-// Step 4: Design auth from the requirements and API
-const authDesign = await runAuthDesignerAgent({
-  input: `Design auth for this project:\nRequirements: ${requirements.output}\nAPI: ${apiDesign.output}`,
-  model,
-  maxIterations: 15,
-});
+Plans frontend architecture including pages, components, routing, and state management. Routes to React Builder or Next.js Builder based on framework selection.
-// Step 5: Plan backend architecture
-const backendDesign = await runBackendArchitectAgent({
-  input: `Design backend:\nData model: ${dataModel.output}\nAPI: ${apiDesign.output}\nAuth: ${authDesign.output}`,
-  model,
-  maxIterations: 15,
-});
+| Attribute         | Detail                                                              |
+| ----------------- | ------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                        |
+| **Sub-Agents**    | `page-planner`, `component-analyzer`, `framework-selector`          |
+| **Output Format** | Frontend design JSON (pages, components, state management, routing) |
+| **Frameworks**    | React + Vite, Next.js                                               |
-// Step 6: Plan frontend architecture
-const frontendDesign = await runFrontendArchitectAgent({
-  input: `Design frontend:\nAPI: ${apiDesign.output}\nRequirements: ${requirements.output}`,
-  model,
+```typescript
+import { runFrontendArchitectAgent } from 'sweagent';
+const result = await runFrontendArchitectAgent({
+  input: 'Dashboard app with analytics, settings, and user management pages',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
   maxIterations: 15,
 });
+// result.output: FrontendDesign JSON with pages, components, state management, routing
+```
-// Save all specs for your coding agent
-writeFileSync('specs/requirements.json', requirements.output);
-writeFileSync('specs/data-model.json', dataModel.output);
-writeFileSync('specs/api-design.json', apiDesign.output);
-writeFileSync('specs/auth-design.json', authDesign.output);
-writeFileSync('specs/backend-design.json', backendDesign.output);
-writeFileSync('specs/frontend-design.json', frontendDesign.output);
+---
-// Now hand the specs/ directory to Cursor, Claude Code, or Codex
-// "Implement the backend using specs/backend-design.json and specs/data-model.json"
-```
+### React Builder Agent
-You can also run individual agents standalone -- each works independently with natural-language input. The pipeline approach gives you maximum control over each design decision.
+Generates complete frontend application configuration from a GraphQL schema. A `graphql-analyzer` sub-agent parses the schema structure, and a `config-validator` sub-agent verifies the output. Produces app config, modules, pages, fields, and API hooks.
----
+| Attribute         | Detail                                                                                                      |
+| ----------------- | ----------------------------------------------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                                                                |
+| **Sub-Agents**    | `graphql-analyzer` (parses GraphQL schema structure), `config-validator` (validates frontend config output) |
+| **Tools**         | `generate_frontend`, `generate_feature_breakdown`, `validate_frontend_config`                               |
+| **Output Format** | React app config JSON (app, modules, pages, fields, API hooks, branding)                                    |
+| **Schemas**       | App config, User config, Page config, Field config, Branding                                                |
-## Features
+```typescript
+import { runReactBuilderAgent } from 'sweagent';
-| Feature                     | Description                                                                                                                                                                                                                                                                    |
-| --------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
-| **14 Domain Agent Modules** | Planning, Requirement Gatherer, Data Modeler, DB Designer, API Designer, Auth Designer, Backend Architect, Express Builder, Apollo Builder, Frontend Architect, React Builder, Next.js Builder, Execution Planner, and Hello World template -- each a self-contained pipeline. |
-| **Multi-Stage Pipelines**   | Every domain agent progresses through structured stages (discovery, requirements, design, synthesis) with dedicated LLM calls at each step. No single-shot prompts.                                                                                                            |
-| **Sub-Agent Orchestration** | Complex domains delegate to specialized sub-agents (`entity-analyzer`, `schema-refiner`, `graphql-analyzer`, `config-validator`) that run in isolation and return condensed results.                                                                                           |
-| **Plan Validation**         | LLM-based judge validates that planning output meets all criteria for a coding agent to start implementing.                                                                                                                                                                    |
-| **Structured Outputs**      | Requirements as typed JSON (actors, flows, stories, modules). DB schemas with field-level detail. Frontend configs with pages, hooks, and branding. Plans with 11 sections.                                                                                                    |
-| **Multi-Provider Models**   | Unified API for OpenAI, Anthropic, and Google. One `createModel()` call, zero provider lock-in.                                                                                                                                                                                |
-| **Type-Safe Tools**         | Define tools with Zod schemas; full type inference and validation before execution. Minimal, workflow-oriented tool sets.                                                                                                                                                      |
-| **Agent Framework**         | Iterative agent loop with tool calling, step callbacks, and configurable max iterations.                                                                                                                                                                                       |
-| **MCP Protocol**            | Connect to Model Context Protocol servers over HTTP or stdio. Lazy connection, typed tool invocation.                                                                                                                                                                          |
-| **Vision**                  | Image inputs via `model.generateVision()` for vision-capable models.                                                                                                                                                                                                           |
-| **Zero Extra Deps**         | All provider SDKs (OpenAI, Anthropic, Google) included. Set API keys and run.                                                                                                                                                                                                  |
+const result = await runReactBuilderAgent({
+  input: 'GraphQL schema: type User { id: ID! name: String! } type Task { ... }',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
+});
+// result.output: frontend config JSON with pages, fields, hooks, branding
+```
 ---
-## Installation
+### Next.js Builder Agent
-### Prerequisites
+Generates Next.js App Router configuration with pages, layouts, API routes, server actions, and middleware.
-- **Node.js** >= 18.0.0
-- **npm** >= 8.0.0 (or yarn, pnpm, bun)
+| Attribute         | Detail                                                                       |
+| ----------------- | ---------------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                                 |
+| **Sub-Agents**    | `route-planner`, `api-route-generator`                                       |
+| **Tools**         | `generate_nextjs`, `validate_nextjs`                                         |
+| **Output Format** | Next.js config JSON (pages, layouts, API routes, server actions, middleware) |
-### Install
+```typescript
+import { runNextjsBuilderAgent } from 'sweagent';
-```bash
-npm install sweagent
+const result = await runNextjsBuilderAgent({
+  input: 'Next.js app for project management with teams, tasks, and dashboards',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
+});
+// result.output: NextjsConfig JSON with pages, layouts, API routes, server actions
 ```
-Or with yarn, pnpm, or bun:
+---
-```bash
-yarn add sweagent
-pnpm add sweagent
-bun add sweagent
-```
+### Execution Planner Agent
-All AI provider SDKs (OpenAI, Anthropic, Google) are included; no extra packages needed.
+Creates phased implementation execution plans from plan sections, with edge case analysis and testing checklists.
-### From source
+| Attribute         | Detail                                                                                  |
+| ----------------- | --------------------------------------------------------------------------------------- |
+| **Pattern**       | Orchestrator with sub-agents                                                            |
+| **Sub-Agents**    | `edge-case-analyzer`, `testing-strategist`                                              |
+| **Tools**         | `create_execution_plan`, `validate_execution_plan`                                      |
+| **Output Format** | Execution plan JSON (phases, edge cases, testing checklist, security/performance notes) |
-```bash
-git clone https://github.com/sijeeshmiziha/sweagent.git
-cd sweagent
-npm install
-```
+```typescript
+import { runExecutionPlannerAgent } from 'sweagent';
-### Environment setup
+const result = await runExecutionPlannerAgent({
+  input: 'Create execution plan for the task manager project',
+  model: { provider: 'openai', model: 'gpt-4o-mini' },
+  maxIterations: 15,
+});
+// result.output: ExecutionPlan JSON with phases, edge cases, testing checklist
+```
-Create a `.env` file in your project root:
+---
-```bash
-# At least one provider API key is required
-OPENAI_API_KEY=sk-...
-ANTHROPIC_API_KEY=sk-ant-...
-GOOGLE_GENERATIVE_AI_API_KEY=...
-```
+### Hello World (Template)
-### Verify
+Minimal example module with a single greeting tool. Use as a starting point when building your own domain agent module.
-```bash
-# If installed as a package
-echo 'import { createModel, runAgent, helloWorldTool } from "sweagent";
-const model = createModel({ provider: "openai", model: "gpt-4o-mini" });
-const result = await runAgent({ model, tools: [helloWorldTool], systemPrompt: "You are helpful.", input: "Say hello" });
-console.log(result.output);' > test.mjs
-node --env-file=.env test.mjs
+```typescript
+import { createModel, runAgent, helloWorldTool } from 'sweagent';
-# If cloned from source
-npm run example -- examples/hello-world/01-hello-world.ts
+const result = await runAgent({
+  model: createModel({ provider: 'openai', model: 'gpt-4o-mini' }),
+  tools: [helloWorldTool],
+  systemPrompt: 'You are helpful.',
+  input: 'Say hello',
+});
 ```
 ---
@@ -880,15 +960,14 @@ const result = await runAgent({
 Generate an implementation plan for a coding agent:
 ```typescript
-import { runPlanningWithResult } from 'sweagent';
+import { runPlanningAgent } from 'sweagent';
-const { planning, plan } = await runPlanningWithResult({
+const result = await runPlanningAgent({
   input: 'E-commerce site: users, products, cart, checkout, admin dashboard',
   model: { provider: 'openai', model: 'gpt-4o-mini' },
 });
-console.log('Ready for coding agent:', planning);
-console.log(plan);
+console.log(result.output); // Full markdown blueprint
 ```
 ### Level 6: MCP integration
@@ -906,730 +985,709 @@ const result = await client.callTool('tool_name', { arg: 'value' });
 ---
-## Architecture
+## Installation
-### System overview
+### Prerequisites
-```mermaid
-graph TB
-  subgraph Client[Client Application]
-    App["Your App / Cursor / Claude Code / Codex"]
-  end
+- **Node.js** >= 18.0.0
+- **npm** >= 8.0.0 (or yarn, pnpm, bun)
-  subgraph DomainAgents[Domain Agent Modules]
-    Planning["Planning"]
-    ReqGatherer["Requirement Gatherer"]
-    DataModeler["Data Modeler"]
-    DbDesigner["DB Designer"]
-    ApiDesigner["API Designer"]
-    AuthDesigner["Auth Designer"]
-    BackendArch["Backend Architect"]
-    ExpressBuilder["Express Builder"]
-    ApolloBuilder["Apollo Builder"]
-    FrontendArch["Frontend Architect"]
-    ReactBuilder["React Builder"]
-    NextjsBuilder["Next.js Builder"]
-    ExecPlanner["Execution Planner"]
-  end
+### Install
-  subgraph Framework[Shared Framework]
-    Models["Model Abstraction"]
-    ToolFW["Tool Framework"]
-    AgentLoop["Agent Loop"]
-    SubAgentOrch["Sub-Agent Orchestration"]
-    MCP["MCP Protocol"]
-  end
+```bash
+npm install sweagent
+```
-  subgraph Providers[AI Providers]
-    OpenAI["OpenAI"]
-    Anthropic["Anthropic"]
-    Google["Google"]
-  end
+Or with yarn, pnpm, or bun:
-  App --> DomainAgents
-  DomainAgents --> Framework
-  Framework --> Providers
+```bash
+yarn add sweagent
+pnpm add sweagent
+bun add sweagent
 ```
-### Domain agent pipeline flow
+All AI provider SDKs (OpenAI, Anthropic, Google) are included; no extra packages needed.
-Each domain agent follows a structured pipeline. The Planning Agent is representative:
+### From source
-```mermaid
-flowchart LR
-  Input["User Requirement"] --> Discovery["Discovery Stage"]
-  Discovery --> Requirements["Requirements Stage"]
-  Requirements --> Design["Design Stage"]
-  Design --> Synthesis["Synthesis Stage"]
-  Synthesis --> Plan["Structured Output"]
-  Plan --> Validate["LLM Validator"]
-  Validate --> Output["Validated Result"]
+```bash
+git clone https://github.com/sijeeshmiziha/sweagent.git
+cd sweagent
+npm install
 ```
-### Orchestrator with sub-agents
+### Environment setup
-Domain agents like DB Designer and React Builder delegate to specialized sub-agents:
+Create a `.env` file in your project root:
-```mermaid
-sequenceDiagram
-  participant User
-  participant Orchestrator
-  participant Model
-  participant Tools
-  participant SubAgent1 as entity-analyzer
-  participant SubAgent2 as schema-refiner
-  User->>Orchestrator: Natural-language requirement
-  Orchestrator->>Model: Messages + Domain Tools
-  Model-->>Orchestrator: Tool call design_database
-  Orchestrator->>Tools: Execute design_database
-  Tools-->>Orchestrator: Initial schema
-  Orchestrator->>SubAgent1: Analyze entities and relationships
-  SubAgent1-->>Orchestrator: Structured entity analysis
-  Orchestrator->>SubAgent2: Refine and validate schema
-  SubAgent2-->>Orchestrator: Validated schema
-  Orchestrator->>Model: Compile final output
-  Model-->>Orchestrator: Final result
-  Orchestrator-->>User: Production-grade schema
+```bash
+# At least one provider API key is required
+OPENAI_API_KEY=sk-...
+ANTHROPIC_API_KEY=sk-ant-...
+GOOGLE_GENERATIVE_AI_API_KEY=...
 ```
-### Agent execution loop
+### Verify
-```mermaid
-sequenceDiagram
-  participant User
-  participant Agent
-  participant Model
-  participant Tools
+```bash
+# If installed as a package
+echo 'import { createModel, runAgent, helloWorldTool } from "sweagent";
+const model = createModel({ provider: "openai", model: "gpt-4o-mini" });
+const result = await runAgent({ model, tools: [helloWorldTool], systemPrompt: "You are helpful.", input: "Say hello" });
+console.log(result.output);' > test.mjs
+node --env-file=.env test.mjs
-  User->>Agent: Input + Tools + System Prompt
-  loop Until Complete or Max Iterations
-    Agent->>Model: Messages + Tool Schemas
-    Model-->>Agent: Response (Text or Tool Calls)
-    alt Tool Calls Present
-      Agent->>Tools: Execute Tool Calls
-      Tools-->>Agent: Tool Results
-      Agent->>Agent: Append Results to Messages
-    else Final Answer
-      Agent-->>User: Output + Steps + Usage
-    end
-  end
+# If cloned from source
+npm run example -- examples/hello-world/01-hello-world.ts
 ```
 ---
-## Engineering Deep Dive
+## MCP Server
-### The problem: long-running coding agents
+sweagent is also a **Model Context Protocol (MCP) server**. Install it once and every MCP-compatible IDE or tool -- Cursor, VS Code, Windsurf, Claude Desktop, and more -- can call any of the 13 domain agents directly from the chat interface. No wrapper scripts, no code to write.
-Coding agents that work across many steps or sessions fail in two ways: they try to do too much in one shot (leaving partial, undocumented work), or they declare the job done too early. Each new session starts with no memory of the last. To make progress across sessions, each run needs a way to get up to speed quickly and leave a clean state for the next run.
+```mermaid
+flowchart LR
+  IDE["Cursor / VS Code / Windsurf / Claude Desktop"] -->|"MCP stdio"| Server["sweagent MCP Server"]
+  Server --> Plan["plan"]
+  Server --> Req["gather_requirements"]
+  Server --> Data["design_data_model"]
+  Server --> Api["design_api"]
+  Server --> Auth["design_auth"]
+  Server --> More["... 8 more tools"]
+```
-### Incremental progress pattern
+### Quick Start
-Each session does bounded work: one feature or one clear subtask. The agent updates a progress file and commits what it did. The next session reads progress and git history, chooses the next unfinished item, and continues. This avoids one-shotting the whole project and reduces premature completion.
+**1. Install sweagent from npm:**
-### Feature list approach
+```bash
+npm install -g sweagent
+```
-A structured list of requirements (e.g. in JSON) with a status per item gives the agent a clear definition of "done." Agents only mark items passing after verification, keeping scope explicit and making it easy to resume across context windows.
+This installs the `sweagent` command globally on your machine. Requires Node.js >= 18.
-### Clean state principle
+**2. Set your API key:**
-Every session should end with code that is buildable, documented, and easy to continue from. No half-implemented features, no stray debug code, clear commit messages or progress notes. The orchestration prompts in production modules encode these expectations.
+You need at least one AI provider API key. Export it in your shell or pass it via the IDE config (shown below):
-### Error hierarchy
+```bash
+export OPENAI_API_KEY=sk-...
+# or ANTHROPIC_API_KEY, or GOOGLE_GENERATIVE_AI_API_KEY
+```
-| Class               | When                                             |
-| ------------------- | ------------------------------------------------ |
-| **LibraryError**    | Base; all others extend it.                      |
-| **ModelError**      | Model creation or invoke failed.                 |
-| **ToolError**       | Tool not found or tool execution failed.         |
-| **ValidationError** | Zod validation failed.                           |
-| **AgentError**      | Agent reached max iterations without completing. |
-| **SubagentError**   | Subagent config or run failed.                   |
+**3. Add the config to your IDE** (pick your IDE below) **and restart.**
-All accept an optional `cause` for chaining.
+**4. Verify** -- ask the chat agent: _"Use the hello_world tool to test the sweagent server."_
-### Provider adapter pattern
+> **No global install?** You can skip step 1 and use `npx -y sweagent` instead. The IDE configs below show both options.
-Models are created via `createModel({ provider, model, ... })`. A shared AI SDK adapter wraps the Vercel AI SDK's `generateText` and normalizes messages, tool schemas, and responses. Each provider has a thin factory that passes the correct `LanguageModel` into this adapter. Provider-specific logic stays in one place; everything else is provider-agnostic.
+### How it runs
-### Tool execution safety
+The MCP server communicates over **stdio**. Your IDE starts it automatically -- you do not run these commands yourself. Under the hood, the IDE runs one of:
-Inputs are validated with Zod before any tool runs. Invalid input produces a **ToolError** with the parse error; the handler is never called with bad data. Handler errors are caught and rethrown as **ToolError** with the original error as cause. The agent loop receives structured tool results (including error payloads) so the model can see failures and retry or adjust.
+```bash
+# If you installed globally (npm install -g sweagent)
+sweagent
----
+# If you prefer npx (no install needed, downloads on first use)
+npx -y sweagent
-## API Reference
+# If you cloned the repo and built from source
+node --env-file=.env dist/stdio.js
+```
-All public APIs are exported from the main package: `import { ... } from 'sweagent'`.
+### Setup with Cursor
-### Models
+Create `.cursor/mcp.json` in your project root:
-**createModel(config)** — Create a model instance.
+**Option A -- Global install (recommended):**
-```typescript
-import { createModel } from 'sweagent';
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
+```
-const model = createModel({
-  provider: 'openai' | 'anthropic' | 'google',
-  model: string,            // e.g. 'gpt-4o', 'claude-sonnet-4-20250514'
-  apiKey?: string,          // Uses env var by default
-  temperature?: number,
-  maxOutputTokens?: number,
-  baseUrl?: string,
-});
+**Option B -- npx (no install needed):**
-const response = await model.invoke(messages, { tools });
-// response: { text, toolCalls, usage, finishReason }
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "npx",
+      "args": ["-y", "sweagent"],
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
 ```
-**Supported models (examples):**
-| Provider  | Models                                               |
-| --------- | ---------------------------------------------------- |
-| OpenAI    | `gpt-4o`, `gpt-4o-mini`, `gpt-4-turbo`               |
-| Anthropic | `claude-sonnet-4-20250514`, `claude-3-opus-20240229` |
-| Google    | `gemini-1.5-pro`, `gemini-1.5-flash`                 |
+**Option C -- From source (local clone):**
-**Vision:** `model.generateVision(prompt, images, options)` for image inputs.
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "node",
+      "args": ["--env-file=.env", "dist/stdio.js"]
+    }
+  }
+}
+```
----
+Cursor auto-discovers `.cursor/mcp.json`. After saving, restart Cursor or reload the window. The sweagent tools appear in the Cursor chat tool list.
-### Tools
+### Setup with VS Code (GitHub Copilot)
-**defineTool(config)** — Define a type-safe tool with Zod schema and handler.
+Create `.vscode/mcp.json` in your project root:
-```typescript
-import { defineTool } from 'sweagent';
-import { z } from 'zod';
+**Option A -- Global install:**
-const tool = defineTool({
-  name: 'my_tool',
-  description: 'What the tool does',
-  input: z.object({ key: z.string() }),
-  handler: async (parsed, context) => ({ result: parsed.key }),
-});
+```json
+{
+  "servers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
 ```
-**createToolSet(tools)** — Build a record of tools for the agent (key = tool name).
+**Option B -- npx:**
-**getTool(toolSet, name)** / **getTools(toolSet)** — Look up one or all tools.
-**executeTool(tool, input, options)** — Run a single tool with input.
-**executeToolByName(toolSet, name, input, options)** — Run by name; throws if tool missing.
+```json
+{
+  "servers": {
+    "sweagent": {
+      "command": "npx",
+      "args": ["-y", "sweagent"],
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
+```
-**zodToJsonSchema(schema)** — Convert a Zod schema to JSON Schema (e.g. for MCP).
+VS Code discovers MCP servers from `.vscode/mcp.json` automatically. You can also configure servers globally via **MCP: Open User Configuration** in the Command Palette. After saving, open the Copilot chat panel -- the sweagent tools are available to the agent.
----
+### Setup with Windsurf
-### Agents
+Edit (or create) the Windsurf MCP config file at `~/.codeium/windsurf/mcp_config.json`:
-**runAgent(config)** — Run the agent loop until the model returns no tool calls or max iterations is reached.
+**Option A -- Global install:**
-```typescript
-import { runAgent } from 'sweagent';
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
+```
-const result = await runAgent({
-  model,
-  tools: createToolSet({ ... }),
-  systemPrompt: string,
-  input: string,
-  maxIterations?: number,   // default 10
-  onStep?: (step: AgentStep) => void,
-});
+**Option B -- npx:**
-// result: { output, steps, totalUsage, messages }
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "npx",
+      "args": ["-y", "sweagent"],
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
 ```
----
+Restart Windsurf after saving. The sweagent tools appear in the Cascade chat.
-### Subagents
+### Setup with Claude Desktop
-**defineSubagent(config)** — Define a subagent (name must be kebab-case).
+Edit the Claude Desktop config file:
-```typescript
-import { defineSubagent } from 'sweagent';
+- **macOS:** `~/Library/Application Support/Claude/claude_desktop_config.json`
+- **Windows:** `%APPDATA%\Claude\claude_desktop_config.json`
-const def = defineSubagent({
-  name: 'my-subagent',
-  description: 'What this subagent does',
-  systemPrompt: '...',
-  tools?: Record<string, Tool>,
-  model?: ModelConfig,
-  maxIterations?: number,
-  disallowedTools?: string[],
-  onStep?: (step) => void,
-});
+**Option A -- Global install:**
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "sweagent",
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
 ```
-**runSubagent(definition, input, options)** — Run the subagent in isolation.
+**Option B -- npx:**
-**createSubagentTool(definition, options)** — Expose one subagent as a tool.
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "npx",
+      "args": ["-y", "sweagent"],
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key"
+      }
+    }
+  }
+}
+```
-**createSubagentToolSet(definitions, options)** — Build a record of subagent tools (`subagent_<name>`).
+Restart Claude Desktop after saving. The sweagent tools appear in the tool list (hammer icon) in a new conversation.
----
+### Available Tools
-### Planning
+All 13 domain agents are exposed as MCP tools:
-**runPlanningAgent(config)** — One-shot mode: single input, auto-advances through all stages, returns plan markdown.
+| Tool                  | Description                                                                                           |
+| --------------------- | ----------------------------------------------------------------------------------------------------- |
+| `plan`                | Generate a full software plan (discovery, requirements, design, synthesis) from a project description |
+| `gather_requirements` | Extract structured requirements (actors, flows, stories, modules) from a project description          |
+| `design_data_model`   | Design a database schema (MongoDB or PostgreSQL) with entities, relations, and indexes                |
+| `design_api`          | Design REST or GraphQL API contracts (endpoints, request/response schemas) from requirements          |
+| `design_auth`         | Design authentication and authorization strategy (providers, roles, permissions, flows)               |
+| `architect_backend`   | Design backend architecture (folder structure, services, middleware, deployment)                      |
+| `architect_frontend`  | Design frontend architecture (components, state management, routing, styling)                         |
+| `build_express`       | Generate Express.js REST API configuration and boilerplate from an API design                         |
+| `build_apollo`        | Generate Apollo GraphQL subgraph configuration and resolvers from an API design                       |
+| `build_react`         | Generate React + Vite application configuration and components from a GraphQL schema                  |
+| `build_nextjs`        | Generate Next.js App Router configuration and pages from requirements                                 |
+| `plan_execution`      | Create a phased execution plan with edge-case analysis and testing strategy                           |
+| `hello_world`         | Test agent that greets users -- use to verify the MCP server is working                               |
-```typescript
-import { runPlanningAgent } from 'sweagent';
+### Tool Input Parameters
-const result = await runPlanningAgent({
-  input: string,
-  model?: ModelConfig,
-  maxIterations?: number,
-  onStep?: (step: AgentStep) => void,
-  logger?: Logger,
-});
-// result: AgentResult { output, steps, totalUsage, messages }
-```
+Every tool accepts the same input shape:
-**runPlanningWithResult(config)** — Runs the planning agent then validates the output with an LLM judge.
+| Parameter     | Type                                  | Required | Description                                                                            |
+| ------------- | ------------------------------------- | -------- | -------------------------------------------------------------------------------------- |
+| `input`       | `string`                              | Yes      | Natural language description of what to build or design                                |
+| `provider`    | `"openai" \| "anthropic" \| "google"` | No       | LLM provider (defaults to `openai`)                                                    |
+| `model`       | `string`                              | No       | Model name, e.g. `gpt-4o-mini`, `claude-sonnet-4-20250514` (defaults to `gpt-4o-mini`) |
+| `temperature` | `number` (0--1)                       | No       | Sampling temperature                                                                   |
-```typescript
-import { runPlanningWithResult } from 'sweagent';
+### Verify the Server
-const result = await runPlanningWithResult({
-  input: string,
-  model?: ModelConfig,
-  logger?: Logger,
-});
-// result: { planning: boolean, plan: string }
-```
+After configuring your IDE, verify the connection by asking the chat agent:
-**processPlanningChat(userMessage, context, config)** — Multi-turn chat mode. Pass `null` context on the first turn.
+> Use the hello_world tool to verify the sweagent MCP server is working.
-```typescript
-import { processPlanningChat } from 'sweagent';
+If the server is running correctly, the agent will call the `hello_world` tool and return a greeting.
-const result = await processPlanningChat(userMessage, context, {
-  model?: ModelConfig,
-  maxIterations?: number,
-  onStep?: (step: AgentStep) => void,
-  logger?: Logger,
-});
-// result: PlanChatTurnResult { message, context, pendingQuestions, planMarkdown }
-```
+### Using Multiple Providers
-**validatePlanForCodingAgent(planMarkdown, model, logger)** — LLM-based validation. Returns `{ valid: boolean, feedback?: string }`.
+Pass `provider` and `model` to any tool call to override the default (OpenAI gpt-4o-mini). Make sure the corresponding API key is set in the `env` block of your MCP config:
-**assemblePlan(projectName, sections)** — Assemble `PlanSections` into a single markdown string.
+```json
+{
+  "mcpServers": {
+    "sweagent": {
+      "command": "npx",
+      "args": ["-y", "sweagent"],
+      "env": {
+        "OPENAI_API_KEY": "your-openai-api-key",
+        "ANTHROPIC_API_KEY": "your-anthropic-api-key",
+        "GOOGLE_GENERATIVE_AI_API_KEY": "your-google-api-key"
+      }
+    }
+  }
+}
+```
-**writePlanToFile(markdown, outputPath)** — Write plan markdown to a file.
+Then tell the agent which provider to use:
-**PlanningContextBuilder** — Fluent builder for `PlanningContext`:
+> Use the plan tool with provider "anthropic" and model "claude-sonnet-4-20250514" to plan a task manager app with teams, Kanban boards, and time tracking.
-```typescript
-import { createPlanningContextBuilder } from 'sweagent';
+### Troubleshooting MCP
-const context = createPlanningContextBuilder()
-  .withStage('requirements')
-  .withProjectDescription('Task manager app')
-  .withSections({ overview: '## Overview\n...' })
-  .build();
-```
----
+**Server not appearing in tool list** -- Restart your IDE after saving the config file. Check that the config file is in the correct location for your IDE.
-### MCP
+**API key errors** -- Make sure the API key is set in the `env` block of your MCP config, not just in a `.env` file (unless you are using the `--env-file` flag with the node command).
-**BaseMcpClient** — Base class for MCP clients. Lazy connection, `callTool(name, args)` for invocation.
+**npx timeout or failure** -- Run `npx sweagent` manually in a terminal to check for errors. Make sure Node.js >= 18 is installed.
-**BaseMcpClient.resolveConfig(options, resolveOpts)** — Build config from options and env (e.g. `MCP_URL`, `MCP_COMMAND`, `MCP_ARGS`).
+**Tool returns an error** -- The MCP server catches errors and returns them as text. Check that the `input` parameter contains a meaningful project description, not an empty string.
 ---
-### Errors
+## Architecture
-| Class               | When                                             |
-| ------------------- | ------------------------------------------------ |
-| **LibraryError**    | Base; all others extend it.                      |
-| **ModelError**      | Model creation or invoke failed.                 |
-| **ToolError**       | Tool not found or tool execution failed.         |
-| **ValidationError** | Zod validation failed.                           |
-| **AgentError**      | Agent reached max iterations without completing. |
-| **SubagentError**   | Subagent config or run failed.                   |
+### System overview
-All accept an optional `cause` for chaining.
+```mermaid
+graph TB
+  subgraph Client[Client Application]
+    App["Your App / Cursor / Claude Code / Codex"]
+  end
----
+  subgraph DomainAgents[Domain Agent Modules]
+    Planning["Planning"]
+    ReqGatherer["Requirement Gatherer"]
+    DataModeler["Data Modeler"]
+    ApiDesigner["API Designer"]
+    AuthDesigner["Auth Designer"]
+    BackendArch["Backend Architect"]
+    ExpressBuilder["Express Builder"]
+    ApolloBuilder["Apollo Builder"]
+    FrontendArch["Frontend Architect"]
+    ReactBuilder["React Builder"]
+    NextjsBuilder["Next.js Builder"]
+    ExecPlanner["Execution Planner"]
+  end
-## Domain Agent Modules
+  subgraph Framework[Shared Framework]
+    Models["Model Abstraction"]
+    ToolFW["Tool Framework"]
+    AgentLoop["Agent Loop"]
+    SubAgentOrch["Sub-Agent Orchestration"]
+    MCP["MCP Protocol"]
+  end
-Each module is a self-contained domain agent with its own orchestrator, pipeline stages, tools, sub-agents, and output format. All are exported from the main package.
+  subgraph Providers[AI Providers]
+    OpenAI["OpenAI"]
+    Anthropic["Anthropic"]
+    Google["Google"]
+  end
-### Planning
+  App --> DomainAgents
+  DomainAgents --> Framework
+  Framework --> Providers
+```
-The primary module for powering AI coding agents. Generates implementation-ready markdown plans from natural-language project descriptions.
+### Domain agent pipeline flow
-| Attribute         | Detail                                                                                                       |
-| ----------------- | ------------------------------------------------------------------------------------------------------------ |
-| **Stages**        | Discovery, Requirements (4 LLM calls), Design (2 LLM calls), Synthesis                                       |
-| **Sub-Agents**    | --                                                                                                           |
-| **Tools**         | -- (pipeline stages, not tool-based)                                                                         |
-| **Output Format** | Markdown plan (11 sections)                                                                                  |
-| **Validation**    | LLM judge checks completeness and actionability                                                              |
-| **Modes**         | One-shot (`runPlanningAgent`), validated (`runPlanningWithResult`), interactive chat (`processPlanningChat`) |
+Each domain agent follows a structured pipeline. The Planning Agent is representative:
-**Output sections:** Overview, Tech Stack, Feature Decisions, Data Models, Pages and Routes, Authentication Flow, API Routes, Implementation Details, Execution Plan, Edge Cases, Testing Checklist.
+```mermaid
+flowchart LR
+  Input["User Requirement"] --> Discovery["Discovery Stage"]
+  Discovery --> Requirements["Requirements Stage"]
+  Requirements --> Design["Design Stage"]
+  Design --> Synthesis["Synthesis Stage"]
+  Synthesis --> Plan["Structured Output"]
+  Plan --> Validate["LLM Validator"]
+  Validate --> Output["Validated Result"]
+```
-```typescript
-import { runPlanningWithResult } from 'sweagent';
+### Orchestrator with sub-agents
-const { planning, plan } = await runPlanningWithResult({
-  input: 'E-commerce: users, orders, products. Admins manage products.',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-});
-// planning: boolean -- did the plan pass validation?
-// plan: string -- full markdown blueprint
-```
+Domain agents like Data Modeler and React Builder delegate to specialized sub-agents:
-See [Planning Pipeline](#planning-pipeline) for stage-by-stage details.
+```mermaid
+sequenceDiagram
+  participant User
+  participant Orchestrator
+  participant Model
+  participant Tools
+  participant SubAgent1 as entity-analyzer
+  participant SubAgent2 as schema-refiner
----
+  User->>Orchestrator: Natural-language requirement
+  Orchestrator->>Model: Messages + Domain Tools
+  Model-->>Orchestrator: Tool call design_database
+  Orchestrator->>Tools: Execute design_database
+  Tools-->>Orchestrator: Initial schema
+  Orchestrator->>SubAgent1: Analyze entities and relationships
+  SubAgent1-->>Orchestrator: Structured entity analysis
+  Orchestrator->>SubAgent2: Refine and validate schema
+  SubAgent2-->>Orchestrator: Validated schema
+  Orchestrator->>Model: Compile final output
+  Model-->>Orchestrator: Final result
+  Orchestrator-->>User: Production-grade schema
+```
-### Requirement Gatherer
+### Agent execution loop
-Produces structured JSON requirements -- not prose. Unlike the Planning module (markdown output), the Requirement Gatherer extracts typed data that downstream systems can consume programmatically.
+```mermaid
+sequenceDiagram
+  participant User
+  participant Agent
+  participant Model
+  participant Tools
-| Attribute         | Detail                                                                                |
-| ----------------- | ------------------------------------------------------------------------------------- |
-| **Stages**        | Discovery, Requirements, Design, Synthesis                                            |
-| **Sub-Agents**    | --                                                                                    |
-| **Tools**         | Stage-specific tools                                                                  |
-| **Output Format** | Structured JSON                                                                       |
-| **Schemas**       | Actors, User Flows, User Stories, Modules, Database Design, API Design                |
-| **Modes**         | One-shot (`runRequirementGathererAgent`), interactive chat (`processRequirementChat`) |
+  User->>Agent: Input + Tools + System Prompt
+  loop Until Complete or Max Iterations
+    Agent->>Model: Messages + Tool Schemas
+    Model-->>Agent: Response (Text or Tool Calls)
+    alt Tool Calls Present
+      Agent->>Tools: Execute Tool Calls
+      Tools-->>Agent: Tool Results
+      Agent->>Agent: Append Results to Messages
+    else Final Answer
+      Agent-->>User: Output + Steps + Usage
+    end
+  end
+```
-**Output structure:** Actors (with permissions), User Flows (step-by-step sequences), User Stories (with acceptance criteria), Modules (with CRUD operations), Database Design (schemas, relationships), API Design (REST/GraphQL endpoints).
+### Engineering Deep Dive
-```typescript
-import { runRequirementGathererAgent } from 'sweagent';
+#### The problem: long-running coding agents
-const result = await runRequirementGathererAgent({
-  input: 'Project management tool with teams and Kanban boards',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-// result.output: structured JSON with actors, flows, stories, modules
-```
+Coding agents that work across many steps or sessions fail in two ways: they try to do too much in one shot (leaving partial, undocumented work), or they declare the job done too early. Each new session starts with no memory of the last. To make progress across sessions, each run needs a way to get up to speed quickly and leave a clean state for the next run.
----
+#### Incremental progress pattern
-### DB Designer
+Each session does bounded work: one feature or one clear subtask. The agent updates a progress file and commits what it did. The next session reads progress and git history, chooses the next unfinished item, and continues. This avoids one-shotting the whole project and reduces premature completion.
-An orchestrator agent that delegates to specialized sub-agents for entity analysis and schema refinement. Produces MongoDB-style project schemas with modules, fields, relationships, indexes, and validation rules.
+#### Feature list approach
-| Attribute         | Detail                                                                                                                                                                                           |
-| ----------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
-| **Pattern**       | Orchestrator with sub-agents                                                                                                                                                                     |
-| **Sub-Agents**    | `entity-analyzer` (extracts entities and relationships), `schema-refiner` (normalizes and validates schemas)                                                                                     |
-| **Tools**         | `design_database` (text requirements to schema), `design_database_pro` (structured requirements to schema), `redesign_database` (modify existing schemas), `validate_schema` (schema validation) |
-| **Output Format** | MongoDB schema JSON (modules, fields, relationships)                                                                                                                                             |
+A structured list of requirements (e.g. in JSON) with a status per item gives the agent a clear definition of "done." Agents only mark items passing after verification, keeping scope explicit and making it easy to resume across context windows.
-```typescript
-import { runDbDesignerAgent } from 'sweagent';
+#### Clean state principle
-const result = await runDbDesignerAgent({
-  input: 'E-commerce: users, orders, products. Admins manage products.',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-// result.output: MongoDB schema with modules, fields, relationships
-```
+Every session should end with code that is buildable, documented, and easy to continue from. No half-implemented features, no stray debug code, clear commit messages or progress notes. The orchestration prompts in production modules encode these expectations.
----
+### Provider adapter pattern
-### React Builder
+Models are created via `createModel({ provider, model, ... })`. A shared AI SDK adapter wraps the Vercel AI SDK's `generateText` and normalizes messages, tool schemas, and responses. Each provider has a thin factory that passes the correct `LanguageModel` into this adapter. Provider-specific logic stays in one place; everything else is provider-agnostic.
-An orchestrator agent that generates complete frontend application configuration from a GraphQL schema. Uses a `graphql-analyzer` sub-agent to parse schema structure and a `config-validator` to verify the output against frontend config schemas.
+### Tool execution safety
-| Attribute         | Detail                                                                                                                                                      |
-| ----------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                                                                                                                |
-| **Sub-Agents**    | `graphql-analyzer` (parses GraphQL schema structure), `config-validator` (validates frontend config output)                                                 |
-| **Tools**         | `generate_frontend` (GraphQL to frontend config), `generate_feature_breakdown` (module/operation breakdown), `validate_frontend_config` (config validation) |
-| **Output Format** | React app config JSON (app, modules, pages, fields, API hooks, branding)                                                                                    |
-| **Schemas**       | App config, User config, Page config, Field config, Branding                                                                                                |
+Inputs are validated with Zod before any tool runs. Invalid input produces a **ToolError** with the parse error; the handler is never called with bad data. Handler errors are caught and rethrown as **ToolError** with the original error as cause. The agent loop receives structured tool results (including error payloads) so the model can see failures and retry or adjust.
-```typescript
-import { runReactBuilderAgent } from 'sweagent';
+### Error hierarchy
-const result = await runReactBuilderAgent({
-  input: 'GraphQL schema: type User { id: ID! name: String! } type Task { ... }',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-// result.output: frontend config JSON with pages, fields, hooks, branding
-```
+| Class               | When                                             |
+| ------------------- | ------------------------------------------------ |
+| **LibraryError**    | Base; all others extend it.                      |
+| **ModelError**      | Model creation or invoke failed.                 |
+| **ToolError**       | Tool not found or tool execution failed.         |
+| **ValidationError** | Zod validation failed.                           |
+| **AgentError**      | Agent reached max iterations without completing. |
+| **SubagentError**   | Subagent config or run failed.                   |
+All accept an optional `cause` for chaining.
 ---
-### Data Modeler
+## API Reference
-Designs data models for MongoDB or PostgreSQL with entities, fields, indexes, and relationships. Three sub-agents handle entity analysis, relationship mapping, and schema refinement.
+All public APIs are exported from the main package: `import { ... } from 'sweagent'`.
-| Attribute         | Detail                                                                       |
-| ----------------- | ---------------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                                 |
-| **Sub-Agents**    | `entity-analyzer`, `relationship-mapper`, `schema-refiner`                   |
-| **Tools**         | `design_schema`, `design_schema_pro`, `refine_schema`, `validate_data_model` |
-| **Output Format** | Data model JSON (entities, fields, indexes, relationships)                   |
-| **Databases**     | MongoDB, PostgreSQL                                                          |
+### Models
+**createModel(config)** -- Create a model instance.
 ```typescript
-import { runDataModelerAgent } from 'sweagent';
+import { createModel } from 'sweagent';
-const result = await runDataModelerAgent({
-  input: 'SaaS platform with organizations, users, projects, and billing',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const model = createModel({
+  provider: 'openai' | 'anthropic' | 'google',
+  model: string,            // e.g. 'gpt-4o', 'claude-sonnet-4-20250514'
+  apiKey?: string,          // Uses env var by default
+  temperature?: number,
+  maxOutputTokens?: number,
+  baseUrl?: string,
 });
-```
----
-### API Designer
-Designs REST and/or GraphQL APIs from data models. Produces endpoint definitions with request/response contracts and auth requirements.
+const response = await model.invoke(messages, { tools });
+// response: { text, toolCalls, usage, finishReason }
+```
-| Attribute         | Detail                                                                                                  |
-| ----------------- | ------------------------------------------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                                                            |
-| **Sub-Agents**    | `endpoint-analyzer` (derives endpoints from data model), `contract-designer` (designs request/response) |
-| **Tools**         | `design_api`, `design_api_pro`, `validate_api`                                                          |
-| **Output Format** | API design JSON (REST endpoints and/or GraphQL operations)                                              |
+**Supported models (examples):**
-```typescript
-import { runApiDesignerAgent } from 'sweagent';
+| Provider  | Models                                               |
+| --------- | ---------------------------------------------------- |
+| OpenAI    | `gpt-4o`, `gpt-4o-mini`, `gpt-4-turbo`               |
+| Anthropic | `claude-sonnet-4-20250514`, `claude-3-opus-20240229` |
+| Google    | `gemini-1.5-pro`, `gemini-1.5-flash`                 |
-const result = await runApiDesignerAgent({
-  input: 'Design REST API for task manager with users, projects, tasks',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-```
+**Vision:** `model.generateVision(prompt, images, options)` for image inputs.
 ---
-### Auth Designer
-Designs authentication and authorization systems with strategies, flows, middleware, roles, and security policies.
+### Tools
-| Attribute         | Detail                                                                                     |
-| ----------------- | ------------------------------------------------------------------------------------------ |
-| **Pattern**       | Orchestrator with sub-agents                                                               |
-| **Sub-Agents**    | `security-analyzer` (analyzes security requirements), `flow-designer` (designs auth flows) |
-| **Tools**         | `design_auth`, `validate_auth`                                                             |
-| **Output Format** | Auth design JSON (strategy, flows, middleware, roles, policies)                            |
+**defineTool(config)** -- Define a type-safe tool with Zod schema and handler.
 ```typescript
-import { runAuthDesignerAgent } from 'sweagent';
+import { defineTool } from 'sweagent';
+import { z } from 'zod';
-const result = await runAuthDesignerAgent({
-  input: 'JWT auth with email/password, Google OAuth, admin and member roles',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const tool = defineTool({
+  name: 'my_tool',
+  description: 'What the tool does',
+  input: z.object({ key: z.string() }),
+  handler: async (parsed, context) => ({ result: parsed.key }),
 });
 ```
----
-### Frontend Architect
+**createToolSet(tools)** -- Build a record of tools for the agent (key = tool name).
-Plans frontend architecture including pages, components, routing, and state management. Routes to React Builder or Next.js Builder based on framework selection.
+**getTool(toolSet, name)** / **getTools(toolSet)** -- Look up one or all tools.
-| Attribute         | Detail                                                              |
-| ----------------- | ------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                        |
-| **Sub-Agents**    | `page-planner`, `component-analyzer`, `framework-selector`          |
-| **Output Format** | Frontend design JSON (pages, components, state management, routing) |
-| **Frameworks**    | React + Vite, Next.js                                               |
+**executeTool(tool, input, options)** -- Run a single tool with input.
-```typescript
-import { runFrontendArchitectAgent } from 'sweagent';
+**executeToolByName(toolSet, name, input, options)** -- Run by name; throws if tool missing.
-const result = await runFrontendArchitectAgent({
-  input: 'Dashboard with analytics, settings, and user management',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
-```
+**zodToJsonSchema(schema)** -- Convert a Zod schema to JSON Schema (e.g. for MCP).
 ---
-### Backend Architect
-Plans backend architecture including framework selection, services, middleware, routes, and folder structure. Routes to Express Builder or Apollo Builder.
+### Agents
-| Attribute         | Detail                                                                          |
-| ----------------- | ------------------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                                    |
-| **Sub-Agents**    | `framework-selector`, `service-planner`                                         |
-| **Tools**         | `design_backend`, `validate_backend`                                            |
-| **Output Format** | Backend design JSON (framework, services, middleware, routes, folder structure) |
-| **Frameworks**    | Express, Apollo, or both                                                        |
+**runAgent(config)** -- Run the agent loop until the model returns no tool calls or max iterations is reached.
 ```typescript
-import { runBackendArchitectAgent } from 'sweagent';
+import { runAgent } from 'sweagent';
-const result = await runBackendArchitectAgent({
-  input: 'REST API with user auth, CRUD operations, and file uploads',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const result = await runAgent({
+  model,
+  tools: createToolSet({ ... }),
+  systemPrompt: string,
+  input: string,
+  maxIterations?: number,   // default 10
+  onStep?: (step: AgentStep) => void,
 });
+// result: { output, steps, totalUsage, messages }
 ```
 ---
-### Express Builder
-Generates Express.js REST API configuration with routers, models, middleware, and environment variables.
+### Subagents
-| Attribute         | Detail                                                      |
-| ----------------- | ----------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                |
-| **Sub-Agents**    | `route-generator`, `middleware-configurator`                |
-| **Tools**         | `generate_express`, `scaffold_express`, `validate_express`  |
-| **Output Format** | Express config JSON (routers, models, middleware, env vars) |
+**defineSubagent(config)** -- Define a subagent (name must be kebab-case).
 ```typescript
-import { runExpressBuilderAgent } from 'sweagent';
+import { defineSubagent } from 'sweagent';
-const result = await runExpressBuilderAgent({
-  input: 'Express API for e-commerce with products, orders, and user auth',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const def = defineSubagent({
+  name: 'my-subagent',
+  description: 'What this subagent does',
+  systemPrompt: '...',
+  tools?: Record<string, Tool>,
+  model?: ModelConfig,
+  maxIterations?: number,
+  disallowedTools?: string[],
+  onStep?: (step) => void,
 });
 ```
----
+**runSubagent(definition, input, options)** -- Run the subagent in isolation.
-### Apollo Builder
+**createSubagentTool(definition, options)** -- Expose one subagent as a tool.
-Generates Apollo GraphQL subgraph configuration with modules, types, resolvers, datasources, and Federation v2 support.
+**createSubagentToolSet(definitions, options)** -- Build a record of subagent tools (`subagent_<name>`).
-| Attribute         | Detail                                                                |
-| ----------------- | --------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                          |
-| **Sub-Agents**    | `schema-generator`, `resolver-planner`                                |
-| **Tools**         | `generate_subgraph`, `scaffold_subgraph`, `validate_subgraph`         |
-| **Output Format** | Apollo subgraph config JSON (modules, types, operations, datasources) |
+---
+### Planning
+**runPlanningAgent(config)** -- One-shot mode: single input, auto-advances through all stages, returns plan markdown.
 ```typescript
-import { runApolloBuilderAgent } from 'sweagent';
+import { runPlanningAgent } from 'sweagent';
-const result = await runApolloBuilderAgent({
-  input: 'Apollo subgraph for task manager with users, projects, tasks',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const result = await runPlanningAgent({
+  input: string,
+  model?: ModelConfig,
+  maxIterations?: number,
+  onStep?: (step: AgentStep) => void,
+  logger?: Logger,
 });
+// result: AgentResult { output, steps, totalUsage, messages }
 ```
----
+**processPlanningChat(userMessage, context, config)** -- Multi-turn chat mode. Pass `null` context on the first turn.
-### Next.js Builder
+```typescript
+import { processPlanningChat } from 'sweagent';
-Generates Next.js App Router configuration with pages, layouts, API routes, server actions, and middleware.
+const result = await processPlanningChat(userMessage, context, {
+  model?: ModelConfig,
+  maxIterations?: number,
+  onStep?: (step: AgentStep) => void,
+  logger?: Logger,
+});
+// result: PlanChatTurnResult { message, context, pendingQuestions, planMarkdown }
+```
-| Attribute         | Detail                                                                       |
-| ----------------- | ---------------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                                 |
-| **Sub-Agents**    | `route-planner`, `api-route-generator`                                       |
-| **Tools**         | `generate_nextjs`, `validate_nextjs`                                         |
-| **Output Format** | Next.js config JSON (pages, layouts, API routes, server actions, middleware) |
+**runPlanningFromRequirements(config)** -- Convert requirement-gatherer output to a plan, skipping redundant stages.
 ```typescript
-import { runNextjsBuilderAgent } from 'sweagent';
+import { runPlanningFromRequirements } from 'sweagent';
-const result = await runNextjsBuilderAgent({
-  input: 'Next.js app for project management with teams and dashboards',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
+const result = await runPlanningFromRequirements({
+  requirement: FinalRequirement,  // JSON from requirement-gatherer
+  model?: ModelConfig,
+  onStep?: (step: AgentStep) => void,
+  logger?: Logger,
 });
+// result: AgentResult { output (plan markdown), steps, totalUsage, messages }
 ```
----
-### Execution Planner
+**assemblePlan(projectName, sections)** -- Assemble `PlanSections` into a single markdown string.
-Creates phased implementation execution plans with edge case analysis and testing checklists.
+**writePlanToFile(markdown, outputPath)** -- Write plan markdown to a file.
-| Attribute         | Detail                                                                                  |
-| ----------------- | --------------------------------------------------------------------------------------- |
-| **Pattern**       | Orchestrator with sub-agents                                                            |
-| **Sub-Agents**    | `edge-case-analyzer`, `testing-strategist`                                              |
-| **Tools**         | `create_execution_plan`, `validate_execution_plan`                                      |
-| **Output Format** | Execution plan JSON (phases, edge cases, testing checklist, security/performance notes) |
+**PlanningContextBuilder** -- Fluent builder for `PlanningContext`:
 ```typescript
-import { runExecutionPlannerAgent } from 'sweagent';
+import { createPlanningContextBuilder } from 'sweagent';
-const result = await runExecutionPlannerAgent({
-  input: 'Create execution plan for the task manager project',
-  model: { provider: 'openai', model: 'gpt-4o-mini' },
-  maxIterations: 15,
-});
+const context = createPlanningContextBuilder()
+  .withStage('requirements')
+  .withProjectDescription('Task manager app')
+  .withSections({ overview: '## Overview\n...' })
+  .build();
 ```
 ---
-### Hello World (Template)
+### MCP
-Minimal example module with a single greeting tool. Use as a starting point when building your own domain agent module.
+sweagent ships as a standalone **MCP server** that exposes all 13 domain agents as tools. See [MCP Server](#mcp-server) for IDE setup and tool reference.
-```typescript
-import { createModel, runAgent, helloWorldTool } from 'sweagent';
+**BaseMcpClient** -- Base class for MCP clients. Lazy connection, `callTool(name, args)` for invocation.
-const result = await runAgent({
-  model: createModel({ provider: 'openai', model: 'gpt-4o-mini' }),
-  tools: [helloWorldTool],
-  systemPrompt: 'You are helpful.',
-  input: 'Say hello',
-});
-```
+**BaseMcpClient.resolveConfig(options, resolveOpts)** -- Build config from options and env (e.g. `MCP_URL`, `MCP_COMMAND`, `MCP_ARGS`).
 ---
-## Examples
-The [examples directory](./examples/README.md) contains runnable scripts organized by domain agent. Use the interactive launcher or run a file directly:
-```bash
-# Interactive launcher -- pick a domain agent, then an example
-npm run example:interactive
+### Errors
-# Run a specific domain agent example
-npm run example -- examples/planning/01-planning-agent.ts
-npm run example -- examples/db-designer/01-db-designer-agent.ts
-npm run example -- examples/react-builder/01-react-builder-agent.ts
-```
+| Class               | When                                             |
+| ------------------- | ------------------------------------------------ |
+| **LibraryError**    | Base; all others extend it.                      |
+| **ModelError**      | Model creation or invoke failed.                 |
+| **ToolError**       | Tool not found or tool execution failed.         |
+| **ValidationError** | Zod validation failed.                           |
+| **AgentError**      | Agent reached max iterations without completing. |
+| **SubagentError**   | Subagent config or run failed.                   |
-| Domain Agent             | Examples                                                                             | What it produces                                                         |
-| ------------------------ | ------------------------------------------------------------------------------------ | ------------------------------------------------------------------------ |
-| **Planning**             | 01 Planning Agent                                                                    | Implementation-ready markdown plan through 4-stage pipeline              |
-| **Requirement Gatherer** | 01 Requirement Gatherer Agent                                                        | Structured JSON requirements (actors, flows, stories, modules)           |
-| **Data Modeler**         | 01 Data Modeler Agent                                                                | MongoDB/PostgreSQL data model with entities, fields, indexes             |
-| **DB Designer**          | 01 DB Designer Agent                                                                 | MongoDB schemas via `entity-analyzer` and `schema-refiner` sub-agents    |
-| **API Designer**         | 01 API Designer Agent                                                                | REST/GraphQL API design with endpoints and contracts                     |
-| **Auth Designer**        | 01 Auth Designer Agent                                                               | Auth strategy, flows, middleware, roles, and policies                    |
-| **Backend Architect**    | 01 Backend Architect Agent                                                           | Backend architecture with framework selection and services               |
-| **Express Builder**      | 01 Express Builder Agent                                                             | Express.js config with routers, models, and middleware                   |
-| **Apollo Builder**       | 01 Apollo Builder Agent                                                              | Apollo GraphQL subgraph config with types and resolvers                  |
-| **Frontend Architect**   | 01 Frontend Architect Agent                                                          | Frontend architecture with pages, components, and routing                |
-| **React Builder**        | 01 React Builder Agent                                                               | Frontend config via `graphql-analyzer` and `config-validator` sub-agents |
-| **Next.js Builder**      | 01 Next.js Builder Agent                                                             | Next.js App Router config with pages, layouts, and API routes            |
-| **Execution Planner**    | 01 Execution Planner Agent                                                           | Phased implementation plan with edge cases and testing checklist         |
-| **Core Framework**       | 01 Basic Model, 02 All Providers, 03 Tool Calling, 04 Multi-Tool Agent, 05 Subagents | Models, tools, agent loop, sub-agent delegation                          |
-| **Hello World**          | 01 Hello World                                                                       | Minimal agent with greeting tool (module template)                       |
+All accept an optional `cause` for chaining.
 ---
-## Configuration Reference
+## Reference
 ### Environment variables
@@ -1644,21 +1702,14 @@ npm run example -- examples/react-builder/01-react-builder-agent.ts
 | `REQUIREMENT`                          | Project requirement for planning/requirement-gatherer examples |
 | `MAX_ITERATIONS`                       | Max agent iterations for examples                              |
-### ModelConfig
-`provider`, `model`, `apiKey?`, `temperature?`, `maxOutputTokens?`, `baseUrl?`
+### Config types
-### AgentConfig
+- **ModelConfig** -- `provider`, `model`, `apiKey?`, `temperature?`, `maxOutputTokens?`, `baseUrl?`
+- **AgentConfig** -- `model`, `tools`, `systemPrompt`, `input`, `maxIterations?`, `onStep?`
+- **PlanningAgentConfig** -- `input`, `model?`, `maxIterations?`, `onStep?`, `logger?`
+- **PlanFromRequirementsConfig** -- `requirement`, `model?`, `onStep?`, `logger?`
-`model`, `tools`, `systemPrompt`, `input`, `maxIterations?`, `onStep?`
-### PlanningAgentConfig
-`input`, `model?`, `maxIterations?`, `onStep?`, `logger?`
----
-## FAQ
+### FAQ
 **Which AI provider should I use?**
 All work well. Choose by existing infrastructure and pricing. The API is the same regardless of provider.
@@ -1675,34 +1726,48 @@ Target is Node.js. For browsers, proxy API calls through your backend and keep k
 **How do I add a new provider?**
 Implement a factory that returns a model conforming to the internal `Model` interface (e.g. via `createAIModel` and the provider's AI SDK binding) and register it in `createModel`.
----
-## Troubleshooting
+### Troubleshooting
-### API key errors
+**API key errors** -- Ensure the key is set: `echo $OPENAI_API_KEY` (or the relevant env var). If using `.env`, load it: `tsx --env-file=.env your-script.ts` or `node --env-file=.env your-script.js`.
-- Ensure the key is set: `echo $OPENAI_API_KEY` (or the relevant env var).
-- If using `.env`, load it: `tsx --env-file=.env your-script.ts` or `node --env-file=.env your-script.js`.
+**Model not found** -- Use the exact model id for the provider (e.g. `gpt-4o-mini`, `claude-sonnet-4-20250514`). Confirm your account has access to that model.
-### Model not found
+**Agent hits max iterations** -- Increase `maxIterations` or simplify the task. Check that tools return clear, parseable results so the model can decide the next step.
-- Use the exact model id for the provider (e.g. `gpt-4o-mini`, `claude-sonnet-4-20250514`).
-- Confirm your account has access to that model.
+**Tool not found** -- Tools must be in the same object passed to `runAgent` under the name the model uses (e.g. `createToolSet({ calculator: calculatorTool })` means the model calls `calculator`).
-### Agent hits max iterations
+---
-- Increase `maxIterations` or simplify the task.
-- Check that tools return clear, parseable results so the model can decide the next step.
+## Examples
-### Tool not found
+The [examples directory](./examples/) contains runnable scripts organized by domain agent. Use the interactive launcher or run a file directly:
-- Tools must be in the same object passed to `runAgent` under the name the model uses (e.g. `createToolSet({ calculator: calculatorTool })` means the model calls `calculator`).
+```bash
+# Interactive launcher -- pick a domain agent, then an example
+npm run example:interactive
-### Planning module returns `planning: false`
+# Run a specific domain agent example
+npm run example -- examples/planning/01-planning-agent.ts
+npm run example -- examples/data-modeler/01-data-modeler-agent.ts
+npm run example -- examples/react-builder/01-react-builder-agent.ts
+```
-- The LLM validator found missing sections. Check the `plan` string for gaps (no tech stack? no implementation order?).
-- Try a more capable model (e.g. `gpt-4o` instead of `gpt-4o-mini`).
-- Provide a more detailed project description as input.
+| Domain Agent             | Examples                                              | What it produces                                                         |
+| ------------------------ | ----------------------------------------------------- | ------------------------------------------------------------------------ |
+| **Planning**             | 01 Planning Agent                                     | Implementation-ready markdown plan through 4-stage pipeline              |
+| **Requirement Gatherer** | 01 Requirement Gatherer Agent                         | Structured JSON requirements (actors, flows, stories, modules)           |
+| **Data Modeler**         | 01 Data Modeler Agent                                 | MongoDB/PostgreSQL data model with entities, fields, indexes             |
+| **API Designer**         | 01 API Designer Agent                                 | REST/GraphQL API design with endpoints and contracts                     |
+| **Auth Designer**        | 01 Auth Designer Agent                                | Auth strategy, flows, middleware, roles, and policies                    |
+| **Backend Architect**    | 01 Backend Architect Agent                            | Backend architecture with framework selection and services               |
+| **Express Builder**      | 01 Express Builder Agent                              | Express.js config with routers, models, and middleware                   |
+| **Apollo Builder**       | 01 Apollo Builder Agent                               | Apollo GraphQL subgraph config with types and resolvers                  |
+| **Frontend Architect**   | 01 Frontend Architect Agent                           | Frontend architecture with pages, components, and routing                |
+| **React Builder**        | 01 React Builder Agent                                | Frontend config via `graphql-analyzer` and `config-validator` sub-agents |
+| **Next.js Builder**      | 01 Next.js Builder Agent                              | Next.js App Router config with pages, layouts, and API routes            |
+| **Execution Planner**    | 01 Execution Planner Agent                            | Phased implementation plan with edge cases and testing checklist         |
+| **Core Framework**       | 01-05: Model, Providers, Tools, Multi-Tool, Subagents | Models, tools, agent loop, sub-agent delegation                          |
+| **Hello World**          | 01 Hello World                                        | Minimal agent with greeting tool (module template)                       |
 ---
@@ -1735,11 +1800,11 @@ npm run build
 **Support**
-- [GitHub Issues](https://github.com/sijeeshmiziha/sweagent/issues) — Bugs and features
-- [GitHub Discussions](https://github.com/sijeeshmiziha/sweagent/discussions) — Questions
+- [GitHub Issues](https://github.com/sijeeshmiziha/sweagent/issues) -- Bugs and features
+- [GitHub Discussions](https://github.com/sijeeshmiziha/sweagent/discussions) -- Questions
 ---
 ## License
-MIT License — see [LICENSE](LICENSE) for details.
+MIT License -- see [LICENSE](LICENSE) for details.