npm - e2e-ai - Versions diffs - 1.4.2 → 1.5.0 - Mend

e2e-ai 1.4.2 → 1.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/README.md CHANGED Viewed

@@ -505,25 +505,21 @@ E2E_AI_API_KEY=key-...          # API key for push command
 ## AI Agents
-Eight specialized agents live in `agents/*.md`. Each has:
-- **YAML frontmatter**: model, max_tokens, temperature
-- **System prompt**: role + context
-- **Input/Output schemas**: what the agent receives and must produce
-- **Rules**: numbered constraints (e.g. "never remove assertions")
-- **Examples**: concrete input/output pairs
-| Agent | Input | Output | Used by |
-|-------|-------|--------|---------|
-| `transcript-agent` | codegen + transcript JSON | Structured narrative with intent mapping | `scenario` |
-| `scenario-agent` | narrative + issue context | YAML test scenario | `scenario` |
-| `playwright-generator-agent` | scenario + project context | `.test.ts` file | `generate` |
-| `refactor-agent` | test + project context | Improved test file | `refine` |
-| `self-healing-agent` | failing test + error output | Diagnosis + patched test | `heal` |
-| `qa-testcase-agent` | test + scenario + issue data | QA markdown + test case JSON | `qa` |
-| `feature-analyzer-agent` | AST scan result | Features, workflows, components JSON | `analyze` |
-| `scenario-planner-agent` | Features + workflows | Complete QA map with scenarios JSON | `analyze` |
-You can customize agent behavior by editing the `.md` files directly. The frontmatter `model` field is the default model for that agent (overridable via `--model` or `config.llm.agentModels`).
+Nine specialized agents live in `agents/*.md`, numbered by pipeline order. Each has a system prompt, input/output schemas, rules, and examples.
+| # | File | Input | Output | Used by |
+|---|------|-------|--------|---------|
+| 0 | `0.init-agent` | Codebase scan | `.e2e-ai/context.md` | `init` (AI chat) |
+| 1.1 | `1_1.transcript-agent` | codegen + transcript JSON | Structured narrative with intent mapping | `scenario` |
+| 1.2 | `1_2.scenario-agent` | narrative + issue context | YAML test scenario | `scenario` |
+| 2 | `2.playwright-generator-agent` | scenario + project context | `.test.ts` file | `generate` |
+| 3 | `3.refactor-agent` | test + project context | Improved test file | `refine` |
+| 4 | `4.self-healing-agent` | failing test + error output | Diagnosis + patched test | `heal` |
+| 5 | `5.qa-testcase-agent` | test + scenario + issue data | QA markdown + test case JSON | `qa` |
+| 6.1 | `6_1.feature-analyzer-agent` | AST scan result | Features, workflows, components JSON | `analyze` |
+| 6.2 | `6_2.scenario-planner-agent` | Features + workflows | Complete QA map with scenarios JSON | `analyze` |
+Agents are loaded by bare name (e.g., `loadAgent('scenario-agent')`) — the numbered prefix is resolved automatically. You can customize agent behavior by editing the `.md` files in `.e2e-ai/agents/`.
 ## Output Directory Structure
@@ -598,6 +594,16 @@ If e2e-ai is installed globally or as a project dependency, you can use the bina
 ### Available Tools
+#### Orchestration (workflow automation)
+| Tool | Description | Input |
+|------|-------------|-------|
+| `e2e_ai_plan_workflow` | Plan an automation workflow — returns an ordered todo list of steps | `goal`, `key?`, `from?`, `skip?`, `voice?`, `trace?`, `scanDir?` |
+| `e2e_ai_execute_step` | Execute a single pipeline step | `step`, `key?`, `voice?`, `trace?`, `scanDir?`, `output?`, `extraArgs?` |
+| `e2e_ai_get_workflow_guide` | Get the full workflow guide explaining how the pipeline works | (none) |
+#### Project setup
 | Tool | Description | Input |
 |------|-------------|-------|
 | `e2e_ai_scan_codebase` | Scan project for test files, configs, fixtures, path aliases, and sample test content | `projectRoot?` (defaults to cwd) |
@@ -605,16 +611,35 @@ If e2e-ai is installed globally or as a project dependency, you can use the bina
 | `e2e_ai_read_agent` | Load an agent prompt by name — returns system prompt + config | `agentName` (e.g. `scenario-agent`) |
 | `e2e_ai_get_example` | Get the example context markdown template | (none) |
-### Usage with AI Assistants
-Once configured, an AI assistant can:
-1. **Scan your project** to understand its test structure, fixtures, and conventions
-2. **Read agent prompts** to understand how each pipeline step works
-3. **Validate context files** to ensure they have the right format before running commands
-4. **Get the example template** as a starting point for writing `e2e-ai.context.md`
-This enables AI assistants to help you set up e2e-ai, debug pipeline issues, and generate better project context files.
+### How AI Orchestration Works
+The MCP server includes built-in orchestration instructions that teach AI assistants (Claude Code, Cursor, etc.) how to run e2e-ai workflows autonomously. The protocol is:
+1. **Plan** — The AI calls `e2e_ai_plan_workflow` with your goal. It returns an ordered step list.
+2. **Approve** — The AI presents the plan to you for review. You can adjust steps before proceeding.
+3. **Execute** — The AI runs each step one at a time via `e2e_ai_execute_step`, reporting results between steps. If a step fails, it stops and asks you how to proceed.
+Each step is executed as a separate job (ideally a subagent) to keep context clean. The AI never runs multiple pipeline steps at once.
+**Example interaction:**
+> **You:** "Run the full test pipeline for PROJ-101"
+>
+> **AI:** *Calls `e2e_ai_plan_workflow`*, then presents:
+> 1. `record` — Launch browser codegen + voice recording
+> 2. `transcribe` — Transcribe voice via Whisper
+> 3. `scenario` — Generate YAML test scenario
+> 4. `generate` — Generate Playwright test
+> 5. `refine` — Refactor test with AI
+> 6. `test` — Run Playwright test
+> 7. `heal` — Self-heal if failing (can skip if test passes)
+> 8. `qa` — Generate QA documentation
+>
+> "Does this look right? Ready to start?"
+>
+> **You:** "Skip voice, go ahead"
+>
+> **AI:** *Removes transcribe, executes each step sequentially*
 ## Library API

package/dist/cli-hjczkpxm.js ADDED Viewed

@@ -0,0 +1,117 @@
+import {
+  getPackageRoot,
+  getProjectRoot
+} from "./cli-kx32qnf3.js";
+// src/agents/loadAgent.ts
+import { readFileSync, existsSync, readdirSync } from "node:fs";
+import { join } from "node:path";
+function resolveAgentFile(dir, agentName) {
+  const exact = join(dir, `${agentName}.md`);
+  if (existsSync(exact))
+    return exact;
+  try {
+    const files = readdirSync(dir);
+    const suffix = `.${agentName}.md`;
+    const match = files.find((f) => f.endsWith(suffix));
+    if (match)
+      return join(dir, match);
+  } catch {}
+  return null;
+}
+function loadAgent(agentName, config) {
+  const localDir = join(getProjectRoot(), ".e2e-ai", "agents");
+  const packageDir = join(getPackageRoot(), "agents");
+  const filePath = resolveAgentFile(localDir, agentName) ?? resolveAgentFile(packageDir, agentName);
+  if (!filePath) {
+    throw new Error(`Agent file not found for "${agentName}" in ${localDir} or ${packageDir}`);
+  }
+  let content;
+  try {
+    content = readFileSync(filePath, "utf-8");
+  } catch {
+    throw new Error(`Agent file not readable: ${filePath}`);
+  }
+  const { frontmatter, body } = parseFrontmatter(content);
+  const agentConfig = extractConfig(frontmatter);
+  let systemPrompt = body;
+  if (config) {
+    const contextPath = join(getProjectRoot(), ".e2e-ai", "context.md");
+    if (existsSync(contextPath)) {
+      const projectContext = readFileSync(contextPath, "utf-8").trim();
+      if (projectContext) {
+        systemPrompt = `${body}
+## Project Context
+${projectContext}`;
+      }
+    }
+    if (config.llm.agentModels[agentName]) {
+      agentConfig.model = config.llm.agentModels[agentName];
+    }
+  }
+  const sections = parseSections(body);
+  return {
+    name: frontmatter.agent ?? agentName,
+    systemPrompt,
+    inputSchema: sections["Input Schema"],
+    outputSchema: sections["Output Schema"],
+    rules: sections["Rules"],
+    example: sections["Example"],
+    config: agentConfig
+  };
+}
+function parseFrontmatter(content) {
+  const match = content.match(/^---\n([\s\S]*?)\n---\n([\s\S]*)$/);
+  if (!match)
+    return { frontmatter: {}, body: content };
+  const frontmatter = {};
+  for (const line of match[1].split(`
+`)) {
+    const colonIdx = line.indexOf(":");
+    if (colonIdx === -1)
+      continue;
+    const key = line.slice(0, colonIdx).trim();
+    let value = line.slice(colonIdx + 1).trim();
+    if (value.startsWith('"') && value.endsWith('"'))
+      value = value.slice(1, -1);
+    if (value === "true")
+      value = true;
+    if (value === "false")
+      value = false;
+    if (!isNaN(Number(value)) && value !== "")
+      value = Number(value);
+    frontmatter[key] = value;
+  }
+  return { frontmatter, body: match[2] };
+}
+function extractConfig(frontmatter) {
+  return {
+    model: frontmatter.model,
+    maxTokens: frontmatter.max_tokens ?? 4096,
+    temperature: frontmatter.temperature ?? 0.2
+  };
+}
+function parseSections(body) {
+  const sections = {};
+  const headingRegex = /^##\s+(.+)$/gm;
+  const headings = [];
+  let match;
+  while ((match = headingRegex.exec(body)) !== null) {
+    headings.push({ title: match[1].trim(), index: match.index });
+  }
+  const systemMatch = body.match(/^#\s+System Prompt\n([\s\S]*?)(?=\n##\s|$)/m);
+  if (systemMatch) {
+    sections["System Prompt"] = systemMatch[1].trim();
+  }
+  for (let i = 0;i < headings.length; i++) {
+    const start = headings[i].index + body.slice(headings[i].index).indexOf(`
+`) + 1;
+    const end = i + 1 < headings.length ? headings[i + 1].index : body.length;
+    sections[headings[i].title] = body.slice(start, end).trim();
+  }
+  return sections;
+}
+export { loadAgent };

package/dist/cli.js CHANGED Viewed

@@ -1,7 +1,7 @@
 #!/usr/bin/env node
 import {
   loadAgent
-} from "./cli-98db6h2q.js";
+} from "./cli-hjczkpxm.js";
 import {
   getPackageRoot,
   getProjectRoot,
@@ -9057,33 +9057,35 @@ function registerInit(program2) {
   program2.command("init").description("Initialize e2e-ai configuration for your project").option("--non-interactive", "Skip interactive prompts, use defaults").action(async (cmdOpts) => {
     const projectRoot = getProjectRoot();
     const e2eDir = join13(projectRoot, ".e2e-ai");
-    header("e2e-ai init");
-    const answers = cmdOpts?.nonInteractive ? getDefaultAnswers() : await askConfigQuestions();
-    const config = buildConfigFromAnswers(answers);
+    const nonInteractive = !!cmdOpts?.nonInteractive;
     const configPath = join13(e2eDir, "config.ts");
-    if (fileExists(configPath)) {
-      warn(`Config already exists: ${configPath}`);
-      const overwrite = cmdOpts?.nonInteractive ? false : await dist_default4({ message: "Overwrite existing config?", default: false });
-      if (!overwrite) {
-        info("Skipping config generation");
-      } else {
-        writeFile(configPath, generateConfigFile(config));
-        success(`Config written: ${configPath}`);
-      }
+    const isReInit = fileExists(configPath);
+    header("e2e-ai init");
+    if (isReInit) {
+      info(`Existing .e2e-ai/ detected — preserving config and context.
+`);
+      await copyAgentsToLocal(projectRoot, nonInteractive);
+      await copyWorkflowGuide(projectRoot, nonInteractive);
     } else {
+      const answers = nonInteractive ? getDefaultAnswers() : await askConfigQuestions();
+      const config = buildConfigFromAnswers(answers);
       writeFile(configPath, generateConfigFile(config));
       success(`Config written: ${configPath}`);
+      await copyAgentsToLocal(projectRoot, nonInteractive);
+      await copyWorkflowGuide(projectRoot, nonInteractive);
     }
-    await copyAgentsToLocal(projectRoot, !!cmdOpts?.nonInteractive);
-    copyWorkflowGuide(projectRoot);
     console.log("");
     success(`Initialization complete!
 `);
-    console.log(import_picocolors2.default.bold("Next steps:"));
-    console.log(`  1. Use the ${import_picocolors2.default.cyan("init-agent")} in your AI tool to generate ${import_picocolors2.default.cyan(".e2e-ai/context.md")}`);
-    console.log(`     (or use the MCP server: ${import_picocolors2.default.cyan("e2e_ai_scan_codebase")} + ${import_picocolors2.default.cyan("e2e_ai_read_agent")})`);
-    console.log(`  2. Review the generated ${import_picocolors2.default.cyan(".e2e-ai/context.md")}`);
-    console.log(`  3. Run: ${import_picocolors2.default.cyan("e2e-ai run --key PROJ-101")}`);
+    if (!isReInit) {
+      console.log(import_picocolors2.default.bold("Next steps:"));
+      console.log(`  1. Use the ${import_picocolors2.default.cyan("init-agent")} in your AI tool to generate ${import_picocolors2.default.cyan(".e2e-ai/context.md")}`);
+      console.log(`     (or use the MCP server: ${import_picocolors2.default.cyan("e2e_ai_scan_codebase")} + ${import_picocolors2.default.cyan("e2e_ai_read_agent")})`);
+      console.log(`  2. Review the generated ${import_picocolors2.default.cyan(".e2e-ai/context.md")}`);
+      console.log(`  3. Run: ${import_picocolors2.default.cyan("e2e-ai run --key PROJ-101")}`);
+    } else {
+      console.log(import_picocolors2.default.dim("Config and context.md were preserved. Only agents and workflow were checked."));
+    }
   });
 }
 function getDefaultAnswers() {
@@ -9199,8 +9201,8 @@ async function copyAgentsToLocal(projectRoot, nonInteractive) {
         return 0;
       }
       const overwrite = await dist_default4({
-        message: `Agent files already exist in .e2e-ai/agents/ (${existingFiles.length} files). Overwrite?`,
-        default: false
+        message: `Update agents to latest version? (${agentFiles.length} files, currently ${existingFiles.length} in .e2e-ai/agents/)`,
+        default: true
       });
       if (!overwrite) {
         info("Skipping agent copy");
@@ -9216,14 +9218,26 @@ async function copyAgentsToLocal(projectRoot, nonInteractive) {
   success(`Agents copied to .e2e-ai/agents/ (${agentFiles.length} files)`);
   return agentFiles.length;
 }
-function copyWorkflowGuide(projectRoot) {
+async function copyWorkflowGuide(projectRoot, nonInteractive) {
   const packageRoot = getPackageRoot();
   const source = join13(packageRoot, "templates", "workflow.md");
   const target = join13(projectRoot, ".e2e-ai", "workflow.md");
   if (!existsSync2(source))
     return;
-  if (existsSync2(target))
-    return;
+  if (existsSync2(target)) {
+    if (nonInteractive) {
+      info("Workflow guide already exists, skipping");
+      return;
+    }
+    const overwrite = await dist_default4({
+      message: "Update workflow.md to latest version?",
+      default: true
+    });
+    if (!overwrite) {
+      info("Skipping workflow guide update");
+      return;
+    }
+  }
   const content = readFileSync2(source, "utf-8");
   writeFile(target, content);
   success("Workflow guide written to .e2e-ai/workflow.md");

package/dist/mcp.js CHANGED Viewed

@@ -1,7 +1,7 @@
 #!/usr/bin/env node
 import {
   loadAgent
-} from "./cli-98db6h2q.js";
+} from "./cli-hjczkpxm.js";
 import {
   getPackageRoot
 } from "./cli-kx32qnf3.js";
@@ -14856,7 +14856,8 @@ class StdioServerTransport {
 }
 // src/mcp.ts
-import { readFileSync as readFileSync2 } from "node:fs";
+import { execSync } from "node:child_process";
+import { existsSync as existsSync2, readFileSync as readFileSync2 } from "node:fs";
 import { join as join2 } from "node:path";
 // src/utils/scan.ts
@@ -14957,13 +14958,364 @@ function validateContext(content) {
 }
 // src/mcp.ts
-var server = new McpServer({
-  name: "e2e-ai",
-  version: "1.1.2"
-});
+var SERVER_INSTRUCTIONS = `
+# e2e-ai — Orchestration Guide
+You have access to e2e-ai, an AI-powered E2E test automation tool. Follow this protocol when the user asks you to perform any e2e-ai automation.
+## Core Principle: Plan → Approve → Execute Step-by-Step
+NEVER run multiple pipeline steps at once. Each step is a separate job with its own context.
+## Protocol
+1. **Plan first.** Call \`e2e_ai_plan_workflow\` with the user's goal. This returns a structured todo list of steps.
+2. **Check prerequisites.** The plan includes a \`ready\` boolean and \`missingPrerequisites\` array. If \`ready\` is false, show the user what's missing (API keys, config, etc.) and **wait for them to fix it** before proceeding. Do NOT attempt to execute any step while prerequisites are missing.
+3. **Present the plan.** Show the user the ordered step list with descriptions. Ask for confirmation or adjustments before proceeding.
+4. **Execute one step at a time.** For each step in the approved plan:
+   a. Tell the user which step you're about to run and why.
+   b. Call \`e2e_ai_execute_step\` with the step name and parameters.
+   c. Report the result to the user (success, key output, any warnings).
+   d. If the step fails, stop and discuss with the user before continuing.
+   e. Move to the next step only after the current one succeeds.
+5. **Use subagents when available.** If your AI platform supports subagents (e.g., Claude Code Agent tool), dispatch each step as a dedicated subagent to preserve context. Each subagent should:
+   - Receive only the context it needs (step name, key, relevant file paths)
+   - Call \`e2e_ai_execute_step\` to do its work
+   - Return the result to the orchestrator
+## Step Dependencies
+Steps produce artifacts that feed into later steps. The pipeline handles this automatically — each step picks up where the previous one left off. Do not skip steps unless the plan says a step can be skipped.
+## Interactive Steps
+The \`record\` step opens a browser and requires user interaction. When the plan includes \`record\`:
+- Tell the user they need to interact with the browser window
+- The step will block until they close the codegen window
+- After recording completes, proceed with the next step
+## When Things Fail
+- If \`test\` fails and \`heal\` is in the plan, that's expected — heal will attempt to fix it
+- If \`heal\` exhausts all retries, stop and show the user the last error output
+- For any other failure, stop and ask the user how to proceed
+## Available Workflows
+- **Full test pipeline**: record → transcribe → scenario → generate → refine → test → heal → qa
+- **From existing recording**: transcribe → scenario → generate → refine → test → heal → qa
+- **AI-only (no recording)**: scenario → generate → refine → test → heal → qa
+- **Generate from scenario**: generate → refine → test → heal → qa
+- **Test + heal loop**: test → heal
+- **Scanner pipeline**: scan → analyze → push
+- **Single step**: any individual command
+Always use \`e2e_ai_plan_workflow\` to determine the right steps — don't guess.
+`.trim();
+var TEST_PIPELINE_STEPS = [
+  {
+    name: "record",
+    description: "Launch Playwright codegen in the browser. Optionally records voice narration for richer test scenarios.",
+    produces: "codegen .ts file + optional .wav voice recording",
+    requires: "none",
+    interactive: true
+  },
+  {
+    name: "transcribe",
+    description: "Transcribe the voice recording via OpenAI Whisper. Merges timestamped voice comments into the codegen file.",
+    produces: "transcript JSON + annotated codegen file",
+    requires: "voice recording from record step",
+    interactive: false,
+    canSkip: "No voice recording exists or voice is disabled"
+  },
+  {
+    name: "scenario",
+    description: "AI analyzes the codegen + transcript and generates a structured YAML test scenario with semantic steps and expected results.",
+    produces: "YAML test scenario file",
+    requires: "codegen file (+ optional transcript)",
+    interactive: false
+  },
+  {
+    name: "generate",
+    description: "AI converts the YAML scenario into a complete Playwright .test.ts file using project conventions from context.md.",
+    produces: "Playwright .test.ts file",
+    requires: "YAML scenario file",
+    interactive: false
+  },
+  {
+    name: "refine",
+    description: "AI refactors the test: replaces raw selectors with semantic alternatives, adds proper timeouts, uses project helpers.",
+    produces: "improved .test.ts file (in-place)",
+    requires: "Playwright .test.ts file",
+    interactive: false
+  },
+  {
+    name: "test",
+    description: "Run the Playwright test with trace/video/screenshot capture. Reports pass/fail status.",
+    produces: "test results + trace files",
+    requires: "Playwright .test.ts file",
+    interactive: false
+  },
+  {
+    name: "heal",
+    description: "If the test failed, AI diagnoses the failure and patches the test. Retries up to 3 times with different strategies.",
+    produces: "patched .test.ts file (if test was failing)",
+    requires: "failing test + error output",
+    interactive: false,
+    canSkip: "Test already passes"
+  },
+  {
+    name: "qa",
+    description: "Generate formal QA documentation: markdown test case with preconditions, steps table, and optional Zephyr XML export.",
+    produces: "QA markdown + optional Zephyr XML",
+    requires: "Playwright .test.ts file + scenario",
+    interactive: false
+  }
+];
+var SCANNER_PIPELINE_STEPS = [
+  {
+    name: "scan",
+    description: "Scan the codebase AST: extract routes, components, hooks, imports, and dependency graph.",
+    produces: "ast-scan.json with full codebase structure",
+    requires: "none",
+    interactive: false
+  },
+  {
+    name: "analyze",
+    description: "AI analyzes the AST scan to identify features, workflows, components, and generate test scenarios.",
+    produces: "qa-map.json with features, workflows, scenarios",
+    requires: "ast-scan.json from scan step",
+    interactive: false
+  },
+  {
+    name: "push",
+    description: "Push the QA map to a remote API endpoint for integration with external tools.",
+    produces: "push confirmation with version info",
+    requires: "qa-map.json from analyze step + API config",
+    interactive: false
+  }
+];
+var ALL_STEPS = [...TEST_PIPELINE_STEPS, ...SCANNER_PIPELINE_STEPS];
+var STEP_REQUIREMENTS = {
+  record: { envVars: [] },
+  transcribe: { envVars: [{ name: "OPENAI_API_KEY", reason: "Whisper transcription requires OpenAI API key" }] },
+  scenario: { envVars: [
+    { name: "OPENAI_API_KEY", reason: "LLM calls require OpenAI API key", onlyIf: () => getProvider() === "openai" },
+    { name: "ANTHROPIC_API_KEY", reason: "LLM calls require Anthropic API key", onlyIf: () => getProvider() === "anthropic" }
+  ] },
+  generate: { envVars: [
+    { name: "OPENAI_API_KEY", reason: "LLM calls require OpenAI API key", onlyIf: () => getProvider() === "openai" },
+    { name: "ANTHROPIC_API_KEY", reason: "LLM calls require Anthropic API key", onlyIf: () => getProvider() === "anthropic" }
+  ] },
+  refine: { envVars: [
+    { name: "OPENAI_API_KEY", reason: "LLM calls require OpenAI API key", onlyIf: () => getProvider() === "openai" },
+    { name: "ANTHROPIC_API_KEY", reason: "LLM calls require Anthropic API key", onlyIf: () => getProvider() === "anthropic" }
+  ] },
+  test: { envVars: [] },
+  heal: { envVars: [
+    { name: "OPENAI_API_KEY", reason: "LLM calls require OpenAI API key", onlyIf: () => getProvider() === "openai" },
+    { name: "ANTHROPIC_API_KEY", reason: "LLM calls require Anthropic API key", onlyIf: () => getProvider() === "anthropic" }
+  ] },
+  qa: { envVars: [
+    { name: "OPENAI_API_KEY", reason: "LLM calls require OpenAI API key", onlyIf: () => getProvider() === "openai" },
+    { name: "ANTHROPIC_API_KEY", reason: "LLM calls require Anthropic API key", onlyIf: () => getProvider() === "anthropic" }
+  ] },
+  scan: { envVars: [] },
+  analyze: { envVars: [
+    { name: "OPENAI_API_KEY", reason: "LLM calls require OpenAI API key", onlyIf: () => getProvider() === "openai" },
+    { name: "ANTHROPIC_API_KEY", reason: "LLM calls require Anthropic API key", onlyIf: () => getProvider() === "anthropic" }
+  ] },
+  push: { envVars: [
+    { name: "E2E_AI_API_URL", reason: "Push requires API URL (set E2E_AI_API_URL or push.apiUrl in config)" },
+    { name: "E2E_AI_API_KEY", reason: "Push requires API key (set E2E_AI_API_KEY or push.apiKey in config)" }
+  ] }
+};
+function getProvider() {
+  return process.env.AI_PROVIDER ?? "openai";
+}
+function checkPrerequisites(stepNames) {
+  const issueMap = new Map;
+  for (const stepName of stepNames) {
+    const reqs = STEP_REQUIREMENTS[stepName];
+    if (!reqs)
+      continue;
+    for (const envReq of reqs.envVars) {
+      if (envReq.onlyIf && !envReq.onlyIf())
+        continue;
+      if (!process.env[envReq.name]) {
+        const key = `env:${envReq.name}`;
+        if (issueMap.has(key)) {
+          issueMap.get(key).stepsAffected.push(stepName);
+        } else {
+          issueMap.set(key, {
+            type: "env_var",
+            name: envReq.name,
+            reason: envReq.reason,
+            stepsAffected: [stepName]
+          });
+        }
+      }
+    }
+    if (reqs.files) {
+      for (const fileReq of reqs.files) {
+        if (!existsSync2(fileReq.path)) {
+          const key = `file:${fileReq.path}`;
+          if (issueMap.has(key)) {
+            issueMap.get(key).stepsAffected.push(stepName);
+          } else {
+            issueMap.set(key, {
+              type: "file",
+              name: fileReq.label,
+              reason: `File not found: ${fileReq.path}`,
+              stepsAffected: [stepName]
+            });
+          }
+        }
+      }
+    }
+  }
+  const missing = Array.from(issueMap.values());
+  return { ready: missing.length === 0, missing };
+}
+function planWorkflow(goal, options) {
+  const goalLower = goal.toLowerCase();
+  const notes = [];
+  const isScannerGoal = /\b(scan|analyze|qa.?map|feature.?analy|push.?qa|codebase.?scan)\b/.test(goalLower);
+  const isSingleStep = ALL_STEPS.some((s) => goalLower === s.name || goalLower === `run ${s.name}`);
+  let stepDefs;
+  if (isScannerGoal && !isSingleStep) {
+    stepDefs = [...SCANNER_PIPELINE_STEPS];
+    if (!/\bpush\b/.test(goalLower)) {
+      stepDefs = stepDefs.filter((s) => s.name !== "push");
+      notes.push("Push step excluded — add it if you want to upload the QA map to a remote API.");
+    }
+  } else if (isSingleStep) {
+    const stepName = ALL_STEPS.find((s) => goalLower.includes(s.name)).name;
+    stepDefs = ALL_STEPS.filter((s) => s.name === stepName);
+  } else {
+    stepDefs = [...TEST_PIPELINE_STEPS];
+    if (options.from) {
+      const fromIdx = stepDefs.findIndex((s) => s.name === options.from);
+      if (fromIdx > 0) {
+        const skipped = stepDefs.slice(0, fromIdx).map((s) => s.name);
+        stepDefs = stepDefs.slice(fromIdx);
+        notes.push(`Starting from "${options.from}" — skipping: ${skipped.join(", ")}`);
+      }
+    } else {
+      if (/\b(from recording|existing recording|already recorded)\b/.test(goalLower)) {
+        stepDefs = stepDefs.filter((s) => s.name !== "record");
+        notes.push("Skipping record — using existing recording files.");
+      }
+      if (/\b(from scenario|existing scenario|manual scenario|yaml)\b/.test(goalLower)) {
+        stepDefs = stepDefs.filter((s) => !["record", "transcribe", "scenario"].includes(s.name));
+        notes.push("Starting from generate — using existing scenario YAML.");
+      }
+      if (/\b(generate.?only|just.?generate|no.?record)\b/.test(goalLower)) {
+        stepDefs = stepDefs.filter((s) => !["record", "transcribe"].includes(s.name));
+      }
+      if (/\b(test.?and.?heal|test.?heal|heal.?loop|fix.?test|self.?heal)\b/.test(goalLower)) {
+        stepDefs = stepDefs.filter((s) => ["test", "heal"].includes(s.name));
+      }
+      if (/\b(refine|refactor)\b/.test(goalLower) && !/\brun\b/.test(goalLower)) {
+        stepDefs = stepDefs.filter((s) => s.name === "refine");
+      }
+      if (/\bqa\b/.test(goalLower) && /\b(doc|only|generate)\b/.test(goalLower)) {
+        stepDefs = stepDefs.filter((s) => s.name === "qa");
+      }
+    }
+  }
+  if (options.skip?.length) {
+    stepDefs = stepDefs.filter((s) => !options.skip.includes(s.name));
+    notes.push(`Skipping: ${options.skip.join(", ")}`);
+  }
+  if (options.voice === false) {
+    stepDefs = stepDefs.filter((s) => s.name !== "transcribe");
+    notes.push("Voice disabled — transcribe step removed.");
+  }
+  const cliBase = "e2e-ai";
+  const steps = stepDefs.map((s, i) => {
+    const args = [s.name];
+    if (options.key && !["scan", "analyze", "push"].includes(s.name)) {
+      args.push("--key", options.key);
+    }
+    if (s.name === "record") {
+      if (options.voice === false)
+        args.push("--no-voice");
+      if (options.trace === false)
+        args.push("--no-trace");
+    }
+    if (s.name === "scan" && options.scanDir) {
+      args.push("--scan-dir", options.scanDir);
+    }
+    return {
+      order: i + 1,
+      name: s.name,
+      description: s.description,
+      command: `${cliBase} ${args.join(" ")}`,
+      produces: s.produces,
+      interactive: s.interactive,
+      canSkip: s.canSkip
+    };
+  });
+  const pipeline2 = isScannerGoal ? "scanner" : isSingleStep ? "single" : "test";
+  if (!options.key && pipeline2 === "test" && steps.length > 1) {
+    notes.push("No --key provided. Use --key <ISSUE-KEY> to organize files by issue.");
+  }
+  const prereqs = checkPrerequisites(steps.map((s) => s.name));
+  return { goal, pipeline: pipeline2, ready: prereqs.ready, missingPrerequisites: prereqs.missing, steps, notes };
+}
+function executeStep(stepName, options) {
+  const args = [stepName];
+  if (options.key && !["scan", "analyze", "push"].includes(stepName)) {
+    args.push("--key", options.key);
+  }
+  if (stepName === "record") {
+    if (options.voice === false)
+      args.push("--no-voice");
+    if (options.trace === false)
+      args.push("--no-trace");
+  }
+  if (stepName === "scan" && options.scanDir) {
+    args.push("--scan-dir", options.scanDir);
+  }
+  if (options.output) {
+    args.push("--output", options.output);
+  }
+  if (options.extraArgs?.length) {
+    args.push(...options.extraArgs);
+  }
+  const pkgRoot = getPackageRoot();
+  const cliBin = join2(pkgRoot, "dist", "cli.js");
+  const command = `node ${cliBin} ${args.join(" ")}`;
+  try {
+    const stdout = execSync(command, {
+      cwd: process.cwd(),
+      encoding: "utf-8",
+      timeout: 300000,
+      env: { ...process.env },
+      stdio: ["pipe", "pipe", "pipe"]
+    });
+    return { success: true, output: stdout, command };
+  } catch (err) {
+    const stderr = err.stderr?.toString() ?? "";
+    const stdout = err.stdout?.toString() ?? "";
+    return {
+      success: false,
+      output: `EXIT CODE: ${err.status ?? "unknown"}
+STDOUT:
+${stdout}
+STDERR:
+${stderr}`,
+      command
+    };
+  }
+}
+var server = new McpServer({ name: "e2e-ai", version: "1.5.0" }, { instructions: SERVER_INSTRUCTIONS });
 server.registerTool("e2e_ai_scan_codebase", {
   title: "Scan Codebase",
-  description: "Scan a project directory for test files, configs, fixtures, path aliases, and sample test content",
+  description: "Scan a project directory for test files, configs, fixtures, path aliases, and sample test content. Use this during project setup or to understand test infrastructure.",
   inputSchema: exports_external.object({
     projectRoot: exports_external.string().optional().describe("Project root directory (defaults to cwd)")
   })
@@ -14976,7 +15328,7 @@ server.registerTool("e2e_ai_scan_codebase", {
 });
 server.registerTool("e2e_ai_validate_context", {
   title: "Validate Context",
-  description: "Validate that a context markdown file contains all required sections",
+  description: "Validate that a context markdown file contains all required sections (Application, Test Infrastructure, Feature Methods, Import Conventions, Selector Conventions, Test Structure Template, Utility Patterns).",
   inputSchema: exports_external.object({
     content: exports_external.string().describe("The markdown content of the context file to validate")
   })
@@ -14988,7 +15340,7 @@ server.registerTool("e2e_ai_validate_context", {
 });
 server.registerTool("e2e_ai_read_agent", {
   title: "Read Agent",
-  description: "Read an agent prompt definition by name. Returns the agent name, system prompt, and config (model, max_tokens, temperature).",
+  description: "Read an agent prompt definition by name. Returns the agent system prompt and config. Agents: transcript-agent, scenario-agent, playwright-generator-agent, refactor-agent, self-healing-agent, qa-testcase-agent, feature-analyzer-agent, scenario-planner-agent, init-agent.",
   inputSchema: exports_external.object({
     agentName: exports_external.string().describe("Agent name (e.g. scenario-agent, playwright-generator-agent)")
   })
@@ -15014,7 +15366,7 @@ server.registerTool("e2e_ai_read_agent", {
 });
 server.registerTool("e2e_ai_get_example", {
   title: "Get Example Context",
-  description: "Returns the full example context markdown file that shows the expected format for .e2e-ai/context.md",
+  description: "Returns the full example context markdown file that shows the expected format for .e2e-ai/context.md.",
   inputSchema: exports_external.object({})
 }, async () => {
   try {
@@ -15030,6 +15382,108 @@ server.registerTool("e2e_ai_get_example", {
     };
   }
 });
+server.registerTool("e2e_ai_plan_workflow", {
+  title: "Plan Workflow",
+  description: "Plan an e2e-ai automation workflow. Call this FIRST when the user asks to run any automation. " + "Returns an ordered list of steps (todo list) that should be executed one at a time. " + "Present the plan to the user for approval before executing any step.",
+  inputSchema: exports_external.object({
+    goal: exports_external.string().describe('What the user wants to achieve. Examples: "run full pipeline for PROJ-101", ' + '"generate test from existing recording", "scan codebase and analyze features", ' + '"heal failing test PROJ-101", "refactor test PROJ-101"'),
+    key: exports_external.string().optional().describe("Issue key (e.g. PROJ-101, LIN-42)"),
+    from: exports_external.string().optional().describe("Start from a specific step (skip all prior steps)"),
+    skip: exports_external.array(exports_external.string()).optional().describe('Steps to skip (e.g. ["transcribe", "heal"])'),
+    voice: exports_external.boolean().optional().describe("Enable voice recording (default: true)"),
+    trace: exports_external.boolean().optional().describe("Enable trace capture (default: true)"),
+    scanDir: exports_external.string().optional().describe("Directory to scan (for scanner pipeline)")
+  })
+}, async ({ goal, key, from, skip, voice, trace, scanDir }) => {
+  const plan = planWorkflow(goal, { key, from, skip, voice, trace, scanDir });
+  return {
+    content: [{
+      type: "text",
+      text: JSON.stringify(plan, null, 2)
+    }]
+  };
+});
+server.registerTool("e2e_ai_execute_step", {
+  title: "Execute Pipeline Step",
+  description: "Execute a single e2e-ai pipeline step. Call this ONE STEP AT A TIME from an approved plan. " + "Each step produces artifacts consumed by later steps. " + "If your AI platform supports subagents, run each step in a dedicated subagent to preserve context. " + 'The "record" step is interactive and will open a browser window — the user must interact with it.',
+  inputSchema: exports_external.object({
+    step: exports_external.string().describe("Step name: record, transcribe, scenario, generate, refine, test, heal, qa, scan, analyze, push"),
+    key: exports_external.string().optional().describe("Issue key (e.g. PROJ-101)"),
+    voice: exports_external.boolean().optional().describe("Enable voice recording (record step only)"),
+    trace: exports_external.boolean().optional().describe("Enable trace capture (record step only)"),
+    scanDir: exports_external.string().optional().describe("Directory to scan (scan step only)"),
+    output: exports_external.string().optional().describe("Custom output path (scan/analyze steps)"),
+    extraArgs: exports_external.array(exports_external.string()).optional().describe("Additional CLI arguments")
+  })
+}, async ({ step, key, voice, trace, scanDir, output, extraArgs }) => {
+  const validSteps = ALL_STEPS.map((s) => s.name);
+  if (!validSteps.includes(step)) {
+    return {
+      content: [{
+        type: "text",
+        text: `Error: Unknown step "${step}". Valid steps: ${validSteps.join(", ")}`
+      }],
+      isError: true
+    };
+  }
+  const prereqs = checkPrerequisites([step]);
+  if (!prereqs.ready) {
+    const lines = prereqs.missing.map((m) => `- ${m.type === "env_var" ? `Set ${m.name}` : m.name}: ${m.reason}`);
+    return {
+      content: [{
+        type: "text",
+        text: JSON.stringify({
+          step,
+          success: false,
+          blocked: true,
+          missingPrerequisites: prereqs.missing,
+          message: `Cannot run "${step}" — missing prerequisites:
+${lines.join(`
+`)}
+Ask the user to provide these before retrying.`
+        }, null, 2)
+      }],
+      isError: true
+    };
+  }
+  const result = executeStep(step, { key, voice, trace, scanDir, output, extraArgs });
+  return {
+    content: [{
+      type: "text",
+      text: JSON.stringify({
+        step,
+        success: result.success,
+        command: result.command,
+        output: result.output
+      }, null, 2)
+    }]
+  };
+});
+server.registerTool("e2e_ai_get_workflow_guide", {
+  title: "Get Workflow Guide",
+  description: "Returns the e2e-ai workflow guide explaining how the pipeline works, step by step. Useful for understanding what each step does and how they connect.",
+  inputSchema: exports_external.object({})
+}, async () => {
+  try {
+    const guidePath = join2(getPackageRoot(), "templates", "workflow.md");
+    if (!existsSync2(guidePath)) {
+      return {
+        content: [{ type: "text", text: "Error: workflow.md not found in templates" }],
+        isError: true
+      };
+    }
+    const content = readFileSync2(guidePath, "utf-8");
+    return {
+      content: [{ type: "text", text: content }]
+    };
+  } catch (err) {
+    return {
+      content: [{ type: "text", text: `Error: ${err.message}` }],
+      isError: true
+    };
+  }
+});
 async function main() {
   const transport = new StdioServerTransport;
   await server.connect(transport);

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "e2e-ai",
   "description": "AI-powered test automation pipeline — record, transcribe, generate, heal and ship Playwright tests from a single CLI",
-  "version": "1.4.2",
+  "version": "1.5.0",
   "private": false,
   "type": "module",
   "bin": {

package/templates/workflow.md CHANGED Viewed

@@ -14,6 +14,10 @@ record → transcribe → scenario → generate → refine → test → heal →
 **In short:** You record yourself testing in the browser (optionally narrating what you're doing), and e2e-ai turns that into a production-ready Playwright test with QA documentation.
+**Two ways to run it:**
+- **CLI**: Run commands directly (`e2e-ai run --key PROJ-101`)
+- **AI assistant**: Ask your AI tool (Claude Code, Cursor, etc.) — the MCP server guides it through the pipeline step by step, asking for your approval before starting
 ---
 ## Setup
@@ -198,6 +202,27 @@ This is independent from the test pipeline — use it to get an overview of your
 ---
+## AI-Assisted Workflow (MCP)
+If you have the e2e-ai MCP server configured, you can ask your AI assistant to run the pipeline for you. The MCP server teaches the AI how to orchestrate the workflow:
+1. **You say:** "Run the full test pipeline for PROJ-101" (or any variation)
+2. **AI plans:** Calls `e2e_ai_plan_workflow` → gets an ordered step list
+3. **AI shows plan:** Presents the steps and asks for your approval
+4. **You adjust:** "Skip voice" / "Start from generate" / "Looks good, go"
+5. **AI executes:** Runs each step one at a time via `e2e_ai_execute_step`, reporting results between steps
+Each step runs as a separate subagent (when supported by the AI platform) to keep context clean and focused. If a step fails, the AI stops and asks you what to do.
+**Example prompts you can give your AI assistant:**
+- "Run the full pipeline for PROJ-101"
+- "Generate a test from the existing recording for PROJ-101, skip voice"
+- "Just run test and heal for PROJ-101"
+- "Scan the codebase and analyze features"
+- "Refactor the test for PROJ-101"
+---
 ## File Structure
 After running the pipeline for `PROJ-101`:
@@ -207,7 +232,16 @@ After running the pipeline for `PROJ-101`:
   config.ts              ← your configuration
   context.md             ← project context (teach AI your conventions)
   workflow.md            ← this file
-  agents/                ← AI agent prompts (customizable)
+  agents/                ← AI agent prompts (numbered by pipeline order)
+    0.init-agent.md
+    1_1.transcript-agent.md
+    1_2.scenario-agent.md
+    2.playwright-generator-agent.md
+    3.refactor-agent.md
+    4.self-healing-agent.md
+    5.qa-testcase-agent.md
+    6_1.feature-analyzer-agent.md
+    6_2.scenario-planner-agent.md
   PROJ-101/              ← working files (codegen, recordings)
 e2e/