npm - testdriverai - Versions diffs - 7.2.73 → 7.2.75 - Mend

testdriverai 7.2.73 → 7.2.75

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/ai/agents/{test-writer.md → testdriver.md} +111 -85
package/interfaces/cli/commands/init.js +70 -95
package/interfaces/cli/commands/setup.js +6 -6
package/mcp-server/dist/mcp-app.html +2 -2
package/mcp-server/dist/provision-types.d.ts +1 -1
package/mcp-server/dist/provision-types.js +2 -2
package/mcp-server/dist/server.mjs +25 -49
package/package.json +1 -1
package/sdk.d.ts +16 -1
package/sdk.js +19 -2

package/ai/agents/{test-writer.md → testdriver.md} RENAMED Viewed

@@ -1,13 +1,16 @@
 ---
+name: testdriver
 description: An expert at creating and refining automated tests using TestDriver.ai
-capabilities:
-  [
-    "create tests",
-    "refine tests",
-    "debug tests",
-    "use MCP workflow",
-    "visual verification",
-  ]
+tools:
+mcp-servers:
+  testdriver:
+    command: npx
+    args:
+      - -p
+      - testdriverai@beta
+      - testdriverai-mcp
+    env:
+      TD_API_KEY: ${TD_API_KEY}
 ---
 # TestDriver Expert
@@ -35,11 +38,12 @@ Use this agent when the user asks to:
 ### Workflow
 1. **Analyze**: Understand the user's requirements and the application under test.
-2. **Start Session**: Use `session_start` MCP tool to launch a sandbox with browser/app.
-3. **Interact**: Use MCP tools (`find`, `click`, `type`, etc.) - each returns a screenshot showing the result.
-4. **Verify**: Use `check` after actions and `assert` for test conditions.
-5. **Commit**: Use `commit` to write recorded commands to a test file.
-6. **Verify Test**: Use `verify` to run the generated test from scratch.
+2. **Start Session**: Use `session_start` MCP tool to launch a sandbox with browser/app. Specify `testFile` to track where code should be written.
+3. **Interact**: Use MCP tools (`find`, `click`, `type`, etc.) - each returns a screenshot AND generated code.
+4. **⚠️ WRITE CODE IMMEDIATELY**: After EVERY successful action, append the generated code to the test file RIGHT AWAY. Do NOT wait until the end.
+5. **Verify Actions**: Use `check` after actions to verify they succeeded (for YOUR understanding only).
+6. **Add Assertions**: Use `assert` for test conditions that should be in the final test file.
+7. **⚠️ RUN THE TEST YOURSELF**: Use `npx vitest run <testFile>` to run the test - do NOT tell the user to run it. Iterate until it passes.
 ## Prerequisites
@@ -108,12 +112,16 @@ describe("My Test Suite", () => {
     await testdriver.provision.chrome({
       url: "https://example.com",
     });
+    await testdriver.screenshot(); // Capture initial page state
     // Find elements and interact
     const button = await testdriver.find("Sign In button");
+    await testdriver.screenshot(); // Capture before click
     await button.click();
+    await testdriver.screenshot(); // Capture after click
     // Assert using natural language
+    await testdriver.screenshot(); // Capture before assertion
     const result = await testdriver.assert("the dashboard is visible");
     expect(result).toBeTruthy();
   });
@@ -177,9 +185,9 @@ await element.mouseUp(); // release mouse
 element.found(); // check if found (boolean)
 ```
-### Screenshots
+### Screenshots for Debugging
-Use `screenshot()` **only when the user explicitly asks** to see what the screen looks like. Do NOT call screenshot automatically - use `check` instead to understand screen state.
+**Use `screenshot()` liberally throughout your tests** to capture the screen state at key moments. This makes debugging much easier when tests fail - you can see exactly what the screen looked like at each step.
 ```javascript
 // Capture a screenshot - saved to .testdriver/screenshots/<test-file>/
@@ -190,6 +198,14 @@ console.log("Screenshot saved to:", screenshotPath);
 await testdriver.screenshot(1, false, true);
 ```
+**When to add screenshots:**
+- After provisioning (initial page load)
+- Before and after clicking important elements
+- After typing text into fields
+- Before assertions (to see what the AI is evaluating)
+- After any action that changes the page state
+- When debugging a flaky or failing test
 **Screenshot file organization:**
 ```
@@ -210,107 +226,106 @@ await testdriver.screenshot(1, false, true);
 ### Key Advantages
 - **No need to restart** - continue from current state
-- **Automatic command recording** - successful commands are logged
-- **Code generation** - convert recorded commands to test files
+- **Generated code with every action** - each tool returns the code to add to your test
 - **Use `check` to verify** - understand screen state without explicit screenshots
+### ⚠️ CRITICAL: Write Code Immediately & Run Tests Yourself
+**Every MCP tool response includes "ACTION REQUIRED: Append this code..." - you MUST write that code to the test file IMMEDIATELY before proceeding to the next action.**
+**When ready to validate, RUN THE TEST YOURSELF using `npx vitest run`. Do NOT tell the user to run it.**
 ### Step 1: Start a Session
 ```
-session_start({ type: "chrome", url: "https://your-app.com/login" })
+session_start({ type: "chrome", url: "https://your-app.com/login", testFile: "tests/login.test.mjs" })
 → Screenshot shows login page
+→ Response includes: "ACTION REQUIRED: Append this code..."
+→ ⚠️ IMMEDIATELY write to tests/login.test.mjs:
+   await testdriver.provision.chrome({ url: "https://your-app.com/login" });
+   await testdriver.screenshot(); // Capture initial page state
 ```
 This provisions a sandbox with Chrome and navigates to your URL. You'll see a screenshot of the initial page.
 ### Step 2: Interact with the App
-Find elements and interact with them:
+Find elements and interact with them. **Write code to file after EACH action, including screenshots for debugging:**
 ```
-find({ description: "email input field" })
-→ Returns: screenshot with element highlighted, coordinates, and a ref ID
-click({ elementRef: "el-123456" })
-→ Returns: screenshot with click marker
+find_and_click({ description: "email input field" })
+→ Returns: screenshot with element highlighted
+→ ⚠️ IMMEDIATELY append to test file:
+   await testdriver.find("email input field").click();
+   await testdriver.screenshot(); // Capture after click
 type({ text: "user@example.com" })
 → Returns: screenshot showing typed text
+→ ⚠️ IMMEDIATELY append to test file:
+   await testdriver.type("user@example.com");
+   await testdriver.screenshot(); // Capture after typing
 ```
-Or combine find + click in one step:
-```
-find_and_click({ description: "Sign In button" })
-```
-### Step 3: Verify Actions Succeeded
+### Step 3: Verify Actions Succeeded (For Your Understanding)
-After each action, use `check` to verify it worked:
+After actions, use `check` to verify they worked. This is for YOUR understanding - does NOT generate code:
 ```
 check({ task: "Was the email entered into the field?" })
 → Returns: AI analysis comparing previous screenshot to current state
 ```
-### Step 4: Add Assertions
+### Step 4: Add Assertions (Generates Code)
-Use `assert` for pass/fail conditions that get recorded in test files:
+Use `assert` for pass/fail conditions. This DOES generate code for the test file:
 ```
 assert({ assertion: "the dashboard is visible" })
 → Returns: pass/fail with screenshot
+→ ⚠️ IMMEDIATELY append to test file:
+   await testdriver.screenshot(); // Capture before assertion
+   const assertResult = await testdriver.assert("the dashboard is visible");
+   expect(assertResult).toBeTruthy();
 ```
-### Step 5: Commit to Test File
+### Step 5: Run the Test Yourself
-When your sequence works, save it:
+**⚠️ YOU must run the test - do NOT tell the user to run it:**
+```bash
+npx vitest run tests/login.test.mjs
 ```
-commit({
-  testFile: "tests/login.test.mjs",
-  testName: "Login Flow",
-  testDescription: "User can log in with email and password"
-})
-```
-### Step 6: Verify the Test
-Run the generated test from scratch to ensure it works:
-```
-verify({ testFile: "tests/login.test.mjs" })
-```
+Analyze the output, fix any issues, and iterate until the test passes.
 ### MCP Tools Reference
 | Tool | Description |
 |------|-------------|
-| `session_start` | Start sandbox with browser/app, capture initial screenshot |
-| `session_status` | Check session health, time remaining, command count |
+| `session_start` | Start sandbox with browser/app, returns screenshot + provision code |
+| `session_status` | Check session health and time remaining |
 | `session_extend` | Add more time before session expires |
 | `find` | Locate element by description, returns ref for later use |
-| `click` | Click on element ref or coordinates |
+| `click` | Click on element ref |
 | `find_and_click` | Find and click in one action |
 | `type` | Type text into focused field |
 | `press_keys` | Press keyboard shortcuts (e.g., `["ctrl", "a"]`) |
 | `scroll` | Scroll page (up/down/left/right) |
-| `check` | AI analysis of whether a task completed |
-| `assert` | AI-powered boolean assertion (pass/fail for test files) |
+| `check` | AI analysis of screen state - for YOUR understanding only, does NOT generate code |
+| `assert` | AI-powered boolean assertion - GENERATES CODE for test files |
 | `exec` | Execute JavaScript, shell, or PowerShell in sandbox |
 | `screenshot` | Capture screenshot - **only use when user explicitly asks** |
-| `commit` | Write recorded commands to test file |
-| `verify` | Run test file from scratch |
-| `get_command_log` | View recorded commands before committing |
 ### Tips for MCP Workflow
-1. **Work incrementally** - Don't try to build the entire test at once
-2. **Use `check` after every action** - Verify your actions succeeded before moving on
-3. **Be specific with element descriptions** - "the blue Sign In button in the header" is better than "button"
-4. **Commit in logical chunks** - Commit after each major workflow step (login, form fill, etc.)
-5. **Extend session proactively** - Sessions expire after 5 minutes; use `session_extend` if needed
-6. **Review the command log** - Use `get_command_log` to see what will be committed
+1. **⚠️ Write code IMMEDIATELY** - After EVERY action, append generated code to test file RIGHT AWAY
+2. **⚠️ Run tests YOURSELF** - Use `npx vitest run` - do NOT tell user to run tests
+3. **⚠️ Add screenshots liberally** - Include `await testdriver.screenshot()` after every significant action for debugging
+4. **Work incrementally** - Don't try to build the entire test at once
+5. **Use `check` after actions** - Verify your actions succeeded before moving on (for YOUR understanding)
+6. **Use `assert` for test verifications** - These generate code that goes in the test file
+7. **Be specific with element descriptions** - "the blue Sign In button in the header" is better than "button"
+8. **Extend session proactively** - Sessions expire after 5 minutes; use `session_extend` if needed
 ## Recommended Development Workflow
@@ -325,17 +340,21 @@ verify({ testFile: "tests/login.test.mjs" })
 it("should incrementally build test", async (context) => {
   const testdriver = TestDriver(context);
   await testdriver.provision.chrome({ url: "https://example.com" });
+  await testdriver.screenshot(); // Capture initial state
   // Step 1: Find and inspect
   const element = await testdriver.find("Some button");
   console.log("Element found:", element.found());
   console.log("Coordinates:", element.x, element.y);
   console.log("Confidence:", element.confidence);
+  await testdriver.screenshot(); // Capture after find
   // Step 2: Interact
   await element.click();
+  await testdriver.screenshot(); // Capture after click
   // Step 3: Assert and log
+  await testdriver.screenshot(); // Capture before assertion
   const result = await testdriver.assert("Something happened");
   console.log("Assertion result:", result);
   expect(result).toBeTruthy();
@@ -417,33 +436,42 @@ const date = await testdriver.exec("pwsh", "Get-Date", 5000);
 ### Capturing Screenshots
+**Add screenshots liberally throughout your tests** for debugging. When a test fails, you'll have a visual trail showing exactly what happened at each step.
 ```javascript
-// Capture a screenshot and save to file
-const screenshot = await testdriver.screenshot();
-const filepath = "screenshot.png";
-fs.writeFileSync(filepath, Buffer.from(screenshot, "base64"));
-console.log("Screenshot saved to:", filepath);
+// Basic screenshot - automatically saved to .testdriver/screenshots/<test-file>/
+await testdriver.screenshot();
 // Capture with mouse cursor visible
-const screenshotWithMouse = await testdriver.screenshot(1, false, true);
-fs.writeFileSync(
-  "screenshot-with-mouse.png",
-  Buffer.from(screenshotWithMouse, "base64"),
-);
-console.log("Screenshot with mouse saved to: screenshot-with-mouse.png");
+await testdriver.screenshot(1, false, true);
+// Recommended pattern: screenshot after every significant action
+await testdriver.provision.chrome({ url: "https://example.com" });
+await testdriver.screenshot(); // After page load
+await testdriver.find("Login button").click();
+await testdriver.screenshot(); // After click
+await testdriver.type("user@example.com");
+await testdriver.screenshot(); // After typing
+await testdriver.screenshot(); // Before assertion
+const result = await testdriver.assert("dashboard is visible");
 ```
 ## Tips for Agents
-1. **Use MCP tools for development** - Don't write test files manually; use the MCP workflow to build tests interactively
-2. **Always check `sdk.d.ts`** for method signatures and types when debugging generated tests
-3. **Look at test samples** in `node_modules/testdriverai/test` for working examples
-4. **Use `check` to understand screen state** - This is how you verify what the sandbox shows. Only use `screenshot` when the user asks to see the screen.
-5. **Use `check` after actions, `assert` for test files** - `check` gives detailed AI analysis, `assert` gives boolean pass/fail
-6. **Be specific with element descriptions** - "blue Sign In button in the header" > "button"
-7. **Start simple** - get one step working before adding more
-8. **Commit working sequences** - Don't lose progress; use `commit` after each successful interaction sequence
-9. **Always `await` async methods** - TestDriver will warn if you forget, but for TypeScript projects, add `@typescript-eslint/no-floating-promises` to your ESLint config to catch missing `await` at compile time:
+1. **⚠️ WRITE CODE IMMEDIATELY** - After EVERY successful MCP action, append the generated code to the test file RIGHT AWAY. Do NOT wait until the session ends.
+2. **⚠️ RUN TESTS YOURSELF** - Do NOT tell the user to run tests. YOU must run the tests using `npx vitest run <testFile>`. Analyze the output and iterate until the test passes.
+3. **⚠️ ADD SCREENSHOTS LIBERALLY** - Include `await testdriver.screenshot()` throughout your tests: after provision, before/after clicks, after typing, and before assertions. This creates a visual trail that makes debugging failures much easier.
+4. **Use MCP tools for development** - Build tests interactively with visual feedback
+5. **Always check `sdk.d.ts`** for method signatures and types when debugging generated tests
+6. **Look at test samples** in `node_modules/testdriverai/test` for working examples
+7. **Use `check` to understand screen state** - This is how you verify what the sandbox shows during MCP development.
+8. **Use `check` after actions, `assert` for test files** - `check` gives detailed AI analysis (no code), `assert` gives boolean pass/fail (generates code)
+9. **Be specific with element descriptions** - "blue Sign In button in the header" > "button"
+10. **Start simple** - get one step working before adding more
+11. **Always `await` async methods** - TestDriver will warn if you forget, but for TypeScript projects, add `@typescript-eslint/no-floating-promises` to your ESLint config to catch missing `await` at compile time:
    ```json
    // eslint.config.js (for TypeScript projects)
@@ -453,5 +481,3 @@ console.log("Screenshot with mouse saved to: screenshot-with-mouse.png");
      }
    }
    ```
-10. **Use `verify` to validate tests** - After committing, run `verify` to ensure the generated test works from scratch.

package/interfaces/cli/commands/init.js CHANGED Viewed

@@ -21,6 +21,7 @@ class InitCommand extends BaseCommand {
     await this.createGitHubWorkflow();
     await this.createGitignore();
     await this.createVscodeMcpConfig();
+    await this.createVscodeExtensions();
     await this.installDependencies();
     await this.copySkills();
     await this.createAgents();
@@ -149,7 +150,10 @@ class InitCommand extends BaseCommand {
       try {
         execSync(`setx ${key} "${value}"`, { stdio: "ignore" });
         console.log(
-          chalk.green(`  ✓ Set ${key} as user environment variable\n`),
+          chalk.green(`  ✓ Set ${key} as user environment variable`),
+        );
+        console.log(
+          chalk.gray(`    Restart your terminal for changes to take effect\n`),
         );
       } catch (error) {
         console.log(
@@ -184,7 +188,10 @@ class InitCommand extends BaseCommand {
         );
         fs.writeFileSync(profilePath, updated);
         console.log(
-          chalk.green(`  ✓ Updated ${key} in ${profilePath}\n`),
+          chalk.green(`  ✓ Updated ${key} in ${profilePath}`),
+        );
+        console.log(
+          chalk.gray(`    Run: source ${profilePath}  (or open a new terminal)\n`),
         );
         return;
       }
@@ -193,7 +200,10 @@ class InitCommand extends BaseCommand {
     // Append to profile
     fs.appendFileSync(profilePath, `\n${exportLine}\n`);
     console.log(
-      chalk.green(`  ✓ Added ${key} to ${profilePath}\n`),
+      chalk.green(`  ✓ Added ${key} to ${profilePath}`),
+    );
+    console.log(
+      chalk.gray(`    Run: source ${profilePath}  (or open a new terminal)\n`),
     );
   }
@@ -346,11 +356,8 @@ test('should login and add item to cart', async (context) => {
     if (!fs.existsSync(configFile)) {
       const configContent = `import { defineConfig } from 'vitest/config';
 import TestDriver from 'testdriverai/vitest';
-import { config } from 'dotenv';
-// Load environment variables from .env file
-config();
+// Note: dotenv is loaded automatically by the TestDriver SDK
 export default defineConfig({
   test: {
     testTimeout: 300000,
@@ -478,10 +485,10 @@ jobs:
     if (!fs.existsSync(mcpConfigFile)) {
       const mcpConfig = {
-        mcpServers: {
+        servers: {
           testdriver: {
             command: "npx",
-            args: ["testdriverai-mcp"],
+            args: ["-p", "testdriverai@beta", "testdriverai-mcp"],
             env: {
               TD_API_KEY: "${TD_API_KEY}",
             },
@@ -499,6 +506,36 @@ jobs:
     }
   }
+  /**
+   * Create VSCode extensions recommendations
+   */
+  async createVscodeExtensions() {
+    const vscodeDir = path.join(process.cwd(), ".vscode");
+    const extensionsFile = path.join(vscodeDir, "extensions.json");
+    // Create .vscode directory if it doesn't exist
+    if (!fs.existsSync(vscodeDir)) {
+      fs.mkdirSync(vscodeDir, { recursive: true });
+      console.log(chalk.gray(`  Created directory: ${vscodeDir}`));
+    }
+    if (!fs.existsSync(extensionsFile)) {
+      const extensionsConfig = {
+        recommendations: [
+          "vitest.explorer",
+        ],
+      };
+      fs.writeFileSync(
+        extensionsFile,
+        JSON.stringify(extensionsConfig, null, 2) + "\n",
+      );
+      console.log(chalk.green(`  Created extensions config: ${extensionsFile}`));
+    } else {
+      console.log(chalk.gray("  Extensions config already exists, skipping..."));
+    }
+  }
   /**
    * Copy TestDriver skills from the package to the project
    */
@@ -557,7 +594,7 @@ jobs:
   }
   /**
-   * Create TestDriver agents in GitHub Copilot format
+   * Copy TestDriver agents to .github/agents
    */
   async createAgents() {
     const agentsDestDir = path.join(process.cwd(), ".github", "agents");
@@ -576,98 +613,36 @@ jobs:
       }
     }
+    if (!agentsSourceDir) {
+      console.log(chalk.yellow("  ⚠️  Agents directory not found, skipping agents copy..."));
+      return;
+    }
     // Create .github/agents directory if it doesn't exist
     if (!fs.existsSync(agentsDestDir)) {
       fs.mkdirSync(agentsDestDir, { recursive: true });
       console.log(chalk.gray(`  Created directory: ${agentsDestDir}`));
     }
-    // If we found source agents, convert them to .agent.md format
-    if (agentsSourceDir) {
-      const agentFiles = fs.readdirSync(agentsSourceDir).filter(f => f.endsWith(".md"));
-      for (const agentFile of agentFiles) {
-        const sourcePath = path.join(agentsSourceDir, agentFile);
-        const agentName = agentFile.replace(".md", "");
-        const destPath = path.join(agentsDestDir, `${agentName}.agent.md`);
-        if (!fs.existsSync(destPath)) {
-          const sourceContent = fs.readFileSync(sourcePath, "utf8");
-          // Parse the source frontmatter and body
-          const frontmatterMatch = sourceContent.match(/^---\n([\s\S]*?)\n---\n([\s\S]*)$/);
-          if (frontmatterMatch) {
-            const frontmatterText = frontmatterMatch[1];
-            const body = frontmatterMatch[2];
-            // Extract description from frontmatter
-            const descMatch = frontmatterText.match(/description:\s*["']?(.*?)["']?$/m);
-            const description = descMatch ? descMatch[1] : `TestDriver ${agentName} agent`;
-            // Create GitHub Copilot agent format
-            const agentContent = `---
-name: ${agentName}
-description: ${description}
-tools:
-  - testdriver/*
-mcp-servers:
-  testdriver:
-    command: npx
-    args:
-      - testdriverai-mcp
-    env:
-      TD_API_KEY: \${TD_API_KEY}
----
-${body}`;
-            fs.writeFileSync(destPath, agentContent);
-            console.log(chalk.green(`  Created agent: ${destPath}`));
-          }
-        } else {
-          console.log(chalk.gray(`  Agent ${agentName}.agent.md already exists, skipping...`));
-        }
+    // Copy agent files with .agent.md extension
+    const agentFiles = fs.readdirSync(agentsSourceDir).filter(f => f.endsWith(".md"));
+    let copiedCount = 0;
+    for (const agentFile of agentFiles) {
+      const sourcePath = path.join(agentsSourceDir, agentFile);
+      const agentName = agentFile.replace(".md", "");
+      const destPath = path.join(agentsDestDir, `${agentName}.agent.md`);
+      if (!fs.existsSync(destPath)) {
+        fs.copyFileSync(sourcePath, destPath);
+        copiedCount++;
       }
-    } else {
-      // Create a default test-writer agent if no source found
-      const defaultAgentPath = path.join(agentsDestDir, "test-writer.agent.md");
-      if (!fs.existsSync(defaultAgentPath)) {
-        const defaultAgentContent = `---
-name: test-writer
-description: An expert at creating and refining automated tests using TestDriver.ai
-tools:
-  - testdriver/*
-mcp-servers:
-  testdriver:
-    command: npx
-    args:
-      - testdriverai-mcp
-    env:
-      TD_API_KEY: \${TD_API_KEY}
----
-# TestDriver Expert
-You are an expert at writing automated tests using the TestDriver library. Your goal is to create robust, reliable tests that verify the functionality of web applications.
-## Workflow
-1. **Start Session**: Use \`session_start\` to provision a sandbox with browser
-2. **Interact**: Use \`find\`, \`click\`, \`type\` etc. - each returns a screenshot
-3. **Verify**: Use \`check\` after actions and \`assert\` for test conditions
-4. **Build Test**: Append generated code to your test file
-5. **Validate**: Use \`verify\` to run the test from scratch
-## Tips
-- Be specific with element descriptions: "blue Sign In button in the header" > "button"
-- Use \`check\` after actions to verify they succeeded
-- Start simple - get one step working before adding more
-`;
+    }
-        fs.writeFileSync(defaultAgentPath, defaultAgentContent);
-        console.log(chalk.green(`  Created default agent: ${defaultAgentPath}`));
-      }
+    if (copiedCount > 0) {
+      console.log(chalk.green(`  Copied ${copiedCount} agent(s) to ${agentsDestDir}`));
+    } else {
+      console.log(chalk.gray("  Agents already exist, skipping..."));
     }
   }
@@ -705,7 +680,7 @@ You are an expert at writing automated tests using the TestDriver library. Your
     console.log("  1. Run your tests:");
     console.log(chalk.gray("     npx vitest run\n"));
     console.log("  2. Use AI agents to write tests:");
-    console.log(chalk.gray("     Open VSCode/Cursor and use @test-writer agent\n"));
+    console.log(chalk.gray("     Open VSCode/Cursor and use @testdriver agent\n"));
     console.log("  3. MCP server configured:");
     console.log(chalk.gray("     TestDriver tools available via MCP in .vscode/mcp.json\n"));
     console.log(

package/interfaces/cli/commands/setup.js CHANGED Viewed

@@ -47,7 +47,7 @@ class SetupCommand extends Command {
     this.installSkills(sourceSkills, path.join(CLAUDE_HOME, "skills"));
     this.installAgents(sourceAgents, path.join(CLAUDE_HOME, "agents"));
-    this.installMcp();
+    this.installClaudeMcp();
     this.installCursorMcp();
     await this.promptForApiKey();
@@ -125,7 +125,7 @@ class SetupCommand extends Command {
   /**
    * Add testdriver MCP server to ~/.claude.json
    */
-  installMcp() {
+  installClaudeMcp() {
     let config = {};
     if (fs.existsSync(CLAUDE_MCP_FILE)) {
@@ -182,13 +182,13 @@ class SetupCommand extends Command {
       }
     }
-    if (!config.mcpServers) {
-      config.mcpServers = {};
+    if (!config.servers) {
+      config.servers = {};
     }
-    const alreadyConfigured = config.mcpServers["testdriver-cloud"];
+    const alreadyConfigured = config.servers["testdriver-cloud"];
-    Object.assign(config.mcpServers, CURSOR_MCP_SERVER_CONFIG);
+    Object.assign(config.servers, CURSOR_MCP_SERVER_CONFIG);
     fs.writeFileSync(CURSOR_MCP_FILE, JSON.stringify(config, null, 2) + "\n");
     if (alreadyConfigured) {

package/mcp-server/dist/mcp-app.html CHANGED Viewed

@@ -90,8 +90,8 @@ Note: This type uses \`Record<K, string | undefined>\` rather than \`Partial<Rec
 for compatibility with Zod schema generation. Both are functionally equivalent for validation.`);c.object({method:c.literal("ui/open-link"),params:c.object({url:c.string().describe("URL to open in the host's browser")})});var fI=c.object({isError:c.boolean().optional().describe("True if the host failed to open the URL (e.g., due to security policy).")}).passthrough(),pI=c.object({isError:c.boolean().optional().describe("True if the host rejected or failed to deliver the message.")}).passthrough();c.object({method:c.literal("ui/notifications/sandbox-proxy-ready"),params:c.object({})});var fo=c.object({connectDomains:c.array(c.string()).optional().describe("Origins for network requests (fetch/XHR/WebSocket)."),resourceDomains:c.array(c.string()).optional().describe("Origins for static resources (scripts, images, styles, fonts)."),frameDomains:c.array(c.string()).optional().describe("Origins for nested iframes (frame-src directive)."),baseUriDomains:c.array(c.string()).optional().describe("Allowed base URIs for the document (base-uri directive).")}),po=c.object({camera:c.object({}).optional().describe("Request camera access (Permission Policy `camera` feature)."),microphone:c.object({}).optional().describe("Request microphone access (Permission Policy `microphone` feature)."),geolocation:c.object({}).optional().describe("Request geolocation access (Permission Policy `geolocation` feature)."),clipboardWrite:c.object({}).optional().describe("Request clipboard write access (Permission Policy `clipboard-write` feature).")});c.object({method:c.literal("ui/notifications/size-changed"),params:c.object({width:c.number().optional().describe("New width in pixels."),height:c.number().optional().describe("New height in pixels.")})});var vI=c.object({method:c.literal("ui/notifications/tool-input"),params:c.object({arguments:c.record(c.string(),c.unknown().describe("Complete tool call arguments as key-value pairs.")).optional().describe("Complete tool call arguments as key-value pairs.")})}),hI=c.object({method:c.literal("ui/notifications/tool-input-partial"),params:c.object({arguments:c.record(c.string(),c.unknown().describe("Partial tool call arguments (incomplete, may change).")).optional().describe("Partial tool call arguments (incomplete, may change).")})}),gI=c.object({method:c.literal("ui/notifications/tool-cancelled"),params:c.object({reason:c.string().optional().describe('Optional reason for the cancellation (e.g., "user action", "timeout").')})}),$I=c.object({fonts:c.string().optional()}),_I=c.object({variables:mI.optional().describe("CSS variables for theming the app."),css:$I.optional().describe("CSS blocks that apps can inject.")}),bI=c.object({method:c.literal("ui/resource-teardown"),params:c.object({})});c.record(c.string(),c.unknown());var ts=c.object({text:c.object({}).optional().describe("Host supports text content blocks."),image:c.object({}).optional().describe("Host supports image content blocks."),audio:c.object({}).optional().describe("Host supports audio content blocks."),resource:c.object({}).optional().describe("Host supports resource content blocks."),resourceLink:c.object({}).optional().describe("Host supports resource link content blocks."),structuredContent:c.object({}).optional().describe("Host supports structured content.")}),yI=c.object({experimental:c.object({}).optional().describe("Experimental features (structure TBD)."),openLinks:c.object({}).optional().describe("Host supports opening external URLs."),serverTools:c.object({listChanged:c.boolean().optional().describe("Host supports tools/list_changed notifications.")}).optional().describe("Host can proxy tool calls to the MCP server."),serverResources:c.object({listChanged:c.boolean().optional().describe("Host supports resources/list_changed notifications.")}).optional().describe("Host can proxy resource reads to the MCP server."),logging:c.object({}).optional().describe("Host accepts log messages."),sandbox:c.object({permissions:po.optional().describe("Permissions granted by the host (camera, microphone, geolocation)."),csp:fo.optional().describe("CSP domains approved by the host.")}).optional().describe("Sandbox configuration applied by the host."),updateModelContext:ts.optional().describe("Host accepts context updates (ui/update-model-context) to be included in the model's context for future turns."),message:ts.optional().describe("Host supports receiving content messages (ui/message) from the view.")}),kI=c.object({experimental:c.object({}).optional().describe("Experimental features (structure TBD)."),tools:c.object({listChanged:c.boolean().optional().describe("App supports tools/list_changed notifications.")}).optional().describe("App exposes MCP-style tools that the host can call."),availableDisplayModes:c.array(Ut).optional().describe("Display modes the app supports.")});c.object({method:c.literal("ui/notifications/initialized"),params:c.object({}).optional()});c.object({csp:fo.optional().describe("Content Security Policy configuration."),permissions:po.optional().describe("Sandbox permissions requested by the UI."),domain:c.string().optional().describe("Dedicated origin for view sandbox."),prefersBorder:c.boolean().optional().describe("Visual boundary preference - true if UI prefers a visible border.")});c.object({method:c.literal("ui/request-display-mode"),params:c.object({mode:Ut.describe("The display mode being requested.")})});var II=c.object({mode:Ut.describe("The display mode that was actually set. May differ from requested if not supported.")}).passthrough(),wI=c.union([c.literal("model"),c.literal("app")]).describe("Tool visibility scope - who can access the tool.");c.object({resourceUri:c.string().optional(),visibility:c.array(wI).optional().describe(`Who can access this tool. Default: ["model", "app"]
 - "model": Tool visible to and callable by the agent
 - "app": Tool callable by the app from this server only`)});c.object({mimeTypes:c.array(c.string()).optional().describe('Array of supported MIME types for UI resources.\nMust include `"text/html;profile=mcp-app"` for MCP Apps support.')});c.object({method:c.literal("ui/message"),params:c.object({role:c.literal("user").describe('Message role, currently only "user" is supported.'),content:c.array(Ct).describe("Message content blocks (text, image, etc.).")})});c.object({method:c.literal("ui/notifications/sandbox-resource-ready"),params:c.object({html:c.string().describe("HTML content to load into the inner iframe."),sandbox:c.string().optional().describe("Optional override for the inner iframe's sandbox attribute."),csp:fo.optional().describe("CSP configuration from resource metadata."),permissions:po.optional().describe("Sandbox permissions from resource metadata.")})});var SI=c.object({method:c.literal("ui/notifications/tool-result"),params:Li.describe("Standard MCP tool execution result.")}),Ef=c.object({toolInfo:c.object({id:Pt.optional().describe("JSON-RPC id of the tools/call request."),tool:hr.describe("Tool definition including name, inputSchema, etc.")}).optional().describe("Metadata of the tool call that instantiated this App."),theme:cI.optional().describe("Current color theme preference."),styles:_I.optional().describe("Style configuration for theming the app."),displayMode:Ut.optional().describe("How the UI is currently displayed."),availableDisplayModes:c.array(Ut).optional().describe("Display modes the host supports."),containerDimensions:c.union([c.object({height:c.number().describe("Fixed container height in pixels.")}),c.object({maxHeight:c.union([c.number(),c.undefined()]).optional().describe("Maximum container height in pixels.")})]).and(c.union([c.object({width:c.number().describe("Fixed container width in pixels.")}),c.object({maxWidth:c.union([c.number(),c.undefined()]).optional().describe("Maximum container width in pixels.")})])).optional().describe(`Container dimensions. Represents the dimensions of the iframe or other
-container holding the app. Specify either width or maxWidth, and either height or maxHeight.`),locale:c.string().optional().describe("User's language and region preference in BCP 47 format."),timeZone:c.string().optional().describe("User's timezone in IANA format."),userAgent:c.string().optional().describe("Host application identifier."),platform:c.union([c.literal("web"),c.literal("desktop"),c.literal("mobile")]).optional().describe("Platform type for responsive design decisions."),deviceCapabilities:c.object({touch:c.boolean().optional().describe("Whether the device supports touch input."),hover:c.boolean().optional().describe("Whether the device supports hover interactions.")}).optional().describe("Device input capabilities."),safeAreaInsets:c.object({top:c.number().describe("Top safe area inset in pixels."),right:c.number().describe("Right safe area inset in pixels."),bottom:c.number().describe("Bottom safe area inset in pixels."),left:c.number().describe("Left safe area inset in pixels.")}).optional().describe("Mobile safe area boundaries in pixels.")}).passthrough(),xI=c.object({method:c.literal("ui/notifications/host-context-changed"),params:Ef.describe("Partial context update containing only changed fields.")});c.object({method:c.literal("ui/update-model-context"),params:c.object({content:c.array(Ct).optional().describe("Context content blocks (text, image, etc.)."),structuredContent:c.record(c.string(),c.unknown().describe("Structured content for machine-readable context data.")).optional().describe("Structured content for machine-readable context data.")})});c.object({method:c.literal("ui/initialize"),params:c.object({appInfo:Ai.describe("App identification (name and version)."),appCapabilities:kI.describe("Features and capabilities this app provides."),protocolVersion:c.string().describe("Protocol version this app supports.")})});var zI=c.object({protocolVersion:c.string().describe('Negotiated protocol version string (e.g., "2025-11-21").'),hostInfo:Ai.describe("Host application identification and version."),hostCapabilities:yI.describe("Features and capabilities provided by the host."),hostContext:Ef.describe("Rich context about the host environment.")}).passthrough();function ZI(e){let t=document.documentElement;t.setAttribute("data-theme",e),t.style.colorScheme=e}function UI(e,t=document.documentElement){for(let[n,r]of Object.entries(e))r!==void 0&&t.style.setProperty(n,r)}function OI(e){if(document.getElementById("__mcp-host-fonts"))return;let t=document.createElement("style");t.id="__mcp-host-fonts",t.textContent=e,document.head.appendChild(t)}class NI extends U_{constructor(n,r={},i={autoResize:!0}){super(i);se(this,"_appInfo");se(this,"_capabilities");se(this,"options");se(this,"_hostCapabilities");se(this,"_hostInfo");se(this,"_hostContext");se(this,"sendOpenLink",this.openLink);this._appInfo=n,this._capabilities=r,this.options=i,this.setRequestHandler(Ci,a=>(console.log("Received ping:",a.params),{})),this.onhostcontextchanged=()=>{}}getHostCapabilities(){return this._hostCapabilities}getHostVersion(){return this._hostInfo}getHostContext(){return this._hostContext}set ontoolinput(n){this.setNotificationHandler(vI,r=>n(r.params))}set ontoolinputpartial(n){this.setNotificationHandler(hI,r=>n(r.params))}set ontoolresult(n){this.setNotificationHandler(SI,r=>n(r.params))}set ontoolcancelled(n){this.setNotificationHandler(gI,r=>n(r.params))}set onhostcontextchanged(n){this.setNotificationHandler(xI,r=>{this._hostContext={...this._hostContext,...r.params},n(r.params)})}set onteardown(n){this.setRequestHandler(bI,(r,i)=>n(r.params,i))}set oncalltool(n){this.setRequestHandler(Ws,(r,i)=>n(r.params,i))}set onlisttools(n){this.setRequestHandler(qs,(r,i)=>n(r.params,i))}assertCapabilityForMethod(n){}assertRequestHandlerCapability(n){switch(n){case"tools/call":case"tools/list":if(!this._capabilities.tools)throw Error(`Client does not support tool capability (required for ${n})`);return;case"ping":case"ui/resource-teardown":return;default:throw Error(`No handler for method ${n} registered`)}}assertNotificationCapability(n){}assertTaskCapability(n){throw Error("Tasks are not supported in MCP Apps")}assertTaskHandlerCapability(n){throw Error("Task handlers are not supported in MCP Apps")}async callServerTool(n,r){return await this.request({method:"tools/call",params:n},Li,r)}sendMessage(n,r){return this.request({method:"ui/message",params:n},pI,r)}sendLog(n){return this.notification({method:"notifications/message",params:n})}updateModelContext(n,r){return this.request({method:"ui/update-model-context",params:n},tr,r)}openLink(n,r){return this.request({method:"ui/open-link",params:n},fI,r)}requestDisplayMode(n,r){return this.request({method:"ui/request-display-mode",params:n},II,r)}sendSizeChanged(n){return this.notification({method:"ui/notifications/size-changed",params:n})}setupSizeChangedNotifications(){let n=!1,r=0,i=0,a=()=>{n||(n=!0,requestAnimationFrame(()=>{n=!1;let s=document.documentElement,u=s.style.width,l=s.style.height;s.style.width="fit-content",s.style.height="fit-content";let m=s.getBoundingClientRect();s.style.width=u,s.style.height=l;let d=window.innerWidth-s.clientWidth,p=Math.ceil(m.width+d),g=Math.ceil(m.height);(p!==r||g!==i)&&(r=p,i=g,this.sendSizeChanged({width:p,height:g}))}))};a();let o=new ResizeObserver(a);return o.observe(document.documentElement),o.observe(document.body),()=>o.disconnect()}async connect(n=new N_(window.parent,window.parent),r){var i;await super.connect(n);try{let a=await this.request({method:"ui/initialize",params:{appCapabilities:this._capabilities,appInfo:this._appInfo,protocolVersion:T_}},zI,r);if(a===void 0)throw Error(`Server sent invalid initialize result: ${a}`);this._hostCapabilities=a.hostCapabilities,this._hostInfo=a.hostInfo,this._hostContext=a.hostContext,await this.notification({method:"ui/notifications/initialized"}),(i=this.options)!=null&&i.autoResize&&this.setupSizeChangedNotifications()}catch(a){throw this.close(),a}}}const je=document.querySelector(".main"),Ee=document.getElementById("screenshot-container"),$e=document.getElementById("screenshot"),_t=document.getElementById("overlays"),Me=document.getElementById("action-status"),qe=document.getElementById("session-info"),vo=document.getElementById("loading-overlay"),TI=vo.querySelector(".loading-text"),rt=document.createElement("div");rt.id="target-info";rt.className="target-info hidden";let _i=0,qn=0,We=null;function PI(e){return e.structuredContent??{}}function ho(e="Waiting for screenshot..."){TI.textContent=e,vo.classList.remove("hidden")}function yt(){vo.classList.add("hidden")}function Df(e){var n,r,i;e.theme&&ZI(e.theme),(n=e.styles)!=null&&n.variables&&UI(e.styles.variables),(i=(r=e.styles)==null?void 0:r.css)!=null&&i.fonts&&OI(e.styles.css.fonts),e.safeAreaInsets&&(je.style.paddingTop=`${e.safeAreaInsets.top}px`,je.style.paddingRight=`${e.safeAreaInsets.right}px`,je.style.paddingBottom=`${e.safeAreaInsets.bottom}px`,je.style.paddingLeft=`${e.safeAreaInsets.left}px`);const t=e.containerDimensions;t&&("height"in t?(document.documentElement.style.height="100vh",je.style.height="100%"):"maxHeight"in t&&t.maxHeight&&(document.documentElement.style.maxHeight=`${t.maxHeight}px`,je.style.maxHeight="100%"),"width"in t?(document.documentElement.style.width="100vw",je.style.width="100%"):"maxWidth"in t&&t.maxWidth&&(document.documentElement.style.maxWidth=`${t.maxWidth}px`,je.style.maxWidth="100%"))}function is(e,t,n){return t===0?e:e/t*n}function jI(){Ee.style.transform="none",Ee.classList.remove("zoomed")}function Rf(e){requestAnimationFrame(()=>{var i,a;_t.innerHTML="";const t=$e.clientWidth,n=$e.clientHeight;if(console.info("addOverlays:",{action:e.action,hasElement:!!e.element,hasClickPosition:!!e.clickPosition,displayedWidth:t,displayedHeight:n,naturalWidth:_i,naturalHeight:qn}),t===0||n===0){console.warn("addOverlays: Dimensions not ready, retrying..."),setTimeout(()=>Rf(e),50);return}if((e.action==="find"||e.action==="find_and_click"||e.action==="findall")&&e.element){const o=document.createElement("div");o.className="element-target",o.style.left=`${t/2}px`,o.style.top=`${n/2}px`;const s=document.createElement("div");s.className="crosshair-h",o.appendChild(s);const u=document.createElement("div");u.className="crosshair-v",o.appendChild(u);const l=document.createElement("div");l.className="element-label",l.textContent=((i=e.element)==null?void 0:i.description)||"Element",(a=e.element)!=null&&a.confidence&&(l.textContent+=` (${Math.round(e.element.confidence*100)}%)`),o.appendChild(l),_t.appendChild(o),console.info("addOverlays: Added element target at center")}if(e.clickPosition){const o=e.clickPosition.centerX??e.clickPosition.x,s=e.clickPosition.centerY??e.clickPosition.y;if(o!==void 0&&s!==void 0&&_i>0){const u=document.createElement("div");u.className="click-marker";const l=is(o,_i,t),m=is(s,qn,n);u.style.left=`${l}px`,u.style.top=`${m}px`;const d=document.createElement("div");d.className="click-ripple",u.appendChild(d),_t.appendChild(u),console.info("addOverlays: Added click marker at",{clickX:o,clickY:s,scaledX:l,scaledY:m})}}if(e.scrollDirection){const o=document.createElement("div");o.className=`scroll-indicator scroll-${e.scrollDirection}`,o.textContent=e.scrollDirection==="up"?"↑":e.scrollDirection==="down"?"↓":e.scrollDirection==="left"?"←":"→",_t.appendChild(o)}jI(),delete Ee.dataset.focalX,delete Ee.dataset.focalY})}function EI(e){_t.innerHTML="";const t=e.action||"unknown",n=e.success?"✓":"✗",r=e.success?"success":"error";let i=`${n} ${t}`;if(e.duration&&(i+=` (${e.duration}ms)`),e.assertion&&(i+=`: "${e.assertion}"`),e.text&&e.action==="type"&&(i+=`: "${e.text}"`),e.error&&(i+=` - ${e.error}`),Me.textContent=i,Me.className=r,e.debuggerUrl&&(We=e.debuggerUrl),e.session){const a=e.session.expiresIn?Math.round(e.session.expiresIn/1e3):0;if(qe.innerHTML="",We){const o=document.createElement("a");o.href=We,o.target="_blank",o.rel="noopener noreferrer",o.textContent=`${a}s remaining`,o.className="debugger-link",o.title=`Open debugger: ${We}`,qe.appendChild(o)}else qe.textContent=`${a}s remaining`;qe.className=a<30?"warning":""}else if(We){qe.innerHTML="";const a=document.createElement("a");a.href=We,a.target="_blank",a.rel="noopener noreferrer",a.textContent="Open Debugger",a.className="debugger-link",a.title=We,qe.appendChild(a)}else e.action==="session_start"&&(qe.textContent="Session started");if(e.element&&(e.action==="find"||e.action==="find_and_click")){const a=e.element;let o=`<strong>Target:</strong> "${a.description||"Element"}"`;if(a.centerX!==void 0&&a.centerY!==void 0&&(o+=` <span class="target-coords">(${Math.round(a.centerX)}, ${Math.round(a.centerY)})</span>`),a.confidence!==void 0){const s=Math.round(a.confidence*100);o+=` <span class="target-confidence ${s>=70?"high":s>=40?"medium":"low"}">${s}%</span>`}a.ref&&(o+=` <span class="target-ref">ref: ${a.ref}</span>`),rt.innerHTML=o,rt.classList.remove("hidden")}else rt.classList.add("hidden");e.imageUrl?(ho("Loading image..."),$e.onerror=()=>{console.error("Image failed to load"),$e.alt="Image failed to load",Ee.style.display="none",yt()},$e.onload=()=>{console.info("Image loaded:",$e.naturalWidth,"x",$e.naturalHeight),_i=$e.naturalWidth,qn=$e.naturalHeight,Ee.style.display="block",Rf(e),yt()},$e.src=e.imageUrl,$e.style.display="block"):($e.style.display="none",Ee.style.display="none",yt())}const Oe=new NI({name:"TestDriver Screenshot",version:"1.0.0"});async function DI(e){try{console.info("Fetching screenshot from resource:",e);const n=(await Oe.request({method:"resources/read",params:{uri:e}},Js)).contents[0];if(!n||!("blob"in n))return console.error("Resource did not contain blob data"),null;const r=`data:${n.mimeType||"image/png"};base64,${n.blob}`;return console.info("Screenshot fetched successfully, blob length:",n.blob.length),r}catch(t){return console.error("Failed to fetch screenshot resource:",t),null}}Oe.onteardown=async()=>(console.info("TestDriver app being torn down"),{});Oe.ontoolinput=e=>{console.info("Received tool input:",e),Me.textContent="Running action...",Me.className="loading",Ee.style.display="none",ho("Running action...")};Oe.ontoolresult=async e=>{console.info("Received tool result:",e),console.info("structuredContent:",e.structuredContent);const t=PI(e);console.info("Extracted data keys:",Object.keys(t)),console.info("Has imageUrl:",!!t.imageUrl),console.info("Has screenshotResourceUri:",!!t.screenshotResourceUri),console.info("Has croppedImageResourceUri:",!!t.croppedImageResourceUri);const n=t.screenshotResourceUri||t.croppedImageResourceUri;if(n&&!t.imageUrl){ho("Fetching image...");const r=await DI(n);r&&(t.imageUrl=r)}EI(t)};Oe.ontoolcancelled=e=>{console.info("Tool cancelled:",e.reason),Me.textContent=`Cancelled: ${e.reason}`,Me.className="error",yt()};Oe.onerror=e=>{console.error("App error:",e),Me.textContent=`Error: ${e}`,Me.className="error",yt()};Oe.onhostcontextchanged=Df;Oe.connect().then(()=>{const e=Oe.getHostContext();e&&Df(e)});const $i=document.querySelector(".screenshot-wrapper");$i&&$i.parentNode&&$i.parentNode.insertBefore(rt,$i.nextSibling);</script>
-  <style rel="stylesheet" crossorigin>*{box-sizing:border-box;margin:0;padding:0}html,body{font-family:var(--font-sans, system-ui, -apple-system, sans-serif);font-size:var(--font-text-sm-size, 14px);line-height:var(--font-text-sm-line-height, 1.5);background:var(--color-background-primary, #fff);color:var(--color-text-primary, #1a1a1a);overflow:hidden;margin:0;padding:0}body{height:100%;display:flex;flex-direction:column}.main{width:100%;max-width:100%;display:flex;flex-direction:column;overflow:hidden}.screenshot-wrapper{position:relative;width:100%;flex:1;min-height:0;background:var(--color-background-secondary, #f5f5f5);border-radius:var(--border-radius-md, 8px);overflow:hidden;display:flex;flex-direction:column}.loading-overlay{display:flex;flex-direction:column;align-items:center;justify-content:center;gap:12px;padding:40px;flex:1;min-height:100px;background:var(--color-background-secondary, #f5f5f5)}.loading-overlay.hidden{display:none}.loading-spinner{width:32px;height:32px;border:3px solid var(--color-border-primary, #e5e5e5);border-top-color:#3b82f6;border-radius:50%;animation:spin .8s linear infinite}@keyframes spin{to{transform:rotate(360deg)}}.loading-text{font-size:var(--font-text-sm-size, 14px);color:var(--color-text-secondary, #666)}.screenshot-container{position:relative;width:100%;flex:1;min-height:0;display:flex;align-items:center;justify-content:center}#screenshot{display:block;max-width:100%;max-height:100%;width:auto;height:auto;object-fit:contain}#overlays{position:absolute;top:0;left:0;width:100%;height:100%;pointer-events:none}.element-target{position:absolute;width:24px;height:24px;margin-left:-12px;margin-top:-12px;background:#3b82f64d;border:3px solid #3b82f6;border-radius:50%;box-shadow:0 0 0 2px #fff,0 2px 8px #0000004d;animation:target-pulse 1.5s ease-in-out infinite}@keyframes target-pulse{0%,to{transform:scale(1);border-color:#3b82f6;box-shadow:0 0 0 2px #fff,0 2px 8px #0000004d}50%{transform:scale(1.1);border-color:#60a5fa;box-shadow:0 0 0 2px #fff,0 0 12px #3b82f680}}.crosshair-h,.crosshair-v{position:absolute;background:#3b82f6}.crosshair-h{width:40px;height:2px;left:50%;top:50%;margin-left:-20px;margin-top:-1px}.crosshair-v{width:2px;height:40px;left:50%;top:50%;margin-left:-1px;margin-top:-20px}.element-label{position:absolute;top:100%;left:50%;transform:translate(-50%);margin-top:24px;background:#3b82f6;color:#fff;font-size:11px;padding:4px 8px;border-radius:4px;white-space:nowrap;max-width:300px;overflow:hidden;text-overflow:ellipsis;box-shadow:0 2px 8px #0003}.click-marker{position:absolute;width:16px;height:16px;margin-left:-8px;margin-top:-8px;background:#ef4444;border:2px solid white;border-radius:50%;box-shadow:0 2px 8px #0000004d}.click-ripple{position:absolute;top:50%;left:50%;width:40px;height:40px;margin-left:-20px;margin-top:-20px;border:2px solid #ef4444;border-radius:50%;animation:ripple 1s ease-out infinite}@keyframes ripple{0%{transform:scale(.5);opacity:1}to{transform:scale(2);opacity:0}}.scroll-indicator{position:absolute;font-size:48px;color:#3b82f6;text-shadow:0 2px 8px rgba(0,0,0,.3);animation:scroll-bounce .5s ease-in-out}.scroll-up{top:20px;left:50%;transform:translate(-50%)}.scroll-down{bottom:20px;left:50%;transform:translate(-50%)}.scroll-left{left:20px;top:50%;transform:translateY(-50%)}.scroll-right{right:20px;top:50%;transform:translateY(-50%)}@keyframes scroll-bounce{0%,to{opacity:1}50%{opacity:.5}}.status-bar{display:flex;justify-content:space-between;align-items:center;padding:8px 12px;background:var(--color-background-secondary, #f5f5f5);border-top:1px solid var(--color-border-primary, #e5e5e5);font-size:var(--font-text-xs-size, 12px)}#action-status{font-weight:var(--font-weight-medium, 500)}#action-status.success{color:#10b981}#action-status.error{color:#ef4444}#action-status.loading,#session-info{color:var(--color-text-secondary, #666)}#session-info.warning{color:#f59e0b}.debugger-link{color:#3b82f6;text-decoration:none;transition:color .15s ease}.debugger-link:hover{color:#2563eb;text-decoration:underline}.target-info{padding:8px 12px;background:var(--color-background-secondary, #f5f5f5);border-bottom:1px solid var(--color-border-primary, #e5e5e5);font-size:var(--font-text-sm-size, 14px);color:var(--color-text-primary, #1a1a1a)}.target-info.hidden{display:none}.target-info strong{color:#3b82f6;font-weight:var(--font-weight-medium, 500)}.target-coords{color:var(--color-text-secondary, #666);font-family:var(--font-mono, ui-monospace, monospace);font-size:var(--font-text-xs-size, 12px);margin-left:4px}.target-confidence{display:inline-block;padding:1px 6px;border-radius:10px;font-size:var(--font-text-xs-size, 11px);font-weight:var(--font-weight-medium, 500);margin-left:6px}.target-confidence.high{background:#dcfce7;color:#166534}.target-confidence.medium{background:#fef3c7;color:#92400e}.target-confidence.low{background:#fee2e2;color:#991b1b}.target-ref{color:var(--color-text-tertiary, #999);font-family:var(--font-mono, ui-monospace, monospace);font-size:var(--font-text-xs-size, 11px);margin-left:8px}@media(prefers-color-scheme:dark){html:not([data-theme=light]) .element-target{border-color:#60a5fa;background:#60a5fa33;box-shadow:0 0 0 2px #1f2937,0 2px 8px #00000080}html:not([data-theme=light]) .crosshair-h,html:not([data-theme=light]) .crosshair-v{background:#60a5fa}html:not([data-theme=light]) .element-label{background:#2563eb}html:not([data-theme=light]) .click-marker{background:#f87171;border-color:#1f2937}html:not([data-theme=light]) .click-ripple{border-color:#f87171}html:not([data-theme=light]) .scroll-indicator,html:not([data-theme=light]) .debugger-link{color:#60a5fa}html:not([data-theme=light]) .debugger-link:hover{color:#93c5fd}html:not([data-theme=light]) .target-info strong{color:#60a5fa}html:not([data-theme=light]) .target-confidence.high{background:#166534;color:#dcfce7}html:not([data-theme=light]) .target-confidence.medium{background:#92400e;color:#fef3c7}html:not([data-theme=light]) .target-confidence.low{background:#991b1b;color:#fee2e2}}</style>
+container holding the app. Specify either width or maxWidth, and either height or maxHeight.`),locale:c.string().optional().describe("User's language and region preference in BCP 47 format."),timeZone:c.string().optional().describe("User's timezone in IANA format."),userAgent:c.string().optional().describe("Host application identifier."),platform:c.union([c.literal("web"),c.literal("desktop"),c.literal("mobile")]).optional().describe("Platform type for responsive design decisions."),deviceCapabilities:c.object({touch:c.boolean().optional().describe("Whether the device supports touch input."),hover:c.boolean().optional().describe("Whether the device supports hover interactions.")}).optional().describe("Device input capabilities."),safeAreaInsets:c.object({top:c.number().describe("Top safe area inset in pixels."),right:c.number().describe("Right safe area inset in pixels."),bottom:c.number().describe("Bottom safe area inset in pixels."),left:c.number().describe("Left safe area inset in pixels.")}).optional().describe("Mobile safe area boundaries in pixels.")}).passthrough(),xI=c.object({method:c.literal("ui/notifications/host-context-changed"),params:Ef.describe("Partial context update containing only changed fields.")});c.object({method:c.literal("ui/update-model-context"),params:c.object({content:c.array(Ct).optional().describe("Context content blocks (text, image, etc.)."),structuredContent:c.record(c.string(),c.unknown().describe("Structured content for machine-readable context data.")).optional().describe("Structured content for machine-readable context data.")})});c.object({method:c.literal("ui/initialize"),params:c.object({appInfo:Ai.describe("App identification (name and version)."),appCapabilities:kI.describe("Features and capabilities this app provides."),protocolVersion:c.string().describe("Protocol version this app supports.")})});var zI=c.object({protocolVersion:c.string().describe('Negotiated protocol version string (e.g., "2025-11-21").'),hostInfo:Ai.describe("Host application identification and version."),hostCapabilities:yI.describe("Features and capabilities provided by the host."),hostContext:Ef.describe("Rich context about the host environment.")}).passthrough();function ZI(e){let t=document.documentElement;t.setAttribute("data-theme",e),t.style.colorScheme=e}function UI(e,t=document.documentElement){for(let[n,r]of Object.entries(e))r!==void 0&&t.style.setProperty(n,r)}function OI(e){if(document.getElementById("__mcp-host-fonts"))return;let t=document.createElement("style");t.id="__mcp-host-fonts",t.textContent=e,document.head.appendChild(t)}class NI extends U_{constructor(n,r={},i={autoResize:!0}){super(i);se(this,"_appInfo");se(this,"_capabilities");se(this,"options");se(this,"_hostCapabilities");se(this,"_hostInfo");se(this,"_hostContext");se(this,"sendOpenLink",this.openLink);this._appInfo=n,this._capabilities=r,this.options=i,this.setRequestHandler(Ci,a=>(console.log("Received ping:",a.params),{})),this.onhostcontextchanged=()=>{}}getHostCapabilities(){return this._hostCapabilities}getHostVersion(){return this._hostInfo}getHostContext(){return this._hostContext}set ontoolinput(n){this.setNotificationHandler(vI,r=>n(r.params))}set ontoolinputpartial(n){this.setNotificationHandler(hI,r=>n(r.params))}set ontoolresult(n){this.setNotificationHandler(SI,r=>n(r.params))}set ontoolcancelled(n){this.setNotificationHandler(gI,r=>n(r.params))}set onhostcontextchanged(n){this.setNotificationHandler(xI,r=>{this._hostContext={...this._hostContext,...r.params},n(r.params)})}set onteardown(n){this.setRequestHandler(bI,(r,i)=>n(r.params,i))}set oncalltool(n){this.setRequestHandler(Ws,(r,i)=>n(r.params,i))}set onlisttools(n){this.setRequestHandler(qs,(r,i)=>n(r.params,i))}assertCapabilityForMethod(n){}assertRequestHandlerCapability(n){switch(n){case"tools/call":case"tools/list":if(!this._capabilities.tools)throw Error(`Client does not support tool capability (required for ${n})`);return;case"ping":case"ui/resource-teardown":return;default:throw Error(`No handler for method ${n} registered`)}}assertNotificationCapability(n){}assertTaskCapability(n){throw Error("Tasks are not supported in MCP Apps")}assertTaskHandlerCapability(n){throw Error("Task handlers are not supported in MCP Apps")}async callServerTool(n,r){return await this.request({method:"tools/call",params:n},Li,r)}sendMessage(n,r){return this.request({method:"ui/message",params:n},pI,r)}sendLog(n){return this.notification({method:"notifications/message",params:n})}updateModelContext(n,r){return this.request({method:"ui/update-model-context",params:n},tr,r)}openLink(n,r){return this.request({method:"ui/open-link",params:n},fI,r)}requestDisplayMode(n,r){return this.request({method:"ui/request-display-mode",params:n},II,r)}sendSizeChanged(n){return this.notification({method:"ui/notifications/size-changed",params:n})}setupSizeChangedNotifications(){let n=!1,r=0,i=0,a=()=>{n||(n=!0,requestAnimationFrame(()=>{n=!1;let s=document.documentElement,u=s.style.width,l=s.style.height;s.style.width="fit-content",s.style.height="fit-content";let m=s.getBoundingClientRect();s.style.width=u,s.style.height=l;let d=window.innerWidth-s.clientWidth,p=Math.ceil(m.width+d),g=Math.ceil(m.height);(p!==r||g!==i)&&(r=p,i=g,this.sendSizeChanged({width:p,height:g}))}))};a();let o=new ResizeObserver(a);return o.observe(document.documentElement),o.observe(document.body),()=>o.disconnect()}async connect(n=new N_(window.parent,window.parent),r){var i;await super.connect(n);try{let a=await this.request({method:"ui/initialize",params:{appCapabilities:this._capabilities,appInfo:this._appInfo,protocolVersion:T_}},zI,r);if(a===void 0)throw Error(`Server sent invalid initialize result: ${a}`);this._hostCapabilities=a.hostCapabilities,this._hostInfo=a.hostInfo,this._hostContext=a.hostContext,await this.notification({method:"ui/notifications/initialized"}),(i=this.options)!=null&&i.autoResize&&this.setupSizeChangedNotifications()}catch(a){throw this.close(),a}}}const je=document.querySelector(".main"),Ee=document.getElementById("screenshot-container"),$e=document.getElementById("screenshot"),_t=document.getElementById("overlays"),Me=document.getElementById("action-status"),qe=document.getElementById("session-info"),vo=document.getElementById("loading-overlay"),TI=vo.querySelector(".loading-text"),rt=document.createElement("div");rt.id="target-info";rt.className="target-info hidden";let _i=0,qn=0,We=null;function PI(e){return e.structuredContent??{}}function ho(e="Waiting for screenshot..."){TI.textContent=e,vo.classList.remove("hidden")}function yt(){vo.classList.add("hidden")}function Df(e){var n,r,i;e.theme&&ZI(e.theme),(n=e.styles)!=null&&n.variables&&UI(e.styles.variables),(i=(r=e.styles)==null?void 0:r.css)!=null&&i.fonts&&OI(e.styles.css.fonts),e.safeAreaInsets&&(je.style.paddingTop=`${e.safeAreaInsets.top}px`,je.style.paddingRight=`${e.safeAreaInsets.right}px`,je.style.paddingBottom=`${e.safeAreaInsets.bottom}px`,je.style.paddingLeft=`${e.safeAreaInsets.left}px`);const t=e.containerDimensions;t&&("height"in t?(document.documentElement.style.height="100vh",je.style.height="100%"):"maxHeight"in t&&t.maxHeight&&(document.documentElement.style.maxHeight=`${t.maxHeight}px`,je.style.maxHeight="100%"),"width"in t?(document.documentElement.style.width="100vw",je.style.width="100%"):"maxWidth"in t&&t.maxWidth&&(document.documentElement.style.maxWidth=`${t.maxWidth}px`,je.style.maxWidth="100%"))}function is(e,t,n){return t===0?e:e/t*n}function jI(){Ee.style.transform="none",Ee.classList.remove("zoomed")}function Rf(e){requestAnimationFrame(()=>{var i,a;_t.innerHTML="";const t=$e.clientWidth,n=$e.clientHeight;if(console.info("addOverlays:",{action:e.action,hasElement:!!e.element,hasClickPosition:!!e.clickPosition,displayedWidth:t,displayedHeight:n,naturalWidth:_i,naturalHeight:qn}),t===0||n===0){console.warn("addOverlays: Dimensions not ready, retrying..."),setTimeout(()=>Rf(e),50);return}if((e.action==="find"||e.action==="find_and_click"||e.action==="findall")&&e.element){const o=document.createElement("div");o.className="element-target",o.style.left=`${t/2}px`,o.style.top=`${n/2}px`;const s=document.createElement("div");s.className="crosshair-h",o.appendChild(s);const u=document.createElement("div");u.className="crosshair-v",o.appendChild(u);const l=document.createElement("div");l.className="element-label",l.textContent=((i=e.element)==null?void 0:i.description)||"Element",(a=e.element)!=null&&a.confidence&&(l.textContent+=` (${Math.round(e.element.confidence*100)}%)`),o.appendChild(l),_t.appendChild(o),console.info("addOverlays: Added element target at center")}if(e.clickPosition){const o=e.clickPosition.centerX??e.clickPosition.x,s=e.clickPosition.centerY??e.clickPosition.y;if(o!==void 0&&s!==void 0&&_i>0){const u=document.createElement("div");u.className="click-marker";const l=is(o,_i,t),m=is(s,qn,n);u.style.left=`${l}px`,u.style.top=`${m}px`;const d=document.createElement("div");d.className="click-ripple",u.appendChild(d),_t.appendChild(u),console.info("addOverlays: Added click marker at",{clickX:o,clickY:s,scaledX:l,scaledY:m})}}if(e.scrollDirection){const o=document.createElement("div");o.className=`scroll-indicator scroll-${e.scrollDirection}`,o.textContent=e.scrollDirection==="up"?"↑":e.scrollDirection==="down"?"↓":e.scrollDirection==="left"?"←":"→",_t.appendChild(o)}jI(),delete Ee.dataset.focalX,delete Ee.dataset.focalY})}function EI(e){_t.innerHTML="";const t=e.action||"unknown",n=e.success?"✓":"✗",r=e.success?"success":"error";let i=`${n} ${t}`;if(e.duration&&(i+=` (${e.duration}ms)`),e.assertion&&(i+=`: "${e.assertion}"`),e.text&&e.action==="type"&&(i+=`: "${e.text}"`),e.error&&(i+=` - ${e.error}`),Me.textContent=i,Me.className=r,e.debuggerUrl&&(We=e.debuggerUrl),e.session){const a=e.session.expiresIn?Math.round(e.session.expiresIn/1e3):0;if(qe.innerHTML="",We){const o=document.createElement("a");o.href=We,o.target="_blank",o.rel="noopener noreferrer",o.textContent=`${a}s remaining`,o.className="debugger-link",o.title=`Open debugger: ${We}`,qe.appendChild(o)}else qe.textContent=`${a}s remaining`;qe.className=a<30?"warning":""}else if(We){qe.innerHTML="";const a=document.createElement("a");a.href=We,a.target="_blank",a.rel="noopener noreferrer",a.textContent="Open Debugger",a.className="debugger-link",a.title=We,qe.appendChild(a)}else e.action==="session_start"&&(qe.textContent="Session started");if(e.element&&(e.action==="find"||e.action==="find_and_click")){const a=e.element;let o=`<strong>Target:</strong> "${a.description||"Element"}"`;if(a.centerX!==void 0&&a.centerY!==void 0&&(o+=` <span class="target-coords">(${Math.round(a.centerX)}, ${Math.round(a.centerY)})</span>`),a.confidence!==void 0){const s=Math.round(a.confidence*100);o+=` <span class="target-confidence ${s>=70?"high":s>=40?"medium":"low"}">${s}%</span>`}rt.innerHTML=o,rt.classList.remove("hidden")}else rt.classList.add("hidden");e.imageUrl?(ho("Loading image..."),$e.onerror=()=>{console.error("Image failed to load"),$e.alt="Image failed to load",Ee.style.display="none",yt()},$e.onload=()=>{console.info("Image loaded:",$e.naturalWidth,"x",$e.naturalHeight),_i=$e.naturalWidth,qn=$e.naturalHeight,Ee.style.display="block",Rf(e),yt()},$e.src=e.imageUrl,$e.style.display="block"):($e.style.display="none",Ee.style.display="none",yt())}const Oe=new NI({name:"TestDriver Screenshot",version:"1.0.0"});async function DI(e){try{console.info("Fetching screenshot from resource:",e);const n=(await Oe.request({method:"resources/read",params:{uri:e}},Js)).contents[0];if(!n||!("blob"in n))return console.error("Resource did not contain blob data"),null;const r=`data:${n.mimeType||"image/png"};base64,${n.blob}`;return console.info("Screenshot fetched successfully, blob length:",n.blob.length),r}catch(t){return console.error("Failed to fetch screenshot resource:",t),null}}Oe.onteardown=async()=>(console.info("TestDriver app being torn down"),{});Oe.ontoolinput=e=>{console.info("Received tool input:",e);const t=e.arguments,n=[];t&&(t.description&&n.push(`"${t.description}"`),t.text&&n.push(`"${t.text}"`),t.url&&n.push(`${t.url}`),t.direction&&n.push(`${t.direction}`),t.assertion&&n.push(`"${t.assertion}"`),t.task&&n.push(`"${t.task}"`),t.keys&&n.push(`[${t.keys.join("+")}]`),t.type&&n.push(`${t.type}`));const r=n.length>0?n.join(" "):"action";Me.textContent=`Running ${r}...`,Me.className="loading",Ee.style.display="none",ho(`Running ${r}...`)};Oe.ontoolresult=async e=>{console.info("Received tool result:",e),console.info("structuredContent:",e.structuredContent);const t=PI(e);console.info("Extracted data keys:",Object.keys(t)),console.info("Has imageUrl:",!!t.imageUrl),console.info("Has screenshotResourceUri:",!!t.screenshotResourceUri),console.info("Has croppedImageResourceUri:",!!t.croppedImageResourceUri);const n=t.screenshotResourceUri||t.croppedImageResourceUri;if(n&&!t.imageUrl){ho("Fetching image...");const r=await DI(n);r&&(t.imageUrl=r)}EI(t)};Oe.ontoolcancelled=e=>{console.info("Tool cancelled:",e.reason),Me.textContent=`Cancelled: ${e.reason}`,Me.className="error",yt()};Oe.onerror=e=>{console.error("App error:",e),Me.textContent=`Error: ${e}`,Me.className="error",yt()};Oe.onhostcontextchanged=Df;Oe.connect().then(()=>{const e=Oe.getHostContext();e&&Df(e)});const $i=document.querySelector(".screenshot-wrapper");$i&&$i.parentNode&&$i.parentNode.insertBefore(rt,$i.nextSibling);</script>
+  <style rel="stylesheet" crossorigin>*{box-sizing:border-box;margin:0;padding:0}html,body{font-family:var(--font-sans, system-ui, -apple-system, sans-serif);font-size:var(--font-text-sm-size, 14px);line-height:var(--font-text-sm-line-height, 1.5);background:var(--color-background-primary, #fff);color:var(--color-text-primary, #1a1a1a);overflow:hidden;margin:0;padding:0}body{height:100%;display:flex;flex-direction:column}.main{width:100%;max-width:100%;display:flex;flex-direction:column;overflow:hidden}.screenshot-wrapper{position:relative;width:100%;flex:1;min-height:0;background:var(--color-background-secondary, #f5f5f5);border-radius:var(--border-radius-md, 8px);overflow:hidden;display:flex;flex-direction:column}.loading-overlay{display:flex;flex-direction:column;align-items:center;justify-content:center;gap:12px;padding:40px;flex:1;min-height:100px;background:var(--color-background-secondary, #f5f5f5)}.loading-overlay.hidden{display:none}.loading-spinner{width:32px;height:32px;border:3px solid var(--color-border-primary, #e5e5e5);border-top-color:#3b82f6;border-radius:50%;animation:spin .8s linear infinite}@keyframes spin{to{transform:rotate(360deg)}}.loading-text{font-size:var(--font-text-sm-size, 14px);color:var(--color-text-secondary, #666)}.screenshot-container{position:relative;width:100%;flex:1;min-height:0;display:flex;align-items:center;justify-content:center}#screenshot{display:block;max-width:100%;max-height:100%;width:auto;height:auto;object-fit:contain}#overlays{position:absolute;top:0;left:0;width:100%;height:100%;pointer-events:none}.element-target{position:absolute;width:24px;height:24px;margin-left:-12px;margin-top:-12px;background:#3b82f64d;border:3px solid #3b82f6;border-radius:50%;box-shadow:0 0 0 2px #fff,0 2px 8px #0000004d;animation:target-pulse 1.5s ease-in-out infinite}@keyframes target-pulse{0%,to{transform:scale(1);border-color:#3b82f6;box-shadow:0 0 0 2px #fff,0 2px 8px #0000004d}50%{transform:scale(1.1);border-color:#60a5fa;box-shadow:0 0 0 2px #fff,0 0 12px #3b82f680}}.crosshair-h,.crosshair-v{position:absolute;background:#3b82f6}.crosshair-h{width:40px;height:2px;left:50%;top:50%;margin-left:-20px;margin-top:-1px}.crosshair-v{width:2px;height:40px;left:50%;top:50%;margin-left:-1px;margin-top:-20px}.element-label{position:absolute;top:100%;left:50%;transform:translate(-50%);margin-top:24px;background:#3b82f6;color:#fff;font-size:11px;padding:4px 8px;border-radius:4px;white-space:nowrap;max-width:300px;overflow:hidden;text-overflow:ellipsis;box-shadow:0 2px 8px #0003}.click-marker{position:absolute;width:16px;height:16px;margin-left:-8px;margin-top:-8px;background:#ef4444;border:2px solid white;border-radius:50%;box-shadow:0 2px 8px #0000004d}.click-ripple{position:absolute;top:50%;left:50%;width:40px;height:40px;margin-left:-20px;margin-top:-20px;border:2px solid #ef4444;border-radius:50%;animation:ripple 1s ease-out infinite}@keyframes ripple{0%{transform:scale(.5);opacity:1}to{transform:scale(2);opacity:0}}.scroll-indicator{position:absolute;font-size:48px;color:#3b82f6;text-shadow:0 2px 8px rgba(0,0,0,.3);animation:scroll-bounce .5s ease-in-out}.scroll-up{top:20px;left:50%;transform:translate(-50%)}.scroll-down{bottom:20px;left:50%;transform:translate(-50%)}.scroll-left{left:20px;top:50%;transform:translateY(-50%)}.scroll-right{right:20px;top:50%;transform:translateY(-50%)}@keyframes scroll-bounce{0%,to{opacity:1}50%{opacity:.5}}.status-bar{display:flex;justify-content:space-between;align-items:center;padding:8px 12px;background:var(--color-background-secondary, #f5f5f5);border-top:1px solid var(--color-border-primary, #e5e5e5);font-size:var(--font-text-xs-size, 12px)}#action-status{font-weight:var(--font-weight-medium, 500)}#action-status.success{color:#10b981}#action-status.error{color:#ef4444}#action-status.loading,#session-info{color:var(--color-text-secondary, #666)}#session-info.warning{color:#f59e0b}.debugger-link{color:#3b82f6;text-decoration:none;transition:color .15s ease}.debugger-link:hover{color:#2563eb;text-decoration:underline}.target-info{padding:8px 12px;background:var(--color-background-secondary, #f5f5f5);border-bottom:1px solid var(--color-border-primary, #e5e5e5);font-size:var(--font-text-sm-size, 14px);color:var(--color-text-primary, #1a1a1a)}.target-info.hidden{display:none}.target-info strong{color:#3b82f6;font-weight:var(--font-weight-medium, 500)}.target-coords{color:var(--color-text-secondary, #666);font-family:var(--font-mono, ui-monospace, monospace);font-size:var(--font-text-xs-size, 12px);margin-left:4px}.target-confidence{display:inline-block;padding:1px 6px;border-radius:10px;font-size:var(--font-text-xs-size, 11px);font-weight:var(--font-weight-medium, 500);margin-left:6px}.target-confidence.high{background:#dcfce7;color:#166534}.target-confidence.medium{background:#fef3c7;color:#92400e}.target-confidence.low{background:#fee2e2;color:#991b1b}@media(prefers-color-scheme:dark){html:not([data-theme=light]) .element-target{border-color:#60a5fa;background:#60a5fa33;box-shadow:0 0 0 2px #1f2937,0 2px 8px #00000080}html:not([data-theme=light]) .crosshair-h,html:not([data-theme=light]) .crosshair-v{background:#60a5fa}html:not([data-theme=light]) .element-label{background:#2563eb}html:not([data-theme=light]) .click-marker{background:#f87171;border-color:#1f2937}html:not([data-theme=light]) .click-ripple{border-color:#f87171}html:not([data-theme=light]) .scroll-indicator,html:not([data-theme=light]) .debugger-link{color:#60a5fa}html:not([data-theme=light]) .debugger-link:hover{color:#93c5fd}html:not([data-theme=light]) .target-info strong{color:#60a5fa}html:not([data-theme=light]) .target-confidence.high{background:#166534;color:#dcfce7}html:not([data-theme=light]) .target-confidence.medium{background:#92400e;color:#fef3c7}html:not([data-theme=light]) .target-confidence.low{background:#991b1b;color:#fee2e2}}</style>
 </head>
 <body>
   <main class="main">

package/mcp-server/dist/provision-types.d.ts CHANGED Viewed

@@ -150,7 +150,7 @@ export declare const SessionStartInputSchema: z.ZodObject<{
     os: z.ZodDefault<z.ZodEnum<["linux", "windows"]>>;
     /** Keep sandbox alive duration in ms (default: 5 minutes) */
     keepAlive: z.ZodDefault<z.ZodNumber>;
-    /** Path to test file being built */
+    /** Path to test file - when provided, you MUST append generated code to this file after each action */
     testFile: z.ZodOptional<z.ZodString>;
     /** Reconnect to last sandbox */
     reconnect: z.ZodDefault<z.ZodBoolean>;

package/mcp-server/dist/provision-types.js CHANGED Viewed

@@ -121,8 +121,8 @@ export const SessionStartInputSchema = z.object({
     os: z.enum(["linux", "windows"]).default("linux").describe("Sandbox OS"),
     /** Keep sandbox alive duration in ms (default: 5 minutes) */
     keepAlive: z.number().default(300000).describe("Keep sandbox alive for this many ms"),
-    /** Path to test file being built */
-    testFile: z.string().optional().describe("Path to test file being built"),
+    /** Path to test file - when provided, you MUST append generated code to this file after each action */
+    testFile: z.string().optional().describe("Path to test file. When provided, append generated code from each action to this file immediately."),
     /** Reconnect to last sandbox */
     reconnect: z.boolean().default(false).describe("Reconnect to last sandbox"),
     /** API endpoint URL */

package/mcp-server/dist/server.mjs CHANGED Viewed

@@ -226,14 +226,22 @@ function requireActiveSession() {
  * Images: imageUrl (data URL) goes to structuredContent for UI to display
  * The croppedImage from find() is small (~10KB) so it's acceptable as data URL
  *
- * If generatedCode is provided, it's appended to the text response so the agent
- * can add it to their test file.
+ * If generatedCode is provided, it's appended to the text response with instructions
+ * for the agent to write it to the test file.
  */
 function createToolResult(success, textContent, structuredData, generatedCode) {
-    // Build text content - append generated code if provided
+    // Build text content - append generated code if provided with directive instructions
     let fullText = textContent;
     if (generatedCode && success) {
-        fullText += `\n\nAdd to test file:\n${generatedCode}`;
+        // Get the test file from the current session
+        const session = sessionManager.getCurrentSession();
+        const testFile = session?.testFile;
+        if (testFile) {
+            fullText += `\n\n⚠️ ACTION REQUIRED: Append this code to ${testFile}:\n\`\`\`javascript\n${generatedCode}\n\`\`\``;
+        }
+        else {
+            fullText += `\n\n⚠️ ACTION REQUIRED: Append this code to the test file:\n\`\`\`javascript\n${generatedCode}\n\`\`\``;
+        }
     }
     const content = [{ type: "text", text: fullText }];
     logger.debug("createToolResult", {
@@ -244,9 +252,17 @@ function createToolResult(success, textContent, structuredData, generatedCode) {
         hasGeneratedCode: !!generatedCode
     });
     // structuredContent goes to UI (includes imageUrl for display)
+    // Always include success flag so UI can display correct status indicator
+    // Include generatedCode and testFile in structured data so agents can programmatically handle it
+    const session = sessionManager.getCurrentSession();
     return {
         content,
-        structuredContent: structuredData,
+        structuredContent: {
+            ...structuredData,
+            success,
+            generatedCode: generatedCode && success ? generatedCode : undefined,
+            testFile: session?.testFile || undefined,
+        },
     };
 }
 // Create MCP server
@@ -322,6 +338,10 @@ registerAppTool(server, "session_start", {
     title: "Session Start",
     description: `Start a new TestDriver session and provision a sandbox with browser or app.
+⚠️ IMPORTANT - Test File Parameter:
+When 'testFile' is provided, you MUST append the generated code to that file after EVERY successful action.
+Each tool response includes "ACTION REQUIRED: Append this code..." - you must write that code to the test file immediately.
 Provision types:
 - chrome: Launch Chrome browser (default). Use 'url' for starting page.
 - chromeExtension: Launch Chrome with an extension. Use 'extensionPath' or 'extensionId'.
@@ -1369,50 +1389,6 @@ Only use 'screenshot' when you explicitly want to show something to the human us
         return createToolResult(false, `Screenshot failed: ${error}`, { error: String(error) });
     }
 });
-// Verify
-server.registerTool("verify", {
-    description: "Run the test file from scratch to verify it works",
-    inputSchema: z.object({
-        testFile: z.string().describe("Path to test file to run"),
-    }),
-}, async (params) => {
-    const startTime = Date.now();
-    logger.info("verify: Starting", { testFile: params.testFile });
-    const session = sessionManager.getCurrentSession();
-    if (!fs.existsSync(params.testFile)) {
-        logger.warn("verify: Test file not found", { testFile: params.testFile });
-        return createToolResult(false, `Test file not found: ${params.testFile}`, { error: "Test file not found" });
-    }
-    const { execSync } = await import("child_process");
-    try {
-        logger.info("verify: Running vitest", { testFile: params.testFile });
-        const output = execSync(`npx vitest run "${params.testFile}" --reporter=verbose`, {
-            encoding: "utf-8",
-            timeout: 300000,
-            cwd: process.cwd(),
-            env: { ...process.env },
-        });
-        const duration = Date.now() - startTime;
-        logger.info("verify: Test passed", { testFile: params.testFile, duration });
-        return createToolResult(true, `✓ Test passed!\n\n${output}`, {
-            action: "verify",
-            success: true,
-            session: getSessionData(session),
-            duration,
-        });
-    }
-    catch (error) {
-        const duration = Date.now() - startTime;
-        logger.error("verify: Test failed", { testFile: params.testFile, error: error.message, duration });
-        return createToolResult(false, `✗ Test failed!\n\n${error.stdout || error.message}`, {
-            action: "verify",
-            success: false,
-            error: error.stdout || error.message,
-            session: getSessionData(session),
-            duration,
-        });
-    }
-});
 // Start the server
 async function main() {
     logger.info("Starting TestDriver MCP Server", {

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "testdriverai",
-  "version": "7.2.73",
+  "version": "7.2.75",
   "description": "Next generation autonomous AI agent for end-to-end testing of web & desktop",
   "main": "sdk.js",
   "types": "sdk.d.ts",

package/sdk.d.ts CHANGED Viewed

@@ -867,7 +867,22 @@ export interface DashcamAPI {
 }
 export default class TestDriverSDK {
-  constructor(apiKey: string, options?: TestDriverOptions);
+  /**
+   * Create a new TestDriverSDK instance
+   * Automatically loads environment variables from .env file via dotenv.
+   *
+   * @param apiKey - API key (optional, defaults to TD_API_KEY environment variable)
+   * @param options - SDK configuration options
+   *
+   * @example
+   * // API key loaded automatically from TD_API_KEY in .env
+   * const client = new TestDriver();
+   *
+   * @example
+   * // Or pass API key explicitly
+   * const client = new TestDriver('your-api-key');
+   */
+  constructor(apiKey?: string, options?: TestDriverOptions);
   /**
    * Whether the SDK is currently connected to a sandbox

package/sdk.js CHANGED Viewed

@@ -5,6 +5,9 @@ const crypto = require("crypto");
 const { formatter } = require("./sdk-log-formatter");
 const logger = require("./agent/lib/logger");
+// Load .env file into process.env by default
+require("dotenv").config();
 /**
  * Get the file path of the caller (the file that called TestDriver)
  * @returns {string|null} File path or null if not found
@@ -1233,13 +1236,18 @@ function createChainablePromise(promise) {
  * TestDriver SDK
  *
  * This SDK provides programmatic access to TestDriver's AI-powered testing capabilities.
+ * Automatically loads environment variables from .env file via dotenv.
  *
  * @example
  * const TestDriver = require('testdriverai');
  *
- * const client = new TestDriver(process.env.TD_API_KEY);
+ * // API key loaded automatically from TD_API_KEY in .env
+ * const client = new TestDriver();
  * await client.connect();
  *
+ * // Or pass API key explicitly
+ * const client = new TestDriver('your-api-key');
+ *
  * // New API
  * const element = await client.find('Submit button');
  * await element.click();
@@ -1264,9 +1272,18 @@ const { createMarkdownLogger } = require("./interfaces/logger.js");
 class TestDriverSDK {
   constructor(apiKey, options = {}) {
+    // Support calling with just options: new TestDriver({ os: 'windows' })
+    if (typeof apiKey === 'object' && apiKey !== null) {
+      options = apiKey;
+      apiKey = null;
+    }
+    // Use provided API key or fall back to environment variable
+    const resolvedApiKey = apiKey || process.env.TD_API_KEY;
     // Set up environment with API key
     const environment = {
-      TD_API_KEY: apiKey,
+      TD_API_KEY: resolvedApiKey,
       TD_API_ROOT: options.apiRoot || "https://testdriver-api.onrender.com",
       TD_RESOLUTION: options.resolution || "1366x768",
       TD_ANALYTICS: options.analytics !== false,