npm - testdriverai - Versions diffs - 7.3.32 → 7.3.34 - Mend

testdriverai 7.3.32 → 7.3.34

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (34) hide show

package/.github/copilot-instructions.md +641 -0
package/.github/skills/testdriver:assert/SKILL.md +31 -0
package/.github/skills/testdriver:aws-setup/SKILL.md +1 -68
package/.github/skills/testdriver:client/SKILL.md +33 -0
package/.github/skills/testdriver:debugging-with-screenshots/SKILL.md +175 -45
package/.github/skills/testdriver:find/SKILL.md +87 -0
package/.github/skills/testdriver:parse/SKILL.md +124 -6
package/.github/skills/testdriver:quickstart/SKILL.md +2 -17
package/.github/skills/testdriver:reusable-code/SKILL.md +9 -0
package/.github/skills/testdriver:running-tests/SKILL.md +4 -0
package/.github/skills/testdriver:screenshot/SKILL.md +84 -3
package/.github/skills/testdriver:scroll/SKILL.md +36 -0
package/.github/skills/testdriver:testdriver/SKILL.md +194 -86
package/CHANGELOG.md +8 -0
package/docs/_data/examples-manifest.json +72 -64
package/docs/v7/examples/ai.mdx +1 -1
package/docs/v7/examples/assert.mdx +1 -1
package/docs/v7/examples/chrome-extension.mdx +1 -1
package/docs/v7/examples/drag-and-drop.mdx +1 -1
package/docs/v7/examples/element-not-found.mdx +1 -1
package/docs/v7/examples/hover-image.mdx +1 -1
package/docs/v7/examples/hover-text.mdx +1 -1
package/docs/v7/examples/installer.mdx +1 -1
package/docs/v7/examples/launch-vscode-linux.mdx +1 -1
package/docs/v7/examples/match-image.mdx +1 -1
package/docs/v7/examples/press-keys.mdx +1 -1
package/docs/v7/examples/scroll-keyboard.mdx +1 -1
package/docs/v7/examples/scroll-until-image.mdx +1 -1
package/docs/v7/examples/scroll-until-text.mdx +1 -1
package/docs/v7/examples/scroll.mdx +1 -1
package/docs/v7/examples/type.mdx +1 -1
package/docs/v7/examples/windows-installer.mdx +1 -1
package/package.json +1 -2
package/sdk.js +1 -1

package/.github/skills/testdriver:aws-setup/SKILL.md CHANGED Viewed

@@ -256,7 +256,7 @@ jobs:
         run: npm ci
       - name: Run Windows tests with self-hosted instances
-        run: vitest run examples/*.test.mjs
+        run: npx vitest run examples/*.test.mjs
         env:
           TD_API_KEY: ${{ secrets.TD_API_KEY }}
           TD_OS: windows
@@ -393,73 +393,6 @@ You can customize the AMI to include additional software or configurations:
   **Security**: Never use the default password in production. Always rotate passwords before creating custom AMIs.
 </Warning>
-## Custom Lifecycle Hooks
-You can run custom scripts after instance spawning by creating your own Vitest setup file. This is useful for:
-- Installing additional software via SSM before tests run
-- Configuring network settings or proxies
-- Running cleanup scripts after tests complete
-- Collecting logs or artifacts from instances
-### Post-Spawn Hooks
-To run commands after the instance is ready but before the test executes, create a setup file positioned **after** `setup-aws`:
-```javascript post-spawn-setup.mjs
-import { execSync } from 'child_process';
-import { beforeEach } from 'vitest';
-beforeEach(async (context) => {
-  // context.ip is set by setup-aws
-  if (!context.ip) return;
-  console.log(`Instance ready at ${context.ip}, running post-spawn setup...`);
-  // Run SSM commands on the instance
-  execSync(`aws ssm send-command \
-    --region "${process.env.AWS_REGION}" \
-    --targets "Key=tag:Name,Values=td-*" \
-    --document-name "AWS-RunPowerShellScript" \
-    --parameters 'commands=["choco install my-software -y"]'`);
-});
-```
-```javascript vitest.config.mjs
-setupFiles: [
-  'testdriverai/vitest/setup',
-  'testdriverai/vitest/setup-aws',  // Instance spawns here
-  './post-spawn-setup.mjs'          // Your hooks run AFTER instance is ready
-]
-```
-<Note>
-  Vitest executes setup files in array order. By positioning your file after `setup-aws`, the `context.ip` is already available in your `beforeEach` hook.
-</Note>
-### Post-Test Cleanup
-To run cleanup after each test completes (while the instance is still running):
-```javascript post-spawn-setup.mjs
-import { execSync } from 'child_process';
-import { beforeEach, afterEach } from 'vitest';
-beforeEach(async (context) => {
-  if (!context.ip) return;
-  // ... post-spawn setup
-});
-afterEach(async (context) => {
-  if (!context.ip) return;
-  console.log('Running post-test cleanup...');
-  execSync('./scripts/collect-logs.sh', {
-    env: { ...process.env, INSTANCE_IP: context.ip }
-  });
-});
-```
 ## Security Best Practices
 ### Network Security

package/.github/skills/testdriver:client/SKILL.md CHANGED Viewed

@@ -44,9 +44,37 @@ const testdriver = new TestDriver(apiKey, options)
       Enable or disable console logging
     </ParamField>
+    <ParamField path="autoScreenshots" type="boolean" default="true">
+      Automatically capture screenshots before and after each command. Screenshots are saved to `.testdriver/screenshots/<test>/` with descriptive filenames that include the line number and action name. Format: `<seq>-<action>-<phase>-L<line>-<description>.png`
+    </ParamField>
     <ParamField path="environment" type="object">
       Additional environment variables to pass to the sandbox
     </ParamField>
+    <ParamField path="ai" type="object">
+      Global AI sampling configuration. Controls how the AI model generates responses for `find()` verification and `assert()` calls. Can be overridden per call.
+      <Expandable title="properties">
+        <ParamField path="temperature" type="number">
+          Controls randomness in AI responses. `0` = deterministic (best for verification), higher values = more creative. Default: `0` for find verification, model default for assert.
+        </ParamField>
+        <ParamField path="top" type="object">
+          Nucleus and top-k sampling parameters
+          <Expandable title="properties">
+            <ParamField path="p" type="number">
+              Top-P (nucleus sampling). Limits token choices to the smallest set whose cumulative probability exceeds P. Lower values = more focused responses. Range: 0-1.
+            </ParamField>
+            <ParamField path="k" type="number">
+              Top-K sampling. Limits token choices to the top K most likely tokens. `1` = always pick the most likely token. `0` = disabled (consider all tokens).
+            </ParamField>
+          </Expandable>
+        </ParamField>
+      </Expandable>
+    </ParamField>
   </Expandable>
 </ParamField>
@@ -63,6 +91,11 @@ const testdriver = new TestDriver({
   analytics: true
 });
+// With AI config for stricter verification
+const testdriver = new TestDriver({
+  ai: { temperature: 0, top: { p: 0.9, k: 40 } }
+});
 // Or pass API key explicitly
 const testdriver = new TestDriver('your-api-key', {
   os: 'windows'

package/.github/skills/testdriver:debugging-with-screenshots/SKILL.md CHANGED Viewed

@@ -8,20 +8,60 @@ description: View and analyze saved screenshots using MCP commands for test debu
 TestDriver MCP provides powerful commands to view and analyze screenshots saved during test execution. This enables rapid debugging, test development, and comparison workflows without manually opening image files.
+<Note>
+  **Automatic Screenshots (Default: Enabled)**: TestDriver automatically captures screenshots before and after every command. Screenshots are named with the line number and action, making it easy to trace exactly which line of code produced each screenshot. For example: `001-click-before-L42-submit-button.png`
+</Note>
 ## MCP Commands
 ### list_local_screenshots
-List all screenshots saved in the `.testdriver/screenshots/` directory:
+List and filter screenshots saved in the `.testdriver/screenshots/` directory:
 ```
 list_local_screenshots()
 ```
-**Optional Parameters:**
+**Filter Parameters:**
 <ParamField path="directory" type="string" optional>
-  Filter screenshots by subdirectory (e.g., specific test file). If omitted, lists all screenshots.
+  Filter screenshots by test file or subdirectory (e.g., "login.test", "mcp-screenshots"). If omitted, lists all screenshots.
+</ParamField>
+<ParamField path="line" type="number" optional>
+  Filter by exact line number from test file (e.g., 42 matches L42 in filename).
+</ParamField>
+<ParamField path="lineRange" type="object" optional>
+  Filter by line number range. Example: `{ start: 10, end: 20 }` matches screenshots from lines 10-20.
+</ParamField>
+<ParamField path="action" type="string" optional>
+  Filter by action type: `click`, `find`, `type`, `assert`, `provision`, `scroll`, `hover`, etc.
+</ParamField>
+<ParamField path="phase" type="string" optional>
+  Filter by phase: `"before"` (state before action) or `"after"` (state after action).
+</ParamField>
+<ParamField path="pattern" type="string" optional>
+  Regex pattern to match against filename. Example: `"login|signin"` or `"button.*click"`.
+</ParamField>
+<ParamField path="sequence" type="number" optional>
+  Filter by exact sequence number.
+</ParamField>
+<ParamField path="sequenceRange" type="object" optional>
+  Filter by sequence range. Example: `{ start: 1, end: 10 }` matches first 10 screenshots.
+</ParamField>
+<ParamField path="limit" type="number" optional>
+  Maximum number of results to return (default: 50).
+</ParamField>
+<ParamField path="sortBy" type="string" optional>
+  Sort results by: `"modified"` (newest first, default), `"sequence"` (execution order), or `"line"` (line number).
 </ParamField>
 **Returns:**
@@ -29,24 +69,31 @@ list_local_screenshots()
 Array of screenshot metadata including:
 - `path` - Full absolute path to the screenshot file
 - `relativePath` - Path relative to `.testdriver/screenshots/`
-- `testFile` - The test file that created this screenshot
-- `filename` - Screenshot filename
-- `size` - File size in bytes
+- `name` - Screenshot filename
+- `sizeBytes` - File size in bytes
 - `modified` - Last modification timestamp
-- `created` - Creation timestamp
+- `sequence` - Sequential number (from auto-screenshots)
+- `action` - Action type (click, find, etc.)
+- `phase` - Before/after phase
+- `lineNumber` - Line number from test file
+- `description` - Element or action description
-**Example Response:**
+**Example Responses:**
 ```json
+// Basic listing
 [
   {
-    "path": "/Users/user/project/.testdriver/screenshots/login.test/screenshot-1737633600000.png",
-    "relativePath": "login.test/screenshot-1737633600000.png",
-    "testFile": "login.test",
-    "filename": "screenshot-1737633600000.png",
-    "size": 145632,
+    "path": "/Users/user/project/.testdriver/screenshots/login.test/001-click-before-L42-submit-button.png",
+    "relativePath": "login.test/001-click-before-L42-submit-button.png",
+    "name": "001-click-before-L42-submit-button.png",
+    "sizeBytes": 145632,
     "modified": "2026-01-23T10:00:00.000Z",
-    "created": "2026-01-23T10:00:00.000Z"
+    "sequence": 1,
+    "action": "click",
+    "phase": "before",
+    "lineNumber": 42,
+    "description": "submit-button"
   }
 ]
 ```
@@ -75,34 +122,85 @@ view_local_screenshot({ path: "/full/path/to/screenshot.png" })
 ### Test Debugging After Failures
-When a test fails, view the saved screenshots to understand what went wrong:
+When a test fails, use powerful filtering to quickly find relevant screenshots:
-1. **List screenshots from the failed test:**
+**1. Find screenshots at the failing line:**
 ```
-list_local_screenshots({ directory: "login.test" })
+// If test failed at line 42
+list_local_screenshots({ line: 42 })
+// View before and after states at that line
+view_local_screenshot({ path: ".testdriver/screenshots/login.test/005-click-before-L42-submit-button.png" })
+view_local_screenshot({ path: ".testdriver/screenshots/login.test/006-click-after-L42-submit-button.png" })
 ```
-2. **View screenshots in chronological order** (sorted by creation time) to trace the test execution:
+**2. See what happened leading up to the failure:**
 ```
-view_local_screenshot({ path: ".testdriver/screenshots/login.test/screenshot-1737633600000.png" })
-view_local_screenshot({ path: ".testdriver/screenshots/login.test/screenshot-1737633610000.png" })
-view_local_screenshot({ path: ".testdriver/screenshots/login.test/screenshot-1737633620000.png" })
+// Get screenshots from lines 35-45 to see context
+list_local_screenshots({ directory: "login.test", lineRange: { start: 35, end: 45 } })
 ```
-3. **Analyze the UI state** at each step to identify where things went wrong
+**3. Find all assertion screenshots:**
+```
+// See what the screen looked like during assertions
+list_local_screenshots({ action: "assert" })
+```
+**4. View the final state before failure:**
+```
+// Get the last 5 screenshots in execution order
+list_local_screenshots({ directory: "login.test", sortBy: "sequence", limit: 5 })
+```
-4. **Compare expected vs actual** - if you added descriptive filenames with `screenshot("step-name")`, you can easily identify key moments
+### Finding Specific Actions
+When debugging element interactions:
+```
+// Find all click actions
+list_local_screenshots({ action: "click" })
+// Find what the screen looked like BEFORE each click
+list_local_screenshots({ action: "click", phase: "before" })
+// Find screenshots related to a specific element using regex
+list_local_screenshots({ pattern: "submit|button" })
+// Find all type actions (for form filling issues)
+list_local_screenshots({ action: "type" })
+```
+### Understanding Test Flow
+View screenshots in execution order to trace test behavior:
+```
+// Get screenshots in execution order
+list_local_screenshots({ directory: "checkout.test", sortBy: "sequence" })
+// Get just the first 10 actions
+list_local_screenshots({ sequenceRange: { start: 1, end: 10 }, sortBy: "sequence" })
+// Get just the last 10 actions
+list_local_screenshots({ directory: "checkout.test", sortBy: "sequence", limit: 10 })
+```
 ### Interactive Test Development
 While building tests using MCP tools, view screenshots to verify your test logic:
-1. **After a test run**, list screenshots to see what was captured:
+1. **After a test run**, filter screenshots to see specific actions:
 ```
-list_local_screenshots()
+// See all assertions
+list_local_screenshots({ action: "assert" })
+// See what happened at a specific line you're debugging
+list_local_screenshots({ line: 25 })
 ```
 2. **Review key points** in the test execution:
@@ -117,21 +215,31 @@ view_local_screenshot({ path: ".testdriver/screenshots/my-test.test/after-login.
 ### Comparison and Analysis
-Compare screenshots across multiple test runs to identify flaky behavior or UI changes:
+Compare screenshots to identify issues:
-1. **List screenshots from multiple test runs** (note: each test run clears the folder, so copy screenshots elsewhere for comparison if needed)
+**Using phase filtering for before/after comparison:**
-2. **View screenshots side-by-side** to spot differences:
+```
+// See state before all clicks
+list_local_screenshots({ action: "click", phase: "before" })
+// See state after all clicks
+list_local_screenshots({ action: "click", phase: "after" })
 ```
-view_local_screenshot({ path: ".testdriver/screenshots/test.test/before-click.png" })
-// Analyze first run
-view_local_screenshot({ path: ".testdriver/screenshots-backup/test.test/before-click.png" })
-// Compare with previous run
+**Using line-based debugging:**
+```
+// Something went wrong around line 50
+list_local_screenshots({ lineRange: { start: 45, end: 55 } })
 ```
-3. **Identify timing issues** - if element positions or states vary between runs, you may have timing/race condition issues
+**Using regex patterns:**
+```
+// Find screenshots related to login functionality
+list_local_screenshots({ pattern: "login|signin|email|password" })
+```
 ## Best Practices
@@ -188,20 +296,37 @@ Understanding the directory structure helps with efficient screenshot viewing:
 .testdriver/
   screenshots/
     login.test/              # Test file name (without .mjs extension)
-      screenshot-1737633600000.png   # Auto-generated timestamp filename
-      initial-state.png              # Custom descriptive filename
-      after-click.png
+      001-find-before-L15-email-input.png     # Auto: before find() at line 15
+      002-find-after-L15-email-input.png      # Auto: after find() at line 15
+      003-click-before-L16-email-input.png    # Auto: before click() at line 16
+      004-click-after-L16-email-input.png     # Auto: after click() at line 16
+      login-complete.png                       # Manual: screenshot("login-complete")
     checkout.test/
-      screenshot-1737633700000.png
-      product-page.png
-    profile.test/
-      screenshot-1737633800000.png
+      001-find-before-L12-add-to-cart.png
+      002-find-after-L12-add-to-cart.png
+      ...
 ```
+### Automatic Screenshot Naming Format
+`<seq>-<action>-<phase>-L<line>-<description>.png`
+| Component | Description | Example |
+|-----------|-------------|---------|
+| `seq` | Sequential number | `001`, `002` |
+| `action` | Command name | `click`, `type`, `find` |
+| `phase` | Before, after, or error | `before`, `after`, `error` |
+| `L<line>` | Line number from test file | `L42` |
+| `description` | Element/action description | `submit-button` |
+### Key Points
 - Each test file gets its own subdirectory
-- Filenames are either timestamps (default) or custom names you provide
+- Automatic screenshots include line numbers for easy tracing
+- Manual `screenshot()` calls use custom names you provide
 - Folders are cleared at the start of each test run
 - All screenshots are PNG format
+- Disable automatic screenshots with `autoScreenshots: false` if needed
 ## Integration with Test Development
@@ -248,11 +373,16 @@ When tests fail or behave unexpectedly:
   </Accordion>
   <Accordion title="Too many screenshots">
-    If you have hundreds of screenshots making it hard to find what you need:
+    If you have hundreds of screenshots making it hard to find what you need, use filtering:
-    - Use the `directory` parameter to filter by test file
-    - Consider adding more descriptive filenames in your tests
-    - Clean up old screenshot folders: `rm -rf .testdriver/screenshots/*`
+    - Filter by test file: `list_local_screenshots({ directory: "my-test.test" })`
+    - Filter by line number: `list_local_screenshots({ line: 42 })` or `list_local_screenshots({ lineRange: { start: 40, end: 50 } })`
+    - Filter by action: `list_local_screenshots({ action: "click" })`
+    - Filter by phase: `list_local_screenshots({ phase: "before" })`
+    - Use regex: `list_local_screenshots({ pattern: "submit|login" })`
+    - Limit results: `list_local_screenshots({ limit: 10 })`
+    - Sort by line: `list_local_screenshots({ sortBy: "line" })`
+    - Clean up old folders: `rm -rf .testdriver/screenshots/*`
   </Accordion>
   <Accordion title="Screenshots from old test runs">

package/.github/skills/testdriver:find/SKILL.md CHANGED Viewed

@@ -52,6 +52,30 @@ const element = await testdriver.find(description, options)
     <ParamField path="zoom" type="boolean" default={false}>
       Enable two-phase zoom mode for better precision in crowded UIs with many similar elements.
     </ParamField>
+    <ParamField path="ai" type="object">
+      AI sampling configuration for this find call (overrides global `ai` config from constructor).
+      <Expandable title="properties">
+        <ParamField path="temperature" type="number">
+          Controls randomness. `0` = deterministic. Default: `0` for find verification.
+        </ParamField>
+        <ParamField path="top" type="object">
+          Sampling parameters
+          <Expandable title="properties">
+            <ParamField path="p" type="number">
+              Top-P (nucleus sampling). Range: 0-1.
+            </ParamField>
+            <ParamField path="k" type="number">
+              Top-K sampling. `1` = most deterministic.
+            </ParamField>
+          </Expandable>
+        </ParamField>
+      </Expandable>
+    </ParamField>
   </Expandable>
 </ParamField>
@@ -220,6 +244,69 @@ This is useful for:
   ```
 </Check>
+## Confidence Threshold
+Require a minimum AI confidence score for element matches. If the confidence is below the threshold, `find()` treats the result as not found:
+```javascript
+// Require at least 90% confidence
+const element = await testdriver.find('submit button', { confidence: 0.9 });
+if (!element.found()) {
+  // AI found something but wasn't confident enough
+  throw new Error('Could not confidently locate submit button');
+}
+await element.click();
+```
+This is useful for:
+- Critical test steps where an incorrect click could cause cascading failures
+- Distinguishing between similar elements (e.g., multiple buttons)
+- Failing fast when the UI has changed unexpectedly
+```javascript
+// Combine with timeout for robust polling with confidence gate
+const element = await testdriver.find('success notification', {
+  confidence: 0.85,
+  timeout: 15000,
+});
+```
+<Tip>
+  The `confidence` value is a float between 0 and 1 (e.g., `0.9` = 90%). The AI returns its confidence with each find result, which you can also read from `element.confidence` after a successful find.
+</Tip>
+## Element Type
+Use the `type` option to hint what kind of element you're looking for. This wraps your description into a more specific prompt for the AI, improving match accuracy — especially when users provide short or ambiguous descriptions.
+```javascript
+// Find text on the page
+const label = await testdriver.find('Sign In', { type: 'text' });
+// AI prompt becomes: The text "Sign In"
+// Find an image
+const logo = await testdriver.find('company logo', { type: 'image' });
+// AI prompt becomes: The image "company logo"
+// Find a UI element (button, input, checkbox, etc.)
+const btn = await testdriver.find('Submit', { type: 'ui' });
+// AI prompt becomes: The UI element "Submit"
+// No wrapping — same as omitting the option
+const el = await testdriver.find('the blue submit button', { type: 'any' });
+```
+| Type | Prompt sent to AI |
+|------|----|
+| `"text"` | `The text "..."` |
+| `"image"` | `The image "..."` |
+| `"ui"` | `The UI element "..."` |
+| `"any"` | Original description (no wrapping) |
+<Tip>
+  This is particularly useful for short descriptions like `"Submit"` or `"Login"` where the AI may not know whether to look for a button, a link, or visible text. Specifying `type` removes the ambiguity.
+</Tip>
 ## Polling for Dynamic Elements
 For elements that may not be immediately visible, use the `timeout` option to automatically poll: