npm - @mindstone/mcp-server-browser-automation - Versions diffs - 0.1.7 → 0.1.8 - Mend

@mindstone/mcp-server-browser-automation 0.1.7 → 0.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +90 -12
package/package.json +2 -2

package/README.md CHANGED Viewed

@@ -1,8 +1,87 @@
-# Browser Automation MCP Server
+# @mindstone/mcp-server-browser-automation
-Headless browser control via accessibility snapshots — navigate pages, fill forms, click elements, take screenshots, and manage tabs using the [agent-browser](https://www.npmjs.com/package/agent-browser) CLI.
+[![npm version](https://img.shields.io/npm/v/@mindstone/mcp-server-browser-automation.svg)](https://www.npmjs.com/package/@mindstone/mcp-server-browser-automation)
+[![License: FSL-1.1-MIT](https://img.shields.io/badge/License-FSL--1.1--MIT-blue.svg)](./LICENSE)
-## Installation
+Browser control you can watch: open pages, sign in, click around, fill forms, take screenshots, and keep a reusable browser session.
+*Best for practical web tasks where the user needs to see, approve, or reuse browser state instead of running a full browser-testing stack.*
+## Status
+- **Version:** [0.1.7](./CHANGELOG.md) · [npm](https://www.npmjs.com/package/@mindstone/mcp-server-browser-automation)
+- **Auth:** None ([`server.json`](./server.json))
+- **Tools:** [18](./src/tools/) (navigation, observation, interaction, sessions)
+- **Surface:** browser-automation
+- **Machine-readable:** [`STATUS.json`](./STATUS.json)
+## Why this exists
+Microsoft's Playwright MCP is a strong choice for broad browser automation and testing. This connector is deliberately smaller and more visible.
+Use it when an assistant needs to work through ordinary websites in a way a person can follow: open a real browser, let the user complete a login, click through admin screens, fill forms, take screenshots, and come back to the same session later. The point is trust and day-to-day usefulness, not exposing every browser-testing capability.
+## Example interaction
+> "Open https://example.com, tell me the page title, and take a screenshot."
+Tools the host calls:
+1. `browser_navigate` — opens the URL in the configured browser session.
+2. `browser_get_page_info` — returns the current URL and page title.
+3. `browser_screenshot` — captures a PNG screenshot.
+Response (trimmed):
+```json
+{
+  "ok": true,
+  "url": "https://example.com/",
+  "title": "Example Domain"
+}
+```
+## Requirements
+- Node.js 20+
+- npm
+- The `agent-browser` CLI on `PATH`, or `npx` available so the server can install it automatically.
+<!-- BEGIN INSTALL_LINKS: do not edit by hand; regenerated by scripts/gen-install-links.mjs -->
+## One-click install
+[![Add to Cursor](https://img.shields.io/badge/Add_to_Cursor-black?style=for-the-badge&logo=cursor&logoColor=white)](cursor://anysphere.cursor-deeplink/mcp/install?name=Browser%20Automation&config=eyJ0eXBlIjoic3RkaW8iLCJjb21tYW5kIjoibnB4IiwiYXJncyI6WyIteSIsIkBtaW5kc3RvbmUvbWNwLXNlcnZlci1icm93c2VyLWF1dG9tYXRpb24iXSwiZW52Ijp7IkFHRU5UX0JST1dTRVJfU0VTU0lPTl9OQU1FIjoibWNwIiwiQUdFTlRfQlJPV1NFUl9TSE9XX1dJTkRPVyI6InRydWUifX0)
+[![Add to VS Code](https://img.shields.io/badge/Add_to_VS_Code-007ACC?style=for-the-badge&logo=visual-studio-code&logoColor=white)](vscode:mcp/install?%7B%22name%22%3A%22Browser%20Automation%22%2C%22command%22%3A%22npx%22%2C%22args%22%3A%5B%22-y%22%2C%22%40mindstone%2Fmcp-server-browser-automation%22%5D%2C%22env%22%3A%7B%22AGENT_BROWSER_SESSION_NAME%22%3A%22mcp%22%2C%22AGENT_BROWSER_SHOW_WINDOW%22%3A%22true%22%7D%7D)
+[![Add to VS Code Insiders](https://img.shields.io/badge/Add_to_VS_Code_Insiders-24bfa5?style=for-the-badge&logo=visual-studio-code&logoColor=white)](vscode-insiders:mcp/install?%7B%22name%22%3A%22Browser%20Automation%22%2C%22command%22%3A%22npx%22%2C%22args%22%3A%5B%22-y%22%2C%22%40mindstone%2Fmcp-server-browser-automation%22%5D%2C%22env%22%3A%7B%22AGENT_BROWSER_SESSION_NAME%22%3A%22mcp%22%2C%22AGENT_BROWSER_SHOW_WINDOW%22%3A%22true%22%7D%7D)
+After clicking the button, your host will prompt you to fill: `AGENT_BROWSER_SESSION_NAME`, `AGENT_BROWSER_SHOW_WINDOW`.
+<details>
+<summary>Manual config for Claude Desktop / Claude Code / Goose / Continue.dev (Browser Automation)</summary>
+```json
+{
+  "mcpServers": {
+    "Browser Automation": {
+      "command": "npx",
+      "args": [
+        "-y",
+        "@mindstone/mcp-server-browser-automation"
+      ],
+      "env": {
+        "AGENT_BROWSER_SESSION_NAME": "mcp",
+        "AGENT_BROWSER_SHOW_WINDOW": "true"
+      }
+    }
+  }
+}
+```
+</details>
+<!-- END INSTALL_LINKS -->
+## Quick Start
+### npx
 ```bash
 npx -y @mindstone/mcp-server-browser-automation
@@ -15,14 +94,12 @@ npm install -g @mindstone/mcp-server-browser-automation
 mcp-server-browser-automation
 ```
-## Requirements
 This server requires the `agent-browser` CLI binary to control the browser.
 ### Binary Resolution
 1. **PATH lookup** (preferred): If `agent-browser` is on your PATH, it is used directly.
-2. **npx fallback**: If the binary is not found, the server automatically falls back to `npx -y agent-browser@0.17`.
+2. **npx fallback**: If the binary is not found, the server automatically falls back to `npx -y agent-browser@0.26.0`.
 ### Installing agent-browser
@@ -39,7 +116,8 @@ No API keys or credentials are required. The server communicates with the browse
 | Variable | Required | Description |
 |---|---|---|
 | `AGENT_BROWSER_SESSION_NAME` | No | Session name for browser persistence (default: `mcp`) |
-| `BROWSER_AUTOMATION_ALLOW_EVAL` | No | Set to `1` to register the `browser_evaluate` tool. Off by default. See [Security considerations](#security-considerations). |
+| `AGENT_BROWSER_SHOW_WINDOW` | No | Set to `false` to run without a visible browser window. Default is visible (`true`). |
+| `BROWSER_AUTOMATION_ALLOW_EVAL` | No | Set to `1` to register the `browser_evaluate` tool. Off by default. See [Security notes](#security-notes). |
 ### MCP Host Configuration
@@ -75,7 +153,7 @@ No API keys or credentials are required. The server communicates with the browse
 - **browser_scroll** — Scroll the page in a direction
 - **browser_select** — Select an option from a dropdown
 - **browser_hover** — Hover over an element
-- **browser_evaluate** — Execute JavaScript in the page context (gated; see [Security considerations](#security-considerations))
+- **browser_evaluate** — Execute JavaScript in the page context (gated; see [Security notes](#security-notes))
 ### Session Management
 - **browser_tabs** — List open tabs or switch to a tab
@@ -91,7 +169,7 @@ The typical workflow uses accessibility snapshots for reliable element targeting
 3. `browser_click` / `browser_fill` → interact using @ref references
 4. `browser_screenshot` → visual verification
-## Security considerations
+## Security notes
 Browser automation has a large attack surface: the agent-browser CLI controls a real headless browser that loads URLs you pass it, runs page-side JavaScript, and persists cookies and session state across runs. Read this section before deploying.
@@ -103,7 +181,7 @@ Browser automation has a large attack surface: the agent-browser CLI controls a
 BROWSER_AUTOMATION_ALLOW_EVAL=1 mcp-server-browser-automation
 ```
-Without this env var, `browser_evaluate` is **not** in the tools list at all — the LLM cannot even see it. When enabled, the tool is annotated `destructiveHint: true` so MCP hosts can (and should) require explicit user confirmation before each invocation.
+Without this env var, `browser_evaluate` is **not** in the tools list at all — the LLM cannot even see it. When enabled, the tool is marked so MCP hosts can require explicit user confirmation before each invocation.
 ### URL scheme deny-list
@@ -129,6 +207,6 @@ To override the session name (for example, to keep separate profiles per project
 - **Require host confirmation** for `browser_authenticate` and any flow that may navigate to authenticated sites — otherwise prompt injection in fetched content can drive the browser at sites the user is logged into.
 - **Treat returned page content as untrusted** — accessibility snapshots, screenshots, and JavaScript-evaluation outputs come from arbitrary websites and may contain prompt-injection attempts.
-## License
+## Licence
-FSL-1.1-MIT
+[FSL-1.1-MIT](./LICENSE) — Functional Source License, Version 1.1, with MIT future licence. The software converts to MIT licence on 2030-04-08.

package/package.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
   "name": "@mindstone/mcp-server-browser-automation",
-  "version": "0.1.7",
+  "version": "0.1.8",
   "mcpName": "io.github.mindstone/mcp-server-browser-automation",
-  "description": "Browser automation MCP server \u2014 visible-by-default browser control via accessibility snapshots, navigation, form filling, screenshots, and tab management. Set AGENT_BROWSER_SHOW_WINDOW=false to run quietly.",
+  "description": "Browser automation MCP server — visible-by-default browser control via accessibility snapshots, navigation, form filling, screenshots, and tab management. Set AGENT_BROWSER_SHOW_WINDOW=false to run quietly.",
   "license": "FSL-1.1-MIT",
   "type": "module",
   "bin": {