npm - @mercuryo-ai/agentbrowse-cli - Versions diffs - 0.1.7 → 0.1.8 - Mend

@mercuryo-ai/agentbrowse-cli 0.1.7 → 0.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +57 -30
package/package.json +4 -4

package/README.md CHANGED Viewed

@@ -1,59 +1,86 @@
 # @mercuryo-ai/agentbrowse-cli
-Browser-only CLI for AI agents.
+Browser automation CLI for AI agents.
-Choose this package when the task is navigation, observation, extraction, or
-browser actions on visible page state. If the task includes protected login,
-identity, or payment forms, pair it with `@mercuryo-ai/magicpay-cli` or use
-`@mercuryo-ai/magicpay-agent-cli`.
+AgentBrowse CLI is the operator-facing way to work with a real web page.
-## Before You Start
+Use it when you need to:
-- Node.js 18 or newer
-- A MagicPay account. Create it at `https://agents.mercuryo.io/signup`, then
-  create an API key in the dashboard.
-- A browser the agent can launch itself or attach to over the Chrome DevTools
-  Protocol (CDP)
+- launch a browser or attach to an existing CDP session;
+- inspect the current page and act on visible controls;
+- navigate directly to a known URL;
+- extract structured data from the page;
+- capture screenshots or recover a stuck browser session.
-## Install And Initialize
+AgentBrowse works well as a standalone browser tool. It can also sit next to
+MagicPay when a broader flow later reaches a protected login, identity, or
+payment step.
+Open source:
+- library and docs: `https://github.com/MercuryoAI/agentbrowse`
+## Install
 ```bash
 npm i -g @mercuryo-ai/agentbrowse-cli@latest
-agentbrowse init <magicpay-api-key>
-agentbrowse --version
 ```
-`agentbrowse init` stores the shared MagicPay config used by goal-based
-`observe` and `extract`. After setup, start a fresh agent session if the new
-CLI is not visible immediately.
+## Core Loop
-## Verify And First Use
+1. `agentbrowse launch [url]` or `agentbrowse attach <cdp-url>`
+2. `agentbrowse observe`
+3. `agentbrowse act <targetRef> <action> [value]`
+4. repeat `observe` after meaningful page changes
+5. `agentbrowse navigate <url>` if the destination is already known
+6. `agentbrowse extract '<schema-json>' [scopeRef]` when you need structured
+   output
+7. `agentbrowse screenshot` or `agentbrowse browser-status` for evidence and
+   debugging
+8. `agentbrowse close` when you are done
-Verify the install first:
+## When You Need API Setup
+Simple browser tasks work without API setup.
+Goal-based `observe` and `extract` use the shared agent config. Set it up once
+with:
 ```bash
-agentbrowse --version
+agentbrowse init <api-key>
 ```
-Then run a simple browser-only task:
+You can also provide `MAGICPAY_API_KEY` in the environment. Use
+`agentbrowse doctor` only if semantic commands still fail after `init` and you
+want to inspect `~/.magicpay/config.json`.
+## First Task
+Start with a plain browser task:
 ```bash
 agentbrowse launch "https://example.com"
-agentbrowse observe "find the next visible action to continue"
+agentbrowse observe
 ```
-Use `agentbrowse doctor` only when `init` succeeded but goal-based `observe` or
-`extract` still fail because the local MagicPay config looks wrong.
+If you want a focused or structured result:
+```bash
+agentbrowse init <api-key>
+agentbrowse observe "find the next visible action to continue"
+agentbrowse extract '{"summary":"string"}'
+```
 ## Command Groups
-- Setup: `init`, `doctor`
-- Browser lifecycle: `launch`, `attach`, `browser-status`, `close`
-- Task loop: `navigate`, `observe`, `act`, `extract`, `screenshot`
+- Session setup: `launch`, `attach`, `navigate`
+- Page work: `observe`, `act`, `extract`
+- Diagnostics and cleanup: `screenshot`, `browser-status`, `close`
+- Semantic runtime setup: `init`, `doctor`, `--version`
-## Choose Another Package When
+## Companion Tools
-- You already have the browser on a protected login, identity, or payment form:
+- If the page is already at a protected login, identity, or payment form:
   use `@mercuryo-ai/magicpay-cli`
-- You want one CLI that covers both browser work and protected fills:
+- If one CLI should cover both browser work and protected fills:
   use `@mercuryo-ai/magicpay-agent-cli`

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "@mercuryo-ai/agentbrowse-cli",
-  "version": "0.1.7",
-  "description": "AgentBrowse CLI — browser-only operator shell on the shared MagicPay home",
+  "version": "0.1.8",
+  "description": "Browser automation CLI for AI agents",
   "type": "module",
   "license": "MIT",
   "bin": {
@@ -37,9 +37,9 @@
   "dependencies": {
     "@browserbasehq/stagehand": "^3.0.0",
     "dotenv": "^16.4.0",
-    "@mercuryo-ai/magicpay-home": "0.1.6",
     "@mercuryo-ai/agentbrowse": "0.2.53",
-    "@mercuryo-ai/magicpay-sdk": "0.1.0-test.5"
+    "@mercuryo-ai/magicpay-home": "0.1.6",
+    "@mercuryo-ai/magicpay-sdk": "0.1.0-test.6"
   },
   "devDependencies": {
     "@types/node": "^22.0.0",