@quanta-intellect/vessel-browser 0.1.65 → 0.1.68

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -16,7 +16,7 @@ Open source chromium-based browser for persistent web agents. Linux is the most
16
16
  Vessel gives external agent harnesses a real browser with durable state, MCP control, and a human-visible supervisory UI. It is built for long-running workflows where the agent drives and the human audits, intervenes, and redirects when needed.
17
17
 
18
18
  - **Built for agent harnesses** such as Hermes Agent, OpenClaw, and other MCP clients
19
- - **Keeps browser state alive** with named sessions, bookmarks, checkpoints, and structured page visibility
19
+ - **Keeps browser state alive** with named sessions, editable bookmarks, annotated checkpoints, action undo, and structured page visibility
20
20
  - **Keeps humans in the loop** with approvals, runtime controls, and a visible browser instead of a headless black box
21
21
 
22
22
  *Vessel is in active development and currently makes no security assurances. Use and deploy it with care.*
@@ -89,14 +89,16 @@ Today, Vessel provides the browser shell, page visibility, and supervisory surfa
89
89
  - **Agent-first browser model** — Vessel is designed around an agent driving the browser while a human watches, intervenes, and redirects
90
90
  - **Human-visible browser UI** — pages render like a normal browser so agent activity stays legible instead of disappearing into a headless run
91
91
  - **Command Bar** (`Ctrl+L`) — a secondary operator surface for harness-driven workflows and future runtime commands
92
- - **Supervisor Sidebar** (`Ctrl+Shift+L`) — live supervision across five tabs: Supervisor, Bookmarks, Checkpoints, Chat, and Automate
92
+ - **Supervisor Sidebar** (`Ctrl+Shift+L`) — live supervision across seven tabs: Supervisor, Bookmarks, Checkpoints, Chat, Automate, History, and Changes
93
93
  - **Chat Assistant** — built-in conversational AI in the sidebar Chat tab; supports Anthropic, OpenAI, Ollama, llama.cpp, Mistral, xAI, Google Gemini, OpenRouter, and any OpenAI-compatible endpoint; reads the current page automatically; has full access to the same browser tools as external agents; multi-turn session history; configure provider, model, and API key in Settings
94
94
  - **Automation Kits** (Premium) — parameterized workflow templates in the sidebar Automate tab; fill in a short form and the built-in agent executes the workflow autonomously; bundled kits include Research & Collect (multi-source research with bookmark saving) and Price Scout (cross-retailer price comparison); designed for a future kit marketplace
95
95
  - **Dev Tools Panel** (`F12`) — inspect console output, network requests, and MCP/agent activity in a resizable panel at the bottom of the window; export logs by category and date range as JSON
96
- - **Agent-Meaningful Bookmarks** — bookmarks carry structured context the agent can read and act on: `intent` (what the page is for), `expectedContent` (what to expect on the page), `keyFields` (important form fields), `agentHints` (arbitrary directives), and a stored `pageSchema`; all fields are searchable
96
+ - **Action Undo / Rollback** — restore the browser to the session snapshot captured immediately before the last successful mutating agent action; available from the Supervisor tab and through the `undo_last_action` tool
97
+ - **Agent-Meaningful Bookmarks** — bookmarks carry structured context the agent can read and act on: `intent` (what the page is for), `expectedContent` (what to expect on the page), `keyFields` (important form fields), `agentHints` (arbitrary directives), and a stored `pageSchema`; humans can create and edit this metadata directly in the Bookmarks tab, and all fields are searchable
97
98
  - **Page Schema Inference** — Vessel automatically infers a typed schema for every page: `pageType` (article, product, form, search, checkout, login, dashboard), `primaryEntity` (structured fields for products and articles), `formFields` (with names, types, labels, selectors), and `actionButtons` (with inferred intents: submit, addToCart, login, etc.); schema is attached to every content extraction result
98
99
  - **Bookmarks for Agents** — save pages into folders, attach one-line folder summaries, and search bookmarks over MCP instead of dumping the entire library
99
100
  - **Named Session Persistence** — save cookies, localStorage, and current tab layout under a reusable name, then reload it after a restart
101
+ - **Annotated Checkpoints** — capture and restore short-lived browser recovery points with names and editable notes, so humans and agents can mark why a checkpoint matters before risky flows
100
102
  - **Page Highlights** — agents can visually highlight text or elements on any page with labeled, color-coded markers that persist across navigation; highlight count and navigation controls appear in the sidebar; cleared explicitly or via tool call
101
103
  - **Agent Transcript Dock** — floating transcript overlay anchored to the browser chrome; configurable display modes (off, summary, full) set in Settings; shows live agent thinking and status updates without occupying sidebar space
102
104
  - **Workflow Flow Tracking** — agents can declare a named multi-step workflow at runtime using `flow_start`; progress is tracked step-by-step with `flow_advance` and visible in the sidebar throughout execution
@@ -104,6 +106,7 @@ Today, Vessel provides the browser shell, page visibility, and supervisory surfa
104
106
  - **Popup Recovery Tools** — agents can explicitly dismiss common popups, newsletter gates, and consent walls instead of brute-forcing generic clicks
105
107
  - **Form Autofill Profiles** — save reusable personal or work profiles in Settings and fill common contact, address, and organization fields on the current page; Vessel matches fields using labels, names, placeholders, and autocomplete hints
106
108
  - **Page Diff / "What Changed?"** — Vessel remembers the last snapshot of a page and surfaces a `Changed` badge in the address bar when the title, headings, or main content differ on a later visit; expand it to see a compact summary of what changed since the last snapshot
109
+ - **What Changed Timeline** (Premium) — the sidebar Changes tab keeps a per-page history of recent change bursts, showing when each update was detected and a compact summary of what changed
107
110
  - **Per-Tab Ad Blocking Controls** — tabs default to ad blocking on, but agents can selectively disable and re-enable blocking when a page misbehaves
108
111
  - **Domain Policy** — allowlist or blocklist domains globally in Settings; agents cannot navigate to blocked domains
109
112
  - **Agent Credential Vault** (Premium) — encrypted credential storage for agent-driven logins; credentials are filled directly into login forms via a "blind fill" pattern and are never sent to AI providers; user consent dialog before every use; TOTP 2FA support; domain-scoped access; append-only audit log
@@ -152,7 +155,7 @@ Main Process Renderer (SolidJS)
152
155
 
153
156
  Each browser tab is a separate `WebContentsView` managed by the main process. The browser chrome (SolidJS) runs in its own view layered on top. All communication between renderer and main goes through typed IPC channels via `contextBridge`.
154
157
 
155
- The sidebar Automate tab renders kit forms entirely in the renderer and passes the rendered prompt to the built-in agent via the same `query()` path used by the Chat tab — no additional IPC surface is needed.
158
+ The sidebar Automate tab renders kit forms entirely in the renderer and passes the rendered prompt to the built-in agent via the same `query()` path used by the Chat tab — no additional IPC surface is needed. The Changes tab reads the current page's diff timeline through IPC and unlocks persisted history for Premium users.
156
159
 
157
160
  ## Getting Started
158
161
 
@@ -234,8 +237,8 @@ Vessel is designed to act as the browser runtime that your external agent harnes
234
237
  2. Open Settings (`Ctrl+,`) to confirm MCP status, copy the endpoint, or change the MCP port
235
238
  3. Optional: set an Obsidian vault path, create autofill profiles, or adjust session preferences
236
239
  4. Start Hermes Agent or OpenClaw and point it at Vessel — the easiest way is `vessel-browser-mcp --stdio` as the MCP command (auth is resolved automatically), or connect directly to `http://127.0.0.1:<mcpPort>/mcp` with the bearer token from `~/.config/vessel/mcp-auth.json`
237
- 5. Use the Supervisor panel in Vessel's sidebar to pause the agent, change approval mode, review pending approvals, checkpoint, or restore the browser session while the harness runs
238
- 6. Use the Bookmarks panel to organize saved pages into folders and expose those bookmarks back to the agent over MCP
240
+ 5. Use the Supervisor panel in Vessel's sidebar to pause the agent, change approval mode, review pending approvals, checkpoint, undo the last mutating action, or restore the browser session while the harness runs
241
+ 6. Use the Bookmarks panel to organize saved pages into folders, edit agent-facing bookmark metadata, and expose those bookmarks back to the agent over MCP
239
242
 
240
243
  Notes:
241
244
 
@@ -249,6 +252,7 @@ Notes:
249
252
  - Settings now show MCP runtime status, active endpoint, startup warnings, and allow changing the MCP port with an immediate server restart
250
253
  - Settings also include reusable Form Autofill profiles for one-click filling of common contact and address forms on the active page
251
254
  - The address bar can also show a `Changed` badge when Vessel detects that a previously visited page has meaningfully changed since the last saved snapshot
255
+ - Premium users can open the sidebar Changes tab for the full What Changed timeline for the active page
252
256
  - Agents can selectively disable ad blocking for a problematic tab, reload, retry the flow, and turn blocking back on later
253
257
  - Agents can persist authenticated state with named sessions, for example `github-logged-in`, and reload that state in later runs
254
258
  - The intended control plane is an external harness driving Vessel through MCP
@@ -306,6 +310,7 @@ Page interaction and recovery tools exposed today include:
306
310
  - `vessel_set_ad_blocking`
307
311
  - `vessel_wait_for`
308
312
  - `vessel_screenshot` (Premium) — capture the full page as an image for visual AI analysis
313
+ - `undo_last_action` — restore the browser to the snapshot captured before the last successful mutating agent action
309
314
 
310
315
  Page highlight tools:
311
316
 
@@ -521,7 +526,7 @@ src/
521
526
  │ └── src/
522
527
  │ ├── components/
523
528
  │ │ ├── chrome/ # TitleBar, TabBar, AddressBar, AgentTranscriptDock
524
- │ │ ├── ai/ # CommandBar, Sidebar (Supervisor/Bookmarks/Checkpoints/Chat/Automate)
529
+ │ │ ├── ai/ # CommandBar, Sidebar (Supervisor/Bookmarks/Checkpoints/Chat/Automate/History/Changes)
525
530
  │ │ ├── devtools/ # DevTools panel (Console, Network, Activity)
526
531
  │ │ └── shared/ # Settings panel
527
532
  │ ├── stores/ # SolidJS signal stores (tabs, ai, ui, runtime, bookmarks, etc.)