agent-browser 0.22.2 → 0.22.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +18 -2
- package/bin/agent-browser-darwin-arm64 +0 -0
- package/bin/agent-browser-darwin-x64 +0 -0
- package/bin/agent-browser-linux-arm64 +0 -0
- package/bin/agent-browser-linux-musl-arm64 +0 -0
- package/bin/agent-browser-linux-musl-x64 +0 -0
- package/bin/agent-browser-linux-x64 +0 -0
- package/bin/agent-browser-win32-x64.exe +0 -0
- package/package.json +1 -1
- package/skills/agent-browser/SKILL.md +12 -0
package/README.md
CHANGED
|
@@ -125,6 +125,9 @@ agent-browser pdf <path> # Save as PDF
|
|
|
125
125
|
agent-browser snapshot # Accessibility tree with refs (best for AI)
|
|
126
126
|
agent-browser eval <js> # Run JavaScript (-b for base64, --stdin for piped input)
|
|
127
127
|
agent-browser connect <port> # Connect to browser via CDP
|
|
128
|
+
agent-browser stream enable [--port <port>] # Start runtime WebSocket streaming
|
|
129
|
+
agent-browser stream status # Show runtime streaming state and bound port
|
|
130
|
+
agent-browser stream disable # Stop runtime WebSocket streaming
|
|
128
131
|
agent-browser close # Close browser (aliases: quit, exit)
|
|
129
132
|
```
|
|
130
133
|
|
|
@@ -925,13 +928,26 @@ Stream the browser viewport via WebSocket for live preview or "pair browsing" wh
|
|
|
925
928
|
|
|
926
929
|
### Enable Streaming
|
|
927
930
|
|
|
928
|
-
|
|
931
|
+
For an already-running session, enable streaming at runtime:
|
|
932
|
+
|
|
933
|
+
```bash
|
|
934
|
+
agent-browser stream enable
|
|
935
|
+
agent-browser stream status
|
|
936
|
+
agent-browser stream disable
|
|
937
|
+
```
|
|
938
|
+
|
|
939
|
+
`stream enable` binds an available localhost port automatically unless you pass `--port <port>`.
|
|
940
|
+
Use `stream status` to inspect whether streaming is enabled, which port is active, whether a browser is attached, and whether screencasting is active.
|
|
941
|
+
|
|
942
|
+
If you want streaming to be available immediately when the daemon starts, set `AGENT_BROWSER_STREAM_PORT` before the first command in that session:
|
|
929
943
|
|
|
930
944
|
```bash
|
|
931
945
|
AGENT_BROWSER_STREAM_PORT=9223 agent-browser open example.com
|
|
932
946
|
```
|
|
933
947
|
|
|
934
|
-
|
|
948
|
+
The environment variable only affects daemon startup. For sessions that are already running, use `agent-browser stream enable` instead.
|
|
949
|
+
|
|
950
|
+
Once enabled, the WebSocket server streams the browser viewport and accepts input events.
|
|
935
951
|
|
|
936
952
|
### WebSocket Protocol
|
|
937
953
|
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
|
Binary file
|
package/package.json
CHANGED
|
@@ -171,6 +171,12 @@ agent-browser screenshot --screenshot-dir ./shots # Save to custom directory
|
|
|
171
171
|
agent-browser screenshot --screenshot-format jpeg --screenshot-quality 80
|
|
172
172
|
agent-browser pdf output.pdf # Save as PDF
|
|
173
173
|
|
|
174
|
+
# Live preview / streaming
|
|
175
|
+
agent-browser stream enable # Start runtime WebSocket streaming on an auto-selected port
|
|
176
|
+
agent-browser stream enable --port 9223 # Bind a specific localhost port
|
|
177
|
+
agent-browser stream status # Inspect enabled state, port, connection, and screencasting
|
|
178
|
+
agent-browser stream disable # Stop runtime streaming and remove the .stream metadata file
|
|
179
|
+
|
|
174
180
|
# Clipboard
|
|
175
181
|
agent-browser clipboard read # Read text from clipboard
|
|
176
182
|
agent-browser clipboard write "Hello, World!" # Write text to clipboard
|
|
@@ -192,6 +198,12 @@ agent-browser diff url <url1> <url2> --wait-until networkidle # Custom wait str
|
|
|
192
198
|
agent-browser diff url <url1> <url2> --selector "#main" # Scope to element
|
|
193
199
|
```
|
|
194
200
|
|
|
201
|
+
## Runtime Streaming
|
|
202
|
+
|
|
203
|
+
Use `agent-browser stream enable` when you need a live WebSocket preview for an already-running session. This is the preferred runtime path because it does not require restarting the daemon. `stream enable` creates the server, `stream status` reports the bound port and connection state, and `stream disable` tears it down cleanly.
|
|
204
|
+
|
|
205
|
+
If streaming must be present from the first daemon command, `AGENT_BROWSER_STREAM_PORT` still works at daemon startup, but that environment variable is not retroactive for sessions that are already running.
|
|
206
|
+
|
|
195
207
|
## Batch Execution
|
|
196
208
|
|
|
197
209
|
Execute multiple commands in a single invocation by piping a JSON array of string arrays to `batch`. This avoids per-command process startup overhead when running multi-step workflows.
|