npm - @obtoai/agent-bridge - Versions diffs - 0.1.0-beta.4 → 0.1.0-beta.5 - Mend

@obtoai/agent-bridge 0.1.0-beta.4 → 0.1.0-beta.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/README.md +40 -29
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 # @obtoai/agent-bridge
-A local daemon that lets a coding agent — [Claude Code](https://claude.ai/code) or [OpenAI Codex](https://developers.openai.com/codex) — running on your machine be driven from the [OBTO Agent Bridge](https://obto.co) web UI, even when you're away from the keyboard.
+A local daemon that lets coding agents — [Claude Code](https://claude.ai/code), [OpenAI Codex](https://developers.openai.com/codex), or [opencode](https://opencode.ai) — running on your machine be driven from the [OBTO Agent Bridge](https://obto.co) web UI, even when you're away from the keyboard.
-You post a message on a thread from your phone or laptop. The daemon (running on your machine, no port forwarding required) receives it over a long-lived HTTPS stream, spawns or resumes an agent session in your project directory, and the response posts back to the bridge thread within seconds.
+You post a message on a thread from your phone or laptop. The daemon (running on your machine, no port forwarding required) receives it over a long-lived HTTPS stream, spawns or resumes a session for the agent that thread is bound to, and the response posts back to the bridge thread within seconds.
 ## Status
@@ -11,10 +11,11 @@ You post a message on a thread from your phone or laptop. The daemon (running on
 ## What you'll need
 - macOS, Linux, or Windows, **Node.js 18.17+**
-- One coding agent installed, with your own auth:
-  - **Claude** — Claude Code / the Claude Agent SDK, billed to your Anthropic account; or
-  - **Codex** — the `codex` CLI (`npm i -g @openai/codex`), signed in to your OpenAI/ChatGPT account
-- An invite from `support@obto.co` (gives you an `accountId`, browser username/password, and an API token)
+- At least one coding agent installed on the machine (the daemon drives whichever ones it finds, with your own auth):
+  - **Claude** — Claude Code / the Claude Agent SDK, billed to your Anthropic account.
+  - **Codex** — the `codex` CLI (`npm i -g @openai/codex`), signed in to your OpenAI/ChatGPT account.
+  - **opencode** — `npm i -g opencode-ai` (the `opencode` CLI; the daemon bundles the `@opencode-ai/sdk`). Auth is your own provider key (Anthropic by default; override with env vars below).
+- An invite from `support@obto.co` (gives you an `accountId`, browser username/password, and an API token).
 ## Install
@@ -34,18 +35,25 @@ npx @obtoai/agent-bridge <command>
 obto-bridge init
 ```
-Walks you through a few questions: your account ID, API token, an agent name (to distinguish multiple machines on one account), which coding agent to drive (`claude` or `codex`), the project directory to work in, and whether to relay tool-permission requests via the bridge. (The server URL is a built-in default; advanced / self-hosted users can override it with the `BRIDGE_BASE_URL` env var.)
+Walks you through a few questions: your account ID, API token, an agent name (to distinguish multiple machines on one account), a *fallback* agent (`claude`, `codex`, or `opencode` — used only for legacy events without an explicit agent), the project directory to work in, and whether to relay tool-permission requests via the bridge. (The server URL is a built-in default; advanced / self-hosted users can override it with the `BRIDGE_BASE_URL` env var.)
 Config lands at `~/.obto-bridge/config.json` (mode 0600). Safe to commit your account ID; **never commit the `apiToken`**.
-### claude vs codex
+### Agents (claude / codex / opencode)
-Both drive real coding work on your machine; they differ in how they report back:
+v1.1 makes the daemon **agent-agnostic per event**: at startup it detects which of `claude`, `codex`, and `opencode` are installed on the machine, advertises that to the bridge, and routes each incoming reply to the right driver based on what the thread is bound to in the UI. You can switch a thread's agent live from the thread header; each engine keeps its own session for that thread, so flipping claude→codex→claude resumes each side's context.
-- **claude** — the fuller integration. Posts status updates, questions, and results as it works (via an in-process MCP tool), and supports the human-in-the-loop tool-permission relay.
-- **codex** — runs the task and delivers one final answer per turn. No mid-task updates and no per-tool relay (the Codex SDK exposes neither); it runs unattended inside a sandbox (`workspace-write` by default, override with `BRIDGE_CODEX_SANDBOX`).
+How the three differ in how they report back:
-One daemon drives one agent. To run both, use two daemons on two accounts.
+- **claude** — the fullest integration. Posts status updates, mid-task questions, and final results as it works (via an in-process MCP tool), and supports the human-in-the-loop tool-permission relay.
+- **codex** — runs the turn and delivers one final answer per turn. No mid-task updates and no per-tool relay (the Codex SDK exposes neither). Runs unattended inside a sandbox (`workspace-write` by default, override with `BRIDGE_CODEX_SANDBOX`).
+- **opencode** — same capture-model shape as codex: one final answer per turn, no mid-task chatter. Defaults to provider `anthropic` and model `claude-sonnet-4-5`; override with `BRIDGE_OPENCODE_PROVIDER` and `BRIDGE_OPENCODE_MODEL`.
+Picking a model is done in the bridge UI's **+ New thread** dialog and the thread-header switcher — not in the daemon config.
+### Multi-daemon (running across more than one machine)
+You can run the same account's daemon on more than one machine (e.g. a Mac and a Windows box). Each daemon advertises its `agentId` (machine name) + capabilities on connect; threads are atomically **first-touch claimed** by whichever daemon gets the event first, and every other daemon skips the event cleanly. No duplicate replies, no special configuration — just install + start the daemon on each machine.
 ## Run
@@ -56,14 +64,16 @@ obto-bridge start
 You'll see two log lines and then the daemon waits silently:
 ```
-{"msg":"starting daemon","data":{"accountId":"acc_...","agentId":"my-mac",...}}
+{"msg":"starting daemon","data":{"accountId":"acc_...","agentId":"my-mac","capabilities":["claude","codex"],...}}
 {"msg":"sse stream connected","data":{"status":200}}
 ```
+`capabilities` is the list of agents this daemon will accept — the bridge UI offers exactly the union across your connected machines.
 Now open the bridge UI in any browser, log in with the browser credentials from your invite, and either:
-- Reply on an existing thread — daemon resumes the session bound to that thread
-- Start a new thread via the **+ New thread** button — daemon spawns a fresh session in your project directory
+- Reply on an existing thread — daemon resumes the session bound to that thread (and to whichever agent the thread currently uses).
+- Start a new thread via the **+ New thread** button — pick Claude, Codex, or Opencode; the daemon spawns a fresh session in your project directory.
 Within ~5–10 seconds you should see the agent's reply appear back on the thread.
@@ -72,40 +82,41 @@ Within ~5–10 seconds you should see the agent's reply appear back on the threa
 | Command | What it does |
 |---|---|
 | `obto-bridge whoami` | Verify your token works + show your account info |
-| `obto-bridge status` | List active thread→session bindings |
+| `obto-bridge status` | List bindings per (thread, agent) — one row per engine that's ever driven a thread |
 | `obto-bridge logout` | Wipe `~/.obto-bridge/config.json` |
 ## How it actually works
 ```
-Your phone        OBTO server                      Your machine
-─────────         ───────────                      ────────────
+Your phone        OBTO server                      Your machine(s)
+─────────         ───────────                      ───────────────
 [reply form] ──►  /api/reply ─► Mongo (durable)
-                  └─►  RabbitMQ (publish bridge.<acct>.reply.<thread>)
+                  └─►  RabbitMQ (publish bridge.<acct>.reply.<thread>,
+                                 payload carries agent + agentId)
                                                 ◄── /api/bridge/stream  (SSE, Bearer auth)
-                                                    └─► daemon process
-                                                        └─► Claude Agent SDK
-                                                            └─► session JSONL in
-                                                                ~/.claude/projects/...
-                  /api/message ◄────  bridge_post (in-process MCP tool from daemon)
+                                                    └─► daemon (dispatches per payload.agent)
+                                                        ├─► Claude Agent SDK   → ~/.claude/projects/...
+                                                        ├─► @openai/codex-sdk  → ~/.codex/sessions/...
+                                                        └─► @opencode-ai/sdk   → opencode server
+                  /api/message ◄────  bridge_post (in-process MCP tool, Claude only)
 [poll: /api/messages] ◄──── (4s loop)
 ```
 Key bits:
 - The daemon **never** holds RabbitMQ credentials; broker access stays server-side. Per-account routing key isolation enforced by `BridgeAuth`.
-- The daemon's spawned Claude session uses an **in-process MCP server** (`mcp__bridge__bridge_post`) — not the platform's hosted MCP, so the daemon's tools don't depend on a long-lived OBTO MCP proxy session.
-- Each bridge **thread** binds to its own agent **session ID** at first message. Subsequent messages on the same thread resume the same session, so the agent keeps full context. Your interactive sessions are unaffected — they live in separate session stores.
+- For the **claude** driver, the spawned Claude session uses an **in-process MCP server** (`mcp__bridge__bridge_post`) — not the platform's hosted MCP, so the daemon's tools don't depend on a long-lived OBTO MCP proxy session. For **codex** and **opencode**, the SDKs can't auto-approve a write tool when run unattended, so the daemon captures the final response and posts it to the thread on the agent's behalf.
+- Each bridge **thread** binds to its own session ID **per agent**. Subsequent messages on the same thread + same agent resume the same engine-specific session, so the agent keeps full context. Switching the thread's agent in the UI starts (or resumes) the other engine's session — each side's state stays intact. Your interactive sessions are unaffected — they live in separate session stores.
 - Per-thread serialization means rapid bursts on the same thread are handled in order, never racing the same session.
-- With **codex**, there is no in-process MCP tool — the Codex SDK can't auto-approve a write tool when run unattended, so the daemon captures Codex's final response and posts it to the thread on the agent's behalf.
+- Multi-daemon races are killed by atomic first-touch claim against the thread record on the bridge.
 ## Agent costs
-The daemon runs your chosen agent on your machine with **your** credentials — Anthropic for `claude` (whatever Claude Code uses: `ANTHROPIC_API_KEY` or your Claude.ai session), or your OpenAI/ChatGPT account for `codex`. Every bridge-driven turn is a normal API call billed to you. We don't proxy.
+The daemon runs your chosen agent on your machine with **your** credentials — Anthropic for `claude` (whatever Claude Code uses: `ANTHROPIC_API_KEY` or your Claude.ai session); your OpenAI/ChatGPT account for `codex`; whichever provider you've configured `opencode` to call (Anthropic by default for this daemon). Every bridge-driven turn is a normal API call billed to you. We don't proxy.
 ## Data handling
-**Your model traffic never touches us.** The daemon runs on your machine and calls Anthropic or OpenAI with *your own* credentials. Your prompts, your code, and the model's responses pass directly between your machine and the model provider, under your own API account and its terms. OBTO does not proxy, route, or see that traffic.
+**Your model traffic never touches us.** The daemon runs on your machine and calls Anthropic, OpenAI, or whichever provider opencode is configured for, with *your own* credentials. Your prompts, your code, and the model's responses pass directly between your machine and the model provider, under your own API account and its terms. OBTO does not proxy, route, or see that traffic.
 **What the bridge stores.** For threads to work, the messages you and the agent post are saved in OBTO's database — that's what makes a thread durable and readable from your phone. Threads are strictly scoped to your account; one tenant can never see another's. Your daemon's API token is stored server-side only as a SHA-256 hash; the plaintext token never leaves your local config file.

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@obtoai/agent-bridge",
-  "version": "0.1.0-beta.4",
+  "version": "0.1.0-beta.5",
   "description": "Local consumer for the OBTO Agent Bridge. Receives bridge events over SSE and drives a coding agent (Claude Code or OpenAI Codex) on your machine.",
   "license": "Apache-2.0",
   "author": "OBTO Inc.",