npm - @stage-labs/metro - Versions diffs - 0.1.0-beta.13 → 0.1.0-beta.15 - Mend

@stage-labs/metro 0.1.0-beta.13 → 0.1.0-beta.15

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (45) hide show

package/README.md +76 -189
package/dist/broker/claims.js +144 -0
package/dist/{broker.js → broker/history-stream.js} +46 -99
package/dist/cli/config.js +115 -121
package/dist/cli/index.js +51 -64
package/dist/cli/messenger-api.js +214 -0
package/dist/cli/messenger-transcribe.js +43 -0
package/dist/cli/messenger-uploads.js +116 -0
package/dist/cli/monitor-api.js +205 -0
package/dist/cli/tail.js +49 -118
package/dist/cli/webhook.js +103 -3
package/dist/{codex-rc.js → codex-rc/client.js} +12 -32
package/dist/codex-rc/protocol.js +38 -0
package/dist/dispatcher/server.js +122 -0
package/dist/dispatcher.js +52 -83
package/dist/history.js +49 -27
package/dist/ipc.js +28 -10
package/dist/lines.js +54 -0
package/dist/local-identity.js +80 -0
package/dist/paths.js +58 -12
package/dist/trains/protocol.js +99 -0
package/dist/trains/supervisor.js +210 -0
package/dist/tunnel.js +39 -1
package/docs/broker.md +88 -136
package/docs/monitor.md +88 -10
package/docs/uri-scheme.md +10 -7
package/examples/README.md +32 -0
package/examples/telegram.ts +121 -0
package/package.json +6 -5
package/skills/metro/SKILL.md +67 -213
package/dist/cache.js +0 -69
package/dist/cli/actions.js +0 -206
package/dist/cli/skill.js +0 -62
package/dist/monitor.js +0 -194
package/dist/registry.js +0 -48
package/dist/stations/claude.js +0 -45
package/dist/stations/codex.js +0 -68
package/dist/stations/discord.js +0 -216
package/dist/stations/index.js +0 -129
package/dist/stations/telegram-md.js +0 -34
package/dist/stations/telegram-upload.js +0 -113
package/dist/stations/telegram.js +0 -234
package/dist/stations/webhook.js +0 -103
package/dist/webhooks.js +0 -41
package/docs/users.md +0 -226

package/docs/broker.md CHANGED Viewed

@@ -1,43 +1,36 @@
 # Metro broker
-Multi-user event routing. Turns metro from "one daemon → one stdout consumer" into "one daemon → N independently-subscribed users (Claude Code, Codex, anything) with durable, replayable delivery".
-## Why
-Today the dispatcher writes every inbound event to **its own stdout**, which only the parent process (one Claude Code, monitoring the daemon via `Monitor`) can read. Consequences:
-- **Throughput bottleneck**: bursts of inbound messages serialize behind whatever the single user is currently doing.
-- **No real sub-users**: `Agent`-tool sub-users can call `metro send` (IPC works from anywhere), but they cannot *receive* events — they have no stdout subscription.
-- **No multi-instance**: a second `claude` window or a separate `codex` process can't join in; the stream has one reader.
-- **No durability**: a user crashes mid-conversation → events emitted during the gap are lost; on restart it starts deaf.
-The fix is to treat metro as a tiny **durable message broker** over the event log that already exists ([history.ts](../src/history.ts), [user-registry.json](../src/registry.ts)).
+Multi-user event routing on top of the event log. Turns "one daemon → one stdout consumer"
+into "one daemon → N independently-subscribed users (Claude Code, Codex, anything) with
+durable, replayable delivery".
 ## Core idea
 One concept — a **claim** — and three on-disk files you can `cat`:
-| Concern              | File                                    | Role                                                                                                                              |
-|----------------------|-----------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------|
-| Event log            | `$METRO_STATE_DIR/history.jsonl`        | Append-only JSONL — every inbound/outbound/edit/react. Already exists. The single source of truth.                                |
-| Claims               | `$METRO_STATE_DIR/claims.json`          | `{ <line>: <user-id> }` — flat map. A line in here is *exclusively* owned by that user. Absence = broadcast. New.                 |
-| Per-mode cursor      | `$METRO_STATE_DIR/cursors/<key>`        | Byte offset into `history.jsonl` — last-emitted position for one tail mode. New. Updated atomically after each emit.              |
+| Concern              | File                                    | Role                                                                                                              |
+|----------------------|-----------------------------------------|-------------------------------------------------------------------------------------------------------------------|
+| Event log            | `$METRO_STATE_DIR/history.jsonl`        | Append-only JSONL — every event (chat messages, webhooks, reactions, transcripts, …). Single source of truth.     |
+| Claims               | `$METRO_STATE_DIR/claims.json`          | `{ <line>: <user-id> }` — flat map. A line in here is *exclusively* owned by that user. Absence = broadcast.      |
+| Per-mode cursor      | `$METRO_STATE_DIR/cursors/<key>`        | Byte offset into `history.jsonl` — last-emitted position for one tail mode. Updated atomically after each emit.   |
-Cursor keys are derived from the *effective mode* (not from `userSelf()`), so `--all` and `--unclaimed` don't collide with a personal `--as=<id>` tail:
+Cursor keys are derived from the *effective mode* (not from `userSelf()`), so `--all` and
+`--unclaimed` don't collide with a personal `--as=<id>` tail:
-| Tail invocation                  | Cursor key                         |
-|----------------------------------|------------------------------------|
-| `metro tail --as=<id>`           | `<userSlug(id)>`                   |
-| `metro tail --as=<id> --strict`  | `<userSlug(id)>--strict`           |
-| `metro tail --as=<id> --include-webhooks` | `<userSlug(id)>--with-webhooks` (or `…--strict--with-webhooks`) |
-| `metro tail --unclaimed`         | `_unclaimed`                       |
-| `metro tail --all`               | `_all`                             |
+| Tail invocation                            | Cursor key                                                      |
+|--------------------------------------------|-----------------------------------------------------------------|
+| `metro tail --as=<id>`                     | `<userSlug(id)>`                                                |
+| `metro tail --as=<id> --strict`            | `<userSlug(id)>--strict`                                        |
+| `metro tail --as=<id> --include-webhooks`  | `<userSlug(id)>--with-webhooks` (or `…--strict--with-webhooks`) |
+| `metro tail --unclaimed`                   | `_unclaimed`                                                    |
+| `metro tail --all`                         | `_all`                                                          |
-The `_` prefix on the mode-keys can't collide with a real `userSelf()` slug (which always contains a station name like `claude-user-…`). Switching modes mid-stream keeps each cursor independent — a `tail --all` from a `CLAUDECODE=1` shell does **not** advance the personal `--as=<me>` cursor.
+The `_` prefix on the mode-keys can't collide with a real `userSelf()` slug. `--chat=<line>`
+and `--station=<name>` are post-filters applied **after** cursor advancement, so they don't
+need their own cursor keys.
-`--chat=<line>` and `--station=<name>` are post-filters applied **after** cursor advancement, so they don't need their own cursor keys.
-Subscribers do not register with the daemon. They tail the log; the broker semantics emerge from one filtering rule applied at read time:
+Subscribers do not register with the daemon. They tail the log; the broker semantics emerge
+from one filtering rule applied at read time:
 > An event is delivered to a user when its `line` is **claimed by that user** *or* **claimed by no one**.
@@ -48,8 +41,6 @@ That single rule covers every case the design needs to handle:
 - **Operator observability** — `metro tail` with no `--as` (or `--all`) shows everything regardless of claims; doesn't take ownership.
 - **Sub-user onboarding** — sub-user claims its assigned chat before reading; parent stops receiving that chat without any coordination.
-There is no separate concept for "subscription" or "fan-out mode" — claims and their absence cover both. The dispatcher writes; tails filter; claims gate exclusivity. Three primitives, one rule.
 ```
                                 ┌──────────────────────────┐
    inbound (Discord/TG/web) ──► │  dispatcher              │ ──► history.jsonl  ◄── metro tail --as claude-A
@@ -69,146 +60,107 @@ metro tail [--as <user-id>] [--follow] [--strict | --unclaimed | --all] [--inclu
 metro claim   <line> [--as <user-id>]      # add/overwrite — last writer wins
 metro release <line>                       # remove (line returns to broadcast)
 metro claims                               # print current claims.json
-# Outbound actions auto-claim the line on first contact when topology is 1:1 (DM, claude/codex line).
-# Group / public / webhook lines are skipped by default — pass --claim to force.
-metro send  <line> <text>          [--no-claim] [--claim]
-metro reply <line> <msg-id> <text> [--no-claim] [--claim]
-metro edit  <line> <msg-id> <text> [--no-claim] [--claim]
-metro react <line> <msg-id> <emoji> [--no-claim] [--claim]
-# Or disable globally: METRO_NO_AUTO_CLAIM=1
-# Lease/ack — optional, v2. When enabled, an event is "in flight" with the
-# claimant; if no ack in N seconds the cursor isn't advanced and the next
-# `metro tail` re-emits.
-metro ack <event-id> --as <user-id>
 ```
-`--as <user-id>` defaults to `userSelf()` ([history.ts:121](../src/history.ts#L121)) — the same stable identity already used in routing-aware code.
+`--as <user-id>` defaults to `userSelf()` ([history.ts](../src/history.ts)) — the same stable
+identity already used in routing-aware code.
 ### Subscription modes
-The same `metro tail` command serves four distinct callers — a working user, a strict worker, a router, and a human observer. Each maps to one mutually-exclusive flag controlling the claim-aware filter:
+The same `metro tail` command serves four distinct callers — a working user, a strict
+subscriber, a router, and a human observer. Each maps to one mutually-exclusive flag:
 | Mode               | Flag                       | Predicate                                                                       | Who uses it                                                |
 |--------------------|----------------------------|---------------------------------------------------------------------------------|------------------------------------------------------------|
-| **Mine + free**     | `--as <id>` (default)      | `(claims[line] == <id> ∨ line ∉ claims) ∧ station ≠ 'webhook'`                  | Default working user. Zero-config single-user setup.       |
-| **Mine only**       | `--as <id> --strict`        | `claims[line] == <id> ∧ station ≠ 'webhook'`                                    | Disciplined worker that won't race on unclaimed events.    |
-| **Unclaimed only**  | `--unclaimed`              | `line ∉ claims`                                                                 | Router/first-responder user that finds work to claim.      |
-| **All**             | (no `--as`) or `--all`     | `true`                                                                          | Operator/auditor/debugger; never takes ownership.          |
+| **Mine + free**    | `--as <id>` (default)      | `(claims[line] == <id> ∨ line ∉ claims) ∧ station ≠ 'webhook'`                  | Default working user. Zero-config single-user setup.       |
+| **Mine only**      | `--as <id> --strict`       | `claims[line] == <id> ∧ station ≠ 'webhook'`                                    | Disciplined subscriber that won't race on unclaimed events.|
+| **Unclaimed only** | `--unclaimed`              | `line ∉ claims`                                                                 | Router/first-responder pattern that finds work to claim.   |
+| **All**            | (no `--as`) or `--all`     | `true`                                                                          | Operator/auditor/debugger; never takes ownership.          |
-Webhooks (`station == 'webhook'`) are excluded from the personal modes by default — they're broadcast traffic (GitHub pushes, Intercom pings, etc.) that should flow to the *router* (`--unclaimed`) or *operator* (`--all`) feed, not firehose into every `--as <id>` tail. Opt back in with `metro tail --as <id> --include-webhooks` when you genuinely want a worker to see them.
+Webhooks (`station == 'webhook'`) are excluded from the personal modes by default — they're
+broadcast traffic (GitHub pushes, Intercom pings, etc.) that should flow to the *router*
+(`--unclaimed`) or *operator* (`--all`) feed, not firehose into every `--as <id>` tail. Opt
+back in with `metro tail --as <id> --include-webhooks`.
-Two UX defaults worth being explicit about:
-1. **`--as <id>` with no mode flag = "mine + free".** Single-user setups (the common case) get zero-config metro: nothing claimed yet, so the only tail sees everything. Adding a second user means claiming first — surfaced in docs, not enforced by the daemon. `--strict` is the opt-in for setups that want stricter separation.
-2. **No `--as` = "all".** Matches the unix `tail -f` mental model. Operators just want to read the log without registering an identity or accidentally taking ownership of anything.
-`--unclaimed` is the genuinely new primitive: it enables a "router" user pattern where one process watches for ownerless events and either responds directly or claims and delegates. It works with or without `--as` — with `--as`, outbound replies are still attributed correctly.
-Direct messages between users (`event.to == user-line`) always pass the filter regardless of mode — they're inherently 1:1 and can't be "claimed" by someone else.
+Direct messages between users (`event.to == user-line`) always pass the filter regardless of
+mode — they're inherently 1:1 and can't be "claimed" by someone else.
 ### Auto-claim on outbound
-`metro send`, `reply`, `edit`, and `react` claim the target `<line>` for the actor (`userSelf()`) the first time they touch it, atomically — same lockfile as `metro claim`. The intent: when a user picks up a conversation by replying, subsequent inbound events on that line route to them without any explicit `metro claim` call.
-Auto-claim only fires when **the line topology is 1:1** (DM, or a Claude/Codex cross-user line). Shared lines — group chats, public channels, webhook streams — would lock out other workers, so they're skipped by default:
-| Line                                            | Classification | Auto-claim default? | How                                                          |
-|-------------------------------------------------|----------------|---------------------|--------------------------------------------------------------|
-| `metro://telegram/<positive-id>` (incl. topics) | DM             | Yes                 | Telegram chat-id > 0 ⇒ private chat                          |
-| `metro://telegram/<negative-id>` / `-100…`      | group          | **No**              | Telegram chat-id < 0 ⇒ group/supergroup                      |
-| `metro://discord/<channel-id>` (no guild)       | DM             | Yes                 | Recent inbound payload `guildId == null`                     |
-| `metro://discord/<channel-id>` (in guild)       | group          | **No**              | Recent inbound payload `guildId != null`                     |
-| `metro://discord/<channel-id>` (no inbound)     | unknown        | Yes (conservative)  | No metadata cached — treat as DM-eligible until proven group |
-| `metro://claude/...` / `metro://codex/...`      | 1:1            | Yes                 | Cross-user notify is inherently 1:1 by construction          |
-| `metro://webhook/<id>`                          | broadcast      | **Never**           | Webhook lines are conceptually a stream, not a conversation  |
-- If the line is already claimed by **someone else** (and topology check passed), the action still proceeds (sending doesn't require ownership) but the claim is **not overwritten**. A single-line stderr note (`auto-claim skipped: line owned by <other-id>`) signals the no-op.
-- On a group-line skip you'll see `auto-claim skipped: <line> is a group/public line; pass --claim to take it explicitly` on stderr.
-- Opt-out per command with `--no-claim`, or globally with the env var `METRO_NO_AUTO_CLAIM=1`.
-- Opt-IN for groups: `--claim` forces auto-claim even on a group/public line (operator explicitly takes responsibility).
-- Cross-user sends (`metro send metro://claude/... ...` from a different user) auto-claim the target line too — the sender is taking ownership of the conversation.
-This default plus the webhook-exclusion above means: a webhook or a busy group channel flowing through the daemon won't auto-claim under any worker, so the router pattern (`--unclaimed`) can still see them.
+Outbound paths call `tryAutoClaim` ([broker/claims.ts](../src/broker/claims.ts)) to claim the target `<line>`
+for the actor (`userSelf()`) the first time it's touched, atomically — same lockfile as
+`metro claim`. Auto-claim only fires when **the line topology is 1:1** (DM, or a
+Claude/Codex cross-user line). Shared lines are skipped:
+| Line                                            | Classification | Auto-claim? | How                                                          |
+|-------------------------------------------------|----------------|-------------|--------------------------------------------------------------|
+| `metro://telegram/<positive-id>` (incl. topics) | DM             | Yes         | Telegram chat-id > 0 ⇒ private chat                          |
+| `metro://telegram/<negative-id>` / `-100…`      | group          | **No**      | Telegram chat-id < 0 ⇒ group/supergroup                      |
+| `metro://discord/<channel-id>` (no guild)       | DM             | Yes         | Recent inbound payload `guildId == null`                     |
+| `metro://discord/<channel-id>` (in guild)       | group          | **No**      | Recent inbound payload `guildId != null`                     |
+| `metro://discord/<channel-id>` (no inbound)     | unknown        | Yes         | No metadata cached — treat as DM-eligible until proven group |
+| `metro://claude/...` / `metro://codex/...`      | 1:1            | Yes         | Cross-user notify is inherently 1:1 by construction          |
+| `metro://webhook/<id>`                          | broadcast      | **Never**   | Webhook lines are a stream, not a conversation               |
+- If the line is already claimed by someone else (and topology check passed), the action
+  still proceeds but the claim is **not overwritten**.
+- Auto-claim writes happen after the action succeeds, so a failed call never writes to `claims.json`.
 ### `metro tail` mechanics
-- Reads `history.jsonl`, applies the mode predicate + any `--chat`/`--station` filters (AND), prints one JSONL line per event to stdout.
-- With `--follow`: stays open, watches the file via `fs.watch`, emits new matching lines as they're appended.
-- Maintains a per-user cursor (byte offset) at `cursors/<user-id>`. On startup, resumes from cursor; on each emitted line, the offset is advanced *after* the write succeeds. Byte offsets give O(1) resume — no file scan.
-- `--since <offset>` overrides the cursor; `--since=tail` starts from EOF, ignoring backlog. Useful for fresh-start without losing the persisted cursor.
-- Claim lookups read `claims.json` once per emitted event. The file is small (a few KB) and OS-cached; cost is sub-microsecond per event.
+- Reads `history.jsonl`, applies the mode predicate + any `--chat`/`--station` filters (AND),
+  prints one JSONL line per event to stdout.
+- With `--follow`: stays open, watches the file via `fs.watch`, emits new matching lines.
+- Maintains a per-mode cursor (byte offset) at `cursors/<key>`. On startup, resumes from cursor;
+  on each emitted line, the offset is advanced *after* the write succeeds. O(1) resume.
+- `--since <offset>` overrides the cursor; `--since=tail` starts from EOF, ignoring backlog.
+- Claim lookups read `claims.json` once per emitted event. Small (~KB), OS-cached; sub-microsecond cost.
 ### `metro claim` semantics
-- Pure metadata edit on `claims.json`. Does **not** notify the daemon — claims are read by tails, not the dispatcher (see "Dispatcher changes" below).
-- Re-claiming a line re-assigns it (last writer wins). `metro claims` prints the current map so a human can audit.
+- Pure metadata edit on `claims.json`. Does **not** notify the daemon — claims are read by
+  tails, not the dispatcher.
+- Re-claiming a line re-assigns it (last writer wins). `metro claims` prints the current map.
 - Releasing a line returns it to broadcast — every matching tail picks it up again.
-- Writes to `claims.json` are wrapped in an `O_EXCL` lockfile to serialize concurrent `metro claim` invocations on the same host.
+- Writes to `claims.json` are wrapped in an `O_EXCL` lockfile to serialize concurrent writes
+  on the same host.
-## Dispatcher changes
+## Dispatcher
-Almost none. `emit()` still appends to history, pushes to codex-rc, and writes to stdout. The broker model lives entirely on the read side — claims and cursors are consulted by `metro tail`, not by the dispatcher. The dispatcher doesn't need to know who's listening or who's claimed what.
-This is the design's key simplification: **the daemon stays dumb**. It's still a single-writer to a JSONL file. All the routing intelligence is in `metro tail`'s filter, which reads two small files (`claims.json` and its own cursor) on each event.
-No new sockets. No fan-out bookkeeping. No coupling between subscriber count and daemon state.
-## What this enables
-- **Sub-users that actually receive events**: `Agent` spawns a sub-user whose first action is `metro tail --as <its-id> --chat <line> --follow &` — it then `Monitor`s that background process and gets *only* its assigned chat's events.
-- **Two manual Claude Code windows**: each runs `metro tail --as claude-A` / `claude-B`, claims disjoint chats. No coordination beyond `metro claim`.
-- **Codex alongside Claude**: same model — `metro tail --as codex-1 --station telegram` etc. The codex-rc push becomes optional: a Codex worker can subscribe via `metro tail` directly and bypass the rc file.
-- **Crash recovery**: process dies → restarts → `metro tail` resumes from cursor → backlog replays in order. No double-replies (the cursor is advanced on emit, not on reply).
-- **Replay for new joiners**: `metro tail --as new-user --since <offset-from-5-min-ago>` lets a freshly-spawned process backfill recent history before going live.
+The dispatcher stays dumb. `emit()` appends to history, pushes to codex-rc, and writes to
+stdout. All routing intelligence lives in `metro tail`'s filter, which reads two small files
+(`claims.json` + its own cursor) per event. No new sockets, no fan-out bookkeeping, no
+coupling between subscriber count and daemon state.
 ## Concurrency
-Multiple processes already write `history.jsonl` today: the daemon's `emit()` and every short-lived CLI invocation (`metro send`/`reply`/`react` — see [actions.ts](../src/cli/actions.ts)). It works because `appendFileSync` opens with `O_APPEND`, and POSIX guarantees that `O_APPEND` writes atomically seek-to-end-and-write in one operation — concurrent writers produce whole lines in some order, never interleaved halves. Node issues one `write(2)` per `appendFileSync` call, and our entries (even fat webhook payloads) stay well under per-syscall atomicity limits on both Linux (~2GB) and macOS (`INT_MAX`). The broker model adds **only readers**, so the existing safety property is preserved.
+Multiple processes write `history.jsonl`: the daemon's `emit()` and short-lived auto-claim
+writers. `appendFileSync` opens with `O_APPEND`; POSIX guarantees atomic seek-to-end-and-write
+per `write(2)`. Concurrent writers produce whole lines in some order, never interleaved halves.
+Node issues one `write(2)` per `appendFileSync` call, and history entries stay well under
+per-syscall atomicity limits (~2GB on Linux, `INT_MAX` on macOS).
-`claims.json` is read on every event by every tail, but writes are infrequent (`metro claim`/`release`). An `O_EXCL` lockfile around writes is enough; tails do an unlocked read with a malformed-JSON retry (one read can race with one write; the retry resolves it).
+`claims.json` is read on every event by every tail; writes are infrequent. An `O_EXCL`
+lockfile around writes is enough; tails do an unlocked read with a malformed-JSON retry.
 ## Isolation
-`METRO_STATE_DIR` isolates state-dir-scoped artifacts (`history.jsonl`, `claims.json`, `cursors/`, `lines.json`, `bot-ids.json`, the daemon socket, the webhook port). It does **not** isolate platform credentials: `metro send`, `reply`, `edit`, and `react` always read bot tokens from `$XDG_CONFIG_HOME/metro/.env` (defaulting to `~/.config/metro/.env`) and post directly to Discord/Telegram regardless of where `METRO_STATE_DIR` points.
-This means a test invocation with `METRO_STATE_DIR=/tmp/metro-test metro send …` will hit the **production** Discord/Telegram bot with production tokens. To avoid leaking real messages from a test/sandbox:
-- Use lines whose channel/chat IDs you know don't exist (the platform will 4xx before any side-effect).
-- Or unset/move `~/.config/metro/.env` for the test process — `metro send` will fail fast with a missing-token error.
-- Or use `metro tail` + manual `history.jsonl` seeding to exercise the read path without any platform contact.
-The auto-claim write happens **after** platform-API success, so a failed `metro send` never writes to `claims.json`. (Tests can rely on this: a failing send leaves the test state dir unchanged apart from the `history.jsonl` line the daemon would emit, if one were running.)
+`METRO_STATE_DIR` isolates state-dir-scoped artifacts (`history.jsonl`, `claims.json`,
+`cursors/`, `lines.json`, `bot-ids.json`, the daemon socket, the webhook port). It does **not**
+isolate platform credentials — those are owned by the train and read from `~/.metro/.env`.
-## Failure modes & guardrails
+## Failure modes
 | Failure                             | Behavior                                                                                          |
 |-------------------------------------|---------------------------------------------------------------------------------------------------|
 | Process crashes mid-event           | Cursor not advanced → event redelivered on next `metro tail`. At-least-once.                      |
 | Two users claim same line           | `claims.json` last-write-wins. `metro claims` shows current owner; humans resolve.                |
-| No user claims a chat               | Event broadcasts to every tail whose filters match. Two tails without filters → both reply (operator error — claim should have been set first). |
-| User silently slow (no ack)         | v1: not detected. v2: `metro ack` + lease TTL — cursor doesn't advance, next `metro tail` re-emits, can surface "X went dark on chat Y" via an inbound event from another user. |
-| `history.jsonl` grows unboundedly   | Existing concern; out of scope for this doc. (Rotate by date, prune by age.)                      |
-## Migration
-All changes are additive. With no subscribers, the dispatcher behaves exactly as today (parent reads stdout, single-user throughput, no routing). The broker model layers on top:
-1. Ship `metro tail` (read-only, no daemon changes, no claim file). Users can subscribe and filter; multi-cast works for everything.
-2. Ship `metro claim`/`release`/`claims` + claim-aware filtering in `metro tail`. Exclusivity works.
-3. Optional v2: lease/ack — only if silent drops become a real problem.
-Each step is independently shippable.
-## Open questions
-- **Routing-key granularity**: claims map to `user-id` (orgId-level — same across sessions/devices) rather than `user-line` (`<user-id>/<session-id>`). This means two Claude Code windows logged into the same account share claims. The session-scoped alternative is more flexible but requires the claimant to write its current `selfLine()` into `claims.json` and refresh it when the session changes. **Default: user-id.** Override per-claim with `metro claim <line> --as <full-line>` if needed.
-- **codex-rc deprecation**: today the dispatcher mirrors every event into a codex-rc file so Codex sees them. Once `metro tail` exists, Codex workers could subscribe directly. The rc-push stays for compatibility; the next major version can drop it.
+| No user claims a chat               | Event broadcasts to every tail whose filters match.                                               |
+| `history.jsonl` grows unboundedly   | Out of scope here. (Rotate by date, prune by age.)                                                |
 ## Non-goals
-- **Strict ordering across chats**: events within one `line` are ordered by JSONL append order; cross-chat ordering is best-effort. Subscribers shouldn't rely on it.
-- **Exactly-once delivery**: at-least-once via cursor + redelivery. Idempotency is the subscriber's problem (the daemon already mints stable `msg_*` ids).
-- **Authn between users**: any process with filesystem access to `$METRO_STATE_DIR` can tail and claim. Same trust model as today.
-- **Remote users**: broker is local-only. Cross-host fan-out is a separate problem (likely solved by running metro on each host and bridging at the chat layer).
+- **Strict ordering across chats**: events within one `line` are ordered by JSONL append order; cross-chat ordering is best-effort.
+- **Exactly-once delivery**: at-least-once via cursor + redelivery. Idempotency is the subscriber's problem (the daemon mints stable `msg_*` ids).
+- **Authn between users**: any process with filesystem access to `$METRO_STATE_DIR` can tail and claim. Same trust model as the host.
+- **Remote users**: broker is local-only. Cross-host fan-out is solved by running metro on each host and bridging at the chat layer.

package/docs/monitor.md CHANGED Viewed

@@ -5,18 +5,27 @@ dashboard, a curl one-liner) to view live daemon state without touching the JSON
 directly.
 These endpoints mount on the **existing** webhook HTTP server (default port `8420`).
-There is no separate daemon, no separate port, no extra process to launch.
+There is no separate daemon, no separate port, no extra process to launch. The
+implementation lives in [`src/cli/monitor-api.ts`](../src/cli/monitor-api.ts) (re-exported
+by `src/cli/tail.ts` for backwards compatibility) and is wired into the HTTP server in
+[`src/dispatcher/server.ts`](../src/dispatcher/server.ts) via `handleMonitorRequest`.
 ## Routes
-| Method | Path         | Returns                                                                                  |
-|--------|--------------|------------------------------------------------------------------------------------------|
-| GET    | `/api/state` | JSON snapshot — `{ claims, lines, recent_history (last 100), bot_ids }`.                 |
-| GET    | `/api/tail`  | Server-Sent Events stream — `history.jsonl` entries, claim-aware filtered.                |
-Both routes are **read-only**. The daemon never mutates state on receipt. The handlers
-read the same files the broker reads (`history.jsonl`, `claims.json`, `bot-ids.json`)
-under whatever `METRO_STATE_DIR` resolves to.
+| Method | Path                            | Returns                                                                                  |
+|--------|---------------------------------|------------------------------------------------------------------------------------------|
+| GET    | `/api/state`                    | JSON snapshot — `{ claims, lines, recent_history (last 100), bot_ids }`.                 |
+| GET    | `/api/tail`                     | Server-Sent Events stream — `history.jsonl` entries, claim-aware filtered.                |
+| POST   | `/api/call/<train>/<action>`    | Forward an action call to a train via `forward-call` IPC; returns `{result}`.            |
+| POST   | `/api/messenger/send`           | In-daemon chat: emits a history entry on `metro://messenger/owner`. Accepts `{text?, attachments?[], as?}`. |
+| POST   | `/api/messenger/register`       | Store an Expo push token so agent replies push to the phone.                              |
+| POST   | `/api/messenger/upload`         | Raw binary upload (up to 25 MiB). Body = file bytes; headers `Content-Type`, optional `X-Filename`. Returns `{id, url, kind, mime, size, name?}`. |
+| GET    | `/api/messenger/files/<name>`   | Stream a previously uploaded attachment. Accepts `?token=…` as an alternative to the bearer header for `<img>` / `<audio>` tags. |
+`/api/state` and `/api/tail` are read-only. `/api/call/<train>/<action>` is the single
+write endpoint — it never touches the on-disk history; the train running on the daemon
+emits its own outbound event after delivering the message, which then flows through the
+normal SSE stream like any other entry.
 ## Authentication
@@ -47,7 +56,8 @@ Returns a one-shot JSON snapshot:
     "metro://telegram/-100…"
   ],
   "recent_history": [/* most-recent-first, up to 100 HistoryEntry objects */],
-  "bot_ids": { "discord": "1234567890", "telegram": "987654321" }
+  "bot_ids": { "discord": "1234567890", "telegram": "987654321" },
+  "version": "x.y.z"
 }
 ```
@@ -55,6 +65,19 @@ Returns a one-shot JSON snapshot:
 - `lines` — the set of conversation URIs seen across recent history and current claims (good-enough proxy for "what lines exist right now"). Subject to refinement; not authoritative.
 - `recent_history` — same shape as `HistoryEntry` in `src/history.ts`, ordered most-recent-first, capped at 100 entries.
 - `bot_ids` — verbatim contents of `bot-ids.json`.
+- `version` — the daemon's package version (handy for clients gating on capabilities).
+### Pagination
+For backlog scrolling, pass `?before=<N>&limit=<M>` (both non-negative integers, `limit` clamped to 500):
+```bash
+curl -H "Authorization: Bearer $METRO_MONITOR_TOKEN" \
+  "https://monitor.metro.box/api/state?before=200&limit=100" | jq .recent_history
+```
+When `before` is set, only `recent_history` is returned (the older slice). Without
+`before`, the full snapshot above is returned.
 ### Example
@@ -117,6 +140,61 @@ curl -N \
   "https://monitor.metro.box/api/tail?since=0"
 ```
+## `POST /api/call/<train>/<action>`
+Forwards an action call to a train (same as `metro call <train> <action> <args>` on the
+command line) via the daemon's existing `forward-call` IPC. Use this from the mobile or
+web app to send a message, react, edit, etc. without needing shell access.
+### Request
+```http
+POST /api/call/discord/send HTTP/1.1
+Host: monitor.metro.box
+Authorization: Bearer <METRO_MONITOR_TOKEN>
+Content-Type: application/json
+{"args": {"line": "metro://discord/123", "text": "hello from the web"}}
+```
+The body is one of:
+- `{"args": <object|array|string>}` — explicit `args` wrapper (recommended).
+- `<object>` — any other JSON object is forwarded as the args verbatim (useful for terse
+  clients).
+- Empty body — forwarded as `{}`.
+`<train>` must match a train running under `~/.metro/trains/`; `<action>` is whatever
+that train expects (`send`, `react`, `edit`, …).
+### Response
+| Status | Body                                                | Meaning                                                  |
+|--------|-----------------------------------------------------|----------------------------------------------------------|
+| 200    | `{"result": <whatever the train returned>}`         | Train accepted the call and returned a result.           |
+| 400    | `{"error": "bad JSON body: …"}`                      | Body was not valid JSON.                                 |
+| 401    | `{"error": "unauthorized"}`                          | Missing / wrong bearer token.                            |
+| 405    | `{"error": "method not allowed"}`                    | Wrong verb (only `POST` is accepted on this path).       |
+| 500    | `{"error": "…"}`                                     | Daemon IPC unavailable (e.g. socket missing).            |
+| 502    | `{"error": "…"}`                                     | Train returned an error or the IPC handshake malformed.  |
+Request bodies larger than 256 KiB are rejected with HTTP 500.
+### Example: send a message
+```bash
+curl -X POST \
+  -H "Authorization: Bearer $METRO_MONITOR_TOKEN" \
+  -H "Content-Type: application/json" \
+  -d '{"args":{"line":"metro://discord/123","text":"hi"}}' \
+  https://monitor.metro.box/api/call/discord/send
+```
+Because the daemon's `send` adapter writes a history entry once the message lands, an
+active `/api/tail` subscriber will receive the corresponding event a moment later
+(`from` = local user). UIs typically clear the input on HTTP 200 and let the SSE replay
+show the sent message.
 ## Exposing publicly via Cloudflare tunnel
 The daemon listens on `127.0.0.1:8420` only. To reach `/api/*` from a phone or a

package/docs/uri-scheme.md CHANGED Viewed

@@ -58,7 +58,7 @@ Override either segment with `METRO_USER_ID` / `METRO_USER_SESSION_ID` env vars.
 ### User registry
-The daemon persists every `(station, user-id, session)` tuple it sees to `$METRO_STATE_DIR/user-registry.json`. `metro stations` prints the count of seen users and sessions per station. Run it to discover what's reachable rather than guessing topic names.
+The daemon persists every `(station, user-id, session)` tuple it sees to `$METRO_STATE_DIR/user-registry.json`. `metro lines` lists the recently-seen conversation URIs.
 ## Webhook station
@@ -87,9 +87,9 @@ After setup, `metro webhook list` prints `https://webhook.yourdomain.com/wh/<id>
 Messages on chat lines are referenced by **line + message id** (two args), not as part of the URI. So:
 ```bash
-metro reply  metro://discord/123…  4567  "ack"
-metro edit   metro://discord/123…  9876  "fixed typo"
-metro react  metro://telegram/-100…/42  4567  👍
+metro call discord send '{"line":"metro://discord/123","text":"ack","replyTo":"4567"}'
+metro call discord edit '{"line":"metro://discord/123","messageId":"9876","text":"fixed typo"}'
+metro call telegram react '{"line":"metro://telegram/-100/42","messageId":"4567","emoji":"👍"}'
 ```
 ## Properties
@@ -102,7 +102,7 @@ metro react  metro://telegram/-100…/42  4567  👍
 ## API
 ```ts
-import { Line } from './stations/index.js';            // value namespace + type
+import { Line } from './lines.js';            // value namespace + type
 const l: Line = Line.discord('1234567890');     // typed Line
 Line.parse(l);                                   // { station: 'discord', path: ['1234567890'] } | null
@@ -119,6 +119,9 @@ Line.isLocal(l);                                 // true for any metro://{claude
 ## Adding a new station
+A "station" in this URI scheme is just a namespace — anything a train emits with
+`metro://<name>/<path>` works. There is no required registration with core:
 1. Pick a lowercase station name (`slack`, `matrix`, …).
-2. Add a `Line.<station>(...)` formatter and a parser that returns your typed payload.
-3. Document the path grammar in the table above.
+2. In your train, emit envelopes with `line: "metro://<name>/<id>"`.
+3. Optionally add a `Line.<station>(...)` formatter to `src/lines.ts` for type-safe construction.

package/examples/README.md ADDED Viewed

@@ -0,0 +1,32 @@
+# Example train
+`telegram.ts` is a starting point, not runtime code. Copy to
+`~/.metro/trains/<name>.ts`, edit, save, restart the daemon:
+```
+cp telegram.ts ~/.metro/trains/telegram.ts
+echo 'TELEGRAM_BOT_TOKEN=…' >> ~/.metro/.env
+metro
+```
+For a Discord port: swap the API base + auth header (`Bot $TOKEN`), install
+`discord.js` for the gateway, and emit the same envelope shape — the
+stdin/stdout protocol below is platform-independent. Action names and payload
+shapes are entirely up to you.
+## Protocol (JSON lines over stdio)
+```
+metro  ─── stdin (one JSON line per action call) ──>  train
+       <── stdout (one JSON line per event OR response) ── train
+```
+- **Inbound event** (train → metro): `{ kind, station, line, from, from_name?, message_id?, line_name?, reply_to?, text?, is_private?, payload? }` — snake_case on the wire. Metro mints `id` + `display` if absent and translates to camelCase for `history.jsonl` / the broker (`HistoryEntry` in `src/history.ts`).
+- **Call** (metro → train): `{ "op": "call", "id": "req_abc", "action": "send", "args": {...} }`.
+- **Response** (train → metro): `{ "op": "response", "id": "req_abc", "result": {...} }` or `{ ..., "error": "..." }`.
+Anything on stdout without an `op` is treated as an inbound event.
+## Lifecycle
+Metro scans `~/.metro/trains/*.{ts,js,mjs}` at boot — one subprocess per file. Crashed trains restart with backoff (1s → 5s → 30s, up to 5 consecutive failures). `metro trains list` shows state. Restart the daemon to pick up edits. `~/.metro/.env` is auto-loaded into each train's `process.env`.