npm - switchroom - Versions diffs - 0.12.12 → 0.12.14 - Mend

switchroom 0.12.12 → 0.12.14

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/README.md +107 -371
package/dist/cli/switchroom.js +397 -245
package/dist/host-control/main.js +1 -1
package/dist/vault/approvals/kernel-server.js +20 -0
package/dist/vault/broker/server.js +111 -1
package/package.json +1 -1
package/telegram-plugin/dist/gateway/gateway.js +208 -26
package/telegram-plugin/gateway/approval-callback.ts +24 -13
package/telegram-plugin/gateway/gateway.ts +149 -17
package/telegram-plugin/gateway/pending-permission-decisions.ts +112 -0
package/telegram-plugin/gateway/vault-grant-inbound-builders.ts +117 -0
package/telegram-plugin/tests/pending-permission-decisions.test.ts +73 -0
package/telegram-plugin/tests/vault-save-inbound-builders.test.ts +96 -0

package/README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 <p align="center">
-  <img src="docs/assets/switchroom-hero-wide.png" alt="Switchroom — opinionated Telegram UX for Claude Code on your Pro or Max subscription" width="100%">
+  <img src="docs/assets/switchroom-hero-wide.png" alt="Switchroom: opinionated Telegram UX for Claude Code on your Pro or Max subscription" width="100%">
 </p>
 # Switchroom
@@ -10,176 +10,76 @@
 [![Trigger evals](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2Fmekenthompson%2F002f3482b19111d35e57c1903b3733e2%2Fraw%2Fswitchroom-trigger-evals.json)](https://github.com/switchroom/switchroom/actions/workflows/ci-evals.yml)
 [![Quality evals](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2Fmekenthompson%2F002f3482b19111d35e57c1903b3733e2%2Fraw%2Fswitchroom-quality-evals.json)](https://github.com/switchroom/switchroom/actions/workflows/ci-evals.yml)
-**A switchboard for your Pro or Max.** Your Claude subscription, as a fleet of always-on specialist agents you talk to from Telegram. Opinionated UX, done properly.
-> *I loved OpenClaw + Telegram. I wanted my Claude subscription. And the UX done properly. So I built this.*
+**A switchboard for your Pro or Max.** Your Claude subscription, run as a fleet of always-on specialist agents you talk to from Telegram. Opinionated UX, done properly.
 [Latest release notes →](CHANGELOG.md)
-## See what your agent is doing
-Every time an agent starts work, a **progress card** pins into its Telegram topic and updates in place as tools execute. Each Read, Bash, Edit, Grep is visible as it happens, with elapsed time so you can tell if something's stuck. Sub-agents surface in the same card. When the agent finishes, the card flips to Done and unpins.
-No silent gaps. No ghosts. No squinting into a black box.
-<p align="center"><img src="docs/diagrams/progress-card-anatomy.svg" width="700" alt="Annotated progress card: pin badge, user quote, last 5 steps, collapsed older, in-flight pulse, elapsed timer, sub-agent indent"></p>
-```
-⚙️ Working… · ⏱ 12s
-💬 refactor the auth module to use JWT
-  ─ ─ ─
-  … (+3 more earlier steps)
-  ✅ Read src/auth/session.ts
-  ✅ Grep "cookie" (in src/)
-  🤖 Edit src/auth/jwt.ts · 4s
-```
-The card is the headline UX. The rest of the product is in service of it.
+## Why this exists
-- Cards update at most once every 5 seconds. Fast enough to follow, not so fast it floods.
-- Last 5 steps stay visible. Older ones collapse into `(+N more earlier steps)`.
-- Running steps show elapsed time so a stuck tool is obvious.
-- Tool labels are deterministic, written by a `PreToolUse` hook, so the card never lies about what's running.
-- Two agents working at once? Each gets its own card, labelled `(1/2)` and `(2/2)`.
-### Sub-agent visibility
+> *I loved OpenClaw + Telegram. I wanted my Claude subscription. And the UX done properly. So I built this.*
-When an agent delegates to a sub-agent — Opus plans, Sonnet implements — the sub-agent's work shows up indented inside the parent's pinned card. One pinned surface per task, however many processes it spawns underneath. Nothing gets buried in a side-channel you have to go look for.
+You had the obvious idea. Run Claude Code agents 24/7 on a cheap Linux box, talk to them from Telegram, use the Pro or Max subscription you already pay for.
-### Right, so what's this about
+Two ways to do that today. Both miss:
-So you had the bright idea. Run Claude Code agents 24/7 on a cheap Linux box, talk to them from Telegram, use the Claude Pro or Max subscription you're already paying for. Sensible. Obvious, even.
+- **OpenClaw + Telegram.** Great UX. But it hits the Anthropic API on your own key, so a token bill ticks over in the background. You signed up to use your subscription, not to buy API credits on top of it.
+- **Claude Code's built-in Telegram channel.** Uses the subscription correctly. But it's an MVP black box. Send a message, wait, eventually something comes back. What did the agent actually do? Which tools ran? Did it get stuck? No idea.
-Then you tried OpenClaw. Followed the docs, spun it up, got it running, only to realise halfway through that you're pinging the Anthropic API on your own key and your token bill is quietly ticking over in the background. Bit of a bait and switch, that one. You signed up for "use your subscription," not "buy API credits on top of your subscription."
+Switchroom is the third option. Subscription-honest, and the UX done properly. The headline: the agent is never a black box and never silent. You always see what it is doing, what it is waiting on, and when it is done.
-So you gave Claude Code's built-in Telegram channel a crack instead. Sent a message. Waited. Something happened, maybe. Eventually a reply came back. What did the agent actually do? No idea. Which tools ran? No idea. Did it get stuck, crash, spawn a sub-agent, read half your repo? No idea. It's an MVP black box of death, and I got sick of squinting into it.
+## See it work
-So I built this.
-## What you get
-| Feature | What it does |
-|---|---|
-| **Progress cards** | Pinned, in-place, every tool call visible. The headline UX. |
-| **Claude Pro/Max auth** | OAuth, not API keys. No per-token billing. Fleet-wide active account + fallback order; broker-owned refresh and credential fanout. |
-| **Approval kernel** | Inline allow/deny cards in Telegram for every gated tool. TTL'd grants, full audit trail. |
-| **Sub-agents** | Opus plans, Sonnet implements. Sub-agent work surfaces in the parent card. |
-| **Config cascade** | Defaults, then profiles, then per-agent YAML. Change one line, every agent updates. |
-| **Scheduled tasks** | Cron-syntax tasks that fire across reboots. Headless secret access via the vault broker. |
-| **Persistent memory** | Hindsight semantic memory with knowledge graphs and mental models. |
-| **Session continuity** | Resume across restarts with freshness gating and a wake-audit. |
-| **Encrypted vault** | AES-256-GCM for secrets. Optional auto-unlock keyed off `/etc/machine-id`. |
-| **Drive MCP** | Read Google Docs, Sheets, and Drive files inline. Per-agent OAuth, no shared key. |
-| **Card audit log** | Every progress-card edit appended to `card-events.jsonl` for retrospective debugging. |
-| **15 Telegram MCP tools** | Reply, stream replies, edit, pin, react, native checklists, sticker aliases, voice-in transcription, attachments, history. |
-## Architecture
-One long-running service per agent. Each agent runs the stock `claude` CLI — not a fork, not the Agents SDK, not a wrapped harness — authenticated directly with Anthropic via official OAuth. Switchroom is scaffolding and lifecycle around the CLI you'd run by hand: a Telegram bot, an approval broker, a vault broker, and Docker Compose for supervision. See [`docs/architecture.md`](docs/architecture.md) for the process model and how each layer maps to the `claude` CLI.
+Switchroom's UX is deterministic. Every agent topic carries a status line that is always one of three honest states, never a guess and never nothing:
 ```
-You (Telegram)
-    │
-    ▼
-@YourBot ──┬── switchroom-telegram MCP ──┬── agent supervisor ─── Claude Code CLI
-           │       (15 tools)            │     (per-agent)        │
-           │                             │                        ├─ .claude/agents/*.md (sub-agents)
-           ├─ Progress cards             ├─ Approval kernel ◄─────┤   settings.json (tools, hooks, MCP)
-           ├─ Pin / unpin lifecycle      │   (allow/deny broker)  ├─ Hindsight plugin (memory)
-           ├─ SQLite history             ├─ Vault broker ◄────────┤   Drive MCP, Playwright MCP, …
-           ├─ Card-events.jsonl audit    ├─ Auth broker ◄─────────┤   in-agent scheduler sidecar
-           ├─ Emoji reactions            │   (OAuth refresh,       └─ (cron, fires across reboots)
-           └─ Format conversion          │    sole creds writer)
-                                         ├─ hostd (host-control:
-                                         │   /restart, /update apply)
-                                         └─ Docker Compose restart (unless-stopped)
+🟡 quiet · no turns yet
+⚙️ working since 2m ago
+🟢 idle · last reply 5m ago
 ```
-See [`docs/architecture.md`](docs/architecture.md) for the process model, IPC layout, supervisor choice, and how each layer maps to the `claude` CLI.
-## Approvals & safety
-Tools that touch the world — Bash, Edit, Write, anything not on an agent's pre-approved allowlist — pause for explicit approval. Switchroom's **approval kernel** (shipped in v0.5.1) routes every gated tool call through an inline Telegram card with the actual diff or command shown. Tap Allow and the tool resumes. Tap Deny and the agent gets a clean refusal it can recover from.
-<p align="center"><img src="docs/diagrams/approval-grant-flow.svg" width="700" alt="Approval grant flow: agent tool call pauses at the kernel, broker writes pending grant to sqlite, user taps Allow on the Telegram card, broker releases the gate, tool resumes"></p>
-- **Inline cards.** Allow / Deny / Allow once / Allow for 1h. No leaving Telegram.
-- **TTL'd grants.** "Allow Bash for 1h" expires automatically. No silent permanent escalation.
-- **Audit trail.** Every grant, denial, and expiry written to a per-agent log you can replay.
-- **Per-agent allowlist.** `switchroom agent grant <name> <tool>` for the boring ones you don't want to be asked about.
-The kernel runs as an out-of-process broker over a unix socket. The agent process never decides its own permissions; it asks and waits.
-### Compliance posture
-Switchroom never intercepts auth, never proxies inference, never patches the CLI. The `claude` binary you run is the one Anthropic ships. See the [Compliance Attestation](docs/compliance-attestation.md) for the full analysis against Anthropic's April 2026 third-party policy.
-## Survives real life
-Each agent is a long-running service. They survive reboots, network drops, and your laptop closing. But "always on" isn't enough on its own. Things still die. The product has to handle that gracefully or the illusion breaks.
-<p align="center"><img src="docs/diagrams/wake-audit-lifecycle.svg" width="700" alt="Wake-audit lifecycle: kill, crash-pane snapshot, auto-restart, agent boots with SWITCHROOM_PENDING_TURN, acks with three options"></p>
-- **Auto-restart.** Agent containers come up with `restart: unless-stopped`, and each service has a healthcheck — a crashed or wedged agent is brought back automatically. No silent dropped work.
-- **Resume protocol.** When an agent reboots mid-turn, `start.sh` exports `SWITCHROOM_PENDING_TURN=true` plus the original chat / message ids. The agent's first action on boot is to acknowledge the gap and ask the user how to proceed (start over, summarise and continue, or drop it).
-- **Wake-audit.** On every fresh boot the agent checks for owed replies, orphan sub-agents, and stale in-progress todos. If everything's clean it stays quiet. If it owed you a reply, it tells you.
-- **Token refresh.** The `switchroom-auth-broker` daemon owns the refresh loop and is the sole writer of every `credentials.json`. Per-account quota state fans out across the fleet in seconds; `auth.fallback_order` cycles when an account is exhausted.
+While the agent works, the reply streams in place (about once a second, not one delayed dump at the end) and tool steps render as they run with `MM:SS` durations. Long-running work gets a Steer button so you can redirect mid-turn without killing it. When an agent delegates, a sub-agent block shows the live fleet, up to 5 members, each with its role and current tool:
-## How it stacks up
+```
+⚙️ Working…  ·  00:12
+refactor the auth module to use JWT
-| | Switchroom | Claude Code channels | OpenClaw | NanoClaw |
-|---|---|---|---|---|
-| Progress visibility | Live cards, pinned | Black box | None | None |
-| Runtime | Claude Code CLI | Claude Code CLI | Custom runtime | Agents SDK |
-| Auth | Pro/Max OAuth | Pro/Max OAuth | API key | API key |
-| Sub-agent tracking | Yes, in card | No | No | No |
-| Parallel task display | Labelled cards `(1/N)` | No | No | No |
-| Approval UX | Inline Telegram cards | None | None | None |
-| Config | YAML with cascade | None | JSON/TOML | Env vars |
-| Setup | `switchroom setup` | Built-in (limited) | Docker compose | Docker compose |
+✅ Read session.ts
+✅ Grep "cookie"
+🔵 Edit jwt.ts  ·  00:04
-The wedge against OpenClaw and NanoClaw isn't the substrate — it's the stock `claude` CLI under your subscription, instead of a custom runtime under your API key.
+Sub-agents · 2 running
+🔵 implement  ·  Edit jwt.ts
+🔵 test-runner  ·  Bash npm test
+                                   [ Steer ]
+```
-## Install
+<p align="center"><img src="docs/diagrams/deterministic-status-anatomy.svg" width="760" alt="Deterministic status anatomy: the never-silent state strip (quiet, working, idle) feeding a working-status message with streamed tool steps at MM:SS, a sub-agent fleet block, and a Steer button. Posted and edited in the topic, never pinned."></p>
-Runs on the box you already have. The supported production runtime is Linux + Docker. **Canonical target: Ubuntu 24.04 LTS with ≥4 GiB RAM** (8 GiB recommended once you run more than one agent). Other Debian-derivatives work with the same script; non-apt distros need a manual prereq install. macOS (Docker Desktop) works for development but is not yet release-validated.
+Older sub-agents collapse to `+N more` once the live set passes 5. Tool arguments are sanitised before they render: file paths show the basename only, token-shaped strings get redacted, so a secret never lands in a status message. The card is posted and edited in the topic, not pinned, so it never leaves a stale `⚙️ Working…` stranded at the top of your chat. [Full UX behaviour →](docs/telegram-plugin.md)
-> **Full new-user walkthrough — [`docs/install.md`](docs/install.md).** Zero to first Telegram message in ~15 minutes. Includes the [BotFather walkthrough](docs/botfather-walkthrough.md). Read that first if you're installing from scratch.
+## Quickstart
-> **Heads up on the package name.** The npm package was originally `switchroom-ai`. It's now just `switchroom`. The old name is deprecated and will stop receiving updates — `npm install -g switchroom` is the current path.
+Runs on the box you already have. Supported production runtime is Linux + Docker. Canonical target: Ubuntu 24.04 LTS, 4 GiB RAM minimum, 8 GiB once you run more than one agent. macOS (Docker Desktop) works for development, not yet release-validated.
-### Fresh Linux box — one script
+**Fresh Linux box, one script:**
 ```bash
 curl -fsSL https://github.com/switchroom/switchroom/raw/main/scripts/install-deps.sh | sudo bash
 ```
-Installs Docker Engine + Compose v2, Node.js 20.11+, Bun, and the `@anthropic-ai/claude-code` + `switchroom` CLIs. Idempotent. Adds the invoking user to the `docker` group. Tested on Ubuntu 24.04 LTS and 26.04 LTS. Warns (does not block) on hosts under 4 GiB RAM.
-Then log out and back in so the docker group takes effect, and:
+Installs Docker Engine + Compose v2, Node.js 20.11+, Bun, and the `@anthropic-ai/claude-code` + `switchroom` CLIs. Idempotent. Log out and back in so the docker group takes effect, then:
 ```bash
 switchroom setup                       # interactive: Telegram + vault + first agent
-switchroom apply                       # regenerate ~/.switchroom/compose/docker-compose.yml
-docker compose -p switchroom -f ~/.switchroom/compose/docker-compose.yml up -d   # bring the fleet up
-switchroom auth add default --via-claude   # OAuth your Claude Pro/Max account — run AFTER the fleet is up
+switchroom apply                       # generate ~/.switchroom/compose/docker-compose.yml
+docker compose -p switchroom -f ~/.switchroom/compose/docker-compose.yml up -d
+switchroom auth add default --via-claude   # OAuth your Pro/Max account, AFTER the fleet is up
 switchroom auth use default                # make it the fleet-wide active account
 ```
-Auth comes *after* the fleet is up on purpose: the `switchroom-auth-broker` is the sole writer of credentials and doesn't exist until the compose stack is running (`switchroom auth …` beforehand just prints a "fleet not up" hint). After this you talk to the agent from Telegram and don't touch the server again. To catch a running host up later — pull images, refresh scaffolds, recreate — use **`switchroom update`**, not a raw `docker compose up` (a bare compose-up on a live fleet skips the operator restart-marker, so the boot cards render as crashes).
+Auth comes after the fleet is up on purpose. The `switchroom-auth-broker` is the sole writer of credentials and does not exist until the compose stack is running. After this you talk to the agent from Telegram and never touch the server again. To catch a running host up later use `switchroom update`, not a raw `docker compose up` (a bare compose-up on a live fleet skips the operator restart-marker, so boot cards render as crashes).
-### Already have Docker + Node ≥ 20.11
-```bash
-sudo npm install -g bun @anthropic-ai/claude-code switchroom
-switchroom setup
-```
-`bun` is a hard runtime dep — the `switchroom` CLI's entrypoint is a Bun script. A Node-only CLI build is on the roadmap but not yet shipped.
-### From inside Claude Code (the on-ramp)
-If you already use Claude Code, this is the shortest path. Inside any session:
+**Already in Claude Code?** Shortest path. Inside any session:
 ```
 /plugin marketplace add switchroom/switchroom
@@ -187,237 +87,88 @@ If you already use Claude Code, this is the shortest path. Inside any session:
 /switchroom:setup
 ```
-`/switchroom:setup` walks you through deps, `switchroom setup` (Telegram + vault + first agent), and `switchroom agent start`. Day-to-day: `/switchroom:start`, `/switchroom:stop`, `/switchroom:status`. See [`docs/publishing.md`](docs/publishing.md).
-### Static binary (planned, not yet shipped)
-Pre-built single-binary releases (no Node or Bun required on the host) are scaffolded in [`install.sh`](install.sh) and referenced from the GitHub Releases page, but **the release workflow that publishes those binaries still does not exist as of v0.12.0**. Use the one-script or npm paths above. Tracking work: switching the release pipeline to actually upload `switchroom-linux-{amd64,arm64}` and `switchroom-macos-{amd64,arm64}` on every tag.
-### One-shot happy path (no wizard)
-If you already have Telegram credentials in `~/.switchroom/switchroom.yaml` and one Anthropic account already added, skip `switchroom setup`. `agent create --profile` writes a minimal entry; the new agent inherits the fleet-wide active account automatically — no per-agent OAuth flow:
-```bash
-switchroom agent create coach --profile health-coach
-switchroom apply && docker compose -p switchroom -f ~/.switchroom/compose/docker-compose.yml up -d
-```
-## Example configuration
-```yaml
-switchroom:
-  version: 1
-telegram:
-  # Per-agent bot token (DM-only by default).
-  bot_token: "vault:telegram-bot-token"
-memory:
-  backend: hindsight
-defaults:
-  model: claude-opus-4-7   # or claude-sonnet-4-6, claude-haiku-4-5
-  tools: { allow: [all] }
-  subagents:
-    worker:
-      description: "Implementation tasks"
-      model: sonnet
-      background: true
-      isolation: worktree
-  schedule:
-    - cron: "0 8 * * 1-5"
-      prompt: "Morning briefing"
-  session:
-    max_idle: 2h
-agents:
-  assistant:
-    topic_name: "General"
-    memory: { collection: general }
-  coach:
-    topic_name: "Coach"
-    extends: advisor
-    soul:
-      name: Coach
-```
-See [docs/configuration.md](docs/configuration.md) for the full reference.
-## Vault broker (cron secrets)
-Scheduled tasks run headless inside the agent container, so they can't prompt for the vault passphrase. The vault broker is a long-running container (`switchroom-vault-broker`) that holds the vault decrypted in memory after a one-time interactive unlock. Cron tasks fetch the specific keys they declare via a per-agent unix socket. The passphrase never sits on disk.
-**Declare per-cron secrets in `switchroom.yaml`:**
+Full new-user walkthrough, zero to first Telegram message in ~15 minutes, plus the npm-only path, the no-wizard one-shot, the BotFather steps, and static-binary status: **[docs/install.md](docs/install.md)**.
-```yaml
-agents:
-  scout:
-    schedule:
-      - cron: "0 8 * * *"
-        prompt: "Morning brief."
-        secrets: [openai_api_key, polygon_api_key]   # only these may be read
-```
-`secrets: []` (the default) means the cron has no vault access.
-**Bootstrap once per host:**
-```bash
-switchroom apply                        # writes broker into docker-compose.yml
-docker compose -p switchroom -f ~/.switchroom/compose/docker-compose.yml up -d switchroom-vault-broker
-switchroom vault broker unlock          # prompt for passphrase, primes broker
-```
-Or just run `switchroom vault get <key>` from a TTY. The broker offers to take the unlocked state with `[Y/n]` so you don't have to remember a separate unlock command.
-**Identity model (v0.7+).** Path-as-identity. The broker binds one socket per agent at `/run/switchroom/broker/<agent>/sock` inside its own container, hosted via a per-agent named volume that's also mounted at `/run/switchroom/broker/` inside `agent-<agent>`. The agent name is parsed unspoofably from the bind path — see `src/vault/broker/peercred.ts:socketPathToAgent()`. A compromised agent cannot pose as another agent's cron because it only ever sees its own socket on its mount. ACL is bind-time, never wire-time.
-The broker locks on `SIGTERM` (so a container restart zeros the in-memory state) and on demand via `switchroom vault broker lock`. Use `switchroom vault get <key> --no-broker` to bypass and prompt locally.
-Vault file (post-v0.7.12) lives at `~/.switchroom/vault/vault.enc` — a directory, not a single file, so atomic rename can use the parent as the staging dir. See [docs/vault.md](docs/vault.md) for the layout rationale.
-### Auto-unlock on boot (opt-in)
-By default, the broker holds the unlocked state in memory only. Every restart (host reboot, service crash, reconcile that re-renders the unit) wipes it and requires `switchroom vault broker unlock` again. For unattended hosts where this is too painful, switchroom can encrypt the passphrase with a key derived from `/etc/machine-id` and have the broker unlock itself at boot:
-```bash
-switchroom vault broker enable-auto-unlock   # one-time setup, prompts for passphrase
-```
-Done. The wizard prompts for your vault passphrase, encrypts it with AES-256-GCM keyed off `/etc/machine-id`, writes the result to `~/.switchroom/vault-auto-unlock` (mode 0600), flips `vault.broker.autoUnlock: true` in `switchroom.yaml`, restarts the broker, and verifies the vault came up unlocked. Every subsequent boot the broker reads + decrypts + unlocks itself.
-Disable with `switchroom vault broker disable-auto-unlock`.
-**Security tradeoff. Read this before enabling.** The encrypted blob lives at mode 0600 in your home directory; the encryption key is derived from `/etc/machine-id` plus a per-file random salt. Disk theft is safe (the blob doesn't decrypt on any other machine) and other UNIX users on the same box can't read it. But root on the host *can* read both the blob and the machine-id, so once root is on the machine the passphrase is recoverable. Same blast radius as the running broker process (anything with code-exec as you can already attach to the broker socket and exfiltrate secrets), but it shifts the convenience-vs-security knob: auto-unlock means a lost laptop is a lost vault even if the vault file itself is encrypted at rest. Use only on hosts you trust. See [docs/auto-unlock.md](docs/auto-unlock.md) for the full threat model and recovery instructions.
-## CLI reference
-```bash
-switchroom setup                              # Interactive wizard
-switchroom doctor                             # Health check
-switchroom apply                              # Reconcile + regenerate docker-compose.yml (self-elevates via sudo for scaffolds). Does NOT run docker — prints the `up` command
-switchroom update [--check|--status|--rebuild] # Operator catch-up: pull images + apply + recreate fleet + doctor
-switchroom restart [agent] [--force]          # Bounce agent(s); drains in-flight turn by default
-switchroom version                            # Show versions + running agent health summary
-switchroom agent list                         # Status of all agents
-switchroom agent status <name>                # Status of one agent
-switchroom agent add [name]                   # Wizard: scaffold a new agent end-to-end (#543)
-switchroom agent create <name> [--profile <p>] # Scaffold + install timers; --profile writes yaml entry
-switchroom agent bootstrap <name> --profile <p> --bot-token <t>  # One-shot scaffold + auth + start
-switchroom agent reconcile <name|all>         # Re-apply switchroom.yaml (without pulling/building)
-switchroom agent start|stop|restart <name>    # Lifecycle (with preflight)
-switchroom agent interrupt <name>             # Cancel in-flight turn without restarting
-switchroom agent unquarantine <name>          # Clear a crash-quarantine and resume supervision
-switchroom agent rename <old> <new>           # Rename an agent slug (#168)
-switchroom agent destroy <name>               # Remove from compose + scaffold dir
-switchroom agent attach <name>                # Interactive tmux session
-switchroom agent send <name> <slash-cmd>      # Inject a slash command into the agent's tmux pane
-switchroom agent logs <name> [-f]             # View logs
-switchroom agent grant <name> <tool>          # Grant a tool permission
-switchroom agent permissions <name>           # Show allow/deny list
-switchroom agent dangerous <name> [off]       # Toggle full tool access
-switchroom soul path|show|reset <name>        # Manage the agent's user-owned SOUL.md (persona)
-switchroom hostd install|status|uninstall|audit # Host-control daemon (/restart, /update apply, …)
-switchroom drive connect|disconnect <agent>   # Per-agent Google Drive OAuth
-```
+## What you get
-`switchroom --help` lists every verb (also `deps`, `issues`, `migrate`).
+| Feature | What it does |
+|---|---|
+| **Deterministic UX** | Every topic shows an honest state: quiet, working since, or idle. Never a black box, never silent. The headline. |
+| **Live step streaming** | The reply streams in place while tools run, with durations and a Steer button to redirect mid-turn. Not pinned, no stale cards. |
+| **Sub-agent fleet view** | Delegated workers surface as live rows (up to 5, then `+N more`), each with role and current tool. Args sanitised before render. |
+| **Claude Pro/Max auth** | OAuth, not API keys. No per-token billing. Fleet-wide active account plus fallback order, broker-owned refresh and credential fanout. |
+| **Vault + approval kernel** | AES-256-GCM secrets. Per-agent least-privilege ACL. Anything extra needs your tap on an inline Telegram Approve/Deny card. |
+| **Sub-agents** | Opus plans, Sonnet implements. Sub-agent work surfaces in the fleet block. |
+| **Config cascade** | Defaults, then profiles, then per-agent YAML. Change one line, every agent updates. |
+| **Scheduled tasks** | Cron-syntax tasks that fire across reboots. Headless secret access through the vault broker. |
+| **Persistent memory** | Hindsight semantic memory, including a per-user mental model that carries across conversations. |
+| **Always-on** | Long-running service per agent. Survives reboots, network drops, your laptop closing. Resumes mid-turn with a wake-audit. |
+| **17 Telegram MCP tools** | reply, stream, react, edit, pin, delete, forward, typing, history, checklists (+update), stickers, GIFs, attachment download, ask-user, vault request access/save. |
-Profiles live in `profiles/` at the repo root. Bundled ones for `--profile`: `coding`, `default`, `executive-assistant`, `health-coach` (the `_base/` dir is framework-internal render templates and is not a user-selectable profile).
+## Subscription-honest, and safe by default
-`switchroom agent create <name> --profile <profile>` does two things in one step:
+**Stock CLI, real OAuth.** Each agent runs the unmodified `claude` binary, authenticated with Anthropic through the same OAuth flow you use on the desktop app. No API key. No harness. No patched CLI. No proxied inference. One bill, the one you already pay. See the [Compliance Attestation](docs/compliance-attestation.md) for the full analysis against Anthropic's April 2026 third-party policy.
-1. Adds an entry to `switchroom.yaml` under `agents:` with `extends: <profile>` and a derived `topic_name` (capitalized agent name). Edit the yaml afterwards to change the topic name, emoji, tools, etc.
-2. Scaffolds the agent directory and registers the agent in `docker-compose.yml` on next `switchroom apply` (same as running `agent create` on an entry that already exists in yaml).
+**Least privilege, and you hold the keys.** This is the core opinion. An agent never holds the vault passphrase and never sees a secret it was not given. Secrets live in an AES-256-GCM vault. Each agent reaches the vault broker over its own socket whose identity is the bind path, so a compromised agent cannot pose as another. An agent can read a key only if you listed it for that agent's task: that is the standing, least-privilege ACL. Anything beyond that is just-in-time. The agent asks, you get an inline Telegram Approve/Deny card showing exactly what and why, and only your tap mints a scoped, expiring grant. The agent cannot self-elevate.
-If the agent is already in yaml, `--profile` must match the existing `extends:` value or it errors. If the yaml entry has no `extends:` and you pass `--profile`, the flag is written in additively with a warning. Running `agent create` with no `--profile` on a missing entry keeps the old "Agent not defined in switchroom.yaml" error, now with a hint to use `--profile`.
+<p align="center"><img src="docs/diagrams/approval-grant-flow.svg" width="700" alt="Approval grant flow: agent requests a key it does not have, broker stages a pending grant, you tap Allow on the Telegram card, broker mints a scoped TTL grant, the read proceeds"></p>
-Model aliases: the bare names `opus`, `sonnet`, `haiku` are accepted alongside the full IDs (`claude-opus-4-7`, `claude-sonnet-4-6`, `claude-haiku-4-5`). Use whichever reads cleaner in your config.
+**The approval kernel gates risky actions too.** A separate kernel daemon handles action approvals on the same model: an agent that wants to write to a Google Doc requests it, ends its turn, and waits. You see the diff and tap Allow or Deny. Nothing destructive happens on the agent's say-so alone. TTL'd decisions expire, and every grant and denial is logged.
-### Authentication (one OAuth, many agents)
+### How agents collaborate on files
-The **Anthropic account is the unit of authentication.** One OAuth flow per account, then every agent in the fleet inherits the fleet-wide active account. The `switchroom-auth-broker` daemon owns the refresh loop and is the sole writer of every `credentials.json`. Per-account quota state fans out across the fleet in seconds. See [`docs/auth.md`](docs/auth.md) for the full operator guide.
+Switchroom's position: agents should collaborate with you on real documents, not paste walls of text into chat. The supported path is Google Drive. An agent reads and proposes changes, and every write or suggestion is gated through the approval kernel exactly like a credential request, so an agent never silently edits your Drive. Today Drive is opt-in per agent: you connect a Google account and enable it for the agents that should have it (`switchroom drive connect <agent>`), and the broker enforces that allowlist at runtime. It is not on by default for a fresh agent.
-```bash
-switchroom auth add <label> --via-claude          # New account, broader scope — recommended for first-time
-switchroom auth add <label> --from-oauth          # Narrow scope=user:inference (rejected by agents in server: mode)
-switchroom auth add <label> --from-agent <name>   # Seed from an existing agent's creds
-switchroom auth add <label> --from-credentials <path>  # Import a credentials.json
-switchroom auth add <label> --via-claude --replace     # Re-auth an existing label (drift recovery)
-switchroom auth list                              # Accounts + health + which one is fleet-active
-switchroom auth show [agent]                      # Full snapshot (fleet + agents + consumers), or one agent
-switchroom auth use <label>                       # Fleet-wide active swap
-switchroom auth rotate                            # Cycle to next non-exhausted in fallback_order
-switchroom auth rm <label>                        # Remove an account (refused if it's the only one)
-switchroom auth agent override <agent> <label>    # Edge case: one agent on a different account
-switchroom auth agent override <agent> --clear    # Back to fleet active
-switchroom auth refresh [label]                   # Diagnostic: force a refresh tick
-```
-The same surface is reachable from Telegram in any agent's chat: `/auth show` (read-only), `/auth use <label>`, `/auth rotate`. Mutating verbs are admin-gated against the per-agent `admin: true` flag (the same flag that gates `/agents`, `/restart`, `/update`, etc.). One knob to make an agent the fleet control panel.
+## Survives real life
-### Workspace (agent bootstrap layer)
+Always-on is not enough on its own. Things still die. The product has to handle that or the illusion breaks.
-Each agent has a workspace directory (`~/.switchroom/agents/<name>/workspace/`) with editable stable files (`AGENTS.md`, `USER.md`, `IDENTITY.md`, `TOOLS.md`) and dynamic files (`MEMORY.md`, `memory/YYYY-MM-DD.md`, `HEARTBEAT.md`) injected into the model's context at turn time.
+<p align="center"><img src="docs/diagrams/wake-audit-lifecycle.svg" width="700" alt="Wake-audit lifecycle: kill, crash-pane snapshot, auto-restart, agent boots with SWITCHROOM_PENDING_TURN, acks with three options"></p>
-`SOUL.md` (the persona) is a special case since v0.12.0: it's **user-owned and seeded once** — switchroom writes it at first scaffold (from the setup wizard's persona prompts or the profile default) and then *never overwrites it*, the deliberate inverse of the switchroom-managed `CLAUDE.md`. Edit it freely; `switchroom update` won't touch it. Use `switchroom soul reset <agent>` to re-seed from the profile (it backs the old one up first). See [docs/configuration.md § Persona & SOUL.md ownership](docs/configuration.md#persona--soulmd-ownership).
+- **Auto-restart.** Agent containers run with `restart: unless-stopped` and only start once the auth-broker's healthcheck passes. The vault broker, approval kernel, and auth broker each have their own healthchecks, so a wedged dependency is caught instead of silently breaking agents.
+- **Resume protocol.** When an agent reboots mid-turn it boots with `SWITCHROOM_PENDING_TURN` plus the original chat ids. Its first action is to acknowledge the gap and ask how to proceed: start over, summarise and continue, or drop it.
+- **Wake-audit.** On every fresh boot the agent checks for owed replies, orphan sub-agents, and stale todos. Clean means it stays quiet. If it owed you a reply, it tells you.
+- **Token refresh.** The `switchroom-auth-broker` owns the refresh loop and is the sole writer of every `credentials.json`. Per-account quota state fans out across the fleet in seconds, and `auth.fallback_order` cycles when an account is exhausted.
-```bash
-switchroom workspace path <agent>                 # Print the workspace dir
-switchroom workspace show <agent> [file]          # Print one workspace file (default AGENTS.md)
-switchroom workspace edit <agent> [file]          # Open in $EDITOR (default AGENTS.md)
-switchroom workspace render <agent> --stable      # Dump the stable bootstrap block (for start.sh)
-switchroom workspace render <agent> --dynamic     # Dump the dynamic block (for UserPromptSubmit)
-switchroom workspace search <agent> <query...>    # BM25-lite search over workspace markdown
-switchroom workspace commit <agent> [-m <msg>]    # Git checkpoint of workspace state
-switchroom workspace status <agent>               # git status on the workspace
-```
+## How it stacks up
-### Observability
+| | Switchroom | Claude Code channels | OpenClaw | NanoClaw |
+|---|---|---|---|---|
+| Progress visibility | Deterministic status, never silent | Black box | None | None |
+| Runtime | Claude Code CLI | Claude Code CLI | Custom runtime | Agents SDK |
+| Auth | Pro/Max OAuth | Pro/Max OAuth | API key | API key |
+| Sub-agent tracking | Yes, live fleet block | No | No | No |
+| Parallel display | Shared fleet status, up to 5 rows | No | No | No |
+| Approval UX | Inline Telegram cards | None | None | None |
+| Config | YAML with cascade | None | JSON/TOML | Env vars |
+| Setup | `switchroom setup` | Built-in (limited) | Docker compose | Docker compose |
-```bash
-switchroom debug turn <agent>                     # Dump the exact prompt layering from the last turn
-switchroom memory setup|search|stats|reflect      # Hindsight memory
-```
+The wedge against OpenClaw and NanoClaw is not the substrate. It is the stock `claude` CLI under your subscription, instead of a custom runtime under your API key. [vs OpenClaw](docs/vs-openclaw.md) and [vs NanoClaw](docs/vs-nanoclaw.md).
-The progress card driver also writes a per-agent `card-events.jsonl` audit log: every edit, pin, unpin, and tool-label transition the user sees in Telegram, captured locally so a debug session doesn't depend on Telegram's history. Tail it like any other journal.
+## Architecture
-### Other
+One long-running service per agent. Each agent runs the stock `claude` CLI, not a fork, not the Agents SDK, not a wrapped harness, authenticated directly with Anthropic over official OAuth. Switchroom is scaffolding and lifecycle around the CLI you would run by hand: a Telegram bot, an approval kernel, a vault broker, an auth broker, and Docker Compose for supervision.
-```bash
-switchroom topics sync|list|cleanup               # Telegram forum topics
-switchroom vault init|set|get|list|remove         # Encrypted secrets
-switchroom handoff <agent>                        # Cross-session handoff summarizer
-switchroom web                                    # Web dashboard
 ```
-### Migrating credentials from OpenClaw
-`scripts/import-openclaw-credentials.ts` is a one-shot migration script that lifts `/data/openclaw-config/credentials/` into the Switchroom vault. It ships with a small set of default mappings for filenames OpenClaw documents out of the box.
-User-specific credential filenames (your custom bot tokens, SSH keys, and so on) belong in a local overlay file, not the source repository. Create `~/.switchroom/import-openclaw.yaml`:
-```yaml
-# ~/.switchroom/import-openclaw.yaml
-files:
-  telegram-bot-token-mybot: telegram/mybot-bot-token
-  discord-bot-token-mybot: discord/mybot-bot-token
-  my-server-ssh-key: ssh/my-server
-skip:
-  compass-mac-cookies.json: "auto-managed by compass skill (8h TTL cache)"
-secrets_env:
-  X_BEARER_TOKEN: x-api/bearer-token
-directories:
-  garmin-tokens: garmin/tokens
+You (Telegram)
+    │
+    ▼
+@YourBot ──┬── switchroom-telegram MCP ──┬── agent supervisor ─── Claude Code CLI
+           │       (17 tools)            │     (per-agent)        │
+           │                             │                        ├─ .claude/agents/*.md (sub-agents)
+           ├─ Deterministic status       ├─ Approval kernel ◄─────┤   settings.json (tools, hooks, MCP)
+           │   (quiet/working/idle)      │   (action grants)      ├─ Hindsight plugin (memory)
+           ├─ Live step streaming        ├─ Vault broker ◄────────┤   Drive MCP, Playwright MCP, …
+           ├─ Sub-agent fleet block      │   (per-agent ACL)      ├─ in-agent scheduler sidecar
+           ├─ SQLite history             ├─ Auth broker ◄─────────┤   (cron, fires across reboots)
+           └─ Emoji reactions            │   (OAuth refresh,       │
+                                         │    sole creds writer)   │
+                                         ├─ hostd (host-control:   │
+                                         │   /restart, /update)    │
+                                         └─ Docker Compose restart (unless-stopped)
 ```
-Overlay entries win on collision with built-in defaults. Unknown files that appear in neither defaults nor the overlay surface as `warn` entries so nothing is silently dropped. Run `bun scripts/import-openclaw-credentials.ts --help` for flags including `--mapping <path>` to override the default overlay location.
+See [`docs/architecture.md`](docs/architecture.md) for the process model, IPC layout, supervisor choice, and how each layer maps to the `claude` CLI.
 ## Documentation
@@ -425,43 +176,27 @@ Overlay entries win on collision with built-in defaults. Unknown files that appe
 |---|---|
 | **[Install](docs/install.md)** | Zero-to-first-message new-user walkthrough |
 | **[BotFather walkthrough](docs/botfather-walkthrough.md)** | Step-by-step bot creation in Telegram |
-| **[Changelog](CHANGELOG.md)** | Release notes, every version |
-| **[Configuration](docs/configuration.md)** | Full field reference, cascade semantics, profiles |
+| **[Configuration](docs/configuration.md)** | Full field reference, cascade semantics, profiles, example config |
+| **[CLI reference](docs/cli-reference.md)** | Every verb, grouped, with behaviour and usage |
 | **[Vault](docs/vault.md)** | Architecture, per-cron secrets, ACL, audit log, threat model |
-| **[Telegram Plugin](docs/telegram-plugin.md)** | Progress cards, 15 MCP tools, native checklists, sticker aliases, voice-in |
+| **[Telegram Plugin](docs/telegram-plugin.md)** | Deterministic UX, 17 MCP tools, checklists, sticker aliases, voice-in |
 | **[Sub-Agents](docs/sub-agents.md)** | Model routing, delegation patterns, frontmatter spec |
 | **[Scheduling](docs/scheduling.md)** | Cron tasks (in-agent scheduler sidecar), model selection |
 | **[Session Management](docs/session-optimization.md)** | Continuity, compaction, freshness policy |
-| **[OpenClaw alternative](docs/vs-openclaw.md)** | Switchroom vs OpenClaw |
-| **[NanoClaw alternative](docs/vs-nanoclaw.md)** | Switchroom vs NanoClaw |
 | **[Compliance](docs/compliance-attestation.md)** | Anthropic compliance analysis |
+| **[Changelog](CHANGELOG.md)** | Release notes, every version |
 | **[Telemetry](docs/posthog.md)** | What Switchroom reports to PostHog and how to opt out |
-## Telemetry
-Switchroom reports anonymous usage events and errors to PostHog so I can spot regressions and understand which commands are used. **No personal data, code, or message content leaves your machine.** The anonymous ID lives at `~/.switchroom/analytics-id` and is a random UUID. Not tied to your username, email, IP, or machine identifier (we pass `disableGeoip: true` on every event).
-To opt out, set this in your shell profile:
-```bash
-export SWITCHROOM_TELEMETRY_DISABLED=1
-```
-Full event catalogue, dashboard links, and the source module at [docs/posthog.md](docs/posthog.md).
 ## FAQ
 **Can I use a Claude Pro or Max subscription instead of an API key?**
-Yes. That's the whole point. Switchroom runs the unmodified `claude` CLI with the same OAuth flow you use on the desktop app. No API key. No per-token billing.
+Yes. That is the whole point. Switchroom runs the unmodified `claude` CLI with the same OAuth flow you use on the desktop app. No API key. No per-token billing.
 **How is this different from Claude Code's built-in Telegram channel?**
-The built-in channel is message in, message out, with zero visibility into what the agent is doing in between. Switchroom adds live progress cards that pin to the top of each topic and update as tools execute. You can always see what's happening, which is the bit the built-in channel gets wrong.
+The built-in channel is message in, message out, with no visibility into what the agent is doing in between. Switchroom shows a deterministic status on every topic (quiet, working, idle) and streams the steps in place as tools run. You always know the state, which is the bit the built-in channel gets wrong.
 **Does it work with multiple agents at the same time?**
-Yes. Each agent gets its own Telegram forum topic. When multiple agents are working simultaneously, each has its own pinned progress card labelled `(1/N)`, `(2/N)` and so on.
-**Can I see what sub-agents are doing?**
-Yes. When an agent delegates to a sub-agent (a worker, a researcher), the sub-agent's activity shows up in its own indented section of the parent's progress card. You see the full hierarchy, not just the top-level agent.
+Yes. Each agent gets its own Telegram forum topic with its own status. When an agent delegates, its sub-agents show up as live rows in a shared fleet block (up to 5, then `+N more`), each with role and current tool.
 **What does it cost to run?**
 A cheap Linux VPS (around $6/mo on Hetzner, DigitalOcean, wherever), plus your existing Claude Pro ($20/mo) or Max ($100/mo) subscription. Switchroom itself is MIT-licensed, free.
@@ -469,8 +204,9 @@ A cheap Linux VPS (around $6/mo on Hetzner, DigitalOcean, wherever), plus your e
 **Is this against Anthropic's terms of service?**
 No. Switchroom uses the official `claude` binary with the official OAuth flow. See [docs/compliance-attestation.md](docs/compliance-attestation.md) for the full analysis.
-**Is Switchroom an alternative to OpenClaw?**
-Yes. Same use case, but it uses your Claude subscription via OAuth instead of an API key, and runs the native `claude` binary instead of a custom runtime in Docker. See [vs-openclaw](docs/vs-openclaw.md).
+## Telemetry
+Switchroom reports anonymous usage events and errors to PostHog so I can spot regressions and see which commands are used. **No personal data, code, or message content leaves your machine.** The anonymous ID at `~/.switchroom/analytics-id` is a random UUID, not tied to your username, email, IP, or machine identifier. Opt out with `export SWITCHROOM_TELEMETRY_DISABLED=1`. Full event catalogue at [docs/posthog.md](docs/posthog.md).
 ## License