npm - @madarco/agentbox - Versions diffs - 0.14.0 → 0.16.0 - Mend

@madarco/agentbox 0.14.0 → 0.16.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (66) hide show

package/CHANGELOG.md +108 -0
package/dist/{_cloud-attach-GUBB5RH2.js → _cloud-attach-5KJWOASL.js} +4 -4
package/dist/{chunk-RSKG7AFU.js → chunk-3WCEB6RE.js} +2 -2
package/dist/{chunk-XKH7NTT7.js → chunk-DBBUDKKB.js} +248 -5
package/dist/chunk-DBBUDKKB.js.map +1 -0
package/dist/{chunk-TCS5HXJX.js → chunk-GXJNJUEV.js} +1090 -527
package/dist/chunk-GXJNJUEV.js.map +1 -0
package/dist/{chunk-LDMYHWUS.js → chunk-NW2UZQV6.js} +10 -6
package/dist/chunk-NW2UZQV6.js.map +1 -0
package/dist/{chunk-TBSIJVSN.js → chunk-PIK47622.js} +37 -17
package/dist/chunk-PIK47622.js.map +1 -0
package/dist/{chunk-BKU34KYY.js → chunk-QXFNLKJJ.js} +9 -3
package/dist/{chunk-BKU34KYY.js.map → chunk-QXFNLKJJ.js.map} +1 -1
package/dist/{chunk-BYCLD6D6.js → chunk-SB4QTF2T.js} +98 -54
package/dist/chunk-SB4QTF2T.js.map +1 -0
package/dist/{chunk-VATTS2MR.js → chunk-SENASAU4.js} +10 -6
package/dist/{chunk-VATTS2MR.js.map → chunk-SENASAU4.js.map} +1 -1
package/dist/{dist-34RKQ74M.js → dist-4IQFJJQI.js} +5 -5
package/dist/{dist-4DPOL5A7.js → dist-7YB7BMNG.js} +5 -5
package/dist/{dist-3IMQNTTV.js → dist-SL2QSMBE.js} +5 -5
package/dist/{dist-J2IHD5T7.js → dist-VHI5QOSQ.js} +6 -6
package/dist/{dist-57M6ZA7H.js → dist-XC47DSCR.js} +5 -5
package/dist/index.js +1043 -333
package/dist/index.js.map +1 -1
package/dist/{prepared-state-MQHD3M5F-Q27AZU53.js → prepared-state-MQHD3M5F-2LANTRL7.js} +2 -2
package/package.json +6 -5
package/runtime/docker/Dockerfile.box +21 -2
package/runtime/docker/apps/cli/share/agentbox-setup/SKILL.md +112 -29
package/runtime/docker/packages/ctl/dist/bin.cjs +10353 -8575
package/runtime/docker/packages/sandbox-docker/scripts/agentbox-checkpoint-cleanup +5 -2
package/runtime/docker/packages/sandbox-docker/scripts/linear-shim +181 -0
package/runtime/docker/packages/sandbox-docker/scripts/ntn-shim +95 -0
package/runtime/e2b/agentbox-checkpoint-cleanup +5 -2
package/runtime/e2b/agentbox-setup-skill.md +112 -29
package/runtime/e2b/ctl.cjs +10353 -8575
package/runtime/e2b/linear-shim +181 -0
package/runtime/e2b/ntn-shim +95 -0
package/runtime/e2b/scripts/build-template.sh +13 -7
package/runtime/hetzner/agentbox-checkpoint-cleanup +5 -2
package/runtime/hetzner/agentbox-setup-skill.md +112 -29
package/runtime/hetzner/ctl.cjs +10353 -8575
package/runtime/hetzner/linear-shim +181 -0
package/runtime/hetzner/ntn-shim +95 -0
package/runtime/hetzner/scripts/install-box.sh +19 -9
package/runtime/relay/bin.cjs +3707 -2828
package/runtime/vercel/agentbox-checkpoint-cleanup +5 -2
package/runtime/vercel/agentbox-setup-skill.md +112 -29
package/runtime/vercel/ctl.cjs +10353 -8575
package/runtime/vercel/linear-shim +181 -0
package/runtime/vercel/ntn-shim +95 -0
package/runtime/vercel/scripts/provision.sh +13 -7
package/share/agentbox-setup/SKILL.md +112 -29
package/share/host-skills/agentbox-info/SKILL.md +22 -2
package/dist/chunk-BYCLD6D6.js.map +0 -1
package/dist/chunk-LDMYHWUS.js.map +0 -1
package/dist/chunk-TBSIJVSN.js.map +0 -1
package/dist/chunk-TCS5HXJX.js.map +0 -1
package/dist/chunk-XKH7NTT7.js.map +0 -1
/package/dist/{_cloud-attach-GUBB5RH2.js.map → _cloud-attach-5KJWOASL.js.map} +0 -0
/package/dist/{chunk-RSKG7AFU.js.map → chunk-3WCEB6RE.js.map} +0 -0
/package/dist/{dist-34RKQ74M.js.map → dist-4IQFJJQI.js.map} +0 -0
/package/dist/{dist-4DPOL5A7.js.map → dist-7YB7BMNG.js.map} +0 -0
/package/dist/{dist-3IMQNTTV.js.map → dist-SL2QSMBE.js.map} +0 -0
/package/dist/{dist-J2IHD5T7.js.map → dist-VHI5QOSQ.js.map} +0 -0
/package/dist/{dist-57M6ZA7H.js.map → dist-XC47DSCR.js.map} +0 -0
/package/dist/{prepared-state-MQHD3M5F-Q27AZU53.js.map → prepared-state-MQHD3M5F-2LANTRL7.js.map} +0 -0

package/runtime/vercel/linear-shim ADDED Viewed

@@ -0,0 +1,181 @@
+#!/usr/bin/env bash
+# agentbox `linear` shim — translates a strict subset of `linear`
+# (@schpet/linear-cli, v2) subcommands into `agentbox-ctl integration
+# linear <op>` so the host's authenticated `linear` runs the operation and
+# only the result crosses back into the box. The in-box agent never sees a
+# Linear API token.
+#
+# Installed at /usr/local/bin/linear (real `linear` is not in the box).
+#
+# This shim ships only what documented agent flows need; anything outside
+# the subset below is rejected with a clear error. Add ops deliberately —
+# the relay is gated by `integrations.linear.enabled` and an explicit op
+# allowlist in @agentbox/integrations.
+#
+# Three classes of upstream subcommand are EXPLICITLY rejected even though
+# they exist on the host CLI, because proxying them would defeat the
+# security model:
+#   - `auth token` PRINTS the raw API token to stdout — proxying it would
+#     hand the box the host's Linear credential. The only auth-family op
+#     we proxy is `auth whoami` (identity only), via `linear whoami`.
+#   - `auth login/logout/migrate/default` would mutate host auth state.
+#   - `issue delete` / `team delete` / `team create` are destructive and
+#     off-list (widen deliberately, as gated writes, only if needed).
+set -euo pipefail
+# Path is a constant in production; the env override exists purely to let
+# unit tests substitute a stub `agentbox-ctl` on PATH without rewriting the
+# shim. Mirrors gh-shim / git-shim / ntn-shim.
+CTL="${AGENTBOX_CTL_PATH:-/usr/local/bin/agentbox-ctl}"
+die() {
+  printf 'agentbox linear shim: %s\n' "$*" >&2
+  exit 2
+}
+handle_auth() {
+  local sub="${1-}"; shift || true
+  case "$sub" in
+    whoami)
+      exec "$CTL" integration linear whoami -- "$@"
+      ;;
+    token)
+      die "'auth token' leaks the raw API key — refused. Use 'linear whoami' for identity."
+      ;;
+    login|logout|migrate|default)
+      die "'auth $sub' is not proxied (the host owns auth; run it on the host)."
+      ;;
+    '')
+      die "missing subcommand for 'auth'. Supported: whoami"
+      ;;
+    *)
+      die "unsupported 'auth $sub' (allowed: whoami)"
+      ;;
+  esac
+}
+handle_issue_comment() {
+  local sub="${1-}"; shift || true
+  case "$sub" in
+    add)
+      exec "$CTL" integration linear issue.comment -- "$@"
+      ;;
+    '')
+      die "missing subcommand for 'issue comment'. Supported: add"
+      ;;
+    *)
+      die "unsupported 'issue comment $sub' (allowed: add)"
+      ;;
+  esac
+}
+handle_issue() {
+  local sub="${1-}"; shift || true
+  case "$sub" in
+    list)
+      exec "$CTL" integration linear issue.list -- "$@"
+      ;;
+    mine)
+      exec "$CTL" integration linear issue.mine -- "$@"
+      ;;
+    view)
+      exec "$CTL" integration linear issue.view -- "$@"
+      ;;
+    query)
+      exec "$CTL" integration linear issue.query -- "$@"
+      ;;
+    create)
+      exec "$CTL" integration linear issue.create -- "$@"
+      ;;
+    update)
+      exec "$CTL" integration linear issue.update -- "$@"
+      ;;
+    comment)
+      handle_issue_comment "$@"
+      ;;
+    delete)
+      die "'issue delete' is not proxied (destructive; off-list by default)."
+      ;;
+    '')
+      die "missing subcommand for 'issue'. Supported: list, mine, view, query, create, update, comment add"
+      ;;
+    *)
+      die "unsupported 'issue $sub' (allowed: list, mine, view, query, create, update, comment add)"
+      ;;
+  esac
+}
+handle_team() {
+  local sub="${1-}"; shift || true
+  case "$sub" in
+    list)
+      exec "$CTL" integration linear team.list -- "$@"
+      ;;
+    create|delete)
+      die "'team $sub' is not proxied (destructive; off-list by default)."
+      ;;
+    '')
+      die "missing subcommand for 'team'. Supported: list"
+      ;;
+    *)
+      die "unsupported 'team $sub' (allowed: list)"
+      ;;
+  esac
+}
+# Top-level dispatch. `linear`'s real subcommands are
+# `auth issue team project cycle milestone initiative label document api schema`;
+# we expose only the read-safe ones plus a few gated writes (no destructive
+# ops, no auth token).
+if [ $# -eq 0 ]; then
+  die "no subcommand. Supported: whoami, auth whoami, issue {list,mine,view,query,create,update,comment add}, team list, api <query>, --version"
+fi
+case "$1" in
+  --version|-v)
+    # Tools that sniff "linear --version" succeed with our shim line. The
+    # real version lives host-side and is reported by the relay's
+    # readiness probe (`assertIntegrationReady`).
+    printf 'linear version 0.0.0 (agentbox-shim)\n'
+    ;;
+  --help|-h)
+    printf 'agentbox linear shim — strict subset.\n' >&2
+    printf 'Supported: whoami, auth whoami, issue {list,mine,view,query,create,update,comment add}, team list, api <query>, --version\n' >&2
+    printf 'Anything else is rejected. Run host `linear --help` for full upstream docs.\n' >&2
+    ;;
+  whoami)
+    shift
+    exec "$CTL" integration linear whoami -- "$@"
+    ;;
+  auth)
+    shift
+    handle_auth "$@"
+    ;;
+  issue)
+    shift
+    handle_issue "$@"
+    ;;
+  team)
+    shift
+    handle_team "$@"
+    ;;
+  api)
+    shift
+    # `linear api` accepts pre-positional flags (`--variable`,
+    # `--variables-json`, `--paginate`, `--silent`) before the GraphQL
+    # query, so we don't require the FIRST arg to be a non-flag — only
+    # that some arg is present. The relay's refuseGraphqlNonQuery
+    # enforces query-only by rejecting any positional whose first
+    # keyword is `mutation`/`subscription` (and any `--variable
+    # key=@<path>` host-file load), so we don't duplicate that check
+    # here. Writes go through the dedicated issue.* ops.
+    if [ $# -eq 0 ]; then
+      die "'api' requires a positional <query> (e.g. '{ teams { id } }')"
+    fi
+    exec "$CTL" integration linear api -- "$@"
+    ;;
+  *)
+    die "'$1' is not proxied (supported: whoami, issue {list,mine,view,query,create,update,comment add}, team list, api <query>, --version)"
+    ;;
+esac

package/runtime/vercel/ntn-shim ADDED Viewed

@@ -0,0 +1,95 @@
+#!/usr/bin/env bash
+# agentbox `ntn` shim — translates a strict subset of `ntn` (the official
+# Notion CLI) subcommands into `agentbox-ctl integration notion <op>` so the
+# host's authenticated `ntn` runs the operation and only the result crosses
+# back into the box. The in-box agent never sees a Notion token.
+#
+# Installed at /usr/local/bin/ntn (real `ntn` is not in the box). The same
+# shim is symlinked as /usr/local/bin/notion — the per-service surface name
+# from docs/integrations_backlog.md — both invocations behave identically.
+#
+# This shim ships only what documented agent flows need; anything outside
+# the subset below is rejected with a clear error. Add ops deliberately —
+# the relay is gated by `integrations.notion.enabled` and an explicit op
+# allowlist in @agentbox/integrations.
+set -euo pipefail
+# Path is a constant in production; the env override exists purely to let
+# unit tests substitute a stub `agentbox-ctl` on PATH without rewriting the
+# shim. Mirrors gh-shim / git-shim.
+CTL="${AGENTBOX_CTL_PATH:-/usr/local/bin/agentbox-ctl}"
+die() {
+  printf 'agentbox notion shim: %s\n' "$*" >&2
+  exit 2
+}
+handle_pages() {
+  local op="${1-}"; shift || true
+  case "$op" in
+    create)
+      exec "$CTL" integration notion page.create -- "$@"
+      ;;
+    update)
+      exec "$CTL" integration notion page.update -- "$@"
+      ;;
+    '')
+      die "missing subcommand for 'pages'. Supported: create, update"
+      ;;
+    *)
+      die "unsupported 'pages $op' (allowed: create, update)"
+      ;;
+  esac
+}
+# Top-level dispatch. `ntn`'s real subcommands are
+# `api datasources files pages login logout whoami workers`; we expose only
+# the read-safe ones plus `pages {create,update}`.
+if [ $# -eq 0 ]; then
+  die "no subcommand. Supported: whoami, api <endpoint>, pages {create,update}, --version"
+fi
+case "$1" in
+  --version|-v)
+    # Tools that sniff "ntn version" succeed with our shim line. The real
+    # version lives host-side and is reported by the relay's readiness probe
+    # (`assertIntegrationReady`).
+    printf 'ntn version 0.0.0 (agentbox-shim)\n'
+    ;;
+  --help|-h)
+    printf 'agentbox notion shim — strict subset.\n' >&2
+    printf 'Supported: whoami, api <path> [inputs] [-d JSON], pages {create, update}, --version\n' >&2
+    printf 'api is read-only: GET to any endpoint; POST only to v1/search and\n' >&2
+    printf 'v1/{databases,data_sources}/{id}/query. Writes go through `pages`.\n' >&2
+    printf 'Anything else is rejected. Run host `ntn --help` for full upstream docs.\n' >&2
+    ;;
+  whoami)
+    shift
+    exec "$CTL" integration notion whoami -- "$@"
+    ;;
+  api)
+    shift
+    # Forward verbatim to mirror real `ntn api` (options may precede the path;
+    # `ls`/`help`/`--spec`/`--docs` and `-d <JSON>` bodies are all valid). The
+    # relay's refuseUnsafeApiCall is the security boundary: GET to any endpoint,
+    # POST only to read endpoints (v1/search, v1/databases/{id}/query,
+    # v1/data_sources/{id}/query); every other method/endpoint is refused.
+    # Writes go through the dedicated `pages create/update` ops.
+    exec "$CTL" integration notion api -- "$@"
+    ;;
+  pages)
+    shift
+    handle_pages "$@"
+    ;;
+  comment|comments)
+    # The T1 connector intentionally has no comment op — `ntn` exposes no
+    # top-level `comment` subcommand and Notion's REST POST /v1/comments
+    # takes a structured JSON body that doesn't trivially map from CLI
+    # flags. Tracked as a focused follow-up in docs/notion_backlog.md.
+    die "comment ops not supported yet (deferred from T2; see docs/notion_backlog.md)"
+    ;;
+  *)
+    die "'$1' is not proxied (supported: whoami, api <endpoint>, pages {create,update}, --version)"
+    ;;
+esac

package/runtime/vercel/scripts/provision.sh CHANGED Viewed

@@ -22,6 +22,8 @@
 #   /tmp/agentbox-open                 -- in-box xdg-open shim
 #   /tmp/agentbox-gh-shim              -- in-box `gh` shim (routes to host gh)
 #   /tmp/agentbox-git-shim             -- in-box `git` shim (routes via relay)
+#   /tmp/agentbox-ntn-shim             -- in-box `ntn`/`notion` shim (routes to host ntn)
+#   /tmp/agentbox-linear-shim          -- in-box `linear` shim (routes to host linear; rejects `auth token`)
 #   /tmp/agentbox-custom-CLAUDE.md     -- /etc/claude-code/CLAUDE.md content
 #   /tmp/agentbox-managed-settings.json -- /etc/claude-code/managed-settings.json
 #   /tmp/agentbox-codex-hooks.json     -- /usr/local/share/agentbox/codex-hooks.json
@@ -96,10 +98,10 @@ visudo -cf /etc/sudoers >/dev/null
 done_ "vscode user + sudoers"
 step "agentbox base dirs + /workspace ownership"
-mkdir -p /workspace /run/agentbox /var/log/agentbox /etc/agentbox /etc/claude-code \
+mkdir -p /workspace /run/agentbox /var/log/agentbox /var/lib/agentbox /etc/agentbox /etc/claude-code \
          /usr/local/share/agentbox
 chmod 755 /workspace
-chown vscode:vscode /workspace /run/agentbox /var/log/agentbox
+chown vscode:vscode /workspace /run/agentbox /var/log/agentbox /var/lib/agentbox
 done_ "agentbox base dirs + /workspace ownership"
 step "node setcap (bind <1024 without root)"
@@ -317,15 +319,19 @@ done_ "dnf cleanup"
 # the bake there is no relay, so they must not shadow the real binaries until
 # provisioning is done. Installed from /tmp just before the trim step removes the
 # sources.
-step "relay shims (gh + git)"
-install -m 0755 /tmp/agentbox-gh-shim  /usr/local/bin/gh
-install -m 0755 /tmp/agentbox-git-shim /usr/local/bin/git
-done_ "relay shims (gh + git)"
+step "relay shims (gh + git + ntn + linear)"
+install -m 0755 /tmp/agentbox-gh-shim     /usr/local/bin/gh
+install -m 0755 /tmp/agentbox-git-shim    /usr/local/bin/git
+install -m 0755 /tmp/agentbox-ntn-shim    /usr/local/bin/ntn
+ln -sf /usr/local/bin/ntn /usr/local/bin/notion
+install -m 0755 /tmp/agentbox-linear-shim /usr/local/bin/linear
+done_ "relay shims (gh + git + ntn + linear)"
 step "trim /tmp/agentbox-*"
 rm -f /tmp/agentbox-ctl /tmp/agentbox-vnc-start \
       /tmp/agentbox-checkpoint-cleanup /tmp/agentbox-open \
-      /tmp/agentbox-gh-shim /tmp/agentbox-git-shim \
+      /tmp/agentbox-gh-shim /tmp/agentbox-git-shim /tmp/agentbox-ntn-shim \
+      /tmp/agentbox-linear-shim \
       /tmp/agentbox-custom-CLAUDE.md /tmp/agentbox-managed-settings.json \
       /tmp/agentbox-codex-hooks.json /tmp/agentbox-setup-skill.md
 mv /tmp/agentbox-provision.sh /var/log/agentbox/provision.sh 2>/dev/null || true

package/share/agentbox-setup/SKILL.md CHANGED Viewed

@@ -46,35 +46,56 @@ Look at `/workspace`:
 - **Tasks** = one-shot. `pnpm install`, DB migrations, codegen, fixture loaders, install apt packages. Wire dependent services with `needs:` so they wait for the task to finish successfully.
 - Names: must match `[A-Za-z0-9_-]+`. Task names and service names share a namespace — no collisions.
 - No cycles in `needs:`.
-- **Always generate a dependency-install task** and make it the root of the `needs:` graph (every service that needs deps gets `needs: [install, …]`). Future boxes start from a snapshot of the final filesystem so they won't need this, but updates or moving to a cloud provider might need to rebuild the container from scratch. The filesystem can be then later captured by `agentbox-ctl checkpoint --set-default`. The task must be **idempotent and self-healing**: `agentbox-ctl` re-runs pending tasks on every box stop/start (the daemon dies with the container and is relaunched), so a plain `rm -rf node_modules && install` would wipe + reinstall on every start. Guard the rebuild with a marker file *inside* `node_modules` (the `.agentbox-installed` convention AgentBox uses internally): rebuild only when the marker is absent (fresh box), and be a fast no-op once it exists. Detect the package manager from the lockfile — never hardcode `pnpm`. See the worked example below.
+- **Always generate a dependency-install task** and make it the root of the `needs:` graph (every service that needs deps gets `needs: [install, …]`). Future boxes start from a snapshot of the final filesystem so they won't need this, but updates or moving to a cloud provider might need to rebuild the container from scratch. The filesystem can be then later captured by `agentbox-ctl checkpoint --set-default`. The task must be **idempotent**: `agentbox-ctl` re-runs pending tasks on every box stop/start (the daemon dies with the container and is relaunched), so an unguarded install would reinstall on every start. The clean way is the **`run_once: true`** field — the supervisor stores a marker keyed by a hash of the command and skips warm boots automatically (the marker lives at `/var/lib/agentbox/tasks/<name>`, on the box rootfs, captured by checkpoints, never polluting `/workspace`). Editing the command re-runs it. Detect the package manager from the lockfile — never hardcode `pnpm`. See the worked example below.
 - **Add a comment to the beginning** of the file to explain what you did and what issues you encountered, so that future run might use this information in case the project evolves and you need to update the agentbox.yaml file.
 ### Stateful services: data persistence & re-seeding (read this for databases)
+**Declare a containerized dependency with the `image:` service form** — AgentBox
+generates the `docker start`-or-`run` shell (no hand-written `docker run … || docker
+start …`). The container runs in the box's dockerd; a published port is reachable
+from other in-box services at `127.0.0.1:<host port>`:
+```yaml
+services:
+  postgres:
+    image:                            # bare string (image: postgres:17-alpine) or a mapping:
+      name: postgres:17-alpine
+      ports: ["5432:5432"]
+      env:
+        POSTGRES_PASSWORD: postgres
+        POSTGRES_DB: app
+      args: "-c max_connections=200"  # string or ["-c","max_connections=200"]
+      container_name: app_db          # optional; default = service name
+    ready_when: { port: 5432 }
+    restart: always
+```
+The container is reused by name across box stop/start. (Changing `image`/`env`
+reuses the existing container as-is; `docker rm <container_name>` + `agentbox-ctl
+reload` to apply.) Install the DB client the migrate/seed tasks need (e.g.
+`postgresql-client`) in the `install` task and reach the DB over TCP — don't
+`docker exec` the container (nested exec fails with a `setns` error in a box).
 **A checkpoint does NOT capture docker-in-docker data.** `agentbox checkpoint` is a `docker commit` of the box's writable filesystem (the system + `/workspace`). The in-box `dockerd` keeps its storage in a *separate* per-box volume (`/var/lib/docker`), which is **not** part of that image — it's fresh on every new box and wiped on `agentbox destroy`. So a database or cache you run as a **docker container** (e.g. `docker run … postgres`) starts **empty on every new box** created from a checkpoint (every `agentbox claude` / `agentbox create`), even though `/workspace` and any marker files you wrote were restored. (A DB run as a **native process** with its data dir on the box filesystem — e.g. `postgres -D /var/lib/postgresql/data` — *is* captured by the checkpoint, since it lives in the writable layer.)
-**Consequence for migrate/seed tasks of a containerized DB: do not gate them on a filesystem marker.** A marker like `node_modules/.agentbox-installed` is correct for deps (they live in `/workspace`, which the checkpoint captures), but **wrong** for DB data living in a docker volume: the marker is restored from the checkpoint while the DB is empty, so a marker-guarded seed wrongly skips and the app boots against an empty database. Instead, **gate on the actual data** — connect to the DB and check whether a sentinel table/row exists, and seed only when it's missing:
+**Consequence for migrate/seed tasks of a containerized DB: do NOT use `run_once: true` (the marker form).** A command-hash marker is correct for deps (they live in `/workspace`, which the checkpoint captures), but **wrong** for DB data living in a docker volume: the marker is restored from the checkpoint while the DB is empty, so a marker-guarded seed wrongly skips and the app boots against an empty database. Instead use the **`run_once: { check: <cmd> }`** form — the probe runs first and the seed runs unless the probe exits 0, and **no marker is written** (the DB is the source of truth). Gate on the actual data:
 ```yaml
   seed:
-    # Re-seed when the DB is empty. The postgres data lives in the in-box
-    # docker volume, which is NOT captured by `agentbox checkpoint` — so a box
-    # started from a checkpoint has the workspace warm but an empty DB. We can't
-    # use a filesystem marker here (it would be restored while the DB is blank);
-    # instead probe the DB and seed only if the data is absent. Fast no-op once
+    # Re-seed when the DB is empty. The postgres data lives in the in-box docker
+    # volume, which is NOT captured by `agentbox checkpoint` — so a box started
+    # from a checkpoint has the workspace warm but an empty DB. The marker form
+    # would be restored while the DB is blank and wrongly skip; the `check` probe
+    # gates on the data itself. Exit 0 = already seeded, skip. Fast no-op once
     # the data is present.
-    command: |
-      set -e
-      export PGPASSWORD=postgres
-      # Probe for existing data. If the table is missing the query errors,
-      # stderr is suppressed, stdout is empty, the grep fails — so we seed.
-      if psql -h 127.0.0.1 -p 5432 -U postgres -d app -tAc \
-          "SELECT EXISTS (SELECT 1 FROM users LIMIT 1)" 2>/dev/null | grep -q t; then
-        echo "data present — skip seed"
-        exit 0
-      fi
-      pnpm db:seed
+    command: pnpm db:seed
     needs: [install, migrate]
+    run_once:
+      check: |
+        export PGPASSWORD=postgres
+        psql -h 127.0.0.1 -p 5432 -U postgres -d app -tAc \
+          "SELECT EXISTS (SELECT 1 FROM users LIMIT 1)" 2>/dev/null | grep -q t
 ```
 **Lifecycle nuance (this is why the data check, not a marker, is right):**
@@ -148,22 +169,19 @@ tasks:
   # Idempotent install. /workspace is the container's writable filesystem, so
   # node_modules persists across pause/stop/start and is captured by
   # `agentbox checkpoint`. The host's node_modules is macOS-native and is
-  # never copied in, so force a clean Linux build the first time — but skip
-  # on every subsequent box start (agentbox-ctl re-runs pending tasks after
-  # stop/start). Adjust the lockfile detection to the project's package
-  # manager.
+  # never copied in, so the first Linux install runs; `run_once: true` then
+  # skips it on every subsequent box start (the supervisor stores a marker
+  # keyed by a hash of the command). Adjust the lockfile detection to the
+  # project's package manager.
   install:
     command: |
       set -e
-      MARKER=node_modules/.agentbox-installed
-      [ -f "$MARKER" ] && { echo "deps installed (marker present) — skip"; exit 0; }
-      apt-get update && apt-get install -y postgresql-client
-      rm -rf node_modules
+      sudo apt-get update && sudo apt-get install -y postgresql-client
       if [ -f pnpm-lock.yaml ]; then
         corepack enable >/dev/null 2>&1 || true
         pnpm install --frozen-lockfile || pnpm install
       fi
-      touch "$MARKER"
+    run_once: true
   migrate:
     command: pnpm db:migrate
@@ -192,6 +210,36 @@ services:
       factor: 2
 ```
+## 6b. Bringing extra host files/folders into the box
+Two ways to copy host files in (both COPY — never a live mount, so the box can't
+write back to the host):
+- **`carry:` block** (declarative, in `agentbox.yaml`) — for files/dirs every box
+  should get at create time. Each entry is `{ src, dest }` with optional `mode`,
+  `user`, `optional`, and `exclude:` (a list of tar globs / bare dir names to drop
+  when copying a directory). Heavy regenerable dirs (`.git`, `node_modules`, `bin`,
+  `obj`, `packages`, `dist`, `.next`, `target`) are dropped by default; `exclude:`
+  is additive. Each carry entry is capped at `box.cpMaxBytes` (default 100 MiB
+  after excludes) — the same limit `agentbox cp` enforces.
+- **`agentbox-ctl cp fromHost <hostPath> <boxPath>`** (ad-hoc, from inside the box)
+  — for a one-off copy. Prompts the user on the host to approve.
+**The per-copy size limit (important for large/legacy folders).** A single copy is
+blocked above `box.cpMaxBytes` (default **100 MB**) *after* default excludes, so it
+fails loud instead of silently hanging. When blocked you get a `du`-style tree of
+the biggest remaining folders/subfolders. To get under the limit, EITHER:
+- **drop what the box can regenerate** (the default excludes already remove
+  `node_modules`/`.git`/build output; add more with `--exclude=<glob-or-name>`), OR
+- **copy the heavy folders one at a time** so each copy is under the limit, OR
+- pass `--yes` to copy the whole thing anyway (only when you really need it all).
+Example: a 2.4 GB legacy folder is mostly `packages/` (NuGet) + `.git`; those are
+excluded by default, and what's left can be split:
+`agentbox-ctl cp fromHost ../legacy/src /workspace/legacy/src` then
+`... cp fromHost ../legacy/Database /workspace/legacy/Database`.
 ## 7. Validate before handing off
 - check with `agentbox-ctl reload` and then `agentbox-ctl status` that everything is running as expected.
@@ -228,6 +276,41 @@ On Vercel: this actually STOPS the sandbox, so warn the user about it. Also the
 - For Nextjs/Vite/Tasnstack projects, makes sure to forward also websocket for hot reload.
-- Service like flask, nextjs, BETTER_AUTH_URL, NEXT_PUBLIC_APP_URL should use the <boxname>.localhost url for the local development so that on the host it will use the same url as the box.
+- Service like flask, nextjs, BETTER_AUTH_URL, NEXT_PUBLIC_APP_URL should use the `<boxname>.localhost` url for the local development so that on the host it will use the same url as the box. Render this automatically instead of hand-writing `sed` — see section 6c.
+- The `install` task above uses `run_once: true`, so it is a no-op on warm boots. Do **not** wrap it in a manual marker check too. To force a one-off rebuild, run `agentbox-ctl run-task install --force` (which bypasses the run_once marker), or edit the command (a changed command invalidates the hash and re-runs).
+## 11. Pin URLs / render config files (env, secrets)
+Many apps hard-code a hostname (e.g. `optima.localhost`) or read a gitignored `.env`. Instead of long `sed` commands in a task, use the built-ins:
+- **`agentbox-ctl render <src>`** — a declarative `sed` for files already in the workspace. `--env` substitutes `{{AGENTBOX_*}}` placeholders; `--rules <name>` applies a named rule-set from the top-level `replacements:` block; `--rule 'from=>to'` / `--rule-regex 'pat=>repl'` are inline. Write to `--out <path>` (or `--in-place`). The whitelist placeholders are `{{AGENTBOX_BOX_NAME}}`, `{{AGENTBOX_BOX_HOST}}` (= `<boxname>.localhost`), `{{AGENTBOX_BOX_ID}}`, `{{AGENTBOX_BOX_KIND}}`, `{{AGENTBOX_HOST_WORKSPACE}}`, `{{AGENTBOX_PROJECT_ROOT}}`.
+  Render a gitignored `.env` from a committed `env.example` on every boot, pinning the URLs to this box:
+  ```yaml
+  replacements:
+    box-host:
+      - { from: 'optima\.localhost', to: '{{AGENTBOX_BOX_HOST}}', regex: true }  # {{AGENTBOX_BOX_HOST}} = <box>.localhost
+  tasks:
+    env:
+      # The render is idempotent (the rules re-pin the same lines every boot), so
+      # no `run_once:` guard is needed — it self-corrects on a checkpoint-started
+      # box that carries a different box's host in .env.
+      command: agentbox-ctl render apps/saas/env.example --out apps/saas/.env --env --rules box-host
+  ```
+  Note: an `run_once: { check: <cmd> }` probe runs verbatim via `bash -c` with the box env — use shell vars like `$AGENTBOX_BOX_NAME`, NOT `{{…}}` placeholders (those are only expanded by `render`/carry, never by the supervisor).
+  **Generated secrets:** put `{{AGENTBOX_AUTO_SECRET}}` in the template for a value like `BETTER_AUTH_SECRET` instead of shelling out to `openssl rand`. Unnamed → a fresh 32-byte base64url secret each render (stable when you render the template→`.env` once). `{{AGENTBOX_AUTO_SECRET:better-auth}}` → generated once, persisted at `/var/lib/agentbox/secrets/<name>`, reused on every render (stable even if you render every boot). Example `env.example` line: `BETTER_AUTH_SECRET="{{AGENTBOX_AUTO_SECRET:better-auth}}"`.
+- **`carry:` + `replaceEnvs`/`replace`/`rules`** — for a host-only file (e.g. a real `.env` with secrets that never lives in the repo), carry it in and render it host-side in one step (file entries only):
-- The `install` task is intentionally a no-op once `node_modules/.agentbox-installed` exists. Do **not** remove the marker guard to "force a fresh install" — that reinstalls on every box start. To force a one-off rebuild, delete `node_modules` (or just the marker) then run `agentbox-ctl reload`.
+  ```yaml
+  carry:
+    - src: ~/secrets/optima.env
+      dest: /workspace/apps/saas/.env
+      replaceEnvs: true
+      rules: [box-host]
+  ```

package/share/host-skills/agentbox-info/SKILL.md CHANGED Viewed

@@ -174,10 +174,29 @@ Wrap step 2 in a loop to babysit a box across many turns. Use the narrow `wait-f
 Implications for you, the host-side agent:
 - Inside the box you can `git commit … && git push` exactly as normal. No setup needed.
-- Pushes are gated host-side: the relay can require a confirm prompt for destructive operations (the user sees it in the dashboard footer, ~25 s TTL). If a push appears to hang, tell the user to check the dashboard.
+- Pushes are gated host-side: the relay can require a confirm prompt for destructive operations (the user sees it in the dashboard footer, ~25 s TTL). If a push appears to hang, it's waiting on this approval — see "Answering host-action approvals" below.
 - The relay process is started lazily by the first `agentbox create` / `agentbox claude` and persists across runs (PID at `~/.agentbox/relay.pid`, log at `~/.agentbox/relay.log`). You normally don't need to manage it.
 - For HTTPS origins (`https://github.com/...`), pushing usually needs a credential — recommend the user run `gh auth login` and `gh auth setup-git` once on the host. After that, host `git push` uses gh's OAuth token automatically. SSH origins (`git@github.com:...`) keep using the host's SSH agent as before.
+## Answering approvals (orchestrator path)
+When you are **orchestrating boxes unattended** (no human watching the dashboard footer), a box blocks on two kinds of approval and `agent approvals` / `agent approve` cover **both**:
+- **Relay host-action approvals** — `git push` / `cp` / `gh pr` write / checkpoint. You answer them yourself; you're a host process that already holds the user's git/file credentials, so approving grants nothing you don't already have.
+- **In-TUI agent prompts** — Claude plan-mode approval, `AskUserQuestion`, a tool-permission dialog. Previously you had to craft `drive keypress` sends by hand; now `approve` enacts the right keystrokes for you.
+```bash
+agentbox agent approvals 1 --json          # list everything box 1 is blocked on: each row has an id + kind
+agentbox agent approve <id>                # answer that exact prompt (default = approve / first option)
+agentbox agent approve <id> --option 2     # in-TUI question/plan: pick option 2 (or --option "Risk first")
+agentbox agent approve <id> --deny         # reject (relay: deny; in-TUI: Escape)
+agentbox agent approvals 1 --wait 600000   # block until something is pending, then act
+```
+`kind` is `host-action` (relay), or `plan` / `question` / `permission` (in-TUI). Relay rows carry `command`/`argv`; `question` rows carry the option labels; `plan` rows the plan body.
+**The id is a safety token — inspect, then approve that exact id.** `approve <id>` answers the specific prompt you listed; if a *different* prompt has since taken its place, the recomputed id won't match and the approve is **refused** (it never answers the wrong thing). So always `approvals` → read the `command`/`argv`/options → `approve <id>`, one at a time. Do not blanket-approve whatever a box asks (that defeats the gate against a prompt-injected box laundering a malicious push), and never hand-`curl` `/admin/prompts/answer` — these commands are the supported surface. In-TUI keystroke mapping is best-effort and TUI-version-sensitive; if an approve doesn't take, fall back to `drive snapshot` + `drive keypress`.
 ## PRs through the host relay (`agentbox-ctl git pr …`)
 In-box agents can drive GitHub PRs from inside a box via the host's `gh` CLI. Same model as `git push`: the box has no GitHub token; the relay shells out to `gh` on the host with the user's authenticated gh identity. Requires `gh` installed on the host and `gh auth login` run once.
@@ -208,6 +227,7 @@ If a PR op appears to hang, tell the user to check the dashboard footer for the
 | `agentbox code [n\|name]` | Open VS Code / Cursor pointed at the box. |
 | `agentbox prepare --provider <name>` | One-time base image / snapshot build for `daytona` or `hetzner` or `vercel`. With no `--provider`, prints status across all providers. |
 | `agentbox prune --provider <name>` | Clean up orphan boxes / images / snapshots for a provider (docker + daytona supported; hetzner pending). |
+| `agentbox cp <src> <dst>` | Copy a file/dir host↔box (`box:/path` prefix picks direction). Heavy dirs (`.git`, `node_modules`, build output) are dropped by default; add `--exclude=<glob\|name>` or `--no-default-excludes`. Uploads over `box.cpMaxBytes` (100 MB, post-exclude) are **blocked** with a size breakdown — trim with `--exclude`, copy heavy folders one at a time, or pass `--yes`. |
 Per-project numeric index (`1`, `2`, …) and friendly name (`review`, `smoke`) both work wherever `<box>` is accepted. Index `1` is the first box created in the current workspace.
@@ -217,7 +237,7 @@ Per-project numeric index (`1`, `2`, …) and friendly name (`review`, `smoke`)
 2. **Use `-i` whenever the user asks for parallel agent work** rather than spawning multiple foreground sessions. Then point them at `agentbox dashboard` to watch progress.
 3. **Pick the provider deliberately.** `docker` is the fast default. `--provider hetzner` gives a real VPS (heavier, isolated, requires `agentbox prepare --provider hetzner` once). `--provider vercel` is the managed cloud option.
 4. **Cross-check before recommending a command.** If a flag isn't listed here, run `agentbox <command> --help` (it's safe and read-only) before suggesting it to the user.
-5. **`/agentbox-setup` is a different skill.** It runs *inside* a box to generate `/workspace/agentbox.yaml`. Don't conflate it with `/agentbox` (host-side fork) or this reference skill.
+5. **`/agentbox-setup` is a different skill.** It runs *inside* a box to generate `/workspace/agentbox.yaml`. Don't conflate it with `/agentbox` (host-side fork) or this reference skill. When authoring `agentbox.yaml`, prefer the declarative `run_once: true` / `run_once: { check }` task field over hand-rolled marker/probe guards, and `agentbox-ctl render` / carry `replaceEnvs` over `sed` for pinning env URLs to `{{AGENTBOX_BOX_HOST}}`.
 ## Reference