@blamejs/core 0.9.19 → 0.9.21

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -8,6 +8,8 @@ upgrading across more than a few patches at a time.
8
8
 
9
9
  ## v0.9.x
10
10
 
11
+ - v0.9.21 (2026-05-14) — **`b.agent.orchestrator` — framework-level supervisor for every agent blamejs ships.** First slice of the v0.9.21–v0.9.29 substrate playbook that builds before mail-stack resumes. (1) **`b.agent.orchestrator.create(opts)`** — facade with `register(name, agent, opts)` / `lookup(name)` / `unregister(name)` / `list({ kind, tenantId })` registry, `spawnConsumers({ agent, queue, shards, taskTopic, maxConcurrency })` for sharded-topic dispatch (FNV-1a consistent-hash via `b.agent.orchestrator.shardFor(key, shards)`; per-shard topic suffix `<base>.<shard>`), `elect({ resource })` composing `b.cluster` DB-row leader election (returns `{ isLeader, fencingToken, leaderId }` — single-process deployments get a trivial-leader; cluster deployments delegate), `drain({ timeoutMs })` stopping every spawned consumer + audit-emitting elapsed/count, and `health()` aggregating per-agent + per-consumer + per-election state into one shape ready for `b.middleware.healthcheck`. (2) **Pluggable backend** — `{ get, set, delete, list }` interface; in-memory default ships for single-process deployments, operator wires `b.config.loadDbBacked`-shaped or external for restart-survival. (3) **`b.guardAgentRegistry`** — registry-op shape validator. Refuses non-ASCII agent names (NFC + ASCII-only — operator-greppable in audit logs), path-traversal shapes (`..` / `/` / `\` / NUL / C0 / DEL), oversized (default 64 bytes), reserved `FRAMEWORK.*` / `ROOT.*` prefix, duplicate-on-register, register without `agentKind`. Ships `strict` / `balanced` / `permissive` profiles and `hipaa` / `pci-dss` / `gdpr` / `soc2` postures (all pin `strict`). (4) **Drain phase auto-wires into `b.appShutdown`** when operator supplies an `appShutdown` instance; `SIGTERM` delivers a clean drain of all spawned consumers + stream-registry signal before the process exits. (5) **Stream-registry hook** (`registerStream` / `unregisterStream` / `isDraining`) — substrate for v0.9.23 `b.agent.stream` so async-iterable method variants can check the drain flag and emit drain-markers. Substrate is NOT a process supervisor — process spawn / restart-on-crash / pod scheduling delegate to pm2 / systemd / k8s / Nomad; framework doesn't compete. Fuzz harness ships in `fuzz/guard-agent-registry.fuzz.js`. Per the substrate playbook in `memory/specs/blamejs-agent-orchestrator-spec.md`; v0.9.22 idempotency wires on top next.
12
+ - v0.9.20 (2026-05-14) — **`b.mail.agent` — the standardization contract for every mail protocol blamejs ships.** Every above-the-wire mail surface (JMAP at v0.9.27, IMAP at v0.9.28, POP3 at v0.9.29, ManageSieve at v0.9.30, the MX listener at v0.9.24, submission at v0.9.25) translates protocol calls into `agent.X(args)`; RBAC, posture enforcement, audit emission, dispatch, and worker isolation are owned at the agent. (1) **`b.mail.agent.create({ store, audit, permissions, posture, identity, dispatch })`** — facade with 23 methods. Read surface (`search` / `fetch` / `thread` / `folders` / `quota`) is backed by v0.9.19 `b.mailStore` and runs immediately. Move surface (`move` / `flag` / `delete`) is backed by the new `mailStore.moveMessages` substrate; soft-delete moves to Trash + tags `\Deleted` (hard expunge wires at v0.9.28 with retention-floor enforcement). Write surface (`compose` / `send` / `reply` / `forward`), Sieve (`sieve.list/put/activate`), identity (`identity.set` / `vacation.set`), MDN (`mdn.send/parse/allowList`), regulated export, and migration `import` throw `mail-agent/not-implemented` with a `wiredAt` tag naming the slice that lights them up (defer-with-condition per the v1-defensible-scope rule). (2) **Dispatch contract** — `dispatch.mode` is `"local"` / `"queue"` / `"auto"`. `local` runs every method in-process. `queue` publishes envelopes to `mail.agent.tasks` via `b.queue.enqueue`; an `agent.consumer({ agent, queue })` running in a dedicated process or replicas across hosts pulls and executes. Posture metadata travels with each envelope; the consumer re-validates against its own posture before unseal so no posture downgrade survives the queue boundary. `auto` routes fast-path ops (`fetch` / `folders` / `flag` / `quota`) locally and heavy ops (`search` / `export`) to queue/workerPool when configured. (3) **Worker isolation** — `dispatch.workerPool` (composes `b.workerPool`) is validated at create-time; the agent reserves `vaultKeyDelivery: "in-worker"` (default) vs `"main-only"` (posture-conditional — HIPAA/PCI/GDPR default to main-only when the worker-script path wires at v0.9.26 Sieve). (4) **`b.mail.agent.consumer`** — the queue-side facade for multi-host load-spreading; carries its own `store` and re-validates posture at the boundary. (5) **Five new guards** through `b.gateContract` — **`b.guardMailQuery`** (search/fetch filter shape: bounded depth/keys/array-length, function/regex/Buffer/cycle refusal, `__proto__` key refusal, projection-column allowlist via `FILTERABLE_COLUMNS`, posture-required actor fields HIPAA→`purposeOfUse` / PCI→`pciScope` / GDPR→`lawfulBasis`); **`b.guardMailCompose`** (draft envelope: identity-vs-From alignment, recipient deduplication, attachment-byte cap default 25 MiB, body shape — exactly one of text/html unless `allowMultipartAlternative`, C0 control-char refusal in headers); **`b.guardMailReply`** (References-chain cap default 100 defends infinite-loop forwards, In-Reply-To continuity per RFC 5322 §3.6.4 — last References must match In-Reply-To, quoted-original byte cap, forwarded-attachment cardinality cap); **`b.guardMailMove`** (system-folder allowlist for INBOX/Sent/Drafts/Trash/Junk/Archive, admin-scope or `allowedFolders` gate for arbitrary destinations, path-traversal refusal, slash refusal — IMAP `.` hierarchy separator only); **`b.guardMailSieve`** (pre-parser shape-only: script-byte cap default 64 KiB, line-count cap defends one-byte-line bombs, name shape — path-traversal / slash / backslash refusal, actor-ownership check — non-admin actors restricted to their `ownedNames`; full Sieve parse at v0.9.26 via `b.safeSieve`). Each guard ships `strict` / `balanced` / `permissive` profiles and `hipaa` / `pci-dss` / `gdpr` / `soc2` postures (all pin `strict`). (6) **`b.mailStore.moveMessages(fromFolder, toFolder, objectIds)`** — per IMAP4rev2 §6.6.2, both folders bump modseq on move; agent.move composes this. (7) Fuzz harnesses ship for every new guard and the v0.9.19 substrates (`safe-mime` / `guard-message-id`) are now wired into the ClusterFuzzLite matrix.
11
13
  - v0.9.19 (2026-05-14) — **First slice of the blamepost mail-stack sequence — `b.mailStore` + `b.safeMime` + `b.guardMessageId` substrates.** Byte-level mail-store foundation that every above-the-wire mail primitive composes (agent at v0.9.20, MX listener at v0.9.23, submission listener at v0.9.24, JMAP/IMAP/POP3 at v0.9.26-29, ManageSieve at v0.9.30, DAV at v0.9.32). (1) **`b.safeMime`** — RFC 5322 + 2045/2046/2047/EAI MIME parser. Bounded: total parts cap (default 64), nesting-depth cap (default 16), boundary length cap (default 70 per RFC 2046 §5.1.1), header-bytes cap (default 64 KiB), header-line cap (default 998 per RFC 5322 §2.1.1), body-bytes cap (default 25 MiB), message cap (default 50 MiB), charset allowlist (UTF-8 / US-ASCII / common legacy 8-bit), transfer-encoding allowlist (7bit/8bit/binary/qp/base64). Surface: `parse(bytes, opts) → tree`, `walk(tree, visitor)`, `findFirst(tree, predicate)`, `extractText(tree, opts)` (RFC 2046 §5.1.4 last-wins for `multipart/alternative`), `extractAttachments(tree, opts)`. Includes RFC 2047 Q + B encoded-word decoding for `Subject:` / `From:` etc. + RFC 2231 charset'lang'value filename decoding. Throws `safe-mime/<code>` on every cap exceeded / malformed boundary / unknown charset / unknown CTE / control chars in headers / NUL bytes. **Defends CVE-2024-39929** (Exim MIME multipart parser) and **CVE-2025-30258** (gnumail truncated-MIME-tree class). Fuzz harness ships in `fuzz/safe-mime.fuzz.js`. (2) **`b.guardMessageId`** — RFC 5322 §3.6.4 Message-Id validator. Gates Message-Id / In-Reply-To / References at the mail-store append boundary, the MX inbound boundary (v0.9.23), and the submission outbound path (v0.9.24). Refuses oversized (>998 bytes), bare CR/LF/NUL/C0-control/DEL (header-injection defense — defends `From:` / `Bcc:` smuggling via folded Message-Id continuation), unbracketed under strict profile, empty value, missing `@`, nested angle brackets, bidi codepoints (CVE-2021-42574 RTLO class in mail-header context). Profile family: strict (default) / balanced / permissive. Posture family: hipaa / pci-dss / gdpr / soc2 → all pin profile to strict. Surface: `validate(value, opts)`, `validateList(value, opts)` (References-chain cap = 100), `compliancePosture(posture)`. Fuzz harness ships in `fuzz/guard-message-id.fuzz.js`. (3) **`b.mailStore`** — byte-level mail-store substrate with pluggable backend (sqlite default; operator's `b.externalDb` Postgres or any `{ prepare(sql) → { run, get, all } }`-shaped object). Surface: `create(opts)` returning `{ appendMessage, fetchByObjectId, queryByModseq, setFlags, createFolder, listFolders, threadFor, quota, setLegalHold }`. **Sealed by default** via `b.cryptoField.sealRow` — `subject` / `from_addr` / `to_addrs` / `body_text` / `body_html` route through vault-managed AEAD envelope on insert + unseal on fetch. Plaintext (forensic-queryable without unsealing): `objectid` / `modseq` / `internal_date` / `received_at` / `size_bytes` / `flags` / `legal_hold` / `from_hash` / `message_id_hash`. Per-folder monotonic `modseq` counter (RFC 7162 CONDSTORE substrate). Per-message `objectid` (RFC 8474 JMAP cross-protocol identity). Threading at append time via In-Reply-To + References chain walk (cryptoField.lookupHash for hash-aware threading on sealed columns). Quota substrate (per-folder `used_bytes` + `used_count` maintained atomically). Legal-hold flag composes existing `b.legalHold`. Schema bootstraps at construction with six IMAP4rev2 default folders (INBOX / Sent / Drafts / Trash / Junk / Archive) and JMAP role mapping. Append composes `b.safeMime.parse` (bounded inbound) + `b.guardMessageId.validate` (header-injection gate). **Per the operator-confirmed blamepost roadmap** (`memory/specs/blamepost-roadmap.md`); next slice v0.9.20 wires `b.mail.agent` on top of this substrate.
12
14
  - v0.9.18 (2026-05-14) — **18 CodeQL alerts closed across 4 rule classes + SECURITY.md hardening checklist additions for v0.9.13+ primitives + MIGRATING.md out-of-band breaking-changes section.** Post-v0.9.17 audit identified 18 pre-existing CodeQL security findings on `main` — accumulated over many releases, surfaced explicitly when v0.9.15's rename sweep changed line content. v0.9.18 closes them all. (1) **`js/file-system-race` (6 sites)** — TOCTOU between `fs.existsSync()` / `fs.statSync()` and a subsequent file op. Fixed via the framework's canonical TOCTOU-safe-read scaffold (open fd first → `fstatSync` → `readSync` loop → `closeSync` in `finally`) at `lib/atomic-file.js` (`_readSyncCore`), `lib/restore-rollback.js` (marker write switched to exclusive-create `wx` + EEXIST-tolerant), `lib/network-tls.js` (`_readPathFile` extraction with per-file ENOENT tolerance), `lib/backup/bundle.js` (open-fd-first plus required-vs-skip branch routing), `lib/static.js` (request-serve hot path narrowed to single fd). `lib/vault/seal-pem-file.js` retained as-is with a CodeQL suppression — the site has an in-line `lstat.ino === fstat.ino` inode-equality defense (line 290) that refuses with `seal-pem-file/toctou-detected` if an attacker swaps the file between `lstat` and `open`. (2) **`js/insecure-temporary-file` (6 sites)** — predictable temp paths. `lib/vault/rotate.js` now uses `mkdtempSync` for a per-rotation random scratch dir + plain filenames inside (replaces the predictable `_blamejs_rotate.tmp.db` / `_blamejs_verify.tmp.db` paths in `stagingDir`). `lib/mtls-ca.js` switched to exclusive-create `openSync(..., "wx", 0o600)` + `writeSync` + `fsyncSync` so an attacker pre-creating the path is refused at `EEXIST`. `lib/atomic-file.js` (`fsyncDir`), `lib/vault/rotate.js` (`_fsyncFileByPath`), `lib/http-client.js` (atomic tmp path) retained as-is with suppressions — `dirPath` / `p` are operator-supplied framework data paths (not `os.tmpdir`-reachable), and `tmpPath` carries 16 hex chars of crypto-random suffix (line 1802 `dest + ".tmp-" + bCrypto.generateToken(8)`). (3) **`js/path-injection` (2 sites in `lib/static.js`)** — `nodeFs.createReadStream(absPath)` in `_readMeta` (line 161) and the request-serve hot path (line 1115). Suppression comments added referencing the upstream `_resolveSafe` lexical-resolve + `startsWith(rootResolved + nodePath.sep)` + realpath escape check at lines 181-207 — `absPath` is sandbox-validated against `root` before reaching these lines. (4) **`js/remote-property-injection` (4 sites)** — `lib/websocket.js` (`ext.params: {}` → `Object.create(null)`), `lib/middleware/csrf-protect.js` (`var out = {}` → `Object.create(null)` for cookie-parse output). `lib/middleware/body-parser.js` (multipart `fields[currentField] = ...`) retained as-is with suppression — `currentField` is gated upstream at line 867 by `POISONED_KEYS = new Set(["__proto__", "constructor", "prototype"])` refusing the field BEFORE assignment with a 400 BodyParserError. **Plus: SECURITY.md hardening checklist** gains 5 lines covering `b.middleware.idempotencyKey.dbStore` (hash + seal defaults), `b.metrics.snapshot` (out-of-process metrics export), `b.selfUpdate.standaloneVerifier` (zero-dep install-pipeline verifier), `b.pqcAgent.reload` (TLS-posture refresh without restart), `b.crypto.hashFilesParallel` (parallel SBOM/integrity-sweep hashing). **Plus: MIGRATING.md** now carries an "Out-of-band breaking changes" section (the v0.9.15 dbStore schema break is the first entry); `scripts/gen-migrating.js` extended with an `OUT_OF_BAND_BREAKS` table so future schema/on-disk format breaks land in MIGRATING.md without operators needing to grep CHANGELOG.
13
15
  - v0.9.17 (2026-05-14) — **Two new `codebase-patterns` detectors + 192-site cleanup sweep — `node:` prefix consistency + internal-binding leak prevention.** Post-v0.9.16 audit surfaced two enforceable invariants the existing detectors didn't cover. (1) **`node-builtin-prefix` detector** — every `require("<X>")` of a Node built-in (`fs`, `path`, `crypto`, `stream`, `tls`, `url`, `os`, `net`, `http`, `http2`, `https`, `zlib`, `dgram`, `events`, `child_process`, `readline`, …) must use the modern `require("node:<X>")` form. Three reasons: (a) userland packages on npm CAN be named after built-ins, so without the `node:` prefix a typo or `npm install` accident could shadow the built-in; (b) the prefix is a clearer at-a-glance signal that the dependency is on Node, not on a userland module; (c) bundler / SEA static-trace passes treat `node:` prefix as an unambiguous Node-builtin marker. Sweep: 153 `require()` rewrites across 79 framework files (2 parallel agents). The detector skips JSDoc `@example` block continuation lines (`*`-prefixed), so operator-facing examples that show `var fs = require("fs")` aren't rewritten — operators write their own bindings however they prefer. (2) **`internal-binding-in-prose` detector** — internal binding names (`nodeFs` / `nodePath` / `nodeCrypto` / `nodeStream` / `nodeTls` / `nodeUrl` / `bCrypto` / `retryHelper`) must NOT appear in operator-facing surface: JSDoc/comment continuation lines or string literals (error messages, audit metadata). Operators see the public API name (`path` / `fs` / `crypto` / `retry` / …), never the framework's internal alias. Sweep: 39 prose-leak fixes across 16 files — comments rewritten to use the operator-facing word (`nodePath` → `path`, `nodeFs.watch failed` → `fs.watch failed`, debug-log `"op": "nodeFs.unlinkSync"` → `"op": "fs.unlinkSync"`). (3) **2 follow-on require-binding canonicalizations** surfaced by the node-prefix sweep — `lib/ws-client.js` now destructures `var { EventEmitter } = require("node:events")` (was binding the entire `events` module to a class-shaped name) and `lib/process-spawn.js` renames inline `nodeChild` → `childProcess` (matches the module-level `childProcess` lazyRequire in `lib/dev.js`).
package/index.js CHANGED
@@ -160,6 +160,13 @@ var guardHtml = require("./lib/guard-html");
160
160
  var guardSvg = require("./lib/guard-svg");
161
161
  var guardFilename = require("./lib/guard-filename");
162
162
  var guardMessageId = require("./lib/guard-message-id");
163
+ var guardMailQuery = require("./lib/guard-mail-query");
164
+ var guardMailCompose = require("./lib/guard-mail-compose");
165
+ var guardMailReply = require("./lib/guard-mail-reply");
166
+ var guardMailMove = require("./lib/guard-mail-move");
167
+ var guardMailSieve = require("./lib/guard-mail-sieve");
168
+ var guardAgentRegistry = require("./lib/guard-agent-registry");
169
+ var agentOrchestrator = require("./lib/agent-orchestrator");
163
170
  var guardArchive = require("./lib/guard-archive");
164
171
  var guardJson = require("./lib/guard-json");
165
172
  var guardYaml = require("./lib/guard-yaml");
@@ -398,6 +405,13 @@ module.exports = {
398
405
  guardSvg: guardSvg,
399
406
  guardFilename: guardFilename,
400
407
  guardMessageId: guardMessageId,
408
+ guardMailQuery: guardMailQuery,
409
+ guardMailCompose: guardMailCompose,
410
+ guardMailReply: guardMailReply,
411
+ guardMailMove: guardMailMove,
412
+ guardMailSieve: guardMailSieve,
413
+ guardAgentRegistry: guardAgentRegistry,
414
+ agent: { orchestrator: agentOrchestrator },
401
415
  guardArchive: guardArchive,
402
416
  guardJson: guardJson,
403
417
  guardYaml: guardYaml,
@@ -0,0 +1,469 @@
1
+ "use strict";
2
+ /**
3
+ * @module b.agent.orchestrator
4
+ * @nav Agent
5
+ * @title Agent Orchestrator
6
+ * @order 50
7
+ * @featured true
8
+ *
9
+ * @intro
10
+ * Framework-level supervisor for every agent blamejs ships
11
+ * (`b.mail.agent` today; future search-index / AI-classify / DSR /
12
+ * c2pa-watermark agents). The orchestrator owns:
13
+ *
14
+ * - **Registry** (`register` / `lookup` / `unregister` / `list`)
15
+ * — pluggable backend; in-memory default, durable via operator-
16
+ * supplied `b.config.loadDbBacked` for restart-survival. Sealed
17
+ * rows so tenant names + endpoint metadata don't leak in DB
18
+ * dumps.
19
+ * - **Sharded topics** (`spawnConsumers`) — consistent-hash route
20
+ * per-shard so each tenant's traffic owns one shard's ordering.
21
+ * - **Leader-elected singletons** (`elect`) — composes `b.cluster`
22
+ * DB-row election. Operator marks methods that must run on
23
+ * exactly one node (MDN batch dispatch, virus-DB refresh,
24
+ * journal compaction) as singletons.
25
+ * - **Drain** (`drain`) — `consumer.stop()` on every spawned
26
+ * consumer; wait for in-flight envelopes via `b.outbox`; audit.
27
+ * Wires into `b.appShutdown` as a registered phase.
28
+ * - **Health probe** (`health`) — aggregates per-agent + per-
29
+ * consumer + per-election state into one shape for
30
+ * `b.middleware.healthcheck`.
31
+ *
32
+ * The orchestrator is the **in-process supervisor of agents**, NOT
33
+ * the **OS-level supervisor of processes**. Spawn / restart-on-
34
+ * crash / autoscaling / network routing all delegate to pm2 /
35
+ * systemd / k8s / Nomad — the framework doesn't compete.
36
+ *
37
+ * ```js
38
+ * var orch = b.agent.orchestrator.create({
39
+ * audit: b.audit,
40
+ * permissions: myPerms,
41
+ * backend: operatorBackend, // optional; in-memory default
42
+ * });
43
+ *
44
+ * await orch.register("tenant-acme.mail", mailAgent, { agentKind: "mail" });
45
+ * var agent = await orch.lookup("tenant-acme.mail");
46
+ * ```
47
+ *
48
+ * @card
49
+ * The framework-level supervisor for every agent blamejs ships.
50
+ * Registry, sharded topics, leader-elected singletons, drain, and
51
+ * health probe — operators stop wiring these per-agent.
52
+ */
53
+
54
+ var lazyRequire = require("./lazy-require");
55
+ var C = require("./constants");
56
+ var { defineClass } = require("./framework-error");
57
+ var guardAgentRegistry = require("./guard-agent-registry");
58
+ var bCrypto = require("./crypto");
59
+
60
+ var audit = lazyRequire(function () { return require("./audit"); });
61
+ var cluster = lazyRequire(function () { return require("./cluster"); });
62
+
63
+ var AgentOrchestratorError = defineClass("AgentOrchestratorError", { alwaysPermanent: true });
64
+
65
+ var DEFAULT_DRAIN_TIMEOUT_MS = C.TIME.minutes(2);
66
+ var STREAM_ID_RAND_BYTES = 8; // allow:raw-byte-literal — stream-id random-suffix byte length, not a size cap
67
+
68
+ /**
69
+ * @primitive b.agent.orchestrator.create
70
+ * @signature b.agent.orchestrator.create(opts)
71
+ * @since 0.9.21
72
+ * @status stable
73
+ * @related b.mail.agent.create, b.cluster, b.appShutdown
74
+ *
75
+ * Create the orchestrator. Returns a singleton-style facade with
76
+ * registry / spawn / elect / drain / health methods. Operator runs
77
+ * one orchestrator per process; multi-process deployments share
78
+ * coordination via the backing store + `b.cluster`.
79
+ *
80
+ * @opts
81
+ * audit: b.audit namespace, // optional; defaults to b.audit
82
+ * permissions: b.permissions instance, // optional; orchestrator skips RBAC if absent
83
+ * backend: { get, set, delete, list }, // optional; in-memory default
84
+ * cluster: b.cluster module, // optional; defaults to b.cluster
85
+ * appShutdown: b.appShutdown.create() // optional; orchestrator registers drain phase if supplied
86
+ *
87
+ * @example
88
+ * var orch = b.agent.orchestrator.create({});
89
+ * await orch.register("tenant-acme.mail", mailAgent, { agentKind: "mail" });
90
+ * var agent = await orch.lookup("tenant-acme.mail");
91
+ * var folders = await agent.folders({ actor: { id: "u1" } });
92
+ */
93
+ function create(opts) {
94
+ opts = opts || {};
95
+ var backend = opts.backend || _inMemoryBackend();
96
+ if (typeof backend.get !== "function" || typeof backend.set !== "function" ||
97
+ typeof backend.delete !== "function" || typeof backend.list !== "function") {
98
+ throw new AgentOrchestratorError("agent-orchestrator/bad-backend",
99
+ "b.agent.orchestrator.create: backend must expose { get, set, delete, list }");
100
+ }
101
+ var clusterImpl = opts.cluster || cluster();
102
+ var auditImpl = opts.audit || audit();
103
+ var permissions = opts.permissions || null;
104
+
105
+ var ctx = {
106
+ backend: backend,
107
+ cluster: clusterImpl,
108
+ audit: auditImpl,
109
+ permissions: permissions,
110
+ spawnedConsumers: [],
111
+ streams: new Map(),
112
+ elections: new Map(),
113
+ // Live agent objects stay in-process — DB/JSON backends can't
114
+ // serialize function properties. The backend row carries only the
115
+ // operator-supplied metadata (kind / tenantId / posture / ...);
116
+ // every consuming process holds its own runtime map of name → agent.
117
+ liveAgents: new Map(),
118
+ };
119
+
120
+ // Wire the drain phase into b.appShutdown if the operator supplied one.
121
+ if (opts.appShutdown && typeof opts.appShutdown.registerPhase === "function") {
122
+ opts.appShutdown.registerPhase("agent.orchestrator.drain", function () {
123
+ return _drain(ctx, { timeoutMs: DEFAULT_DRAIN_TIMEOUT_MS });
124
+ });
125
+ }
126
+
127
+ return {
128
+ register: function (name, agent, regOpts) { return _register(ctx, name, agent, regOpts || {}); },
129
+ unregister: function (name, args) { return _unregister(ctx, name, args || {}); },
130
+ lookup: function (name, args) { return _lookup(ctx, name, args || {}); },
131
+ list: function (args) { return _list(ctx, args || {}); },
132
+ spawnConsumers: function (args) { return _spawnConsumers(ctx, args || {}); },
133
+ elect: function (args) { return _elect(ctx, args || {}); },
134
+ drain: function (args) { return _drain(ctx, args || {}); },
135
+ health: function () { return _health(ctx); },
136
+ registerStream: function (info) { return _registerStream(ctx, info || {}); },
137
+ unregisterStream: function (streamId) { return _unregisterStream(ctx, streamId); },
138
+ isDraining: function (streamId) { return ctx.draining === true; },
139
+ AgentOrchestratorError: AgentOrchestratorError,
140
+ _ctx: ctx, // test-only introspection
141
+ };
142
+ }
143
+
144
+ // ---- Registry -------------------------------------------------------------
145
+
146
+ async function _register(ctx, name, agent, regOpts) {
147
+ guardAgentRegistry.validate({ kind: "register", name: name, agentKind: regOpts.agentKind }, {});
148
+ _checkPermission(ctx, regOpts.actor, "agent-registry:write");
149
+ if (!agent || typeof agent !== "object") {
150
+ throw new AgentOrchestratorError("agent-orchestrator/bad-agent",
151
+ "register: agent object required");
152
+ }
153
+ var existing = await ctx.backend.get(name);
154
+ if (existing) {
155
+ throw new AgentOrchestratorError("agent-orchestrator/duplicate",
156
+ "register: '" + name + "' already registered; unregister first");
157
+ }
158
+ // Backend row carries operator-supplied serializable metadata only —
159
+ // DB/JSON backends can't preserve function properties. The live agent
160
+ // ref is held in-process via ctx.liveAgents (see ctx init above).
161
+ var row = {
162
+ name: name,
163
+ kind: regOpts.agentKind,
164
+ tenantId: regOpts.tenantId || null,
165
+ posture: regOpts.posture || null,
166
+ registeredAt: Date.now(),
167
+ metadata: regOpts.metadata || {},
168
+ };
169
+ await ctx.backend.set(name, row);
170
+ ctx.liveAgents.set(name, agent);
171
+ _safeAudit(ctx, "agent.orchestrator.registered", regOpts.actor, {
172
+ name: name, agentKind: regOpts.agentKind, tenantId: row.tenantId,
173
+ });
174
+ return { name: name, registeredAt: row.registeredAt };
175
+ }
176
+
177
+ async function _unregister(ctx, name, args) {
178
+ guardAgentRegistry.validate({ kind: "unregister", name: name }, {});
179
+ _checkPermission(ctx, args.actor, "agent-registry:write");
180
+ var row = await ctx.backend.get(name);
181
+ if (!row) {
182
+ throw new AgentOrchestratorError("agent-orchestrator/not-found",
183
+ "unregister: '" + name + "' not registered");
184
+ }
185
+ await ctx.backend.delete(name);
186
+ ctx.liveAgents.delete(name);
187
+ _safeAudit(ctx, "agent.orchestrator.unregistered", args.actor, {
188
+ name: name, agentKind: row.kind,
189
+ });
190
+ return { name: name };
191
+ }
192
+
193
+ async function _lookup(ctx, name, args) {
194
+ guardAgentRegistry.validate({ kind: "lookup", name: name }, {});
195
+ _checkPermission(ctx, args.actor, "agent-registry:read");
196
+ // Live agent ref lives in-process; the backend row exists only as
197
+ // a metadata declaration. In multi-process deployments each process
198
+ // hydrates its own liveAgents map by calling register() locally.
199
+ var agent = ctx.liveAgents.get(name);
200
+ if (agent) return agent;
201
+ var row = await ctx.backend.get(name);
202
+ if (!row) {
203
+ _safeAudit(ctx, "agent.orchestrator.lookup_miss", args.actor, { name: name });
204
+ return null;
205
+ }
206
+ // Backend row exists but no live ref in this process — operator
207
+ // didn't hydrate locally. Surface explicitly so the caller knows
208
+ // to register the agent or route to the process that holds it.
209
+ throw new AgentOrchestratorError("agent-orchestrator/not-hydrated",
210
+ "lookup: '" + name + "' exists in registry but no live agent ref " +
211
+ "in this process — register the agent locally first");
212
+ }
213
+
214
+ async function _list(ctx, args) {
215
+ guardAgentRegistry.validate({ kind: "list" }, {});
216
+ _checkPermission(ctx, args.actor, "agent-registry:read");
217
+ var rows = await ctx.backend.list();
218
+ return rows.filter(function (r) {
219
+ if (args.kind && r.kind !== args.kind) return false;
220
+ if (args.tenantId && r.tenantId !== args.tenantId) return false;
221
+ return true;
222
+ }).map(function (r) {
223
+ return {
224
+ name: r.name, kind: r.kind, tenantId: r.tenantId,
225
+ posture: r.posture, registeredAt: r.registeredAt,
226
+ };
227
+ });
228
+ }
229
+
230
+ // ---- Sharded topic dispatch -----------------------------------------------
231
+
232
+ function _spawnConsumers(ctx, args) {
233
+ if (!args.agent || typeof args.agent !== "object") {
234
+ throw new AgentOrchestratorError("agent-orchestrator/bad-agent",
235
+ "spawnConsumers: agent required");
236
+ }
237
+ if (!args.queue || typeof args.queue.consume !== "function") {
238
+ throw new AgentOrchestratorError("agent-orchestrator/bad-queue",
239
+ "spawnConsumers: queue with .consume() required");
240
+ }
241
+ var shards = typeof args.shards === "number" ? args.shards : 1;
242
+ if (!Number.isInteger(shards) || shards < 1 || shards > 256) { // allow:raw-byte-literal — shard cap
243
+ throw new AgentOrchestratorError("agent-orchestrator/bad-shard-count",
244
+ "spawnConsumers: shards must be an integer in 1..256");
245
+ }
246
+ var topicBase = args.taskTopic || "agent.tasks";
247
+ var consumers = [];
248
+ for (var i = 0; i < shards; i += 1) {
249
+ var topic = shards === 1 ? topicBase : topicBase + "." + i;
250
+ var c = _spawnSingleConsumer(ctx, args.agent, args.queue, topic, args.maxConcurrency || 4);
251
+ consumers.push(c);
252
+ ctx.spawnedConsumers.push(c);
253
+ }
254
+ _safeAudit(ctx, "agent.orchestrator.consumers_spawned", args.actor, {
255
+ shards: shards, topicBase: topicBase, perShardConcurrency: args.maxConcurrency || 4,
256
+ });
257
+ return consumers;
258
+ }
259
+
260
+ function _spawnSingleConsumer(ctx, agent, queue, topic, maxConcurrency) {
261
+ var stopped = false;
262
+ var subscription = null;
263
+ return {
264
+ topic: topic,
265
+ start: async function () {
266
+ if (subscription) {
267
+ throw new AgentOrchestratorError("agent-orchestrator/already-started",
268
+ "consumer for topic '" + topic + "': already started");
269
+ }
270
+ subscription = await queue.consume(topic, async function (envelope) {
271
+ if (stopped) return;
272
+ var method = envelope.method;
273
+ if (!method || typeof agent[method] !== "function") {
274
+ var dotted = method && method.indexOf(".") > 0 ? method.split(".") : null;
275
+ if (dotted && agent[dotted[0]] && typeof agent[dotted[0]][dotted[1]] === "function") {
276
+ return agent[dotted[0]][dotted[1]](envelope.args);
277
+ }
278
+ throw new AgentOrchestratorError("agent-orchestrator/unknown-method",
279
+ "consumer: unknown method '" + method + "'");
280
+ }
281
+ return agent[method](envelope.args);
282
+ }, { maxConcurrency: maxConcurrency });
283
+ },
284
+ stop: async function () {
285
+ stopped = true;
286
+ if (subscription && typeof subscription.unsubscribe === "function") {
287
+ await subscription.unsubscribe();
288
+ }
289
+ subscription = null;
290
+ },
291
+ };
292
+ }
293
+
294
+ /**
295
+ * @primitive b.agent.orchestrator.shardFor
296
+ * @signature b.agent.orchestrator.shardFor(shardKey, shards)
297
+ * @since 0.9.21
298
+ * @status stable
299
+ * @related b.agent.orchestrator.create
300
+ *
301
+ * Consistent-hash router for sharded topic dispatch. Operator passes
302
+ * a stable shard-key (e.g. tenantId or actor.id); orchestrator picks
303
+ * the topic suffix so each tenant's traffic owns one shard's ordering.
304
+ * Uses FNV-1a 32-bit — fast, good distribution for short keys, no
305
+ * cryptographic guarantees (shard routing is not security-bearing).
306
+ * Empty key returns 0; `shards <= 1` always returns 0.
307
+ *
308
+ * @example
309
+ * var shard = b.agent.orchestrator.shardFor("tenant-acme", 8);
310
+ * // → integer in [0, 8)
311
+ */
312
+ function shardFor(shardKey, shards) {
313
+ if (typeof shardKey !== "string" || shardKey.length === 0) return 0;
314
+ if (shards <= 1) return 0;
315
+ // FNV-1a 32-bit — fast + good distribution for short keys.
316
+ var h = 2166136261; // allow:raw-byte-literal — FNV-1a offset basis
317
+ for (var i = 0; i < shardKey.length; i += 1) {
318
+ h ^= shardKey.charCodeAt(i);
319
+ h = (h * 16777619) >>> 0; // allow:raw-byte-literal — FNV-1a prime
320
+ }
321
+ return h % shards;
322
+ }
323
+
324
+ // ---- Leader-elected singletons --------------------------------------------
325
+
326
+ async function _elect(ctx, args) {
327
+ if (typeof args.resource !== "string" || args.resource.length === 0) {
328
+ throw new AgentOrchestratorError("agent-orchestrator/bad-elect-args",
329
+ "elect: resource required");
330
+ }
331
+ // Composes b.cluster's leader-election. When cluster mode is active,
332
+ // delegate; when not, the local node is the trivial leader for this
333
+ // process's lifetime (single-process deployment).
334
+ var isClusterMode = false;
335
+ try { isClusterMode = ctx.cluster.isClusterMode(); } catch (_e) { isClusterMode = false; }
336
+ if (!isClusterMode) {
337
+ // Single-process trivial leader.
338
+ var elec = { isLeader: true, fencingToken: 1, resource: args.resource };
339
+ ctx.elections.set(args.resource, elec);
340
+ _safeAudit(ctx, "agent.orchestrator.elected", args.actor, {
341
+ resource: args.resource, mode: "single-process",
342
+ });
343
+ return elec;
344
+ }
345
+ // Cluster mode: query current leader state via b.cluster.
346
+ var leaderRow = null;
347
+ try { leaderRow = await ctx.cluster.currentLeader(); } catch (_e) { leaderRow = null; }
348
+ var amLeader = false;
349
+ try { amLeader = ctx.cluster.isLeader(); } catch (_e) { amLeader = false; }
350
+ var token = null;
351
+ if (amLeader) {
352
+ try { token = ctx.cluster.fencingToken(); } catch (_e) { token = null; }
353
+ }
354
+ var elec2 = {
355
+ isLeader: amLeader,
356
+ fencingToken: token,
357
+ resource: args.resource,
358
+ leaderId: leaderRow && leaderRow.nodeId ? leaderRow.nodeId : null,
359
+ };
360
+ ctx.elections.set(args.resource, elec2);
361
+ _safeAudit(ctx, "agent.orchestrator.elected", args.actor, {
362
+ resource: args.resource, mode: "cluster",
363
+ amLeader: amLeader, leaderId: elec2.leaderId,
364
+ });
365
+ return elec2;
366
+ }
367
+
368
+ // ---- Drain ----------------------------------------------------------------
369
+
370
+ async function _drain(ctx, args) {
371
+ ctx.draining = true;
372
+ var timeoutMs = typeof args.timeoutMs === "number" ? args.timeoutMs : DEFAULT_DRAIN_TIMEOUT_MS;
373
+ var drained = 0;
374
+ var startedAt = Date.now();
375
+ // Stop every spawned consumer + collect timing.
376
+ for (var i = 0; i < ctx.spawnedConsumers.length; i += 1) {
377
+ var c = ctx.spawnedConsumers[i];
378
+ try { await c.stop(); drained += 1; } catch (_e) { /* best-effort */ }
379
+ if (Date.now() - startedAt > timeoutMs) break;
380
+ }
381
+ // Streams: signal each to wrap up (the streams check ctx.draining
382
+ // and emit a drain-marker themselves; orchestrator just sets the flag).
383
+ var streamCount = ctx.streams.size;
384
+ _safeAudit(ctx, "agent.orchestrator.drained", null, {
385
+ drainedConsumers: drained, totalConsumers: ctx.spawnedConsumers.length,
386
+ streamCount: streamCount, elapsedMs: Date.now() - startedAt,
387
+ });
388
+ return { drained: drained, elapsedMs: Date.now() - startedAt };
389
+ }
390
+
391
+ // ---- Streams (v0.9.23 substrate hook) -------------------------------------
392
+
393
+ function _registerStream(ctx, info) {
394
+ // Stream IDs are cross-tenant-distinguishable; use crypto-grade
395
+ // generateToken to keep them uniformly random across operators.
396
+ var streamId = "stream-" + bCrypto.generateToken(STREAM_ID_RAND_BYTES);
397
+ ctx.streams.set(streamId, {
398
+ streamId: streamId, kind: info.kind || "unknown",
399
+ actor: info.actor || null, startedAt: Date.now(),
400
+ });
401
+ return streamId;
402
+ }
403
+
404
+ function _unregisterStream(ctx, streamId) {
405
+ ctx.streams.delete(streamId);
406
+ }
407
+
408
+ // ---- Health probe ---------------------------------------------------------
409
+
410
+ async function _health(ctx) {
411
+ var rows = await ctx.backend.list();
412
+ var elections = [];
413
+ ctx.elections.forEach(function (v) { elections.push(v); });
414
+ var consumers = ctx.spawnedConsumers.map(function (c) { return { topic: c.topic }; });
415
+ return {
416
+ agents: rows.map(function (r) {
417
+ return { name: r.name, kind: r.kind, tenantId: r.tenantId, registeredAt: r.registeredAt };
418
+ }),
419
+ elections: elections,
420
+ consumers: consumers,
421
+ streams: ctx.streams.size,
422
+ draining: ctx.draining === true,
423
+ overall: ctx.draining ? "draining" : "ok",
424
+ };
425
+ }
426
+
427
+ // ---- Internals ------------------------------------------------------------
428
+
429
+ function _inMemoryBackend() {
430
+ var map = new Map();
431
+ return {
432
+ get: function (k) { return Promise.resolve(map.get(k) || null); },
433
+ set: function (k, v) { map.set(k, v); return Promise.resolve(); },
434
+ delete: function (k) { map.delete(k); return Promise.resolve(); },
435
+ list: function () {
436
+ var out = [];
437
+ map.forEach(function (v) { out.push(v); });
438
+ return Promise.resolve(out);
439
+ },
440
+ };
441
+ }
442
+
443
+ function _checkPermission(ctx, actor, scope) {
444
+ if (!ctx.permissions) return;
445
+ if (!actor || !ctx.permissions.check(actor, scope)) {
446
+ throw new AgentOrchestratorError("agent-orchestrator/permission-denied",
447
+ "actor lacks scope '" + scope + "'");
448
+ }
449
+ }
450
+
451
+ function _safeAudit(ctx, action, actor, metadata) {
452
+ try {
453
+ ctx.audit.safeEmit({
454
+ action: action,
455
+ actor: actor ? { id: actor.id, roles: actor.roles || [] } : { id: "<system>" },
456
+ outcome: action.indexOf("denied") >= 0 || action.indexOf("miss") >= 0 ? "failure" : "success",
457
+ metadata: metadata || {},
458
+ });
459
+ } catch (_e) { /* drop-silent — audit emit failures don't crash the call */ }
460
+ }
461
+
462
+ module.exports = {
463
+ create: create,
464
+ shardFor: shardFor,
465
+ AgentOrchestratorError: AgentOrchestratorError,
466
+ guards: {
467
+ registry: guardAgentRegistry,
468
+ },
469
+ };