npm - switchroom - Versions diffs - 0.7.15 → 0.10.0 - Mend

switchroom 0.7.15 → 0.10.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (301) hide show

package/README.md +51 -59
package/bin/run-hook.sh +27 -11
package/bin/timezone-hook.sh +9 -7
package/dist/agent-scheduler/index.js +410 -133
package/dist/auth-broker/index.js +13932 -0
package/dist/cli/switchroom.js +26937 -5601
package/dist/host-control/main.js +12702 -0
package/dist/vault/approvals/kernel-server.js +467 -184
package/dist/vault/broker/server.js +1430 -724
package/examples/minimal.yaml +63 -0
package/examples/personal-google-workspace-mcp/.env.example +34 -0
package/examples/personal-google-workspace-mcp/README.md +194 -0
package/examples/personal-google-workspace-mcp/compose.yaml +66 -0
package/examples/switchroom.yaml +220 -0
package/package.json +7 -4
package/profiles/_base/settings.json.hbs +20 -5
package/profiles/_base/start.sh.hbs +16 -3
package/profiles/_shared/agent-self-service.md.hbs +126 -0
package/profiles/_shared/telegram-style.md.hbs +20 -90
package/profiles/_shared/vault-protocol.md.hbs +68 -0
package/profiles/default/CLAUDE.md +50 -96
package/profiles/default/CLAUDE.md.hbs +36 -6
package/profiles/default/workspace/SOUL.md.hbs +12 -5
package/skills/buildkite-agent-infrastructure/SKILL.md +30 -11
package/skills/buildkite-agent-runtime/SKILL.md +44 -11
package/skills/buildkite-api/SKILL.md +31 -8
package/skills/buildkite-cli/SKILL.md +27 -9
package/skills/buildkite-migration/SKILL.md +22 -9
package/skills/buildkite-pipelines/SKILL.md +26 -9
package/skills/buildkite-secure-delivery/SKILL.md +23 -9
package/skills/buildkite-test-engine/SKILL.md +25 -8
package/skills/docx/SKILL.md +1 -1
package/skills/docx/scripts/office/validators/__pycache__/__init__.cpython-313.pyc +0 -0
package/skills/docx/scripts/office/validators/__pycache__/base.cpython-313.pyc +0 -0
package/skills/file-bug/SKILL.md +34 -6
package/skills/humanizer/SKILL.md +15 -0
package/skills/humanizer-calibrate/SKILL.md +7 -1
package/skills/mcp-builder/SKILL.md +1 -1
package/skills/pdf/SKILL.md +1 -1
package/skills/pptx/SKILL.md +1 -1
package/skills/skill-creator/SKILL.md +21 -1
package/skills/skill-creator/scripts/__pycache__/__init__.cpython-313.pyc +0 -0
package/skills/skill-creator/scripts/__pycache__/generate_report.cpython-313.pyc +0 -0
package/skills/skill-creator/scripts/__pycache__/improve_description.cpython-313.pyc +0 -0
package/skills/skill-creator/scripts/__pycache__/run_eval.cpython-313.pyc +0 -0
package/skills/skill-creator/scripts/__pycache__/run_loop.cpython-313.pyc +0 -0
package/skills/skill-creator/scripts/__pycache__/utils.cpython-313.pyc +0 -0
package/skills/switchroom-cli/SKILL.md +63 -64
package/skills/switchroom-health/SKILL.md +23 -10
package/skills/switchroom-install/SKILL.md +3 -3
package/skills/switchroom-manage/SKILL.md +26 -19
package/skills/switchroom-runtime/SKILL.md +191 -0
package/skills/switchroom-status/SKILL.md +27 -2
package/skills/telegram-test-harness/SKILL.md +3 -0
package/skills/token-helpers/SKILL.md +24 -1
package/skills/webapp-testing/SKILL.md +31 -1
package/skills/xlsx/SKILL.md +1 -1
package/telegram-plugin/admin-commands/index.ts +7 -5
package/telegram-plugin/analytics-posthog.ts +191 -0
package/telegram-plugin/bridge/bridge.ts +69 -0
package/telegram-plugin/bridge/ipc-client.ts +4 -1
package/telegram-plugin/dist/bridge/bridge.js +194 -119
package/telegram-plugin/dist/gateway/gateway.js +23611 -19671
package/telegram-plugin/dist/server.js +245 -189
package/telegram-plugin/first-paint.ts +3 -24
package/telegram-plugin/gateway/auth-add-flow.ts +326 -0
package/telegram-plugin/gateway/auth-broker-client.ts +75 -0
package/telegram-plugin/gateway/auth-command.ts +794 -0
package/telegram-plugin/gateway/auth-line.ts +123 -0
package/telegram-plugin/gateway/boot-card.ts +169 -40
package/telegram-plugin/gateway/boot-issue-cache.ts +308 -0
package/telegram-plugin/gateway/boot-probes.ts +166 -123
package/telegram-plugin/gateway/boot-reason.ts +41 -7
package/telegram-plugin/gateway/boot-version.ts +66 -0
package/telegram-plugin/gateway/gateway.ts +3499 -1885
package/telegram-plugin/gateway/hostd-dispatch.ts +117 -0
package/telegram-plugin/gateway/ipc-protocol.ts +18 -0
package/telegram-plugin/gateway/pending-inbound-buffer.ts +106 -0
package/telegram-plugin/gateway/quarantine.ts +69 -0
package/telegram-plugin/gateway/quota-cache.ts +9 -4
package/telegram-plugin/gateway/reaction-trigger.ts +401 -0
package/telegram-plugin/gateway/recent-denials.test.ts +103 -0
package/telegram-plugin/gateway/recent-denials.ts +77 -0
package/telegram-plugin/gateway/startup-network-retry.ts +109 -31
package/telegram-plugin/gateway/vault-grant-inbound-builders.ts +125 -0
package/telegram-plugin/history.ts +91 -0
package/telegram-plugin/hooks/hooks.json +10 -0
package/telegram-plugin/hooks/sandbox-hint-posttool.mjs +130 -0
package/telegram-plugin/hooks/subagent-tracker-posttool.mjs +19 -2
package/telegram-plugin/hooks/subagent-tracker-pretool.mjs +22 -2
package/telegram-plugin/hooks/tool-label-pretool.mjs +11 -0
package/telegram-plugin/hooks/wedge-detect-posttool.mjs +303 -0
package/telegram-plugin/inbound-classifier.ts +50 -0
package/telegram-plugin/inline-keyboard-callbacks.ts +136 -0
package/telegram-plugin/node_modules/.vite/vitest/da39a3ee5e6b4b0d3255bfef95601890afd80709/results.json +1 -0
package/telegram-plugin/package.json +4 -2
package/telegram-plugin/permission-rule.ts +51 -0
package/telegram-plugin/permission-title.ts +56 -0
package/telegram-plugin/quota-check.ts +19 -41
package/telegram-plugin/registry/reaper.ts +223 -0
package/telegram-plugin/retry-api-call.ts +80 -0
package/telegram-plugin/runtime-metrics.ts +177 -0
package/telegram-plugin/scripts/build.mjs +0 -1
package/telegram-plugin/secret-detect/index.ts +24 -0
package/telegram-plugin/secret-detect/vault-error.test.ts +64 -12
package/telegram-plugin/secret-detect/vault-error.ts +78 -11
package/telegram-plugin/secret-detect/vault-write.ts +14 -2
package/telegram-plugin/server.js +41795 -0
package/telegram-plugin/session-tail.ts +6 -1
package/telegram-plugin/shared/bot-runtime.ts +5 -4
package/telegram-plugin/silence-poke.ts +420 -0
package/telegram-plugin/silent-end.ts +174 -0
package/telegram-plugin/stream-controller.ts +13 -0
package/telegram-plugin/stream-reply-handler.ts +7 -0
package/telegram-plugin/subagent-watcher.ts +213 -4
package/telegram-plugin/tests/auth-add-flow.test.ts +559 -0
package/telegram-plugin/tests/auth-code-redact.test.ts +8 -4
package/telegram-plugin/tests/auth-command-vernacular.test.ts +531 -0
package/telegram-plugin/tests/boot-card-issue-dedup.test.ts +247 -0
package/telegram-plugin/tests/boot-card-reason-to-render.test.ts +182 -0
package/telegram-plugin/tests/boot-card-reason.test.ts +65 -2
package/telegram-plugin/tests/boot-card-render.test.ts +146 -0
package/telegram-plugin/tests/boot-card-silent-on-operator.test.ts +103 -0
package/telegram-plugin/tests/boot-probes.test.ts +216 -10
package/telegram-plugin/tests/boot-version-string.test.ts +0 -0
package/telegram-plugin/tests/finalize-callback.test.ts +190 -0
package/telegram-plugin/tests/gateway-message-validator.test.ts +26 -0
package/telegram-plugin/tests/gateway-secret-detect.test.ts +12 -3
package/telegram-plugin/tests/gateway-startup-network-retry.test.ts +104 -0
package/telegram-plugin/tests/history-reaper.test.ts +378 -0
package/telegram-plugin/tests/hostd-dispatch.test.ts +129 -0
package/telegram-plugin/tests/inbound-classifier.test.ts +76 -0
package/telegram-plugin/tests/inbound-message-types.test.ts +267 -0
package/telegram-plugin/tests/issues-card.test.ts +49 -0
package/telegram-plugin/tests/pending-inbound-buffer.test.ts +132 -0
package/telegram-plugin/tests/permission-rule.test.ts +80 -1
package/telegram-plugin/tests/permission-title.test.ts +31 -0
package/telegram-plugin/tests/quota-check.test.ts +5 -35
package/telegram-plugin/tests/races.test.ts +179 -0
package/telegram-plugin/tests/reaction-trigger-flow.test.ts +353 -0
package/telegram-plugin/tests/reaction-trigger.test.ts +397 -0
package/telegram-plugin/tests/retry-api-call.test.ts +152 -1
package/telegram-plugin/tests/runtime-metrics.test.ts +145 -0
package/telegram-plugin/tests/sandbox-hint-posttool.test.ts +155 -0
package/telegram-plugin/tests/secret-detect-delete-must-surface-failures.test.ts +133 -0
package/telegram-plugin/tests/secret-detect-false-positives.test.ts +137 -0
package/telegram-plugin/tests/silence-poke.test.ts +493 -0
package/telegram-plugin/tests/silent-end.test.ts +206 -0
package/telegram-plugin/tests/subagent-tracker-hooks.test.ts +107 -0
package/telegram-plugin/tests/subagent-watcher-env-thresholds.test.ts +224 -0
package/telegram-plugin/tests/subagent-watcher-stall-terminal.test.ts +316 -0
package/telegram-plugin/tests/subagent-watcher.test.ts +263 -0
package/telegram-plugin/tests/turn-signal-tracker.test.ts +81 -0
package/telegram-plugin/tests/vault-approval-posture.test.ts +256 -0
package/telegram-plugin/tests/vault-grant-auto-resume.test.ts +73 -0
package/telegram-plugin/tests/vault-grant-inbound-builders.test.ts +226 -0
package/telegram-plugin/tests/vault-grant-union.test.ts +130 -0
package/telegram-plugin/tests/vault-key-regex-allows-slash.test.ts +140 -0
package/telegram-plugin/tests/vault-posture-quarantine.test.ts +104 -0
package/telegram-plugin/tests/vault-request-access-tool.test.ts +114 -0
package/telegram-plugin/tests/vault-request-access-unlock-resume.test.ts +106 -0
package/telegram-plugin/turn-signal-tracker.ts +100 -24
package/telegram-plugin/uat/SETUP.md +210 -35
package/telegram-plugin/uat/assertions.ts +264 -37
package/telegram-plugin/uat/driver-info.ts +57 -0
package/telegram-plugin/uat/driver.ts +590 -51
package/telegram-plugin/uat/harness.ts +140 -94
package/telegram-plugin/uat/load-env.test.ts +72 -0
package/telegram-plugin/uat/load-env.ts +48 -0
package/telegram-plugin/uat/login.ts +96 -53
package/telegram-plugin/uat/runners/agent-self-sufficiency.ts +457 -0
package/telegram-plugin/uat/runners/paraphrases.ts +231 -0
package/telegram-plugin/uat/runners/report.ts +150 -0
package/telegram-plugin/uat/runners/run-agent-self-sufficiency.sh +50 -0
package/telegram-plugin/uat/runners/scorer.test.ts +196 -0
package/telegram-plugin/uat/runners/scorer.ts +106 -0
package/telegram-plugin/uat/runners/skill-coverage.test.ts +100 -0
package/telegram-plugin/uat/runners/skill-coverage.ts +620 -0
package/telegram-plugin/uat/scenarios/ask-user-button-tap-dm.test.ts +141 -0
package/telegram-plugin/uat/scenarios/bg-sub-agent-dispatch-dm.test.ts +191 -0
package/telegram-plugin/uat/scenarios/fuzz-extended-dm.test.ts +255 -0
package/telegram-plugin/uat/scenarios/fuzz-human-style-dm.test.ts +275 -0
package/telegram-plugin/uat/scenarios/fuzz-random-prompts-dm.test.ts +146 -0
package/telegram-plugin/uat/scenarios/fuzz-status-ask-dm.test.ts +486 -0
package/telegram-plugin/uat/scenarios/jtbd-interrupt-marker-dm.test.ts +67 -0
package/telegram-plugin/uat/scenarios/jtbd-rapid-followup-dm.test.ts +100 -0
package/telegram-plugin/uat/scenarios/jtbd-soft-commit-dm.test.ts +67 -0
package/telegram-plugin/uat/scenarios/jtbd-status-query-dm.test.ts +49 -0
package/telegram-plugin/uat/scenarios/location-inbound-dm.test.ts +65 -0
package/telegram-plugin/uat/scenarios/midturn-silent-dm.test.ts +175 -0
package/telegram-plugin/uat/scenarios/reactions-dm.test.ts +142 -0
package/telegram-plugin/uat/scenarios/reactions-trigger-turn-dm.test.ts +96 -0
package/telegram-plugin/uat/scenarios/secret-redaction-deletes-original-dm.test.ts +123 -0
package/telegram-plugin/uat/scenarios/secret-redaction-no-false-positive-dm.test.ts +87 -0
package/telegram-plugin/uat/scenarios/silence-poke-soft-dm.test.ts +155 -0
package/telegram-plugin/uat/scenarios/silent-end-recovery-dm.test.ts +95 -0
package/telegram-plugin/uat/scenarios/smoke-dm-reply.test.ts +57 -0
package/telegram-plugin/uat/scenarios/subagent-watcher-no-rerun-dm.test.ts +135 -0
package/telegram-plugin/uat/scenarios/vault-approval-posture-telegram-id-dm.test.ts +191 -0
package/telegram-plugin/uat/scenarios/vault-audit-allow-dm.test.ts +108 -0
package/telegram-plugin/uat/scenarios/vault-grant-auto-resume-dm.test.ts +121 -0
package/telegram-plugin/uat/scenarios/vault-request-access-concurrent-dm.test.ts +161 -0
package/telegram-plugin/uat/scenarios/vault-request-access-end-to-end-dm.test.ts +158 -0
package/telegram-plugin/uat/scenarios/voice-inbound-dm.test.ts +65 -0
package/telegram-plugin/vault-approval-posture.ts +42 -0
package/telegram-plugin/welcome-text.ts +1 -0
package/telegram-plugin/active-pins-sweep.ts +0 -204
package/telegram-plugin/active-pins.ts +0 -146
package/telegram-plugin/auth-dashboard.ts +0 -1104
package/telegram-plugin/auth-slot-parser.ts +0 -497
package/telegram-plugin/card-event-log.ts +0 -138
package/telegram-plugin/dist/foreman/foreman.js +0 -31106
package/telegram-plugin/docs/multi-agent-card-design.md +0 -847
package/telegram-plugin/docs/pinned-progress-card-reliability.md +0 -144
package/telegram-plugin/foreman/foreman-create-flow.ts +0 -202
package/telegram-plugin/foreman/foreman-handlers.ts +0 -493
package/telegram-plugin/foreman/foreman.ts +0 -1165
package/telegram-plugin/foreman/setup-flow.ts +0 -345
package/telegram-plugin/foreman/setup-state.ts +0 -239
package/telegram-plugin/foreman/state.ts +0 -203
package/telegram-plugin/pin-event-log.ts +0 -76
package/telegram-plugin/progress-card-driver.ts +0 -2886
package/telegram-plugin/progress-card-pin-manager.ts +0 -589
package/telegram-plugin/progress-card-pin-watchdog.ts +0 -98
package/telegram-plugin/progress-card.ts +0 -1409
package/telegram-plugin/tests/HARNESS.md +0 -340
package/telegram-plugin/tests/_progress-card-harness.ts +0 -109
package/telegram-plugin/tests/active-pins-boot-reaper.test.ts +0 -211
package/telegram-plugin/tests/active-pins-sweep.test.ts +0 -309
package/telegram-plugin/tests/active-pins.test.ts +0 -187
package/telegram-plugin/tests/auth-account-identity-surface.test.ts +0 -118
package/telegram-plugin/tests/auth-dashboard-edge-cases.test.ts +0 -260
package/telegram-plugin/tests/auth-dashboard-restart-flow.test.ts +0 -140
package/telegram-plugin/tests/auth-dashboard-v3b.test.ts +0 -559
package/telegram-plugin/tests/auth-dashboard.test.ts +0 -1045
package/telegram-plugin/tests/auth-slot-commands.test.ts +0 -640
package/telegram-plugin/tests/bg-agent-progress-card-757.test.ts +0 -201
package/telegram-plugin/tests/boot-card-account-quota.test.ts +0 -137
package/telegram-plugin/tests/card-event-log.test.ts +0 -145
package/telegram-plugin/tests/first-paint.test.ts +0 -257
package/telegram-plugin/tests/foreman-create-flow.test.ts +0 -359
package/telegram-plugin/tests/foreman-handlers.test.ts +0 -347
package/telegram-plugin/tests/foreman-state.test.ts +0 -164
package/telegram-plugin/tests/foreman-write-ops.test.ts +0 -214
package/telegram-plugin/tests/harness-ordering-invariants.test.ts +0 -243
package/telegram-plugin/tests/pin-event-log.test.ts +0 -124
package/telegram-plugin/tests/progress-card-api-failure-during-deferred.test.ts +0 -73
package/telegram-plugin/tests/progress-card-close-paths-converge.test.ts +0 -272
package/telegram-plugin/tests/progress-card-cross-turn.test.ts +0 -258
package/telegram-plugin/tests/progress-card-delay-842.test.ts +0 -160
package/telegram-plugin/tests/progress-card-dispose-preservepending.test.ts +0 -81
package/telegram-plugin/tests/progress-card-draft-flag.test.ts +0 -80
package/telegram-plugin/tests/progress-card-driver-eviction.test.ts +0 -215
package/telegram-plugin/tests/progress-card-driver-fleet-shadow.test.ts +0 -123
package/telegram-plugin/tests/progress-card-driver-force-complete-parent-done.test.ts +0 -76
package/telegram-plugin/tests/progress-card-edit-timestamps-budget.test.ts +0 -62
package/telegram-plugin/tests/progress-card-memory-bounds.test.ts +0 -84
package/telegram-plugin/tests/progress-card-pin-failure-paths.test.ts +0 -139
package/telegram-plugin/tests/progress-card-pin-manager.test.ts +0 -773
package/telegram-plugin/tests/progress-card-pin-race-fast-turn.test.ts +0 -66
package/telegram-plugin/tests/progress-card-pin-sidecar-partial-write.test.ts +0 -64
package/telegram-plugin/tests/progress-card-pin-watchdog.test.ts +0 -190
package/telegram-plugin/tests/progress-card-sigterm-pin-flush.test.ts +0 -146
package/telegram-plugin/tests/real-gateway-f1-ladder-integrity.test.ts +0 -123
package/telegram-plugin/tests/real-gateway-f2-instant-draft.test.ts +0 -82
package/telegram-plugin/tests/real-gateway-f3-late-card.test.ts +0 -114
package/telegram-plugin/tests/real-gateway-harness.ts +0 -699
package/telegram-plugin/tests/real-gateway-i6-turn-flush-replay-dedup.test.ts +0 -313
package/telegram-plugin/tests/real-gateway-ipc-lifecycle.test.ts +0 -299
package/telegram-plugin/tests/real-gateway-spec.test.ts +0 -487
package/telegram-plugin/tests/real-gateway.smoke.test.ts +0 -101
package/telegram-plugin/tests/setup-flow.test.ts +0 -510
package/telegram-plugin/tests/setup-state.test.ts +0 -146
package/telegram-plugin/tests/sync-chat-running-subagents.test.ts +0 -116
package/telegram-plugin/tests/turn-end-regressions.test.ts +0 -489
package/telegram-plugin/tests/turn-flush-card-takeover.test.ts +0 -218
package/telegram-plugin/tests/turn-flush-prose-recovery.test.ts +0 -78
package/telegram-plugin/tests/two-zone-bg-carry-full-lifecycle.test.ts +0 -131
package/telegram-plugin/tests/two-zone-bg-detection.test.ts +0 -120
package/telegram-plugin/tests/two-zone-bg-done-when-all-terminal.test.ts +0 -116
package/telegram-plugin/tests/two-zone-bg-early-turn-end.test.ts +0 -87
package/telegram-plugin/tests/two-zone-bg-survives-next-turn.test.ts +0 -211
package/telegram-plugin/tests/two-zone-card-cap.test.ts +0 -62
package/telegram-plugin/tests/two-zone-card-fleet-row.test.ts +0 -101
package/telegram-plugin/tests/two-zone-card-header-phases.test.ts +0 -78
package/telegram-plugin/tests/two-zone-card-html-balance.test.ts +0 -110
package/telegram-plugin/tests/two-zone-card-lifecycle.test.ts +0 -128
package/telegram-plugin/tests/two-zone-card-sanitise.test.ts +0 -58
package/telegram-plugin/tests/two-zone-card-snapshot.test.ts +0 -133
package/telegram-plugin/tests/two-zone-concurrent-turns-isolation.test.ts +0 -155
package/telegram-plugin/tests/two-zone-phasefor-precedence.test.ts +0 -117
package/telegram-plugin/tests/two-zone-snapshot-extras.test.ts +0 -187
package/telegram-plugin/tests/two-zone-stuck-edit-throttle.test.ts +0 -149
package/telegram-plugin/tests/two-zone-stuck-header-escalation.test.ts +0 -101
package/telegram-plugin/tests/two-zone-stuck-per-member.test.ts +0 -114
package/telegram-plugin/tests/two-zone-stuck-recovery.test.ts +0 -105
package/telegram-plugin/tests/waiting-ux-harness.ts +0 -381
package/telegram-plugin/tests/waiting-ux.e2e.test.ts +0 -233
package/telegram-plugin/turn-flush-prose-recovery.ts +0 -40
package/telegram-plugin/two-zone-card.ts +0 -269
package/telegram-plugin/uat/scenarios/smoke-clerk-reply.test.ts +0 -61

package/telegram-plugin/turn-signal-tracker.ts CHANGED Viewed

@@ -1,46 +1,82 @@
 /**
- * Per-turn silent-gap tracker for streaming observability.
+ * Per-turn signal + outbound tracker for streaming observability.
  *
- * Tracks the longest contiguous interval within a turn where no user-visible
- * signal was sent. Signals include: progress-card edits, status-reaction
- * transitions, answer-lane updates, and fresh sendMessage calls.
+ * Tracks TWO things, keyed by chatId+threadId:
+ *
+ *   1. **Signal gap** — longest contiguous interval where no user-visible
+ *      signal of ANY kind was sent (progress-card edits, status-reaction
+ *      transitions, answer-lane updates, fresh sendMessage calls). The
+ *      original use case from #203.
+ *
+ *   2. **Outbound messages** (added 2026-05 for the conversational-turn-
+ *      UX redesign, issue #1122) — strictly user-visible MESSAGES that
+ *      the agent sent: `reply`, `stream_reply` first-emits, progress
+ *      card flushes that produce a fresh sendMessage. Status reactions
+ *      and message edits don't count here — they don't ping the device
+ *      and aren't what "outbound silence" means for the KPI.
  *
  * Keyed by chatId+threadId so concurrent turns in different chats don't
- * collide. Designed to be fully standalone (no grammy/bot dependency) so
- * it's testable with deterministic time injection via vi.useFakeTimers().
+ * collide. Fully standalone — no grammy/bot dependency, deterministic
+ * time injection via vi.useFakeTimers().
  *
  * Usage:
- *   signalTracker.reset(key, now)       // at turn start
- *   signalTracker.noteSignal(key, now)  // on every user-visible signal
- *   signalTracker.getLongestGap(key)    // at turn_end
- *   signalTracker.clear(key)            // after emitting (cleanup)
+ *   signalTracker.reset(key, now)                  // at turn start
+ *   signalTracker.noteSignal(key, now)             // any signal (legacy)
+ *   signalTracker.noteOutbound(key, now)           // outbound message only
+ *   signalTracker.getLongestGap(key)               // at turn_end (signal)
+ *   signalTracker.getOutboundMetrics(key)          // at turn_end (KPIs)
+ *   signalTracker.clear(key)                       // after emitting
  */
 export interface TurnSignalState {
-  /** The time the current gap started (i.e., the last signal time). */
+  /** The time the turn began. Used to compute TTFO. */
+  turnStartedAt: number
+  /** Time the current signal gap started (last signal time). */
   lastSignalAt: number
-  /** The longest gap observed so far (ms). */
+  /** Longest signal-gap (any signal) observed so far (ms). */
   longestGapMs: number
+  /** First outbound message timestamp this turn, or null if none yet. */
+  firstOutboundAt: number | null
+  /** Most recent outbound message timestamp, or null. */
+  lastOutboundAt: number | null
+  /** Total outbound messages sent this turn. */
+  outboundCount: number
+  /** Longest gap between consecutive outbound messages (ms). */
+  longestOutboundGapMs: number
+}
+export interface OutboundMetrics {
+  /** ms between turn start and first outbound message; null if none sent. */
+  ttfoMs: number | null
+  /** Total outbound messages this turn. */
+  outboundCount: number
+  /** Longest gap between outbound messages — i.e. the "silent stretch"
+   *  metric for the conversational-pacing KPI. 0 if <2 messages. */
+  longestOutboundGapMs: number
 }
-/**
- * Module-scoped map: `"chatId:threadId"` → state. Using a module-level map
- * keeps the tracker lightweight and avoids passing state through every
- * call-site while remaining mockable in tests via the exported functions.
- */
 const state = new Map<string, TurnSignalState>()
 /**
  * Begin tracking a new turn. Records `now` as the initial signal time and
- * resets the gap accumulator. Call at the start of each fresh turn.
+ * resets the gap accumulator + outbound state. Call at the start of each
+ * fresh turn.
  */
 export function reset(key: string, now: number): void {
-  state.set(key, { lastSignalAt: now, longestGapMs: 0 })
+  state.set(key, {
+    turnStartedAt: now,
+    lastSignalAt: now,
+    longestGapMs: 0,
+    firstOutboundAt: null,
+    lastOutboundAt: null,
+    outboundCount: 0,
+    longestOutboundGapMs: 0,
+  })
 }
 /**
- * Record a user-visible signal. Measures the gap since the last signal and
- * updates `longestGapMs` if this gap is larger.
+ * Record a user-visible signal (any kind: reaction, edit, send). Measures
+ * the gap since the last signal and updates `longestGapMs` if larger.
  */
 export function noteSignal(key: string, now: number): void {
   const entry = state.get(key)
@@ -51,8 +87,31 @@ export function noteSignal(key: string, now: number): void {
 }
 /**
- * Returns the longest gap observed during the current turn (ms).
- * Returns 0 if no tracking state exists for this key.
+ * Record a fresh outbound MESSAGE (reply, stream_reply first-emit, or
+ * card flush that produced a new sendMessage). Updates the
+ * outbound-specific metrics: TTFO on first call, outbound-gap on
+ * subsequent calls.
+ *
+ * Does not double-update the signal-gap stream — callers that note an
+ * outbound message should ALSO call `noteSignal()` to keep the legacy
+ * signal-gap accurate.
+ */
+export function noteOutbound(key: string, now: number): void {
+  const entry = state.get(key)
+  if (entry == null) return
+  if (entry.firstOutboundAt == null) {
+    entry.firstOutboundAt = now
+  } else if (entry.lastOutboundAt != null) {
+    const gap = now - entry.lastOutboundAt
+    if (gap > entry.longestOutboundGapMs) entry.longestOutboundGapMs = gap
+  }
+  entry.lastOutboundAt = now
+  entry.outboundCount += 1
+}
+/**
+ * Returns the longest gap observed during the current turn (ms) — legacy
+ * "any signal" metric. Returns 0 if no tracking state exists for this key.
  */
 export function getLongestGap(key: string): number {
   return state.get(key)?.longestGapMs ?? 0
@@ -67,8 +126,25 @@ export function getLastSignalAt(key: string): number | undefined {
 }
 /**
- * Remove state for this key. Call after emitting the turn_signal_gap metric.
+ * Returns the outbound-message KPI bundle for the conversational-pacing
+ * redesign. Zeroed-out if no tracking state exists.
  */
+export function getOutboundMetrics(key: string): OutboundMetrics {
+  const entry = state.get(key)
+  if (entry == null) {
+    return { ttfoMs: null, outboundCount: 0, longestOutboundGapMs: 0 }
+  }
+  const ttfoMs = entry.firstOutboundAt != null
+    ? entry.firstOutboundAt - entry.turnStartedAt
+    : null
+  return {
+    ttfoMs,
+    outboundCount: entry.outboundCount,
+    longestOutboundGapMs: entry.longestOutboundGapMs,
+  }
+}
+/** Remove state for this key. Call after emitting the turn-end metrics. */
 export function clear(key: string): void {
   state.delete(key)
 }

package/telegram-plugin/uat/SETUP.md CHANGED Viewed

@@ -112,49 +112,224 @@ If the driver account is locked entirely (e.g. SPAM_WAIT), only the
 account owner can resolve it via support@telegram.org. The harness has
 no recourse.
-## 5. Worktree-based agent install (NOT `switchroom agent add`)
+## 5. The `test-harness` agent (Phase 2a — DM focus)
-The UAT harness does **not** persistently install the test-harness
-agent through `switchroom agent add` (which writes a systemd unit + a
-persistent state dir — wrong shape for hermetic test runs). Instead,
-the harness `exec`s the agent as a child process per scenario with:
+Phase 2a tests run against a **persistent** `test-harness` agent
+created once via `switchroom agent add`. This pivots from the epic's
+original child-process-per-scenario plan (written before the Docker
+runtime landed) — the standard runtime now gets us most of the
+hermeticity we want without re-inventing the agent lifecycle. Forum
+topic + per-scenario STATE_DIR isolation rolls in with Phase 2b.
-- `STATE_DIR=$(mktemp -d)` — ephemeral; teardown rm-rfs it.
-- A unique `TELEGRAM_GATEWAY_PORT` (see port allocator note below).
-- `SWITCHROOM_AGENT_NAME=test-harness`.
-- The test bot token loaded from `telegram-test-bot-token`.
+### One-shot agent creation
-The Phase 1 scaffold stubs this out in `harness.ts`; Phase 2 wires it
-end-to-end.
+```bash
+# Resolve the driver's user_id once via mtcute (the helper prints
+# only the integer id to stdout; the session string never appears):
+cd ~/code/switchroom/telegram-plugin
+read -sp "Vault passphrase: " SWITCHROOM_VAULT_PASSPHRASE; echo
+export SWITCHROOM_VAULT_PASSPHRASE
+DRIVER_UID=$(bun uat/driver-info.ts)
+echo "Driver user_id: $DRIVER_UID"
+# Then create the agent. `--topology dm --allow-from $DRIVER_UID`
+# bypasses the @BotFather DM-pair flow and writes the driver's
+# user_id directly into allowFrom — so the bot will respond only
+# to DMs from the driver, never from arbitrary Telegram users
+# (important: the test bot's token is in vault scoped to
+# `test-harness` only, but the bot itself is publicly reachable
+# on Telegram).
+SWITCHROOM_BOT_TOKEN=$(switchroom vault get --no-broker telegram-test-bot-token) \
+  switchroom agent add test-harness \
+    --profile default \
+    --topology dm \
+    --bot-username meken_switchroom_test_bot \
+    --allow-from "$DRIVER_UID"
+unset SWITCHROOM_BOT_TOKEN SWITCHROOM_VAULT_PASSPHRASE
+# Verify the agent is up:
+switchroom agent status test-harness
+```
+`agent add` runs the n+1 wizard: scaffolds the per-agent dir under
+`~/.switchroom/agents/test-harness/`, refreshes the compose file,
+boots the container, runs a preflight. On success the agent is
+running and will reply to DMs from the driver user account.
+> **Hosts upgraded from before #1009.** If you set up the
+> `test-harness` agent on an older CLI build, its
+> `access.json` may carry the two pre-fix shapes — numeric
+> `allowFrom` (silently rejected by the gateway, #1001) and a
+> placeholder `groups: {"-100…"}` entry (404 boot-probe noise,
+> #1002). Both writers were corrected in #1009, but existing
+> scaffolds aren't auto-rewritten. To rebuild a clean access.json
+> on a host that hit the old shapes:
+>
+> ```bash
+> switchroom agent stop test-harness
+> rm ~/.switchroom/agents/test-harness/telegram/access.json
+> switchroom apply       # rewrites access.json via the fixed buildAccessJson
+> switchroom agent start test-harness
+> ```
+>
+> Fresh agent-add invocations on current main don't need this.
+### When this agent should be running
+- During UAT runs: yes. Scenarios fail with `expectMessage` timeouts
+  if the agent isn't responding.
+- Idle: harmless to leave running. It consumes one Claude turn only
+  when DMed by the driver — no scheduled work, no MCP polls.
+### Resetting state between runs
+Phase 2a accepts mild state pollution across scenarios (the agent's
+history accumulates). To reset hard:
-## 6. Port allocator vs unix sockets
+```bash
+switchroom agent stop test-harness
+rm -rf ~/.switchroom/agents/test-harness/state
+switchroom agent start test-harness
+```
+Phase 2b adds per-scenario state-dir scoping so this becomes
+automatic.
+### Optional: force progress-card on every turn (Phase 2c+ card scenarios)
+The gateway's `progress_card.delay_ms` defaults to 45 s, so short DM
+turns (most of UAT) never trigger the pinned card and the card-
+lifecycle scenarios (`progress-card-dm.test.ts`) skip themselves.
+To unskip — and validate `expectPinnedCard` / `waitForCardPhase`
+against real Telegram — override the delay on `test-harness` only:
+Edit `~/.switchroom/switchroom.yaml`, find the `test-harness:`
+block, and add the highlighted lines:
+```yaml
+  test-harness:
+    extends: default
+    topic_name: Test Harness
+    channels:
+      telegram:
+        progress_card:
+          delay_ms: 1000     # short — make every turn flash a card
+```
+Then apply + restart:
+```bash
+switchroom apply
+switchroom agent restart test-harness
+```
-Phase 1 commits to a **process-wide port allocator** (see
-`uat/port-allocator.ts`) rather than unix sockets. Rationale:
+Production agents keep the 45 s default; this override is test-only.
+Once configured, unskip the card scenario by changing
+`describe.skip(...)` → `describe(...)` in
+`scenarios/progress-card-dm.test.ts`.
-- The gateway already speaks IP loopback to the bridge; switching to
-  unix sockets is a code change in `gateway/` we don't want bundled
-  with the UAT scaffold work.
-- Tests only ever run from one harness process, so a node-local
-  monotonic counter starting at a high ephemeral port (default 47000)
-  is enough to avoid collisions with the system + with sibling
-  scenarios in the same run.
-- The allocator also `bind()`s a probe socket and releases it before
-  returning, which catches "port already in use by another process"
-  before the agent boots and produces a confusing crash.
+## 6. Running scenarios — env setup
-If we ever want concurrent harness runs from CI, swap to unix sockets;
-the harness API takes a `transport` shape so it's a one-line change.
+The harness reads four env vars at `spinUp()` time. The recommended
+workflow is to materialise them once into `.env`
+— the harness loads that file automatically on import (see
+`load-env.ts`). The file is gitignored repo-wide (`.env*` in
+`/.gitignore`); never commit a populated copy.
+Vault file perms (root:root 0600) mean the operator can't read
+`vault.enc` directly. Sourcing through the `test-harness` agent
+container — which already has these keys in its ACL — is the
+cleanest path:
+```bash
+cd ~/code/switchroom
+read -sp "Vault passphrase: " SWITCHROOM_VAULT_PASSPHRASE; echo
+export SWITCHROOM_VAULT_PASSPHRASE
+( umask 077 && {
+  echo "TELEGRAM_API_ID=$(docker exec switchroom-test-harness switchroom vault get telegram-uat-api-id)"
+  echo "TELEGRAM_API_HASH=$(docker exec switchroom-test-harness switchroom vault get telegram-uat-api-hash)"
+  echo "TELEGRAM_UAT_DRIVER_SESSION=$(docker exec switchroom-test-harness switchroom vault get telegram-uat-driver-session)"
+  echo "TELEGRAM_TEST_BOT_USERNAME=meken_switchroom_test_bot"
+} > .env )
+unset SWITCHROOM_VAULT_PASSPHRASE
+```
+> `umask 077` in the subshell guarantees the file is never
+> world-readable between creation and the redirection's implicit
+> chmod.
+> The `docker exec` path requires `test-harness` to have the three
+> `telegram-uat-*` keys in its `schedule[*].secrets` ACL (see
+> `~/.switchroom/switchroom.yaml`). If `vault get` returns
+> `VAULT-BROKER-DENIED`, add them and `switchroom apply`. The legacy
+> `vault get --no-broker` path no longer works for non-root operators
+> because the vault file is owned by the broker container's root user.
+After the `.env` is in place, just run the suite — no per-shell
+export dance:
+```bash
+bun test telegram-plugin/uat/scenarios/
+```
+To rotate or refresh the file, repeat the block above. The harness
+prefers existing `process.env` entries over `.env` values, so a
+one-off env override still works (`TELEGRAM_API_ID=99999 bun test ...`).
+The vault passphrase is unset before the test run so a misbehaving
+scenario can't smuggle it into a chat message. The session string in
+`.env` is bearer-equivalent to the driver account — treat the file
+as a long-lived secret.
 ## 7. Verification checklist before running scenarios
-- [ ] `switchroom vault get telegram-test-bot-token` returns a token.
-- [ ] `switchroom vault get telegram-uat-driver-session` returns a
-      session string (the command output may be redacted by the
-      vault — that's fine, you only need exit code 0).
-- [ ] `$SWITCHROOM_UAT_CHAT_ID` exported and is a negative int.
-- [ ] Test bot is admin in the supergroup.
-- [ ] Driver user is admin in the supergroup.
-- [ ] Topics enabled in the supergroup.
+- [ ] `switchroom vault list` shows `telegram-test-bot-token`,
+      `telegram-uat-api-id`, `telegram-uat-api-hash`,
+      `telegram-uat-driver-session` (and `telegram-uat-chat-id` for
+      Phase 2b).
+- [ ] `switchroom agent status test-harness` reports the agent active.
+- [ ] Driver user can DM `@meken_switchroom_test_bot` from Telegram
+      and get a reply (manual sanity check before automating).
+When all three are checked, the env block above + `bun run test:uat`
+is safe to run.
+## 8. CI gate — `:robot: UAT fuzz` Buildkite step
+Since the buildkite gate landed, the fuzz subset of scenarios
+(`fuzz-random-prompts-dm.test.ts`, `fuzz-extended-dm.test.ts`,
+`fuzz-human-style-dm.test.ts`) runs automatically on every PR that
+touches `telegram-plugin/`, `src/agents/`, or `telegram-plugin/uat/`.
+The step runs on a self-hosted Buildkite agent tagged
+`queue=uat-host` that lives on the same box as the `test-harness`
+agent. Secrets come from the Buildkite cluster secret store, not
+from local vault. See `.buildkite/README.md` § "UAT fuzz step" for
+agent setup + secret rotation.
+**Scope (CI):**
+| Scenario | In CI? | Why |
+|---|---|---|
+| `fuzz-random-prompts-dm` | ✅ gates PRs | JTBD-floor invariants; PR #1132. |
+| `fuzz-extended-dm` | ✅ gates PRs | Second-pass categories; PR #1134. |
+| `fuzz-human-style-dm` | ✅ gates PRs | Human-shape inbounds + meaningful-reply floor. |
+| `silent-end-recovery-dm` | ❌ local only | Passes, but the 5-min worst-case budget makes it costly to run every PR. Run nightly + ad-hoc. |
+| `jtbd-status-query-dm` | ❌ local only | Passes; defer to a follow-up that batches the cheap JTBD scenarios. |
+| `jtbd-soft-commit-dm` | ❌ local only | Already budget-tuned but real-Telegram timing flake risk; defer until we have flake telemetry. |
+| `jtbd-interrupt-marker-dm` | ❌ `describe.skip` | Suspected real bug per #1132 overnight. Investigate before unskipping. |
+| `jtbd-rapid-followup-dm` | ❌ `describe.skip` | Suspected real classification bug per #1132 overnight. Investigate before unskipping. |
+| vault / secret-redaction / voice / location / reactions / progress-card | ❌ local only | Need specific surfaces / config overrides not wired into the gate yet. |
+A local `bun run test:uat` runs the full include glob minus the two
+`describe.skip`'d JTBDs.
+## 9. Port allocator vs unix sockets (Phase 1 scaffold note)
-When all six are checked, `bun run test:uat` is safe to run.
+The Phase 1 `port-allocator.ts` is held in reserve for Phase 2b's
+child-process flow — Phase 2a (standard-runtime agent) doesn't need
+it. Kept rather than deleted because the allocator's bind-probe is
+the right shape for what 2b will need.