npm - instar - Versions diffs - 0.28.78 → 0.28.79 - Mend

instar 0.28.78 → 0.28.79

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (65) hide show

package/dashboard/index.html +170 -7
package/dist/commands/init.d.ts.map +1 -1
package/dist/commands/init.js +6 -4
package/dist/commands/init.js.map +1 -1
package/dist/commands/playbook.d.ts.map +1 -1
package/dist/commands/playbook.js +2 -1
package/dist/commands/playbook.js.map +1 -1
package/dist/commands/server.d.ts.map +1 -1
package/dist/commands/server.js +91 -8
package/dist/commands/server.js.map +1 -1
package/dist/commands/setup.d.ts.map +1 -1
package/dist/commands/setup.js +5 -3
package/dist/commands/setup.js.map +1 -1
package/dist/core/Config.d.ts.map +1 -1
package/dist/core/Config.js +2 -1
package/dist/core/Config.js.map +1 -1
package/dist/core/PostUpdateMigrator.d.ts.map +1 -1
package/dist/core/PostUpdateMigrator.js +4 -5
package/dist/core/PostUpdateMigrator.js.map +1 -1
package/dist/core/SessionManager.d.ts +38 -0
package/dist/core/SessionManager.d.ts.map +1 -1
package/dist/core/SessionManager.js +157 -23
package/dist/core/SessionManager.js.map +1 -1
package/dist/core/UpdateChecker.d.ts.map +1 -1
package/dist/core/UpdateChecker.js +3 -1
package/dist/core/UpdateChecker.js.map +1 -1
package/dist/core/UpgradeGuideProcessor.d.ts.map +1 -1
package/dist/core/UpgradeGuideProcessor.js +3 -1
package/dist/core/UpgradeGuideProcessor.js.map +1 -1
package/dist/core/types.d.ts +18 -0
package/dist/core/types.d.ts.map +1 -1
package/dist/core/types.js.map +1 -1
package/dist/lifeline/ServerSupervisor.d.ts.map +1 -1
package/dist/lifeline/ServerSupervisor.js +3 -1
package/dist/lifeline/ServerSupervisor.js.map +1 -1
package/dist/memory/SemanticMemory.d.ts +9 -0
package/dist/memory/SemanticMemory.d.ts.map +1 -1
package/dist/memory/SemanticMemory.js +131 -0
package/dist/memory/SemanticMemory.js.map +1 -1
package/dist/scheduler/JobRunHistory.d.ts +6 -0
package/dist/scheduler/JobRunHistory.d.ts.map +1 -1
package/dist/scheduler/JobRunHistory.js +11 -0
package/dist/scheduler/JobRunHistory.js.map +1 -1
package/dist/scheduler/JobScheduler.d.ts +23 -0
package/dist/scheduler/JobScheduler.d.ts.map +1 -1
package/dist/scheduler/JobScheduler.js +84 -0
package/dist/scheduler/JobScheduler.js.map +1 -1
package/dist/server/routes.d.ts.map +1 -1
package/dist/server/routes.js +56 -0
package/dist/server/routes.js.map +1 -1
package/dist/threadline/ThreadlineBootstrap.d.ts.map +1 -1
package/dist/threadline/ThreadlineBootstrap.js +3 -2
package/dist/threadline/ThreadlineBootstrap.js.map +1 -1
package/dist/threadline/relay/ConnectionManager.d.ts.map +1 -1
package/dist/threadline/relay/ConnectionManager.js +34 -7
package/dist/threadline/relay/ConnectionManager.js.map +1 -1
package/package.json +1 -1
package/scripts/pre-push-gate.js +26 -0
package/src/data/builtin-manifest.json +64 -64
package/upgrades/0.28.79.md +67 -0
package/upgrades/side-effects/0.28.79.md +310 -0
package/upgrades/side-effects/assembler-context-endpoint.md +67 -0
package/upgrades/side-effects/post-update-migrator-path-fix.md +52 -0
package/upgrades/side-effects/semantic-memory-corruption-recovery.md +98 -0
package/upgrades/side-effects/url-pathname-path-encoding-fix.md +45 -0

package/upgrades/side-effects/0.28.79.md ADDED Viewed

@@ -0,0 +1,310 @@
+# Side-Effects Review — Topic-binding-aware zombie kill + resume-failure fallback
+**Version / slug:** `zombie-kill-topic-binding`
+**Date:** `2026-05-04`
+**Author:** `Echo`
+**Second-pass reviewer:** `independent-review-subagent (concerns raised + resolved)`
+## Summary of the change
+Closes a two-stage failure mode that drops the user's first message after a
+conversational pause on Telegram-bound (and Slack/iMessage-bound) agents.
+**Root cause traced from Inspec/monroe-workspace logs.** When a Telegram agent
+finishes replying, Claude sits at the prompt waiting for the next user
+message. SessionManager's zombie-killer interprets "idle at prompt + no active
+processes for 15 minutes" as zombie and kills the session. When the user
+finally messages, the bridge tries to respawn with `--resume <UUID>`; the
+saved UUID was captured at kill time and sometimes crashes Claude during
+startup (`Session died during startup`). `waitForClaudeReady` times out, the
+initial message is logged "NOT injected", and the user's message is dropped.
+Five minutes later, the presence proxy fires its `tier-3 — session appears
+stopped` warning. The user has to send "unstick" or re-send to recover.
+**Fix in two layers:**
+- **Layer A — Topic-binding exemption (signal-vs-authority structural
+  exemption).** SessionManager gains an optional `topicBindingChecker`
+  callback. When the zombie-killer is about to act, it consults the checker;
+  if the session is bound to a live messaging topic the kill threshold is
+  raised from 15 minutes to a configurable bound threshold (default 240
+  minutes / 4h). The binding is an authoritative structural fact (the
+  TelegramAdapter's reverse map), not a judgment call. Default chosen to
+  cover normal conversational pauses through a workday without holding
+  per-session resources (Claude TUI ~200-500MB RSS, Anthropic connection)
+  indefinitely. Operators can override via `idlePromptKillMinutesBoundToTopic`.
+- **Layer B — Resume-failure fresh-spawn fallback.** When the readiness
+  probe fails AND tmux died during startup AND the spawn was using
+  `--resume`, SessionManager falls through once to a fresh-spawn carrying the
+  same initial message. A `resumeFailed` event is emitted; the bridge clears
+  the bad UUID from `TopicResumeMap` so the next user-driven respawn doesn't
+  retry the same broken UUID. The bridge listener gates the `remove()` on
+  UUID-equality with the failed UUID — so a fresh spawn that quickly saved a
+  *new* UUID won't have it wiped by a late-firing listener.
+**Files touched:**
+- `src/core/types.ts` — adds `idlePromptKillMinutesBoundToTopic?: number`.
+- `src/core/SessionManager.ts` — adds binding checker, bound threshold getter,
+  binding-aware kill decision, and `handleReadyAndInject` with single-retry
+  fresh-spawn fallback. Emits `resumeFailed` event.
+- `src/commands/server.ts` — wires the binding checker to consult Telegram /
+  Slack / iMessage adapters; subscribes to `resumeFailed` to clear the stale
+  UUID from `TopicResumeMap`.
+- `tests/unit/zombie-kill-topic-binding.test.ts` — new behavioral tests.
+- `tests/unit/spawn-resume-fallback.test.ts` — new behavioral tests.
+## Decision-point inventory
+- `SessionManager` zombie-kill decision (`isActuallyIdle && idleMs > threshold`) — **modified**: threshold is now binding-aware.
+- `SessionManager.spawnInteractiveSession` post-readiness initial-message inject — **modified**: adds a single fresh-spawn fallback when --resume crashes during startup.
+- `commands/server.ts` `injectionDropped` listener — **pass-through**: existing recovery path is preserved; the new `resumeFailed` listener is purely a UUID-cleanup hook, no block/allow surface.
+---
+## 1. Over-block
+**What legitimate inputs does this change reject that it shouldn't?**
+The only block-shaped surface this change touches is "kill vs don't-kill". The
+change *raises* the threshold for topic-bound sessions; it does not block
+anything new. The risk is the inverse of over-block: failing to kill a truly-
+zombied topic-bound session for up to 24h.
+Concrete scenario: A topic-bound session whose Claude process hangs internally
+(e.g., infinite loop in the TUI) but stays "alive" by `pane_current_command`
+will not be cleaned up by the zombie-killer for up to 24h. Mitigation: the
+bridge's `isSessionAlive` check on the next user message authoritatively
+detects truly-dead Claude processes and triggers a clean respawn — that's the
+fast path. The 24h threshold only matters for users who never message again.
+---
+## 2. Under-block
+**What failure modes does this still miss?**
+Layer A misses: a topic-bound session whose Claude has stopped responding to
+input but is still showing the prompt and registering as `alive` will not be
+killed promptly. As above — mitigated by user-driven respawn on next message.
+Layer B misses: if the fresh-spawn fallback also crashes during startup (e.g.,
+disk full, claudePath wrong, persistent corruption), we surface a degradation
+event but do not retry again. This is intentional — single retry only — to
+avoid spawn-loops. The bridge's existing `injectionDropped` recovery path
+will pick up the ball on the next inbound message.
+Layer B also does not cover: the case where `--resume` succeeds *enough* for
+tmux to stay alive but Claude itself is broken (won't render the prompt). In
+that case we still fall through to "best-effort inject anyway" preserving the
+prior behavior. That's not a regression.
+---
+## 3. Level-of-abstraction fit
+**Is this at the right layer?**
+Yes. SessionManager owns session lifecycle, so the kill threshold belongs
+there. The binding check is delegated to a callback (the same pattern as
+`subagentChecker` and `activeRecoveryChecker`) so SessionManager stays
+unaware of which messaging platform is asking — it only consumes a yes/no
+binding signal.
+The fresh-spawn fallback also belongs in SessionManager because it owns the
+spawn primitive. The bridge layer only consumes the `resumeFailed` event for
+its own state cleanup (`TopicResumeMap.remove`), which is unique to the
+bridge's responsibility.
+A higher-level alternative would have been to do the retry in the bridge
+(routes.ts `/internal/telegram-forward`). Rejected: that requires either
+refactoring `spawnInteractiveSession` to expose readiness to the caller (big
+churn across 15 callers) or duplicating the spawn-and-await logic in two
+places (drift risk). Keeping it in SessionManager is cheaper and isolates
+the fix to one method body.
+---
+## 4. Signal vs authority compliance
+**Required reference:** [docs/signal-vs-authority.md](../../docs/signal-vs-authority.md)
+**Does this change hold blocking authority with brittle logic?**
+- [x] No — this change has no block/allow surface in the judgment sense.
+The zombie-killer is not a judgment authority — it's a structural cleanup
+mechanism whose behavior is now parameterized by an authoritative structural
+fact (is this session in the topic→session reverse map). Per the principle
+doc:
+> When this principle does NOT apply: Hard-invariant validation … structural
+> validators at the boundary of the system are not decision points in the
+> sense this principle applies to.
+The binding lookup is a hard structural fact ("is this session ID in the
+TelegramAdapter's reverse map?"), not a judgment about what a message
+*means*. There is no LLM, no regex, no similarity score, no token list.
+The fresh-spawn fallback is a recovery flow control, not a decision point on
+content or intent. No principle violation.
+---
+## 5. Interactions
+**Does this interact with existing checks, recovery paths, or infrastructure?**
+- **Shadowing:**
+  - The zombie-killer's existing vetoes (`activeRecoveryChecker`,
+    `subagentChecker`, `pendingInjections`) all run BEFORE the new threshold
+    check. They are unaffected — bound sessions still respect compaction
+    recovery, subagent activity, and pending-injection events.
+  - The new `topicBindingChecker` runs AFTER `idlePromptSince` is established,
+    not before, so the existing first-idle hooks (paste-retry, error-nudge)
+    still fire normally on bound sessions.
+- **Double-fire:**
+  - `resumeFailed` and `injectionDropped` could both fire for the same session
+    if the resume crashes AND the recovered fresh-spawn also fails to
+    inject. In that case, the bridge's `injectionDropped` listener will
+    re-forward the user's text via `/internal/telegram-forward`, which
+    triggers a new spawn. This is the same path that already runs today on
+    crashed sessions; the change does not introduce a new loop.
+- **Races:**
+  - The fresh-spawn fallback inside `handleReadyAndInject` runs after a
+    `kill-session` to clean up any zombie pane. If a concurrent monitor tick
+    is in flight, it could observe the dead pane mid-cleanup and emit
+    `sessionComplete` for the failed session. We mark the failed session
+    `status: 'failed'` BEFORE emitting `resumeFailed` (and before the
+    recursive spawn), so reapers see consistent state through the full
+    handoff.
+  - The `resumeFailed` listener fires AFTER the fresh-spawn may have already
+    saved a new UUID via the proactive 8-second save. To avoid wiping the new
+    UUID, the listener gates `TopicResumeMap.remove(topicId)` on a
+    UUID-equality check (only remove when the stored UUID still matches the
+    failed one). Direct test: `tests/unit/resume-failed-uuid-gate.test.ts`.
+  - `topicBindingChecker` is read-only. No shared mutable state.
+- **Feedback loops:** None. The fresh-spawn fallback is one-shot.
+---
+## 6. External surfaces
+**Does this change anything visible outside the immediate code path?**
+- **Other agents on the same machine:** No. Each agent's SessionManager owns
+  its own kill threshold and its own binding map.
+- **Other users of the install base:** Yes — every Telegram/Slack/iMessage
+  agent will now hold sessions for up to 24h instead of cleaning them up at
+  15 minutes. Memory footprint and Claude API connection count per agent
+  may rise. Users with many concurrent topics on a memory-constrained host
+  can override `idlePromptKillMinutesBoundToTopic` in config.json. The
+  default is conservative for the common case (1-3 topics per agent).
+- **External systems:** No changes to Telegram/Slack/iMessage API surface.
+  Tunnel, GitHub, Cloudflare unaffected.
+- **Persistent state:** `TopicResumeMap` entries are cleared on resume
+  failure (one extra `remove` call per failure). State file format unchanged.
+- **Timing/runtime:** The bound threshold default (24h) is bounded; sessions
+  cannot accumulate forever. The fresh-spawn fallback adds at most one
+  additional 90-second readiness window per spawn attempt; bounded.
+- **Logs:** New log lines on bound-zombie-kill (`(topic-bound, threshold Nm)`),
+  resume failure (`Resume failed for "X" — tmux died during startup. Falling
+  back to fresh spawn.`), and fresh-spawn success/failure. Format is
+  consistent with existing `[SessionManager]` lines.
+---
+## 7. Rollback cost
+**If this turns out wrong in production, what's the back-out?**
+Pure code change. No schema migration, no persistent state shape change, no
+data migration. Rollback path: revert the commit, ship as next patch. Agents
+will resume the prior 15-minute kill threshold on their next server restart.
+No user-visible regression during rollback window — at worst, the user sees
+the old "session appears stopped" pattern they reported.
+The new config option `idlePromptKillMinutesBoundToTopic` falls back to a
+hardcoded default (1440), so a rollback that drops the field from disk
+config is a no-op.
+---
+## Conclusion
+This review produced no design changes — both layers passed signal-vs-
+authority compliance and the side-effects review on first read. The change
+is contained to SessionManager and one wiring call in `commands/server.ts`,
+with two new dedicated test files (8 new tests) plus 21 existing
+session-reap-detect tests still passing.
+The change is clear to ship pending second-pass review (required because it
+touches session lifecycle: spawn, kill, recovery).
+---
+## Second-pass review (if required)
+**Reviewer:** independent-review-subagent
+**Independent read of the artifact: concern**
+I concur on layer A's signal-vs-authority compliance and on the overall shape of layer B, but I have specific concerns that should be resolved before ship:
+- **Threshold default 1440m (24h) is too aggressive a swing from 15m.** The healthy waiting state argument is sound, but 24h means each bound session holds a Claude TUI process (~200–500MB RSS) and an Anthropic connection for a full day even if the user never returns. For an agent with 8–10 concurrent Telegram topics on a 16GB host, that's 2–5GB of resident memory locked indefinitely, vs. the prior steady-state where idle topics released within 15m and only re-spawned on the next message. The artifact's mitigation ("config override available") puts the burden on every multi-topic operator to discover the new default and tune it down; the conservative default should solve the reported symptom without the resource cost. **Recommended resolution:** drop default to 240m (4h) — long enough that conversational pauses through normal work hours don't trip the kill, short enough that overnight idle sessions release. Keep the config knob for users who genuinely want 24h.
+- **Cleanup race between proactive UUID save and `resumeFailed` listener is plausible (low-likelihood but real).** Order of operations I traced:
+  1. `spawnSessionForTopic` calls `spawnInteractiveSession` (returns at line 1390 once tmux is created, before readiness probe).
+  2. Caller at server.ts:528 immediately removes the bad UUID from `TopicResumeMap`.
+  3. Caller at server.ts:537–549 schedules a `setTimeout(8s)` proactive UUID save against the same tmux name.
+  4. ~90s later, `handleReadyAndInject` decides resume failed, emits `resumeFailed`, listener tries to remove UUID (no-op — already gone).
+  5. Fallback recursively calls `spawnInteractiveSession` which creates a fresh Claude under the same tmux name.
+  The 8s proactive save fires while the failed Claude is still in startup-crash territory (no hook event yet, `claudeSessionId` empty → save is skipped). That's safe by accident, not by design. If the fresh-spawn fallback finishes quickly enough that a hook event lands before the resumeFailed listener fires, the listener could clear a fresh, valid UUID. The current emit-before-spawn ordering makes this unlikely, but it's not asserted by a test. **Recommended resolution:** add a test that runs the full sequence (proactive save scheduled → resume crash → fallback spawn → fresh hook event lands) and verifies `TopicResumeMap` ends with the *new* UUID, not empty. Or, more defensively, gate the listener's remove on a UUID-equality check (only clear if the stored UUID still matches `info.resumeSessionId`).
+- **Failed-session status update happens AFTER `emit('resumeFailed')`.** Lines 1431–1446 emit the event first, then mark `failed.status = 'failed'`. The artifact §5 Races claims "We mark the failed session `status: 'failed'` before kicking off the fresh spawn so reapers don't re-process it" — but the emit-then-mark order means a concurrent monitor tick that fires between emit and `state.saveSession(failed)` will see `status: 'running'` on a dead pane. The recursive spawn happens after the marking, so the practical impact is small (window is microseconds), but the artifact's claim doesn't match the code. **Recommended resolution:** either move the status update before the emit, or soften the artifact's claim.
+- **Test coverage gaps the artifact undersells.** All 8 new tests use mocked tmux with a single session; none cover (a) the fresh-spawn fallback itself failing — only `DegradationReporter.report` is exercised by code path, never by test, (b) concurrent monitor ticks during fallback (the race the artifact §5 itself flags), (c) multiple bound + unbound sessions on the same manager where the binding checker returns a mix, or (d) the listener's UUID-cleanup interacting with a happy-path remove on the same topic. The "21 existing session-reap-detect tests" don't cover any of this — they predate the change. **Recommended resolution:** add at least the "fallback also fails" test and a "mixed bound/unbound sessions" test before merge.
+- **Minor: `tmuxSession.replace(\`${path.basename(this.config.projectDir)}-\`, '')` at line 1457 is a string-first-occurrence replace.** If the agent's project directory basename happens to appear later in the session name (rare but possible — e.g. project `monroe`, session named `monroe-ai-monroe-debug`), only the first occurrence is stripped, which is correct. But the implicit assumption that `tmuxSession` always begins with `${projectBase}-` isn't enforced — if `name` was originally `null` and `tmuxSession` was `${projectBase}-interactive-${Date.now()}`, the recursive call passes `interactive-${Date.now()}` as `name`, creating a *different* tmux session name on retry (`projectBase-interactive-<sanitized>`). The recursive call would not reuse the same tmux name, breaking the implicit contract that the bridge's session→topic mapping still resolves. **Recommended resolution:** add a guard for the un-named case, or pass the tmuxSession name through more explicitly to ensure name preservation.
+None of these are blockers in the "stop the world" sense — layer A is sound and layer B is a clear improvement on dropping messages. But the threshold default and the cleanup-race test gap warrant a follow-up before this lands on a production-traffic agent.
+---
+### Author's resolution of second-pass concerns
+All five concerns were addressed in this same PR before commit:
+1. **Threshold default lowered from 1440 → 240 minutes (4h).** Source: `src/core/SessionManager.ts:65`. Long enough to cover normal conversational pauses through a workday; short enough to release resources from genuinely abandoned topics. Config knob `idlePromptKillMinutesBoundToTopic` preserved for operators who want a different value. Test updated to assert the new default.
+2. **UUID-equality gate on `resumeFailed` listener.** Source: `src/commands/server.ts` (search `UUID-equality gate`). Listener now reads stored UUID and only calls `remove()` when it matches `info.resumeSessionId`. New test file `tests/unit/resume-failed-uuid-gate.test.ts` covers all four cases: matching, replaced (the race), absent, and missing-topicId.
+3. **Order swapped: failed-status update now happens BEFORE `emit('resumeFailed')`.** Source: `src/core/SessionManager.ts` `handleReadyAndInject`. Artifact §5 claim now matches the code.
+4. **Test gaps closed.** Added: "fresh-spawn fallback also fails → degradation reported", "mixed bound + unbound sessions on the same manager", and the entire UUID-equality gate test file. Total new behavioral tests: 14 (was 8).
+5. **tmuxSession-name reconstruction fixed.** Source: `handleReadyAndInject` now threads the original `name` parameter through and passes it directly to the recursive `spawnInteractiveSession`. The fragile `tmuxSession.replace(prefix, '')` reconstruction is gone — auto-generated `interactive-${ts}` names round-trip correctly.
+Verified by re-running the focused test suite: 81 tests across 7 files passing.
+---
+## Evidence pointers
+**Repro evidence:**
+- `/Users/justin/Documents/Projects/monroe-workspace/logs/server.log`
+  - `2026-05-04T23:39:14Z` — zombie kill of healthy idle session
+  - `2026-05-05T00:48:16Z` — user message arrives, no live session
+  - `2026-05-05T00:48:16Z` — respawn-with-resume attempted (UUID `716881a4-...`)
+  - `2026-05-05T00:48:20Z` — Session died during startup
+  - `2026-05-05T00:48:20Z` — Claude not ready, message NOT injected
+  - 19 prior occurrences of the same `Claude not ready` log line going back to 2026-04-28.
+**Test evidence:**
+- `tests/unit/zombie-kill-topic-binding.test.ts` — 6 tests: unbound kill, bound exemption, bound + over-threshold kill, null-checker, mixed bound+unbound (added per reviewer), default 4h.
+- `tests/unit/spawn-resume-fallback.test.ts` — 4 tests: resume crash → fresh-spawn fallback, no-resume fresh spawn, no-fallback on prompt-detection false negative, both spawns fail → degradation reported (added per reviewer).
+- `tests/unit/resume-failed-uuid-gate.test.ts` — 4 tests (added per reviewer): clear when stored UUID matches, preserve when stored UUID has been replaced (race), no-op when no stored UUID, no-op when no telegramTopicId.
+- 7 related test files (81 tests) all green: `session-manager-behavioral`, `session-reap-detect`, `CompactionSentinel`, `bootstrap-file-threshold`, plus the three new files.

package/upgrades/side-effects/assembler-context-endpoint.md ADDED Viewed

@@ -0,0 +1,67 @@
+# Side-Effects Review — Wire WorkingMemoryAssembler into session context API
+**Version / slug:** `assembler-context-endpoint`
+**Date:** 2026-04-28
+**Author:** gfrankgva (contributor)
+**Second-pass reviewer:** Echo (EchoOfDawn), 3 review rounds
+## Summary of the change
+Two files touched:
+1. `src/commands/server.ts` — WorkingMemoryAssembler construction is moved from line 3258 (before activitySentinel) to after activitySentinel initialization (~line 3475). This enables wiring `episodicMemory` via `activitySentinel.getEpisodicMemory()`, which was previously left as a TODO comment. The assembler now receives both `semanticMemory` and `episodicMemory`, making the 400-token episode budget functional in production. Construction is guarded by `if (semanticMemory || activitySentinel)` — skipped entirely in minimal-config setups where neither memory system is available.
+2. `src/server/routes.ts` — The two assembled-context endpoints (`/topic/context/:topicId?assembled=true` and `/session/context/:topicId`) are refactored to call a shared `assembleAndRespond()` helper instead of duplicating the assembly + response logic. The helper takes the assembler instance, topicId, options, and the Express response object. Auth confirmation is added to the JSDoc for the session context route.
+## Decision-point inventory
+- `WorkingMemoryAssembler` construction order — **modify** (move later in init sequence for dependency availability).
+- `WorkingMemoryAssembler` construction guard — **add** (skip when both `semanticMemory` and `activitySentinel` are undefined).
+- `episodicMemory` wiring — **add** (was commented out, now passed via `activitySentinel?.getEpisodicMemory()`).
+- `assembleAndRespond()` helper — **add** (extracts duplicated assembly logic).
+- Route handlers — **modify** (delegate to shared helper instead of inline assembly).
+---
+## 1. Over-block
+**What legitimate inputs does this change reject that it shouldn't?**
+None. The assembler degrades gracefully when episodicMemory is undefined (sentinel requires sharedIntelligence / LLM key). The helper produces identical output to the previous inline logic. Backwards compatibility is preserved: `?assembled=true` is opt-in, and the raw topic context path is unchanged.
+## 2. Under-block
+**What failure modes does this still miss?**
+If `activitySentinel.getEpisodicMemory()` returns an EpisodicMemory instance that later becomes invalid (e.g., sentinel is stopped mid-session), the assembler would hold a stale reference. However, EpisodicMemory is file-based (JSON under `state/episodes/`), so the instance remains usable even if the sentinel stops producing new digests — it just won't have fresh data.
+## 3. Level-of-abstraction fit
+**Is this at the right layer?**
+Yes. The assembler is a dependency-injected component — it receives its memory sources at construction time. Moving its initialization to the correct point in the dependency graph (after sentinel) is the natural fix. The shared helper is a local function within the route setup closure, keeping the DRY refactor scoped to the routes file.
+## 4. Blocking authority
+- [x] No — these are read-only API endpoints. They do not gate any operation.
+## 5. Interactions
+- **Init ordering**: Assembler now depends on `activitySentinel` being initialized first. If sentinel init fails (sharedIntelligence unavailable), `activitySentinel` is undefined and `getEpisodicMemory()` is not called — assembler gets `episodicMemory: undefined` and degrades gracefully.
+- **Route behavior**: Identical to prior implementation — the helper is a pure extraction refactor.
+## 6. External surfaces
+- **Agents**: Session-start hooks calling `/session/context/:topicId` now receive episode context (recent activity digests, themed episodes) in the assembled output. This is strictly additive — agents get richer context.
+- **Persistent state**: No modifications. Both endpoints are read-only.
+## 7. Rollback cost
+Pure code change. Revert restores the previous inline handlers and removes episodic wiring. No migration or data repair needed.
+---
+## Evidence pointers
+- Typecheck: `tsc --noEmit` — 0 errors.
+- Existing tests (14 integration + 3 E2E) cover both endpoints' happy paths, fallback behavior, and budget surfacing. The shared helper produces identical output, so existing test assertions remain valid.

package/upgrades/side-effects/post-update-migrator-path-fix.md ADDED Viewed

@@ -0,0 +1,52 @@
+# Side-Effects Review — PostUpdateMigrator path decoding fix
+**Version / slug:** `post-update-migrator-path-fix`
+**Date:** 2026-04-28
+**Author:** gfrankgva (contributor)
+## Summary of the change
+One file, one line:
+`src/core/PostUpdateMigrator.ts` — `getFreeTextGuardHook()` replaced `path.dirname(new URL(import.meta.url).pathname)` with `__dirname`. The former preserves `%20`-encoded spaces in the filesystem path, causing `fs.readFileSync` to fail when the project directory contains spaces. `__dirname` is already defined at module scope via `fileURLToPath(import.meta.url)`, which properly decodes percent-encoded characters.
+## Decision-point inventory
+- `getFreeTextGuardHook()` path construction — **fix** (replace URL.pathname with __dirname).
+---
+## 1. Over-block
+None. Pure bug fix — strictly widens the set of environments where the function works.
+## 2. Under-block
+None. `__dirname` handles all valid filesystem paths.
+## 3. Level-of-abstraction fit
+Correct. Uses the same `__dirname` already defined at module scope by the file itself.
+## 4. Blocking authority
+- [x] No — this is a path construction fix, not a gate.
+## 5. Interactions
+None. The function is called during hook installation — no racing, no shadowing.
+## 6. External surfaces
+The function returns the content of `free-text-guard.sh`, which is written to `.claude/hooks/`. No behavioral change to the hook content itself.
+## 7. Rollback cost
+Revert restores the bug — `readFileSync` would again fail on paths with spaces.
+---
+## Evidence pointers
+- Typecheck: `tsc --noEmit` — 0 errors.
+- Tests: All 7 `PostUpdateMigrator-buildStopHook` tests pass (were 2/7 failing before this fix).

package/upgrades/side-effects/semantic-memory-corruption-recovery.md ADDED Viewed

@@ -0,0 +1,98 @@
+# Side-Effects Review — SemanticMemory corruption detection and auto-recovery
+**Version / slug:** `semantic-memory-corruption-recovery`
+**Date:** 2026-04-27
+**Author:** gfrankgva (contributor)
+**Second-pass reviewer:** Echo (EchoOfDawn), 3 review rounds
+## Summary of the change
+Three files touched:
+1. `src/core/types.ts` — `SemanticMemoryConfig` gains an optional `autoRebuildMaxBytes?: number` field (default 50 MB). No existing code passes this field, so all callers keep current behavior.
+2. `src/memory/SemanticMemory.ts` — `open()` gains an integrity check block mirroring TopicMemory's pattern:
+   - After opening the DB, runs `PRAGMA integrity_check`. If result is not `'ok'`, or if the pragma itself throws (severely corrupt DB), triggers recovery.
+   - **Secondary probe read**: If `integrity_check` passes, reads 100 rows from each existing table. Catches torn interior pages that `integrity_check` misses (pages not reachable from the B-tree schema walk).
+   - Recovery: calls `quarantineCorruptDb()` which renames the DB to `.corrupt.<timestamp>`, removes WAL/SHM sidecars, writes a JSON marker file. Falls back to delete if rename fails.
+   - After schema creation and vector init, checks `_needsRebuild` flag. If JSONL exists and is within the size gate, rebuilds synchronously. If JSONL exceeds `autoRebuildMaxBytes`, logs warning, starts empty, and writes a `skipped-rebuild` marker file.
+3. `tests/unit/semantic-memory-corruption-recovery.test.ts` — Test file with 12 contract-style tests covering: open-without-throwing, quarantine preservation, marker shape, sidecar cleanup (strengthened WAL/SHM assertions), JSONL rebuild, no-JSONL fresh start, healthy-DB no-op, severe-corruption pragma-throws path, partial-corruption (valid header + 4KB corrupted data page in 5000-row DB), size-gate skip, skipped-rebuild marker file, and subsequent-open stability.
+## Decision-point inventory
+- `SemanticMemoryConfig.autoRebuildMaxBytes` — **add** (type: optional number, default 50 MB).
+- `SemanticMemory.open()` integrity check block — **add** (new code path between DB constructor and pragma setup).
+- `SemanticMemory.quarantineCorruptDb()` — **add** (new private method).
+- `SemanticMemory._needsRebuild` — **add** (new private field, transient between integrity check and rebuild).
+- Auto-rebuild size gate — **add** (checks `fs.statSync(jsonlPath).size` against config limit).
+- `SemanticMemory` probe-read block — **add** (secondary detection after integrity_check passes; reads 100 rows from each existing table).
+- `SemanticMemory.writeSkippedRebuildMarker()` — **add** (new private method; writes `.skipped-rebuild.<ts>.marker.json` when size gate triggers).
+---
+## 1. Over-block
+**What legitimate inputs does this change reject that it shouldn't?**
+When JSONL exceeds `autoRebuildMaxBytes` (default 50 MB), the DB starts empty after corruption recovery. This means an operator with a large knowledge graph (> ~500k entities) would need to trigger `importFromJsonl()` manually after startup. This is deliberate — blocking server startup for minutes on a synchronous import is worse than starting with empty memory. The operator can rebuild during a maintenance window.
+The integrity check itself runs on every `open()` call, adding measurable startup latency for large DBs. TopicMemory pays the same cost, so consistency wins. If semantic DBs grow very large, a `quick_check` pragma (subset of integrity_check) could be a future optimization.
+## 2. Under-block
+**What failure modes does this still miss?**
+- **Mid-session corruption**: Only detected on `open()`. If a disk block goes bad during a running session, individual SQLite operations will throw but no automatic recovery triggers. This is out of scope — mid-session recovery would require connection pooling or shadow-DB switching, far beyond this PR's scope.
+- **Probe-read coverage**: The secondary probe reads 100 rows from each non-FTS table. Very large tables with corruption only in pages beyond the first 100 rows could theoretically pass the probe. In practice, 100 rows spans multiple 4KB pages, making this unlikely. Full table scans at startup would have unacceptable latency on large DBs.
+- **JSONL truncation**: If the JSONL was itself truncated (disk-full event during a write), the rebuild will be partial — some entities may be missing. The `importFromJsonl()` method handles malformed lines gracefully (skips them), so the rebuild is best-effort. The quarantined DB is preserved for forensic comparison.
+- **Writes not flushed to JSONL**: All mutation paths in SemanticMemory go through `remember()` / `addEdge()` which write to JSONL first (append), then to DB. The JSONL is the source of truth. There is no path where the DB is written first.
+## 3. Level-of-abstraction fit
+**Is this at the right layer?**
+Yes. SemanticMemory owns its DB lifecycle — `open()` is the correct place for integrity checks, matching TopicMemory's pattern. The quarantine logic is a private method, not exposed to callers. The size-gate config is on the existing `SemanticMemoryConfig` interface, which is the established place for tuning knobs.
+The alternative (adding a "health check" service layer above SemanticMemory) would scatter recovery logic across modules and require SemanticMemory to expose its DB state — worse encapsulation.
+## 4. Blocking authority
+- [x] No — this is a startup-time recovery mechanism. It does not gate any runtime operation. The only "decision" is quarantine-vs-keep, which is always quarantine (corruption is binary).
+## 5. Interactions
+- **Shadowing:** No existing corruption detection to shadow — SemanticMemory had none before this PR.
+- **Double-fire:** `_needsRebuild` is reset after the rebuild block. A second `open()` on the recovered DB is a no-op (tested).
+- **Races:** `open()` is async but the integrity check is synchronous (better-sqlite3 is sync). No concurrent access during startup.
+- **Downstream consumers:** Callers of `SemanticMemory.open()` (currently only `src/commands/server.ts`) see no behavioral change on healthy DBs. On corrupt DBs, `open()` succeeds instead of potentially throwing — strictly better.
+## 6. External surfaces
+- **Agents:** After corruption recovery, the knowledge graph may be rebuilt from JSONL (common case) or start empty (large JSONL). Agents notice "fewer memories" but server stays up — preferable to a crash loop.
+- **File system:** New files created during recovery: `.corrupt.<ts>` (quarantined DB), `.corrupt-recovery.<ts>.marker.json` (recovery marker). These accumulate over time — an operator might want periodic cleanup, but each occurrence is exceptional (disk errors).
+- **Persistent state:** The JSONL append log is never modified — only read during rebuild. The SQLite DB is replaced (quarantined + fresh). No other persistent state is touched.
+## 7. Rollback cost
+Pure code change. Revert removes the integrity check — corrupt DBs would again cause `open()` to either throw or silently serve bad data. No migration, no data repair needed on rollback.
+---
+## 8. Destructive-tool containment compliance
+`quarantineCorruptDb()` uses `fs.unlinkSync` to remove the corrupt DB and its WAL/SHM sidecars. Per the Comprehensive Destructive-Tool Containment spec (PRs #98/#99), all destructive filesystem calls must go through `SafeFsExecutor`. Updated:
+- `fs.unlinkSync(this.config.dbPath)` → `SafeFsExecutor.safeUnlinkSync(this.config.dbPath, { operation: 'SemanticMemory.quarantineCorruptDb' })`
+- `fs.unlinkSync(this.config.dbPath + ext)` → `SafeFsExecutor.safeUnlinkSync(this.config.dbPath + ext, { operation: 'SemanticMemory.quarantineCorruptDb:sidecar' })`
+The test file uses `fs.rmSync` in `afterEach` cleanup only (temp directory in `os.tmpdir()`). Annotated with `// safe-git-allow:` escape comment per the lint spec.
+---
+## Evidence pointers
+- Typecheck: `tsc --noEmit` — 0 errors.
+- Lint: `node scripts/lint-no-direct-destructive.js` — 0 violations.
+- Tests: 12 contract tests covering all recovery paths including partial corruption (valid SQLite header + 4KB corrupted data page in 5000-row DB), size-gate behavior, and skipped-rebuild marker.
+- TopicMemory parity: pattern mirrors `TopicMemory.open()` which has been production-stable since v0.27.x.

package/upgrades/side-effects/url-pathname-path-encoding-fix.md ADDED Viewed

@@ -0,0 +1,45 @@
+# Side-Effects Review — Eliminate URL.pathname path encoding across the codebase
+**Version / slug:** `url-pathname-path-encoding-fix`
+**Date:** 2026-04-28
+**Author:** gfrankgva (contributor)
+## Summary of the change
+Systematic replacement of `new URL(import.meta.url).pathname` with `__dirname` (or `fileURLToPath()`) across 13 source files. The former preserves `%20`-encoded spaces in filesystem paths, causing `fs.readFileSync`, `path.resolve`, and similar operations to fail when the project directory contains spaces.
+**Files changed (source):**
+- `src/commands/init.ts` (4 occurrences)
+- `src/commands/playbook.ts` (1)
+- `src/commands/server.ts` (4)
+- `src/commands/setup.ts` (3)
+- `src/core/Config.ts` (1)
+- `src/core/PostUpdateMigrator.ts` (2)
+- `src/core/SessionManager.ts` (1)
+- `src/core/UpdateChecker.ts` (1)
+- `src/core/UpgradeGuideProcessor.ts` (1)
+- `src/threadline/ThreadlineBootstrap.ts` (1)
+- `src/lifeline/ServerSupervisor.ts` (1)
+**Files changed (tests):** 5 test files with unquoted `execSync` paths or test expectation updates.
+**Files changed (generated):** `src/data/builtin-manifest.json` — content hashes updated to reflect changed source files.
+**Files changed (infrastructure):** `scripts/pre-push-gate.js` — added regression guard (check 5) that prevents re-introduction of the `URL.pathname` antipattern.
+## Decision-point inventory
+- All `new URL(import.meta.url).pathname` usages — **fix** (replace with `__dirname` or `fileURLToPath`).
+- No behavioral changes — every replacement produces the same decoded path, just without the `%20` encoding bug.
+---
+## 1–7. Analysis
+This is a pure bug fix with no behavioral, architectural, or security implications. Every replacement produces the identical filesystem path on systems without spaces, and the correct path on systems with spaces. No new code paths, no new dependencies, no new failure modes. Fully reversible by reverting the commit.
+## Evidence pointers
+- Typecheck: `tsc --noEmit` — 0 errors.
+- Full test suite: 740 files passed, 0 failed, 17171 individual tests passed.
+- Zero instances of `new URL(import.meta.url).pathname` remain in `src/`.