npm - @agenr/openclaw-plugin - Versions diffs - 0.14.0 → 1.1.0 - Mend

@agenr/openclaw-plugin 0.14.0 → 1.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (8) hide show

package/CHANGELOG.md DELETED Viewed

@@ -1,2624 +0,0 @@
-# Changelog
-## [0.14.0] - 2026-03-24
-### Surgeon — Budget Model Overhaul
-- **`--budget` now means dollars, not tokens.** The cumulative token budget that double-counted context re-sends across turns is gone. The surgeon is now constrained by two things: cost (dollars) and context window (tokens). Config `surgeon.budget` is dollars per run. Default: $5.
-- **Context-aware limits.** New `surgeon.contextLimit` config field and `--context-limit` CLI flag. Auto-detects from `model.contextWindow * 0.85` if not set. The surgeon stops when context is full, not from artificial token counting.
-- **`costCap` removed.** Replaced by `budget`. Old `costCap` values are accepted as backward-compatible alias with deprecation warning.
-### Surgeon — Contradiction Improvements
-- **`coexists` relation for `log_conflict`.** The surgeon can now log reviewed false-positive pairs as coexisting rather than forcing them into `contradicts` or `supersedes`. Coexists conflicts are auto-resolved as `keep-both` so they don't accumulate as pending conflicts and future scans skip them.
-- **Improved contradiction candidate filtering.** Suppresses historical series pairs (release versions, prompt paths, roadmap snapshots) from claim divergence scanning. Prioritizes `current_state` claim divergence pairs.
-### Surgeon — Retirement Candidate Scoping
-- **Scoped candidate queries.** `query_candidates` now accepts `scope` parameter: `actionable` (default) filters to entries with high retirement probability — temporary, ephemeral, todos, low-importance events, mislabeled temporal artifacts. `all` shows the full candidate pool for deep sweeps.
-- **Improved candidate ordering.** Actionable scope returns temporary entries first, then ephemeral, then todos, then low-importance events. Durable permanent decisions and preferences are excluded from the default scope.
-- **Two-phase retirement in auto mode.** Surgeon starts with actionable candidates, widens to full pool only if budget remains after exhausting the actionable set.
-## [0.13.4] - 2026-03-23
-### Surgeon
-- **Tightened completion gating thresholds.** Final completion now requires 75% budget usage (was 25%). Phase completion requires 75% (was 50%). Safety valve raised to 5 rejections (was 3). Continuation attempts raised to 5 (was 3). These changes force the surgeon to work through substantially more of the corpus before accepting completion.
-## [0.13.3] - 2026-03-23
-### Surgeon
-- **`complete_pass` gating rejects premature completion.** The tool now validates budget utilization and candidate coverage before accepting completion. Final completion rejected if <25% budget used. Dedup phase rejected if <50% of clusters processed with budget remaining. Retirement phase rejected if <40 candidates evaluated with budget remaining. Safety valve accepts after 3 rejections per phase. Rejection messages tell the surgeon exactly what to do next.
-- **System prompt tightened.** Budget Awareness section now explicitly states that `complete_pass` will reject premature attempts, and that efficiency means spending budget on the right candidates, not spending less budget overall.
-## [0.13.2] - 2026-03-23
-### Surgeon
-- **Continuation loop prevents early exit.** If the surgeon model stops without calling `complete_pass` and has >10% budget remaining, a continuation prompt is injected to push it back to work. Up to 3 nudges before allowing exit. Eliminates the "surgeon quits at 1% budget" problem.
-- **Lowered default dedup similarity threshold from 0.82 to 0.60.** The threshold controls candidate surfacing, not merge execution — the surgeon agent still makes every merge decision. Lower threshold surfaces more candidates for review on large corpora.
-- **`reset` parameter for `query_dedup_clusters` and `query_contradiction_candidates`.** Query parameters are no longer permanently frozen after the first call. Pass `reset: true` to clear cached clusters and rebuild at a new threshold. Lets the surgeon start wide and narrow if noisy.
-- **Strengthened auto sweep prompts.** Contradictions phase always runs proactive scan (no more skipping when pending conflicts = 0). Budget discipline section added — surgeon must keep working while budget remains. Retirement throughput expectations: 500+ candidates on a 3K corpus, not 100.
-- **Dedup threshold guidance in prompts.** Surgeon is told the default is deliberately low, and can raise it via reset if too noisy.
-## [0.13.1] - 2026-03-23
-### MCP Server
-- **`agenr_store` tool added to MCP surface.** MCP consumers (Claude Desktop/Cowork, Claude Code, etc.) can now make structured memory writes with explicit type, subject, importance, expiry, tags, project, and scope — matching the fidelity of OpenClaw's store tool.
-- **Enriched tool descriptions with embedded doctrine.** All 5 MCP tool descriptions (`agenr_recall`, `agenr_store`, `agenr_extract`, `agenr_retire`, `agenr_update`) now carry usage guidance directly in their metadata — when to recall, query tips, importance calibration, lifetime selection, what to store vs. not store. MCP consumers no longer need external instruction files to use memory well.
-- **Fixed `scope` persistence bug.** Manual store writes were not persisting the `scope` field due to a missing column in the insert path. Fixed in the shared store persistence layer.
-### Surgeon
-- **Proactive contradiction discovery.** Surgeon can now discover contradictions proactively during sweeps, rather than only resolving pre-flagged conflicts.
-- **Supersession scan.** New surgeon pass identifies entries that have been superseded by newer, more complete information.
-- **Contradictions pass and auto sweep.** Combined contradictions + auto multi-pass mode where a single agent loop has all tools from all passes registered simultaneously.
-- **Dedup pass.** Dedicated surgeon dedup pass for identifying and merging near-duplicate entries.
-- **Graceful abort.** Surgeon now handles abort signals gracefully and removes artificial loop caps.
-- **Paginated retirement.** Retirement pass now paginates to handle large corpora without memory pressure.
-- **Inspect provenance.** New surgeon tool for inspecting entry provenance chains.
-- **Prompt loading fix.** Fixed surgeon prompt loading for bundled distribution.
-### Infrastructure
-- **Removed `agenr maintain` and `agenr consolidate` commands.** Surgeon is now the sole corpus-health entry point. Replace any cron jobs or scripts with `agenr surgeon run`.
-- **Watcher interval floor.** Enforced minimum watcher polling interval to prevent excessive resource usage.
-- **Default model migration.** Migrated defaults to gpt-5.4 models.
-- **Surgeon workflow limits and budget defaults.** Increased defaults for production workloads.
-## [0.13.0] - 2026-03-22
-### Architecture
-- **Hexagonal architecture restructure.** Complete reorganization of the codebase into a modular monolith with hexagonal (ports & adapters) architecture. `src/` now has four top-level directories: `modules/`, `shared/`, `edge/`, and `runtime/`.
-- **Five bounded context modules.** `recall`, `store`, `ingestion`, `surgeon`, and `eval` — each with `domain/`, `application/`, `ports/`, and `adapters/` layers.
-- **OpenClaw plugin as edge adapter.** Moved to `src/edge/openclaw/` with internal hex layering (`domain/`, `application/`, `adapters/`), alongside CLI, MCP, and watch edge adapters.
-- **Eliminated `src/memory/`.** The awkward middle layer between app and db is gone. Recall orchestration moved to the recall module; store pipeline moved to the store module.
-- **Eliminated `src/domain/`, `src/app/`, `src/prompts/`, `src/adapters/`.** Each module now owns its own domain logic, application workflows, and prompt templates.
-- **Runtime consolidation.** Collapsed 31 `*-defaults.ts` wiring files into 8 focused composition roots — one per module plus shared infrastructure.
-- **Module boundary tests.** New architecture boundary tests enforce: domain purity, no cross-module internal imports, ports-only cross-module communication.
-- **Operations owner matrix.** DB utility workflows assigned to owning modules or shared operations (`context/` → recall, `classify-entries/` → store, `clusters/` → surgeon, `watcher-report/` and `session-store-metrics/` → `src/shared/operations/`). Only genuinely cross-module admin tooling remains in `src/shared/operations/`.
-- **App-boundary allowlist dropped to zero.** All previously allowlisted cross-layer imports are now legitimate within-module imports or port-mediated cross-module calls.
-### Corpus Health
-- **Surgeon is now the sole corpus-health entry point.** Removed `agenr maintain` and standalone `agenr consolidate`. `agenr surgeon run` now owns deterministic cleanup, quality evolution, vector integrity checks, retirement, dedup/merge, and contradiction resolution.
-- **Maintenance and consolidation internals absorbed into surgeon.** Deterministic pre/post steps now run inside surgeon workflows, and clustering/merge infrastructure moved under `src/modules/surgeon/application/clustering/`.
-- **Migration note.** Replace any cron jobs or scripts invoking `agenr maintain` or `agenr consolidate` with `agenr surgeon run`.
-## [0.12.3] - 2026-03-22
-### Fixed
-- **Plugin DB tables in core schema.** `session_projects`, `seen_sessions`, and `session_identity_breadcrumbs` tables are now created as part of the main schema init, fixing `SQLITE_ERROR: no such table: session_projects` errors when calling `agenr_set_session_project` or `agenr_get_session_project`. Previously these tables depended on a lazy plugin-db init path that could fail to run.
-- **Tool schema Codex compatibility.** Replaced `Type.Union([Type.Literal(...)])` with `Type.String({ enum: [...] })` for the `expiry` and recall `context` fields in tool schemas. Codex and other providers that reject `anyOf`/`oneOf` at top level now get clean schemas.
-- **Support-aware cluster validation.** Consolidation cluster validation now uses support-aware entry resolution, preventing false merge candidates from stale or unsupported cluster assignments.
-## [0.12.2] - 2026-03-22
-### Recall Scoping Fixes
-- **Wildcard project passthrough.** `project: "*"` no longer gets silently dropped to `undefined` during recall request building. The wildcard marker now flows through the full stack to `hasWildcardProjectOverride` in `prepareRecallInputs`, enabling true cross-project recall.
-- **Default project fallback for unscoped recall.** When an agent calls `agenr_recall` without an explicit `project`, the session's default project is now used as a `universal_first` hint — searching the default project first with null-project fallback. Previously, unscoped recall defaulted to null-project-only entries, silently returning near-empty results for project-heavy corpora.
-- **Wildcard default when no project context.** When neither an explicit project nor a session default is available, unscoped recall now defaults to wildcard (`*`) cross-project search instead of null-project-only.
-### Browse Recall
-- **Temporal proximity rebalancing.** Browse mode recall now prioritizes temporal proximity over importance with a diversity pass, better surfacing recent entries during temporal exploration.
-- **Removed default 1d since window.** Browse mode no longer applies a default 1-day `since` window, allowing full temporal exploration of the corpus.
-### Update & Retire Improvements
-- **Expiry changes via `agenr_update`.** The update tool now supports changing an entry's expiry tier (`core` → `permanent`, etc.) without retiring and re-creating it. Entry history (confirmations, recall count, created_at) is preserved.
-- **Subject selectors for update and retire.** Both `agenr_update` and `agenr_retire` now accept `subject` as an alternative to `entry_id`. Subject matching is case-insensitive exact match; when multiple entries share a subject, the most recent is targeted.
-- **Agent action replay.** Retire and update operations now support recall target hints for agent action replay workflows.
-### Maintenance
-- **Vector integrity detection.** New `vector-integrity` maintain task detects and repairs drift between the vector shadow table and the entries table.
-## [0.12.1] - 2026-03-21
-### Post-Ingest Quality Fixes
-- **Active-only FTS index.** FTS triggers and rebuild helpers now scope to active entries only (`retired = 0, superseded_by IS NULL`). Retired entries no longer occupy FTS slots or get reindexed during maintenance.
-- **Active-only vector index.** The `idx_entries_embedding_shadow` partial index now excludes retired entries. Vector `top_k` queries no longer waste slots on retired embeddings that get post-filtered. Rebuild and health-check paths updated to match.
-- **Schema regression test.** Fresh `init`/`reset` must create active-only trigger and index definitions — validated by new schema test.
-### Passthrough Dedup Fix
-- **Normalized content hash dedup.** Passthrough now checks active `norm_content_hash` before insert, catching exact duplicates that survived because the previous `contentHash` included `source.file`. Identical content from different tool calls is now correctly deduplicated.
-- **Within-batch norm-hash tracking.** Local batch dedup tracks both `contentHash` and `normContentHash`, preventing same-batch duplicates with different synthetic source files.
-- **Granular skip tracking.** `stats.skipped` now counts by skipped candidates rather than matched set size.
-### Store Guidance
-- **Future-session test.** Updated `agenr_store` tool description, Memory Doctrine (`system-context.ts`), and SKILL.md with concrete store/don't-store guidance. Agents are now instructed to apply the "future-session test" before storing: will a fresh session need this to act differently, or is this just logging what happened?
-- **Importance calibration.** Added "importance is not recency" guidance — shipping events are 5-6, recurring operational hazards are 7-8.
-## [0.12.0] - 2026-03-20
-### Ingestion Overhaul
-- **Unified extraction pipeline.** Consolidated `src/extractor/` and `src/app/extract/` into `src/app/ingest/extraction/`. Removed legacy extraction runtime infrastructure and redundant module exports.
-- **Claim extraction enabled by default.** Structured claims (subject/attribute/predicate/object) are now extracted during ingest without requiring `--claims`. Added embedding validation for extracted entries.
-- **Batch claim extraction.** New batch mode for `backfill-claims` and eval commands processes multiple entries per LLM call.
-- **Within-batch dedup renamed and simplified.** Renamed pre-store dedup to within-batch dedup. Removed the DB-based deduplication layer — online store-time dedup handles cross-batch conflicts.
-- **Token usage tracking.** Threaded `tokenUsageTracker` through the full ingest pipeline for per-run cost reporting.
-- **Per-file and workflow timing breakdowns.** Ingest now reports extraction, queue wait, and store exec timing per file and in the summary. Wall-clock timing alongside cumulative timing shows the benefit of parallel workers.
-- **Per-file project inference.** When `--project` is not specified, each file infers its project from transcript metadata (session project block → cwd detection → label mapping → agent store projects). Matches watcher behavior.
-- **Source classification for ingest entries.** Entries extracted via `ingest` now get `source_class: "cli"` instead of being misclassified as `"watcher"`.
-- **Dedup quality warning tracking.** Expanded valuation negative signals and added tracking for dedup quality warnings in ingest results.
-- **Claim extraction config controls.** Added `claims` and `claimExtractionConcurrency` config fields to normalize ingest result reporting.
-### Session Metadata Enrichment
-- **Agent-stored entries as extraction seeds.** The OpenClaw adapter now parses successful `agenr_store` tool calls from session files, correlating with tool results to confirm success. These pre-validated entries seed the extraction LLM's "previously extracted" context, preventing duplicate/inferior re-extraction.
-- **Enhanced tool call summaries.** `agenr_store` → `[attempted brain store: decision: "subject" project=X]`, `agenr_recall` → `[recalled from brain: "query" project=X]`, `sessions_spawn` → `[spawned sub-agent: label (mode model=X)]`. Replaces generic one-liners with structured summaries.
-- **Session project parsing.** The OpenClaw adapter extracts `sessionProject` from the injected `current_session_project:` block in the first user message. Three-value semantics: string (authoritative project), null (explicitly none), undefined (not found).
-- **Models used tracking.** Collects all models from session records, model_change records, and assistant messages into `metadata.modelsUsed`.
-- **Project fallback chain.** `resolveFallbackProjectFromMetadata` now checks: session project → cwd detection (gated against configured projects) → session label mapping → agent store projects (unanimous only).
-- **Configured project slug validation.** Detected projects from `cwd` are now gated against configured project slugs. Random git repos (like `~/.openclaw/workspace`) no longer create phantom projects.
-### Maintenance Overhaul
-- **Removed edge-decay and clusters tasks.** Eliminated cluster scoring and graph-based edge-decay from the maintain workflow.
-- **Removed reflection synthesis.** Removed synthetic recall generation and the reflection synthesis system.
-- **Removed auto-resolve confidence threshold.** Simplified conflict resolution by removing the automatic resolution path.
-- **Deterministic retirement.** Added a deterministic retirement path for clearly stale entries with conflict-based protection for entries under active dispute.
-- **Recall recency grace periods.** Added grace periods to db audit retirement that protect recently-recalled entries from premature retirement.
-- **Configurable forgetting threshold.** Added `forgetting.lowScoreTriggerCount` config field for auto-forgetting trigger and improved retirement field consistency.
-### Consolidation Overhaul
-- **Migrated consolidation to `src/app/consolidate/`.** Moved from top-level into the app domain with updated documentation.
-- **Post-merge claim normalization.** Consolidation merges now re-extract structured claims and validate subject specificity.
-- **Removed cluster and graph scoring.** Recall scoring no longer uses cluster membership or graph augmentation signals.
-- **Replaced multiplicative recall scoring.** Switched from multiplicative to signal-averaging for recall score composition. Added recall engagement reporting.
-### Watcher Improvements
-- **OpenClaw watcher demotion policy.** Watcher-extracted entries from OpenClaw sessions get capped importance and temporary expiry with auto-retirement tracking and session metrics reporting.
-- **Source-aware watcher deferral.** Watcher now defers to tool-stored entries for the same session, with tracking and reporting of deferral decisions.
-- **Session key threading.** OpenClaw `session_key` is threaded through the watcher workflow into store options for cross-surface correlation.
-- **Watcher report command.** New `watcher-report` command analyzes tool/watcher overlap, coverage gaps, and readiness for transition from watcher to tool-primary ingestion.
-### Store Pipeline
-- **Source classification columns.** Added `source_class` and `session_key` columns to the entries table with normalized source classification.
-- **Batch store lookups.** Added batch lookup methods to the store repository with precomputed dedup resolutions.
-- **Entity normalization.** Added entity normalization during ingest extraction with entity hint threading through the dedup workflow.
-- **Per-task context types.** Replaced monolithic `TaskExecutionContext` with per-task context types and type-safe task dispatch in the maintain orchestrator.
-### Audit & Diagnostics
-- **DB audit command.** New `db audit` command for legacy structural and quality issue cleanup.
-- **Importance inflation audit.** Added per-entry attribution and preview logging for importance inflation detection.
-- **Contradiction blocking tracking.** Added infrastructure for tracking and reporting contradiction blocking decisions.
-- **Shadow threshold telemetry.** Added multi-threshold analysis for contradiction audit shadow thresholds.
-- **Unresolved reference detection.** Improved detection of dead cross-references with weak self-containedness signal in valuation.
-### Fixed
-- **Vector index rebuild.** Fixed libsql vector index rebuild not populating the shadow table after `DROP INDEX` + `CREATE INDEX`. Added `UPDATE entries SET embedding = embedding` to force re-indexing of existing rows.
-- **`resolveStoreKnownProjects` using paths instead of slugs.** Fixed the function to extract `.project` values from config instead of using directory path keys.
-- **Removed `"workspace"` from `DEFAULT_KNOWN_PROJECTS`.** Prevented phantom project inference from the OpenClaw workspace `.git` directory.
-### Dependencies
-- Updated `@mariozechner/pi-ai` to 0.60.0.
-- Fixed `streamSimple` import to use package root instead of dist path.
-## [0.11.3] - 2026-03-18
-### Fixed
-- Added `sessionStartSectionCaps` to `openclaw.plugin.json` config schema so OpenClaw validates it correctly.
-## [0.11.2] - 2026-03-18
-### Added
-- Added `sessionStartSectionCaps` plugin config option to control per-section caps for session-start memory injection. Set a section to `0` to disable it (e.g. `{ "preferences": 0, "recent": 0 }` to suppress noisy non-core sections while keeping core context). Sections: `core` (default 6), `active` (default 4), `preferences` (default 4), `recent` (default 2).
-## [0.11.1] - 2026-03-18
-### Fixed
-- Fixed `@agenr/openclaw-plugin` package missing `openclaw.extensions` field, which prevented OpenClaw from installing the plugin.
-## [0.11.0] - 2026-03-18
-### Changed
-- **Recall pipeline rebuilt from scratch.** Removed ~42,000 lines of heuristic post-processing (decision policies, confidence gates, category rollup assembly, suppression chains, lexical rescue, expansion heuristics, name-bound token matching) and replaced them with a clean ~1,750-line retrieval pipeline: normalize → embed → scoped vector search → lexical search → combine scores → rerank top-N → top-k → return.
-- Added a narrow second-stage reranker with query intent detection. The reranker operates only on a bounded candidate window and uses query-conditioned features (lexical specificity, scope match, metadata signals, temporal markers) instead of global corpus-wide score adjustments.
-- Improved lexical candidate admission: FTS now evaluates all query variants before trimming, subject overlap is scored separately from content overlap, and recent/state-like entries get query-conditioned admission preference.
-- Simplified the eval harness to measure ranked retrieval directly (hit@k, MRR, precision@k, scope bleed) instead of tracking deleted heuristic concepts.
-### Added
-- Added deterministic ingest-time classification: `entry_kind` (directive, state, identity, episode, reference) and `temporal_class` (ephemeral, durable) columns on entries, assigned at store time using type, subject, and content signals. No LLM calls — pure pattern matching.
-- Added tag-based project inference at store time: entries with a single known-project tag and no explicit project assignment are automatically promoted.
-- Added `agenr maintenance classify-entries` command to backfill classification metadata on existing databases.
-- Added rerank intent detection (`policy_seeking`, `status_seeking`, `recentness_seeking`, `topic_seeking`, `general`) used exclusively by the reranker to condition feature weights.
-### Fixed
-- Fixed exact-policy queries consistently losing rank to generic same-topic entries.
-- Fixed session handoff and status queries being polluted by handoff-infrastructure notes.
-- Fixed temporal/recency queries returning older durable entries instead of recent events.
-- Reduced project scope bleed in retrieval results.
-### Removed
-- Removed decision-policy state machine, confidence gates, and mode-dependent dispatch (~3,500 lines).
-- Removed category rollup assembly subsystem (~3,700 lines).
-- Removed suppression framework and reflection suppression (~800 lines).
-- Removed rescue mechanisms (change-lookup-prior-state-rescue, structured-state-rescue).
-- Removed query expansion heuristics, name-bound token matching, and topic-specific boosting.
-- Removed surface-selection heuristics and compact-bundle diagnostics.
-## [0.10.1] - 2026-03-17
-### Fixed
-- Fixed `agenr_recall` tool using mid-session recall scoping, which caused the session's configured project to be applied as a strict filter — excluding universal/personal memories when the agent didn't explicitly pass a `project` parameter (#552).
-### Changed
-- Separated recall reads from writes in the OpenClaw plugin tool path. Tool recall now uses a shared read-only connection (read-safe init, no schema work) and writes recall events asynchronously via a serialized write queue. This eliminates `SQLITE_BUSY` errors when the agent fires multiple `agenr_recall` calls in parallel (#553).
-## [0.10.0] - 2026-03-17
-### Changed
-- Refactored the codebase around clearer application, runtime, adapter, and domain boundaries, extracting major workflow orchestration into `src/app/`, adding explicit app ports/runtime defaults, and converting command modules into thinner presentation layers.
-- Reorganized large retrieval, ingest, maintain, config, watcher, and OpenClaw plugin surfaces into more focused modules, including modular category-rollup assembly, session continuity/handoff plumbing, watcher runtime pieces, and config normalization/schema layers.
-- Tightened architectural discipline and boundary enforcement with new architecture tests, updated contributor/architecture docs, and cleanup of internal architecture/research/history material from version control.
-### Added
-- Added shared app-layer slices and runtime defaults for recall, store, extract, ingest, maintain, evaluation, DB operations, watch flows, and OpenClaw recall feedback, giving the codebase clearer dependency seams for future work.
-- Added live-facing personal-topic contamination controls for family/parents recall, including shared personal-topic filtering, cleaner rollup evidence selection, and focused regression coverage for personal-family retrieval behavior.
-- Added broader recall and eval-harness support during the refactor, including modular recall-regression tooling, richer gold-harness diagnostics, and artifact checks used to debug live-vs-eval family contamination gaps.
-### Fixed
-- Fixed broad personal-family recall contamination across both rollup and non-rollup live CLI paths by suppressing project/debug/meta memories when grounded family evidence exists.
-- Fixed category-rollup evidence selection so adversarial family contamination cases no longer survive into selected evidence bundles and synthesized rollup summaries in the new regression coverage.
-- Fixed multiple adapter/runtime boundary leaks uncovered during the refactor by centralizing dependencies behind app/runtime seams instead of letting command and infrastructure code reach directly across layers.
-## [0.9.99] - 2026-03-15
-### Added
-- Added a full post-rebuild tightening tranche across recall evaluation and diagnosis, including truth-telling regression metrics and query-class breakdowns, null-project scope audit/recovery tooling, cloned-DB before/after probe workflows, strict scope-sensitive snapshot coverage, and graph diagnosis artifacts in `agenr-evals`.
-- Added targeted broad-recall upgrades for category and project rollups, including rollup-specific retrieval shaping, deterministic rollup summaries, semantic diversity, graph-neighborhood-aware assembly, project workstream interpretation, and bounded non-project neighborhood interpretation.
-- Added standing graph-sensitive coverage beyond project rollups, including family neighborhood completion and project continuity completion cases that let graph diagnosis distinguish helpful vs low-leverage graph participation.
-### Fixed
-- Fixed historical issue recall for exact repair/diagnosis questions by routing explicit `what fixed/caused/issue with ...` queries into historical lookup instead of collapsing into `abstain_or_narrow`, then tightened historical exactness scoring so older exact answers can beat newer broad policy distractors when appropriate.
-- Fixed strict project-scoped retrieval end to end by recovering a safe first slice of missing `project=agenr` rows, proving the recovery path in strict-scope probes, and then eliminating the remaining surfaced-result bleed and partial-bundle residuals.
-- Fixed project-rollup graph behavior so graph-backed assembly now backs off when base retrieval already has a sufficient bundle, preserves participation only when it materially completes the answer, and exposes explicit participation diagnostics that `agenr-evals` now interprets correctly.
-## [0.9.98] - 2026-03-15
-### Changed
-- Refactored the codebase into cleaner responsibility boundaries across the app/runtime, MCP server, init/setup flow, watch context refresh, OpenClaw plugin service calls, and shared store/retire plumbing, reducing entry-point sprawl while preserving the existing product surface.
-- Continued internal cleanup after the large module split by removing now-unused variables, functions, and imports across the refactored codebase.
-- Updated architecture documentation to match the new module organization and adapter boundaries so the repository structure and implementation seams are easier to navigate after the refactor.
-## [0.9.97] - 2026-03-15
-### Fixed
-- Fixed a follow-on watch/change issue discovered immediately after the 0.9.96 release and shipped the corrective patch as a fast follow-up release.
-## [0.9.96] - 2026-03-15
-### Fixed
-- Fixed the bulk rebuild ingest path so Phase 3 structured fields are no longer bypassed during ingest when claims are enabled. Bulk ingest now runs structured-claim normalization and optional claim extraction before insert, keeping `subject_key`, `claim_predicate`, and `claim_role` population aligned with the normal store path.
-- Fixed ingest/eval parity so bulk-mode and normal-mode claim population stay aligned, and added regression coverage proving bulk ingest can fill structured columns while still leaving `entry_supports` empty on first insert until later reinforcement/consolidation.
-## [0.9.95] - 2026-03-15
-### Added
-- Added a full Phase 3 structured-memory spine for canonical claims, including support accumulation, explicit `current_state` and `prior_state` anchors, and explicit committed `state_transition` anchors for replacement/migration changes.
-- Added bounded and then broader structured retrieval support over the new state fields, including direct structured lookup, subject-key recovery from weak footholds, hybrid candidate assembly, and structure-aware embedding composition so canonical anchors can win semantically as well as lexically.
-- Added pre-rebuild alignment work for consolidation, maintenance, conflicts, and runtime prompt guidance so rebuild-time operation follows the new canonical brain model instead of the legacy blob-brain assumptions.
-### Fixed
-- Fixed live recall crash behavior around retired related-entry hydration by making relation-neighbor expansion safe in the presence of retired superseded source rows, then removed active-to-retired `relations.supersedes` residue as a write-time invariant rather than leaving retrieval to tolerate it forever.
-- Fixed consolidation and maintenance behavior to preserve Phase 3 canonical invariants: same-identity structured claims now reinforce deterministically instead of synthesizing blob merges, current/prior/transition anchors remain distinct, canonical structured claims are protected from generic retirement/reflection demotion, and superseded cleanup distinguishes true supersession from expiry residue.
-- Fixed prompt/runtime alignment so extraction, dedup, consolidation, reflection, and rebuild-oriented prompt paths now encode the new structured-claim, role-aware, rebuild-required model instead of lagging behind older assumptions.
-## [0.9.94] - 2026-03-14
-### Added
-- Added a Phase 2 query-normalization and retrieval-mode layer that interprets raw recall phrasing together with explicit `agenr_recall` parameters, making current-state, historical, decision, preference/policy/workflow, change/transition, and abstain-or-narrow routing explicit and traceable.
-- Added noisy conversational/currentness/transition/param-interaction eval coverage plus surfaced-set diagnostics that measure what users actually see, including output-alignment reasons and bounded transition/currentness support shaping.
-### Fixed
-- Fixed recall precision and surface behavior across the full Phase 1/Phase 2 brain-quality tranche: adjacent-wrong suppression, lexical candidate survival, generic lookup query-shape handling, selected-support-only surfacing, abstain-or-narrow trust behavior, historical clear-winner qualification, and currentness/transition before-after pair handling.
-- Fixed explicit migration/replacement transition queries in both isolated eval and realistic snapshot/live-brain paths so prior-state rows remain available for bounded before/after answers without reopening broad stale-history retrieval.
-## [0.9.93] - 2026-03-13
-### Added
-- Added focused gold-eval coverage and regression protections for fallback surfaced-set suppression, blocker-shaped current-state routing, natural prerequisite pair assembly, residual holdout abstain/pair-selection cases, and currentness-update-posture bundle/commitment-state cases.
-- Added memory-formation eval diagnostics that preserve extractor warnings in gold harness artifacts, making provider/auth failures visible instead of collapsing them into misleading empty extraction results.
-### Fixed
-- Hardened fallback surfaced-set selection so answer-bearing support can suppress adjacent same-topic distractors without reopening broad ranking changes.
-- Fixed blocker-shaped `still blocked` status questions so they route into current-state/resolved-state handling instead of generic lexical fallback.
-- Fixed natural prerequisite phrasing and bounded pair assembly so answerable prerequisite questions stop abstaining or dropping complementary support.
-- Fixed residual holdout generic-intent abstain and prerequisite completeness cases, eliminating the last holdout distractor-intrusion failures.
-- Fixed currentness-update-posture residuals by routing bounded policy-bundle questions into pair handling and tightening commitment-state recognition for settled `code slice` phrasing.
-- Fixed sandbox eval credential resolution so inherited bad API-key env vars no longer override valid sandbox-config credentials during eval runs.
-## [0.9.92] - 2026-03-13
-### Added
-- Added an `agenr eval harness --mode recall-regression` path for replaying live-brain snapshot recall cases as focused regression suites, with targeted fixtures and assertions for rank expectations and trace inspection.
-- Added targeted full-recall regression coverage for lexical-candidate rescue and affinity-rerank edge cases, including generic-subject strong body matches and direct workspace repair answers with only soft meta wording.
-### Fixed
-- Tightened lexical candidate rescue so strong handoff-style body matches can survive vector-top-k misses and reach reranking in bounded recall flows.
-- Refined affinity reranking so soft meta signals such as `workspace` or procedural wording no longer blanket-demote otherwise direct operational answers, while preserving demotion pressure for genuinely meta/config memories on non-meta queries.
-## [0.9.91] - 2026-03-13
-### Added
-- Added a public `agenr eval harness --mode ingest-inspect` seam for structured ingestion inspection and private eval-pack orchestration against session-derived fixtures.
-### Fixed
-- Hardened ingestion hygiene against transcript-like session scaffolding and bookkeeping residue by adding a conservative pre-chunk sanitizer that preserves intended durable content while stripping obvious session junk.
-- Fixed ingestion parity harness measurement so paired normal/bulk comparisons reuse a single cached extraction artifact per fixture instead of comparing unrelated live extraction samples.
-- Fixed the shared extraction blind spot behind `ingestion-parity-002` by threading platform through raw-text extraction, narrowing procedural-noise valuation so replacement-state entries like "open todo" are not rejected as command noise, and tightening replacement/completed-state extraction guidance.
-- Restored full test-suite health after the ingest work by aligning the OpenClaw doctrine string with the expected wording, reapplying synthesized reflection demotions during reflection maintenance, and updating the stale reflection demotion unit test expectation.
-## [0.9.90] - 2026-03-12
-### Fixed
-- Hardened reflection maintenance so verification now fails closed instead of applying a reflection after repeated verification-parse failure.
-- Stopped reflection apply from destructively lowering canonical source-memory importance, preserving durable source salience even when synthesis recommends demotion.
-- Narrowed reflected-source suppression so recall retains the best reflected source instead of blanket-hiding all reflected sources whenever a reflection surfaces.
-## [0.9.89] - 2026-03-12
-### Changed
-- Strengthened the OpenClaw `prependSystemContext` agenr memory doctrine so it explicitly treats agenr as the only allowed durable memory workflow, forbids `memory_search`, `memory_get`, `MEMORY.md`, `memory/*.md`, and markdown journals for memory recall, and instructs agents to ignore any conflicting earlier prompt guidance.
-## [0.9.88] - 2026-03-12
-### Fixed
-- Updated the OpenClaw plugin for `openclaw@2026.3.11` by migrating agenr command lifecycle handling from the stale typed `api.on("command", ...)` path to named `api.registerHook(["command:new", "command:reset"], ...)` registration, restoring clean command-hook compatibility and removing current-release hook registration warnings.
-- Removed temporary OpenClaw prompt-build debug instrumentation after confirming `prependSystemContext` and `prependContext` injection is functioning in the sandbox.
-## [0.9.87] - 2026-03-12
-### Changed
-- Added an OpenClaw `prependSystemContext` agenr-first memory doctrine that supersedes legacy markdown-memory guidance without replacing OpenClaw's dynamically built system prompt.
-- Split OpenClaw prompt-build injection into stable doctrine (`prependSystemContext`) and dynamic recalled memory (`prependContext`), preserving existing session-start, mid-session, signal, and nudge behavior.
-- Added the `openclawMemoryDoctrine.enabled` plugin config gate, default-enabled, so operators can disable the stable doctrine block without turning off dynamic memory injection.
-- Updated OpenClaw plugin docs, architecture notes, and regression coverage to reflect the new doctrine/system-context path and its no-op-session behavior.
-## [0.9.86] - 2026-03-12
-### Fixed
-- Fixed OpenClaw session-start core injection to read from a strict core-only path, preventing generic recall behavior from leaking non-core material into startup memory.
-- Fixed OpenClaw mid-session wildcard recall scoping so explicit `project="*"` now behaves as an intentional unscoped bypass instead of being treated like a literal project slug.
-## [0.9.85] - 2026-03-11
-### Changed
-- Raised the default OpenAI task-model baseline from `gpt-4.1-nano` to `gpt-4.1-mini` across config defaults, setup flows, and benchmark defaults after memory-reliability validation showed `mini` is the minimum viable extraction model for the bounded memory-formation contract slice.
-- Preserved the recently landed memory-reliability improvements through downstream OpenClaw surface-contract alignment so prompt-visible memory now honors authoritative `surfacedIds`, preserves abstain posture, and correctly surfaces commitment-shaped startup memory without procedural-noise leakage.
-### Fixed
-- Closed the bounded currentness residual seam, bounded temporal-boundary continuity seam, memory-formation commitment-preservation tranche, and downstream OpenClaw memory-surface contract tranche as part of the current memory-reliability phase milestone.
-## [0.9.84] - 2026-03-09
-### Added
-- Added a public `agenr eval harness` CLI seam for private eval-run orchestration, including fail-closed sandbox validation that rejects default home config/DB paths and emits stable JSON metadata for harness tooling.
-## [0.9.83] - 2026-03-08
-### Fixed
-- Hardened OpenClaw session-start startup reads by preserving the `initDbForStartupReads` export in test mocks, updating startup-read assertion shapes to match the current read-safe API, aligning deferred handoff timing coverage with the current lifecycle, and eliminating a real startup PRAGMA vs `BEGIN IMMEDIATE` contention race in regression coverage.
-## [0.9.82] - 2026-03-08
-### Added
-- Added a GitHub Actions `Validate` workflow that runs on pull requests and `master` pushes, installs dependencies, runs `pnpm typecheck`, and then runs the full `pnpm check` gate.
-### Changed
-- Documented `pnpm check` as the canonical local validation command and called out `pnpm typecheck` as the minimum fast sanity check before trusting passing tests.
-- Hardened update-path validation and regression coverage so validation-gate checks no longer depend on ambient host project config or local previous-session state.
-## [0.9.81] - 2026-03-08
-### Added
-- Added in-place entry metadata updates for memory hygiene, starting with `importance`, via the new `agenr update --id <id> --importance <n>` CLI command.
-- Added the `agenr_update` MCP tool and native OpenClaw plugin tool so agents can demote stale-but-still-true entries without retiring them.
-- Added focused coverage for CLI and MCP update flows plus OpenClaw plugin tool wiring.
-### Fixed
-- Added startup-only `SQLITE_BUSY` retry/backoff for OpenClaw session-start browse and memory-index reads, reducing transient lock-induced degradation immediately after handoff recovery.
-- Session-start browse now logs explicit retry and unavailable states instead of collapsing lock failures into the ordinary `browse returned 0 entries` path.
-- Preserved existing startup fail-open rendering behavior by continuing to omit unavailable browse or memory-index sections while still completing session-start with the remaining recovered context.
-- Added focused regression coverage for strict startup browse execution, compatibility preservation in `runRecall()`, and session-start retry/degradation logging for browse and memory-index lock contention.
-## [0.9.79] - 2026-03-08
-### Fixed
-- Fixed OpenClaw session-start memory index observability so startup logs now distinguish successful empty loads from timeout, invalid-response, and error states instead of collapsing them all into `0 projects`.
-- Fixed the OpenClaw session-start core recall debug log to report configured `coreProjects` truthfully rather than mislabeling config length as "active projects".
-- Fixed OpenClaw session-start fallback recovery for newer TUI session keys such as `agent:main:tui-<uuid>` by allowing degraded same-family TUI predecessor acceptance when the current session identity resolves `family=tui` but cannot recover a stable lane.
-- Preserved strict lane matching for explicit predecessors and known-lane fallback cases, while keeping cross-family fallback candidates ineligible.
-- Added regression coverage for memory-index load result typing, session-start memory-index availability logging, markdown omission on unavailable index states, truthful core-project log wording, unknown-lane TUI fallback acceptance, alternate same-family TUI lane acceptance, cross-family rejection, and the new-key debug logging path.
-## [0.9.78] - 2026-03-08
-### Fixed
-- Restored OpenClaw session-start continuity for legacy TUI session keys that expose `family=tui` without explicit lane metadata by deriving a narrow compatibility lane from safe structured key forms such as `agent:main:tui` and `agent:main:macbook_tui`.
-- Added regression coverage ensuring legacy plain and aliased TUI keys recover same-lane explicit and fallback predecessors, while ambiguous non-lane identifiers still fail closed.
-## [0.9.77] - 2026-03-08
-### Fixed
-- Added DB-backed OpenClaw session identity breadcrumbs keyed by `sessionId`, allowing agenr to recover family and lane identity for reset-archived predecessors after OpenClaw renames live transcripts to `*.jsonl.reset.*`.
-- Restored same-family and same-lane OpenClaw session-start continuity across reset artifacts while preserving fail-closed rejection for different-family or different-lane predecessors.
-- Hardened OpenClaw session-start continuity so direct event identity now beats stale mirrored context, preventing `ctx.sessionKey` lane data from overriding an incoming event family or manufacturing a lane the event never asserted.
-- Hardened predecessor validation so structured identity from validated session keys and session ids now outranks stale manifest `origin.surface` metadata, preserving correct family and lane classification when those signals conflict.
-- Fixed session-store reverse lookup to resolve relative `sessionFile` entries against the OpenClaw sessions directory, preventing valid predecessors from collapsing to `unknown` due to path-base mismatches.
-- Explicit predecessor references that point at a base `*.jsonl` path now canonicalize back to the real archived transcript artifact when the live file is gone and only a supported reset or deleted artifact remains on disk.
-## [0.9.76] - 2026-03-07
-### Changed
-- Centralized recall, store, extract, retire, and trace orchestration behind shared internal service modules under `src/app/`, so the CLI, MCP server, and OpenClaw plugin reuse the same typed business logic instead of duplicating normalization and execution paths.
-- Centralized shared adapter tool-log normalization for Claude Code and OpenClaw so common tool-call extraction, summary rendering, and tool-result retention logic now live in one place while platform-specific behavior stays explicit where needed.
-- Split the DB schema internals into responsibility-based modules under `src/db/schema/` and centralized entry-read projections and row mapping in a shared helper so schema initialization, migrations, and read paths have clearer ownership with a stable public surface.
-- Refactored the OpenClaw `before_prompt_build` and session-start selector internals into stage-specific helper modules for startup context recovery, recall-data preparation, prompt rendering, heuristic analysis, and final selection, while preserving existing prompt composition behavior.
-- Added repository-wide formatting and lint tooling with Prettier, ESLint, and a top-level `pnpm check` quality gate that runs format check, lint, typecheck, and tests.
-- Refreshed the OpenClaw contributor and architecture docs to match the refactored module layout, including new contributor-orientation, project, and handoff lifecycle references.
-### Fixed
-- Restored the remaining watcher, config, vector-decoding, and OpenClaw plugin regression coverage needed for the consolidated post-`0.9.75` codebase to pass the full `pnpm check` verification path.
-- Stabilized OpenClaw `before_prompt_build` test expectations and watcher fixtures so the stricter typed helper boundaries introduced by the refactors remain covered without weakening runtime contracts.
-## [0.9.75] - 2026-03-07
-### Changed
-- OpenClaw session-start memory injection now runs browse/core recall through a dedicated startup selector that favors continuity utility over raw browse ordering, suppresses overlap with `Recent session`, and applies hard section caps before rendering.
-- Session-start startup memory now treats `fact`, `event`, `lesson`, `relationship`, and `reflection` intentionally instead of sending all residual non-core entries into one broad startup bucket.
-- The session-start composer now reserves space for `Memory Index` before rendering `Recent memory`, so aggressive char trimming keeps recovery breadcrumbs whenever they still fit.
-### Fixed
-- OpenClaw session-start dedupe now records only the entry IDs that actually survive selector filtering and prompt rendering, so startup-trimmed entries remain eligible for later mid-session recall.
-- Session-start selector freshness now treats missing or malformed `updated_at` values conservatively as stale instead of accidentally boosting them as fresh.
-- Tightened session-start exact-state detection to require structured operational state such as active branch/worktree/cwd, numbered PR or issue references, concrete path or file state, or explicit config/env/version state.
-- Added a narrow older-continuity escape hatch so importance-7 lessons, relationships, and exact-state or open-thread facts can survive beyond the freshness window when they still materially affect current work.
-- OpenClaw session-start browse candidate assembly now prefetches a small larger pool, caps `reflection` entries before the final 10-slot cutoff, and backfills remaining slots by existing browse score so durable non-reflection entries are not starved upstream of the selector.
-- Tightened OpenClaw session-start selector admission for historical decisions and preferences so high importance now boosts ranking only after current behavioral, exact-state, or real continuity evidence is present.
-- Made session-start `openThread` detection more precise by treating weak narrative words like `next`, `continue`, and `fix` as continuity signals only when they are very recent, while preserving stronger unfinished-work cues.
-- Removed a redundant post-classification cosmetic-preference filter from session-start selection and added regression coverage for stale high-importance decisions and weak versus strong continuity wording.
-## [0.9.74] - 2026-03-07
-### Added
-- Added a shared extractor-side valuation layer with source-aware priors for coding-agent transcripts and README/AGENTS/INSTALL-style repo docs, plus lightweight reason-coded tracing in verbose runs.
-- Added mixed-signal benchmark fixtures for README, AGENTS, INSTALL, repo cartography, and Codex transcript chatter so precision-first ingest regressions can be measured against real traps.
-- Added focused regression coverage for valuation punch-through and suppression behavior, including adversarial setup keyword traps, architecture-keyword cartography traps, release-rationale noise, and plain durable doc-rule fixtures.
-### Changed
-- Extraction-driven ingest now shares the same valuation semantics across watcher and bulk ingest by gating entries centrally before downstream dedup/store cleanup.
-- Tightened extractor valuation so durable constraints, rationale, conventions, and concrete architecture boundaries can still punch through, while weak cues no longer beat doc/profile demotions and procedural or transient negatives on their own.
-- Broadened doc source-profile detection to catch README/AGENTS/INSTALL/runbook/playbook-style filename variants, including benchmark fixture names, so doc-like priors apply consistently instead of falling back to the default profile.
-- Tightened the extraction system prompt to reject source paraphrase, setup/onboarding replay, repo cartography, and operational chatter while still extracting buried durable signal.
-- `agenr benchmark` fixture discovery now includes `.md`, `.markdown`, and `.txt` sources alongside `.jsonl` sessions.
-## [0.9.73] - 2026-03-06
-### Added
-- OpenClaw now persists an explicit per-session project state in the plugin DB and exposes `agenr_set_session_project`, `agenr_get_session_project`, and `agenr_clear_session_project` so the active session can be pinned, inspected, or explicitly cleared without relying on project inference.
-- Session-start OpenClaw recall is now project-aware for this MVP slice: fresh sessions inherit the previous session's explicit project or explicit clear-state, inject that state as authoritative prompt context, and scope startup recall accordingly. This is intentionally limited to explicit session state at session start, not broader inferred project recall.
-- OpenClaw watcher extraction now re-reads explicit session project state on each pass and uses it as the primary attribution source for project-worthy entries, while preserving cleared-state "no project" behavior.
-### Changed
-- `agenr_store` now applies project precedence as `project` passed on the tool call first, then the explicit OpenClaw session project when every entry in the batch is project-worthy, then the configured default project.
-- OpenClaw project attribution now fails closed for ambiguous, personal, or generic entries instead of over-scoping them into the active session project, and the same conservative project-worthiness gate is shared between tool-time and watcher-time attribution.
-### Fixed
-- Fresh-session handoff preservation now keeps an explicit cleared session-project state authoritative, so stale handoff summaries cannot silently re-pin the next session to a previously active project.
-## [0.9.72] - 2026-03-06
-### Fixed
-- Online dedup no longer allows a higher-tier incoming entry (`core > permanent > temporary`) to end in `SKIP` against a lower-tier existing row. Higher-tier `SKIP` outcomes are now overridden to a safe `UPDATE` that preserves existing content while promoting the stored expiry.
-- The near-exact semantic duplicate fast path now blocks `SKIP` for higher-tier incoming entries and only auto-promotes when the subjects still align; otherwise it falls through to a non-skip path instead of swallowing the stronger lifecycle.
-- Added regression coverage for higher-tier `SKIP` overrides in both the store pipeline and the CLI `store` path, while preserving allowed equal-tier and lower-tier `SKIP` behavior.
-- Extended `verify:dist` to fail release builds unless the bundled runtime contains the higher-tier `SKIP` guard and reproduces the live failure shape as an `UPDATE` in the built artifact.
-## [0.9.71] - 2026-03-06
-### Fixed
-- Release builds now clean `dist/` before bundling, so stale hashed chunks with obsolete online-dedup merge logic no longer survive into packed artifacts.
-- Added `prepack` dist verification that fails packaging unless the bundled CLI runtime keeps the higher expiry tier on online-dedup `UPDATE` merges and persists `expiry = ?` in the merge SQL helper.
-- Added CLI store-path regression coverage that seeds a permanent row, stores an incoming core entry through `runStoreCommand`, and verifies the merged row is persisted as `core`.
-## [0.9.70] - 2026-03-06
-### Fixed
-- Online dedup `UPDATE` merges now preserve the higher expiry tier between the stored row and the incoming entry, and persist the reconciled lifecycle to the database.
-- Consolidation merges now preserve the highest expiry tier across source entries in both the LLM merge path and the near-exact keeper update path, and invalid or missing LLM expiry output no longer promotes all-temporary clusters to `permanent`.
-- Consolidation and store row mappers now normalize raw DB expiry values conservatively before applying precedence comparisons.
-- Added regression coverage for expiry-tier precedence across online dedup updates, consolidation merges, consolidation DB-row normalization, and in-memory dedup merging.
-## [0.9.69] - 2026-03-05
-### Fixed
-- OpenClaw session-start memory injection now renders `expiry: "core"` entries in a dedicated `### Core Context` section before the normal recall sections, instead of flattening them into generic memory groups.
-- Session-start memory formatting now preserves the shared memory budget while preventing core entries from being duplicated in the non-core sections.
-## [0.9.68] - 2026-03-05
-### Added
-- Added standalone `agenr synthetic` command to generate synthetic recall signals against existing database entries without re-running ingest.
-- Added `agenr synthetic` options for source-file scoping, similarity/session/event thresholds, dry-run mode, JSON output, and optional post-pass quality seeding.
-- Added command coverage for standalone synthetic execution, dry-run no-write behavior, source-file scoping, and quality seeding flow.
-### Changed
-- `agenr backfill-claims` now runs claim extraction in parallel with a configurable `--concurrency` limit (default `5`), significantly reducing wall-clock time for large databases. The default `--batch-size` also increased from `10` to `50`.
-### Fixed
-- `agenr ingest` now runs synthetic recall generation by default after successful ingest; use `--no-synthetic` to disable the post-ingest synthetic pass.
-- Updated synthetic ingest documentation and tests for the default-on behavior, including explicit `--no-synthetic` disable coverage.
-- Claim extraction during ingest is now disabled by default for better ingest throughput; use `--claims` to opt in, or run `agenr backfill-claims` for bulk claim extraction.
-- Added `agenr ingest --contradiction` to explicitly enable contradiction checks during ingest regardless of config.
-- Clarified and tested ingest contradiction defaults: contradiction checks remain disabled by default unless `config.contradiction.enableDuringIngest === true` or `--contradiction` is passed.
-- OpenClaw plugin `agenr_store` now defaults `platform` to `openclaw` when the tool call omits `platform`, while preserving explicitly passed platform values.
-- Added OpenClaw tool registration tests covering `agenr_store` default platform injection and explicit platform pass-through.
-- Fixed OpenClaw session-start dedupe persistence: `hasSeenSession`/`markSessionSeen` now use a durable `seen_sessions` SQLite table with in-memory map fast-path caching.
-- Added OpenClaw session dedupe diagnostics (`hasSeenSession key=... found=... mapSize=...`) to detect map resets and repeated `isFirst=true` triggers.
-- Added seen-session cleanup and limits for OpenClaw dedupe state (evict rows older than 24 hours and cap persisted rows to `AGENR_OPENCLAW_MAX_SEEN_SESSIONS`).
-- Added OpenClaw regression tests for first/second-hit dedupe behavior, persistence across simulated module reload, stale-row cleanup, and max-row capping.
-## [0.9.67] - 2026-03-05
-### Added
-- Added hybrid mid-session recall presentation in OpenClaw plugin: inject top matches and surface remaining matches as a `[MEMORY CHECK]` subject summary nudge.
-- Added `midSessionRecall.mode` config with `hybrid` (default), `inject` (legacy), and `nudge` modes.
-- Added `midSessionRecall.injectMax` config (default `2`) to control how many entries are directly injected in hybrid mode.
-### Changed
-- Mid-session recall now defaults to `hybrid` mode instead of injecting all fresh matches.
-- Added nudge suppression when the most recent assistant turn already includes an `agenr_recall` tool call.
-## [0.9.66] - 2026-03-04
-### Added
-- Added `agenr recall --index --json` for project-grouped memory index output (`projects`, `totalEntries`).
-- Added `--null-project` as an explicit `agenr recall` filter for NULL-project-only queries.
-- Added OpenClaw plugin memory-index module at `src/openclaw-plugin/memory-index.ts` with dedicated formatter and unit tests.
-- Added pre-store embedding dedup layer for ingest (#474) with 0.90 threshold, sameType gate, event exclusion.
-- Added `survivorEmbeddings` return from pre-store dedup, threaded `EmbeddingCache` through write queue into `storeEntries`.
-- Added `embeddingCache?: EmbeddingCache` to `StoreEntriesOptions`.
-### Changed
-- OpenClaw session-start browse recall now uses stricter universal filters (`--null-project`, `--min-importance 7`) with a 30-day window and a 20-entry cap.
-- OpenClaw session-start memory index is now loaded through `agenr recall --index --json`.
-### Fixed
-- Capped consolidation reflection importance at max source importance (hard cap 8 unless source was 9+) to prevent importance inflation during merges (#472).
-- Added merge prompt guidance: "Merging entries does not make them more important - it makes them more concise."
-- Added retroactive fix to downgrade inflated importance-9 reflections to 7.
-- Fixed consolidate leaving superseded source entries as active; they are now properly retired with reason 'superseded' (#465).
-- Added maintain task to retroactively retire existing superseded-but-active ghost entries.
-## [0.9.65] - 2026-03-04
-### Added
-- Added `agenr memory-index --json [--db <path>]` to return a compact project memory index (`project`, `count`, `lastTouched`).
-- Added `--universal-only` to `agenr recall` to filter recall results to NULL-project entries.
-- Added OpenClaw plugin config `sessionStartBudgetChars` (default `12000`) to cap total session-start injected context.
-### Changed
-- Refactored OpenClaw session-start injection to lean mode: handoff context, universal browse recall only (`project IS NULL`, importance >= 7, limit 10), and memory index summary.
-- Removed session-start semantic seed recall and Phase 2 dedup assembly from `before_prompt_build`.
-- Added a strict session-start hard cap with truncation notice so injected context cannot exceed the configured budget.
-## [0.9.64] - 2026-03-04
-### Fixed
-- Truncated OpenClaw session-start recall entry content to 500 chars by default, with configurable per-entry limits and a total formatted output cap (`maxChars`, default 10000).
-- Truncated OpenClaw mid-session recall entry content to 500 chars by default, with configurable per-entry limits and a total formatted output cap (`maxChars`, default 4000).
-- Reduced OpenClaw session-start browse recall limit from 20 to 12 entries to lower prompt injection size.
-- Wired formatter options through `before-prompt-build` so recall injection limits are explicitly passable for future config overrides.
-- Removed misplaced/dead `[System Message]` handling in `mid-session-recall.ts`: fixed `normalizeBufferedMessage` truncation path and updated `buildQuery` to return an empty query for system-message-prefixed inputs.
-## [0.9.63] - 2026-03-03
-### Added
-- `agenr checkpoint export` - export agent-curated entries to JSONL for brain rebuild preservation
-- `agenr checkpoint import` - re-import checkpointed entries with dedup and recall event re-linking
-- 3-layer export curation: mechanical filters (contradictions, never-recalled) and staleness detection (near-duplicates)
-- Orphaned `recall_events` automatically re-linked to imported entries via `content_fingerprint` matching
-### Fixed
-- Heartbeat polls no longer trigger unnecessary LLM handoff calls (#458)
-## [0.9.62] - 2026-03-03
-### Fixed
-- Restored deleted integration and scoring coverage in `tests/db/recall.test.ts` after the Phase 4 field-removal rewrite.
-- Updated restored recall tests to the `recall_events` model: removed legacy `StoredEntry` recall fields, replaced metadata assertions with `recall_events` checks, and removed `useRecallEvents` and `recallStrength` legacy-path tests.
-- Restored `warmStartThreshold` and `syntheticFloor` coverage in `tests/db/recall-score-metrics.test.ts` and removed legacy compatibility assertions tied to deleted scoring paths.
-### Changed
-- **BREAKING**: Recall scoring now exclusively uses `recall_events` table; legacy entry-level `recall_count`, `last_recalled_at`, and `recall_intervals` fields are no longer read or written
-- `computeSpacingFactor` simplified to use only `maxGapDays` from RecallMetrics
-- `scoreEntry` / `scoreEntryWithBreakdown` now require `metricsMap` parameter (no longer optional)
-- Retirement, health, consolidation, maintenance, and `forgettingScore` queries derive recall activity from `recall_events` table
-- Removed legacy `recallStrength` function (replaced by `recallStrengthFromMetrics`)
-### Removed
-- `scoring.useRecallEvents` feature flag - recall_events is now the sole scoring path
-- `recall_count`, `last_recalled_at`, `recall_intervals` from `StoredEntry` type
-- Dual-write of entry-level recall columns in `updateRecallMetadata`
-- Legacy interval-parsing and imputation in `computeSpacingFactor`
-- In-memory recall_count mutation in CLI recall and MCP server handlers
-## [0.9.61] - 2026-03-03
-### Added
-- **Synthetic cold-start signals** (#417 Phase 3): `agenr ingest --bulk --synthetic` now runs a post-ingest cross-session mention analysis pass. Entries that appear across multiple session transcripts receive synthetic recall events so they start with meaningful recall profiles instead of flat zero-history signals.
-- `--synthetic` and `--synthetic-dry-run` flags for `agenr ingest`.
-- Quality score seeding from synthetic recall signals with four tiers (`0.6`, `0.65`, `0.7`, `0.8`) based on distinct session count and temporal spread. Seeding is guarded so only entries still at `quality_score = 0.5` are updated.
-### Improvements
-- Synthetic event generation writes flat `signal_value = 0.4` per event (frequency is carried by event count), caps ANN fan-out at top-5 neighbors per entry, and uses chunked transaction batches with `INSERT OR IGNORE` idempotency on the existing synthetic dedup index.
-## 0.9.60 (2026-03-03)
-### Features
-- feat(recall): add Phase 2 scoring switchover to read recall signals from `recall_events` behind `scoring.useRecallEvents` (#417)
-- feat(recall): add batch `RecallMetrics` aggregation (`getRecallMetricsBatch`) with chunked SQL + max-gap window query
-- feat(recall): add `recallStrengthFromMetrics` warm-start blending and live-first recency anchor (`lastLiveRecalledAt` fallback chain)
-- feat(recall): thread metrics-aware scoring through primary recall and cluster/graph/relation expansion paths
-### Tests
-- test(recall): add unit coverage for `getRecallMetricsBatch` counts, timestamps, signal sum, max-gap, and chunking
-- test(recall): add unit coverage for metrics-based recall strength, spacing override, and metrics-map scoring behavior
-- test(recall): add integration coverage for `useRecallEvents` on/off behavior, no-events fallback, session-start path, spacing override, and expansion candidates
-## 0.9.59 (2026-03-03)
-### Features
-- feat(schema): add `recall_events` table for durable recall event storage (#417)
-- feat(schema): application-level trigger to orphan recall events on entry deletion
-- feat(migration): migrate existing recall_count/recall_intervals data to recall_events
-- feat(recall): dual-write recall events on every recall alongside entry-level cache
-- feat(merge): re-link recall events during both LLM merge and rules-based merge
-### Tests
-- test(schema): recall_events table creation and indexes
-- test(migration): epoch-to-ISO conversion, migrated vs migrated_approx tagging, sentinel guard
-- test(trigger): entry deletion orphans recall events
-- test(recall): updateRecallMetadata inserts recall_events with correct fields
-- test(merge): recall events re-linked for both merge paths, fingerprint preserved
-## 0.9.58 (2026-03-03)
-### Improvements
-- Rewrite reflection synthesis prompt for fact preservation - LLM now preserves all numbers, versions, and paths; can reject incoherent clusters; anti-editorializing rules added (#443)
-## 0.9.57 (2026-03-03)
-### Improvements
-- Semantic coherence gate on cluster formation - clusters below 0.45 average pairwise cosine similarity are rejected to prevent cross-domain contamination in reflections (#441)
-## 0.9.56 (2026-03-03)
-### Bug Fixes
-- Deterministic importance inheritance for reflections - importance is now computed from source entries instead of hardcoded to 8 (#442)
-## 0.9.55 (2026-03-03)
-### Scoring Improvements
-- Fix session-start candidate selection to include importance-ordered entries, not just recency (#audit-R1)
-- Replace relative normalization with log-scale absolute thresholds in quality evolution (#audit-Q2)
-- Add cold-start awareness to quality blend ratio - sparse usage data no longer drags scores down (#audit-Q1/Q3)
-- Gentler decay penalty for never-recalled entries in quality evolution (#audit-Q4)
-- Drop SIGNAL_UNCLEAR from 0.4 to 0.2 for faster quality decay on unused entries (#audit-F1)
-- Increase feedback EMA alpha from 0.2 to 0.3 for more responsive quality updates (#audit-F2)
-- Add quality_score as factor in retirement forgettingScore (#audit-F6)
-- Add confirmation bonus (up to +10%) to recall scoring (#audit-R4)
-- Source-weighted initial quality: watcher entries start at 0.4, manual stores at 0.6 (#audit-F5)
-- Wire all new scoring parameters into ScoringConfig for config-only tuning
-## 0.9.54 (2026-03-03)
-### Bug Fixes
-- Fix checkpoint not being deleted after successful consolidation when merges occur (#434)
-- Add allPhasesCompleted tracking to prevent deferredPhase3Work from incorrectly persisting checkpoints
-## 0.9.53 (2026-03-03)
-### Performance
-- Consolidation now tracks changes per entry type, skipping dedup for unchanged types (#430)
-- Add 30-minute minimum entry age filter to prevent active session ingestion from triggering wasteful consolidation runs
-- Reduces LLM calls during cron when only some entry types have new data
-## 0.9.52 (2026-03-03)
-### Bug Fixes
-- Fix consolidation checkpoint not being deleted after successful completion (#431)
-- Fix toErrorMessage losing context for non-Error objects (#365)
-- Fix mapBufferToVector throwing on unaligned ArrayBufferView byteOffset (#366)
-- Fix resolveUserPath incorrectly expanding ~username paths (#367)
-- Fix isHeartbeatPoll not detecting heartbeat prompts, causing unnecessary mid-session recall
-- Fix resolveUserPath not handling Windows-style backslash paths (CodeRabbit)
-### Improvements
-- Add progress counter to reflection task during maintain (#404)
-## 0.9.51 (2026-03-03)
-### Bug Fixes
-- Fix runTraceTool passing unrecognized --json flag to trace CLI (#426)
-### Performance
-- Consolidation now skips entirely when no entries changed since last successful run (#427)
-- Uses watermark pattern (same as edge-decay) to detect entry changes
-- Reduces cron consolidation from ~10 minutes to <1 second when brain is stable
-- Checkpoint-aware: always resumes interrupted runs regardless of watermark
-## 0.9.50 (2026-03-02)
-### Features
-- agenr_trace tool for entry provenance (#386)
-  - New `agenr trace <entry_id>` CLI command
-  - New `agenr_trace` OpenClaw plugin tool
-  - Traces reflections back to source entries, and entries forward to reflections
-  - Also shows merge provenance from consolidation
-  - Supports lookup by entry_id, subject match, or --last flag
-  - New `src/db/provenance.ts` module with `getEntryProvenance()` and `findEntryBySubject()`
-## 0.9.49 (2026-03-02)
-### Features
-- Stale reflection detection and invalidation during maintain (#418)
-  - New `stale-reflections` maintain task runs before reflection synthesis
-  - Detects reflections with >50% retired or updated source entries
-  - Retires stale reflections automatically, freeing clusters for re-synthesis
-  - Pure DB queries (no LLM required), respects --dry-run and --apply flags
-  - New DB helpers: getActiveReflections, getSourceEntryStalenessCounts
-### Bug Fixes
-- Stronger reflection demotion in recall scoring (#410)
-  - Default REFLECTION_DEMOTION lowered from 0.85 to 0.70
-  - Cluster bonus no longer applied to reflection entries (breaks circular boost)
-  - Reflection entries excluded from cluster peer expansion
-- Break recall scoring feedback loop (#414)
-  - recallStrength capped at 0.5 (was 1.0) to prevent recall from dominating scores
-  - Removed RECALL_MILESTONES automatic importance promotion (counts 3, 10, 25)
-  - Importance now only changes via explicit updates or re-extraction
-### Tests
-- 5 new stale-reflections task tests (retired/updated majority, healthy, dry-run, no sources)
-- 3 new DB-layer tests for staleness detection helpers
-- 2 new cluster bonus tests for reflection skip behavior
-- Updated recall scoring tests for new defaults and removed milestones
-- Updated maintain task ordering tests for new stale-reflections task
-## 0.9.48 (2026-03-02)
-### Features
-- Reflections now inherit and synthesize tags from their source entries (#388)
-  - Tags appearing on 2+ source entries are included (all tags for small clusters)
-  - "reflection" tag always prepended for identifiability
-  - Capped at 10 tags per reflection, sorted by frequency
-  - Fixes FTS/tag-based recall blindness for all reflection entries
-### Bug Fixes
-- Cap freshness bonus at 1.0 for reflection entries to prevent synthesized
-  summaries from crowding out raw entries in recall results (#410)
-### Tests
-- 8 new tests for reflection tag inheritance (DB queries + task integration)
-## 0.9.47 (2026-03-02)
-### Bug Fixes
-- Added reflection type demotion in recall scoring (default 0.85x) to prevent reflections from crowding out specific raw entries in recall results
-- Added `scoring.reflectionDemotion` config knob (range 0-1) with normalization and recall score breakdown visibility
-## 0.9.46 (2026-03-02)
-### Improvements
-- Rewrote reflection verification prompt to evaluate synthesis quality instead of completeness (#411)
-  - Replaces fact-checker framing with synthesis quality reviewer
-  - Checks accuracy (misrepresentation) separately from theme coverage
-  - Omissions are expected and no longer penalized
-- Lowered default reflection coverage threshold from 0.7 to 0.5
-- Added parse error retry with JSON-only nudge and permissive fallback
-- Parallelized reflection cluster processing in configurable batches to reduce maintain runtime overhead (#409)
-  - Adds `maintain.reflectionConcurrency` (default `5`) for batch size control
-  - Moves per-cluster work into a standalone `processCluster` flow
-  - Adds per-batch progress logging with estimated minutes remaining
-  - Adds SQLITE_BUSY retry/backoff for reflection write transactions under concurrent load
-### Bug Fixes
-- Bumped OpenClaw plugin store timeout from 10s to 30s to prevent agenr_store timeouts with larger brains (#412)
-- Eliminated ~25% false negative verification failures caused by prompt bias
-## 0.9.45 (2026-03-02)
-### Features
-- Reduced default min cluster size from 3 to 2 for better pair clustering (#402)
-- Added subject-match clustering phase for entries sharing exact subjects (#402)
-- Lowered default cosine similarity threshold from 0.72 to 0.68 (#402)
-### Bug Fixes
-- `maintain --full` now bypasses cluster freshness guard (#399)
-## 0.9.44 (2026-03-02)
-### Features
-- Make recall scoring parameters configurable via `scoring.*` config keys (#392)
-  - Allows tuning recall weights, boosts, and thresholds without code changes
-  - Threads scoring config through recall and session-start recall paths
-- Add graph recall config knobs: graphBonus, graphSeedCount, graphNeighborLimit, graphMinEdgeWeight, graphMinSeedVectorSim (#392)
-- Suppress demoted source entries when their reflection is in recall results (#389)
-  - Prevents redundant results when both a reflection and its sources match a query
-  - Backfills results to maintain requested limit after suppression
-## 0.9.43 (2026-03-02)
-### Features
-- Embedding-based clustering fallback for orphan entries (#387)
-  - Assigns orphans to nearest existing cluster by embedding similarity
-  - Forms new clusters from orphans with similar embeddings
-  - Configurable similarity threshold (default 0.72), max cluster size (default 50)
-  - New `method` column on clusters table (`co_recall` | `embedding`)
-  - Reports fallback stats in detectAndStoreClusters result
-### Bug Fixes
-- Replace sigmoid scoring compression with sqrt for better score differentiation (#390)
-  - Old formula squashed all scores into 0.22-0.26 band (0.033 spread)
-  - New sqrt compression gives 0.50-0.67 range (0.17 spread, 5x improvement)
-  - FTS bonus moved from additive post-compression (0.15) to multiplicative pre-compression (1.3x)
-  - Importance/recall blend (70/30) replaces MAX - recall history now always contributes
-- Apply --limit flag to reflection task cluster count (#385)
-- Tighten extraction importance calibration to reduce inflation (#391)
-  - Added stronger downward pressure in calibration text
-  - Rebalanced examples: two 8s lowered to 7, added importance-6 example
-  - Production showed 30.7% of entries at 8+ vs 20% target
-### Tests
-- 8 new tests for embedding fallback clustering
-- 3 new tests for scoring (spread, FTS proportionality, recall contribution)
-- 1 new test for reflection limit enforcement
-## 0.9.42 (2026-03-02)
-### Features
-- **Periodic reflection and synthesis** (#269): New `reflection` maintain task that
-  synthesizes clusters of related entries into higher-level reflections using
-  two-step LLM prompting (question generation + synthesis) with a verification pass.
-  Source entries are importance-demoted (not retired) so raw knowledge remains
-  accessible. Co-recall edges are transferred to reflection entries for graph
-  continuity.
-- New entry type: `reflection` for synthesized knowledge
-- New `reflections` table for process metadata
-- Extended `entry_sources` with `action` column for unified provenance tracking
-- New `--full` flag for full-brain reflection (ignores change detection)
-- Configurable: reflection model, verification model, coverage threshold, demotion
-  amount, importance threshold, cluster size minimum
-- Re-synthesis guards: minimum cumulative importance threshold for changed entries
-### Bug Fixes
-- Fix parseSynthesisResponse bailing on first JSON candidate instead of trying alternatives
-- Fix verification prompt missing explicit check instructions
-- Fix demoteEntryImportance/retireReflectionEntry using wall clock instead of injected time
-- Fix transferCoRecallEdges silently dropping stronger edges (now uses MAX weight)
-- Fix verification ignoring the verified boolean (only checked coverage_score)
-- Extract shared JSON parsing utilities to reduce duplication
-- Add cluster churn detection for re-synthesis safety
-## 0.9.41 - 2026-03-01
-### Fixed
-- Recall score saturation: sigmoid compression + multiplicative boosts preserve ranking discrimination (#341)
-- Classifier now recognizes bare numbers, issue refs (#380), and version patterns (v0.9.22) as entities (#331)
-- Skip expensive recall and context injection when session-start is triggered by heartbeat poll (#340)
-- Extraction prompt now skips hypothetical examples, test data, and mock entities (#283)
-- Extraction prompt now skips meta-conversation about the knowledge base itself (#248)
-### Tests
-- Score saturation: ranking preservation tests, updated ratio assertions to ordering
-- Classifier: numeric reference detection with positive and negative cases
-- Heartbeat detection: pattern matching and session skip verification
-- Extraction prompt: anti-pattern content validation
-## 0.9.40 - 2026-03-01
-### Fixed
-- Add HH:MM:SS timestamps to all createLogger output lines (#380)
-### Tests
-- Updated logger tests to validate timestamp format and ranges
-## 0.9.39 - 2026-03-01
-### Changed
-- Migrated colocated test files out of `src/` into `tests/` with mirrored folder structure and updated relative imports to target `src/` from their new locations.
-- Merged colliding test suites into existing `tests/` files for:
-  - `openclaw-plugin/index`
-  - `openclaw-plugin/recall`
-  - `db/store`
-- Moved the large `src/commands/init.test.ts` suite to `tests/commands/init-src.test.ts` to preserve Vitest mock isolation while removing the `src/` colocated test file.
-- Removed `src/llm/__tests__/` after moving `stream-registry.test.ts` to `tests/llm/`.
-## 0.9.38 - 2026-03-01
-### Changed
-- Refactored `src/openclaw-plugin/index.ts` to decompose `register()` into thin wiring and move hook logic into new modules under `src/openclaw-plugin/hooks/`:
-  - `before-prompt-build.ts`: `handleBeforePromptBuild()` for session-start recall, mid-session recall, signals, and store nudging.
-  - `before-reset.ts`: `handleBeforeReset()` for feedback, quality checks, and reset handoff handling.
-  - `register-tools.ts`: `registerAgenrTools()` for `agenr_recall`, `agenr_store`, `agenr_extract`, and `agenr_retire` registration.
-  - `types.ts`: shared hook parameter interfaces.
-- Preserved existing plugin exports and behavior by keeping `__testing` in `index.ts` and passing explicit params into extracted handlers.
-- Added an explicit hook-wiring test to verify `register()` still registers `before_prompt_build`, `before_reset`, and `command`.
-## 0.9.37 - 2026-03-01
-### Added
-- Added `src/utils/logger.ts` with a shared `createLogger` factory and global verbose controls (`setVerbose`, `isVerbose`) to standardize stderr logging output.
-- Added `tests/utils/logger.test.ts` coverage for logger method shape and stderr output behavior for `info`, `warn`, `error`, and `debug`.
-### Changed
-- Replaced ad-hoc logging in targeted modules with the shared logger factory, including maintain helpers, OpenClaw plugin debug/handoff paths, DB conflict and contradiction flows, store pipeline diagnostics, claim extraction diagnostics, whole-file and extractor warnings, shutdown logging, and selected command-level logging paths.
-- Updated maintain LLM logging to use a dedicated `maintain-llm` logger prefix instead of raw prefixed strings.
-## 0.9.36 - 2026-03-01
-### Changed
-- Split `src/cli-main.ts` into focused command registration modules under `src/cli/`:
-  - `helpers.ts`: shared CLI helpers (`stderrLine`, `assertReadableFile`, `createEmptyStats`, `formatTaskModelLines`, `toReportKey`)
-  - `extract.ts`: extract types (`ExtractCommandOptions`, `CliDeps`), `runExtractCommand`, and extract command registration
-  - `store.ts`, `recall.ts`, `retire.ts`, `review.ts`, `edges.ts`, `eval.ts`, `watch.ts`, `todo.ts`, `ingest.ts`, `benchmark.ts`, `consolidate.ts`, `maintain.ts`, `conflicts.ts`, `backfill.ts`, `init.ts`, `mcp.ts`, `watcher.ts`, `db.ts`, `setup.ts`, `config.ts`, `auth.ts`: command-specific registration builders
-- Slimmed `src/cli-main.ts` to a composition root that wires `register*Command` functions and keeps the root status action inline.
-- Preserved `src/cli-main.ts` public extract exports by re-exporting `ExtractCommandOptions`, `CliDeps`, and `runExtractCommand` from `src/cli/extract.ts`.
-- Added CLI registration coverage test to verify `createProgram()` registers all expected top-level commands.
-## 0.9.35 - 2026-03-01
-### Changed
-- Split `src/db/recall.ts` into focused modules under `src/db/recall/`:
-  - `index.ts`: recall orchestration and public exports
-  - `types.ts`: recall types and shared defaults
-  - `score.ts`: recall scoring and recency math
-  - `filters.ts`: recall filter helpers
-  - `candidates.ts`: SQL candidate and FTS retrieval
-  - `graph.ts`: co-recall graph neighbor expansion
-  - `cluster.ts`: cluster peer expansion
-  - `metadata.ts`: recall metadata update queue and persistence
-  - `helpers.ts`: stored-entry mapping, tags, and recall text helpers
-- Updated all source and test imports to use `src/db/recall/index.ts`.
-- Added recall export coverage test to verify core public recall exports remain available after the split.
-## 0.9.34 - 2026-03-01
-### Changed
-- Split `src/commands/maintain.ts` into focused modules under `src/commands/maintain/`:
-  - `index.ts`: maintain orchestration and task registry wiring
-  - `types.ts`: shared maintain types/constants
-  - `helpers.ts`: option parsing, task execution, and shared utility helpers
-  - `report.ts`: maintain run reporting/rendering
-  - `history.ts`: maintain history rendering and history command
-  - `tasks/clusters.ts`, `tasks/quality.ts`, `tasks/edge-decay.ts`, `tasks/conflicts.ts`, `tasks/consolidation.ts`, `tasks/retirement.ts`, `tasks/snapshot.ts`: task executors and task-specific helpers
-- Updated maintain imports in CLI and tests to `src/commands/maintain/index.ts`.
-- Added maintain export coverage test to verify the public command API and type exports remain available after the split.
-## 0.9.33 - 2026-03-01
-### Changed
-- Split `src/commands/ingest.ts` into focused modules under `src/commands/ingest/`:
-  - `index.ts`: ingest command orchestration and reporting
-  - `file-resolver.ts`: input glob and file resolution helpers
-  - `bulk-store.ts`: bulk-mode dedup and store pipeline
-  - `target-processor.ts`: per-file extract/store processing and watch offset sync
-  - `pipeline.ts`: worker pool, first pass, and retry orchestration
-  - `progress.ts`: progress rendering and byte formatting helpers
-  - `helpers.ts`: ingest-specific retry, cleanup, and ingest-log helpers
-- Updated ingest imports across CLI, command runtime, and tests to use `src/commands/ingest/index.ts`.
-- Added an ingest export coverage test to verify the public `runIngestCommand` API and type exports remain available after the split.
-### Fixed
-- Ingest bulk mode now reports `StoreResult.total_entries` correctly as `added + skipped`.
-- Ingest command now initializes `embeddingApiKey` without a redundant always-true guard.
-- Progress rendering now uses ANSI clear-line (`\x1b[2K\r`) so shorter updates do not leave trailing characters.
-- File resolver now treats `[` and `]` as literal characters and fails fast on missing file paths with explicit `File not found` or `Directory not found` errors.
-- Force re-ingest and failed-file cleanup delete sequences now run inside transactions with rollback on failure.
-- Ingest worker loop now catches unexpected per-target exceptions and records file-level errors instead of aborting the whole pass.
-- Bulk ingest teardown now preserves both pipeline and cleanup errors by attaching cleanup failure as cause when both happen.
-- Chunk-failure counters now avoid double-counting the same file across retries by applying per-file deltas.
-## 0.9.32 - 2026-03-01
-### Changed
-- Split src/openclaw-plugin/index.ts (1,858 lines) into focused modules
-  - session-handoff.ts: transcript building, LLM summarization, handoff orchestration
-  - session-state.ts: session tracking, LRU, signal state
-  - plugin-db.ts: plugin database lifecycle
-  - index.ts: hook wiring and tool registration
-## 0.9.31 - 2026-03-01
-### Changed
-- Split src/db/store.ts (1,908 lines) into 4 focused modules under src/db/store/
-  - queries.ts: DB query helpers, hashing, similarity search, entry insertion
-  - online-dedup.ts: LLM-based online dedup pipeline
-  - planner.ts: entry action planning, mutation application, subject resolution
-  - index.ts: storeEntries orchestration and public API
-## 0.9.30 - 2026-03-01
-### Changed
-- Split src/extractor.ts (2,569 lines) into focused modules under src/extractor/
-  - parser.ts: entry validation, coercion, and schema mapping
-  - dedup.ts: LLM dedup pipeline and batch processing
-  - prefetch.ts: related entry pre-fetching and blocked subjects
-  - chunk-runner.ts: chunk extraction with retry logic
-  - debug.ts: extraction debug logging
-  - index.ts: pipeline orchestration and public API
-- Centralized all LLM prompts under src/prompts/ (extraction, maintain, consolidate, handoff)
-## 0.9.29 - 2026-03-01
-### Changed
-- Extracted shared utilities into dedicated modules: `parsePositiveInt`, `resolveUserPath`, vector math, sleep, `toErrorMessage`, and `isRecord`.
-- Deduplicated copy-pasted utility functions across 30+ files.
-- Fixed imports for `toNumber`, `toStringValue`, and `MILLISECONDS_PER_DAY` to use existing shared exports.
-## 0.9.28 (2026-03-01)
-### Added
-- `agenr maintain` now includes an opt-in `edge-decay` mechanical task that
-  attenuates co-recall edge weights over time when `maintain.edgeDecay: true`
-  is set in config. Runs with `--skip-llm`. (#270)
-- Added `--prune-edges` flag to `agenr maintain` for explicit pruning of
-  co-recall edges below the configured decay floor. Pruning is no longer
-  automatic. (#270)
-- Added `maintain` config section support in `AgenrConfig` with
-  `edgeDecay`, `edgeDecayFactor`, `edgeDecayFloor`, and `clusterStaleHours`.
-  (#270)
-- LLM-powered conflict resolution task in maintain command.
-- `maintain` model task added to config (for conflict resolution,
-  consolidation, retirement).
-- Shared LLM call wrapper for maintenance tasks.
-- Auto-resolve for high-confidence conflicts, recommend-only for ambiguous.
-- `--apply` flag overrides both confidence threshold and high-importance (>=9)
-  protection for conflict resolution.
-- Per-conflict atomic commits with failure tracking.
-- Consolidation task in maintain command wrapping existing consolidation
-  infrastructure. Default mode: dry-run assessment. With `--apply`: runs
-  full LLM-driven cluster merging via orchestrator. (#270)
-- Retirement task in maintain command using forgettability scoring
-  (`1 - retentionScore`). LLM confirms each retirement with `--apply`.
-  High-importance entries (>=9) require `--apply`, importance 10 always
-  protected. (#270)
-- Retirement prompt template for LLM-confirmed entry retirement decisions.
-- Co-recall edge cleanup on retirement (matches retireEntries behavior).
-- SQL pre-filters and LIMIT 5000 safety cap on retirement candidate query.
-- `--cron` flag for scheduled/unattended maintenance runs with
-  min-interval guard and quiet output. (#270)
-- Config fields: `consolidationSimThreshold`, `consolidationLimit`,
-  `retirementLimit`, `retirementMinAgeDays`, `retirementMinForgettingScore`.
-### Changed
-- Consolidation similarity threshold default lowered from 0.9 to 0.76 (matching
-  standalone `agenr consolidate` defaults). Floor lowered from 0.8 to 0.7. (#270)
-- `decayCoRecallEdges()` now supports floor, cutoff-date, and prune mode
-  options, returns full decay stats, and clamps low weights to floor by
-  default instead of deleting them. (#270)
-- Edge decay safety guards added to maintain task flow:
-  sabbatical guard (no recall in 14 days), reinforcement exemption (skip
-  edges reinforced since last decay), and idempotency watermark (skip if last
-  decay was under 20 hours ago). (#270)
-- Maintain task order updated to: quality -> edge-decay -> clusters ->
-  conflicts -> consolidation -> retirement -> snapshot. (#270)
-- `--limit` flag now controls all LLM tasks (conflicts, consolidation,
-  retirement), not just conflicts.
-- Maintain command description updated to reflect all available tasks.
-- Verbose LLM debug logging for maintain conflict resolution, retirement, and
-  consolidation tasks (model name, prompt preview, response preview, token
-  usage, parsed decisions). Uses `[AGENR:maintain-llm]` prefix.
-- Migrated maintain log prefixes to `[AGENR:tag]` convention:
-  `[MAINTAIN]` -> `[AGENR:maintain]`,
-  `[CONFLICT-RESOLVE]` -> `[AGENR:conflict-resolve]`,
-  `[CONFLICT-LOG]` -> `[AGENR:conflict-log]`.
-- `--cron` skip message now outputs valid JSON when `--json` is also passed.
-- `--cron` help text now mentions the 55-minute minimum interval guard.
-### Fixed
-- Retirement scoring uses forgettability (`1 - retentionScore`) to correctly
-  target stale entries instead of inverting polarity. (#270)
-- Retirement timestamps use injected `context.now` instead of SQLite
-  `datetime('now')` for testability.
-- Conflict task: resolve errors tracked separately from LLM errors.
-- Conflict task: failure count lookback filters for runs that included
-  conflicts task.
-- Conflict task: corrupted `summary_json` in previous runs handled
-  gracefully.
-- Config: `autoResolveConfidence` floor enforced at normalization level
-  (0.5).
-- Config: `retirementMinForgettingScore` floor enforced at 0.1.
-## 0.9.27 (2026-03-01)
-### Added
-- Programmatic conflict resolution API (`src/db/conflict-resolution.ts`) with
-  `resolveConflict()`, `getConflictWithEntries()`, and
-  `getPendingConflictsWithEntries()` functions. Enables automated conflict
-  resolution by the upcoming `agenr maintain` command (#270). (#270)
-- `resolution_reasoning` column on conflict_log table for storing why a
-  conflict was resolved (human or automated reasoning). (#270)
-- `resolved_by` column on conflict_log table for tracking resolution source
-  (user, auto, janitor). (#270)
-- Debug logging with `[CONFLICT-RESOLVE]` and `[CONFLICT-LOG]` prefixes
-  for conflict resolution pipeline observability.
-### Changed
-- Conflict resolution logic extracted from conflicts-ui.ts into reusable
-  conflict-resolution.ts module. UI behavior unchanged. (#270)
-## 0.9.26 (2026-02-28)
-### Added
-- agenr clusters command - discovers topic clusters from the co-recall graph
-  using label propagation community detection. Supports --detect, --detail,
-  --json, and --min-size flags. (#303)
-- Cluster-aware recall boost - entries from the same cluster as top recall
-  results receive a 0.10 score bonus, improving topical coherence of
-  recall results. Works alongside the existing co-recall graph boost. (#303)
-- Clusters summary section added to agenr health output showing total
-  clusters, largest cluster, and orphan entry count. (#303)
-- New clusters table and entries.cluster_id column for persistent
-  cluster storage. (#303)
-### Bug Fixes
-- Cluster-aware recall now batches cluster peer lookups into a single
-  fetch pass, avoiding per-peer N+1 entry queries. (#303)
-- Cluster-aware recall now applies the cluster bonus symmetrically to
-  existing scored entries in seed clusters and records `scores.cluster`
-  for both existing and injected entries. (#303)
-- Cluster detection now returns an explicit convergence flag, and the
-  `agenr clusters` command surfaces a warning when propagation does not
-  converge within the iteration cap. (#303)
-- Re-detection now clears `entries.cluster_id` only for rows that are
-  currently clustered, avoiding unnecessary full-table updates. (#303)
-- Cluster detection persistence now uses `db.transaction("write")` so
-  cluster resets, inserts, and member assignments run atomically on a
-  single libsql transaction connection. (#303)
-- Cluster peer lookup now excludes retired entries before applying limits,
-  preventing retired rows from consuming cluster boost peer slots. (#303)
-- `agenr clusters --min-size` now fails fast on invalid values instead of
-  silently defaulting, with a descriptive positive-integer error message. (#303)
-## 0.9.25 (2026-02-28)
-### Bug Fixes
-- Health command now reads conflict data from conflict_log table instead of
-  the never-incremented entries.contradictions column. Shows total conflicts,
-  breakdown by relation type, pending count, and auto-resolved count. (#334)
-- High-confidence coexists conflicts (>0.8) now auto-resolve instead of being
-  flagged for review. Previously, any coexists involving decision or lesson
-  types was flagged regardless of confidence, creating a false backlog. (#335)
-- OpenClaw plugin: system messages (e.g. subagent completions) now classify
-  as trivial for mid-session recall, preventing wasted embedding API calls
-  on garbage queries containing session IDs and boilerplate text.
-## 0.9.24 (2026-02-28)
-### Bug Fixes
-- Added storeNudge to openclaw.plugin.json configSchema so OpenClaw accepts
-  storeNudge config without validation errors
-## 0.9.23 (2026-02-28)
-### Added
-- `agenr db evolve-quality` command - computes quality scores from recall frequency,
-  co-recall graph connectivity, confirmations, importance, and time-based decay
-- Supports --dry-run and --json flags
-- Quality scores now differentiate entries instead of sitting static at 0.5
-- Bumped default store nudge maxPerSession from 3 to 5 for better coverage in longer sessions
-## 0.9.22 (2026-02-28)
-### Bug Fixes
-- OpenClaw plugin: added missing pronouns (You, She, They) to
-  FALSE_POSITIVE_NOUNS so short messages like "You there?" correctly
-  classify as trivial instead of triggering recall. (#329)
-### Improvements
-- OpenClaw plugin: standardized all log output with `[AGENR:tag]`
-  prefix for easy filtering from OpenClaw internal logs. (#329)
-## 0.9.21 (2026-02-28)
-### Improvements
-- OpenClaw plugin: simplified mid-session recall from three-tier
-  classification (trivial/normal/complex) to two-tier (trivial/recall).
-  All non-trivial messages now use a single recall limit of 8, eliminating
-  fragile regex-driven classification that missed edge cases like
-  "What's the status of X?" being classified as normal instead of complex.
-  Config fields normalLimit and complexLimit are deprecated in favor of
-  a single limit field. (#326)
-## 0.9.20 (2026-02-28)
-### Bug Fixes
-- OpenClaw plugin: mid-session recall query now uses the raw current
-  message instead of accumulating a sliding window with stop-word
-  stripping. Fixes garbled queries that returned irrelevant context.
-- OpenClaw plugin: tightened message classifier to avoid marking
-  conversational acks as complex. Messages with no entities, temporal
-  patterns, or recall phrases now correctly classify as normal or
-  trivial regardless of length.
-- OpenClaw plugin: expanded trivial phrase list and added word-count
-  gate so low-signal conversational messages skip recall entirely.
-- OpenClaw plugin: recall queries are capped at 200 chars to prevent
-  large data pastes from producing oversized embedding queries. (#323)
-## 0.9.19 (2026-02-28)
-### Features
-- OpenClaw plugin: store nudging - injects a system nudge when the
-  agent has not called agenr_store in 8+ turns (configurable via
-  storeNudge plugin config). Nudges are spaced by the threshold
-  interval, capped at 3 per session. (#290)
-## 0.9.18 (2026-02-28)
-### Features
-- OpenClaw plugin: subsequent-turn auto-recall with heuristic message
-  classifier. Messages are classified as trivial (skip), normal (5 results),
-  or complex (8 results) based on entity detection, temporal references,
-  and explicit recall phrases. Queries built from a sliding window of
-  recent messages with Jaccard similarity dedup. Recalled entries are
-  deduplicated against session-start context. Configurable via
-  midSessionRecall plugin config.
-### Bug Fixes
-- MCP: fix stdout corruption during store and contradiction checks.
-  Diagnostic logs in db/store.ts and db/contradiction.ts were writing to
-  stdout via console.log, corrupting MCP JSON-RPC framing. Routed all
-  diagnostic logging to stderr via console.error.
-### Changed
-- Removed agenr_store tool from MCP server. Coding agents should rely on
-  Watcher for knowledge ingest from session transcripts. Reduces MCP
-  surface area and eliminates stdout corruption risk.
-- Removed store option from agenr_extract MCP tool. Extract now returns
-  extracted entries without storing them.
-### Tests
-- 16-case message classifier test suite
-- Query builder, similarity check, and state management tests
-- Integration tests for mid-session recall in before_prompt_build
-- MCP stdout corruption test
-- Updated MCP server tests for removed store tool and extract store option
-## 0.9.17 - 2026-02-27
-### Changed
-- Optimized LLM dedup in consolidate clustering: batch up to 10 pairs per API
-  call with 5 concurrent batches. Reduces a 2400-pair dedup queue from ~60min
-  (sequential, 1 call per pair) to ~2min.
-## 0.9.16 - 2026-02-27
-### Added
-- Added progress logging throughout the consolidate pipeline. Pairwise
-  similarity scan, rules phases, cluster processing, LLM dedup checks, and
-  LLM merge calls now report progress so users can see the system is working.
-  Phase-level progress logs are always shown (not gated behind `--verbose`).
-- Added live cluster progress updates with ETA during consolidation phases so
-  long-running Phase 1, Phase 2, and Phase 3 work shows continuous terminal
-  activity.
-## 0.9.14 - 2026-02-27
-### Fixed
-- Fixed a pi-ai dual module registry bug that caused bundled `streamSimple`
-  calls and consolidate LLM dedup pre-screening to silently fail. `tsup`
-  split pi-ai imports across separate ESM instances with separate API
-  registries; `runSimpleStream` now imports from
-  `@mariozechner/pi-ai/dist/stream.js` so provider registration and stream
-  lookup use the same registry instance.
-- Reverted the 0.9.13 workaround that made consolidate loose-band LLM dedup
-  opt-in only when `--loose-threshold` was set.
-## 0.9.13 - 2026-02-27
-### Changed
-- LLM dedup pre-screening in consolidate is now opt-in via --loose-threshold.
-  Without it, consolidate uses subject-aware auto-union in the loose band but
-  skips LLM calls, avoiding potentially hundreds of sequential API round-trips.
-## 0.9.12 - 2026-02-27
-### Fixed
-- Register pi-ai API providers before streamSimple calls so consolidate
-  LLM dedup and merge can resolve the OpenAI/Anthropic streaming backend.
-  Root cause: tsup tree-shook the side-effect import; now uses explicit
-  registration via ensureApiProviders() guard.
-- Added 15s timeout to LLM dedup pre-screening calls in consolidate clustering
-  to prevent hangs from unresponsive LLM endpoints.
-## 0.9.10 - 2026-02-27
-### Added
-- Ingest creates co-recall edges between entries extracted from the same
-  session file, seeding the graph for graph-augmented recall after fresh
-  installs or DB resets (#300).
-- Ingest backfills co-recall edges for already-ingested files on re-ingest
-  without requiring --force (#300).
-- Consolidation clustering now uses in-memory pairwise cosine similarity
-  instead of per-entry SQLite vector queries, eliminating O(N) database
-  round-trips (#263).
-- Consolidation clustering now supports a loose similarity band with
-  subject-aware auto-union and optional LLM pre-screening to catch
-  semantically equivalent entries below the tight cosine threshold (#264).
-- Consolidation reports now include loose-band LLM dedup pre-screen call and
-  match counts across phases (#264).
-## 0.9.9 - 2026-02-27
-### Added
-- Graph-augmented recall: top embedding matches seed 1-hop traversal of
-  co-recall edges, pulling in associatively connected entries that similarity
-  alone would miss. Graph neighbors are scored with real embedding similarity
-  plus an additive graph bonus (0.15 * edge weight). Seeds are selected by
-  embedding similarity only to prevent recency bias (#297).
-## 0.9.8 - 2026-02-27
-### Added
-- Co-recall edge creation (Hebbian learning): entries that are both recalled and
-  used in the same session form weighted associations. Edges strengthen on
-  repeated co-occurrence and decay over time (#267 Phase 2).
-- Review queue: entries flagged for human review are tracked in a dedicated table.
-  Low-quality entries (quality_score < 0.2 after 10+ recalls) are auto-flagged
-  for retirement review (#267 Phase 2).
-- `agenr review` command to list, dismiss, or retire flagged entries.
-- `agenr edges` command to inspect co-recall edges.
-- Co-recall edge statistics and review queue summary in `agenr health` output.
-## 0.9.7 - 2026-02-27
-### Fixed
-- `agenr init` no longer pins MCP config to a version-specific path. It now
-  resolves the `agenr` shim or binary on PATH so upgrades take effect
-  automatically (#294).
-## 0.9.6 - 2026-02-27
-### Added
-- `--around <date>` flag for recall: shifts recency scoring to peak at a target
-  date instead of now. Entries closer to the target date rank higher regardless
-  of whether they are older or newer than it (#189).
-- `--around-radius <days>` flag: controls the window width for `--around` queries
-  (default: 14 days). Also auto-sets since/until bounds when not explicitly
-  provided.
-- `gaussianRecency` scoring function for temporal targeting.
-- `agenr_recall` native tool now exposes `around` and `aroundRadius` parameters
-  for temporal targeting in mid-session recall.
-- MCP server `agenr_recall` tool now exposes `around` and `aroundRadius`
-  parameters for temporal targeting.
-## 0.9.5 - 2026-02-27
-### Added
-- Feedback-driven recall scoring (#267 Phase 1): recalled entries are tracked per
-  session and compared against agent responses at session end. Entries that are
-  used get quality score boosts; unused entries drift slightly downward.
-- Correction signal: if the agent stores a contradicting entry via agenr_store
-  during the session, the recalled entry that was corrected gets a strong
-  negative signal (quality score drops toward 0).
-- Rolling quality_score (0-1) per entry, integrated into recall ranking formula.
-  Consistently useful entries rank higher over time.
-- Entry-type quality floor: facts and preferences cannot drop below 0.35 to
-  prevent background-context entries from being unfairly penalized.
-- Auto-strengthen: entries reaching recall count milestones (3, 10, 25) get
-  importance bumped by 1 (capped at 9, never auto-promotes to 10).
-- Quality score distribution in `agenr health` output.
-## 0.9.4 (2026-02-27)
-### Changed
-- refactor: removed top-level `model` config field; `models` is now required with all four task keys (extraction, claimExtraction, contradictionJudge, handoffSummary) always explicit (#277)
-- `resolveModelForTask` simplified to direct lookup (no fallback chain)
-- `isCompleteConfig` now checks for complete `models` instead of top-level `model`
-- Old configs with top-level `model` auto-upgrade on read (value populates all task models)
-- `config set model <value>` removed; use `config set models.extraction <value>` etc.
-## 0.9.3 (2026-02-26)
-### Added
-- feat: `config set` now supports per-task model overrides via dot-path keys (for example, `models.extraction`, `models.claimExtraction`) (#276)
-- Set value to `default` to remove an override and fall back to the global model
-- Schema: subject_entity, subject_attribute, subject_key, claim_predicate, claim_object, claim_confidence columns on entries table (#266)
-- Schema: conflict_log table for contradiction audit trail (#266)
-- Schema: idx_entries_subject_key partial index (#266)
-- SubjectIndex: in-memory subject key index with lazy initialization (#266)
-- ConflictLogEntry type definition (#266)
-- Claim extraction: dedicated LLM call extracts structured claims (subject/predicate/object) from entries at store time (#266)
-- Config: per-task model configuration via config.models (#266)
-- Setup: optional advanced per-task model selection in setup wizard (#266)
-- resolveModelForTask() helper for consistent model resolution (#266)
-- Tuned claim extraction prompt for entity normalization and reduced false no_claim results (#266)
-- Fixed benchmark fixtures for edge cases (#266)
-- Tuned claim extraction prompt: short-claim guidance, no-claim edge cases, entity hint from subject field (#266)
-- Fixed benchmark fixtures: fact-breed alt values, decision-release-strategy realistic object (#266)
-- Added alternate expected values to claim scorer for entity, attribute, predicate, object (#266)
-- Expanded predicate equivalence groups and soft matches (#266)
-- Removed personal information from claim extraction prompt and benchmark fixtures (#266)
-- Added contradiction detection core: LLM judge classifies entry pairs as supersedes/contradicts/coexists/unrelated (#266)
-- Added type-specific resolution: auto-supersede for facts and preferences, flag decisions and lessons for review (#266)
-- Added conflict log for tracking detected contradictions and their resolutions (#266)
-- Added contradiction judge benchmark: fixtures, scorer, and CLI flag --judge for regression testing the LLM judge (#266)
-- Rewrote contradiction judge prompt: singular vs additive attribute heuristic, shorter and more focused for nano (#266)
-- Tuned judge benchmark fixtures for clarity (#266)
-- Fixed contradiction judge prompt regression: reverted temperature setting, restored original prompt structure with targeted additive-attribute guidance (#266)
-- Added alternate accepted relations to ambiguous judge benchmark fixtures: sup-diet, sup-storage, edge-event-immutable, edge-similar-different (#266)
-- Integrated contradiction detection into store pipeline: runs after claim extraction on ADD decisions, resolves conflicts with type-specific rules (#266)
-- Contradiction detection is enabled by default for all store paths (watcher, plugin, CLI, ingest) (#266)
-- Added production logging for claim extraction, contradiction detection, and conflict resolution (#266)
-- Added `agenr backfill-claims` command to extract claims for existing entries, enabling subject index and contradiction detection on older knowledge (#266)
-- Added `agenr conflicts` command: local web UI for reviewing and resolving detected contradictions (#266, seeds #171)
-- Init wizard: per-task model selection for extraction, claim extraction, contradiction judge, and handoff summary (#266)
-### Fixed
-- fix: CLI banner now displays the current agenr version (#278)
-- fix: setup and init wizards now write explicitly selected task models even when they match defaults (#275)
-- fix: resolveConflict now reads autoSupersedeConfidence from config instead of hardcoding 0.85 (#275)
-- fix: claim fields explicitly propagated through mutation pipeline (#275)
-- fix: supersede + insert wrapped in transaction for online-dedup path (#275)
-- fix: entity names with slashes sanitized during claim extraction (#275)
-- fix: LLM judge errors now logged via console.warn instead of silent fallback (#275)
-- fix: entity alias resolution no longer depends on single-entity heuristic (#275)
-- Critical: conflicts UI "keep-new"/"keep-old" resolution was retiring the wrong entry (swarm review)
-- Contradiction detection: cap subject-index candidates to maxCandidates, sort by recency
-- Contradiction detection: always run both subject-index and embedding search (removed hardcoded < 3 gate)
-- Contradiction detection: parallelize classifyConflict LLM calls via Promise.all
-- Contradiction detection: high-confidence supersession with lower importance now flags for review instead of silent coexist
-- Contradiction detection: lowered default similarity threshold from 0.72 to 0.55 (matches real contradiction scores)
-- Contradiction detection: entity hint injection from DB for consistent claim extraction across sessions
-- Contradiction detection: fuzzy attribute matching fallback in subject index
-- Contradiction detection: cross-entity lookup for same-attribute conflicts across entity aliases
-- Subject index rebuild is now atomic (swap instead of clear-then-populate)
-- Conflicts UI: request body size limit (64KB), auth token on POST endpoints, safe browser open
-- Extracted shared LLM helpers (clampConfidence, resolveModelForLlmClient, extractToolCallArgs) to src/db/llm-helpers.ts
-- Removed unnecessary Float32Array conversions in contradiction detection pipeline
-- Replaced __pendingConflicts side-channel with scoped Map
-- Init wizard: change model without re-running auth setup (#275)
-## 0.9.2 (2026-02-26)
-### Fixed
-- fix(consolidate): fragmented clustering produced duplicate canonical entries instead of a single winner (#249)
-  - Phase 1 now over-fetches neighbors (3x) when type-filtered to preserve same-type neighborhood coverage
-  - Added a new Phase 3 post-merge dedup pass to merge near-duplicate canonical entries created in the same run
-  - Phase 3 disables idempotency and only processes clusters that include entries created during the current run
-## 0.9.1 (2026-02-26)
-### Changed
-- Renamed `agenr daemon` CLI command to `agenr watcher` - "watcher" better describes what it does
-- `agenr daemon` still works as a hidden compatibility command
-- Updated user-facing command output to say "watcher" instead of "daemon"
-### Internal
-- Renamed `src/commands/daemon.ts` to `src/commands/watcher.ts`
-- Renamed daemon command interfaces and exports from `Daemon*`/`runDaemon*` to `Watcher*`/`runWatcher*`
-## 0.9.0 (2026-02-25)
-### Features
-- Interactive onboarding wizard for `agenr init` (#170)
-  - Auth setup with API key links and connection testing
-  - Embeddings API key connectivity check during setup
-  - Platform auto-detection for OpenClaw and Codex (macOS, Linux, Windows)
-  - OpenClaw directory confirmation with custom path support
-  - DB isolation prompt for non-default OpenClaw paths (shared vs isolated)
-  - Project slug derivation with interactive edit
-  - Reconfigure mode with "keep current" defaults
-  - Change tracking for auth, model, embeddings, directory, and DB path
-- Global projects map in `~/.agenr/config.json` for OpenClaw and Codex
-  - Keyed by directory path (multiple instances can share the same project slug)
-  - Stores platform, project slug, and optional dbPath per instance
-  - Per-repo platforms (Cursor, Claude Code, Windsurf) unchanged
-- Current config display shows all registered projects with directories and DB isolation status
-- `resolveProjectFromGlobalConfig()` helper for O(1) project lookup by directory
-- Shared DB warning when same project slug and same database across instances
-- Fix: OpenClaw sessionsDir correctly resolves to agents/main/sessions
-- OpenClaw plugin auto-install with gateway restart during wizard
-- Isolated DB path auto-written to OpenClaw plugin config (no manual editing)
-- Session file scanner with recursive discovery, mtime filtering, size totals
-- Cost estimation before ingest using model pricing from @mariozechner/pi-ai
-- "Recent" ingest passes only last-7-day file paths; "full" uses directory glob
-- Bulk ingest integration (--workers 10 --concurrency 1 --whole-file)
-- Post-ingest consolidation prompt (merges near-duplicates from bulk ingest)
-- Watcher daemon setup on macOS with launchd (120s interval)
-- Re-ingest flow on model/auth change: stops watcher, resets DB, re-ingests
-- Expanded setup summary with plugin/ingest/consolidate/watcher status
-- Next steps section for skipped or failed wizard steps
-### Changed
-- Refactored setup.ts: extracted `runSetupCore()` for programmatic use
-- Subscription auth methods moved to "Advanced options" submenu
-- Default recommended model changed to gpt-4.1-mini
-- Non-interactive init behavior preserved when CLI flags are provided
-- Skip .gitignore writes for OpenClaw and Codex (not git repos)
-### Fixed
-- Test isolation: init wizard tests use isolated config path via `AGENR_CONFIG_PATH`
-- fix(init): `installOpenClawPlugin` no longer forces the `OPENCLAW_HOME`
-  environment variable on OpenClaw CLI calls, preventing production config
-  overwrites when targeting a non-default directory (e.g. sandbox)
-- fix(init): wizard now adds `"agenr"` to `plugins.allow` in the target
-  OpenClaw config, ensuring the plugin is explicitly trusted and suppressing
-  the auto-load warning
-## [0.8.40] - 2026-02-25
-### Added
-- New `agenr benchmark` CLI command to run extraction against benchmark session
-  fixtures, score results against rubric JSON, and report per-session plus overall
-  metrics (recall, partial recall, precision proxy, composite, pass rate)
-- New benchmark scoring engine in `src/benchmark/scorer.ts` with continuous
-  rule scoring, entry claiming, regex-based must-skip checks, count/importance
-  gates, and composite score penalties
-- New benchmark types in `src/benchmark/types.ts` and JSON output schema with
-  reproducibility metadata (`prompt_hash`, `fixture_hash`, model/version/runs)
-- Multi-run benchmark aggregation with mean/min/stdev composite reporting and
-  pass-rate tracking
-- New scorer regression test suite (`src/benchmark/scorer.test.ts`) covering
-  perfect pass, partial credit paths, regex fallback, count and ceiling penalties,
-  case-insensitive matching, entry-claiming, and rule specificity ordering
-### Fixed
-- Fix: benchmark chunk text now joins messages with newline separator instead of empty string
-- Fix: --user-only on ingest now yields zero chunks when no user messages found (was falling back to full content)
-- Fix: added --user-only and --context options to CLI docs
-### Changed
-- Extraction pipeline now accepts optional `temperature`, `logDir`, and `logAll`
-  parameters so benchmark runs can force deterministic temperature (`0`) and
-  capture per-chunk LLM request/response debug logs
-- Extraction prompt: strengthened skip-by-default opening, added EPHEMERAL vs DURABLE classification gate
-- Extraction prompt: shifted default importance from 7 to 6, recalibrated Score 6/7 descriptions across all calibration blocks
-- Extraction prompt: added 6 new anti-patterns (items 11-16) for concrete noise rejection (typos, version bumps, publish events, file observations, tautological facts)
-- Extraction prompt: added 6 new SKIP examples in few-shot section to rebalance extract:skip example ratio
-- Extraction prompt: added anti-consolidation instruction for personal facts with per-fact granularity and subject naming guidance
-- Extraction prompt: added project convention decisions to DURABLE classification list
-- Extraction prompt: improved anti-pattern #16 with extractable vs skip examples for file observations
-- Extraction prompt: added anti-pattern #17 for release-engineering session noise
-- Extraction prompt: whole-file calibration now distinguishes technical (consolidate) vs personal (granular) entries
-- Benchmark: now runs in whole-file mode to match real ingest behavior
-- Benchmark: relaxed rubric content_contains matching for paraphrase resilience
-- Extraction prompt: rewrote whole-file calibration with 3-step process (session triage, user message priority, constrained extraction)
-- Extraction prompt: added importance ceiling of 8 for coding/technical sessions, tightened inflation threshold from 30% to 20%
-- Extraction prompt: added anti-pattern #18 for agent capability/tool setup announcements
-- Extraction prompt: file contents read by agent during startup/exploration explicitly distinguished from user speech
-## [0.8.39] - 2025-02-25
-### Features
-- **ingest:** Add LLM debug logging via `--log-dir`, `--log-all`, and `--sample-rate` flags (#238)
-  - Captures raw LLM prompt input and response output per chunk
-  - Logs dedup before/after entry lists
-  - Best-effort writes, never blocks extraction
-  - Sampling defaults to 1-in-10 files; use `--log-all` for full capture
-### Tests
-- Add tests for ingest debug logging: file creation, sampling, dedup logs, graceful failure on bad logDir
-## [0.8.38] - 2026-02-24
-### Fixed
-- Handoff log line now shows model ID string instead of [object Object]
-- Upgraded handoff retirement and browse debug logs from logger.debug to
-  console.log for production visibility
-- Handoff transcript now strips OpenClaw/agenr injected context (memory
-  blocks, signals, conversation metadata, timestamp prefixes) before
-  sending to the LLM, preventing the summarizer from summarizing its own
-  metadata (#235)
-### Added
-- Opt-in `handoff.includeBackground` config flag for handoff summarizer: when
-  enabled, prior session messages are included as background context with strong
-  section headers so the LLM can orient without blending stale facts into the
-  current session summary (#235)
-- New system prompt variant with anti-hallucination instructions for background
-  context mode ("BACKGROUND CONTEXT (DO NOT SUMMARIZE)" / "SUMMARIZE THIS
-  SESSION ONLY" section headers)
-- Optional `handoff.logDir` config: when set, writes the full LLM request
-  transcript and response to files for prompt tuning and debugging (#235)
-### Changed
-- Default handoff behavior unchanged: current session only, no prior messages
-### Removed
-- All temporary [AGENR-PROBE] debug logging from openclaw-plugin (replaced with
-  clean operational logs where needed)
-## [0.8.37] - 2026-02-24
-### Fixed
-- openclaw-plugin: await runHandoffForSession in session_start handler instead of void
-  fire-and-forget; webchat /new goes through sessions.reset RPC which does not trigger
-  before_reset, so session_start is the only hook that fires on that path - making it
-  void meant the LLM summary was always dropped (closes #232)
-## [0.8.36] - 2026-02-24
-### Fixed
-- openclaw-plugin: await LLM upgrade in runHandoffForSession instead of fire-and-forget; the gateway awaits before_reset so the LLM call can and should block until the summary is stored (closes #230)
-- openclaw-plugin: raise Phase 1 fallback store success/failure logs from logger.debug to console.log for production visibility (extends #223)
-## [0.8.34] - 2026-02-24
-### Fixed
-- rebuild dist - --force flag missing from 0.8.33 artifact (stale build)
-## [0.8.33] - 2026-02-24
-### Fixed
-- retire --force flag skips confirmation prompts for programmatic retirement (#225)
-- runRetireTool now passes --force so high-importance handoff entries (imp >= 8) are properly retired (#225)
-## [0.8.32] - 2026-02-24
-### Fixed
-- summarizeSessionForHandoff: changed logger.debug to console.log for all skip-reason
-  and LLM call logging so output is visible in gateway.err.log at production log level
-  (closes #223)
-## [0.8.31] - 2026-02-24
-### Fixed
-- plugin: LLM handoff now logs transcript size, model, and summary length before/after the LLM call for observability (#221)
-- plugin: fallback handoff retirement now matches by subject+importance+tag only, dropping fragile content equality check that left stale fallback entries alongside LLM summaries (#221)
-## [0.8.30] - 2026-02-24
-### Fixed
-- Rebuild dist to include Phase 1A handoff trigger missing from 0.8.29.
-  The dist/ artifact in 0.8.29 was stale - runHandoffForSession() call added
-  in the 0.8.29 source (commit e3222c5) was not present in the published
-  package. No logic changes. Build-only fix.
-## [0.8.29] - 2026-02-24
-### Added
-- Phase 1A now triggers LLM handoff summarization (fire-and-forget) when a
-  previous session file is found at session start. This is a reliable
-  fallback for the before_reset/command hook paths that do not fire in
-  current OpenClaw versions due to a dispatch gap (openclaw/openclaw#25074).
-  The existing before_reset and command hook paths are unchanged.
-### Changed
-- runHandoffForSession source type now includes "session_start"
-### Tests
-- 5 new tests in session-handoff.test.ts covering the Phase 1A handoff
-  trigger path
-## [0.8.28] - 2026-02-24
-### Fixed
-- command hook fires before_reset handoff logic for RPC-triggered /new (closes #210)
-  - before_reset hook only fires in the in-process auto-reply path; sessions.reset RPC
-    path only fires the command hook
-  - new command hook handler reads and parses the session JSONL directly, then runs
-    the same Phase 1 fallback store + Phase 2 LLM upgrade logic
-  - dedup guard (Set<sessionId>) prevents double-writes when both hooks fire in
-    auto-reply path
-### Added
-- [AGENR-PROBE] debug logging throughout command hook path for observability
-  (to be removed in a future cleanup pass)
-- readAndParseSessionJsonl() helper to parse JSONL session files line by line
-- runHandoffForSession() shared helper extracted from before_reset for reuse
-### Tests
-- 5 new tests for command hook handoff behavior in index.test.ts
-## [0.8.27] - 2026-02-24
-### Changed
-- Add stderr debug probes to openclaw plugin to diagnose before_reset hook dispatch issue
-- Probes: register() entry, hook registrations, session_start handler, before_reset handler entry and guard points
-## [0.8.26] - 2026-02-23
-### Added
-- feat(openclaw-plugin): LLM-summarized multi-session handoff entry at
-  before_reset (#199). Builds a merged transcript from the current session
-  (via event.messages) and the most recent prior .reset.* file (if under
-  24h old), labeled with timestamps and surface (webchat/telegram/etc.)
-  from sessions.json. Summarizes via the configured LLM (from agenr config,
-  default gpt-4.1-nano) into a structured four-section handoff. Falls back
-  to raw text extraction on any failure. Handler is now properly async with
-  awaited store call.
-### Changed
-- fix(openclaw-plugin): before_reset handoff store now uses a two-phase flow to
-  avoid race windows with detached hook execution. Phase 1 stores fallback
-  exchange text immediately at importance 9, then Phase 2 asynchronously upgrades
-  to an importance 10 LLM summary when available.
-- fix(openclaw-plugin): when the LLM upgrade succeeds, fallback handoff entries
-  are looked up and retired (subject/content/tag match) before storing the
-  upgraded summary, preventing stale fallback carryover.
-- fix(openclaw-plugin): prior reset session surface lookup now maps
-  `*.jsonl.reset.*` files back to base `*.jsonl` paths via getBaseSessionPath,
-  and unknown surface fallback now uses "prior session" to improve prompt context.
-- fix(openclaw-plugin): capTranscriptLength now enforces a hard length cap even
-  when the current session alone exceeds 8000 chars.
-- chore(openclaw-plugin): added before_reset debug logs for missing sessionFile,
-  missing apiKey, and pre-LLM invocation traceability.
-### Tests
-- test(openclaw-plugin): added coverage for getBaseSessionPath, reset-path surface
-  resolution, capTranscriptLength edge cases (prior-only overflow, current-only
-  overflow, under-cap passthrough), missing-apiKey debug behavior, budget tail
-  slicing assertions, and buildMergedTranscript ordering.
-- test(openclaw-plugin): updated before_reset integration coverage for two-phase
-  fallback-plus-upgrade storage behavior and no-sessionFile debug path.
-## [0.8.25] - 2026-02-23
-### Changed
-- fix(openclaw-plugin): strip OpenClaw conversation metadata JSON blocks from
-  extractRecentTurns() output (issue #208)
-- fix(openclaw-plugin): extractRecentTurns() now reads JSONL bottom-up, ensuring
-  most recent turns are always included when maxTurns budget is exceeded
-- fix(openclaw-plugin): increase RECENT_TURN_MAX_CHARS from 150 to 300
-- fix(openclaw-plugin): normalize internal whitespace in extracted turns (collapse
-  newlines/spaces to single space) to keep " | " separator clean
-### Tests
-- test(openclaw-plugin): added tests for metadata stripping, bottom-up reading,
-  and whitespace normalization in extractRecentTurns()
-## [0.8.22] - 2026-02-23
-### Changed
-- feat(openclaw-plugin): replace thin-prompt/stash session-start recall with three-phase
-  cross-session context injection (issue #205)
-  - Phase 1A (always): reads last 7 user+assistant turns from most recently modified
-    session JSONL file in ~/.openclaw/agents/<agentId>/sessions/
-  - Phase 1B (always): runs agenr recall --browse --since 1d --limit 20, picks up
-    importance:10 handoff entry written at /new time
-  - Phase 2 (conditional): semantic recall seeded from Phase 1A turns + first user
-    message if >= 5 words; results deduplicated against Phase 1B by entry id
-  - Handoff entries retired after first use (one-time read)
-- feat(openclaw-plugin): added findPreviousSessionFile, extractRecentTurns,
-  buildSemanticSeed to src/openclaw-plugin/session-query.ts
-- feat(openclaw-plugin): findPreviousSessionFile uses parallel stat() calls for
-  performance on large sessions dirs
-- feat(openclaw-plugin): sessionsDir configurable via AgenrPluginConfig.sessionsDir;
-  defaults to ~/.openclaw/agents/<agentId>/sessions using ctx.agentId with "main"
-  fallback
-- feat(openclaw-plugin): RunRecallOptions extended with limit?: number to support
-  --limit flag in browse recall
-- refactor(openclaw-plugin): removed isThinPrompt, resolveSessionQuery,
-  sessionTopicStash, stashSessionTopic, shouldStashTopic, sweepInterval, clearStash,
-  readLatestArchivedUserMessages
-### Tests
-- test(openclaw-plugin): added unit tests for findPreviousSessionFile, extractRecentTurns,
-  buildSemanticSeed in session-query.test.ts
-- test(openclaw-plugin): added integration tests for three-phase before_prompt_build flow,
-  Phase 2 deduplication, and isFirstInSession guard in index.test.ts
-- test(openclaw-plugin): added second-message guard test (isFirstInSession prevents
-  re-injection on subsequent messages in same session)
-## [0.8.19] - 2026-02-23
-### Changed
-- feat(openclaw-plugin): `before_reset` handoff store content now uses a structured recent exchange summary (`U:`/`A:` turns) instead of user-only fragments, improving cross-session handoff clarity while keeping stash-based recall seeding unchanged (issue #196)
-- feat(openclaw-plugin): added `extractLastExchangeText(messages, maxTurns?)` in `src/openclaw-plugin/session-query.ts` to collect the last 5 user-turn windows with interleaved assistant context, per-turn truncation (200 chars), and chronological `U:`/`A:` formatting
-- chore(openclaw-plugin): exported `SESSION_QUERY_LOOKBACK` from `session-query.ts` for direct test assertions
-### Tests
-- test(openclaw-plugin): added `extractLastExchangeText` coverage for empty input, U/A formatting, per-message truncation, 5-user-turn collection window, and no-extractable-content behavior
-- test(openclaw-plugin): updated handoff-store integration assertion to verify stored content includes exchange context prefixes (`U:`/`A:`) rather than flattened user-only text
-## [0.8.18] - 2026-02-23
-### Changed
-- feat(openclaw-plugin): `before_prompt_build` now uses browse-mode recall (`--browse --since 1d`) for cold session starts where no stash/query seed is available, and keeps embed/query recall for substantive or stash-seeded starts (issue #196)
-- chore(openclaw-plugin): removed archived-session fallback query synthesis from session-start recall seeding, simplifying thin-prompt startup behavior to browse vs stash/embed paths only (issue #196)
-- feat(openclaw-plugin): `before_reset` now stores a fire-and-forget `event` memory entry (`session handoff ...`) with the latest user context to support next-session handoff continuity (issue #196)
-- feat(openclaw-plugin): session-start browse results now auto-retire surfaced handoff entries after context injection to avoid repeated carryover (`reason: consumed at session start`) (issue #196)
-- feat(openclaw-plugin): `runRecall` in `src/openclaw-plugin/recall.ts` now accepts an optional context options object and maps browse context to CLI browse args while preserving existing default/session-start call behavior for unchanged callers (issue #196)
-### Tests
-- test(openclaw-plugin): updated query-seeding coverage for new cold-start browse path and removed archive-fallback-specific expectations (issue #196)
-- test(openclaw-plugin): added regression coverage for before-reset handoff storage and session-start handoff auto-retire success, non-handoff skip, missing-id skip, and retire-failure resilience (issue #196)
-- test(openclaw-plugin): added plugin recall browse-args unit coverage to assert `runRecall` browse flag construction and query omission behavior (issue #196)
-## [0.8.17] - 2026-02-23
-### Changed
-- chore: rebuild dist to include browse mode CLI flag inadvertently omitted from 0.8.16 publish
-## [0.8.16] - 2026-02-23
-### Added
-- feat(recall): new temporal browse mode for recall via `agenr recall --browse` and MCP `agenr_recall` with `context="browse"` (issue #190)
-- docs(recall): added `docs/usage/recall.md` with browse-mode CLI and MCP usage examples
-### Changed
-- recall browse mode now uses a SQL-only path that requires no query and performs zero embedding/OpenAI API calls
-- browse mode does not increment recall metadata (`recall_count`, `last_recalled_at`, `recall_intervals`)
-- OpenClaw plugin tool wiring now maps `context="browse"` to the CLI `--browse` flag (and omits query/context positional args appropriately)
-### Tests
-- test(recall): added browse-mode coverage in DB recall, CLI command recall, MCP server recall, and OpenClaw plugin recall tool argument wiring
-## [0.8.15] - 2026-02-23
-### Fixed
-- fix(consolidate): switch GROUP_CONCAT separator from comma to pipe in buildClusters to prevent silent tag corruption when tag values contain commas (issue #155)
-- fix(consolidate): Tier 1 near-exact duplicate merge now preserves the highest importance across merged entries by raising the keeper's `importance` floor to the group max (issue #156)
-- fix(consolidate): Tier 1 near-exact duplicate merge now preserves oldest provenance by inheriting the oldest `created_at` across the merge group into the keeper (issue #156)
-- test(consolidate): new cluster.test.ts with pipe-separator roundtrip and comma-in-tag regression coverage (issue #155)
-- test(consolidate): added merge coverage for tag union transfer, keeper importance floor, and keeper `created_at` inheritance in rules consolidation tests (issue #156)
-## [0.8.13] - 2026-02-23
-### Fixed
-- fix(openclaw-plugin): session-start recall now falls back to reading the most recent archived OpenClaw session file (`*.reset.*`) when webchat `/new` bypasses `before_reset`. If stash-based seeding is unavailable and the opening prompt is short (< 40 characters), recall query text is built from the last 3 user messages in the archived session.
-## [0.8.12]
-### Fixed
-- fix(openclaw-plugin): strip OpenClaw metadata envelope from `before_prompt_build` prompts before session-start recall query resolution; query seeding now uses the user message after the final timestamp marker instead of prepended metadata, with last-match handling for repeated timestamp patterns
-## [0.8.11]
-### Changed
-- feat(plugin): resolveSessionQuery now blends the before_reset stash with the live prompt for session-start recall; when a stash exists and the live prompt is high-signal (>=40 chars / >=5 words), the query is stash + live prompt; when the live prompt is low-signal (common short opener like "did the plugin fire?"), the stash wins outright; no-stash behavior is unchanged (issue #181)
-## [0.8.10]
-### Added
-- feat(plugin): session-start recall now uses the inbound user message as the recall query seed, enabling vector similarity scoring instead of pure recency ranking; entries relevant to the actual conversation topic now surface at session start (issues #177, #178)
-- feat(plugin): before_reset hook captures the last 3 substantive user messages before a /new reset and stashes them in memory; the next session-start recall uses the stash as its query seed when the opening prompt is low-signal (issues #177, #178)
-- feat(plugin): session topic stash eviction sweep runs every 5 minutes; TTL is 1 hour
-### Changed
-- chore(plugin): session-start recall timeout increased from 5s to 10s to accommodate the embedding API call now required when a query is present
-- chore(plugin): session topic stash requires a minimum of 40 characters and 5 words to filter out low-signal conversational closers
-- refactor(plugin): session query helpers extracted from index.ts into session-query.ts
-### Fixed
-- fix(plugin): session-start recall no longer skips vector similarity scoring when a query is available; previously RecallQuery.text was always undefined at session start (issue #177)
-## [0.8.9]
-### Added
-- feat(extractor): broadened extraction prompt to capture personal user context (health, diet, family, occupation, location, values) even from casual or passing mentions; added 6-month durability test heuristic to distinguish durable personal facts from transient states (issue #173)
-- feat(extractor): new few-shot examples for RELATIONSHIP, PREFERENCE, FACT, and EVENT types covering personal context scenarios with scoring rationale
-### Fixed
-- fix(ingest): suppress redundant whole-file ignored-params warning; now fires once per ingest run via shared ExtractRunOnceFlags object instead of once per file (issue #168)
-- fix(ingest): silence SQLITE_ERROR vector-index-not-found pre-fetch error during bulk ingest when vector index is intentionally absent; all other pre-fetch errors still log (issue #168)
-- fix(ingest): detect .jsonl.reset.TIMESTAMP session files as JSONL adapter by extending suffix-stripping regex to handle both .deleted and .reset suffixes (issue #169)
-- fix(consolidate): added merge system prompt constraint that expiry must be exactly permanent or temporary, never a date or timestamp; complements existing runtime fallback (issue #172)
-- fix(daemon): daemon install plist now uses the runtime CLI path resolved from argv[1] via the injected argvFn, preventing hardcoded npm global paths from breaking pnpm installs (issue #174)
-## [0.8.8]
-### Fixed
-- fix(ingest): whole-file mode now uses model-aware output token budgets for context-fit checks and whole-file extraction calls, including support for gpt-5-nano, gpt-5.2-codex, and gpt-5.3-codex (issue #166)
-- fix(ingest): removed whole-file 100-entry truncation; extracted entries are no longer discarded and now only emit a verbose warning when entry count exceeds 500 before downstream dedup (issue #166)
-## [0.8.7]
-### Fixed
-- fix(ingest): detect `.jsonl.deleted.<timestamp>` session files as JSONL by stripping the `.deleted.*` suffix before extension lookup, restoring OpenClaw/Codex adapter routing instead of silent text fallback (issues #160, #163)
-- fix(ingest): pass the resolved ingest `verbose` flag into extraction calls so whole-file diagnostics are emitted with `--verbose`, including unknown-model context-window warnings and whole-file retry/fallback logs (issues #161, #162)
-- fix(ingest): emit an explicit `[whole-file]` verbose warning when auto mode receives zero parsed messages and falls back to chunked extraction (issue #163)
-## [0.8.5]
-### Added
-- feat(ingest): `--bulk` mode for large-scale ingests; drops FTS triggers and the vector index before writing, uses `batchSize=500` with `BEGIN IMMEDIATE` transactions per batch, and rebuilds FTS content + vector index in a single pass after all entries are written (issue #135)
-- feat(ingest): MinHash dedup (`src/db/minhash.ts`) - 128-hash signatures using 5-gram shingles and FNV32 with pre-seeded arrays; two-layer dedup combines an in-memory norm-content-hash Set (cross-batch per run) with per-candidate exact-hash + MinHash scan; new `norm_content_hash` and `minhash_sig` columns added via schema migration with automatic backfill
-- feat(ingest): crash recovery for interrupted bulk ingests; `_meta` flag (`bulk_ingest_state`) is set before teardown and cleared only after REINDEX succeeds; `checkAndRecoverBulkIngest()` detects an interrupted run on next startup, rebuilds missing FTS triggers and/or vector index, runs `PRAGMA integrity_check`, and clears the flag (issue #135)
-### Fixed
-- fix(bulk): `seenNormHashes` was updated inside the transaction before `COMMIT`, causing a rollback to poison the in-memory Set and silently skip affected entries on retry; fixed by moving the update to after `COMMIT` using a local `committedHashes` Set
-- fix(bulk): `bufferToMinhashSig` threw an unhandled `RangeError` on any `minhash_sig` blob that was not exactly 512 bytes (corrupt row, partial write, or schema version mismatch); fixed with a byte-length guard before conversion
-- fix(bulk): `rebuildVectorIndex` DROP+CREATE fallback was not atomic; if `CREATE INDEX` failed after `DROP` succeeded the vector index was permanently absent until recovery ran; fixed by wrapping the fallback in `BEGIN IMMEDIATE`
-- fix(bulk): backfill of `norm_content_hash` and `minhash_sig` ran unconditionally on every `agenr ingest` invocation; gated on `bulkMode` to avoid unnecessary write transactions on non-bulk runs
-- fix(bulk): backfill cap (5000 rows) was hit silently; warns to stderr when more rows remain so the user knows to run ingest again
-- fix(minhash): short-text MinHash fallback used raw `text` instead of normalized `chars`, causing near-duplicate short strings differing only in whitespace to score Jaccard ~0
-- fix(bulk): `getBulkIngestMeta` silently swallowed JSON parse errors, disabling crash recovery without any signal; now warns to stderr
-## [0.8.4]
-### Added
-- feat(openclaw-plugin): project scoping via config.project in openclaw.json; all session-start recall and store calls are scoped to the configured project when set (issue #71)
-- feat(openclaw-plugin): optional subject field in agenr_store schema; agents can now pass an explicit subject per entry rather than always relying on inference (issue #86)
-- feat(openclaw-plugin): platform normalization and source_file format warnings in runStoreTool; platform is inferred from source when missing, invalid values are warned and dropped, freeform source strings trigger a format hint (issue #145)
-### Fixed
-- fix(recall): cap final recall scores at 1.0 after FTS bonus; Math.min(1.0) applied in scoreEntryWithBreakdown (issue #64)
-- fix(mcp): correct misleading retire tool message; retired entries are hidden from all recall paths (issue #143)
-- fix(mcp): inferSubject now splits on punctuation followed by whitespace only, preventing truncation on file path periods (e.g. .ts, .js)
-- fix(openclaw-plugin): subject inference in runStoreTool processedEntries now uses the same safe regex as inferSubject
-### Changed
-- chore(openclaw-plugin): remove openclaw.plugin.json version field; package.json is now the single source of truth (issue #91)
-- chore(openclaw-plugin): remove formatRecallAsSummary dead code; writeAgenrMd was already removed, this cleans up the last orphaned export (issue #77)
-## [0.8.3]
-### Fixed
-- setup: custom model aliases (gpt-4.1-nano, gpt-4.1-mini) now appear in
-  the model picker when using openai-api-key auth (issue #136)
-- setup: revert hint null-normalization regression (details?.name ?? undefined)
-- setup: warn user when empty credential is entered during key rotation
-- setup: note that updated credential is saved but not re-validated
-- setup: openai-api-key now prioritizes gpt-4.1-nano, gpt-4.1-mini, and
-  gpt-5-nano in preferred model selection, and adds gpt-5-nano alias
-  resolution for OpenAI model lookup
-- setup: reconfigure now offers to update stored API key even when existing
-  credential is valid (issue #13)
-- embeddings: EmbeddingCache is now bounded with LRU eviction (default
-  max 5000 entries) to prevent unbounded heap growth during large ingests
-  (issue #57)
-- embeddings: EmbeddingCache constructor throws RangeError for maxSize < 1
-## [0.8.2] - 2026-02-22
-### Added
-- Per-platform extraction prompt addenda for codex/claude-code (code session rules with inline confidence caps) and plaud (meeting transcript rules)
-- plaud added to KNOWLEDGE_PLATFORMS and normalizeKnowledgePlatform
-- applyConfidenceCap now enforces importance cap for codex and claude-code platforms
-- All CLI --platform help text updated to include plaud
-## [0.8.1] - 2026-02-22
-### Fixed
-- fix(openclaw-plugin): sync plugin version in openclaw.plugin.json to match npm package (was stale at 0.7.21, now 0.8.1)
-## [0.8.0] - 2026-02-22
-### Added
-- feat(ingest): whole-file extraction mode for transcript ingest. `extractKnowledgeFromChunks` now supports `wholeFile: "auto" | "force" | "never"` with automatic fit detection against known model context windows and single-call extraction when a file fits.
-- feat(ingest): new `--whole-file` and `--chunk` ingest flags to force whole-file or chunked extraction mode.
-- feat(ingest): new whole-file utilities in `src/ingest/whole-file.ts` for context-window detection, mode resolution, overlap-free message reconstruction, and hard-cap truncation.
-### Changed
-- ingest: whole-file mode now reconstructs extraction text from parsed `messages` via `renderTranscriptLine` instead of joining chunk text, avoiding overlap duplication at chunk boundaries.
-- extractor: whole-file mode now skips embedding pre-fetch and skips post-extraction LLM dedup, applies a 100-entry hard cap by importance, and retries failed whole-file extraction attempts before falling back to chunked mode.
-- watch: watcher calls now set `watchMode: true`, which enforces chunked extraction even if whole-file mode is requested.
-- mcp: ingest-style extraction now forwards parsed `messages` into extraction so whole-file mode can be resolved consistently.
-## [0.7.21] - 2026-02-21
-### Fixed
-- fix(openclaw-plugin): sync plugin version in openclaw.plugin.json to match npm package (was stale at 0.7.7, now 0.7.21)
-## [0.7.20] - 2026-02-21
-### Added
-- feat(models): add gpt-4.1-nano, gpt-4.1-mini, and gpt-4.1 aliases for OpenAI
-  provider; gpt-4.1-nano is now the recommended fast/cheap extraction model
-  (--model gpt-4.1-nano or agenr config set model gpt-4.1-nano) (#127)
-### Changed
-- perf(ingest): pre-batch embedding calls in storeEntries; all entries in a
-  write-queue batch are now embedded in a single API call instead of one call
-  per entry, cutting embedding API round-trips from O(n) to O(1) per batch
-  and reducing ingest wall-clock time proportionally to batch size (#127)
-## [0.7.19] - 2026-02-21
-### Fixed
-- fix(ingest): WriteQueue backpressure deadlock when processing large session files; raised default highWatermark from 500 to 2000 and added configurable backpressure timeout (default 120s) that surfaces a clear error instead of hanging forever (#125)
-### Added
-- feat(ingest): --queue-high-watermark and --queue-backpressure-timeout-ms CLI flags for tuning write queue behavior on large ingest jobs
-- feat(ingest): verbose mode now logs "[X/N] file -- starting" before extraction begins, eliminating the silent gap during large-file processing (#124)
-## [0.7.18] - 2026-02-21
-### Fixed
-- fix(lockfile): suppress false-positive "Another agenr process is writing" warning during multi-worker ingest; `isDbLocked` now returns false when the lock is held by the current process (#121)
-## [0.7.17] - 2026-02-21
-### Performance
-- perf(ingest): two-phase extract+write pipeline eliminates SQLite write-lock contention; extraction workers run in parallel while a single background writer drains entries in batched transactions (#107)
-- feat(ingest): add `--workers` flag (default 10) for file-level parallelism; previously hardcoded to 1
-- The write queue retries each write sub-batch once on transient failure (2s delay) before surfacing the error to the outer file-level retry loop. Use `--no-retry` to disable all retries including the inner write retry.
-### Changed
-- ingest: `entriesStored` now counts `added + superseded` (previously only `added`); superseded entries are written before the previous entry is marked superseded
-## [0.7.16] - 2026-02-21
-### Fixed
-- docs(skill): comprehensive SKILL.md refresh covering all four tools, full importance scale, confidence-aware extraction, store optional params (subject, scope, tags, project), retire and extract tool docs
-## [0.7.15] - 2026-02-21
-### Fixed
-- fix(openclaw-plugin): agenr_recall tool now correctly passes --until flag to CLI (was silently dropped)
-- docs(skill): document all agenr_recall parameters in SKILL.md (since, until, types, platform, project, limit, context)
-## [0.7.14] - 2026-02-21
-### Added
-- feat(recall): added `until` upper date bound to recall query filtering in CLI, MCP, and DB recall paths (`since` + `until` now define an inclusive window)
-### Changed
-- fix(recall): recency decay now anchors to the `until` ceiling for historical windows while freshness boost remains anchored to real query time
-- fix(recall): centralized `parseSinceToIso` in `src/utils/time.ts` and removed duplicate implementations from recall CLI and MCP server
-- fix(recall): added inverted date-range validation - recall now returns a descriptive error when `since > until` instead of returning an empty list
-- fix(recall): interim 3x candidate over-fetch under date bounds to improve in-window recall coverage until SQL-level date filtering is added
-- fix(recall): corrupt `created_at` values are now safely excluded under date-bound filters instead of leaking invalid rows into filtered recall
-## [0.7.13] - 2026-02-21
-### Fixed
-- fix(extractor): added platform-aware extraction system prompt builder (`buildExtractionSystemPrompt`) and OpenClaw confidence addendum for role-labeled transcript handling
-- fix(extractor): added `applyConfidenceCap` hard-cap enforcement for OpenClaw `unverified` entries so tagged claims cannot exceed importance 5
-- fix(extractor): threaded `platform` through `extractKnowledgeFromChunks` and call sites in ingest/watch flows so OpenClaw-specific confidence behavior applies during transcript ingestion
-- fix(extractor): added OpenClaw confidence few-shot examples to `SYSTEM_PROMPT` to distinguish hedged unverified claims from tool-verified claims
-## [0.7.12] - 2026-02-21
-### Fixed
-- fix(recall): retired entries now correctly excluded from all recall queries -- missing `AND retired = 0` filter added to recall.ts and session-start.ts
-- fix(consolidate): retired entries excluded from Tier-1 and clustering queries in rules.ts and cluster.ts
-- fix(recall): consolidated duplicate parseSince implementations into shared utility supporting h/d/m/y units
-## [0.7.11] - 2026-02-20
-### Fixed
-- fix(init): agenr MCP command now resolved via process.execPath (node binary) and process.argv[1] (CLI entry script) instead of which lookup -- eliminates PATH failures in GUI clients like Codex that launch with restricted environments
-## [0.7.10] - 2026-02-20
-### Fixed
-- fix(init): codex platform now writes MCP entry directly to ~/.codex/config.toml instead of .mcp.json (which Codex does not read)
-- fix(init): openclaw platform no longer writes a .mcp.json file (OpenClaw native plugin handles MCP registration via openclaw plugins install agenr)
-- fix(init): agenr binary path is now resolved at init time via which or PNPM_HOME fallback -- GUI clients that launch with a restricted PATH will now find the correct binary
-- fix(init): codex config.toml write is idempotent -- re-running init replaces the agenr line without duplicating it
-- docs: remove redundant Memory (agenr) AGENTS.md block from OPENCLAW.md -- OpenClaw plugin handles agent instruction injection automatically via the built-in skill
-## [0.7.9] - 2026-02-20
-### Fixed
-- fix(openclaw-plugin): moved session-start recall injection from before_agent_start to before_prompt_build -- recall now fires exactly once per session instead of twice due to OpenClaw calling before_agent_start twice (once for model-resolve where prependContext is discarded, once for prompt-build where it is used)
-## [0.7.8] - 2026-02-20
-### Fixed
-- fix(openclaw-plugin): session-start recall dedup now keys on sessionId instead of a shared seen-Set -- each new session (including after /new) correctly receives injected context instead of being silently skipped on the second run
-- fix(extractor): normalizeImportance now defaults to 7 instead of 5 -- aligns runtime default with schema declaration and coaching guidance
-## [0.7.7] - 2026-02-20
-### Fixed
-- fix(extractor): rewrote importance score calibration in SYSTEM_PROMPT -- per-score definitions (5-10) replace undifferentiated 8-10 band
-- fix(extractor): added signal-cost framing -- 8+ fires real-time cross-session alerts; prompt now uses this as conservative filter
-- fix(extractor): made score 7 the explicit default workhorse; 8+ now requires cross-session justification
-- fix(extractor): added dev-session-observations rule -- verified/tested/confirmed patterns cap at 6 unless result is surprising or breaking
-- fix(extractor): resolved conflict between dev-session cap and explicit memory request rule ("remember this" overrides cap)
-- fix(extractor): removed "verified again today" from score-8 pnpm example to avoid contradicting dev-session rule
-- fix(extractor): added NOT-8 negative examples alongside existing NOT-9 callouts
-- fix(extractor): added 3 non-developer few-shot examples (health at 8, personal at 7, preference at 6) to prevent domain bias
-- fix(extractor): lowered 8+ calibration cap from 30% to 20%
-## [0.7.6] - 2026-02-20
-### Fixed
-- fix(plugin): `agenr_recall` now passes query as a positional argument instead of unsupported `--query`
-- fix(plugin): `agenr_recall` now uses `--type` (singular) instead of invalid `--types`
-- fix(plugin): removed unsupported `--threshold` forwarding from `agenr_recall`; threshold has no direct CLI equivalent
-- fix(plugin): `agenr_store` now sends entries array directly on stdin and passes `platform`/`project` as CLI flags
-- fix(plugin): `agenr_store` now infers missing `subject` from `content` before CLI spawn, matching MCP server behavior
-- fix(plugin): `agenr_retire` now calls `agenr retire --id <entry_id>` instead of subject matching with UUIDs
-- fix(cli): `agenr retire` now supports `--id <id>` and enforces exactly one of subject or `--id`
-- fix(plugin): `agenr_extract` now uses a two-step flow for `store=true` (`extract --json` then `store`) and injects source metadata before storing
-- fix(cli): `agenr store` now accepts the `--aggressive` flag used by plugin dedup config forwarding
-## [0.7.5] - 2026-02-20
-### Changed
-- fix(plugin): raise default signalMinImportance from 7 to 8 - default-importance stores (importance 7) no longer trigger mid-session signal interrupts
-- fix(plugin): lower default maxPerSignal from 5 to 3 - smaller batches
-- fix(dedup): lower DEFAULT_DEDUP_THRESHOLD from 0.80 to 0.72 - entries with cosine similarity 0.72-0.80 now reach LLM review instead of being stored as duplicates
-- fix(extractor): increase MAX_PREFETCH_RESULTS from 3 to 5 and lower PREFETCH_SIMILARITY_THRESHOLD from 0.78 to 0.72
-- fix(extractor): increase PREFETCH_CANDIDATE_LIMIT from 10 to 15 for broader elaborative encoding candidates
-- fix(extractor): tighten extractor prompt to suppress near-variant entries already captured in DB
-- fix(extractor): recalibrate importance scoring anchors so routine verifications and test-pass observations default to 6-7; reserve 8+ for cross-session alert-worthy updates
-### Added
-- feat(plugin): signalCooldownMs config - minimum ms between signal batches per session (default: 30000)
-- feat(plugin): signalMaxPerSession config - max total signal batches per session lifetime (default: 10)
-- feat(plugin): signalMaxAgeSec config - only surface entries created within last N seconds (default: 300)
-- feat(dedup): dedup.aggressive config in ~/.agenr/config.json - lower thresholds and more candidate lookups for high-noise environments
-- feat(dedup): dedup.threshold config - manual override for LLM dedup similarity threshold
-## [0.7.4] - 2026-02-20
-### Added
-- feat(plugin): native agenr_recall, agenr_store, agenr_extract, agenr_retire tools registered via api.registerTool() in the OpenClaw plugin - tools now appear in the agent toolset alongside exec, browser, etc.
-## [0.7.3] - 2026-02-20
-### Added
-- feat(plugin): bundled OpenClaw skill (skills/SKILL.md) - teaches agents when to call agenr_store and agenr_recall as MCP tools; automatically available when plugin is installed
-- feat(plugin): complete configSchema in openclaw.plugin.json (signalMinImportance, signalMaxPerSignal, signalsEnabled, dbPath)
-### Changed
-- fix(init): removed AGENTS.md auto-detection heuristic for openclaw platform - openclaw must be specified explicitly via --platform openclaw (AGENTS.md is also used by Codex; the heuristic was unreliable)
-- fix(init): agenr init --platform openclaw no longer writes to AGENTS.md - the OpenClaw plugin handles memory injection via prependContext; AGENTS.md write was redundant
-### Internal
-- chore(plugin): bump openclaw.plugin.json version to 0.7.3
-## [0.7.2] - 2026-02-20
-### Fixed
-- fix(store): within-batch deduplication - entries with the same subject+type+source file in a single storeEntries() call are now deduplicated before processing, preventing same-batch signal duplicates (entries from different source files with the same subject are kept as distinct)
-- fix(store): re-extraction guard - entries with the same subject+type+source_file extracted within 24 hours now increment confirmations instead of adding a new entry
-- fix(mcp): append-only MCP access log at ~/.agenr/mcp-access.log for observability of agenr_recall and agenr_store tool calls
-## [0.7.1] - 2026-02-20
-### Added
-- feat(init): new `agenr init` command to auto-wire project instructions, MCP config, and `.agenr/config.json` with project slug/platform/projectDir
-- feat(init): `--depends-on` support for dependency-aware project recall scope in `.agenr/config.json`
-### Changed
-- feat(mcp): default `agenr_recall` scope now reads `AGENR_PROJECT_DIR` + `.agenr/config.json` per call and auto-includes direct dependencies when `project` is omitted
-- feat(mcp): `project="*"` now bypasses configured project scope, while explicit `project` values stay strict (no dependency expansion)
-- feat(mcp): default `agenr_store` project now comes from configured project scope when caller omits `project`
-- docs: corrected setup guidance in `docs/guides/scenarios.md` and aligned MCP examples in `docs/MCP.md` with current init output
-### Removed
-- perf(mcp): removed public `since_seq` parameter/handler from `agenr_recall`
-- perf(plugin): removed redundant OpenClaw `writeAgenrMd` write path (session-start context is still injected via `prependContext`)
-- perf(signals): removed extra `agenr_recall` footer from signal notifications for lower token overhead
-## [0.7.0] - 2026-02-19
-### Added
-- feat(signals): mid-session signal delivery via `before_prompt_build` hook - notifies agents of new high-importance entries (imp >= 7) with compact 50-100 token notifications
-- feat(signals): `signal_watermarks` table for per-consumer rowid-based watermark tracking
-- feat(mcp): `since_seq` parameter on `agenr_recall` for watermark-based incremental recall without embedding cost
-- feat(plugin): `signalsEnabled`, `signalMinImportance`, `signalMaxPerSignal`, and `dbPath` plugin config options
-### Changed
-- refactor(plugin): plugin now opens a direct DB connection for sub-ms signal queries (vs CLI spawn)
-- refactor(plugin/types): expanded `PluginApi` and `AgenrPluginConfig` types for signal support
-## [0.6.15] - 2026-02-19
-### Changed
-- perf(db): file-backed DB clients now set `PRAGMA busy_timeout=3000` during WAL initialization, reducing immediate `SQLITE_BUSY` failures under write contention
-- perf(db): `initDb()` now explicitly sets `PRAGMA wal_autocheckpoint=1000` for WAL-enabled clients to make checkpoint behavior explicit and testable
-- perf(watch): watcher now supports `walCheckpointIntervalMs` (default `30000`) to rate-limit per-cycle WAL checkpoints while keeping shutdown checkpoint behavior unchanged
-### Fixed
-- test(watch): updated per-cycle checkpoint tests to pass `walCheckpointIntervalMs: 0` when asserting legacy always-checkpoint behavior
-- test(db): added coverage for file-backed `busy_timeout`, explicit `wal_autocheckpoint`, and `:memory:` busy-timeout exclusion
-- test(watch): added interval-gating, shutdown-checkpoint, and sentinel-bypass coverage for WAL checkpoint scheduling
-## [0.6.14] - 2026-02-19
-### Fixed
-- fix(daemon): launchd plist now uses `KeepAlive` with `Crashed`-only semantics and `ThrottleInterval` of 10 seconds so intentional daemon stops do not auto-restart while crash recovery remains enabled
-## [0.6.13] - 2026-02-19
-### Added
-- feat(daemon): `agenr daemon status` now includes watcher health details from `watcher.health.json` (heartbeat age, stalled warning, sessions watched, entries stored)
-### Changed
-- test(daemon): added daemon status health coverage for fresh/missing/stale/error health scenarios and deterministic heartbeat age output
-### Fixed
-- fix(consolidate): corrected `@libsql/client` arg typing in scoped filter paths by using `InValue[]` for SQL args
-- fix(daemon): status command now handles health read failures gracefully and still exits successfully
-## [0.6.12] - 2026-02-19
-### Added
-- feat(watch): new `src/watch/health.ts` heartbeat support with `WatcherHealth` schema, atomic `watcher.health.json` writes, resilient reads, and `isHealthy()` stale-heartbeat checks (5 minute threshold)
-- feat(watch): `runWatcher` now writes heartbeat health snapshots on startup and after every cycle, including PID, start time, last heartbeat timestamp, sessions watched, and total entries stored
-- feat(watch): directory-mode session switch events now increment `sessionsWatched` (including initial `null -> first` activation)
-### Changed
-- chore(watch): injected `writeHealthFileFn` dependency in watcher and watch command paths to keep heartbeat writes testable and mockable
-- test(watch): added `tests/watch/health.test.ts` (10 tests) and new watcher heartbeat assertion coverage in `tests/watch/watcher.test.ts`
-## [0.6.11] - 2026-02-19
-### Fixed
-- fix(shutdown): SIGINT/SIGTERM now wake the watcher immediately via a shared wake callback, so long polling sleeps are interrupted without waiting for the next interval
-- fix(watch): watcher now registers and always deregisters the shutdown wake callback in `runWatcher`, preventing stale wake handlers across normal exits (`--once`) and repeated runs
-- fix(watch): `runWatchCommand` now executes registered shutdown handlers on signal-triggered exits, keeps direct PID cleanup for clean `--once` exits, and adds a 5s force-exit timeout guard (`.unref()`) to avoid indefinite hangs
-## [0.6.10] - 2026-02-19
-### Fixed
-- OpenClaw plugin: AGENR.md now writes a compact summary (subjects only + entry count + recall instructions) instead of full content, preventing double-injection of full context if loaded into Project Context
-- Note: version 0.6.9 was published with a stale build and unpublished; 0.6.10 is the correct release of these changes
-## [0.6.9] - 2026-02-19
-### Fixed
-- OpenClaw plugin: session-seen guard prevents recall firing on every turn (fires once per session)
-- OpenClaw plugin: sessionKey now read from ctx (second handler arg) instead of event
-- OpenClaw plugin: DEFAULT_AGENR_PATH uses correct 2-level relative path to dist/cli.js
-- OpenClaw plugin: spawn strategy detects .js vs executable binary
-### Added
-- OpenClaw plugin: writes AGENR.md to ctx.workspaceDir after successful recall (fire-and-forget)
-## [0.6.8] - 2026-02-19
-### Fixed
-- fix(openclaw-plugin): OpenClaw plugin now uses api.on("before_agent_start") instead of api.registerHook("agent:bootstrap"). The previous approach registered the handler in the gateway bundle's internal handlers map, which is a different module instance from the embedded agent runner. The typed hook system (api.on) uses the shared global plugin registry and works correctly across both bundles.
-## [0.6.7] - 2026-02-19
-### Fixed
-- fix(openclaw-plugin): add name and description to registerHook opts to resolve OpenClaw hook registration warning
-## [0.6.6] - 2026-02-19
-### Added
-- feat(openclaw-plugin): OpenClaw plugin that injects agenr memory into agent sessions
-  - New src/openclaw-plugin/index.ts: plugin entry point, registers agent:bootstrap hook
-  - New src/openclaw-plugin/recall.ts: runs agenr CLI recall, formats JSON as markdown
-  - New src/openclaw-plugin/types.ts: local type aliases for OpenClaw SDK compatibility
-  - Memory injected as synthetic AGENR.md file into # Project Context in system prompt
-  - Grouped markdown output: Active Todos / Preferences and Decisions / Facts and Events
-  - Skips subagent and cron sessions automatically (sessionKey pattern check)
-  - Configurable: budget, enabled via openclaw.json plugins.entries.agenr.config
-  - 5 second timeout on recall; all errors swallowed silently to never block session start
-  - package.json "openclaw" key declares dist/openclaw-plugin/index.js as plugin extension
-## [0.6.5] - 2026-02-19
-### Added
-- feat(watch): watcher writes watcher.pid on start and deletes on exit
-- feat(ingest): ingest exits 1 with clear error if watcher is running
-- feat(watch): isWatcherRunning() helper with stale-PID detection in src/watch/pid.ts
-- feat(watch): deleteWatcherPid registered via onShutdown() as v0.6.6 graceful shutdown hook point
-### Fixed
-- fix(ingest): write conflicts between ingest and watcher are now blocked at the ingest entry point
-- fix(watch): watcher PID write failures now use error-level formatting for consistent clack error output
-- fix(ingest): watcher-running guard now reports via clack error output instead of raw stderr text
-## [0.6.4] - 2026-02-19
-### Added
-- feat(recall): spaced repetition recall strength via recall_intervals tracking
-- feat(recall): computeSpacingFactor() rewards entries with proven long inter-recall gaps
-- feat(schema): recall_intervals column (TEXT/JSON) added via COLUMN_MIGRATIONS
-- feat(types): recall_intervals field on StoredEntry, spacing field on RecallResult.scores
-### Fixed
-- fix(recall): legacy spacing imputation now anchors at created_at and lands exactly on last_recalled_at (including recall_count=1), restoring expected spacing bonuses
-- fix(recall): spacingFactor now applies to the recall-base component before importance comparison, preventing early saturation while keeping memoryStrength clamped to <= 1.0
-- fix(recall): updateRecallMetadata uses json_insert SQLite built-in for atomic array append, avoiding read-modify-write concurrency race
-- fix(recall): recall_intervals timestamps stored as Unix integer seconds (not ISO string) to prevent x1000 unit error in gap calculations
-- fix(recall): removed unused getScoreComponents() refactor artifact to avoid divergence from the active scoring path
-- fix(db): VACUUM database after db reset to reclaim freed pages immediately
-## [0.6.3] - 2026-02-19
-### Added
-- agenr db reset --full --confirm-reset: full clean-slate reset
-  - Deletes watch-state.json and review-queue.json after DB schema reset
-  - Creates a pre-reset DB backup before any destructive operation
-  - Prints backup path to stdout
-  - Dry-run mode when --confirm-reset is omitted
-- Extracted resetDb() into src/db/schema.ts (shared by db reset and db reset --full)
-- Added backupDb() helper in src/db/client.ts
-## [0.6.2] - 2026-02-19
-### Added
-- feat(extractor): elaborative encoding pre-fetch now runs before each chunk extraction, retrieves top-related memories from the vector index, and injects up to 3 references into the extractor prompt
-- feat(cli): `--no-pre-fetch` flag added to `agenr extract`, `agenr ingest`, and `agenr watch` to opt out of prompt memory pre-fetch
-- feat(cli): `--db` flag added to `agenr extract`, `agenr ingest`, and `agenr watch` for database path overrides
-- feat(recall): exported `fetchRelatedEntries()` thin wrapper for direct ANN vector candidate queries
-### Changed
-- tuning(extractor): pre-fetch similarity threshold set to `0.78` for `text-embedding-3-small` (1024 dimensions)
-- tuning(extractor): fresh-install pre-fetch skip threshold set to 20 non-superseded entries
-- tuning(extractor): pre-fetch timeout set to 5000ms to avoid chunk extraction stalls on hanging embedding calls
-### Security
-- prompt: injected related memories are explicitly reference-only and do not lower the SKIP threshold
-- runtime: pre-fetch is always best-effort and silently degrades to empty related-memory context on any error
-## [0.6.1] - 2026-02-19
-### Fixed
-- fix(watch): context file generation failed with CLIENT_CLOSED when context path is configured
-- fix(mcp): remove agenr_done tool (was not removed in v0.6.0 as intended)
-## [0.6.0] - 2026-02-18
-### Added
-- feat(consolidate): forgettingScore, protected subject patterns, and active forgetting pass with `--forget` deletion gate
-- feat(config): `forgetting.protect` never-forget registry plus `scoreThreshold`/`maxAgeDays`/`enabled` config defaults
-- feat(health): new `agenr health` command with read-only DB health and forgetting candidate summaries
-- feat(consolidate): `--report` pre-run consolidation stats mode (and report-only behavior with `--dry-run`)
-- feat(watch): `context-mini.md` and `context-hot.md` context variants on watch context refresh
-- feat(schema): retired, retired_at, retired_reason, suppressed_contexts columns
-- feat(recall): session-start context filtering respects suppressed_contexts
-- feat(db): retirements.json ledger for durable retirement across re-ingest
-- feat(mcp): entry IDs in agenr_recall output
-- feat(mcp): agenr_retire tool - retire any entry type by ID
-- feat(cli): agenr retire command with dry-run, persist, contains flags
-### Fixed
-- fix(health): initialize schema before health queries and support `--db` path override
-- fix(health): reduce scan memory usage by omitting `content` from health stats query
-- fix(consolidate): batch forgetting deletes, reuse assessed candidates, and avoid synchronous full `VACUUM`
-- fix(watch): use real recall score breakdown in generated context variants
-### Removed
-- `agenr_done` MCP tool removed; use `agenr_retire` instead (supports all entry types, not just todos)
-## [0.5.4] - 2026-02-18
-### Added
-- feat(todos): `agenr todo done` command to mark todos complete via CLI
-- feat(mcp): `agenr_done` MCP tool for completing todos from AI tools
-- feat(store): cross-type superseding - new entries can supersede entries of any type, not just same-type
-## [0.5.3] - 2026-02-18
-### Added
-- Explicit memory requests: "remember this/that" triggers importance >= 7, deterministic capture
-- Session label → project mapping via `labelProjectMap` config field
-- `normalizeLabel` utility for deterministic label normalization
-- `SYSTEM_PROMPT` exported from `src/extractor.ts` for testability
-### Fixed
-- `agenr eval recall` now returns correct results for all 5 query categories (was returning zero for 4 of 5 due to FTS literal match; replaced with SQL type filters and hybrid vector+FTS recall)
-## [0.5.2] - 2026-02-18
-### Added
-- `entries.project` column (with index) to tag knowledge by source project/repo (NULL for legacy entries)
-- Project auto-detection from transcript CWD in watch mode (tags entries at write time)
-- `--project` and `--exclude-project` filters/tags across commands:
-  - `agenr recall --project/--exclude-project [--strict]`
-  - `agenr context --project/--exclude-project [--strict]`
-  - `agenr store --project`
-  - `agenr ingest --project`
-  - `agenr consolidate --project/--exclude-project` (never merges across projects)
-  - `agenr db stats --project/--exclude-project`
-  - `agenr db export --project/--exclude-project`
-- MCP tool support for project:
-  - `agenr_recall` accepts optional `project` filter (comma-separated for multiple)
-  - `agenr_store` accepts optional `project` tag
-- `agenr eval recall` command for scoring regression checks (baseline save and compare)
-### Fixed
-- Recall scoring and session-start recall:
-  - Freshness boost for importance >= 6 (clamped to avoid amplifying noisy entries)
-  - Smooth exponential todo staleness decay (half-life 7 days; floors at 0.10 or 0.40 for importance >= 8)
-  - Session-start permanent window widened to 30 days (temporary remains shorter)
-  - Dynamic budget allocation based on available categories
-  - Recency tiebreaking within a 0.05 score dead-band applied to the recent category only
-- Watch ingestion now advances `byteOffset` by bytes actually read in each cycle, preventing duplicate processing when files grow during read.
-- Watch state saves are now atomic (temp file + rename), preventing partial-write corruption on process crashes.
-## [0.5.0] - 2026-02-17
-### Added
-- `_meta` table with schema version stamp for future migrations
-- `agenr db version` command to print schema version metadata
-- `agenr daemon start|stop|restart` commands
-- `agenr daemon install --dir/--platform/--node-path` options for explicit daemon configuration
-- `entries.platform` column (with index) to tag knowledge by platform (`openclaw|claude-code|codex`, NULL for legacy entries)
-- `--platform` filters/tags across commands:
-  - `agenr recall --platform`
-  - `agenr context --platform`
-  - `agenr store --platform`
-  - `agenr ingest --platform`
-  - `agenr consolidate --platform`
-  - `agenr db export --platform`
-- MCP tool support for platform:
-  - `agenr_recall` accepts optional `platform` filter
-  - `agenr_store` accepts optional `platform` tag
-### Changed
-- `agenr db stats` output now includes schema version
-- `agenr db stats` now includes per-platform breakdown
-- `agenr daemon install` now uses smart platform defaults and writes `watch --dir <path> --platform <name>` instead of `watch --auto`
-- `agenr daemon install` now prefers stable node symlinks (Homebrew) when `process.execPath` is version-specific; use `--node-path` to override
-- `agenr watch --auto` is deprecated; `agenr watch --platform <name>` is now the standard invocation and auto-resolves the default platform directory when `--dir` is omitted
-## [0.4.1] - 2026-02-17
-### Fixed
-- npx symlink handling: isDirectRun check now uses realpathSync to resolve npx symlinks correctly
-## [0.4.0] - 2026-02-15
-### Added
-- `agenr context` command - generate context files for AI tool integration
-- `agenr watch --context` - auto-refresh context file after each extraction cycle
-- `agenr daemon` - launchd daemon management for background watching
-- `agenr consolidate` - knowledge base cleanup with rule-based and LLM-assisted merging
-- Online dedup at write time (mem0-style dedup with 3 cosine bands)
-- Post-extraction LLM dedup pass
-- Concurrent chunk extraction
-- Smart filtering before chunking
-- Rate limit protection for chunk extraction
-- Graceful shutdown for long-running commands (SIGINT/SIGTERM)
-- Ingest auto-retry for failed files
-- Source adapter refactor with timestamp preservation
-- Watch WAL checkpointing
-### Changed
-- Embedding dimensions upgraded from 512 to 1024 (text-embedding-3-small)
-- `confidence` field renamed to `importance` for clarity
-### Fixed
-- Session-start recall no longer dominated by stale todos (todo staleness penalty)
-- Consolidate releases DB lock after WAL checkpoint, not before
-## [0.3.0] - 2026-02-15
-### Added
-- `agenr watch` - live file watcher with auto-extraction
-- `agenr ingest` - bulk ingestion of markdown, plaintext, and JSONL
-- `agenr mcp` - MCP server for cross-tool AI memory (recall, store, extract)
-## [0.2.0] - 2026-02-14
-### Added
-- `agenr store` - smart dedup with cosine similarity bands
-- `agenr recall` - recall with scoring and budget-constrained retrieval
-- `agenr db` subcommands (stats, export, reset, path)
-## [0.1.0] - 2026-02-14
-### Added
-- `agenr extract` - structured knowledge extraction from conversation transcripts
-- `agenr setup` - interactive configuration
-- `agenr auth status` - live connection testing
-- `agenr config` - configuration management