npm - @oomkapwn/enquire-mcp - Versions diffs - 3.8.8 → 3.9.0-rc.10 - Mend

@oomkapwn/enquire-mcp 3.8.8 → 3.9.0-rc.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (54) hide show

package/CHANGELOG.md +583 -0
package/README.md +15 -15
package/SECURITY.md +18 -12
package/dist/cli.d.ts.map +1 -1
package/dist/cli.js +58 -1
package/dist/cli.js.map +1 -1
package/dist/dql.d.ts +10 -0
package/dist/dql.d.ts.map +1 -1
package/dist/dql.js +13 -1
package/dist/dql.js.map +1 -1
package/dist/embed-db.d.ts +23 -2
package/dist/embed-db.d.ts.map +1 -1
package/dist/embed-db.js +39 -1
package/dist/embed-db.js.map +1 -1
package/dist/embed-pipeline.d.ts +16 -0
package/dist/embed-pipeline.d.ts.map +1 -1
package/dist/embed-pipeline.js +18 -6
package/dist/embed-pipeline.js.map +1 -1
package/dist/hnsw.d.ts +50 -0
package/dist/hnsw.d.ts.map +1 -1
package/dist/hnsw.js +75 -2
package/dist/hnsw.js.map +1 -1
package/dist/index.d.ts +1 -1
package/dist/index.d.ts.map +1 -1
package/dist/index.js +1 -1
package/dist/index.js.map +1 -1
package/dist/ocr.d.ts +97 -19
package/dist/ocr.d.ts.map +1 -1
package/dist/ocr.js +145 -25
package/dist/ocr.js.map +1 -1
package/dist/server.d.ts +16 -0
package/dist/server.d.ts.map +1 -1
package/dist/server.js +61 -0
package/dist/server.js.map +1 -1
package/dist/tool-registry.d.ts.map +1 -1
package/dist/tool-registry.js +3 -2
package/dist/tool-registry.js.map +1 -1
package/dist/tools/meta.d.ts +35 -0
package/dist/tools/meta.d.ts.map +1 -1
package/dist/tools/meta.js +131 -1
package/dist/tools/meta.js.map +1 -1
package/dist/tools/search.d.ts +44 -0
package/dist/tools/search.d.ts.map +1 -1
package/dist/tools/search.js +72 -13
package/dist/tools/search.js.map +1 -1
package/dist/watcher.d.ts +116 -0
package/dist/watcher.d.ts.map +1 -1
package/dist/watcher.js +283 -9
package/dist/watcher.js.map +1 -1
package/docs/COMPARISON.md +3 -3
package/docs/QUICKSTART.md +1 -1
package/docs/api.md +15 -2
package/docs/benchmarks.md +2 -2
package/package.json +4 -4

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,589 @@
 All notable changes to this project will be documented here. The format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/), and the project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [3.9.0-rc.10] — 2026-05-28
+> **TL;DR:** **Closes overclaim #16 — OCR offline enforcement is now REAL (CRITICAL), plus the OCR canvas-OOM DoS.** The TSDoc/CLI-help/SECURITY.md all claimed `serve` "makes zero outbound network calls" / "no runtime CDN download" / "throws if a language isn't installed" and referenced an `install-ocr-lang` subcommand — but the code did none of it (`createWorker` silently CDN-fetched; the subcommand didn't exist). This RC builds the guards the docs promised: a **pre-flight cache check that throws fail-closed before the worker is created**, a real **`install-ocr-lang <code>` subcommand**, a worker pinned read-only to the local tessdata cache, an **absolute canvas-dimension clamp** (the `scale` cap was a false OOM guard for giant MediaBoxes), page-range validation, and **OIA Check 4e** which fails CI if any doc claims the offline guarantee while a code guard is absent (regression-proofs the #16 class, like Check 4d did for SLSA). **+15 tests (positive + NEGATIVE controls), all CI-runnable without the OCR optional deps. 944 → 959 tests.**
+**Patch — audit-driven security (sprint RC 2): #16 + DoS.**
+### Fixed
+- **#16 OCR offline enforcement (CRITICAL — claimed-guarantee vs code-guard).** `src/ocr.ts`: `extractPdfWithOcr` now calls **`assertOcrLangsInstalled(langs, langPath)`** BEFORE loading any optional dep — it `existsSync`-checks every requested `<lang>.traineddata` in the local tessdata cache and throws (fail-closed), naming the exact `install-ocr-lang` command, if any is missing. The Tesseract worker is created with `langPath` + `cachePath` at the local cache and **`cacheMethod: "readOnly"`** (never writes/refetches). New **`resolveTessdataDir()`** (`$ENQUIRE_TESSDATA_DIR` → `$XDG_CACHE_HOME/enquire-mcp/tessdata` → `~/.cache/enquire-mcp/tessdata`). New CLI **`install-ocr-lang <code>`** subcommand (mirrors `install-model`) downloads `<code>.traineddata` from tessdata_fast into that dir — the ONLY OCR network call, explicit + opt-in, with strict `^[a-z0-9_]+$` code validation (no path-traversal / URL-injection). `serve` now makes **zero** outbound calls for OCR.
+- **OCR canvas-OOM DoS (HIGH).** The `scale ∈ [0.5,4]` clamp bounds the multiplier, not the absolute pixel count — a PDF with a giant MediaBox (spec allows 14400×14400 pt) rendered to a multi-GB single-page canvas → OOM. New **`clampOcrScale(w, h, scale)`** lowers the effective scale so the larger rendered side never exceeds **`MAX_OCR_CANVAS_DIM`** (5000 px).
+- **Inverted page range (LOW).** **`resolveOcrPageRange`** throws on an empty/inverted range (e.g. `pages:[5,2]`) instead of silently returning zero pages (which a caller could misread as "image-only scan").
+- **Docs corrected to the enforced reality.** Rewrote SECURITY.md "OCR network posture" (was: `install-ocr-lang` "Deferred" + "the only outbound call in serve mode" — both now false) with the code-guard list + a stable `<a id="ocr-network-posture">` anchor; fixed the api.md broken anchor; updated `--ocr-pdfs`/`--ocr-langs` CLI help + api.md to cite `install-ocr-lang`.
+### Structural defense (closes the #16 class)
+- **OIA Check 4e** (`scripts/oia-walk.mjs`) — the "claimed-guarantee vs code-guard" pattern applied to OCR (parallel to rc.8's SLSA Check 4d). If any of README/SECURITY.md/COMPARISON/api.md/llms.txt claims "zero outbound / no runtime CDN / install-ocr-lang" (non-roadmap), it asserts `src/ocr.ts` calls `assertOcrLangsInstalled` + sets `cacheMethod:"readOnly"` AND `src/cli.ts` registers `install-ocr-lang` — failing CI otherwise. **Verified non-vacuous** (all 3 guards detected present → silent for the right reason) **and with detection power** (would flag 4+ claim lines if a guard were removed). The generalized enforcement-verb grep remains a tracked ROADMAP item (this is the #16-specific guard, mirroring how 4d was #15-specific).
+### Tests added (+15, positive + NEGATIVE controls)
+`tests/ocr-offline.test.ts` (NEW) — `resolveTessdataDir` precedence (3), `ocrLangIsInstalled`/`assertOcrLangsInstalled` incl. multi-lang + missing-pack throw (5), `extractPdfWithOcr` pre-flight throw before any dep loads (1, the load-bearing #16 guard), `clampOcrScale` normal-unchanged + huge-MediaBox-shrinks (3), `resolveOcrPageRange` clamp + inverted-throws (3). All run without `tesseract.js`/`canvas`/`pdfjs` because the guards execute before those load.
+### Files changed
+- `src/ocr.ts` — `resolveTessdataDir`/`ocrLangIsInstalled`/`assertOcrLangsInstalled`/`clampOcrScale`/`resolveOcrPageRange`/`MAX_OCR_CANVAS_DIM`; pre-flight + readOnly worker + canvas clamp + page-range in `extractPdfWithOcr`; TSDoc corrected to the enforced behavior.
+- `src/cli.ts` — `install-ocr-lang` subcommand; `--ocr-pdfs`/`--ocr-langs` help cite it.
+- `scripts/oia-walk.mjs` — Check 4e + header enumeration (11 → 12 blocks).
+- `SECURITY.md`, `docs/api.md` — OCR posture rewrite + stable anchor + subcommand row.
+- `tests/ocr-offline.test.ts` (new).
+- test count 944 → 959 across README/COMPARISON/llms.txt/AGENTS/package.json/ROADMAP.
+- version bump 3.9.0-rc.9 → 3.9.0-rc.10 (7 surfaces).
+---
+## [3.9.0-rc.9] — 2026-05-28
+> **TL;DR:** **First RC of the post-audit sprint — input-validation security.** A second, five-agent comprehensive audit (core-retrieval code · server/transport/CLI code · docs/workflows/config · competitor landscape · repo-page/discoverability) ran against rc.8; `ROADMAP.md` is rewritten around its findings + the competitive read (we are capability-ahead of every Obsidian-MCP peer; the gap to the memory leaders is published benchmarks + discoverability, not tech). This RC ships the **P0 input-validation** findings: a real **ReDoS guard** on `obsidian_open_questions` (an always-registered tool that compiled a caller-supplied `pattern` straight into V8's backtracking engine and ran it over every line of every note — a remote DoS on `serve-http`), a defensive length cap on DQL `like`, and reconciliation of the bearer-token min-length check between the CLI and the transport. **No behavior change for legitimate callers. 927 → 944 tests** (+17, all with positive + negative controls).
+**Patch — audit-driven security (sprint RC 1 of N).**
+### The audit (sprint kickoff)
+Five parallel agents re-read the project end-to-end on rc.8. Net: **zero CRITICAL beyond the already-tracked #16 OCR overclaim**; the codebase's path-safety, FTS5 escaping, int8 quantization, RRF/IR-metric, bearer-compare, CORS, and P2-10/11 session-lifecycle layers were all re-confirmed solid. New actionable findings were sequenced into a phased sprint (see `ROADMAP.md` Tier 1): **rc.9 input-validation (this RC) → rc.10 OCR offline enforcement + canvas-OOM → rc.11 watcher/HNSW correctness → rc.12 structural defenses + state-driven docs + supply-chain → rc.13 remaining correctness → rc.14 discoverability**. Audit checkpoint after each RC.
+### Fixed (input-validation security)
+- **ReDoS in `obsidian_open_questions` (HIGH).** `tools/meta.ts` compiled `args.pattern` (zod `z.string().optional()`, no constraint) into a `RegExp` and ran it per-line across the whole vault. The tool is **always registered** (not gated), so any stdio or bearer-authenticated `serve-http` client could submit a catastrophic-backtracking pattern (`(a+)+$`, `(.*)*`) and freeze the single-threaded event loop. Fix: a dependency-free **`isCatastrophicRegex`** guard that rejects "star height ≥ 2" patterns (an unbounded/amplifying quantifier applied to a group whose body also has one — honoring char-classes + escapes) **before** compile, plus a hard **`MAX_QUESTION_PATTERN_LEN` = 200** cap mirrored on the zod schema. The safe default pattern is unaffected (regression-guarded in tests).
+- **DQL `like` length cap (defensive).** `dql.ts`'s `likeToRegex` is catastrophic-backtracking-**safe by construction** (it only ever emits `.*`, never a nested quantifier — re-confirmed by the audit), so this is **not** a ReDoS fix; it just bounds regex-compile/match CPU on an absurdly long user-supplied LIKE value via **`MAX_LIKE_PATTERN_LEN` = 512** (throws above it).
+- **Bearer min-length reconciliation.** `cli.ts` accepted any non-empty `--bearer-token` while `startHttpServer` independently threw on `< 16` — so a short token passed the CLI gate then failed deeper with a less-friendly error. The `≥16` check now also fires in the CLI action (before any server setup), giving the user the `gen-token` hint + a clean `exit(1)`. The transport-layer check stays as defense-in-depth.
+### ROADMAP refresh
+`ROADMAP.md` rewritten after the second audit + competitive/discoverability survey: sharpened "#1 in our spheres" thesis, the phased rc.9→rc.14 sprint, a Tier-3 push to **publish LongMemEval scores** (the #1 credibility lever — no Obsidian MCP has any) + a "forgetting-aware" note-staleness signal (a frontier every memory competitor fails), and a "Requires the maintainer" section for the account/OAuth-gated discoverability actions (Glama claim, MCP Registry re-submit, forum post).
+### Tests added (+17, all positive + negative controls)
+- `tests/redos-guard.test.ts` (NEW) — 13 catastrophic patterns flagged (NEGATIVE), 11 safe patterns accepted incl. the production default (POSITIVE regression guard), `readUnboundedQuantifier` unit cases, + 4 `getOpenQuestions` integration cases (rejects catastrophic/over-long; accepts safe/default). The catastrophic *integration* fixture is built at runtime (`String.fromCharCode`) so CodeQL's `js/redos` static pass doesn't flag a regex literal that the guard rejects before compile — keeps "0 new CodeQL alerts" true (caught by the advisory CodeQL gate on the first PR push).
+- `tests/dql.test.ts` — `likeToRegex` cap: normal pattern matches (POSITIVE), boundary at the cap passes, over-long throws (NEGATIVE).
+- `tests/cli.test.ts` — `serve-http` short-token → `exit(1)` + "≥16 chars" hint (NEGATIVE); no-token → "required" with the length error explicitly NOT firing (contrast control).
+### Files changed
+- `src/tools/meta.ts` — `isCatastrophicRegex` + `readUnboundedQuantifier` + `MAX_QUESTION_PATTERN_LEN` + guarded compile in `getOpenQuestions`.
+- `src/tool-registry.ts` — `.max(MAX_QUESTION_PATTERN_LEN)` on the `pattern` schema + import.
+- `src/dql.ts` — `MAX_LIKE_PATTERN_LEN` + cap in `likeToRegex` (exported for tests).
+- `src/cli.ts` — bearer `≥16` check in the `serve-http` action.
+- `ROADMAP.md` — full rewrite (post-audit).
+- `tests/redos-guard.test.ts` (new), `tests/dql.test.ts`, `tests/cli.test.ts`.
+- test count 927 → 944 across README/COMPARISON/llms.txt/AGENTS/package.json; README suite-timing ~5s → ~12s (audit LOW).
+- version bump 3.9.0-rc.8 → 3.9.0-rc.9 (7 surfaces).
+---
+## [3.9.0-rc.8] — 2026-05-28
+> **TL;DR:** **Integrity-batch #2 from the exhaustive file-by-file audit** (every `src/` module, every doc, every workflow, every script re-read on Opus 4.8). Closes the cheap-but-real drift the audit surfaced and adds the FIRST structural defense for the "claimed-guarantee vs code-guard" class introduced in rc.7: a new **OIA Check 4d** that reads `.github/workflows/release.yml`, computes the SLSA Build Level it actually earns, and fails CI if any doc claims a higher level. Also: a bench-harness honesty fix (a 5-sample "p99" that always returned the max — relabeled `max`), determinism fix (`Date.now()` tag → stable), the privacy-test soft-skips made VISIBLE via `ctx.skip()` + a CI tripwire that fails loudly if the native deps that gate them ever go missing in CI, two stale test-title positioning claims, a benchmarks rounding drift, a biome binary/schema unification (2.4.14/2.4.15 → 2.4.16), and a stale Node placeholder in the bug template. **Docs/tests/scripts/config only — zero `src/` runtime logic changed. 926 → 927 tests (+1 CI tripwire).**
+**Patch — audit-driven integrity (Tier 0, batch 2).**
+### Fixed
+- **S2 — OIA Check 4d: SLSA-level code-guard (structural defense for the rc.7 #15 class).** rc.7 *corrected* the SLSA-3→L2 overclaim by hand; this rc makes the regression **structurally impossible**. New `scripts/oia-walk.mjs` Check 4d Part A statically reads `release.yml`: `earnsL3 = /slsa-framework\/slsa-github-generator/`, `doesProvenance = /npm publish[^\n]*--provenance/` → `earnedLevel = earnsL3 ? 3 : doesProvenance ? 2 : 0`. It then greps the claim surfaces (README, package.json, llms.txt, COMPARISON, STABILITY) for an L3 claim (`/\bSLSA[-\s]?3\b|…L(?:evel\s*)?3\b|levels#build-l3/i`) and fails if any claim exceeds the earned level — with a roadmap-context skip so "L3 on the roadmap" stays legal. Part B (opt-out via `--skip-network`) checks the published attestation. This is the first concrete instance of the rc.7-promised "enforcement-verb code-guard" defense.
+- **S1 — bench "p99" was always the max (honesty fix).** `scripts/bench.mjs` runs `RUNS=5` then took `quantile(samples, 0.99)`, which on 5 sorted samples is unconditionally `samples[4]` = the maximum. Reporting it as "p99" overstated tail rigor. Relabeled to `max` in the return object, the table header, and `bench/results.md` (the *values* were always the max — only the label was wrong, so no number moved).
+- **M3 — bench determinism.** The write-path micro-bench used `#new-tag-${Date.now()}`, making every run mutate a different note and defeating run-to-run comparability. Pinned to `#new-tag-stable`.
+- **T1 — privacy tests: visible skips + a CI tripwire (the silent-skip class).** `tests/cli-privacy-filters.test.ts` guarded 6 security-critical privacy assertions behind `if (!distExists() || !canRunFts5) return;` — a SILENT pass when the build or `better-sqlite3` was absent, exactly the failure mode that hides regressions. Converted all 6 to `(ctx) => { if (…) return ctx.skip(); … }` so a skip is *visible* in the reporter, and added one **CI GUARD** test that hard-asserts (when `process.env.CI`) that the dist build AND a live FTS5 query both work — so if the native-dep preconditions ever vanish in CI, the suite fails loudly instead of silently skipping the privacy coverage. The single guard transitively protects every other native-dep soft-skip (same CI preconditions). **This is the +1 test (926 → 927).**
+- **W1 — stale positioning in test titles.** `tests/github-metadata-invariant.test.ts` had two `it(...)` titles still describing the pre-v3.7.8 "Memory layer for AI agents" lead and "v3.6.3 hype keywords" — while the assertions already pinned `ABOUT_LEADS_WITH = /^The most advanced Obsidian MCP/i`. Titles realigned to what the code actually checks (α-class TSDoc-drift sibling, but in test descriptions).
+- **S4 — benchmarks rounding drift.** `docs/benchmarks.md` line 30 said "+25 MRR / +16 NDCG@10" (rounded) while every other surface uses the precise measured "+24.7 MRR / +15.5 NDCG@10". Unified to the precise figures.
+- **C1 — biome binary/schema unification.** Installed binary was 2.4.14, `biome.json` `$schema` pinned 2.4.15, `package.json` devDep `^2.4.15`. Bumped all three to **2.4.16** (latest). Clean bump — `lint:fix` reformatted one long line I'd added to `oia-walk.mjs`; zero new rule violations.
+- **bug_report.yml Node placeholder.** `.github/ISSUE_TEMPLATE/bug_report.yml` example was `v20.11.0`, below the `engines.node >= 22.13.0` floor — a reporter copying it would file an unsupported version. → `v22.13.0`.
+### Why these are batched
+All nine are state-driven findings from re-reading the repo file-by-file (the methodology gap CLAUDE.md documents: change-driven sweeps miss files not actively edited). None touch `src/` runtime behavior — they harden the *audit apparatus* (S2), *measurement honesty* (S1/M3/S4), *test visibility* (T1/W1), and *toolchain/template hygiene* (C1/bug_report). Higher-risk items stay sequenced per plan: **#16 OCR offline enforcement → rc.9; H1 watcher per-file serialization → rc.10.**
+### Files changed
+- `scripts/oia-walk.mjs` — Check 4d SLSA-level guard (Part A static + Part B network) + honest header enumeration of all 8 checks / 11 blocks.
+- `scripts/bench.mjs` — `p99`→`max` (return obj + header); `Date.now()` tag → `#new-tag-stable`.
+- `bench/results.md` — `p50 / p99` → `p50 / max` column label.
+- `tests/cli-privacy-filters.test.ts` — 6 soft-skips → `ctx.skip()`; +1 CI GUARD tripwire.
+- `tests/github-metadata-invariant.test.ts` — 2 stale test titles realigned to assertions.
+- `docs/benchmarks.md` — +25/+16 → +24.7/+15.5.
+- `biome.json` + `package.json` — biome 2.4.15 → 2.4.16.
+- `.github/ISSUE_TEMPLATE/bug_report.yml` — Node placeholder v20.11.0 → v22.13.0.
+- `ROADMAP.md` — re-sequenced #16 OCR offline (rc.8 → rc.9) + Tier 1 watcher/H1 (rc.9 → rc.10) since rc.8 became the integrity-batch; noted Check 4d as partial progress on the structural drift-class item.
+- `README.md`, `docs/COMPARISON.md`, `llms.txt`, `AGENTS.md`, `package.json` — test count 926 → 927.
+- version bump 3.9.0-rc.7 → 3.9.0-rc.8 (7 surfaces).
+### Stats
+- **927 unit tests** (+1 CI tripwire) — all passing.
+- Lint clean (biome 2.4.16, 0 warnings). `tsc` strict clean. OIA clean (8 checks incl. new 4d). scope-completeness clean.
+---
+## [3.9.0-rc.7] — 2026-05-25
+> **TL;DR:** **Tier 0 integrity batch from a full project audit** (deep code audit of all 31 src/ modules + docs/workflows/config audit + competitive survey of the Obsidian-MCP / AI-memory / RAG-MCP landscapes). Fixes the two brand-critical overclaims the audit surfaced — **#15 SLSA-3** (badge linked to the slsa.dev **L3** spec + 8+ surfaces claimed "SLSA-3", but `release.yml` only runs `npm publish --provenance` = SLSA Build **L2**) and corrects pervasive version/RC drift + an undersold reranker number. Adds a public **ROADMAP.md**, gitignores the stray `false/` npm-cache tree, adds `CITATION.cff` version field, and documents a new overclaim anti-pattern (the "claimed-guarantee vs code-guard" class behind #15 + #16). **Docs/config-only; 926 tests unchanged. The OCR-offline-enforcement overclaim (#16, "implement" decision) ships in rc.8; the watcher live-update race (H1) in rc.9.**
+**Patch — audit-driven integrity (Tier 0).**
+### The audit
+Three parallel passes:
+1. **Deep code audit** (all `src/*.ts` + `src/tools/*.ts`, whole files): **zero CRITICAL**. The codebase is well-hardened (constant-time bearer compare, ReDoS-safe glob/like walkers, fail-closed `.base` predicates, transactional SQLite). Residual: 1 HIGH (watcher race, H1), 1 HIGH (OCR offline overclaim, #16), 5 MEDIUM, 5 LOW.
+2. **Docs/workflows/config audit**: SLSA-3 overclaim (#15), version drift, OIA self-count drift (docs say "6 checks", code has 8), reranker undersell, `false/` junk dir, no ROADMAP, missing OSS-health files.
+3. **Competitive survey**: enquire is technically ahead of every Obsidian-MCP peer (CRUD-only or REST-plugin-dependent); near-parity with local-RAG MCPs (knowledge-rag); behind AI-memory frameworks (mem0/cognee/Letta/Zep) only on **published LoCoMo numbers**, **entity knowledge graph**, and **discoverability** (8★). Letta's "filesystem memory scores 74% LoCoMo" validates our vault-as-memory thesis.
+### Fixed in this rc.7 (Tier 0)
+- **#15 SLSA-3 → SLSA L2 (overclaim instance #15).** Real mechanism is `npm publish --provenance` + GitHub OIDC = a Sigstore-signed provenance attestation = **SLSA Build Level 2** (hosted builder + non-forgeable-by-author provenance). Level 3 needs an isolated builder via `slsa-framework/slsa-github-generator`. Corrected every surface: README badge (now links to the L2 spec) + hero line + comparison table + releases row, package.json description + keyword (`slsa-3` → `build-provenance`), llms.txt (×2), docs/COMPARISON.md (×2). Earning real L3 is now a tracked **ROADMAP Tier 4** item, not a claim.
+- **Version/RC drift.** README "Pre-release: currently v3.9.0-rc.3" → rc.6; QUICKSTART version example → rc.6; benchmarks.md "still valid as of rc.3" → rc.6; AGENTS.md "OIA — 6 checks" → 8 (×2); CLAUDE.md OIA-walk description "6 cheap walks" → 8 + the rc.4 "(current)" marker corrected.
+- **Reranker undersold → measured numbers.** README (3 sites) + llms.txt: "+5-10 NDCG@10 typical" → **+15.5 NDCG@10 / +24.7 MRR measured** (the figure already in COMPARISON.md + benchmarks.md). The repo was undercutting its own measured, reproducible result by ~50%.
+- **`false/` npm-cache junk → `.gitignore`.** A stray `--cache false` / `npm_config_cache=false` mis-parse created an untracked `_cacache`/`_logs` tree at repo root; one `git add .` would have committed it.
+- **CITATION.cff** gains `version` (tracks the @latest stable line, deliberately not in version-consistency) + `date-released`.
+- **New `ROADMAP.md`** — public, tiered (Tier 0 integrity → Tier 1 correctness → Tier 2 LoCoMo benchmarks → Tier 3 GraphRAG-full / conversational write-back → Tier 4 discoverability + real SLSA-L3). Linked from README.
+- **New anti-pattern documented (CLAUDE.md):** "Never claim an ENFORCED guarantee the code doesn't actually enforce" — the class behind overclaim #15 (SLSA) + #16 (OCR offline). The invariant apparatus checks numeric/doc drift but had no defense for "we promise enforcement X; does a code path enforce X?". Candidate structural defense (deferred): an OIA enforcement-verb grep.
+### Deferred to the next RCs (tracked in ROADMAP.md)
+- **rc.8 — #16 OCR offline enforcement (HIGH, "implement" decision).** SECURITY.md claims "zero outbound network calls in serve mode" and `ocr.ts` TSDoc claims a pre-flight "throws if language not installed" check, but `extractPdfWithOcr` only warns then `createWorker` silently CDN-fetches; `install-ocr-lang` is referenced in 4 files but never existed. Implement: pre-flight cache check + `langPath` wiring + real `install-ocr-lang` subcommand + env-gated integration test.
+- **rc.9 — H1 watcher per-file serialization (HIGH).** Fire-and-forget `handle()` lets concurrent saves to one file interleave `applyDiff` + the shared `rowsByLabel` mutation → in-memory HNSW drift. Add a per-relPath promise queue + concurrent-event test. Plus M1 (HNSW `saveTo` live count), L2 (unlink kind).
+### Files changed
+- `README.md` — SLSA badge/hero/table/releases; reranker numbers (×3); RC currency; ROADMAP link.
+- `package.json` — description SLSA wording + `slsa-3`→`build-provenance` keyword.
+- `llms.txt` — SLSA (×2) + reranker number.
+- `docs/COMPARISON.md` — SLSA row + provenance paragraph.
+- `docs/QUICKSTART.md`, `docs/benchmarks.md` — RC currency.
+- `AGENTS.md`, `CLAUDE.md` — OIA check count (6→8); CLAUDE status rc.7 entry + new anti-pattern.
+- `CITATION.cff` — version + date-released.
+- `.gitignore` — `false/`.
+- `ROADMAP.md` — new file.
+- version bump 3.9.0-rc.6 → 3.9.0-rc.7 (7 surfaces).
+---
+## [3.9.0-rc.6] — 2026-05-25
+> **TL;DR:** **HNSW disk persistence on live update.** When the watcher applies HNSW live updates (`applyDiff`) during a serve session, the in-memory index diverges from the persisted `.hnsw.bin` sidecar. This rc re-persists the live-updated index at watcher **close time** so the next serve loads the up-to-date sidecar (~50ms) instead of rebuilding from embed-db (~25s on 50K chunks). Correctness was always guaranteed by the signature guard (a stale sidecar is ignored → safe rebuild); this is purely a restart-speed optimization. Chose close-time flush over a debounced during-serve timer: same restart benefit, no timer-lifecycle complexity, no mid-serve disk I/O. **+3 tests (2 POSITIVE + 1 NEGATIVE control); 926 unit tests total. No API breaks (additive).**
+**Patch — restart-speed optimization.**
+### Why close-time flush (not debounced during serve)
+The originally-planned design was "debounced `saveTo` ~30s after the last mutation". On reflection, close-time flush is the better risk-adjusted choice:
+- **Correctness is already guaranteed** by the signature guard. `loadHnswFromDisk` recomputes the embed-db signature at load time and rebuilds on mismatch. After live edits, the embed-db signature changes, so a STALE `.hnsw.bin` is simply ignored → safe (just slower) rebuild. So persisting-on-live-update is ONLY a speed optimization, never a correctness fix.
+- **The only benefit is restart speed**, and that benefit is identical whether you persist debounced-during-serve or once-at-close: either way the NEXT serve loads a current sidecar.
+- **Close-time is lower risk**: no `setTimeout`/`clearTimeout` lifecycle to leak on `close()`, no concurrent save-vs-mutate window mid-serve, no disk I/O churn during active use.
+- **Tradeoff**: an ungraceful `SIGKILL` (no graceful close) skips the flush — but the signature guard makes that safe (falls back to rebuild). A crash is rare; paying a one-time ~25s rebuild after a rare crash is an acceptable cost vs the complexity of a debounce timer.
+### Implementation
+`src/watcher.ts`:
+- New fields `hnswPersistFile: string | null` + `hnswDirty: boolean`.
+- `attachHnsw(hnsw, rowsByLabel, persistFile?)` — gains an optional `persistFile` param (the `<embed-db>.hnsw` sidecar base path). Omitted (or `--no-hnsw-persist`) → no flush.
+- `syncHnswForFile` sets `hnswDirty = true` after every successful `applyDiff`.
+- New `flushHnswToDisk(): Promise<boolean>` — no-op unless dirty + index + rowsByLabel + persistFile + embedDb all wired. Recomputes the embed-db signature so the persisted `.meta.json` matches what the next `loadHnswFromDisk` expects, then `await hnsw.saveTo(...)`. Fail-soft (a save error is logged + swallowed; signature guard → safe rebuild). Returns whether a flush happened.
+- `close()` awaits `flushHnswToDisk()` before closing the chokidar watcher.
+`src/server.ts`: both `attachHnsw` call sites (built-fresh + loaded-from-disk HNSW paths) now pass `persistFile` — gated on `opts.hnswPersist !== false` so `--no-hnsw-persist` correctly skips the close-time flush too.
+### Tests added (+3)
+`tests/watcher.test.ts` — new describe block `VaultWatcher HNSW disk persistence (v3.9.0-rc.6)`:
+- POSITIVE: `flushHnswToDisk is a no-op when no live update occurred (not dirty)` — no sidecar written.
+- POSITIVE: `close() flushes the live-updated index to a loadable sidecar with matching signature` — full integration: real EmbedDb + mock embedder + real `buildHnsw` + FtsIndex → file edit → `applyDiff` → `close()` → assert `.hnsw.bin` exists AND `loadHnswFromDisk(persistFile, postEditSignature)` returns non-null. This integration test also lifted `watcher.ts` branch coverage 55.05% → 59.58%.
+- NEGATIVE control: `flushHnswToDisk is a no-op when persistFile was omitted` — even with a live mutation, no `persistFile` → no flush.
+### Files changed
+- `src/watcher.ts` — `hnswPersistFile`/`hnswDirty` fields + `flushHnswToDisk()` + `attachHnsw` param + `close()` flush (+50 lines).
+- `src/server.ts` — pass `persistFile` to both `attachHnsw` call sites.
+- `tests/watcher.test.ts` — 3 new tests (~120 lines).
+- `scripts/check-per-file-coverage.mjs` — watcher coverage comment refreshed (55.05% → 59.58%; floor stays 53%).
+- `README.md`, `llms.txt`, `AGENTS.md`, `docs/COMPARISON.md`, `package.json` — test count 923 → 926.
+- version bump 3.9.0-rc.5 → 3.9.0-rc.6 (7 surfaces).
+### What's next
+- **v3.9.0 stable** — promote `@rc → @latest`. All architectural v3.9.0 items now shipped (OCR'd PDF watcher embed-sync rc.1, HNSW in-memory live update rc.2, R-10 adaptive refill rc.3, HNSW disk persistence rc.6). Gated on a fresh external audit on the v3.9.0-rc.2+ commit per `docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md` (the v3.6.1 ≥2-independent-external-auditors rule).
+- **v3.9.x+ backlog** — `install-ocr-lang` subcommand (with env-gated integration test); HNSW filter-during-search (structural R-10 closure); serve-http parity residual (P1-3); the remaining P2/P3 items.
+---
+## [3.9.0-rc.5] — 2026-05-25
+> **TL;DR:** **OCR install-instruction unification — closes the μ-class doc inconsistency the v3.9.0-rc.4 fix itself introduced (overclaim #14 residual).** rc.4's fix for overclaim #14 replaced the (non-existent) `install-ocr-lang` references in `cli.ts`/`api.md` with a "download from github tessdata_fast" instruction — but `SECURITY.md:167` documented a *different* procedure ("run OCR once online, copy `tessdata/`"). Two divergent install paths. This rc.5 unifies all three surfaces on the canonical run-once-then-copy procedure (SECURITY.md is the single source of truth), and refreshes the stale `SECURITY.md` roadmap stamp ("(v3.8.0)" → "planned, not yet shipped as of v3.9.0") with the deferral rationale (the `install-ocr-lang` subcommand needs `langPath`/`cachePath` wiring in `src/ocr.ts` that CI can't exercise — tesseract.js + canvas are optional deps absent from the matrix). **Docs-only; 923 unit tests unchanged.**
+**Patch — docs consistency (audit-driven self-correction).**
+### Why this exists
+This is a self-audit finding on rc.4's own diff (the CLAUDE.md "post-merge re-sweep" rule since v3.7.15 — after every audit-driven release that closes a class finding, scan that patch's own diff for fresh instances of the same class). rc.4 closed overclaim #14 (the `install-ocr-lang` subcommand was referenced as if it existed) by swapping the references for a manual `tessdata_fast` download instruction. But that swap was hasty — it created a NEW inconsistency: `SECURITY.md` already documented the canonical "run OCR once online to populate the `tessdata/` cache, then copy to the offline host" procedure, and rc.4's `tessdata_fast` instruction diverged from it without specifying the exact cache dir.
+This is the **μ-class** (instruction inconsistency across docs) — same class swept in v3.7.20 task #24.
+### Fixes
+- **`src/cli.ts`** (`--ocr-pdfs` + `--ocr-langs` help text): now point at SECURITY.md's canonical procedure instead of a standalone `tessdata_fast` instruction.
+- **`docs/api.md`** (`--ocr-pdfs` flag row): same — references the canonical procedure.
+- **`SECURITY.md`**: added an explicit "**Current install procedure (canonical)**" paragraph (the run-once-then-copy approach, with `tessdata_fast` as a documented alternative). Refreshed the "**Roadmap (v3.8.0)**" heading → "**Roadmap (planned, not yet shipped as of v3.9.0 — re-targeted from the original v3.8.0 plan)**" and documented WHY `install-ocr-lang` is deferred: it requires wiring a stable `langPath`/`cachePath` into `src/ocr.ts`'s `createWorker`, and the network-download path can't be exercised in CI, so it needs an env-gated integration test before shipping.
+### Why NOT implement the full `install-ocr-lang` subcommand now
+The honest answer is testability. The subcommand would:
+1. Download `<lang>.traineddata` into a cache dir (network op — fine, mirrors `install-model`).
+2. Require `src/ocr.ts`'s `createWorker` to read from that same dir via `langPath`/`cachePath`.
+Step 2 is the risk: `src/ocr.ts` currently calls `createWorker(langs, undefined, { logger })` with no explicit `langPath`, so tesseract.js uses its default cache behavior. Changing that to a custom dir could break OCR in a way CI can't catch — there are no CI tests that actually run OCR (tesseract.js + `@napi-rs/canvas` are optional deps absent from the CI matrix; the only OCR test is env-gated). Shipping an untestable change to the OCR worker config violates the "audit BEFORE ship" discipline. Tracked as a v3.9.x backlog item that must land WITH an env-gated integration test (`ENQUIRE_LOAD_OCR_E2E=1`, same pattern as the reranker smoke).
+### Files changed
+- `src/cli.ts` — `--ocr-pdfs` + `--ocr-langs` help text reference SECURITY.md canonical procedure.
+- `docs/api.md` — `--ocr-pdfs` flag row reference.
+- `SECURITY.md` — canonical-procedure paragraph + roadmap re-target.
+- version bump 3.9.0-rc.4 → 3.9.0-rc.5 (7 surfaces).
+### What's next
+- **v3.9.0-rc.6** — HNSW disk persistence on live update (debounced `saveTo` ~30s after the last watcher mutation; recompute embed-db signature so the persisted `.hnsw.bin` tracks live state).
+- **v3.9.0 stable** — promote `@rc → @latest` after rc.6 + fresh external audit per `docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md`.
+- **v3.9.x+** — `install-ocr-lang` subcommand (with env-gated integration test); HNSW filter-during-search (structural R-10 closure).
+---
+## [3.9.0-rc.4] — 2026-05-25
+> **TL;DR:** **Full state-driven self-audit on the v3.8.7 → v3.9.0-rc.3 cascade — closes 3 HIGH + 4 MEDIUM findings + documents overclaim instance #13 + recursion-pair shape #7 + extends META scope-completeness with 2 new defenses.** Audit caught: (1) CLAUDE.md header line said "deferred to v3.9.0+: ... OCR'd PDF watcher embed-sync, HNSW in-memory live update, R-10 adaptive refill" while the status section in the same file listed all three as SHIPPED (overclaim #13). (2) `docs/api.md:5` said "currently v3.9.0-rc.1" — we're on rc.3. (3) v3.9.0-rc.1/rc.2/rc.3 features absent from ALL user-facing docs (README, api.md, QUICKSTART, llms.txt, AGENTS.md) — the v3.8.8 META audit covered only NUMERIC drift, not FEATURE-MENTION drift. **+5 tests (3 POSITIVE + 2 NEGATIVE controls); 923 unit tests total.** All findings closed by the same PR.
+**Patch — full audit + docs-only fixes + 2 new structural defenses.**
+### What the audit found
+Phase 0 (reality snapshot): all 9 required CI gates green, 917 tests, lint clean, OIA clean, 7-surface version-consistency, 10/10 per-file floors, 0 vulns.
+Phase 1 (state-driven docs walk via parallel general-purpose agent): 3 HIGH + several MEDIUM/LOW findings.
+Phase 2 (code-doc consistency via parallel general-purpose agent): PASS — every v3.8.7 → v3.9.0-rc.3 CHANGELOG claim verified in the codebase.
+### HIGH findings (all closed in this rc.4)
+- **H-1 — Feature-mention drift**: v3.9.0-rc.1 (`--ocr-pdfs` + 2 sibling flags), v3.9.0-rc.2 (HNSW in-memory live update), v3.9.0-rc.3 (`adaptiveHnswRefill`) shipped in 3 RCs but appeared ONLY in CHANGELOG + CLAUDE.md. Zero hits in `README.md`, `docs/api.md` (flag table), `docs/QUICKSTART.md`, `docs/http-transport.md`, `llms.txt`, `AGENTS.md`. **Fix**: added the 3 OCR flags + 6 other previously-paragraph-only stable flags (`--include-pdfs`, `--enable-reranker`, `--reranker-model`, `--reranker-top-n`, `--use-hnsw`, `--hnsw-ef`, `--late-chunk-context`, `--no-hnsw-persist`, `--quantize-embeddings`) to `docs/api.md` flag table. Added rc.1/rc.2/rc.3 mention to README highlight reel + llms.txt bullet list + AGENTS.md watcher section.
+- **H-2 — Stale RC index**: `docs/api.md:5` said "currently v3.9.0-rc.1 — OCR'd PDF watcher embed-sync"; actual `@rc` is v3.9.0-rc.3. **Fix**: updated to mention all three RCs (OCR, HNSW live update, R-10 adaptive).
+- **H-3 — Ambiguous CI gate rendering in README**: README line 249 listed "lint · test ×2 [Node 22/24] · smoke · audit · coverage · version-consistency · docs · oia" as the 9 required gates, but the `test ×2` rendering reads as 1 entry visually → looks like 8 gates while claiming "9 required". **Fix**: rewrote to enumerate explicitly: "(1) lint, (2) test on Node 22, (3) test on Node 24, (4) smoke, …, (9) oia".
+### MEDIUM findings (closed in this rc.4)
+- **M-1 — Overclaim instance #13** (CLAUDE.md self-contradiction): `CLAUDE.md:9` said "**Still deferred to v3.9.0+:** ... OCR'd PDF watcher embed-sync, HNSW in-memory live update, R-10 adaptive refill" — but the status section in the same file (lines ~143–145) listed all three as SHIPPED. **Class**: stale future-tense deferral claim (vs the present-tense "as of vX.Y.Z" pattern OIA Check 7 catches since v3.8.3). **Fix**: rewrote the header to clearly separate "v3.9.0 RCs shipped on `@rc`" from "Still deferred to v3.9.x+" (HNSW filter-during-search, embed-db migrations, distributed rate-limit, HNSW disk persistence on live update).
+- **M-2 — Stale QUICKSTART version**: `docs/QUICKSTART.md:32` expected output `3.7.12` — bumped to mention both `3.9.0-rc.3` (`@rc`) and `3.8.8` (`@latest`).
+- **M-3 — Stale benchmarks version footer**: `docs/benchmarks.md:3` cited v3.7.x version stamps. **Fix**: appended "still valid as of v3.9.0-rc.3 — retrieval pipeline unchanged; v3.8.x→v3.9.0 work was correctness/hardening + watcher live-update, not algorithmic" so the page is no longer misleadingly date-stale.
+### META extension — 2 new scope-completeness defenses (recursion-pair shape #7)
+The v3.8.8 META audit (`scripts/scope-completeness-audit.mjs`) covered 5 NUMERIC-CLAIM patterns. The HIGH-1 finding above (3 OCR flags missing from `docs/api.md`) revealed that META's dimension coverage was incomplete. **Recursion-pair shape #7** documented: even after v3.8.8's META audit landed, drift in a different dimension (feature mentions) snuck in for 3 RCs.
+Added in rc.4:
+- **`runDeferredClaimAudit()`** — scans `CLAUDE.md` for `(?:Still\s+)?deferred\s+to\s+v\d+\.\d+\.\d+\+?:\s*([^.\n]+)` patterns. For each item named in such a line, checks whether the same file contains a "shipped" status entry mentioning that item. If both present → finding. Closes overclaim #13 class structurally.
+- **`runCliFlagCoverageAudit()`** — extracts every `.option("--name", …)` from `src/cli.ts`; verifies each appears in `docs/api.md` (substring match). Subcommand-specific flags (`--bearer-token`, `--queries`, `--lang`, etc.) live in `subcommandExempts` and are skipped. Closes the feature-mention class for CLI flags specifically.
+- **`runAudit()`** now composes all three sub-audits (numeric + deferred-claim + cli-flag-coverage). OIA Check 8 picks up the extended results automatically.
+### Tests added (+5)
+`tests/scope-completeness-invariant.test.ts` extended:
+- POSITIVE: `runDeferredClaimAudit returns zero findings on current state` (proves rc.4's CLAUDE.md fix closed overclaim #13)
+- POSITIVE: `runCliFlagCoverageAudit returns zero findings on current state` (proves the new OCR/HNSW flags are in `docs/api.md`)
+- POSITIVE: `runAudit returns union of all three sub-audits` (composition correctness)
+- NEGATIVE: deferred-to regex matches the drift pattern (proves the audit would catch a regression)
+- NEGATIVE: missing-flag-in-docs is structurally detectable (synthetic CLI + doc fixture)
+### CLAUDE.md anti-patterns added
+Two new rules captured (already-existing recurring shapes from this session):
+- **Update forward-looking deferral claims in the same commit that ships the deferred item** — closes overclaim instance #13 class. The `deferred-claim` defense above makes this structural; the rule documents the human-side discipline.
+- **META scope-completeness defenses must cover every drift DIMENSION** — closes recursion-pair shape #7. New rule: every structural defense PR must enumerate covered + uncovered dimensions; uncovered ones become deferred-defense TODOs.
+### Files changed
+- `CLAUDE.md` — overclaim #13 documented; recursion-pair shape #7 documented; header bullet at line 9 corrected; 2 new anti-pattern rules added.
+- `docs/api.md` — `:5` Channels paragraph current; flag table expanded with 12 new rows (3 OCR + 9 previously-paragraph-only stable flags).
+- `README.md` — highlight reel + features-table CI block rendering.
+- `llms.txt` — v3.9.0 features bulleted.
+- `AGENTS.md` — watcher section mentions `setOcrPdfs` + `attachHnsw`.
+- `docs/QUICKSTART.md` — version example refreshed.
+- `docs/benchmarks.md` — footer "still valid as of v3.9.0-rc.3" note.
+- `scripts/scope-completeness-audit.mjs` — `runNumericAudit` (renamed), `runDeferredClaimAudit`, `runCliFlagCoverageAudit`, combined `runAudit` (+200 lines).
+- `tests/scope-completeness-invariant.test.ts` — extended describe block with 5 new tests.
+- `README.md`, `llms.txt`, `AGENTS.md`, `docs/COMPARISON.md`, `package.json` — test count 918 → 923.
+- version bump 3.9.0-rc.3 → 3.9.0-rc.4 (7 surfaces).
+### What's next
+- **v3.9.0-rc.5** — HNSW disk persistence on live update (debounced `saveTo` ~30s after last mutation). Originally planned for rc.4; deferred to make space for this audit-driven docs cascade.
+- **v3.9.0 stable** — promote `@rc → @latest` after rc.5 lands + fresh external audit on v3.9.0-rc.2+ per `docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md`.
+- **v3.9.x+** — HNSW filter-during-search (architectural; closes R-10 structurally).
+---
+## [3.9.0-rc.3] — 2026-05-25
+> **TL;DR:** **R-10 adaptive HNSW refill + external audit attribution.** Closes the last open INFO finding from the corrected 2026-05-25 external audit (`docs/audits/v3.8.0-rc.15-external-2026-05-25.md`, 4.85/5). New `adaptiveHnswRefill()` helper in `src/tools/search.ts` doubles k up to maxAttempts=3 times when the post-filter hit count is below `limit`. Closes the ">66% excluded" under-return class that rc.9's static 6× multiplier could not fully solve. Archives the external audit doc in `docs/audits/` + lifts the "External audit blocker per v3.6.1 STILL OPEN" framing in CLAUDE.md (the corrected audit retroactively justifies v3.8.0 stable). Creates `docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md` for the next fresh pass. **+7 tests (5 POSITIVE + 2 NEGATIVE controls); 918 unit tests total. No API breaks.**
+**Patch — R-10 + audit attribution.**
+### R-10 adaptive HNSW refill (INFO-2 from corrected external audit)
+**Problem**: The embed-db can contain entries for paths that the privacy filter (`vault.isExcluded`) drops at response-build time. Pre-3.9.0-rc.3 the HNSW path fetched a STATIC multiplier of `max(limit × 6, 50)` entries; for vaults with > 66% excluded entries, filtering left fewer than `limit` results and the response under-returned.
+**Fix**: `adaptiveHnswRefill()` (`src/tools/search.ts`) is a bounded loop:
+```ts
+let k = min(initialK, maxLabels);
+let filtered: T[] = [];
+for (let attempt = 0; attempt < maxAttempts; attempt++) {
+  filtered = filter(searchKnn(k));
+  if (filtered.length >= limit) break;
+  if (k >= maxLabels) break;        // saturated — re-search yields same set
+  k = min(k * 2, maxLabels);
+}
+return filtered;
+```
+`maxAttempts = 3` bounds the worst-case to 3 × HNSW search latency (~30ms). Typical vaults converge on attempt 1 (most have < 20% excluded). The refill engages only for the long-tail privacy-heavy configurations the static multiplier under-served.
+**Residual**: at > 95% excluded the loop still saturates without satisfying `limit`. That's the structural limit of post-filter retrieval; the architectural fix is `HNSW filter-during-search` (pushes the privacy predicate into the graph traversal). Deferred to v3.9.x+.
+### Tests added (+7)
+`tests/hnsw.test.ts` — new describe block `adaptiveHnswRefill (v3.9.0-rc.3 R-10)`:
+- POSITIVE: returns initialK results when no filter drops anything (0% excluded case)
+- POSITIVE: refills when 80% are filtered out (R-10 target case)
+- POSITIVE: doubles k up to MAX_REFILL_ATTEMPTS=3 times when refill needed (assertion on call count + k progression)
+- POSITIVE: stops doubling when k saturates maxLabels (prevents redundant calls when filter rejects everything)
+- POSITIVE: respects custom maxAttempts override
+- NEGATIVE control: exits after attempt 1 when filter satisfies on first try (proves the early-exit optimization fires)
+- NEGATIVE control: maxAttempts=0 makes zero searchKnn calls (proves the loop bound works)
+### External audit attribution (closes v3.8.1 framing)
+- **`docs/audits/v3.8.0-rc.15-external-2026-05-25.md`** archived in-repo. The corrected audit (returned 2026-05-25 after the auditor acknowledged delivering the wrong project's doc to a prior chat — see v3.8.1 retraction): 4.85/5, ship-blockers none, 5 of 6 actionable findings already closed by the rc.18 → v3.8.5 cascade. INFO-2 (R-10 residual) closes in this rc.3.
+- **CLAUDE.md header + v3.8.0/v3.8.1 entries** updated: the "External audit blocker per v3.6.1 STILL OPEN" framing is lifted. v3.8.0 stable promotion is now retroactively justified by the corrected audit. The v3.8.1 retraction was about misdirected delivery, not about the verdict itself being wrong.
+- **`docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md`** created — fresh audit request for the next pre-stable promotion (v3.9.0 → @latest). Lists the delta since rc.15, current state snapshot, specific zones of interest (HNSW concurrency, OCR network posture, P2-10/P2-11 wire-level verification, MCP Registry sync, test count drift).
+### Cross-walk: external audit findings vs current state
+| ID | Finding (audit on rc.15) | Current status (a80d491, v3.9.0-rc.2) |
+|---|---|---|
+| **M-REG-1** | server.json version drift; gate doesn't cover registry manifest | ✅ Closed in v3.8.0-rc.18 S-AUDIT-1 (5 → 7 version-consistency surfaces) |
+| **L-HYB-1** | searchHybrid lacks terminal vault.isExcluded() filter | ✅ Closed in v3.8.0-rc.18 S-AUDIT-2 (line 1019 of src/tools/search.ts) |
+| **L-OIA-1** | check:oia Check 6 fails on stale coverage-summary.json | ✅ Closed in v3.8.0-rc.18 S-AUDIT-3 (test:coverage → check:oia order documented) |
+| **INFO-1** | README badge "v3.7.x stable" but @rc = rc.15 | ✅ Closed — README now `v3.8.x stable`, badge `tests-918 passing` |
+| **INFO-2** | R-10 residual: HNSW under-return at > 66% excluded | ✅ **Closed in this rc.3** (adaptive refill loop) |
+| **INFO-3** | T-2..T-5, HTTP P2-10/P2-11, multi-subcommand backlog | ✅ T-2/T-3/T-4 in v3.8.5; HTTP P2-10/P2-11 in v3.8.7; multi-subcommand in v3.8.0-rc.17. T-5 was over-counted placeholder (only 4 named items) |
+### Files changed
+- `src/tools/search.ts` — `adaptiveHnswRefill()` helper + integration in HNSW path of `embeddingsSearch` (+85 lines).
+- `tests/hnsw.test.ts` — adaptiveHnswRefill describe block (+7 tests).
+- `docs/audits/v3.8.0-rc.15-external-2026-05-25.md` — new file (archived audit).
+- `docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md` — new file (fresh audit request).
+- `CLAUDE.md` — header note + v3.8.0/v3.8.1 entries + backlog section updated to reflect corrected audit.
+- `README.md`, `llms.txt`, `AGENTS.md`, `docs/COMPARISON.md`, `package.json` — test count 911 → 918.
+- version bump 3.9.0-rc.2 → 3.9.0-rc.3 (7 surfaces).
+### What's next
+- **v3.9.0-rc.4** — HNSW disk persistence on live update (debounced `saveTo` ~30s after last mutation). Currently `applyDiff` only mutates the in-memory index; next serve restart triggers a full rebuild from embed-db.
+- **v3.9.0 stable** — promote `@rc → @latest` after rc.4 lands + the fresh external audit on the v3.9.0-rc.2+ commit completes (per `docs/audits/AUDIT-REQUEST-v3.9.0-rc.2-2026-05-25.md`).
+- **v3.9.x+** — HNSW filter-during-search (architectural; pushes the privacy/exclude filter into the graph traversal itself rather than post-filter, structurally closing the R-10 class).
+---
+## [3.9.0-rc.2] — 2026-05-25
+> **TL;DR:** **HNSW in-memory live update — closes the last named v3.8.0 architectural deferral.** When the watcher updates embed-db rows for an md/pdf file change, the in-memory HNSW index is now updated in lockstep via the new `HnswIndex.applyDiff(removeLabels, addPoints)` method. Pre-3.9.0 the index was rebuilt only at serve startup; long-running sessions slowly drifted as embed-db got upserts but HNSW kept the original vectors. Search results now reflect vault edits within the watcher debounce window (~250ms typical). **+13 tests (10 POSITIVE + 3 NEGATIVE controls); 911 unit tests total. No API breaks (additive — old callers ignoring the new return values + interface methods keep working).**
+**Minor (continued) — architectural watcher feature, paired with rc.1's OCR'd PDF watcher embed-sync.**
+### What changes
+`src/embed-db.ts` — `upsertNote` + `deleteNote` now return the embed-db row ids that were affected:
+```ts
+// Before (v3.8.x):
+db.upsertNote(relPath, mtime, chunks);    // void
+db.deleteNote(relPath);                    // void
+// After (v3.9.0-rc.2):
+const { oldIds, newIds } = db.upsertNote(relPath, mtime, chunks);
+//      ^^^^^^                            //  deleted rows' AUTOINCREMENT ids
+//              ^^^^^^                    //  fresh ids assigned, parallel to chunks[]
+const deletedIds = db.deleteNote(relPath); // ids of rows dropped
+```
+`src/hnsw.ts` — `HnswIndex` interface extended with three new methods:
+- `applyDiff(removeLabels, addPoints)` — markDelete each removeLabel, then addPoint each new vector with `replaceDeleted=true` (reuses deleted slots before growing). Auto-resizes when capacity would be exceeded. Returns `{ removed, added }` counts. Silently skips removeLabels that were never added (the watcher's view can lag behind reality after a sweep eviction).
+- `resize(newMaxElements)` — grow the index in place. No-op if already large enough.
+- `capacity()` — returns `{ currentCount, maxElements }` for capacity planning.
+`buildHnsw` now passes `allowReplaceDeleted=true` to `HierarchicalNSW.initIndex` so the live-update path can reuse deleted slots without a per-call constructor flag.
+`src/watcher.ts` — new `attachHnsw(hnsw, rowsByLabel)` method + internal `syncHnswForFile()` helper. After every embed-db mutation in the md / pdf event handlers, the watcher calls `applyDiff(oldIds, newPoints)` and updates the shared `rowsByLabel` Map (the same map `searchHybrid` reads via reference — mutations are immediately visible to subsequent searches).
+`src/server.ts` — wires `attachHnsw` after both HNSW init paths (built-fresh + loaded-from-disk) so users on `--use-hnsw` automatically get live updates without any new CLI flag.
+### Why the wiring is late-binding
+The watcher boots BEFORE HNSW initializes (so file events from boot-time edits aren't dropped). The `attachHnsw` API mirrors the existing `attachEmbed` and `setOcrPdfs` patterns: the watcher accepts handles after construction via explicit late-binding methods. The handlers check `if (this.hnsw)` at runtime and gracefully skip live-update when HNSW isn't wired.
+### Concurrency model
+JS's single-threaded event loop gives us the property: between an `await` and the next synchronous block, no other code runs. The HNSW mutation block in `syncHnswForFile` is fully synchronous (no awaits inside), so it can't interleave with another file event's HNSW mutations. Concurrent searches read the HNSW graph during the brief windows between mutations; hnswlib-node's `markDelete` is sync and `addPoint(replaceDeleted=true)` reuses slots in place, so the worst case is a search returning the OLD label momentarily (next search after the mutation finishes sees the new label). No torn state.
+The `rowsByLabel` Map is updated AFTER the HNSW mutation completes, so a search that grabbed a label before the mutation but consults the Map after the mutation sees the new metadata (rel_path, chunk_index, text_preview). The same race exists pre-3.9.0 for the initial buildHnsw path; we're not introducing new hazards.
+### Fail-soft posture
+If `applyDiff` throws (e.g. capacity exhausted + auto-resize failed, or hnswlib-node crashed), the watcher logs a stderr line and continues:
+```
+enquire: watcher HNSW live-update failed for foo.md — <message> (search results may be stale until next serve restart)
+```
+The embed-db is already updated, so a serve restart rebuilds HNSW from the correct state. Same posture as the existing v3.8.0-rc.2 watcher embed-db fail-soft.
+### Tests added (13)
+**`tests/hnsw.test.ts` (+8)** — `HnswIndex live-update (v3.9.0-rc.2)`:
+- POSITIVE: `applyDiff removes labels (markDelete) + searchKnn no longer surfaces them`
+- POSITIVE: `applyDiff adds new points + searchKnn returns them`
+- POSITIVE: `applyDiff combined remove + add (typical watcher upsert path)` — proves the most common code path (4 chunks → 4 different chunks)
+- POSITIVE: `applyDiff silently skips removeLabels that were never added (watcher-lag tolerance)`
+- POSITIVE: `applyDiff auto-grows when adding points past maxElements (watcher fail-safe)` — index sized at 5 grows to 11 when 6 new points pushed in
+- POSITIVE: `resize grows the index; no-op when already large enough`
+- POSITIVE: `capacity returns {currentCount, maxElements}`
+- NEGATIVE control: `applyDiff with wrong-dim vector throws` — proves dim validation works before mid-loop crash
+**`tests/embed-db.test.ts` (+4)** — `EmbedDb upsertNote + deleteNote return ids (v3.9.0-rc.2)`:
+- POSITIVE: `upsertNote returns oldIds=[] + newIds for a fresh file`
+- POSITIVE: `upsertNote returns oldIds=existing + newIds=fresh on re-upsert`
+- POSITIVE: `deleteNote returns the ids that were dropped`
+- NEGATIVE control: `deleteNote on absent file returns empty array`
+**`tests/watcher.test.ts` (+1)** — `attachHnsw throws when embedDb has not been attached` — pins the late-binding ordering contract.
+### Files changed
+- `src/embed-db.ts` — upsertNote returns `{oldIds, newIds}`; deleteNote returns `number[]`. Internal `Stmt` interface widened to expose `lastInsertRowid`.
+- `src/hnsw.ts` — `HnswIndex` interface extended with `applyDiff` / `resize` / `capacity`; `wrapNativeIndex` implements them with hasLiveUpdate feature probe + fail-soft on older hnswlib-node builds.
+- `src/watcher.ts` — `HnswIndex` import + `HnswRowMeta` interface; `attachHnsw` method; internal `syncHnswForFile` helper called from md and pdf event handlers (upsert + delete + unlink paths).
+- `src/server.ts` — `watcher.attachHnsw(...)` calls on both built-fresh and loaded-from-disk HNSW paths.
+- `tests/hnsw.test.ts`, `tests/embed-db.test.ts`, `tests/watcher.test.ts` — 13 new tests with POSITIVE+NEGATIVE control siblings per CLAUDE.md rule since v3.6.4.
+- `README.md`, `llms.txt`, `AGENTS.md`, `docs/COMPARISON.md`, `package.json` — test count 898 → 911.
+- `scripts/check-per-file-coverage.mjs` — `src/watcher.ts` floor 64 → 53 with documented rationale (syncHnswForFile + attachHnsw + 6 new branches in event handlers are integration-test-heavy; coverage will lift back when rc.3 adds the chokidar-driven end-to-end test). Also refreshed 2 stale `// current ~X%` inline comments (http-transport 69.39% → 72.85%, embed-pipeline 86.84% → 88.09%).
+- version bump 3.9.0-rc.1 → 3.9.0-rc.2 (7 surfaces).
+### What's next
+- **v3.9.0-rc.3** — debounced HNSW disk persistence. Currently `applyDiff` only mutates the in-memory index; next serve restart triggers a full rebuild from embed-db. Need a debounced `saveTo` call (~30s after last mutation) so the persisted .hnsw.bin tracks live state.
+- **v3.9.0 stable** — promote @rc → @latest once rc.3 lands + 7-day dogfood window passes.
+- **v3.9.x+ deferred**: HNSW filter-during-search (architectural — pushes the privacy/exclude filter into the graph traversal itself rather than post-filter, fixing the search-underfill class).
+---
+## [3.9.0-rc.1] — 2026-05-25
+> **TL;DR:** **OCR'd PDF watcher embed-sync — closes the last deferred v3.8.0 backlog item.** When `--watch` + `--include-pdfs` + `--ocr-pdfs` are all set, the watcher now runs Tesseract OCR on image-only / scanned PDFs that pdfjs can't read text from, then pipes the OCR-derived text through `embedSinglePdf`'s new `preExtractedPages` path so the embed-db stays in sync with edits during a long serve session. Pre-3.9.0 the watcher cleared embed rows for image-only PDFs on change — search recall slowly degraded across sessions for scanned-document vaults. New `VaultWatcher.setOcrPdfs(enabled, langs?, maxPages?)` method wires the OCR fallback after `attachEmbed()` runs. **+5 tests (4 POSITIVE + 1 NEGATIVE control); 898 unit tests total. No API breaks. RC.1 — HNSW in-memory live update deferred to rc.2.**
+**Minor — architectural watcher feature.**
+### What changes
+`src/watcher.ts` — new OCR-on-watch path in the PDF event handler:
+```ts
+// PDF change event fires
+const result = await extractPdfText(buf); // cheap pdfjs path
+this.ftsIndex.reindexPdfFile(relPath, mtimeMs, result.pages); // FTS5 update
+if (this.embedDb && this.embedder) {
+  let preExtractedPages;
+  if (this.ocrPdfs && !result.hasText) {
+    // NEW v3.9.0-rc.1: image-only PDF + OCR enabled → run Tesseract
+    const ocrResult = await extractPdfWithOcr(buf, { langs, maxPages });
+    preExtractedPages = ocrResult.pages.filter((p) => !p.isEmpty)
+      .map((p) => ({ pageNumber: p.pageNumber, text: p.text }));
+  }
+  const pdfResult = await embedSinglePdf(vault, embedder, fileMeta, {
+    lateChunkContext,
+    ...(preExtractedPages ? { preExtractedPages } : {})
+  });
+  // embed-db upsert as usual; source label in log says src=OCR or src=pdfjs
+}
+```
+`src/embed-pipeline.ts` — `embedSinglePdf` gains an optional `preExtractedPages` parameter. When supplied, the pdfjs read + extract step is skipped; the supplied pages feed the existing chunk + embed pipeline directly. Same `[page: N]` markers, same chunking logic — only the text source changes. Empty array → returns null (caller drops rows, parity with pdfjs `hasText=false`).
+`src/cli.ts` + `src/server.ts` — three new CLI flags lifted into `addAdvancedRetrievalOptions` (so both `serve` and `serve-http` get them):
+- `--ocr-pdfs` — enable the OCR fallback. Requires `--watch` + `--include-pdfs`.
+- `--ocr-langs <langs>` — Tesseract language pack (default `eng`). Multi-lang via `+`, e.g. `eng+rus`.
+- `--ocr-max-pages <n>` — per-event OCR page cap (default `DEFAULT_OCR_MAX_PAGES = 200`).
+### Why the wiring is two-stage
+The watcher constructs BEFORE `attachEmbed()` runs in `startServer` (so file events from boot-time edits are captured). OCR validation can't happen at construction time because embed-db isn't open yet. Instead:
+1. Watcher constructor accepts `ocrPdfs` flag but defers validation.
+2. PDF event handler checks `ocrPdfs && embedDb && includePdfs` at runtime — skips OCR silently if any leg is missing.
+3. `setOcrPdfs(enabled, langs?, maxPages?)` is the explicit late-binding API. server.ts calls it AFTER `attachEmbed` succeeds; throws loud if `includePdfs=false` or `embedDb` is null (caught + logged + watcher continues without OCR).
+This matches the v3.8.0-rc.2 R-7 pattern for `attachEmbed` itself: the watcher boots minimal, gets feature-attachments later, gracefully degrades when a leg is missing.
+### Fail-soft posture
+OCR can fail for legitimate reasons: optional deps not installed (`tesseract.js` / `@napi-rs/canvas`), language pack not pre-downloaded, page count > maxPages cap, encrypted PDF. All of these:
+- Log a single stderr line (when not in silent mode)
+- Continue with the default pdfjs path (which will return empty pages for image-only PDFs and drop the embed rows)
+- FTS5 reindex still runs (it doesn't depend on OCR)
+This matches the existing v3.8.0-rc.3 fail-soft posture for the non-OCR PDF embed sync.
+### Tests added
+`tests/embed-pipeline.test.ts` — 2 new tests for the `preExtractedPages` branch:
+- POSITIVE: supplies 2 synthetic pages; asserts chunks + `[page: N]` markers in preview blob; verifies pdfjs is bypassed (the on-disk PDF is intentionally a TEXT file that pdfjs would reject — proves the bypass works).
+- NEGATIVE control: empty `preExtractedPages: []` returns null (parity with `hasText=false`).
+`tests/watcher.test.ts` — 3 new tests for `setOcrPdfs`:
+- Throws when `includePdfs=false` (validation)
+- Throws when `embedDb` not attached (validation)
+- NEGATIVE control: `setOcrPdfs(false)` is a no-op regardless of other state
+End-to-end OCR with real Tesseract is gated by env var (TODO rc.2 — same pattern as `ENQUIRE_LOAD_HYDE_E2E`).
+### What's in scope vs deferred for v3.9.0
+- ✅ **rc.1 (this patch)**: OCR'd PDF watcher embed-sync.
+- 🔜 **rc.2**: HNSW in-memory live update (when an embed-db row changes, mark-delete the old HNSW labels + add new ones with `replaceDeleted=true`, persist on a debounced timer). Requires extending `HnswIndex` interface + careful concurrency handling with serve-time search queries.
+- 🔜 **rc.3+ → stable**: HNSW filter-during-search architectural (the longest-deferred item — currently we post-filter HNSW top-K against the privacy/exclude list, which can leave the response under-filled. Filter-during-search would push the filter into the HNSW graph traversal itself).
+### Files changed
+- `src/watcher.ts` — `ocrPdfs/ocrLangs/ocrMaxPages` options + `setOcrPdfs` method + PDF event handler OCR branch (+90 lines)
+- `src/embed-pipeline.ts` — `preExtractedPages` option in `embedSinglePdf` (+20 lines)
+- `src/server.ts` — `ServeOptions` extensions + `setOcrPdfs` wiring after `attachEmbed` (+30 lines)
+- `src/cli.ts` — 3 new `addAdvancedRetrievalOptions` flags (+15 lines)
+- `tests/embed-pipeline.test.ts` — 2 new tests (+50 lines)
+- `tests/watcher.test.ts` — 3 new tests (+30 lines)
+- `tests/cli-parity.test.ts` — sanity-cap bumped 8 → 11 to match grown helper
+- `scripts/check-per-file-coverage.mjs` — `src/watcher.ts` branches floor lowered 69 → 64 with documented rationale: OCR-on-watch added a try/catch around dynamic `extractPdfWithOcr` import + 3 new option fields. The OCR branches require `tesseract.js` + `@napi-rs/canvas` optional deps that aren't installed in CI; mocking them would defeat the fail-soft posture the codepath is testing. Floor will lift back when v3.9.0-rc.2+ adds an env-gated E2E test (`ENQUIRE_LOAD_OCR_E2E=1` mirroring `ENQUIRE_LOAD_HYDE_E2E`).
+- `docs/api.md` — Channels paragraph acknowledges v3.9.0-rc.1
+- `README.md`, `llms.txt`, `AGENTS.md`, `docs/COMPARISON.md`, `package.json` — test count 893 → 898
+- version bump 3.8.8 → 3.9.0-rc.1 (7 surfaces)
+---
 ## [3.8.8] — 2026-05-25
 > **TL;DR:** **META structural-defense scope completeness audit — closes the recurring "recursion-pair shape" class (6 documented instances since v3.6.x).** New `scripts/scope-completeness-audit.mjs` enumerates every numeric-claim defense's scope (which files it covers + which it exempts) and sweeps the entire repo for matching patterns; any file containing a tracked pattern that's NOT in the defense's scope or exempts list fails CI. Wired into both `tests/scope-completeness-invariant.test.ts` (change-driven gate) and `scripts/oia-walk.mjs` Check 8 (state-driven sweep). Discovered + fixed one immediate gap: STABILITY.md "44 tools" reference was already covered by `docs-consistency.test.ts` line 183 but missing from the audit manifest. **+5 tests (3 POSITIVE + 2 NEGATIVE controls); 893 unit tests total.**