npm - @psiclawops/hypermem - Versions diffs - 0.8.1 → 0.8.2 - Mend

@psiclawops/hypermem 0.8.1 → 0.8.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (7) hide show

package/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,101 @@
+# Changelog
+All notable changes to hypermem are documented here.
+## 0.8.2 — npm packaging & install path clarity
+- **INSTALL.md and CHANGELOG.md now ship in the npm package.** Previously missing from `package.json` `files` array, so `npm pack` / `npm publish` excluded them. Users installing from the registry now get the full install guide without cloning the repo.
+- **Library vs plugin install paths split in README.** The Installation section now clearly separates standalone library usage (no OpenClaw required) from OpenClaw plugin wiring. Library consumers no longer have to read through plugin-specific steps.
+- **Cold-start prerequisites expanded.** INSTALL.md prerequisites now include explicit verification commands (`openclaw gateway status`, `node --version`), distinguish first-time OpenClaw setup from existing installs, and explain when to use `gateway start` vs `gateway restart`.
+- **Verification steps handle auth failures.** Step 4 verification now includes diagnostic callouts for `gateway restart` failures (not onboarded, missing config) and `openclaw logs` failures (gateway not running, path issues).
+- **Troubleshooting quick-reference added to README.** A 7-row common-issues table covers plugin-not-found, legacy fallback, no semantic results, build errors, empty facts, context pressure, and missing messages.db -- with symptoms, causes, and one-line fixes.
+## 0.8.1 — Documentation fixes
+- **Install docs rewritten for clean first-run:** README and INSTALL.md installation sections restructured so config comes before restart, `$HOME` replaces `~` in shell-interpolated JSON strings, clone path is no longer hardcoded, and health-check instructions note the repo-dir requirement and data-dir timing.
+- **install:runtime output:** `npm run install:runtime` now prints the next-step commands (config creation + plugin wiring) so the user doesn't have to hunt through docs.
+- **Lightweight mode clarified:** Step 3 no longer says "skip for Lightweight" — a `config.json` with `provider: "none"` is required to suppress the Ollama fallback warning.
+## 0.8.0 — Phase C correctness, tool-artifact store, tiered contradiction resolution
+- **Reranker credentials via environment variable:** `ZEROENTROPY_API_KEY` and `OPENROUTER_API_KEY` are now read from the environment. Config file `zeroEntropyApiKey` / `openrouterApiKey` still works as fallback. Recommended setup puts the key in a shell env file so it never lands in config-under-version-control.
+- **Recommended operator tuning refreshed:** suggested full-deployment config lowered `warmHistoryBudgetFraction` to 0.27 and `budgetFraction` to 0.55, with corresponding fact/history/keystone trims. Produces steadier turn-over-turn pressure on long-running agent fleets and avoids the tight warm→trim→compact cycling that the previous richer defaults encouraged on 200k+ models. Existing deployments with custom configs are unaffected.
+- **`contextWindowSize` retired from recommended config:** runtime autodetection from the active model identifier is stable; manual override is no longer suggested. The field is still honored if present for back-compat.
+- **Custom/local model window guidance:** INSTALL.md and `docs/TUNING.md` now document `contextWindowOverrides` — how to detect autodetect failure (`verboseLogging` + `budget source:` log line), the two failure signatures (undersized = continuous trim cycling; oversized = mid-turn overflow), and how warming/trimming/compaction all scale off the detected window. Operators running finetuned, local, or unusually-named models should set overrides before tuning any other budget dial.
+- **Phase C correctness cluster:** tool result guards (C1), budget cluster drop telemetry, oversized artifact degradation with canonical reference format, unified pressure accounting, replay recovery isolation. Prompt-path verification harness proves Phase C behavior in real gateway flow.
+- **Tool-artifact store:** durable tool result storage for wave-guard (Sprint 2.1 active-turn hydration, Sprint 2.2 retention sweep + sensitive-artifact flag). Tool results survive compaction cycles.
+- **Fleet registry seeding on startup:** fleet agents seeded from known identity files on every gateway start. No cold-start gaps in registry state after deploy.
+- **Tiered contradiction resolution (V19):** auto-supersede at ≥0.80 confidence, `invalidateFact` at 0.60–0.80, log-only below 0.60. Background indexer records contradiction audits. Schema v19.
+- **Dreaming promoter temporal-marker screen:** blocks durable promotion of time-bound facts without `validFrom`/`invalidAt` metadata. Prevents stale temporally-anchored facts from polluting the durable fact store.
+- **MEMORY.md authoring guide:** `docs/MEMORY_MD_AUTHORING.md` documents the static-vs-dynamic fact contract for operators and agents.
+- **B4 model-aware budgeting:** compositor resolves token budget from active model string when the runtime does not pass `tokenBudget` explicitly. Budget recomputes on mid-session model swap.
+- **BLAKE3 content dedup + RRF fusion:** O(1) fingerprint dedup across all retrieval paths. RRF merges FTS5 and KNN results. Zigzag ordering balances recency and relevance in composed output.
+## 0.7.0 — Temporal validity, expertise storage, contradiction detection
+- **Temporal validity engine:** facts expire, get superseded, and decay over time. Stale knowledge auto-deprioritized in retrieval.
+- **Expertise store:** per-agent skill/domain tracking. Agents accumulate domain proficiency through indexed interactions.
+- **Contradiction detector:** flags conflicting facts at ingest time. Newer evidence supersedes older, with audit trail.
+- **Maintenance APIs:** programmatic access to compaction stats, index health, and storage diagnostics.
+- **CI pipeline:** Redis removed, memory-plugin build stage added, monorepo file-ref wiring.
+## 0.6.2 — Turn DAG, identity scrub, fleet customization
+- Turn DAG phases 1–3: DAG-native reads, context-scoped recall, fence downgrade.
+- Fleet name substitution for public distribution.
+- Fleet customization guide and single-agent install docs.
+## 0.6.0 — Redis removal complete
+- SQLite-only cache layer replaces Redis entirely. Zero external service dependencies.
+- Simplified deployment: single binary + SQLite files.
+## 0.5.6 — Content fingerprint dedup and hardening
+- O(1) fingerprint dedup across all retrieval paths (temporal, open-domain, semantic, cross-session). Catches rephrased near-duplicates that substring matching missed.
+- Identity bootstrap pre-fingerprinting: SOUL.md, USER.md, IDENTITY.md content already in the prompt is never double-injected by retrieval.
+- Indexer circuit breaker with startup integrity check for library.db corruption. Graceful degradation, not cascading failure.
+- SQL parameterization hardening on datetime and FTS5 paths.
+## 0.5.5 — Tuning collapse and config schema
+- Plugin config schema: all tuning knobs declarable in `openclaw.json`. No more manual config.json edits.
+- Tuning simplified to 4 primary knobs: `budgetFraction`, `reserveFraction`, `historyFraction`, `memoryFraction`.
+- Identity and doc chunk dedup against OpenClaw bootstrap injection.
+- Window cache with freshness diagnostics.
+## 0.5.0 — SQLite hot-cache transition and context engine
+- SQLite `:memory:` hot cache introduced for the runtime hot layer. Redis compatibility artifacts still existed at this stage, full removal completed in 0.6.0.
+- Context engine plugin: runs as an OpenClaw `contextEngine` slot, composing prompts per-turn.
+- Transform-first assembly: tool results compressed before budget allocation, not after.
+- Cluster-aware budget shaping: related tool turns grouped and trimmed together.
+- Hybrid FTS5 + KNN retrieval with Reciprocal Rank Fusion.
+- Workspace seeding: agents auto-ingest their workspace docs on bootstrap.
+- Runtime profiles: `light`, `standard`, `full`.
+- Obsidian import and export.
+- Metrics dashboard primitives.
+## 0.4.0 — Eviction and migration
+- Image and heavy-content eviction pre-pass in assembly. Old screenshots and large tool outputs aged out before they compete for budget.
+- Engine version stamps in library.db. Schema migration runs automatically on version bump.
+- Migration guides and scripts for Cognee, QMD, Mem0, Zep, Honcho, and raw MEMORY.md files.
+## 0.3.0 — Subagent context and retrieval
+- Subagent context inheritance: spawned subagents get bounded parent context, session-scoped docs, and relevant facts.
+- Tool Gradient v2: turn-age tiers with head+tail truncation on tool results.
+- Cursor-aware indexer with ghost message suppression.
+## 0.2.0 — Retrieval access control
+- Trigger registry ownership and auditability.
+- Retrieval access control, trigger fallback paths, and history rebalance.
+## 0.1.0 — Core architecture
+- Four-layer memory: in-memory cache, message history, vector search, structured library.
+- 8-level priority compositor with slot-based prompt assembly.
+- Cross-agent memory access with visibility-scoped permissions.
+- Knowledge graph with DAG traversal.

package/INSTALL.md ADDED Viewed

@@ -0,0 +1,800 @@
+# hypermem — Installation Guide
+## Prerequisites
+- **Node.js 22+** (uses built-in `node:sqlite`)
+- **OpenClaw** must already be installed, onboarded, and running. The plugin install assumes a working OpenClaw home with a valid `openclaw.json` and a gateway that can restart.
+- **Disk space:** allow at least 2 GB free. Plugin builds pull OpenClaw as a dev dependency.
+**Verify before starting:**
+```bash
+openclaw gateway status    # should show "running" or "ready"
+openclaw config get gateway # should return gateway config, not an error
+```
+If `gateway status` shows "disabled" or "not configured", complete OpenClaw onboarding first. `openclaw gateway restart` only works when the gateway service is already set up. On a brand-new OpenClaw install that has never been started, you need `openclaw gateway start` (or the full onboarding flow) before installing plugins.
+## Quick Start
+> **Disk space:** plugin installs pull OpenClaw as a dev dependency. Allow at least 2 GB free before starting.
+>
+> **Prerequisites:** OpenClaw must be installed and onboarded before this step. Run `openclaw gateway status` to confirm. If the gateway is not configured, complete OpenClaw setup first.
+>
+> **Production runtime path:** install the built runtime payload into `~/.openclaw/plugins/hypermem`. Do not point production at `/tmp` or at your development repo clone.
+>
+> **Config merge warning:** if you already have values in `plugins.load.paths` or `plugins.allow`, merge them instead of overwriting them blindly.
+```bash
+git clone https://github.com/PsiClawOps/hypermem.git
+cd hypermem
+npm install && npm run build
+npm --prefix plugin install && npm --prefix plugin run build   # ~1 min on a clean machine
+npm --prefix memory-plugin install && npm --prefix memory-plugin run build
+npm run install:runtime
+mkdir -p ~/.openclaw/hypermem
+cat > ~/.openclaw/hypermem/config.json <<'JSON'
+{
+  "embedding": {
+    "provider": "none"
+  }
+}
+JSON
+```
+`install:runtime` stages the built plugin files into `~/.openclaw/plugins/hypermem`. It does **not** modify your OpenClaw config. The commands below wire the plugins manually.
+Wire both plugins into OpenClaw:
+```bash
+openclaw config set plugins.load.paths "[\"$HOME/.openclaw/plugins/hypermem/plugin\",\"$HOME/.openclaw/plugins/hypermem/memory-plugin\"]" --strict-json
+openclaw config set plugins.slots.contextEngine hypercompositor
+openclaw config set plugins.slots.memory hypermem
+openclaw config set plugins.allow '["hypercompositor","hypermem"]' --strict-json
+openclaw gateway restart
+```
+The repo clone is for build and release work. OpenClaw should load the installed runtime payload from `~/.openclaw/plugins/hypermem/`.
+### Verification checkpoints
+1. **Build verified**
+   - root build succeeds
+   - `plugin` build succeeds
+   - `memory-plugin` build succeeds
+2. **Wiring verified**
+   - OpenClaw accepts `plugins.load.paths`
+   - slots are set to `hypercompositor` and `hypermem`
+   - gateway restart succeeds
+3. **Runtime verified active**
+Send a message to any agent, then verify:
+```bash
+openclaw logs --limit 100 | grep -E 'hypermem|context-engine'
+```
+Expected lightweight-mode lines:
+- `[hypermem] hypermem initialized`
+- `[hypermem] Embedding provider: none — semantic search disabled, using FTS5 fallback`
+- `[hypermem:compose]`
+If you see a fallback like `falling back to default engine "legacy"`, the install is **not** fully active yet even if the build and wiring steps succeeded.
+---
+## What hypermem Does
+hypermem replaces OpenClaw's default context assembly with a four-layer SQLite-backed memory system. Every turn, it queries all layers in parallel and composes context within a fixed token budget. No transcript accumulates. No lossy summarization. Content that doesn't fit this turn stays in storage instead of being destroyed.
+| Layer | Storage | What it holds | Speed |
+|---|---|---|---|
+| **L1** | SQLite in-memory | Session cache: identity, recent history, active state | 0.08ms |
+| **L2** | Per-agent SQLite | Conversation history, survives restarts, rotates at 100MB | 0.13ms |
+| **L3** | Per-agent SQLite + sqlite-vec | Semantic search via embeddings | 0.29ms |
+| **L4** | Shared SQLite | Structured knowledge: facts, episodes, preferences, fleet registry | 0.09ms |
+Everything runs in-process on SQLite memory databases. No external database services required.
+---
+## Requirements
+| Dependency | Required | Notes |
+|---|---|---|
+| Node.js 22+ | **Yes** | Uses built-in `node:sqlite`. No standalone SQLite install needed. |
+| OpenClaw | **Yes** | Any version with context engine plugin support |
+| Ollama | Local embeddings only | [ollama.ai](https://ollama.ai) — pull `nomic-embed-text` |
+| OpenRouter API key | Hosted embeddings only | Alternative to local: [openrouter.ai](https://openrouter.ai) |
+| Gemini API key | Gemini embeddings only | Alternative: [aistudio.google.com](https://aistudio.google.com/apikey) |
+`sqlite-vec` is the only native dependency and installs automatically via npm.
+> **Package versions:** the root package (`hypermem`) and the two plugins (`hypercompositor`, `hypermem-memory`) are versioned independently. Plugin versions trail the core by one minor version when no plugin-facing API changes ship in a release — this is expected.
+The **embedding layer** (L3 semantic search) requires a configured provider. Without one, hypermem falls back to FTS5 keyword matching. This is functional but degrades recall quality. See [Setup Styles](#setup-styles) below.
+---
+## Setup Styles
+Pick a style based on your hardware and cost tolerance. All styles support full history, fact recall, and session continuity — the differences are in semantic search quality and local resource requirements.
+| Style | Embedding | Reranker | Semantic recall | Cost | Hardware |
+|---|---|---|---|---|---|
+| **Lightweight** | None (FTS5 only) | None | Keyword match only | Free | Any |
+| **Local** | Ollama nomic-embed-text | None (RRF) or Ollama Qwen3-Reranker (GPU only) | Good | Free | ~1GB RAM + GPU for reranker |
+| **High** | OpenRouter Qwen3-8B | OpenRouter Cohere Rerank 4 | Best (MTEB #1) | ~pennies/day | API key |
+The **reranker is optional at every tier.** Without one, results are ordered by RRF fusion score (FTS5 + vector) — a solid default. The reranker improves precision but requires a GPU for the local option; CPU-only systems should leave it as None.
+---
+## Embedding Providers
+Pick a tier based on your hardware:
+| Tier | Provider | Quality | Cost | Setup |
+|---|---|---|---|---|
+| **Minimal** | None (FTS5 keyword only) | Keyword-only, no semantic recall | Free | None |
+| **Local** | Ollama + nomic-embed-text (768d) | Good | Free | Ollama required |
+| **Hosted** | OpenRouter + Qwen3 Embedding 8B (4096d) | Best (MTEB #1) | ~pennies/day | API key |
+| **Gemini** | Google Gemini Embedding (768d) | Good | Free tier available | API key |
+### Minimal (no embedder)
+No Ollama, no API key. This config must exist **before gateway restart and runtime verification** so the clean install validates the intended lightweight behavior. Set `provider: 'none'` explicitly in `~/.openclaw/hypermem/config.json` to disable embedding entirely:
+```json
+{
+  "embedding": {
+    "provider": "none"
+  }
+}
+```
+Without a config file, the default provider is `ollama` — if Ollama isn't running, the vector store initialization fails non-fatally and hypermem falls back to FTS5. Using `provider: 'none'` makes the intent explicit and avoids the init attempt.
+You'll see in the logs:
+```
+[hypermem] Embedding provider: none — semantic search disabled, using FTS5 fallback
+```
+Upgrade to a higher tier later without losing stored data.
+### Troubleshooting clean installs
+**Symptom:** `Context engine "hypercompositor" ... falling back to default engine "legacy"`
+- The plugin was found, but the context engine did not activate correctly.
+- Treat the install as failed at runtime, not successful.
+- Check for release artifact mismatch, stale plugin build output, or config collisions with existing plugin paths.
+**Symptom:** HyperMem logs never appear after restart
+- Re-check `plugins.load.paths` for exact absolute paths.
+- Confirm the clone directory still exists and was not created in a temp location.
+- Confirm existing `plugins.allow` and `plugins.load.paths` values were merged correctly instead of overwritten incorrectly.
+**Symptom:** build succeeds, but behavior is not lightweight mode
+- Confirm `~/.openclaw/hypermem/config.json` existed before restart.
+- Confirm it contains:
+  ```json
+  {
+    "embedding": {
+      "provider": "none"
+    }
+  }
+  ```
+### Local — Ollama + nomic-embed-text
+```bash
+ollama pull nomic-embed-text
+```
+No config file needed. Ollama on `localhost:11434` with `nomic-embed-text` is the default. Requires ~1GB RAM for the model.
+### Hosted — OpenRouter + Qwen3 Embedding 8B (Recommended)
+Best quality, no local compute. Embedding is async and doesn't affect agent response time.
+Create or edit `~/.openclaw/hypermem/config.json`:
+```json
+{
+  "embedding": {
+    "provider": "openai",
+    "openaiApiKey": "sk-or-YOUR_OPENROUTER_KEY",
+    "openaiBaseUrl": "https://openrouter.ai/api/v1",
+    "model": "qwen/qwen3-embedding-8b",
+    "dimensions": 4096,
+    "batchSize": 128
+  }
+}
+```
+Get a key at [openrouter.ai](https://openrouter.ai). Cost at typical agent volumes: under a cent per day.
+### Gemini
+Create or edit `~/.openclaw/hypermem/config.json`:
+```json
+{
+  "embedding": {
+    "provider": "gemini",
+    "geminiApiKey": "YOUR_GEMINI_API_KEY",
+    "model": "text-embedding-004",
+    "dimensions": 768,
+    "batchSize": 128
+  }
+}
+```
+Get a key at [aistudio.google.com](https://aistudio.google.com/apikey). The free tier covers typical agent usage.
+Optional Gemini-specific settings:
+- `geminiBaseUrl`: defaults to `https://generativelanguage.googleapis.com`
+- `geminiIndexTaskType`: defaults to `RETRIEVAL_DOCUMENT`
+- `geminiQueryTaskType`: defaults to `RETRIEVAL_QUERY`
+### Switching providers
+Changing providers after vectors are built requires a full re-index (dimensions are incompatible):
+```bash
+node scripts/embed-existing.mjs
+```
+Fresh installs don't need this.
+---
+## Reranker (Optional)
+The reranker re-orders semantic search candidates by relevance before injection. Without it, results are ordered by RRF fusion score (FTS5 + KNN). The reranker is optional — the system degrades gracefully to original order on any failure.
+| Provider | Model | Cost | Hardware | Notes |
+|---|---|---|---|---|
+| **None** | — | Free | Any | Default — RRF fusion ordering |
+| **Ollama (local)** | Qwen3-Reranker-0.6B | Free | GPU recommended | CPU-only: too slow for >5 candidates |
+| **OpenRouter** | cohere/rerank-4-pro | ~pennies/day | Any | Best quality, uses existing key |
+| **ZeroEntropy** | zerank-2 | ~pennies/day | Any | Dedicated reranking service |
+**CPU-only systems:** skip the local reranker. Sequential inference makes it 2-10 seconds per document on CPU — unusable at any reasonable candidate depth. RRF fusion (`provider: "none"`) is the right default for CPU-only setups and is meaningfully better than raw vector ordering alone.
+### No reranker (default)
+No config needed. RRF fusion of FTS5 + vector results is the default ordering. For most conversational memory workloads, this is sufficient and runs on any hardware.
+### Local — Ollama Qwen3-Reranker-0.6B
+Best option for air-gapped or GPU-equipped setups. Slower than hosted due to sequential inference (one model call per candidate document) — requires a GPU for practical use.
+```bash
+ollama pull dengcao/Qwen3-Reranker-0.6B:Q5_K_M
+```
+Add to `~/.openclaw/hypermem/config.json`:
+```json
+{
+  "reranker": {
+    "provider": "local",
+    "ollamaUrl": "http://localhost:11434",
+    "ollamaModel": "dengcao/Qwen3-Reranker-0.6B:Q5_K_M",
+    "topK": 10,
+    "minCandidates": 5
+  }
+}
+```
+### Hosted — OpenRouter (Cohere Rerank 4)
+Fastest, highest quality. Uses the same OpenRouter key as hosted embeddings if you already have one.
+Put the key in your environment, not the config file:
+```bash
+export OPENROUTER_API_KEY="sk-or-YOUR_OPENROUTER_KEY"
+```
+Then in `~/.openclaw/hypermem/config.json`:
+```json
+{
+  "reranker": {
+    "provider": "openrouter",
+    "openrouterModel": "cohere/rerank-4-pro",
+    "topK": 10,
+    "minCandidates": 5
+  }
+}
+```
+`openrouterApiKey` in the config file is still honored as a fallback for compatibility, but env-var-first keeps credentials out of any config-under-version-control.
+### Hosted — ZeroEntropy (zerank-2)
+Alternative hosted option, specialized reranking service.
+```bash
+export ZEROENTROPY_API_KEY="YOUR_ZEROENTROPY_KEY"
+```
+Then:
+```json
+{
+  "reranker": {
+    "provider": "zeroentropy",
+    "zeroEntropyModel": "zerank-2",
+    "topK": 10,
+    "minCandidates": 5
+  }
+}
+```
+`zeroEntropyApiKey` in the config file is still honored as a fallback. Get a key at [zeroentropy.dev](https://zeroentropy.dev).
+---
+## Installation Steps
+### Step 1 — Clone and build
+```bash
+git clone https://github.com/PsiClawOps/hypermem.git
+cd hypermem
+npm install
+npm run build
+```
+Build both plugins, then install the runtime payload into OpenClaw's durable plugin directory:
+```bash
+npm --prefix plugin install && npm --prefix plugin run build
+npm --prefix memory-plugin install && npm --prefix memory-plugin run build
+npm run install:runtime
+```
+Verify:
+```bash
+npm test
+```
+The full suite takes 30–60 seconds. When complete, output ends with `ALL N TESTS PASSED ✅`. If you see `ENOSPC`, free up disk space and retry.
+### Step 2 — Wire the plugins
+Use the OpenClaw CLI. **Do not edit `openclaw.json` directly.**
+```bash
+# Add plugin load paths
+openclaw config set plugins.load.paths "[\"$HOME/.openclaw/plugins/hypermem/plugin\",\"$HOME/.openclaw/plugins/hypermem/memory-plugin\"]" --strict-json
+# Set the context engine slot
+openclaw config set plugins.slots.contextEngine hypercompositor
+# Set the memory slot
+openclaw config set plugins.slots.memory hypermem
+# Allow both plugins
+openclaw config set plugins.allow '["hypercompositor","hypermem"]' --strict-json
+```
+If you already have entries in `plugins.allow` or `plugins.load.paths`, merge rather than replace. Check current values:
+```bash
+openclaw config get plugins.allow
+openclaw config get plugins.load.paths
+```
+### Step 3 — Choose embedding provider
+See [Embedding Providers](#embedding-providers) above.
+- **Lightweight (no embedder):** create `~/.openclaw/hypermem/config.json` with `{"embedding":{"provider":"none"}}`. The Quick Start block above already does this. Without this file, the default provider is `ollama` and you'll see a non-fatal init warning if Ollama isn't running.
+- **Local:** `ollama pull nomic-embed-text`. No config file needed (Ollama is the default).
+- **Hosted/Gemini:** create `~/.openclaw/hypermem/config.json` with the provider config block from the relevant section above.
+### Step 4 — Restart and verify
+```bash
+openclaw gateway restart
+```
+> **If restart reports the gateway is disabled or not configured:** you need to complete OpenClaw onboarding before this step. See [Prerequisites](#prerequisites). `gateway restart` only works on an already-running gateway.
+Send a message to any agent, then check:
+```bash
+openclaw logs --limit 50 | grep hypermem
+```
+> **If `openclaw logs` fails with an auth or token error:** the gateway API requires authentication. Run `openclaw gateway status` to confirm the gateway is running and accessible. If the gateway is running but logs fail, check `openclaw config get gateway.token` and ensure your shell session has the correct auth context.
+Expected:
+```
+[hypermem] hypermem initialized — dataDir=/home/.../.openclaw/hypermem
+[hypermem:compose] agent=main triggers=0 fallback=true facts=3 semantic=2 ...
+```
+Full health check (run from the repo clone directory):
+```bash
+node bin/hypermem-status.mjs              # full dashboard
+node bin/hypermem-status.mjs --health     # health checks only (exit 1 on failure)
+```
+> **Note:** The health check requires the data directory to exist. It is created on first gateway restart with the plugin active. Run the `openclaw logs` check first to confirm initialization, then run the health check.
+If the plugin didn't load:
+```bash
+openclaw config get plugins.slots.contextEngine   # should be: hypercompositor
+openclaw config get plugins.slots.memory           # should be: hypermem
+openclaw status                                     # look for hypermem in plugins
+```
+### Step 5 — Configure your fleet
+hypermem works out of the box for both single-agent and multi-agent installs. The source ships with generic placeholder agent names (`agent1`, `agent2`, `director1`, etc.) in two files that define fleet topology:
+| File | What it defines |
+|---|---|
+| `src/cross-agent.ts` | Org membership, agent tiers, visibility scoping |
+| `src/background-indexer.ts` | Agent-to-domain mapping for fact classification |
+#### Single-agent installs
+No code changes needed. hypermem resolves your agent ID from your OpenClaw config at runtime. The placeholder names are never used.
+Verify it's working after Step 4:
+```bash
+openclaw logs --limit 20 | grep hypermem
+```
+You should see your agent ID (not a placeholder) in the compose logs:
+```
+[hypermem:compose] agent=my-agent triggers=0 fallback=true facts=3 semantic=2 ...
+```
+Facts, episodes, and topics are all scoped to your agent ID automatically. Cross-agent features (org visibility, shared facts) are dormant with a single agent and activate only when additional agents are configured.
+#### Multi-agent installs
+hypermem ships with generic placeholder agent names (`agent1`, `agent2`, `director1`, etc.) in the two fleet topology files listed above.
+Replace the placeholder names with your fleet:
+**1. Edit `src/cross-agent.ts`** — replace the `agents` map and `orgs` map in `defaultOrgRegistry()` with your fleet:
+```typescript
+// Before (placeholder):
+agent1: { agentId: 'agent1', tier: 'council' },
+// After (your fleet):
+architect: { agentId: 'architect', tier: 'council' },
+```
+**2. Edit `src/background-indexer.ts`** — update `AGENT_DOMAIN_MAP` with your agent IDs and their domains:
+```typescript
+// Before (placeholder):
+agent1: 'infrastructure',
+// After (your fleet):
+architect: 'infrastructure',
+```
+**3. Rebuild and restart:**
+```bash
+npm run build
+npm --prefix plugin run build
+openclaw gateway restart
+```
+Agents not listed in `AGENT_DOMAIN_MAP` default to domain `'general'`, which is fine for most setups. The org registry only matters if you use cross-agent memory visibility (org-scoped or council-scoped facts). If all your facts are agent-private or fleet-wide, you can skip the org structure entirely.
+**Test fixtures use the placeholder names by design.** Don't rename them in `test/` — the tests validate the cross-agent logic, not your specific fleet topology.
+---
+## Upgrading from 0.5.x
+```bash
+cd <path-to-hypermem>
+git pull
+npm install
+npm run build
+npm --prefix plugin install && npm --prefix plugin run build
+npm --prefix memory-plugin install && npm --prefix memory-plugin run build
+openclaw gateway restart
+```
+What changed on the path from 0.5.x to current:
+- **0.6.0**: SQLite `:memory:` became the only hot layer. Redis was fully removed and the runtime no longer depends on any external cache service.
+- **0.7.0**: Temporal validity, expertise storage, contradiction detection, and maintenance APIs landed.
+- **0.8.1**: Documentation fixes — install instructions rewritten for clean first-run, `$HOME` replaces `~` in shell-interpolated paths, Lightweight mode config clarified.
+- **0.8.0**: Phase C correctness guards, tool-artifact store, schema v10/v19, BLAKE3 dedup, RRF fusion, and fleet registry seeding shipped.
+- **Upgrade impact**: current releases use `messages.db` schema v10 and `library.db` schema v19. If you are upgrading from older 0.5.x installs, expect both schema and runtime-behavior changes.
+What changed in 0.5.x releases:
+- **0.5.5**: Plugin config schema, tuning knobs moved into `openclaw.json`. Manual `config.json` edits for compositor settings may be superseded by the plugin schema.
+- **0.5.6**: Content fingerprint dedup, indexer circuit breaker, SQL parameterization hardening.
+- **Exports field** added to `package.json`: if you import from `@psiclawops/hypermem`, verify your import paths still resolve.
+If you switch embedding providers during the upgrade, re-index:
+```bash
+node scripts/embed-existing.mjs
+```
+Check [CHANGELOG.md](CHANGELOG.md) for the full list of changes per version.
+**Build errors after upgrade:** Clean `dist/` directories and rebuild:
+```bash
+rm -rf dist plugin/dist memory-plugin/dist
+npm run build
+npm --prefix plugin run build
+npm --prefix memory-plugin run build
+```
+---
+## OpenClaw Settings (Optional Tuning)
+These are optional. hypermem works with OpenClaw defaults, but these changes reduce unnecessary overhead.
+### Lower OpenClaw's compaction threshold
+hypermem owns compaction. OpenClaw's default fires at 24K reserved tokens, which races hypermem's budget management:
+```bash
+openclaw config set agents.defaults.compaction.reserveTokens 1000 --strict-json
+```
+This makes OpenClaw's compaction a last-resort safety net that never fires in normal operation.
+### Tighter session store retention
+With hypermem active, SQLite is the durable record. JSONL transcripts provide no memory benefit:
+```bash
+openclaw config set sessions.maintenance.pruneAfter "14d"
+openclaw config set sessions.maintenance.maxEntries 200 --strict-json
+```
+OpenClaw defaults: `pruneAfter: 30d`, `maxEntries: 500`. If you browse conversation history older than 14 days via the session list, keep the higher value.
+### Session max-age (fleet installs only)
+Prevents idle sessions from accumulating indefinitely:
+```bash
+openclaw config set sessions.maxAgeHours 168 --strict-json  # 7 days
+```
+Solo installs can skip this.
+---
+## Token Budget Tuning
+These settings live in `~/.openclaw/hypermem/config.json` under the `compositor` key. All fields are optional — omit any knob to get the code-level default. Gateway restart required after changes.
+The recommended starting config for a standard single-agent deployment is intentionally lean on turn-1 warming. Semantic recall and fact triggers fire against each incoming message, so topic-relevant context surfaces as the conversation takes shape. This produces a steadier pressure profile than aggressive pre-loading and avoids the warm→trim→compact cycling you see when every session starts near the top of the budget.
+```json
+{
+  "compositor": {
+    "budgetFraction": 0.55,
+    "contextWindowReserve": 0.25,
+    "targetBudgetFraction": 0.50,
+    "warmHistoryBudgetFraction": 0.27,
+    "maxFacts": 25,
+    "maxHistoryMessages": 500,
+    "maxCrossSessionContext": 4000,
+    "maxRecentToolPairs": 3,
+    "maxProseToolPairs": 10,
+    "keystoneHistoryFraction": 0.15,
+    "keystoneMaxMessages": 12,
+    "wikiTokenCap": 500
+  }
+}
+```
+| Knob | Recommended | What it controls | Notes |
+|---|---|---|---|
+| `budgetFraction` | 0.55 | Fraction of the detected context window used as input budget | Raise to 0.65 for agents that aggressively tool-use. Autodetect only handles known model families — see *Context window overrides* below for custom/local/finetuned models |
+| `contextWindowReserve` | 0.25 | Reserve left for output and tool results | Below 0.20 on large-context models invites late-turn overflow |
+| `targetBudgetFraction` | 0.50 | Split between context assembly and history | Higher = richer facts/wiki; lower = more conversation headroom |
+| `warmHistoryBudgetFraction` | 0.27 | History's share of first-turn warming | The key lever against tight trim cycles; don't push below 0.20 |
+| `maxFacts` | 25 | Structured facts injected per turn | Recall surfaces more as topics emerge; 35 is fine for long-memory seats |
+| `maxHistoryMessages` | 500 | Candidate pool for history ranking | Pool size, not load size. 300 is fine for short-session agents |
+| `maxCrossSessionContext` | 4000 | Cross-session context tokens | Solo agents with one session: set to 0 |
+| `maxRecentToolPairs` | 3 | Verbatim tool pairs kept | Raise to 5 for code agents with heavy tool output |
+| `maxProseToolPairs` | 10 | Compressed tool pairs before stubbing | |
+| `keystoneHistoryFraction` | 0.15 | Older significant turns reserved within history slot | |
+| `keystoneMaxMessages` | 12 | Max keystone candidates per turn | Raise to 18 if the agent loses track of older decisions |
+| `wikiTokenCap` | 500 | Cap on wiki/knowledge injection | Raise if your agent uses heavy doc content |
+**Lean profile** (~35–45% fewer tokens per turn) — for constrained hosts, small models, or cost-sensitive deployments:
+```json
+{
+  "compositor": {
+    "budgetFraction": 0.55,
+    "contextWindowReserve": 0.30,
+    "warmHistoryBudgetFraction": 0.20,
+    "maxFacts": 10,
+    "maxHistoryMessages": 150,
+    "maxCrossSessionContext": 0,
+    "maxRecentToolPairs": 2,
+    "maxProseToolPairs": 6,
+    "keystoneHistoryFraction": 0.10,
+    "keystoneMaxMessages": 5,
+    "wikiTokenCap": 300,
+    "hyperformProfile": "light"
+  }
+}
+```
+---
+### Context window overrides (custom, local, or finetuned models)
+HyperMem sizes the token budget from the model string using an internal pattern table covering known families (Claude, GPT, Gemini, GLM, Qwen, DeepSeek). If your model string doesn't match a known pattern, resolution silently falls through to `defaultTokenBudget` (90k), and **every downstream dial in this section becomes wrong**, because they're all fractions of the context window:
+- `budgetFraction` × *wrong window* → wrong input budget
+- `warmHistoryBudgetFraction` × *wrong budget* → wrong warm load on first turn
+- Trim tiers and compaction thresholds fire against the wrong ceiling
+The two symptoms that indicate window-detection failure:
+1. **Undersized window detected** (you have a 200k model, HyperMem thinks it's 90k): every turn warms near the top of the misdetected budget, trim fires constantly, semantic recall and facts get starved. You see continuous `warm→trim→compact` cycling even on short sessions.
+2. **Oversized window detected** (you have a 32k local model, HyperMem thinks it's larger): warm loads overshoot the real context window, turns land mid-response with truncated output or provider-side 400s on token overflow.
+**Check what HyperMem is using.** Enable `verboseLogging: true` in the compositor config and look for the `budget source:` log line on each turn:
+```
+[hypermem-plugin] budget source: runtime tokenBudget=163840 model=provider/my-model
+[hypermem-plugin] budget source: contextWindowOverrides[provider/my-model]=131072, reserve=0.25, effective=98304
+[hypermem-plugin] budget source: fallback contextWindowSize=90000, reserve=0.25, effective=67500 model=provider/my-model
+```
+If you see `fallback contextWindowSize` for your model, detection failed and you need an override.
+**Apply an override.** Add a `contextWindowOverrides` block to `~/.openclaw/hypermem/config.json`. The key is `"provider/model"` as it appears in your agent's model string (lowercase, exact match):
+```json
+{
+  "compositor": {
+    "budgetFraction": 0.55,
+    "contextWindowReserve": 0.25,
+    "warmHistoryBudgetFraction": 0.27,
+    "contextWindowOverrides": {
+      "ollama/llama-3.3-70b":      { "contextTokens": 131072 },
+      "copilot-local/custom-sft":  { "contextTokens": 32768 },
+      "vllm/qwen3-coder-ft":       { "contextTokens": 262144 }
+    }
+  }
+}
+```
+Resolution order, highest-to-lowest priority:
+1. Runtime `tokenBudget` passed by OpenClaw (always wins if present)
+2. `contextWindowOverrides["provider/model"]` from this config
+3. Internal pattern-table match against the model string
+4. `defaultTokenBudget` fallback (90k) — **you do not want to end up here**
+Gateway restart required after editing overrides. Invalid override entries (malformed keys, impossible ranges, empty values) are dropped on load with a warning; the sanitizer will not let a bad override poison the resolver.
+**Interaction with warming and trimming.** Once the correct window is in place:
+- First-turn warm load = `detectedWindow × budgetFraction × (1 - contextWindowReserve) × warmHistoryBudgetFraction`
+- Trim pressure zones are computed from the same `detectedWindow × budgetFraction × (1 - reserve)` effective budget, so trim fires at the right proportions of the real window, not a wrong one
+- Compaction thresholds (85% nuclear, 80% afterTurn trim) are also against the effective budget, not the raw window
+TL;DR for operators running custom/local/finetuned models: **set `contextWindowOverrides` before tuning anything else in this section**. Every other knob here assumes the detected window is right.
+---
+## Data Directory
+Created automatically on first run at `~/.openclaw/hypermem/`:
+```
+~/.openclaw/hypermem/
+├── config.json             ← embedding and compositor tuning (user-created)
+├── library.db              ← L4: facts, episodes, knowledge, fleet registry (shared)
+└── agents/
+    └── {agentId}/
+        ├── messages.db     ← L2: conversation history (per-agent)
+        └── vectors.db      ← L3: semantic search index (per-agent)
+```
+**Backup:** `library.db` and per-agent `messages.db` files are persistent memory. Back them up before major upgrades.
+**Rotation:** `messages.db` rotates at 100MB or 90 days. Archives (`messages_2026Q1.db` etc.) remain searchable.
+---
+## Troubleshooting
+**Semantic search not working / no vector results**
+Check your embedding tier:
+- **Local (Ollama):** Confirm Ollama is running with `ollama list`. If `nomic-embed-text` is missing, `ollama pull nomic-embed-text` and restart the gateway.
+- **Hosted (OpenRouter):** Verify `openaiApiKey` and `openaiBaseUrl` in `~/.openclaw/hypermem/config.json`.
+- **Gemini:** Verify `geminiApiKey` in config.
+- **Minimal:** Semantic search is intentionally disabled. FTS5 keyword fallback is active.
+The background indexer runs on a 5-minute interval. After the first cycle, check `openclaw logs | grep embed`.
+**`facts=0 semantic=0` every turn**
+Expected on fresh installs. Facts and episodes accumulate over real conversations. After a few sessions these numbers grow. Workspace files can be seeded manually via the seeder API.
+**Plugin not found**
+Confirm the installed runtime artifacts exist:
+```bash
+ls ~/.openclaw/plugins/hypermem/plugin/dist/index.js
+ls ~/.openclaw/plugins/hypermem/memory-plugin/dist/index.js
+ls ~/.openclaw/plugins/hypermem/dist/index.js
+```
+If missing, rebuild and reinstall the runtime payload:
+```bash
+cd <path-to-hypermem>
+npm run build
+npm --prefix plugin run build
+npm --prefix memory-plugin run build
+npm run install:runtime
+```
+Then restart the gateway.
+**Build errors after upgrade**
+Clean `dist/` and rebuild:
+```bash
+rm -rf dist plugin/dist memory-plugin/dist
+npm run build
+npm --prefix plugin run build
+npm --prefix memory-plugin run build
+```
+**Agent not resuming context after restart**
+Check that `~/.openclaw/hypermem/agents/{agentId}/messages.db` exists. If missing, the agent hasn't bootstrapped yet and will create it on first session.
+---
+## Uninstalling
+To return to OpenClaw's default context engine:
+```bash
+openclaw config set plugins.slots.contextEngine legacy
+openclaw config set plugins.slots.memory none
+openclaw gateway restart
+```
+Data in `~/.openclaw/hypermem/` is untouched. Re-enable by switching back.
+---
+_Questions or issues: file against [the repo](https://github.com/PsiClawOps/hypermem) or ask in `#hypermem`._

package/README.md CHANGED Viewed

@@ -405,7 +405,39 @@ Schema versions are stamped into each database on startup and checked on open. A
 **Requirements:** Node.js 22+, OpenClaw with context engine plugin support. No standalone SQLite install needed (uses Node 22 built-in `node:sqlite`). Embedding provider is optional for first install.
-### From source
+hypermem works two ways:
+- **As a library** — import directly into your own Node.js code. No OpenClaw required.
+- **As an OpenClaw plugin** — replaces the default context engine. Requires a running OpenClaw gateway.
+### Library usage (no OpenClaw required)
+```bash
+npm install @psiclawops/hypermem
+```
+```typescript
+import { HyperMem } from '@psiclawops/hypermem';
+import { join } from 'node:path';
+import { homedir } from 'node:os';
+const hm = await HyperMem.create({
+  dataDir: join(homedir(), '.openclaw', 'hypermem'),
+  embedding: { provider: 'none' },
+});
+await hm.recordUserMessage('my-agent', 'session-1', 'Hello');
+const composed = await hm.compose({
+  agentId: 'my-agent',
+  sessionKey: 'session-1',
+  prompt: 'Hello',
+  tokenBudget: 4000,
+  provider: 'anthropic',
+});
+```
+That's it. No gateway, no plugins, no config files. See [API](#api) for the full interface.
+### OpenClaw plugin install (from source)
 ```bash
 git clone https://github.com/PsiClawOps/hypermem.git
@@ -530,9 +562,11 @@ Full reference: **[docs/TUNING.md](./docs/TUNING.md)**
 ```typescript
 import { HyperMem } from '@psiclawops/hypermem';
+import { join } from 'node:path';
+import { homedir } from 'node:os';
 const hm = await HyperMem.create({
-  dataDir: '~/.openclaw/hypermem',
+  dataDir: join(homedir(), '.openclaw', 'hypermem'),
   cache: { maxEntries: 10000 },
   // Local (Ollama):
   embedding: { ollamaUrl: 'http://localhost:11434', model: 'nomic-embed-text' },
@@ -581,6 +615,14 @@ node bin/hypermem-status.mjs --json          # machine-readable output
 node bin/hypermem-status.mjs --health        # health checks only (exit 1 on failure)
 ```
+By default, `hypermem-status` looks for data in `~/.openclaw/hypermem`. If your data directory is elsewhere (e.g. testing in an isolated environment), set:
+```bash
+HYPERMEM_DATA_DIR=/path/to/data node bin/hypermem-status.mjs --health
+```
+> **Fresh install note:** If no agent has run a session yet, `--health` will report "no sessions ingested" rather than a database error. This is expected. Send a test message to any agent, then re-run the health check.
 ---
 ## Pressure management
@@ -612,6 +654,22 @@ hypermem composes context fresh on every turn, but a long-running session still
 ---
+## Common issues
+| Symptom | Cause | Fix |
+|---|---|---|
+| `falling back to default engine "legacy"` in logs | Plugin not loaded or slot misconfigured | Check `openclaw config get plugins.slots.contextEngine` is `hypercompositor`, paths are correct, and both plugins are in `plugins.allow` |
+| `openclaw gateway restart` says disabled/not configured | OpenClaw not fully onboarded | Complete OpenClaw setup first. `gateway restart` requires a running gateway. |
+| `openclaw logs` fails with auth/token error | Gateway auth not set up for CLI | Run `openclaw gateway status` to confirm the gateway is accessible |
+| `facts=0 semantic=0` every turn | Fresh install, no data yet | Expected. Facts accumulate over real conversations. |
+| Health check says "no sessions ingested" | No agent has run a session yet | Send a test message, then re-run |
+| JS code creates `./~/.openclaw/` directory | Used literal `~` in JS instead of `homedir()` | Use `join(homedir(), '.openclaw', 'hypermem')` from `node:path` and `node:os` |
+| `INSTALL.md` not found in npm package | Older published version | Update to latest or read INSTALL.md on [GitHub](https://github.com/PsiClawOps/hypermem/blob/main/INSTALL.md) |
+Full troubleshooting: **[INSTALL.md § Troubleshooting](./INSTALL.md#troubleshooting)**
+---
 ## Migration
 hypermem doesn't touch your existing memory data. Install it, switch the context engine, and migrate historical data on your own timeline.

package/bin/hypermem-status.mjs CHANGED Viewed

@@ -104,6 +104,16 @@ if (flags.agent) {
 }
 if (!mainDbPath || !existsSync(mainDbPath)) {
+  if (args.includes('--health') || args.includes('--json')) {
+    const result = { status: 'no_sessions', message: 'Installed but no agent sessions ingested yet. Send a message to any agent, then re-run.' };
+    if (args.includes('--json')) {
+      console.log(JSON.stringify(result, null, 2));
+    } else {
+      console.log('Status: installed, no sessions ingested yet.');
+      console.log('Send a message to any agent, then re-run this health check.');
+    }
+    process.exit(0);
+  }
   console.error('Error: no agent messages.db found. Has HyperMem ingested any sessions?');
   process.exit(1);
 }

package/dist/version.d.ts CHANGED Viewed

@@ -1,5 +1,5 @@
 /** Release version — matches package.json and is stamped into library.db on every startup. */
-export declare const ENGINE_VERSION = "0.8.1";
+export declare const ENGINE_VERSION = "0.8.2";
 /** Minimum Node.js version required — matches package.json engines field. */
 export declare const MIN_NODE_VERSION = "22.0.0";
 /** @deprecated No longer used — Redis was replaced with SQLite :memory: CacheLayer. */
@@ -19,15 +19,15 @@ export declare const LIBRARY_SCHEMA_VERSION_EXPORT = 19;
 /**
  * Compatibility version — the single number operators and consumers check.
  * Maps to: messages.db schema v10, library schema v19.
- * Matches ENGINE_VERSION for the 0.8.1 release.
+ * Matches ENGINE_VERSION for the 0.8.2 release.
  */
-export declare const HYPERMEM_COMPAT_VERSION = "0.8.1";
+export declare const HYPERMEM_COMPAT_VERSION = "0.8.2";
 /**
  * Schema compatibility map — machine-readable version requirements.
  * Use this to verify DB schemas match the running engine.
  */
 export declare const SCHEMA_COMPAT: {
-    readonly compatVersion: "0.8.1";
+    readonly compatVersion: "0.8.2";
     readonly mainSchema: 10;
     readonly librarySchema: 19;
 };

package/dist/version.js CHANGED Viewed

@@ -1,5 +1,5 @@
 /** Release version — matches package.json and is stamped into library.db on every startup. */
-export const ENGINE_VERSION = '0.8.1';
+export const ENGINE_VERSION = '0.8.2';
 /** Minimum Node.js version required — matches package.json engines field. */
 export const MIN_NODE_VERSION = '22.0.0';
 /** @deprecated No longer used — Redis was replaced with SQLite :memory: CacheLayer. */
@@ -19,15 +19,15 @@ export const LIBRARY_SCHEMA_VERSION_EXPORT = 19;
 /**
  * Compatibility version — the single number operators and consumers check.
  * Maps to: messages.db schema v10, library schema v19.
- * Matches ENGINE_VERSION for the 0.8.1 release.
+ * Matches ENGINE_VERSION for the 0.8.2 release.
  */
-export const HYPERMEM_COMPAT_VERSION = '0.8.1';
+export const HYPERMEM_COMPAT_VERSION = '0.8.2';
 /**
  * Schema compatibility map — machine-readable version requirements.
  * Use this to verify DB schemas match the running engine.
  */
 export const SCHEMA_COMPAT = {
-    compatVersion: '0.8.1',
+    compatVersion: '0.8.2',
     mainSchema: 10,
     librarySchema: 19,
 };

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@psiclawops/hypermem",
-  "version": "0.8.1",
+  "version": "0.8.2",
   "description": "Agent-centric memory and context composition engine for OpenClaw",
   "type": "module",
   "main": "dist/index.js",
@@ -48,6 +48,8 @@
     "dist/**/*.d.ts",
     "dist/**/*.d.ts.map",
     "README.md",
+    "INSTALL.md",
+    "CHANGELOG.md",
     "ARCHITECTURE.md",
     "LICENSE"
   ],