npm - @simbimbo/memory-ocmemog - Versions diffs - 0.1.7 → 0.1.8 - Mend

@simbimbo/memory-ocmemog 0.1.7 → 0.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/CHANGELOG.md +9 -0
package/README.md +3 -0
package/docs/architecture/local-runtime-2026-03-19.md +33 -0
package/ocmemog/sidecar/app.py +1 -1
package/package.json +1 -1

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,14 @@
 # Changelog
+## 0.1.8 — 2026-03-19
+Documentation and release follow-through after the llama.cpp migration and repo grooming pass.
+### Highlights
+- documented the stable local runtime architecture (gateway/sidecar/text/embed split)
+- published the repo in a llama.cpp-first state with fixed ports and cleaned installers/scripts
+- kept compatibility hooks only where still useful instead of leaving Ollama as the implied primary path
 ## 0.1.7 — 2026-03-19
 llama.cpp-first cleanup after the 0.1.6 runtime cutover.

package/README.md CHANGED Viewed

@@ -14,6 +14,9 @@ Architecture at a glance:
 - **FastAPI sidecar (`ocmemog/sidecar/`)** exposes memory and continuity APIs
 - **SQLite-backed runtime (`brain/runtime/memory/`)** powers storage, hydration, checkpoints, salience ranking, and pondering
+Current local runtime architecture note:
+- `docs/architecture/local-runtime-2026-03-19.md`
 ## Repo layout
 - `openclaw.plugin.json`, `index.ts`, `package.json`: OpenClaw plugin package and manifest.

package/docs/architecture/local-runtime-2026-03-19.md ADDED Viewed

@@ -0,0 +1,33 @@
+# Local Runtime Architecture — 2026-03-19
+This repo is now documented and operated with a **llama.cpp-first** local runtime architecture.
+## Stable loopback-only service split
+- OpenClaw gateway/dashboard: `127.0.0.1:17890`
+- ocmemog sidecar/dashboard: `127.0.0.1:17891`
+- llama.cpp text inference: `127.0.0.1:18080`
+- llama.cpp embeddings: `127.0.0.1:18081`
+## Active local models
+- Text: `Qwen2.5-7B-Instruct-Q4_K_M.gguf`
+- Embeddings: `nomic-embed-text-v1.5.Q4_K_M.gguf`
+## Configuration direction
+Primary local envs:
+- `OCMEMOG_LOCAL_LLM_BASE_URL=http://127.0.0.1:18080/v1`
+- `OCMEMOG_LOCAL_LLM_MODEL=qwen2.5-7b-instruct`
+- `OCMEMOG_LOCAL_EMBED_BASE_URL=http://127.0.0.1:18081/v1`
+- `OCMEMOG_LOCAL_EMBED_MODEL=nomic-embed-text-v1.5`
+Legacy Ollama knobs may remain in code for compatibility/rollback, but they are **not the primary runtime path**.
+## Operational notes
+- The sidecar should remain loopback-only by default.
+- The old plain dashboard lives at `http://127.0.0.1:17891/dashboard`.
+- Memory search and pondering should target the sidecar, not the OpenClaw gateway port.
+- Avoid reusing `17890` for the sidecar; that previously caused a routing collision with the OpenClaw dashboard/gateway.

package/ocmemog/sidecar/app.py CHANGED Viewed

@@ -19,7 +19,7 @@ from ocmemog.sidecar.transcript_watcher import watch_forever
 DEFAULT_CATEGORIES = ("knowledge", "reflections", "directives", "tasks", "runbooks", "lessons")
-app = FastAPI(title="ocmemog sidecar", version="0.1.7")
+app = FastAPI(title="ocmemog sidecar", version="0.1.8")
 API_TOKEN = os.environ.get("OCMEMOG_API_TOKEN")

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@simbimbo/memory-ocmemog",
-  "version": "0.1.7",
+  "version": "0.1.8",
   "description": "Advanced OpenClaw memory plugin with durable recall, transcript-backed continuity, and sidecar APIs",
   "license": "MIT",
   "repository": {