npm - loki-mode - Versions diffs - 7.7.22 → 7.7.25 - Mend

loki-mode 7.7.22 → 7.7.25

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (18) hide show

package/README.md +7 -5
package/SKILL.md +4 -4
package/VERSION +1 -1
package/autonomy/loki +24 -0
package/dashboard/__init__.py +1 -1
package/dashboard/server.py +51 -4
package/docs/INSTALLATION.md +2 -2
package/loki-ts/dist/loki.js +2 -2
package/mcp/__init__.py +1 -1
package/mcp/server.py +6 -5
package/memory/ingest.py +30 -9
package/memory/knowledge_graph.py +40 -16
package/package.json +3 -2
package/tools/bench_cross_project_lift.py +218 -0
package/tools/bench_memory_retrieval.py +157 -0
package/tools/index-codebase.py +474 -0
package/tools/probe-model-catalog.py +159 -0
package/tools/regen-state-machine-refs.py +188 -0

package/README.md CHANGED Viewed

@@ -24,15 +24,15 @@
 ## Why Loki Mode?
-- **Truly autonomous** -- Describe what you want, walk away, come back to working code with tests
+- **Spec to product, autonomously** -- Describe what you want, walk away, come back to working code with tests. Loki runs the full RARV-C closure loop (Reason - Act - Reflect - Verify - Close) until the work is actually done, not just attempted.
 - **Production quality built in** -- 11 quality gates (`skills/quality-gates.md`), blind 3-reviewer code review (`run.sh:run_code_review()`), anti-sycophancy checks
+- **Cross-project memory** -- Episodic/semantic/procedural memory with vector search; knowledge learned on one project surfaces on the next (v5.15.0+, see `memory/engine.py`)
 - **Self-hosted and private** -- Your keys, your infrastructure, no data leaves your network
-- **4 active AI providers** -- Claude, Codex, Cline, Aider with automatic failover (`loki-ts/src/runner/providers.ts`). Gemini CLI deprecated v7.5.18; Antigravity CLI coming soon.
 - **Legacy system healing** -- `loki heal` archaeology/stabilize/isolate/modernize/validate phases (v6.67.0, see `skills/healing.md`)
-- **Memory system** -- Episodic/semantic/procedural with vector search (v5.15.0, see `memory/engine.py`)
 - **MCP server** -- 15 tools including ChromaDB code search (`mcp/server.py`)
 - **Full-stack output** -- Source code, tests, Docker configs, CI/CD pipelines, audit logs
-- **Open source** -- Free for personal, internal, and academic use. No vendor lock-in.
+- **Provider-agnostic** -- runs on Claude, Codex, Cline, or Aider with automatic failover (`loki-ts/src/runner/providers.ts`); no vendor lock-in. Gemini CLI deprecated v7.5.18; Antigravity CLI coming soon.
+- **Open source** -- Free for personal, internal, and academic use.
 ---
@@ -302,7 +302,9 @@ Loki Mode is the only platform that is fully self-hosted, open source, and inclu
 ---
-## Multi-Provider Support
+## Provider-Agnostic Runtime
+Loki's autonomy and quality loop are the product; the underlying coding CLI is swappable. Loki runs on any of the providers below so you are never locked to one vendor.
 | Provider | Status | Autonomous Flag | Parallel Agents | Install |
 |----------|--------|:-:|:-:|---------|

package/SKILL.md CHANGED Viewed

@@ -1,15 +1,15 @@
 ---
 name: loki-mode
-description: Multi-agent autonomous startup system. Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product with minimal human intervention. Requires --dangerously-skip-permissions flag.
+description: Autonomous spec-to-product system. Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product via the RARV-C closure loop, with minimal human intervention. Provider-agnostic. Requires --dangerously-skip-permissions flag.
 ---
-# Loki Mode v7.7.22
+# Loki Mode v7.7.25
 **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
 **Spec in, product out.** A "spec" is whatever describes the work: a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket -- a PRD is one form of spec.
-**Multi-provider (stable since v5.0.0):** Claude/Codex/Cline/Aider with abstract model tiers and degraded mode for non-Claude providers. Gemini deprecated v7.5.18. See `skills/providers.md`. **Current track (v7.7.x):** LSP grounding as first-class agent tool (v7.7.0-v7.7.9; lsp_get_diagnostics actually-returns-diagnostics regression fix v7.7.14), provider_source cli (v7.7.11-v7.7.12 bash/bun parity), Docker/bash-3.2 robustness (v7.7.13), audit chain cross-file verification fix (v7.7.15), Phase 1 RARV-C closure (real provider judges, gate-failure flock, synthetic PRD e2e, status `--json`).
+**Provider-agnostic (stable since v5.0.0):** runs on Claude/Codex/Cline/Aider with abstract model tiers and degraded mode for non-Claude providers; no vendor lock-in. Gemini deprecated v7.5.18. See `skills/providers.md`. **Current track (v7.7.x):** LSP grounding as first-class agent tool (v7.7.0-v7.7.9; lsp_get_diagnostics actually-returns-diagnostics regression fix v7.7.14), provider_source cli (v7.7.11-v7.7.12 bash/bun parity), Docker/bash-3.2 robustness (v7.7.13), audit chain cross-file verification fix (v7.7.15), Phase 1 RARV-C closure (real provider judges, gate-failure flock, synthetic PRD e2e, status `--json`).
 **Runtime migration:** Bash-to-Bun migration. Read-only commands (`version`, `status`, `stats`, `doctor`, `provider show/list`, `memory list/index`) flow through Bun runtime via `bin/loki` since v7.3.0. Every other command remains on the Bash runtime (`autonomy/loki`). Rollback: `LOKI_LEGACY_BASH=1`. See `UPGRADING.md` and `docs/architecture/ADR-001-runtime-migration.md`.
@@ -381,4 +381,4 @@ See `CHANGELOG.md` entries [7.5.7], [7.5.8], [7.5.13] for the per-fix list and r
 ---
-**v7.7.22 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
+**v7.7.25 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**

package/VERSION CHANGED Viewed

	@@ -1 +1 @@
1	- 7.7.22
1	+ 7.7.25

package/autonomy/loki CHANGED Viewed

@@ -299,6 +299,30 @@ load_memory_context() {
         return 0
     fi
+    # v7.7.23 privacy opt-out (excellence bar 6): honor a per-project
+    # .loki/config.json {"memory": {"disabled": true}} flag. Lets a user
+    # disable ALL memory capture/retrieval for a sensitive project without
+    # setting an env var on every invocation.
+    if [ -f "$LOKI_DIR/config.json" ]; then
+        # v7.7.23 council fix (Opus 2): FAIL-CLOSED. A config.json that
+        # exists but cannot be parsed prints 'true' (suppress memory) so
+        # a JSON typo on a sensitive project does not silently re-enable
+        # retrieval. Only the no-config case (outer -f guard false)
+        # proceeds with memory on.
+        local _mem_disabled
+        _mem_disabled=$(python3 -c "
+import json, sys
+try:
+    d = json.load(open('$LOKI_DIR/config.json'))
+    print('true' if d.get('memory', {}).get('disabled') is True else 'false')
+except Exception:
+    print('true')  # malformed config -> fail closed (suppress memory)
+" 2>/dev/null || echo "true")
+        if [ "$_mem_disabled" = "true" ]; then
+            return 0
+        fi
+    fi
     # Check if python3 is available (required for memory system)
     if ! command -v python3 &> /dev/null; then
         echo -e "${YELLOW}Warning: python3 not found - memory context loading disabled${NC}" >&2

package/dashboard/__init__.py CHANGED Viewed

@@ -7,7 +7,7 @@ Modules:
     control: Session control API (start/stop/pause/resume)
 """
-__version__ = "7.7.22"
+__version__ = "7.7.25"
 # Expose the control app for easy import
 try:

package/dashboard/server.py CHANGED Viewed

@@ -2751,14 +2751,61 @@ async def get_token_economics():
 @app.post("/api/memory/consolidate", dependencies=[Depends(auth.require_scope("control"))])
 async def consolidate_memory(hours: int = 24):
-    """Trigger memory consolidation (stub - returns current state)."""
-    return {"status": "ok", "message": f"Consolidation for last {hours}h", "consolidated": 0, "patternsCreated": 0, "patternsMerged": 0, "episodesProcessed": 0}
+    """Run the real episodic-to-semantic consolidation pipeline."""
+    memory_dir = _get_loki_dir() / "memory"
+    try:
+        import sys as _sys
+        project_root = str(_Path(__file__).resolve().parent.parent)
+        if project_root not in _sys.path:
+            _sys.path.insert(0, project_root)
+        from memory.storage import MemoryStorage
+        from memory.consolidation import ConsolidationPipeline
+        storage = MemoryStorage(str(memory_dir))
+        pipeline = ConsolidationPipeline(storage=storage, base_path=str(memory_dir))
+        result = pipeline.consolidate(since_hours=hours)
+        d = result.to_dict()
+        return {
+            "status": "ok",
+            "message": f"Consolidated episodes from the last {hours}h",
+            "consolidated": d.get("patterns_created", 0) + d.get("patterns_merged", 0),
+            "patternsCreated": d.get("patterns_created", 0),
+            "patternsMerged": d.get("patterns_merged", 0),
+            "antiPatternsCreated": d.get("anti_patterns_created", 0),
+            "episodesProcessed": d.get("episodes_processed", 0),
+            "durationSeconds": round(d.get("duration_seconds", 0.0), 3),
+        }
+    except Exception as e:
+        raise HTTPException(status_code=503, detail=f"Consolidation unavailable: {e}")
 @app.post("/api/memory/retrieve", dependencies=[Depends(auth.require_scope("control"))])
 async def retrieve_memory(query: dict = None):
-    """Search memories by query."""
-    return {"results": [], "query": query}
+    """Task-aware retrieval against the real memory engine.
+    Body: {"goal": str, "phase"?: str, "task_type"?: str, "top_k"?: int}.
+    """
+    query = query or {}
+    goal = (query.get("goal") or query.get("q") or "").strip()
+    if not goal:
+        return {"results": [], "query": query, "message": "provide a 'goal' to retrieve against"}
+    top_k = int(query.get("top_k", 5))
+    top_k = max(1, min(top_k, 50))
+    memory_dir = _get_loki_dir() / "memory"
+    try:
+        import sys as _sys
+        project_root = str(_Path(__file__).resolve().parent.parent)
+        if project_root not in _sys.path:
+            _sys.path.insert(0, project_root)
+        from memory.storage import MemoryStorage
+        from memory.retrieval import MemoryRetrieval
+        retriever = MemoryRetrieval(MemoryStorage(str(memory_dir)))
+        context = {"goal": goal, "phase": query.get("phase", "development")}
+        if query.get("task_type"):
+            context["task_type"] = query["task_type"]
+        results = retriever.retrieve_task_aware(context, top_k=top_k, token_budget=query.get("token_budget"))
+        return {"results": results, "query": {"goal": goal, "top_k": top_k}, "count": len(results)}
+    except Exception as e:
+        raise HTTPException(status_code=503, detail=f"Retrieval unavailable: {e}")
 @app.get("/api/memory/index")

package/docs/INSTALLATION.md CHANGED Viewed

@@ -2,7 +2,7 @@
 The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installation instructions for all platforms and use cases.
-**Version:** v7.7.22
+**Version:** v7.7.25
 ---
@@ -32,7 +32,7 @@ setting any flag to `0`.
 ### Earlier highlights still in scope
 - Bash-to-Bun runtime migration in progress (see `UPGRADING.md`)
-- 4-provider support: Claude (full), Codex, Cline, Aider
+- Provider-agnostic runtime: Claude (full), Codex, Cline, Aider (no vendor lock-in)
 - Memory system (episodic / semantic / procedural)
 - ChromaDB semantic code search via MCP

package/loki-ts/dist/loki.js CHANGED Viewed

@@ -1,5 +1,5 @@
 // @bun
-var _7=Object.defineProperty;var I7=(K)=>K;function P7(K,$){this[K]=I7.bind(null,$)}var v=(K,$)=>{for(var Q in $)_7(K,Q,{get:$[Q],enumerable:!0,configurable:!0,set:P7.bind($,Q)})};var R=(K,$)=>()=>(K&&($=K(K=0)),$);var t=import.meta.require;var e1={};v(e1,{lokiDir:()=>P,homeLokiDir:()=>k1,findRepoRootForVersion:()=>N1,REPO_ROOT:()=>p});import{resolve as u,dirname as S1}from"path";import{fileURLToPath as L7}from"url";import{existsSync as J1}from"fs";import{homedir as R7}from"os";function E7(){let K=i1;for(let $=0;$<6;$++){if(J1(u(K,"VERSION"))&&J1(u(K,"autonomy/run.sh")))return K;let Q=S1(K);if(Q===K)break;K=Q}return u(i1,"..","..","..")}function N1(K){let $=K;for(let Q=0;Q<6;Q++){if(J1(u($,"VERSION"))&&J1(u($,"autonomy/run.sh")))return $;let X=S1($);if(X===$)break;$=X}return u(K,"..","..","..")}function P(){return process.env.LOKI_DIR??u(process.cwd(),".loki")}function k1(){return u(R7(),".loki")}var i1,p;var g=R(()=>{i1=S1(L7(import.meta.url));p=E7()});import{readFileSync as F7}from"fs";import{resolve as w7,dirname as x7}from"path";import{fileURLToPath as S7}from"url";function G1(){if(o!==null)return o;let K="7.7.22";if(typeof K==="string"&&K.length>0)return o=K,o;try{let $=x7(S7(import.meta.url)),Q=N1($);o=F7(w7(Q,"VERSION"),"utf-8").trim()}catch{o="unknown"}return o}var o=null;var D1=R(()=>{g()});var $0={};v($0,{runOrThrow:()=>N7,run:()=>k,commandVersion:()=>D7,commandExists:()=>h,ShellError:()=>C1});async function k(K,$={}){let Q=Bun.spawn({cmd:[...K],stdout:"pipe",stderr:"pipe",env:$.env?{...process.env,...$.env}:process.env,cwd:$.cwd}),X,Z;if($.timeoutMs&&$.timeoutMs>0)X=setTimeout(()=>{try{Q.kill("SIGTERM")}catch{}Z=setTimeout(()=>{try{Q.kill("SIGKILL")}catch{}},2000)},$.timeoutMs);try{let[W,z,q]=await Promise.all([new Response(Q.stdout).text(),new Response(Q.stderr).text(),Q.exited]);return{stdout:W,stderr:z,exitCode:q}}finally{if(X)clearTimeout(X);if(Z)clearTimeout(Z)}}async function N7(K,$={}){let Q=await k(K,$);if(Q.exitCode!==0)throw new C1(`command failed (${Q.exitCode}): ${K.join(" ")}`,Q.exitCode,Q.stdout,Q.stderr);return Q}async function h(K){let $=k7(K),Q=await k(["sh","-c",`command -v ${$}`],{timeoutMs:5000});if(Q.exitCode===0)return Q.stdout.trim()||null;return null}function k7(K){if(!/^[A-Za-z0-9._/-]+$/.test(K))throw Error(`refused to shell-escape suspect token: ${K}`);return K}async function D7(K,$="--version"){if(!await h(K))return null;let X=await k([K,$],{timeoutMs:5000});if(X.exitCode!==0)return null;return((X.stdout||X.stderr).split(/\r?\n/)[0]?.trim()??"")||null}var C1;var n=R(()=>{C1=class C1 extends Error{message;exitCode;stdout;stderr;constructor(K,$,Q,X){super(K);this.message=K;this.exitCode=$;this.stdout=Q;this.stderr=X;this.name="ShellError"}}});function c(K){return C7?"":K}var C7,E,b,F,T6,O,D,w,H;var a=R(()=>{C7=(process.env.NO_COLOR??"").length>0;E=c("\x1B[0;31m"),b=c("\x1B[0;32m"),F=c("\x1B[1;33m"),T6=c("\x1B[0;34m"),O=c("\x1B[0;36m"),D=c("\x1B[1m"),w=c("\x1B[2m"),H=c("\x1B[0m")});import{existsSync as c7}from"fs";async function i(){if(X1!==void 0)return X1;let K="/opt/homebrew/bin/python3.12";if(c7(K))return X1=K,K;let $=await h("python3.12");if($)return X1=$,$;let Q=await h("python3");return X1=Q,Q}async function s(K,$={}){let Q=await i();if(!Q)return{stdout:"",stderr:"python3 not found",exitCode:127};return k([Q,"-c",K],$)}var X1;var Z1=R(()=>{n()});var G0={};v(G0,{runStatus:()=>Q5});import{existsSync as N,readFileSync as W1,readdirSync as W0,statSync as H0}from"fs";import{resolve as x,basename as a7}from"path";async function r7(){if(await h("jq"))return!0;return process.stdout.write(`${E}Error: jq is required but not installed.${H}
+var _7=Object.defineProperty;var I7=(K)=>K;function P7(K,$){this[K]=I7.bind(null,$)}var v=(K,$)=>{for(var Q in $)_7(K,Q,{get:$[Q],enumerable:!0,configurable:!0,set:P7.bind($,Q)})};var R=(K,$)=>()=>(K&&($=K(K=0)),$);var t=import.meta.require;var e1={};v(e1,{lokiDir:()=>P,homeLokiDir:()=>k1,findRepoRootForVersion:()=>N1,REPO_ROOT:()=>p});import{resolve as u,dirname as S1}from"path";import{fileURLToPath as L7}from"url";import{existsSync as J1}from"fs";import{homedir as R7}from"os";function E7(){let K=i1;for(let $=0;$<6;$++){if(J1(u(K,"VERSION"))&&J1(u(K,"autonomy/run.sh")))return K;let Q=S1(K);if(Q===K)break;K=Q}return u(i1,"..","..","..")}function N1(K){let $=K;for(let Q=0;Q<6;Q++){if(J1(u($,"VERSION"))&&J1(u($,"autonomy/run.sh")))return $;let X=S1($);if(X===$)break;$=X}return u(K,"..","..","..")}function P(){return process.env.LOKI_DIR??u(process.cwd(),".loki")}function k1(){return u(R7(),".loki")}var i1,p;var g=R(()=>{i1=S1(L7(import.meta.url));p=E7()});import{readFileSync as F7}from"fs";import{resolve as w7,dirname as x7}from"path";import{fileURLToPath as S7}from"url";function G1(){if(o!==null)return o;let K="7.7.25";if(typeof K==="string"&&K.length>0)return o=K,o;try{let $=x7(S7(import.meta.url)),Q=N1($);o=F7(w7(Q,"VERSION"),"utf-8").trim()}catch{o="unknown"}return o}var o=null;var D1=R(()=>{g()});var $0={};v($0,{runOrThrow:()=>N7,run:()=>k,commandVersion:()=>D7,commandExists:()=>h,ShellError:()=>C1});async function k(K,$={}){let Q=Bun.spawn({cmd:[...K],stdout:"pipe",stderr:"pipe",env:$.env?{...process.env,...$.env}:process.env,cwd:$.cwd}),X,Z;if($.timeoutMs&&$.timeoutMs>0)X=setTimeout(()=>{try{Q.kill("SIGTERM")}catch{}Z=setTimeout(()=>{try{Q.kill("SIGKILL")}catch{}},2000)},$.timeoutMs);try{let[W,z,q]=await Promise.all([new Response(Q.stdout).text(),new Response(Q.stderr).text(),Q.exited]);return{stdout:W,stderr:z,exitCode:q}}finally{if(X)clearTimeout(X);if(Z)clearTimeout(Z)}}async function N7(K,$={}){let Q=await k(K,$);if(Q.exitCode!==0)throw new C1(`command failed (${Q.exitCode}): ${K.join(" ")}`,Q.exitCode,Q.stdout,Q.stderr);return Q}async function h(K){let $=k7(K),Q=await k(["sh","-c",`command -v ${$}`],{timeoutMs:5000});if(Q.exitCode===0)return Q.stdout.trim()||null;return null}function k7(K){if(!/^[A-Za-z0-9._/-]+$/.test(K))throw Error(`refused to shell-escape suspect token: ${K}`);return K}async function D7(K,$="--version"){if(!await h(K))return null;let X=await k([K,$],{timeoutMs:5000});if(X.exitCode!==0)return null;return((X.stdout||X.stderr).split(/\r?\n/)[0]?.trim()??"")||null}var C1;var n=R(()=>{C1=class C1 extends Error{message;exitCode;stdout;stderr;constructor(K,$,Q,X){super(K);this.message=K;this.exitCode=$;this.stdout=Q;this.stderr=X;this.name="ShellError"}}});function c(K){return C7?"":K}var C7,E,b,F,T6,O,D,w,H;var a=R(()=>{C7=(process.env.NO_COLOR??"").length>0;E=c("\x1B[0;31m"),b=c("\x1B[0;32m"),F=c("\x1B[1;33m"),T6=c("\x1B[0;34m"),O=c("\x1B[0;36m"),D=c("\x1B[1m"),w=c("\x1B[2m"),H=c("\x1B[0m")});import{existsSync as c7}from"fs";async function i(){if(X1!==void 0)return X1;let K="/opt/homebrew/bin/python3.12";if(c7(K))return X1=K,K;let $=await h("python3.12");if($)return X1=$,$;let Q=await h("python3");return X1=Q,Q}async function s(K,$={}){let Q=await i();if(!Q)return{stdout:"",stderr:"python3 not found",exitCode:127};return k([Q,"-c",K],$)}var X1;var Z1=R(()=>{n()});var G0={};v(G0,{runStatus:()=>Q5});import{existsSync as N,readFileSync as W1,readdirSync as W0,statSync as H0}from"fs";import{resolve as x,basename as a7}from"path";async function r7(){if(await h("jq"))return!0;return process.stdout.write(`${E}Error: jq is required but not installed.${H}
 `),process.stdout.write(`Install with:
 `),process.stdout.write(`  brew install jq    (macOS)
 `),process.stdout.write(`  apt install jq     (Debian/Ubuntu)
@@ -585,4 +585,4 @@ Set LOKI_LEGACY_BASH=1 to force the bash CLI for every command.
 `),2}default:return process.stderr.write(`Unknown command: ${$}
 `),process.stderr.write(j7),2}}process.on("SIGINT",()=>process.exit(130));process.on("SIGTERM",()=>process.exit(143));var X6=await Q6(Bun.argv.slice(2));process.exit(X6);
-//# debugId=67ACBDA1E9391E4564756E2164756E21
+//# debugId=3DA3905BB400BADE64756E2164756E21

package/mcp/__init__.py CHANGED Viewed

@@ -57,4 +57,4 @@ try:
 except ImportError:
     __all__ = ['mcp']
-__version__ = '7.7.22'
+__version__ = '7.7.25'

package/mcp/server.py CHANGED Viewed

@@ -1010,17 +1010,18 @@ async def loki_memory_capture_session_summary(
     try:
         from memory.ingest import ingest_from_summary, _capture_disabled
-        if _capture_disabled():
+        base_path = safe_path_join('.loki', 'memory')
+        # v7.7.23: pass base_path so the .loki/config.json memory.disabled
+        # opt-out is honored in addition to the env escape hatch.
+        if _capture_disabled(base_path):
             _emit_tool_event_async(
                 'loki_memory_capture_session_summary', 'complete',
-                result_status='skipped', error='disabled via env'
+                result_status='skipped', error='disabled via env or config'
             )
             return json.dumps({
-                "error": "memory capture disabled via LOKI_MEMORY_CAPTURE_DISABLED",
+                "error": "memory capture disabled (LOKI_MEMORY_CAPTURE_DISABLED or .loki/config.json memory.disabled)",
                 "disabled": True,
             })
-        base_path = safe_path_join('.loki', 'memory')
         path = ingest_from_summary(
             base_path,
             goal=goal,

package/memory/ingest.py CHANGED Viewed

@@ -125,13 +125,34 @@ def _scrub_path(path: str) -> str:
     return path
-def _capture_disabled() -> bool:
-    """Honor `LOKI_MEMORY_CAPTURE_DISABLED=true` escape hatch."""
-    return os.environ.get("LOKI_MEMORY_CAPTURE_DISABLED", "").lower() in (
-        "true",
-        "1",
-        "yes",
-    )
+def _capture_disabled(memory_base: Optional[str] = None) -> bool:
+    """True when capture should be skipped.
+    Honors the `LOKI_MEMORY_CAPTURE_DISABLED=true` env escape hatch AND
+    (v7.7.23 privacy opt-out, excellence bar 6) a per-project
+    `.loki/config.json` `{"memory": {"disabled": true}}` flag. When
+    `memory_base` is provided (e.g. `<root>/.loki/memory`), the config
+    is resolved as its sibling `<root>/.loki/config.json`.
+    """
+    if os.environ.get("LOKI_MEMORY_CAPTURE_DISABLED", "").lower() in ("true", "1", "yes"):
+        return True
+    if memory_base:
+        # v7.7.23 council fix (Opus 2): FAIL-CLOSED. If a config.json
+        # EXISTS but cannot be parsed, the user's intent is ambiguous and
+        # the safe privacy default is to SUPPRESS capture (leaked data is
+        # irreversible; lost memory is recoverable). Only the no-config
+        # case fails open (capture proceeds = default behavior).
+        config_path = Path(memory_base).parent / "config.json"
+        if config_path.is_file():
+            try:
+                with open(config_path) as f:
+                    cfg = json.load(f)
+            except Exception:
+                # Config present but unreadable/malformed -> fail closed.
+                return True
+            if isinstance(cfg, dict) and cfg.get("memory", {}).get("disabled") is True:
+                return True
+    return False
 def _log_to_errors(memory_base: str, function_name: str, exc: BaseException) -> None:
@@ -301,7 +322,7 @@ def ingest_from_claude_transcript(
         Path to the written episode JSON on success; None on failure
         (silent fail, error logged to `.errors.log`).
     """
-    if _capture_disabled():
+    if _capture_disabled(memory_base):
         return None
     try:
         path = Path(transcript_path)
@@ -409,7 +430,7 @@ def ingest_from_summary(
     Returns episode path on success, None on failure.
     """
-    if _capture_disabled():
+    if _capture_disabled(memory_base):
         return None
     try:
         from memory.engine import MemoryEngine, create_storage

package/memory/knowledge_graph.py CHANGED Viewed

@@ -167,30 +167,54 @@ class OrganizationKnowledgeGraph:
             self._graph = json.load(f)
         return self._graph
+    _STOPWORDS = {
+        'the', 'a', 'an', 'to', 'for', 'of', 'and', 'or', 'with', 'without',
+        'is', 'are', 'be', 'up', 'on', 'in', 'by', 'not', 'make', 'choose',
+        'store', 'how', 'do', 'i', 'we', 'my', 'our', 'this', 'that',
+    }
+    @classmethod
+    def _tokenize(cls, text):
+        """Lowercase, split on non-alphanumerics, drop stopwords + 1-2 char tokens."""
+        import re
+        toks = re.split(r'[^a-z0-9]+', str(text or '').lower())
+        return {t for t in toks if len(t) > 2 and t not in cls._STOPWORDS}
     def query_patterns(self, query, max_results=10):
-        """Simple keyword search across stored patterns.
+        """Keyword search across stored patterns.
+        Scores by token overlap between the query and each pattern's
+        name/pattern/description/category fields. Token overlap (not
+        whole-string substring) lets natural-language goals like "make
+        the charge endpoint safe to retry" match a pattern named
+        "idempotency-key-on-charge". A full-string substring hit still
+        gets a bonus, preserving prior behavior for exact queries.
-        Searches across 'name', 'pattern', 'description', and 'category'
-        fields for compatibility with both simple dicts and SemanticPattern
-        schema.
+        Searches 'name', 'pattern', 'description', and 'category' for
+        compatibility with both simple dicts and the SemanticPattern schema.
         """
         patterns = self.load_patterns(limit=1000)
         query_lower = query.lower()
+        query_tokens = self._tokenize(query)
         scored = []
+        # Per-field weight applied to each overlapping token.
+        fields = (('name', 3), ('pattern', 3), ('category', 2), ('description', 1))
         for p in patterns:
             score = 0
-            name = p.get('name', '').lower()
-            pattern_text = p.get('pattern', '').lower()
-            desc = p.get('description', '').lower()
-            category = p.get('category', '').lower()
-            if query_lower in name:
-                score += 3
-            if query_lower in pattern_text:
-                score += 3
-            if query_lower in desc:
-                score += 1
-            if query_lower in category:
-                score += 2
+            for field, weight in fields:
+                value = p.get(field, '')
+                if not value:
+                    continue
+                # Coerce non-string field values (a hand-edited or
+                # future-schema JSONL row could store a list/int) so we
+                # never crash on .lower() in the live retrieval path.
+                value_lower = str(value).lower()
+                # Whole-query substring bonus (preserves exact-match behavior).
+                if query_lower and query_lower in value_lower:
+                    score += weight
+                # Token-overlap scoring (enables NL-goal retrieval).
+                overlap = query_tokens & self._tokenize(value)
+                score += weight * len(overlap)
             if score > 0:
                 scored.append((score, p))
         scored.sort(key=lambda x: -x[0])

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "loki-mode",
-  "version": "7.7.22",
-  "description": "Loki Mode by Autonomi. Multi-agent autonomous SDLC framework. Spec to deployed app: PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. 4 AI providers (Claude Code, OpenAI Codex, Cline, Aider). 11 quality gates.",
+  "version": "7.7.25",
+  "description": "Loki Mode by Autonomi. Autonomous spec-to-product system: takes a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief to a deployed app via the RARV-C closure loop with 11 quality gates. Provider-agnostic (Claude Code, OpenAI Codex, Cline, Aider).",
   "keywords": [
     "agent",
     "agent-orchestration",
@@ -64,6 +64,7 @@
   "files": [
     "SKILL.md",
     "VERSION",
+    "tools/",
     "autonomy/",
     "providers/",
     "agents/",

package/tools/bench_cross_project_lift.py ADDED Viewed

@@ -0,0 +1,218 @@
+#!/usr/bin/env python3
+"""v7.7.24: cross-project knowledge "lift" report (the memory moat proof).
+WHAT THIS MEASURES (honestly):
+    Loki's moat claim is that knowledge learned on one project helps a
+    DIFFERENT project. The transfer mechanism is real and already in the
+    codebase: each project's semantic patterns (.loki/memory/semantic/)
+    are extracted into an org-wide knowledge graph
+    (memory/knowledge_graph.py -> ~/.loki/knowledge/patterns.jsonl), and
+    any other project can query that graph (query_patterns).
+    "Lift" here is a RETRIEVAL-COVERAGE metric, not a task-success metric.
+    For a target project's set of task goals we count how many RELEVANT
+    patterns are retrievable in two conditions:
+        baseline: only the target project's own patterns are in the graph
+        cross:    the target's patterns PLUS sibling projects' patterns
+    Lift = (relevant retrieved in cross) - (relevant retrieved in baseline),
+    and net-new = relevant patterns that ONLY the sibling projects could
+    supply (the target could never have surfaced them alone).
+WHAT THIS DOES NOT CLAIM:
+    - It does NOT claim downstream task success / fewer iterations / lower
+      cost. That requires running real LLM tasks end-to-end, which this
+      offline harness does not do. Measuring that is a separate, larger
+      benchmark.
+    - "Relevant" is keyword-overlap against the goal, not semantic ground
+      truth. It is a proxy. The number is a coverage signal, not a
+      correctness guarantee.
+The harness is fully self-contained: it seeds synthetic projects in a
+temp dir, points the knowledge graph at a temp knowledge dir, runs both
+conditions, prints a report, and self-cleans. It never touches a real
+~/.loki/knowledge or any real .loki/memory.
+"""
+from __future__ import annotations
+import argparse
+import json
+import os
+import shutil
+import sys
+import tempfile
+from pathlib import Path
+_HERE = os.path.dirname(os.path.abspath(__file__))
+_REPO_ROOT = os.path.dirname(_HERE)
+if _REPO_ROOT not in sys.path:
+    sys.path.insert(0, _REPO_ROOT)
+# Synthetic patterns per source project. Each is a semantic pattern dict
+# matching what memory/knowledge_graph.py reads (name/category/description).
+SOURCE_PROJECTS = {
+    "payments-api": [
+        {"name": "idempotency-key-on-charge", "category": "reliability",
+         "description": "retry-safe charge endpoints require an idempotency key header"},
+        {"name": "stripe-webhook-signature-verify", "category": "security",
+         "description": "verify stripe webhook signatures before processing payment events"},
+        {"name": "decimal-money-never-float", "category": "correctness",
+         "description": "represent money as integer cents or Decimal, never float"},
+    ],
+    "auth-service": [
+        {"name": "jwt-short-ttl-refresh-rotation", "category": "security",
+         "description": "access tokens short ttl with rotating refresh tokens"},
+        {"name": "rate-limit-login-by-ip-and-account", "category": "security",
+         "description": "rate limit login attempts per ip and per account to stop credential stuffing"},
+        {"name": "argon2-password-hash", "category": "security",
+         "description": "hash passwords with argon2id not bcrypt for new services"},
+    ],
+}
+# Patterns the TARGET project already knows on its own (so they are NOT
+# net-new from siblings).
+TARGET_OWN_PATTERNS = [
+    {"name": "openapi-spec-first", "category": "design",
+     "description": "write the openapi spec before implementing the api"},
+]
+# The target project's task goals. Each goal SHOULD be served by a
+# sibling pattern (that the target lacks). These are the realistic
+# overlaps a new billing+login service would hit.
+TARGET_GOALS = [
+    "make the charge endpoint safe to retry",
+    "verify incoming payment webhooks are authentic",
+    "store monetary amounts without rounding errors",
+    "secure login against credential stuffing attacks",
+    "choose a password hashing algorithm",
+    "design the api contract up front",  # served by target's OWN pattern
+]
+def _seed_project(root: Path, name: str, patterns: list) -> None:
+    semantic = root / name / ".loki" / "memory" / "semantic"
+    semantic.mkdir(parents=True, exist_ok=True)
+    for i, p in enumerate(patterns):
+        with open(semantic / f"pattern_{i}.json", "w") as f:
+            json.dump(p, f)
+def _relevant(pattern: dict, goal: str) -> bool:
+    """Keyword-overlap relevance proxy: any meaningful token from the
+    pattern name/description appears in the goal, or vice versa."""
+    stop = {"the", "a", "an", "to", "for", "of", "and", "or", "with",
+            "without", "is", "are", "be", "up", "on", "in", "by", "not",
+            "make", "choose", "store"}
+    def toks(s):
+        return {t for t in s.lower().replace("-", " ").split() if t not in stop and len(t) > 2}
+    goal_t = toks(goal)
+    pat_t = toks(pattern.get("name", "")) | toks(pattern.get("description", ""))
+    return len(goal_t & pat_t) >= 2
+def _coverage(graph, goals, top_k):
+    """For each goal, query the graph and count goals that retrieved at
+    least one relevant pattern. Returns (covered_goals, served_by_sibling)."""
+    covered = 0
+    sibling_served = 0
+    details = []
+    for goal in goals:
+        results = graph.query_patterns(goal, max_results=top_k)
+        relevant = [r for r in results if _relevant(r, goal)]
+        is_covered = len(relevant) > 0
+        # served_by_sibling: at least one relevant result came from a
+        # non-target source project.
+        from_sibling = any(
+            r.get("_source_project", "").rsplit("/", 1)[-1] != "target-billing-login"
+            for r in relevant
+        )
+        if is_covered:
+            covered += 1
+        if is_covered and from_sibling:
+            sibling_served += 1
+        details.append({
+            "goal": goal,
+            "covered": is_covered,
+            "relevant_count": len(relevant),
+            "served_by_sibling": is_covered and from_sibling,
+        })
+    return covered, sibling_served, details
+def run(top_k: int, as_json: bool) -> int:
+    tmp = tempfile.mkdtemp(prefix="loki-xproj-lift-")
+    try:
+        from memory.knowledge_graph import OrganizationKnowledgeGraph
+        projects_root = Path(tmp) / "git"
+        projects_root.mkdir(parents=True)
+        # Seed sibling source projects + the target project.
+        for name, pats in SOURCE_PROJECTS.items():
+            _seed_project(projects_root, name, pats)
+        _seed_project(projects_root, "target-billing-login", TARGET_OWN_PATTERNS)
+        target_dir = projects_root / "target-billing-login"
+        sibling_dirs = [projects_root / n for n in SOURCE_PROJECTS]
+        # BASELINE: knowledge graph built from the target alone.
+        base_kg = OrganizationKnowledgeGraph(
+            knowledge_dir=str(Path(tmp) / "knowledge-baseline"))
+        base_pats = base_kg.extract_patterns([target_dir])
+        base_kg.save_patterns(base_kg.deduplicate_patterns(base_pats))
+        base_covered, base_sibling, base_detail = _coverage(base_kg, TARGET_GOALS, top_k)
+        # CROSS: knowledge graph built from target + siblings.
+        cross_kg = OrganizationKnowledgeGraph(
+            knowledge_dir=str(Path(tmp) / "knowledge-cross"))
+        cross_pats = cross_kg.extract_patterns([target_dir] + sibling_dirs)
+        cross_kg.save_patterns(cross_kg.deduplicate_patterns(cross_pats))
+        cross_covered, cross_sibling, cross_detail = _coverage(cross_kg, TARGET_GOALS, top_k)
+        n = len(TARGET_GOALS)
+        lift = cross_covered - base_covered
+        report = {
+            "goals": n,
+            "baseline_covered": base_covered,
+            "cross_covered": cross_covered,
+            "lift_absolute": lift,
+            "lift_pct_points": round(100.0 * lift / n, 1),
+            "net_new_from_siblings": cross_sibling - base_sibling,
+            "top_k": top_k,
+            "method": "retrieval-coverage (keyword-overlap relevance proxy), NOT task-success",
+            "per_goal": cross_detail,
+        }
+        if as_json:
+            print(json.dumps(report, indent=2))
+        else:
+            print("Cross-project knowledge LIFT report (memory moat proof)")
+            print(f"  target goals:               {n}")
+            print(f"  covered (target alone):     {base_covered}/{n}")
+            print(f"  covered (target + siblings): {cross_covered}/{n}")
+            print(f"  LIFT:                       +{lift} goals "
+                  f"(+{report['lift_pct_points']} pts)")
+            print(f"  net-new served by siblings: {report['net_new_from_siblings']}")
+            print(f"  method: {report['method']}")
+            print("  per-goal:")
+            for d in cross_detail:
+                tag = "sibling" if d["served_by_sibling"] else ("self" if d["covered"] else "MISS")
+                print(f"    [{tag:7}] {d['goal']}")
+        # Exit non-zero if there is no measurable lift (so it can gate CI:
+        # a regression that breaks cross-project transfer would fail here).
+        return 0 if lift > 0 else 1
+    finally:
+        shutil.rmtree(tmp, ignore_errors=True)
+def main():
+    ap = argparse.ArgumentParser(description="Cross-project knowledge lift report")
+    ap.add_argument("--top-k", type=int, default=5, help="patterns retrieved per goal")
+    ap.add_argument("--json", action="store_true", help="emit JSON")
+    args = ap.parse_args()
+    sys.exit(run(args.top_k, args.json))
+if __name__ == "__main__":
+    main()