npm - loki-mode - Versions diffs - 7.45.0 → 7.46.0 - Mend

loki-mode 7.45.0 → 7.46.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (45) hide show

package/README.md +16 -12
package/SKILL.md +5 -5
package/VERSION +1 -1
package/autonomy/CONSTITUTION.md +9 -2
package/autonomy/lib/sentrux-gate.sh +1 -1
package/autonomy/loki +2 -2
package/autonomy/run.sh +355 -92
package/dashboard/__init__.py +1 -1
package/dashboard/registry.py +156 -62
package/dashboard/server.py +9 -10
package/docs/COMPARISON.md +10 -10
package/docs/COMPETITIVE-ANALYSIS.md +1 -1
package/docs/INSTALLATION.md +2 -2
package/docs/P0-SWEEP-PLAN.md +163 -0
package/docs/architecture/STATE-MACHINES.md +18 -19
package/docs/architecture/bmad-loki-voice-agent-council-analysis.md +1 -1
package/docs/auto-claude-comparison.md +14 -11
package/docs/certification/01-core-concepts/lesson.md +12 -11
package/docs/certification/01-core-concepts/quiz.md +6 -6
package/docs/certification/05-troubleshooting/lesson.md +23 -13
package/docs/certification/05-troubleshooting/quiz.md +3 -3
package/docs/certification/answer-key.md +2 -2
package/docs/certification/certification-exam.md +9 -9
package/docs/competitive/bolt-new-analysis.md +1 -1
package/docs/competitive/emergence-others-analysis.md +9 -9
package/docs/competitive/replit-lovable-analysis.md +3 -3
package/docs/cursor-comparison.md +15 -12
package/docs/dashboard-guide.md +9 -7
package/docs/prd-purple-lab-platform-v2.md +1 -1
package/docs/prd-purple-lab-platform.md +3 -3
package/docs/show-hn-post.md +2 -2
package/loki-ts/dist/loki.js +2 -2
package/mcp/__init__.py +1 -1
package/package.json +2 -2
package/plugins/loki-mode/.claude-plugin/plugin.json +2 -2
package/plugins/loki-mode/README.md +1 -1
package/references/magic-rarv-integration.md +1 -1
package/references/quality-control.md +5 -5
package/references/sdlc-phases.md +1 -2
package/skills/00-index.md +1 -1
package/skills/artifacts.md +1 -1
package/skills/healing.md +1 -1
package/skills/magic-modules.md +3 -3
package/skills/quality-gates.md +52 -39
package/skills/testing.md +1 -1

package/README.md CHANGED Viewed

@@ -19,14 +19,14 @@ _The free, source-available autonomous coding agent by [Autonomi](https://www.au
 ---
-> **How it works:** Drop a spec -- a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. Loki Mode classifies complexity (`run.sh:detect_complexity()`), assembles an agent team from 41 specialized agent roles across 8 domains - prompt-defined specifications the orchestrator adopts per phase, with parallel review (blind council) and optional worktree streams on Claude Code, sequential on other providers - and runs autonomous RARV cycles (Reason - Act - Reflect - Verify, see `run.sh:run_autonomous()`) with 11 quality gates (see `skills/quality-gates.md`). Code is not "done" until it passes automated verification. Output is a Git repo with source, tests, configs, and audit logs.
+> **How it works:** Drop a spec -- a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. Loki Mode classifies complexity (`run.sh:detect_complexity()`), assembles an agent team from 41 specialized agent roles across 8 domains - prompt-defined specifications the orchestrator adopts per phase, with parallel review (blind council) and optional worktree streams on Claude Code, sequential on other providers - and runs autonomous RARV cycles (Reason - Act - Reflect - Verify, see `run.sh:run_autonomous()`) with 8 quality gates (see `skills/quality-gates.md`). Code is not "done" until it passes automated verification. Output is a Git repo with source, tests, configs, and audit logs.
 ---
 ## Why Loki Mode?
 - **Spec-driven, autonomous, with a built-in trust layer** -- Hand Loki a spec, walk away, come back to working code with tests. The full RARV-C closure loop (Reason - Act - Reflect - Verify - Close) runs until the work is actually done, not just attempted. The verified-completion evidence gate (`skills/quality-gates.md`) refuses any "done" claim on an empty git diff against the run-start commit, and blocks completion when tests run red, so "complete" means proven, not promised.
-- **Production quality built in** -- 11 quality gates (`skills/quality-gates.md`), blind 3-reviewer code review (`run.sh:run_code_review()`), anti-sycophancy checks
+- **Production quality built in** -- 8 quality gates (`skills/quality-gates.md`), blind 3-reviewer code review (`run.sh:run_code_review()`), anti-sycophancy checks
 - **Standalone verification: `loki verify`** -- Run Loki's deterministic gates (build, tests, static analysis, secret scan, dependency audit) against any branch or PR diff, including code written by other agents or humans. CI-ready exit codes (0 VERIFIED, 1 CONCERNS, 2 BLOCKED), machine-readable evidence at `.loki/verify/evidence.json`. Inconclusive evidence is never reported as VERIFIED (v7.27.0).
 - **Living spec and pre-build interrogation** -- `loki spec` locks a spec and detects drift deterministically (`spec.lock`, `drift-report.json`, and a `SPEC_DRIFT` finding in `loki verify` with CI exit codes), so you can tell when the build diverges from what was agreed. `loki grill` runs a Devil's-Advocate interrogation of the spec before you build, surfacing gaps and contradictions early (v7.28.0).
 - **Mid-flight model switching** -- switch the model a live run uses from the dashboard (applies at the next iteration, current run only). A Fable tier lever exists in the CLI, dashboard, and override paths, but Claude Fable 5 is not yet available at the API, so selecting Fable currently collapses to Opus at every dispatch chokepoint and the `loki plan` quote reflects Opus accordingly. For every model lever (session pin, mid-flight override, architect pass) and every `LOKI_MAX_TIER` path, the `loki plan` quote, the dashboard's reported model, and the actual dispatched model agree, with the ceiling enforced (v7.31.0; Fable-to-Opus collapse v7.39.1).
@@ -41,7 +41,7 @@ _The free, source-available autonomous coding agent by [Autonomi](https://www.au
 - **MCP server** -- 34 tools (including ChromaDB code search) plus 3 resources and 2 prompts (`mcp/server.py`, with magic tools registered from `mcp/magic_tools.py` and the managed-memory tool from `mcp/managed_tools.py`). Of the 34, 33 are always available; `loki_memory_redact` is registered but only succeeds when `LOKI_MANAGED_AGENTS=true` and `LOKI_MANAGED_MEMORY=true`. Launch with `loki mcp` (bootstraps the Python MCP SDK on first run).
 - **Full-stack output** -- Source code, tests, Docker Compose stacks (multi-service with healthchecks), CI/CD pipelines, audit logs
 - **Provider-agnostic** -- runs on Claude, Codex, Cline, or Aider with automatic failover (`loki-ts/src/runner/providers.ts`); no vendor lock-in. Gemini CLI deprecated v7.5.18; Antigravity CLI coming soon.
-- **Open source** -- Free for personal, internal, and academic use.
+- **Source-available (BUSL-1.1)** -- Free for personal, internal, and academic use.
 ---
@@ -193,7 +193,7 @@ A "spec" is whatever you hand `loki start`. Loki auto-detects the format and nor
 | OpenSpec change directory | `loki start --openspec ./openspec/change-001` | Reads OpenSpec change manifest + delta files. |
 | Auto-detect (no input) | `loki start` | Picks up `./prd.md`, `./spec.{json,yaml,yml}`, or `./SPEC.md` from cwd. |
-All formats land in the same RARV pipeline and pass the same 11 quality gates (`skills/quality-gates.md`).
+All formats land in the same RARV pipeline and pass the same 8 quality gates (`skills/quality-gates.md`).
 </details>
@@ -252,8 +252,8 @@ Every iteration: **Reason** (read state) - **Act** (execute, commit) - **Reflect
 </td>
 <td width="33%" valign="top">
-### 11 Quality Gates
-Blind review, anti-sycophancy, severity blocking, mock/mutation detection, backward compatibility (gate 10, v6.67.0), documentation coverage (gate 11, v7.5.0). Code does not ship until all gates pass.
+### 8 Quality Gates
+Static analysis, test suite (pass/fail), blind 3-reviewer code review with severity blocking, anti-sycophancy Devil's Advocate, mock-integrity detection, test-mutation detection, documentation coverage, and Magic Modules debate. Backward-compatibility is a conditional healing-mode auditor (not one of the 8). Code does not ship until all gates pass.
 [Quality Gates](skills/quality-gates.md)
@@ -303,15 +303,15 @@ The historical feature set (platform pages, Monaco IDE workspace, AI chat panel)
 |---------|:---------:|:--------:|:------:|:-------:|
 | Self-hosted / your keys | Yes | No | No | No |
 | 5 AI provider failover | Yes | No | No | No |
-| 11 quality gates | Yes | No | No | No |
+| 8 quality gates | Yes | No | No | No |
 | Blind code review | Yes | No | No | No |
 | Enterprise auth (SSO/RBAC) | Yes | No | Yes | No |
 | Air-gapped deployment | Yes | No | No | No |
 | Docker + CI/CD generation | Yes | No | Yes | No |
-| Open source | Yes | No | No | No |
-| Free tier | Open source | Yes | Yes | Yes |
+| Source-available (BUSL-1.1) | Yes | No | No | No |
+| Free tier | Source-available | Yes | Yes | Yes |
-Loki Mode is the only platform that is fully self-hosted, open source, and includes automated quality verification. Your code, your keys, your infrastructure.
+Loki Mode is the only platform that is fully self-hosted, source-available (BUSL-1.1), and includes automated quality verification. Your code, your keys, your infrastructure.
 ---
@@ -432,7 +432,7 @@ See [benchmarks/](benchmarks/) for methodology.
 ![Loki Mode Presentation](docs/loki-mode-presentation.gif)
-*11 slides: Problem, Solution, 41 Agents, RARV Cycle, 9 Quality Gates (HumanEval 98.78%), Multi-Provider, Enterprise Hardening (Live App Preview), Full Lifecycle*
+*11 slides: Problem, Solution, 41 Agents, RARV Cycle, 8 Quality Gates (HumanEval 98.78%), Multi-Provider, Enterprise Hardening (Live App Preview), Full Lifecycle*
 **[Download PPTX](docs/loki-mode-presentation.pptx)**
@@ -446,7 +446,7 @@ See [benchmarks/](benchmarks/) for methodology.
 |------|-----------|---------------------|
 | **Code Gen** | Full-stack apps from PRDs | Complex domain logic may need human review |
 | **Deploy** | Generates configs, Dockerfiles, CI/CD | Does not deploy -- human runs deploy commands |
-| **Testing** | 11 automated quality gates | Test quality depends on AI assertions |
+| **Testing** | 8 automated quality gates | Test quality depends on AI assertions |
 | **Providers** | 5 providers with auto-failover | Non-Claude providers lack parallel agents |
 | **Dashboard** | Real-time single-machine monitoring | No multi-node clustering |
@@ -499,3 +499,7 @@ See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
 **[Autonomi](https://www.autonomi.dev/)** | **[Documentation](wiki/Home.md)** | **[Changelog](CHANGELOG.md)** | **[Comparisons](references/competitive-analysis.md)**
 </div>
+## Star History
+[![Star History Chart](https://api.star-history.com/chart?repos=asklokesh/loki-mode&type=timeline&logscale&legend=bottom-right)](https://www.star-history.com/?repos=asklokesh%2Floki-mode&type=timeline&logscale=&legend=bottom-right)

package/SKILL.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
 name: loki-mode
-description: Autonomous spec-driven build system with a built-in trust layer. It does not call work done until it is verified (RARV-C closure loop, 11 quality gates, completion council, verified-completion evidence gate). Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product with minimal human intervention. Provider-agnostic. Requires --dangerously-skip-permissions flag.
+description: Autonomous spec-driven build system with a built-in trust layer. It does not call work done until it is verified (RARV-C closure loop, 8 quality gates, completion council, verified-completion evidence gate). Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product with minimal human intervention. Provider-agnostic. Requires --dangerously-skip-permissions flag.
 ---
-# Loki Mode v7.45.0
+# Loki Mode v7.46.0
 **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
-**Spec in, verified product out.** Spec-driven: a "spec" is whatever describes the work -- a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket (a PRD is one form of spec). The differentiator is the trust layer: Loki does not call work done until it is verified. The RARV-C closure loop, 11 quality gates, the completion council, and the verified-completion evidence gate must all clear before completion is accepted.
+**Spec in, verified product out.** Spec-driven: a "spec" is whatever describes the work -- a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket (a PRD is one form of spec). The differentiator is the trust layer: Loki does not call work done until it is verified. The RARV-C closure loop, 8 quality gates, the completion council, and the verified-completion evidence gate must all clear before completion is accepted.
 **Provider-agnostic (stable since v5.0.0):** runs on Claude/Codex/Cline/Aider with abstract model tiers and degraded mode for non-Claude providers; no vendor lock-in. Gemini deprecated v7.5.18. See `skills/providers.md`. **Current track (v7.7.x):** LSP grounding as first-class agent tool (v7.7.0-v7.7.9; lsp_get_diagnostics actually-returns-diagnostics regression fix v7.7.14), provider_source cli (v7.7.11-v7.7.12 bash/bun parity), Docker/bash-3.2 robustness (v7.7.13), audit chain cross-file verification fix (v7.7.15), Phase 1 RARV-C closure (real provider judges, gate-failure flock, synthetic PRD e2e, status `--json`).
@@ -125,7 +125,7 @@ DEPLOYMENT ──[production live, monitoring active]──> GROWTH
 GROWTH ──[continuous improvement loop]──> GROWTH
 ```
-**Transition requires:** All phase quality gates passed. No Critical/High/Medium issues.
+**Transition requires:** All phase quality gates passed. No Critical/High issues (Medium/Low advisory).
 ---
@@ -407,4 +407,4 @@ See `CHANGELOG.md` entries [7.5.7], [7.5.8], [7.5.13] for the per-fix list and r
 ---
-**v7.45.0 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
+**v7.46.0 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**

package/VERSION CHANGED Viewed

	@@ -1 +1 @@
1	- 7.45.0
1	+ 7.46.0

package/autonomy/CONSTITUTION.md CHANGED Viewed

@@ -155,7 +155,7 @@ SKILL.md (~190 lines)         # Always loaded: RARV cycle, autonomy rules
 skills/
   00-index.md                  # Module routing table
   model-selection.md           # Task tool, parallelization
-  quality-gates.md             # 9-gate system, anti-sycophancy
+  quality-gates.md             # 8-gate system, anti-sycophancy
   testing.md                   # Playwright, E2E, property-based
   production.md                # CI/CD, batch processing
   agents.md                    # 41 agent types, A2A patterns
@@ -196,7 +196,14 @@ Main Worktree (orchestrator)
 ---
-## Quality Gates (9-Gate System)
+## Quality Gates
+Note: the canonical, implemented quality-gate set is the 8-gate system defined in
+`skills/quality-gates.md` (static analysis, test suite pass/fail, blind 3-reviewer
+code review with severity blocking, anti-sycophancy Devil's Advocate, mock
+integrity, test mutation, documentation coverage, Magic Modules debate). The
+list below is an aspirational SDLC taxonomy and may name phases that are not
+deterministic loop gates; treat `skills/quality-gates.md` as the source of truth.
 ### Gate 1: Static Analysis
 ```yaml

package/autonomy/lib/sentrux-gate.sh CHANGED Viewed

@@ -2,7 +2,7 @@
 # Loki Mode -- sentrux architectural-drift helper (v7.5.14).
 #
 # Why this exists:
-#   Loki's existing 11 quality gates and 3-reviewer council catch correctness
+#   Loki's existing 8 quality gates and 3-reviewer council catch correctness
 #   and behavioral regressions, but no current gate emits a deterministic,
 #   per-iteration architecture-drift signal. sentrux (https://github.com/sentrux/sentrux)
 #   is a Rust CLI that scores codebase structure (modularity, acyclicity,

package/autonomy/loki CHANGED Viewed

@@ -16135,7 +16135,7 @@ print()
                 echo -e "${RED}Test coverage gate FAILED${NC}"
                 return 1
             else
-                echo -e "${GREEN}Tests passed (min coverage: ${min_coverage}%)${NC}"
+                echo -e "${GREEN}Tests passed (min coverage TARGET, not measured: ${min_coverage}%)${NC}"
                 return 0
             fi
             ;;
@@ -27527,7 +27527,7 @@ if test_results:
     test_lines.append(f"Passed: {test_results.get('pass', 'unknown')}")
     test_lines.append(f"Timestamp: {test_results.get('timestamp', 'unknown')}")
     if test_results.get("min_coverage"):
-        test_lines.append(f"Min coverage target: {test_results['min_coverage']}%")
+        test_lines.append(f"Min coverage TARGET (not measured): {test_results['min_coverage']}%")
 else:
     test_lines.append("No test result data recorded")
 sections.append(("Tests", test_lines))