loki-mode 7.45.1 → 7.46.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (44) hide show
  1. package/README.md +16 -12
  2. package/SKILL.md +5 -5
  3. package/VERSION +1 -1
  4. package/autonomy/CONSTITUTION.md +9 -2
  5. package/autonomy/lib/sentrux-gate.sh +1 -1
  6. package/autonomy/loki +2 -2
  7. package/autonomy/run.sh +355 -92
  8. package/dashboard/__init__.py +1 -1
  9. package/dashboard/server.py +9 -10
  10. package/docs/COMPARISON.md +10 -10
  11. package/docs/COMPETITIVE-ANALYSIS.md +1 -1
  12. package/docs/INSTALLATION.md +2 -2
  13. package/docs/P0-SWEEP-PLAN.md +163 -0
  14. package/docs/architecture/STATE-MACHINES.md +18 -19
  15. package/docs/architecture/bmad-loki-voice-agent-council-analysis.md +1 -1
  16. package/docs/auto-claude-comparison.md +14 -11
  17. package/docs/certification/01-core-concepts/lesson.md +12 -11
  18. package/docs/certification/01-core-concepts/quiz.md +6 -6
  19. package/docs/certification/05-troubleshooting/lesson.md +23 -13
  20. package/docs/certification/05-troubleshooting/quiz.md +3 -3
  21. package/docs/certification/answer-key.md +2 -2
  22. package/docs/certification/certification-exam.md +9 -9
  23. package/docs/competitive/bolt-new-analysis.md +1 -1
  24. package/docs/competitive/emergence-others-analysis.md +9 -9
  25. package/docs/competitive/replit-lovable-analysis.md +3 -3
  26. package/docs/cursor-comparison.md +15 -12
  27. package/docs/dashboard-guide.md +9 -7
  28. package/docs/prd-purple-lab-platform-v2.md +1 -1
  29. package/docs/prd-purple-lab-platform.md +3 -3
  30. package/docs/show-hn-post.md +2 -2
  31. package/loki-ts/dist/loki.js +2 -2
  32. package/mcp/__init__.py +1 -1
  33. package/package.json +2 -2
  34. package/plugins/loki-mode/.claude-plugin/plugin.json +2 -2
  35. package/plugins/loki-mode/README.md +1 -1
  36. package/references/magic-rarv-integration.md +1 -1
  37. package/references/quality-control.md +5 -5
  38. package/references/sdlc-phases.md +1 -2
  39. package/skills/00-index.md +1 -1
  40. package/skills/artifacts.md +1 -1
  41. package/skills/healing.md +1 -1
  42. package/skills/magic-modules.md +3 -3
  43. package/skills/quality-gates.md +52 -39
  44. package/skills/testing.md +1 -1
package/README.md CHANGED
@@ -19,14 +19,14 @@ _The free, source-available autonomous coding agent by [Autonomi](https://www.au
19
19
 
20
20
  ---
21
21
 
22
- > **How it works:** Drop a spec -- a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. Loki Mode classifies complexity (`run.sh:detect_complexity()`), assembles an agent team from 41 specialized agent roles across 8 domains - prompt-defined specifications the orchestrator adopts per phase, with parallel review (blind council) and optional worktree streams on Claude Code, sequential on other providers - and runs autonomous RARV cycles (Reason - Act - Reflect - Verify, see `run.sh:run_autonomous()`) with 11 quality gates (see `skills/quality-gates.md`). Code is not "done" until it passes automated verification. Output is a Git repo with source, tests, configs, and audit logs.
22
+ > **How it works:** Drop a spec -- a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. Loki Mode classifies complexity (`run.sh:detect_complexity()`), assembles an agent team from 41 specialized agent roles across 8 domains - prompt-defined specifications the orchestrator adopts per phase, with parallel review (blind council) and optional worktree streams on Claude Code, sequential on other providers - and runs autonomous RARV cycles (Reason - Act - Reflect - Verify, see `run.sh:run_autonomous()`) with 8 quality gates (see `skills/quality-gates.md`). Code is not "done" until it passes automated verification. Output is a Git repo with source, tests, configs, and audit logs.
23
23
 
24
24
  ---
25
25
 
26
26
  ## Why Loki Mode?
27
27
 
28
28
  - **Spec-driven, autonomous, with a built-in trust layer** -- Hand Loki a spec, walk away, come back to working code with tests. The full RARV-C closure loop (Reason - Act - Reflect - Verify - Close) runs until the work is actually done, not just attempted. The verified-completion evidence gate (`skills/quality-gates.md`) refuses any "done" claim on an empty git diff against the run-start commit, and blocks completion when tests run red, so "complete" means proven, not promised.
29
- - **Production quality built in** -- 11 quality gates (`skills/quality-gates.md`), blind 3-reviewer code review (`run.sh:run_code_review()`), anti-sycophancy checks
29
+ - **Production quality built in** -- 8 quality gates (`skills/quality-gates.md`), blind 3-reviewer code review (`run.sh:run_code_review()`), anti-sycophancy checks
30
30
  - **Standalone verification: `loki verify`** -- Run Loki's deterministic gates (build, tests, static analysis, secret scan, dependency audit) against any branch or PR diff, including code written by other agents or humans. CI-ready exit codes (0 VERIFIED, 1 CONCERNS, 2 BLOCKED), machine-readable evidence at `.loki/verify/evidence.json`. Inconclusive evidence is never reported as VERIFIED (v7.27.0).
31
31
  - **Living spec and pre-build interrogation** -- `loki spec` locks a spec and detects drift deterministically (`spec.lock`, `drift-report.json`, and a `SPEC_DRIFT` finding in `loki verify` with CI exit codes), so you can tell when the build diverges from what was agreed. `loki grill` runs a Devil's-Advocate interrogation of the spec before you build, surfacing gaps and contradictions early (v7.28.0).
32
32
  - **Mid-flight model switching** -- switch the model a live run uses from the dashboard (applies at the next iteration, current run only). A Fable tier lever exists in the CLI, dashboard, and override paths, but Claude Fable 5 is not yet available at the API, so selecting Fable currently collapses to Opus at every dispatch chokepoint and the `loki plan` quote reflects Opus accordingly. For every model lever (session pin, mid-flight override, architect pass) and every `LOKI_MAX_TIER` path, the `loki plan` quote, the dashboard's reported model, and the actual dispatched model agree, with the ceiling enforced (v7.31.0; Fable-to-Opus collapse v7.39.1).
@@ -41,7 +41,7 @@ _The free, source-available autonomous coding agent by [Autonomi](https://www.au
41
41
  - **MCP server** -- 34 tools (including ChromaDB code search) plus 3 resources and 2 prompts (`mcp/server.py`, with magic tools registered from `mcp/magic_tools.py` and the managed-memory tool from `mcp/managed_tools.py`). Of the 34, 33 are always available; `loki_memory_redact` is registered but only succeeds when `LOKI_MANAGED_AGENTS=true` and `LOKI_MANAGED_MEMORY=true`. Launch with `loki mcp` (bootstraps the Python MCP SDK on first run).
42
42
  - **Full-stack output** -- Source code, tests, Docker Compose stacks (multi-service with healthchecks), CI/CD pipelines, audit logs
43
43
  - **Provider-agnostic** -- runs on Claude, Codex, Cline, or Aider with automatic failover (`loki-ts/src/runner/providers.ts`); no vendor lock-in. Gemini CLI deprecated v7.5.18; Antigravity CLI coming soon.
44
- - **Open source** -- Free for personal, internal, and academic use.
44
+ - **Source-available (BUSL-1.1)** -- Free for personal, internal, and academic use.
45
45
 
46
46
  ---
47
47
 
@@ -193,7 +193,7 @@ A "spec" is whatever you hand `loki start`. Loki auto-detects the format and nor
193
193
  | OpenSpec change directory | `loki start --openspec ./openspec/change-001` | Reads OpenSpec change manifest + delta files. |
194
194
  | Auto-detect (no input) | `loki start` | Picks up `./prd.md`, `./spec.{json,yaml,yml}`, or `./SPEC.md` from cwd. |
195
195
 
196
- All formats land in the same RARV pipeline and pass the same 11 quality gates (`skills/quality-gates.md`).
196
+ All formats land in the same RARV pipeline and pass the same 8 quality gates (`skills/quality-gates.md`).
197
197
 
198
198
  </details>
199
199
 
@@ -252,8 +252,8 @@ Every iteration: **Reason** (read state) - **Act** (execute, commit) - **Reflect
252
252
  </td>
253
253
  <td width="33%" valign="top">
254
254
 
255
- ### 11 Quality Gates
256
- Blind review, anti-sycophancy, severity blocking, mock/mutation detection, backward compatibility (gate 10, v6.67.0), documentation coverage (gate 11, v7.5.0). Code does not ship until all gates pass.
255
+ ### 8 Quality Gates
256
+ Static analysis, test suite (pass/fail), blind 3-reviewer code review with severity blocking, anti-sycophancy Devil's Advocate, mock-integrity detection, test-mutation detection, documentation coverage, and Magic Modules debate. Backward-compatibility is a conditional healing-mode auditor (not one of the 8). Code does not ship until all gates pass.
257
257
 
258
258
  [Quality Gates](skills/quality-gates.md)
259
259
 
@@ -303,15 +303,15 @@ The historical feature set (platform pages, Monaco IDE workspace, AI chat panel)
303
303
  |---------|:---------:|:--------:|:------:|:-------:|
304
304
  | Self-hosted / your keys | Yes | No | No | No |
305
305
  | 5 AI provider failover | Yes | No | No | No |
306
- | 11 quality gates | Yes | No | No | No |
306
+ | 8 quality gates | Yes | No | No | No |
307
307
  | Blind code review | Yes | No | No | No |
308
308
  | Enterprise auth (SSO/RBAC) | Yes | No | Yes | No |
309
309
  | Air-gapped deployment | Yes | No | No | No |
310
310
  | Docker + CI/CD generation | Yes | No | Yes | No |
311
- | Open source | Yes | No | No | No |
312
- | Free tier | Open source | Yes | Yes | Yes |
311
+ | Source-available (BUSL-1.1) | Yes | No | No | No |
312
+ | Free tier | Source-available | Yes | Yes | Yes |
313
313
 
314
- Loki Mode is the only platform that is fully self-hosted, open source, and includes automated quality verification. Your code, your keys, your infrastructure.
314
+ Loki Mode is the only platform that is fully self-hosted, source-available (BUSL-1.1), and includes automated quality verification. Your code, your keys, your infrastructure.
315
315
 
316
316
  ---
317
317
 
@@ -432,7 +432,7 @@ See [benchmarks/](benchmarks/) for methodology.
432
432
 
433
433
  ![Loki Mode Presentation](docs/loki-mode-presentation.gif)
434
434
 
435
- *11 slides: Problem, Solution, 41 Agents, RARV Cycle, 9 Quality Gates (HumanEval 98.78%), Multi-Provider, Enterprise Hardening (Live App Preview), Full Lifecycle*
435
+ *11 slides: Problem, Solution, 41 Agents, RARV Cycle, 8 Quality Gates (HumanEval 98.78%), Multi-Provider, Enterprise Hardening (Live App Preview), Full Lifecycle*
436
436
 
437
437
  **[Download PPTX](docs/loki-mode-presentation.pptx)**
438
438
 
@@ -446,7 +446,7 @@ See [benchmarks/](benchmarks/) for methodology.
446
446
  |------|-----------|---------------------|
447
447
  | **Code Gen** | Full-stack apps from PRDs | Complex domain logic may need human review |
448
448
  | **Deploy** | Generates configs, Dockerfiles, CI/CD | Does not deploy -- human runs deploy commands |
449
- | **Testing** | 11 automated quality gates | Test quality depends on AI assertions |
449
+ | **Testing** | 8 automated quality gates | Test quality depends on AI assertions |
450
450
  | **Providers** | 5 providers with auto-failover | Non-Claude providers lack parallel agents |
451
451
  | **Dashboard** | Real-time single-machine monitoring | No multi-node clustering |
452
452
 
@@ -499,3 +499,7 @@ See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
499
499
  **[Autonomi](https://www.autonomi.dev/)** | **[Documentation](wiki/Home.md)** | **[Changelog](CHANGELOG.md)** | **[Comparisons](references/competitive-analysis.md)**
500
500
 
501
501
  </div>
502
+
503
+ ## Star History
504
+
505
+ [![Star History Chart](https://api.star-history.com/chart?repos=asklokesh/loki-mode&type=timeline&logscale&legend=bottom-right)](https://www.star-history.com/?repos=asklokesh%2Floki-mode&type=timeline&logscale=&legend=bottom-right)
package/SKILL.md CHANGED
@@ -1,13 +1,13 @@
1
1
  ---
2
2
  name: loki-mode
3
- description: Autonomous spec-driven build system with a built-in trust layer. It does not call work done until it is verified (RARV-C closure loop, 11 quality gates, completion council, verified-completion evidence gate). Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product with minimal human intervention. Provider-agnostic. Requires --dangerously-skip-permissions flag.
3
+ description: Autonomous spec-driven build system with a built-in trust layer. It does not call work done until it is verified (RARV-C closure loop, 8 quality gates, completion council, verified-completion evidence gate). Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product with minimal human intervention. Provider-agnostic. Requires --dangerously-skip-permissions flag.
4
4
  ---
5
5
 
6
- # Loki Mode v7.45.1
6
+ # Loki Mode v7.46.0
7
7
 
8
8
  **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
9
9
 
10
- **Spec in, verified product out.** Spec-driven: a "spec" is whatever describes the work -- a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket (a PRD is one form of spec). The differentiator is the trust layer: Loki does not call work done until it is verified. The RARV-C closure loop, 11 quality gates, the completion council, and the verified-completion evidence gate must all clear before completion is accepted.
10
+ **Spec in, verified product out.** Spec-driven: a "spec" is whatever describes the work -- a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket (a PRD is one form of spec). The differentiator is the trust layer: Loki does not call work done until it is verified. The RARV-C closure loop, 8 quality gates, the completion council, and the verified-completion evidence gate must all clear before completion is accepted.
11
11
 
12
12
  **Provider-agnostic (stable since v5.0.0):** runs on Claude/Codex/Cline/Aider with abstract model tiers and degraded mode for non-Claude providers; no vendor lock-in. Gemini deprecated v7.5.18. See `skills/providers.md`. **Current track (v7.7.x):** LSP grounding as first-class agent tool (v7.7.0-v7.7.9; lsp_get_diagnostics actually-returns-diagnostics regression fix v7.7.14), provider_source cli (v7.7.11-v7.7.12 bash/bun parity), Docker/bash-3.2 robustness (v7.7.13), audit chain cross-file verification fix (v7.7.15), Phase 1 RARV-C closure (real provider judges, gate-failure flock, synthetic PRD e2e, status `--json`).
13
13
 
@@ -125,7 +125,7 @@ DEPLOYMENT ──[production live, monitoring active]──> GROWTH
125
125
  GROWTH ──[continuous improvement loop]──> GROWTH
126
126
  ```
127
127
 
128
- **Transition requires:** All phase quality gates passed. No Critical/High/Medium issues.
128
+ **Transition requires:** All phase quality gates passed. No Critical/High issues (Medium/Low advisory).
129
129
 
130
130
  ---
131
131
 
@@ -407,4 +407,4 @@ See `CHANGELOG.md` entries [7.5.7], [7.5.8], [7.5.13] for the per-fix list and r
407
407
 
408
408
  ---
409
409
 
410
- **v7.45.1 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
410
+ **v7.46.0 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
package/VERSION CHANGED
@@ -1 +1 @@
1
- 7.45.1
1
+ 7.46.0
@@ -155,7 +155,7 @@ SKILL.md (~190 lines) # Always loaded: RARV cycle, autonomy rules
155
155
  skills/
156
156
  00-index.md # Module routing table
157
157
  model-selection.md # Task tool, parallelization
158
- quality-gates.md # 9-gate system, anti-sycophancy
158
+ quality-gates.md # 8-gate system, anti-sycophancy
159
159
  testing.md # Playwright, E2E, property-based
160
160
  production.md # CI/CD, batch processing
161
161
  agents.md # 41 agent types, A2A patterns
@@ -196,7 +196,14 @@ Main Worktree (orchestrator)
196
196
 
197
197
  ---
198
198
 
199
- ## Quality Gates (9-Gate System)
199
+ ## Quality Gates
200
+
201
+ Note: the canonical, implemented quality-gate set is the 8-gate system defined in
202
+ `skills/quality-gates.md` (static analysis, test suite pass/fail, blind 3-reviewer
203
+ code review with severity blocking, anti-sycophancy Devil's Advocate, mock
204
+ integrity, test mutation, documentation coverage, Magic Modules debate). The
205
+ list below is an aspirational SDLC taxonomy and may name phases that are not
206
+ deterministic loop gates; treat `skills/quality-gates.md` as the source of truth.
200
207
 
201
208
  ### Gate 1: Static Analysis
202
209
  ```yaml
@@ -2,7 +2,7 @@
2
2
  # Loki Mode -- sentrux architectural-drift helper (v7.5.14).
3
3
  #
4
4
  # Why this exists:
5
- # Loki's existing 11 quality gates and 3-reviewer council catch correctness
5
+ # Loki's existing 8 quality gates and 3-reviewer council catch correctness
6
6
  # and behavioral regressions, but no current gate emits a deterministic,
7
7
  # per-iteration architecture-drift signal. sentrux (https://github.com/sentrux/sentrux)
8
8
  # is a Rust CLI that scores codebase structure (modularity, acyclicity,
package/autonomy/loki CHANGED
@@ -16135,7 +16135,7 @@ print()
16135
16135
  echo -e "${RED}Test coverage gate FAILED${NC}"
16136
16136
  return 1
16137
16137
  else
16138
- echo -e "${GREEN}Tests passed (min coverage: ${min_coverage}%)${NC}"
16138
+ echo -e "${GREEN}Tests passed (min coverage TARGET, not measured: ${min_coverage}%)${NC}"
16139
16139
  return 0
16140
16140
  fi
16141
16141
  ;;
@@ -27527,7 +27527,7 @@ if test_results:
27527
27527
  test_lines.append(f"Passed: {test_results.get('pass', 'unknown')}")
27528
27528
  test_lines.append(f"Timestamp: {test_results.get('timestamp', 'unknown')}")
27529
27529
  if test_results.get("min_coverage"):
27530
- test_lines.append(f"Min coverage target: {test_results['min_coverage']}%")
27530
+ test_lines.append(f"Min coverage TARGET (not measured): {test_results['min_coverage']}%")
27531
27531
  else:
27532
27532
  test_lines.append("No test result data recorded")
27533
27533
  sections.append(("Tests", test_lines))