loki-mode 7.45.0 → 7.46.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +16 -12
- package/SKILL.md +5 -5
- package/VERSION +1 -1
- package/autonomy/CONSTITUTION.md +9 -2
- package/autonomy/lib/sentrux-gate.sh +1 -1
- package/autonomy/loki +2 -2
- package/autonomy/run.sh +355 -92
- package/dashboard/__init__.py +1 -1
- package/dashboard/registry.py +156 -62
- package/dashboard/server.py +9 -10
- package/docs/COMPARISON.md +10 -10
- package/docs/COMPETITIVE-ANALYSIS.md +1 -1
- package/docs/INSTALLATION.md +2 -2
- package/docs/P0-SWEEP-PLAN.md +163 -0
- package/docs/architecture/STATE-MACHINES.md +18 -19
- package/docs/architecture/bmad-loki-voice-agent-council-analysis.md +1 -1
- package/docs/auto-claude-comparison.md +14 -11
- package/docs/certification/01-core-concepts/lesson.md +12 -11
- package/docs/certification/01-core-concepts/quiz.md +6 -6
- package/docs/certification/05-troubleshooting/lesson.md +23 -13
- package/docs/certification/05-troubleshooting/quiz.md +3 -3
- package/docs/certification/answer-key.md +2 -2
- package/docs/certification/certification-exam.md +9 -9
- package/docs/competitive/bolt-new-analysis.md +1 -1
- package/docs/competitive/emergence-others-analysis.md +9 -9
- package/docs/competitive/replit-lovable-analysis.md +3 -3
- package/docs/cursor-comparison.md +15 -12
- package/docs/dashboard-guide.md +9 -7
- package/docs/prd-purple-lab-platform-v2.md +1 -1
- package/docs/prd-purple-lab-platform.md +3 -3
- package/docs/show-hn-post.md +2 -2
- package/loki-ts/dist/loki.js +2 -2
- package/mcp/__init__.py +1 -1
- package/package.json +2 -2
- package/plugins/loki-mode/.claude-plugin/plugin.json +2 -2
- package/plugins/loki-mode/README.md +1 -1
- package/references/magic-rarv-integration.md +1 -1
- package/references/quality-control.md +5 -5
- package/references/sdlc-phases.md +1 -2
- package/skills/00-index.md +1 -1
- package/skills/artifacts.md +1 -1
- package/skills/healing.md +1 -1
- package/skills/magic-modules.md +3 -3
- package/skills/quality-gates.md +52 -39
- package/skills/testing.md +1 -1
package/README.md
CHANGED
|
@@ -19,14 +19,14 @@ _The free, source-available autonomous coding agent by [Autonomi](https://www.au
|
|
|
19
19
|
|
|
20
20
|
---
|
|
21
21
|
|
|
22
|
-
> **How it works:** Drop a spec -- a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. Loki Mode classifies complexity (`run.sh:detect_complexity()`), assembles an agent team from 41 specialized agent roles across 8 domains - prompt-defined specifications the orchestrator adopts per phase, with parallel review (blind council) and optional worktree streams on Claude Code, sequential on other providers - and runs autonomous RARV cycles (Reason - Act - Reflect - Verify, see `run.sh:run_autonomous()`) with
|
|
22
|
+
> **How it works:** Drop a spec -- a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief. Loki Mode classifies complexity (`run.sh:detect_complexity()`), assembles an agent team from 41 specialized agent roles across 8 domains - prompt-defined specifications the orchestrator adopts per phase, with parallel review (blind council) and optional worktree streams on Claude Code, sequential on other providers - and runs autonomous RARV cycles (Reason - Act - Reflect - Verify, see `run.sh:run_autonomous()`) with 8 quality gates (see `skills/quality-gates.md`). Code is not "done" until it passes automated verification. Output is a Git repo with source, tests, configs, and audit logs.
|
|
23
23
|
|
|
24
24
|
---
|
|
25
25
|
|
|
26
26
|
## Why Loki Mode?
|
|
27
27
|
|
|
28
28
|
- **Spec-driven, autonomous, with a built-in trust layer** -- Hand Loki a spec, walk away, come back to working code with tests. The full RARV-C closure loop (Reason - Act - Reflect - Verify - Close) runs until the work is actually done, not just attempted. The verified-completion evidence gate (`skills/quality-gates.md`) refuses any "done" claim on an empty git diff against the run-start commit, and blocks completion when tests run red, so "complete" means proven, not promised.
|
|
29
|
-
- **Production quality built in** --
|
|
29
|
+
- **Production quality built in** -- 8 quality gates (`skills/quality-gates.md`), blind 3-reviewer code review (`run.sh:run_code_review()`), anti-sycophancy checks
|
|
30
30
|
- **Standalone verification: `loki verify`** -- Run Loki's deterministic gates (build, tests, static analysis, secret scan, dependency audit) against any branch or PR diff, including code written by other agents or humans. CI-ready exit codes (0 VERIFIED, 1 CONCERNS, 2 BLOCKED), machine-readable evidence at `.loki/verify/evidence.json`. Inconclusive evidence is never reported as VERIFIED (v7.27.0).
|
|
31
31
|
- **Living spec and pre-build interrogation** -- `loki spec` locks a spec and detects drift deterministically (`spec.lock`, `drift-report.json`, and a `SPEC_DRIFT` finding in `loki verify` with CI exit codes), so you can tell when the build diverges from what was agreed. `loki grill` runs a Devil's-Advocate interrogation of the spec before you build, surfacing gaps and contradictions early (v7.28.0).
|
|
32
32
|
- **Mid-flight model switching** -- switch the model a live run uses from the dashboard (applies at the next iteration, current run only). A Fable tier lever exists in the CLI, dashboard, and override paths, but Claude Fable 5 is not yet available at the API, so selecting Fable currently collapses to Opus at every dispatch chokepoint and the `loki plan` quote reflects Opus accordingly. For every model lever (session pin, mid-flight override, architect pass) and every `LOKI_MAX_TIER` path, the `loki plan` quote, the dashboard's reported model, and the actual dispatched model agree, with the ceiling enforced (v7.31.0; Fable-to-Opus collapse v7.39.1).
|
|
@@ -41,7 +41,7 @@ _The free, source-available autonomous coding agent by [Autonomi](https://www.au
|
|
|
41
41
|
- **MCP server** -- 34 tools (including ChromaDB code search) plus 3 resources and 2 prompts (`mcp/server.py`, with magic tools registered from `mcp/magic_tools.py` and the managed-memory tool from `mcp/managed_tools.py`). Of the 34, 33 are always available; `loki_memory_redact` is registered but only succeeds when `LOKI_MANAGED_AGENTS=true` and `LOKI_MANAGED_MEMORY=true`. Launch with `loki mcp` (bootstraps the Python MCP SDK on first run).
|
|
42
42
|
- **Full-stack output** -- Source code, tests, Docker Compose stacks (multi-service with healthchecks), CI/CD pipelines, audit logs
|
|
43
43
|
- **Provider-agnostic** -- runs on Claude, Codex, Cline, or Aider with automatic failover (`loki-ts/src/runner/providers.ts`); no vendor lock-in. Gemini CLI deprecated v7.5.18; Antigravity CLI coming soon.
|
|
44
|
-
- **
|
|
44
|
+
- **Source-available (BUSL-1.1)** -- Free for personal, internal, and academic use.
|
|
45
45
|
|
|
46
46
|
---
|
|
47
47
|
|
|
@@ -193,7 +193,7 @@ A "spec" is whatever you hand `loki start`. Loki auto-detects the format and nor
|
|
|
193
193
|
| OpenSpec change directory | `loki start --openspec ./openspec/change-001` | Reads OpenSpec change manifest + delta files. |
|
|
194
194
|
| Auto-detect (no input) | `loki start` | Picks up `./prd.md`, `./spec.{json,yaml,yml}`, or `./SPEC.md` from cwd. |
|
|
195
195
|
|
|
196
|
-
All formats land in the same RARV pipeline and pass the same
|
|
196
|
+
All formats land in the same RARV pipeline and pass the same 8 quality gates (`skills/quality-gates.md`).
|
|
197
197
|
|
|
198
198
|
</details>
|
|
199
199
|
|
|
@@ -252,8 +252,8 @@ Every iteration: **Reason** (read state) - **Act** (execute, commit) - **Reflect
|
|
|
252
252
|
</td>
|
|
253
253
|
<td width="33%" valign="top">
|
|
254
254
|
|
|
255
|
-
###
|
|
256
|
-
|
|
255
|
+
### 8 Quality Gates
|
|
256
|
+
Static analysis, test suite (pass/fail), blind 3-reviewer code review with severity blocking, anti-sycophancy Devil's Advocate, mock-integrity detection, test-mutation detection, documentation coverage, and Magic Modules debate. Backward-compatibility is a conditional healing-mode auditor (not one of the 8). Code does not ship until all gates pass.
|
|
257
257
|
|
|
258
258
|
[Quality Gates](skills/quality-gates.md)
|
|
259
259
|
|
|
@@ -303,15 +303,15 @@ The historical feature set (platform pages, Monaco IDE workspace, AI chat panel)
|
|
|
303
303
|
|---------|:---------:|:--------:|:------:|:-------:|
|
|
304
304
|
| Self-hosted / your keys | Yes | No | No | No |
|
|
305
305
|
| 5 AI provider failover | Yes | No | No | No |
|
|
306
|
-
|
|
|
306
|
+
| 8 quality gates | Yes | No | No | No |
|
|
307
307
|
| Blind code review | Yes | No | No | No |
|
|
308
308
|
| Enterprise auth (SSO/RBAC) | Yes | No | Yes | No |
|
|
309
309
|
| Air-gapped deployment | Yes | No | No | No |
|
|
310
310
|
| Docker + CI/CD generation | Yes | No | Yes | No |
|
|
311
|
-
|
|
|
312
|
-
| Free tier |
|
|
311
|
+
| Source-available (BUSL-1.1) | Yes | No | No | No |
|
|
312
|
+
| Free tier | Source-available | Yes | Yes | Yes |
|
|
313
313
|
|
|
314
|
-
Loki Mode is the only platform that is fully self-hosted,
|
|
314
|
+
Loki Mode is the only platform that is fully self-hosted, source-available (BUSL-1.1), and includes automated quality verification. Your code, your keys, your infrastructure.
|
|
315
315
|
|
|
316
316
|
---
|
|
317
317
|
|
|
@@ -432,7 +432,7 @@ See [benchmarks/](benchmarks/) for methodology.
|
|
|
432
432
|
|
|
433
433
|

|
|
434
434
|
|
|
435
|
-
*11 slides: Problem, Solution, 41 Agents, RARV Cycle,
|
|
435
|
+
*11 slides: Problem, Solution, 41 Agents, RARV Cycle, 8 Quality Gates (HumanEval 98.78%), Multi-Provider, Enterprise Hardening (Live App Preview), Full Lifecycle*
|
|
436
436
|
|
|
437
437
|
**[Download PPTX](docs/loki-mode-presentation.pptx)**
|
|
438
438
|
|
|
@@ -446,7 +446,7 @@ See [benchmarks/](benchmarks/) for methodology.
|
|
|
446
446
|
|------|-----------|---------------------|
|
|
447
447
|
| **Code Gen** | Full-stack apps from PRDs | Complex domain logic may need human review |
|
|
448
448
|
| **Deploy** | Generates configs, Dockerfiles, CI/CD | Does not deploy -- human runs deploy commands |
|
|
449
|
-
| **Testing** |
|
|
449
|
+
| **Testing** | 8 automated quality gates | Test quality depends on AI assertions |
|
|
450
450
|
| **Providers** | 5 providers with auto-failover | Non-Claude providers lack parallel agents |
|
|
451
451
|
| **Dashboard** | Real-time single-machine monitoring | No multi-node clustering |
|
|
452
452
|
|
|
@@ -499,3 +499,7 @@ See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
|
|
|
499
499
|
**[Autonomi](https://www.autonomi.dev/)** | **[Documentation](wiki/Home.md)** | **[Changelog](CHANGELOG.md)** | **[Comparisons](references/competitive-analysis.md)**
|
|
500
500
|
|
|
501
501
|
</div>
|
|
502
|
+
|
|
503
|
+
## Star History
|
|
504
|
+
|
|
505
|
+
[](https://www.star-history.com/?repos=asklokesh%2Floki-mode&type=timeline&logscale=&legend=bottom-right)
|
package/SKILL.md
CHANGED
|
@@ -1,13 +1,13 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: loki-mode
|
|
3
|
-
description: Autonomous spec-driven build system with a built-in trust layer. It does not call work done until it is verified (RARV-C closure loop,
|
|
3
|
+
description: Autonomous spec-driven build system with a built-in trust layer. It does not call work done until it is verified (RARV-C closure loop, 8 quality gates, completion council, verified-completion evidence gate). Triggers on "Loki Mode". Takes a spec (PRD, GitHub issue, OpenAPI doc, etc.) to deployed product with minimal human intervention. Provider-agnostic. Requires --dangerously-skip-permissions flag.
|
|
4
4
|
---
|
|
5
5
|
|
|
6
|
-
# Loki Mode v7.
|
|
6
|
+
# Loki Mode v7.46.0
|
|
7
7
|
|
|
8
8
|
**You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
|
|
9
9
|
|
|
10
|
-
**Spec in, verified product out.** Spec-driven: a "spec" is whatever describes the work -- a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket (a PRD is one form of spec). The differentiator is the trust layer: Loki does not call work done until it is verified. The RARV-C closure loop,
|
|
10
|
+
**Spec in, verified product out.** Spec-driven: a "spec" is whatever describes the work -- a Markdown PRD, a GitHub issue, an OpenAPI doc, a Jira ticket (a PRD is one form of spec). The differentiator is the trust layer: Loki does not call work done until it is verified. The RARV-C closure loop, 8 quality gates, the completion council, and the verified-completion evidence gate must all clear before completion is accepted.
|
|
11
11
|
|
|
12
12
|
**Provider-agnostic (stable since v5.0.0):** runs on Claude/Codex/Cline/Aider with abstract model tiers and degraded mode for non-Claude providers; no vendor lock-in. Gemini deprecated v7.5.18. See `skills/providers.md`. **Current track (v7.7.x):** LSP grounding as first-class agent tool (v7.7.0-v7.7.9; lsp_get_diagnostics actually-returns-diagnostics regression fix v7.7.14), provider_source cli (v7.7.11-v7.7.12 bash/bun parity), Docker/bash-3.2 robustness (v7.7.13), audit chain cross-file verification fix (v7.7.15), Phase 1 RARV-C closure (real provider judges, gate-failure flock, synthetic PRD e2e, status `--json`).
|
|
13
13
|
|
|
@@ -125,7 +125,7 @@ DEPLOYMENT ──[production live, monitoring active]──> GROWTH
|
|
|
125
125
|
GROWTH ──[continuous improvement loop]──> GROWTH
|
|
126
126
|
```
|
|
127
127
|
|
|
128
|
-
**Transition requires:** All phase quality gates passed. No Critical/High/
|
|
128
|
+
**Transition requires:** All phase quality gates passed. No Critical/High issues (Medium/Low advisory).
|
|
129
129
|
|
|
130
130
|
---
|
|
131
131
|
|
|
@@ -407,4 +407,4 @@ See `CHANGELOG.md` entries [7.5.7], [7.5.8], [7.5.13] for the per-fix list and r
|
|
|
407
407
|
|
|
408
408
|
---
|
|
409
409
|
|
|
410
|
-
**v7.
|
|
410
|
+
**v7.46.0 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
|
package/VERSION
CHANGED
|
@@ -1 +1 @@
|
|
|
1
|
-
7.
|
|
1
|
+
7.46.0
|
package/autonomy/CONSTITUTION.md
CHANGED
|
@@ -155,7 +155,7 @@ SKILL.md (~190 lines) # Always loaded: RARV cycle, autonomy rules
|
|
|
155
155
|
skills/
|
|
156
156
|
00-index.md # Module routing table
|
|
157
157
|
model-selection.md # Task tool, parallelization
|
|
158
|
-
quality-gates.md #
|
|
158
|
+
quality-gates.md # 8-gate system, anti-sycophancy
|
|
159
159
|
testing.md # Playwright, E2E, property-based
|
|
160
160
|
production.md # CI/CD, batch processing
|
|
161
161
|
agents.md # 41 agent types, A2A patterns
|
|
@@ -196,7 +196,14 @@ Main Worktree (orchestrator)
|
|
|
196
196
|
|
|
197
197
|
---
|
|
198
198
|
|
|
199
|
-
## Quality Gates
|
|
199
|
+
## Quality Gates
|
|
200
|
+
|
|
201
|
+
Note: the canonical, implemented quality-gate set is the 8-gate system defined in
|
|
202
|
+
`skills/quality-gates.md` (static analysis, test suite pass/fail, blind 3-reviewer
|
|
203
|
+
code review with severity blocking, anti-sycophancy Devil's Advocate, mock
|
|
204
|
+
integrity, test mutation, documentation coverage, Magic Modules debate). The
|
|
205
|
+
list below is an aspirational SDLC taxonomy and may name phases that are not
|
|
206
|
+
deterministic loop gates; treat `skills/quality-gates.md` as the source of truth.
|
|
200
207
|
|
|
201
208
|
### Gate 1: Static Analysis
|
|
202
209
|
```yaml
|
|
@@ -2,7 +2,7 @@
|
|
|
2
2
|
# Loki Mode -- sentrux architectural-drift helper (v7.5.14).
|
|
3
3
|
#
|
|
4
4
|
# Why this exists:
|
|
5
|
-
# Loki's existing
|
|
5
|
+
# Loki's existing 8 quality gates and 3-reviewer council catch correctness
|
|
6
6
|
# and behavioral regressions, but no current gate emits a deterministic,
|
|
7
7
|
# per-iteration architecture-drift signal. sentrux (https://github.com/sentrux/sentrux)
|
|
8
8
|
# is a Rust CLI that scores codebase structure (modularity, acyclicity,
|
package/autonomy/loki
CHANGED
|
@@ -16135,7 +16135,7 @@ print()
|
|
|
16135
16135
|
echo -e "${RED}Test coverage gate FAILED${NC}"
|
|
16136
16136
|
return 1
|
|
16137
16137
|
else
|
|
16138
|
-
echo -e "${GREEN}Tests passed (min coverage: ${min_coverage}%)${NC}"
|
|
16138
|
+
echo -e "${GREEN}Tests passed (min coverage TARGET, not measured: ${min_coverage}%)${NC}"
|
|
16139
16139
|
return 0
|
|
16140
16140
|
fi
|
|
16141
16141
|
;;
|
|
@@ -27527,7 +27527,7 @@ if test_results:
|
|
|
27527
27527
|
test_lines.append(f"Passed: {test_results.get('pass', 'unknown')}")
|
|
27528
27528
|
test_lines.append(f"Timestamp: {test_results.get('timestamp', 'unknown')}")
|
|
27529
27529
|
if test_results.get("min_coverage"):
|
|
27530
|
-
test_lines.append(f"Min coverage
|
|
27530
|
+
test_lines.append(f"Min coverage TARGET (not measured): {test_results['min_coverage']}%")
|
|
27531
27531
|
else:
|
|
27532
27532
|
test_lines.append("No test result data recorded")
|
|
27533
27533
|
sections.append(("Tests", test_lines))
|