loki-mode 5.49.2 → 5.49.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -11,7 +11,7 @@
11
11
  [![Agent Types](https://img.shields.io/badge/Agent%20Types-41-blue)]()
12
12
  [![Benchmarks](https://img.shields.io/badge/Benchmarks-Infrastructure%20Ready-blue)](benchmarks/)
13
13
 
14
- **Current Version: v5.49.2**
14
+ **Current Version: v5.49.3**
15
15
 
16
16
  **[Autonomi](https://www.autonomi.dev/)** | **[Documentation](https://www.autonomi.dev/docs)** | **[GitHub](https://github.com/asklokesh/loki-mode)**
17
17
 
@@ -150,6 +150,16 @@ Loki Mode is powerful but not magic. Be aware of these honest limitations:
150
150
  - Human oversight is expected for: deployment credentials, domain setup, API keys, and critical business decisions
151
151
  - The system is as good as the underlying AI model -- it can make mistakes, especially on novel or complex problems
152
152
 
153
+ ## What To Expect
154
+
155
+ | Project Type | Examples | Autonomy Level | Typical Experience |
156
+ |---|---|---|---|
157
+ | Simple | Landing page, todo app, static site, single API | High | Completes with minimal retries. Human reviews output. |
158
+ | Standard | CRUD app with auth, REST API + React frontend | Medium | Completes most features. Complex components may need guidance. |
159
+ | Complex | Microservices, real-time systems, ML pipelines | Guided | Use as accelerator. Human reviews between phases. |
160
+
161
+ "Autonomous" means the system runs RARV cycles without prompting. It does NOT mean zero oversight.
162
+
153
163
  ---
154
164
 
155
165
  ## Why Loki Mode?
package/SKILL.md CHANGED
@@ -3,7 +3,7 @@ name: loki-mode
3
3
  description: Multi-agent autonomous startup system. Triggers on "Loki Mode". Takes PRD to deployed product with minimal human intervention. Requires --dangerously-skip-permissions flag.
4
4
  ---
5
5
 
6
- # Loki Mode v5.49.2
6
+ # Loki Mode v5.49.3
7
7
 
8
8
  **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
9
9
 
@@ -263,4 +263,4 @@ The following features are documented in skill modules but not yet fully automat
263
263
  | Quality gates 3-reviewer system | Implemented (v5.35.0) | 5 specialist reviewers in `skills/quality-gates.md`; execution in run.sh |
264
264
  | Benchmarks (HumanEval, SWE-bench) | Infrastructure only | Runner scripts and datasets exist in `benchmarks/`; no published results |
265
265
 
266
- **v5.49.2 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
266
+ **v5.49.3 | [Autonomi](https://www.autonomi.dev/) flagship product | ~260 lines core**
package/VERSION CHANGED
@@ -1 +1 @@
1
- 5.49.2
1
+ 5.49.3
@@ -7,7 +7,7 @@ Modules:
7
7
  control: Session control API (start/stop/pause/resume)
8
8
  """
9
9
 
10
- __version__ = "5.49.2"
10
+ __version__ = "5.49.3"
11
11
 
12
12
  # Expose the control app for easy import
13
13
  try:
@@ -85,7 +85,7 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
85
85
 
86
86
  **Strengths:**
87
87
  - 85.9-87.7% Pass@1 on HumanEval
88
- - 100% task completion rate in evaluations
88
+ - High task completion rate in evaluations (100% reported by MetaGPT authors; not independently verified)
89
89
  - Standard Operating Procedures (SOPs) reduce hallucinations
90
90
  - Assembly line paradigm with role specialization
91
91
  - Low cost: ~$1.09 per project completion
@@ -2,7 +2,7 @@
2
2
 
3
3
  The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installation instructions for all platforms and use cases.
4
4
 
5
- **Version:** v5.49.2
5
+ **Version:** v5.49.3
6
6
 
7
7
  ---
8
8
 
@@ -36,9 +36,7 @@ The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installa
36
36
 
37
37
  - [Quick Install (Recommended)](#quick-install-recommended)
38
38
  - [VS Code Extension](#vs-code-extension)
39
- - [npm (Node.js)](#npm-nodejs)
40
- - [Homebrew (macOS/Linux)](#homebrew-macoslinux)
41
- - [Docker](#docker)
39
+ - [Alternative Methods](#alternative-methods)
42
40
  - [Sandbox Mode](#sandbox-mode)
43
41
  - [Multi-Provider Support](#multi-provider-support)
44
42
  - [Claude Code (CLI)](#claude-code-cli)
@@ -53,23 +51,19 @@ The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installa
53
51
 
54
52
  ## Quick Install (Recommended)
55
53
 
56
- Choose your preferred method:
57
-
58
54
  ```bash
59
- # Option A: npm (easiest)
60
- npm install -g loki-mode
55
+ git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
56
+ ```
61
57
 
62
- # Option B: Homebrew (macOS/Linux)
63
- brew tap asklokesh/tap && brew install loki-mode
58
+ That's it. Claude Code auto-discovers skills in `~/.claude/skills/`.
64
59
 
65
- # Option C: Docker
66
- docker pull asklokesh/loki-mode:latest
60
+ **Update:** `cd ~/.claude/skills/loki-mode && git pull`
67
61
 
68
- # Option D: Git clone
69
- git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
70
- ```
62
+ Skip to [Verify Installation](#verify-installation) to confirm it's working.
71
63
 
72
- **Done!** Skip to [Verify Installation](#verify-installation).
64
+ ### Alternative Installation Methods
65
+
66
+ Also available via npm, Homebrew, Docker, VS Code Extension, and GitHub Action. Each has trade-offs -- see [docs/alternative-installations.md](alternative-installations.md) for details, limitations, and current status of each method.
73
67
 
74
68
  ---
75
69
 
@@ -145,153 +139,42 @@ The extension will automatically connect when it detects the server is running a
145
139
 
146
140
  ---
147
141
 
148
- ## npm (Node.js)
142
+ ## Alternative Methods
149
143
 
150
- Install via npm for the easiest setup with automatic PATH configuration.
144
+ The following installation methods are available but each has limitations. Git clone (above) is the recommended primary method.
151
145
 
152
- ### Prerequisites
146
+ For full details, troubleshooting, and current status of each method, see [alternative-installations.md](alternative-installations.md).
153
147
 
154
- - Node.js 16.0.0 or later
148
+ ### npm
155
149
 
156
- ### Installation
150
+ **Status:** Published to npm registry. Verify current version: `npm view loki-mode version`
157
151
 
158
152
  ```bash
159
- # Global installation
160
153
  npm install -g loki-mode
161
-
162
- # The skill is automatically installed to ~/.claude/skills/loki-mode
163
-
164
- # Opt out of anonymous install telemetry:
165
- # LOKI_TELEMETRY_DISABLED=true npm install -g loki-mode
166
- # Or set DO_NOT_TRACK=1
167
- ```
168
-
169
- ### Usage
170
-
171
- ```bash
172
- # Use the CLI
173
- loki start ./my-prd.md
174
- loki status
175
- loki dashboard
176
-
177
- # Or invoke in Claude Code
178
- claude --dangerously-skip-permissions
179
- > Loki Mode with PRD at ./my-prd.md
180
- ```
181
-
182
- ### Updating
183
-
184
- ```bash
185
- npm update -g loki-mode
186
- ```
187
-
188
- ### Uninstalling
189
-
190
- ```bash
191
- npm uninstall -g loki-mode
192
- rm -rf ~/.claude/skills/loki-mode
193
154
  ```
194
155
 
195
- ---
196
-
197
- ## Homebrew (macOS/Linux)
156
+ Requires Node.js 16+. Provides the `loki` CLI and auto-installs the skill to `~/.claude/skills/loki-mode`.
198
157
 
199
- Install via Homebrew with automatic dependency management.
158
+ ### Homebrew
200
159
 
201
- ### Prerequisites
202
-
203
- - Homebrew (https://brew.sh)
204
-
205
- ### Installation
160
+ **Status:** Available via tap. Verify formula: `brew info asklokesh/tap/loki-mode`
206
161
 
207
162
  ```bash
208
- # Add the tap
209
- brew tap asklokesh/tap
210
-
211
- # Install Loki Mode
212
- brew install loki-mode
213
-
214
- # Set up Claude Code skill integration (manual symlink required)
163
+ brew tap asklokesh/tap && brew install loki-mode
164
+ # Manual symlink required for Claude Code:
215
165
  ln -sf "$(brew --prefix)/opt/loki-mode/libexec" ~/.claude/skills/loki-mode
216
166
  ```
217
167
 
218
- ### Dependencies
219
-
220
- Homebrew automatically installs:
221
- - bash 4.0+ (for associative arrays)
222
- - jq (JSON processing)
223
- - gh (GitHub CLI for integration)
224
-
225
- ### Usage
226
-
227
- ```bash
228
- # Use the CLI
229
- loki start ./my-prd.md
230
- loki status
231
- loki --help
232
- ```
233
-
234
- ### Updating
235
-
236
- ```bash
237
- brew upgrade loki-mode
238
- ```
239
-
240
- ### Uninstalling
168
+ ### Docker
241
169
 
242
- ```bash
243
- brew uninstall loki-mode
244
- rm -rf ~/.claude/skills/loki-mode
245
- ```
246
-
247
- ---
248
-
249
- ## Docker
250
-
251
- Run Loki Mode in a container for isolated execution.
252
-
253
- ### Prerequisites
254
-
255
- - Docker installed and running
256
-
257
- ### Installation
170
+ **Status:** Published to Docker Hub.
258
171
 
259
172
  ```bash
260
- # Pull the image
261
173
  docker pull asklokesh/loki-mode:latest
262
-
263
- # Or use docker-compose
264
- curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode/main/docker-compose.yml
265
- ```
266
-
267
- ### Usage
268
-
269
- ```bash
270
- # Run with a PRD file
271
174
  docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest start ./my-prd.md
272
-
273
- # Interactive mode
274
- docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest
275
-
276
- # Using docker-compose
277
- docker-compose run loki start ./my-prd.md
278
175
  ```
279
176
 
280
- ### Environment Variables
281
-
282
- Pass your configuration via environment variables:
283
-
284
- ```bash
285
- docker run -e LOKI_MAX_RETRIES=100 -e LOKI_BASE_WAIT=120 \
286
- -v $(pwd):/workspace -w /workspace \
287
- asklokesh/loki-mode:latest start ./my-prd.md
288
- ```
289
-
290
- ### Updating
291
-
292
- ```bash
293
- docker pull asklokesh/loki-mode:latest
294
- ```
177
+ **Limitation:** Docker cannot run Claude Code interactively (Claude Code is a terminal-based CLI requiring TTY access). Docker is suitable for CI/CD pipelines, API-only modes, and sandbox execution -- not for the primary interactive workflow.
295
178
 
296
179
  ---
297
180
 
package/mcp/__init__.py CHANGED
@@ -21,4 +21,4 @@ try:
21
21
  except ImportError:
22
22
  __all__ = ['mcp']
23
23
 
24
- __version__ = '5.49.2'
24
+ __version__ = '5.49.3'
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "loki-mode",
3
- "version": "5.49.2",
3
+ "version": "5.49.3",
4
4
  "description": "Loki Mode by Autonomi - Multi-agent autonomous startup system for Claude Code, Codex CLI, and Gemini CLI",
5
5
  "keywords": [
6
6
  "autonomi",
@@ -165,6 +165,16 @@ IMPLEMENT -> BLIND REVIEW (parallel) -> DEBATE (if disagreement) -> AGGREGATE ->
165
165
  - NEVER dispatch reviewers sequentially (always parallel - 3x faster)
166
166
  - NEVER aggregate before all 3 reviewers complete
167
167
 
168
+ ### Test Quality Review (Apply to Every Review)
169
+
170
+ Before approving, verify:
171
+ - Are tests using real implementations or excessive mocks of internal code?
172
+ - Were any assertion expected values changed in the same commit as implementation? (This is the top sign an agent cheated.)
173
+ - Do tests verify meaningful behavior or just "runs without throwing"?
174
+ - Could all tests pass while the feature is completely broken?
175
+
176
+ Assertion manipulation in the same commit as implementation = CRITICAL finding = automatic REJECT.
177
+
168
178
  ### Anti-Sycophancy Protocol (CONSENSAGENT Research)
169
179
 
170
180
  **Problem:** Reviewers may reinforce each other's findings instead of critically engaging.
@@ -14,6 +14,24 @@
14
14
  8. **Mock Detector** - Classifies internal vs external mocks; flags tests that never import source code, tautological assertions, and high internal mock ratios
15
15
  9. **Test Mutation Detector** - Detects assertion value changes alongside implementation changes (test fitting), low assertion density, and missing pass/fail tracking
16
16
 
17
+ ## Gate 8 and 9: Automated Test Integrity
18
+
19
+ Gates 8 (Mock Detector) and 9 (Test Mutation Detector) run during the VERIFY phase and are enabled by default.
20
+
21
+ **How they run:**
22
+ - Gate 8 runs `tests/detect-mock-problems.sh` against all test files in the project
23
+ - Gate 9 runs `tests/detect-test-mutations.sh` against recent commits (default: last 5, or use `--commit HASH` for targeted checks)
24
+ - Both produce findings at HIGH/MEDIUM/LOW severity levels
25
+ - HIGH findings = automatic FAIL (same as other blocking gates)
26
+
27
+ **Disabling (not recommended):**
28
+ ```bash
29
+ LOKI_GATE_MOCK_DETECTOR=false # Disable gate 8
30
+ LOKI_GATE_MUTATION_DETECTOR=false # Disable gate 9
31
+ ```
32
+
33
+ ---
34
+
17
35
  ## Guardrails Execution Modes
18
36
 
19
37
  - **Blocking**: Guardrail completes before agent starts (use for expensive operations)
package/skills/testing.md CHANGED
@@ -1,5 +1,20 @@
1
1
  # Testing
2
2
 
3
+ ## Mandatory Testing Rules
4
+
5
+ 1. Write tests FIRST. Commit the test before writing implementation.
6
+ 2. Tests must call REAL functions with REAL inputs and assert REAL outputs.
7
+ 3. Mock ONLY external dependencies: HTTP APIs, databases, file system, third-party services.
8
+ 4. NEVER mock internal modules, utility functions, or any code that is part of this project.
9
+ 5. NEVER change a test's expected value to make it pass. If a test fails, the implementation is wrong. Fix the code, not the test.
10
+ 6. If you believe a test expectation is incorrect, document WHY and flag for council review. Do not silently change it.
11
+ 7. Every test file must have at least one assertion per tested function.
12
+
13
+ Gate 8 (mock detector) and Gate 9 (mutation detector) enforce rules 3-5 automatically.
14
+ Violations result in automatic FAIL during VERIFY phase.
15
+
16
+ ---
17
+
3
18
  ## E2E Testing with Playwright MCP
4
19
 
5
20
  **Use Playwright MCP for browser-based testing.**