loki-mode 5.49.0 → 5.49.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -12,7 +12,7 @@
12
12
  | Feature | **Loki Mode** | **Zencoder** | **Devin** | **OpenAI Codex** | **Cursor** | **Claude Code** | **Kiro** | **Antigravity** | **Amazon Q** | **OpenCode** |
13
13
  |---------|--------------|--------------|-----------|-----------------|------------|-----------------|----------|-----------------|--------------|--------------|
14
14
  | **Type** | Skill/Framework | Enterprise Platform | Standalone Agent | Cloud Agent | AI IDE | CLI Agent | AI IDE | AI IDE | Cloud Agent | AI IDE (OSS) |
15
- | **Autonomy Level** | Full (zero human) | High | Full | High | Medium-High | High | High | High | High | High |
15
+ | **Autonomy Level** | High (minimal human) | High | Full | High | Medium-High | High | High | High | High | High |
16
16
  | **Max Runtime** | Unlimited | Async/Scheduled | Hours | Per-task | Session | Session | Days | Async | Per-task | Session |
17
17
  | **Pricing** | Free (OSS) | Enterprise | $20/mo | ChatGPT Plus | $20/mo | API costs | Free preview | Free preview | $19/mo | Free (OSS) |
18
18
  | **Open Source** | Yes | No | No | No | No | No | No | No | No | Yes |
@@ -24,7 +24,7 @@
24
24
 
25
25
  | Feature | **Loki Mode** | **Devin** | **Codex** | **Cursor** | **Kiro** | **Antigravity** | **Amazon Q** | **OpenCode** |
26
26
  |---------|--------------|-----------|-----------|------------|----------|-----------------|--------------|--------------|
27
- | **Multi-Agent** | 41 agents in 7 swarms | Single | Single | Up to 8 parallel | Background | Manager Surface | Multiple types | 4 built-in |
27
+ | **Multi-Agent** | 41 agents in 8 swarms | Single | Single | Up to 8 parallel | Background | Manager Surface | Multiple types | 4 built-in |
28
28
  | **Orchestration** | Full orchestrator | N/A | N/A | Git worktree | Hooks | Manager view | Workflow | Subagents |
29
29
  | **Parallel Exec** | 10+ Haiku, 4 impl (worktree) | No | No | 8 max | Yes | Yes | Yes | Yes |
30
30
  | **Agent Swarms** | Eng, Ops, Business, Data, Product, Growth, Review | N/A | N/A | N/A | N/A | N/A | 3 types | N/A |
@@ -37,7 +37,7 @@
37
37
  |---------|--------------|-----------|-----------|------------|----------|-----------------|--------------|--------------|
38
38
  | **Code Review** | 3 blind reviewers + devil's advocate | Basic | Basic | BugBot PR | Property-based | Artifacts | Doc/Review | Basic |
39
39
  | **Anti-Sycophancy** | Yes (CONSENSAGENT) | No | No | No | No | No | No | No |
40
- | **Quality Gates** | 7 gates + PBT | Basic | Sandbox | Tests | Spec validation | Artifact checks | Tests | Permissions |
40
+ | **Quality Gates** | 9 gates + PBT | Basic | Sandbox | Tests | Spec validation | Artifact checks | Tests | Permissions |
41
41
  | **Constitutional AI** | Yes (principles) | No | Refusal training | No | No | No | No | No |
42
42
 
43
43
  ---
@@ -146,7 +146,7 @@
146
146
 
147
147
  | Feature | **Zencoder** | **Loki Mode** | **Assessment** |
148
148
  |---------|-------------|---------------|----------------|
149
- | **Four Pillars** | Structured Workflows, SDD, Multi-Agent Verification, Parallel Execution | SDLC + RARV + 7 Gates + Worktrees | TIE |
149
+ | **Four Pillars** | Structured Workflows, SDD, Multi-Agent Verification, Parallel Execution | SDLC + RARV + 9 Gates + Worktrees | TIE |
150
150
  | **Spec-Driven Dev** | Specs as first-class objects | OpenAPI-first | TIE |
151
151
  | **Multi-Agent Verification** | Model diversity (Claude vs OpenAI, 54% improvement) | 3 blind reviewers + devil's advocate | Different approach (N/A for Claude Code - only Claude models) |
152
152
  | **Quality Gates** | Built-in verification loops | 7 explicit gates + anti-sycophancy | **Loki Mode** |
@@ -180,9 +180,9 @@
180
180
 
181
181
  1. **Quality Control**: 7 explicit gates + blind review + devil's advocate vs built-in loops
182
182
  2. **Memory System**: 3-tier (episodic/semantic/procedural) with cross-project learning
183
- 3. **Agent Specialization**: 41 pre-defined specialized agents across 7 swarms
183
+ 3. **Agent Specialization**: 41 pre-defined specialized agents across 8 swarms
184
184
  4. **Anti-Sycophancy**: CONSENSAGENT patterns prevent reviewer groupthink
185
- 5. **Autonomy Design**: Zero human intervention from PRD to production
185
+ 5. **Autonomy Design**: Minimal human intervention from PRD to production
186
186
  6. **Research Foundation**: 10+ academic papers integrated vs proprietary
187
187
 
188
188
  ### Where Zencoder EXCEEDS Loki Mode
@@ -203,13 +203,13 @@
203
203
  |---------|--------------|---------|-----------------|------------|-----------------|---------------------|----------------|
204
204
  | **Stars** | 594 | 11,903 | 35K+ | 26K+ | 13.7K | N/A | N/A |
205
205
  | **npm/wk** | 6.1K | 21.4K | N/A | N/A | N/A | N/A | N/A |
206
- | **Agents** | 41 in 7 swarms | 11 agents | Fresh per task | 108 agents | Swarm-based | 32 agents | N/A |
206
+ | **Agents** | 41 in 8 swarms | 11 agents | Fresh per task | 108 agents | Swarm-based | 32 agents | N/A |
207
207
  | **Skills** | Progressive disclosure | 6 slash commands | N/A | 129 skills | N/A | 35 skills | Memory focus |
208
208
  | **Multi-Provider** | Yes (Claude/Codex/Gemini) | 3 CLIs (separate) | No | No | No | No | No |
209
209
  | **Memory System** | 3-tier (episodic/semantic/procedural) | None | N/A | N/A | Hybrid | N/A | SQLite+FTS5 |
210
- | **Quality Gates** | 7 gates + Completion Council | User verify only | Two-Stage Review | N/A | Consensus | Tiered | N/A |
210
+ | **Quality Gates** | 9 gates + Completion Council | User verify only | Two-Stage Review | N/A | Consensus | Tiered | N/A |
211
211
  | **Context Mgmt** | Standard | Fresh per task (core innovation) | Fresh per task | N/A | N/A | N/A | Progressive |
212
- | **Autonomy** | Full (zero human) | Semi (checkpoints) | Human-guided | Human-guided | Orchestrated | Human-guided | N/A |
212
+ | **Autonomy** | High (minimal human) | Semi (checkpoints) | Human-guided | Human-guided | Orchestrated | Human-guided | N/A |
213
213
 
214
214
  ### What Loki Mode LACKS (Honest Assessment)
215
215
 
@@ -232,11 +232,11 @@ These are patterns from competing projects that are **practically and scientific
232
232
  |----------|---------|-------------------------|
233
233
  | **Multi-Provider Support** | Only skill supporting Claude, Codex, and Gemini with graceful degradation | All 8 competitors are Claude-only |
234
234
  | **RARV Cycle** | Reason-Act-Reflect-Verify is more rigorous than Plan-Execute | Most use simple Plan-Execute |
235
- | **7-Gate Quality System** | Static analysis + 3 reviewers + devil's advocate + anti-sycophancy + severity blocking + coverage + debate | Superpowers has 2-stage, others have less |
235
+ | **9-Gate Quality System** | Static analysis + 3 reviewers + devil's advocate + anti-sycophancy + severity blocking + coverage + debate | Superpowers has 2-stage, others have less |
236
236
  | **Constitutional AI Integration** | Principles-based self-critique from Anthropic research | None have this |
237
237
  | **Anti-Sycophancy (CONSENSAGENT)** | Blind review + devil's advocate prevents groupthink | None have this |
238
238
  | **Provider Abstraction Layer** | Clean degradation from full-featured to sequential-only | Claude-only projects can't degrade |
239
- | **41 Specialized Agents** | Purpose-built agents in 7 swarms vs generic | agents (108) has more but less organized |
239
+ | **41 Specialized Agents** | Purpose-built agents in 8 swarms vs generic | agents (108) has more but less organized |
240
240
  | **Research Foundation** | 10+ academic papers integrated with citations | Most have no research backing |
241
241
 
242
242
  ### Superpowers Deep-Dive (35K+ Stars)
@@ -342,7 +342,7 @@ Tiered agent architecture with explicit escalation:
342
342
 
343
343
  | Agent | Killer Feature |
344
344
  |-------|---------------|
345
- | **Loki Mode** | Zero-human-intervention full SDLC, 41 agents in 7 swarms, Constitutional AI, anti-sycophancy, cross-project learning, code transformation, property-based testing |
345
+ | **Loki Mode** | Minimal-human-intervention full SDLC, 41 agents in 8 swarms, Constitutional AI, anti-sycophancy, cross-project learning, code transformation, property-based testing |
346
346
  | **Devin** | Full software engineer persona, Slack integration, 67% PR merge rate |
347
347
  | **OpenAI Codex** | Skills marketplace, $skill-creator, GPT-5.2-Codex, secure sandbox |
348
348
  | **Cursor** | 8 parallel agents, BugBot, Memories, $10B valuation, Composer model (250 tok/s) |
@@ -357,9 +357,9 @@ Tiered agent architecture with explicit escalation:
357
357
 
358
358
  | Dimension | Loki Mode Advantage |
359
359
  |-----------|-------------------|
360
- | **Autonomy** | Only agent designed for TRUE zero human intervention |
361
- | **Multi-Agent** | 41 specialized agents in 7 swarms vs 1-8 in competitors |
362
- | **Quality** | 7 gates + blind review + devil's advocate + property-based testing |
360
+ | **Autonomy** | Designed for high autonomy with minimal human intervention |
361
+ | **Multi-Agent** | 41 specialized agents in 8 swarms vs 1-8 in competitors |
362
+ | **Quality** | 9 gates + blind review + devil's advocate + property-based testing |
363
363
  | **Research** | 10+ academic papers integrated vs proprietary/undisclosed |
364
364
  | **Anti-Sycophancy** | Only agent with CONSENSAGENT-based blind review |
365
365
  | **Memory** | 3-tier memory (episodic/semantic/procedural) + review learning + cross-project |
@@ -20,7 +20,7 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
20
20
  | Adoption | 594 stars, 6K/wk npm | 11,903 stars, 21K/wk npm | GSD (20x) |
21
21
  | Simplicity | Complex (5.4K-line run.sh, 12 Python modules) | Simple (markdown agents + slash commands) | GSD |
22
22
  | Full autonomy | Walk away, come back to deployed product | Human checkpoints at discuss/verify/milestone | Loki |
23
- | Quality gates | 7-gate + Completion Council + anti-sycophancy | User verification only | Loki |
23
+ | Quality gates | 9-gate + Completion Council + anti-sycophancy | User verification only | Loki |
24
24
  | Memory system | Episodic/semantic/procedural + vector search | None | Loki |
25
25
  | Context management | Standard | Fresh subagent contexts per task (core innovation) | GSD |
26
26
  | Time to value | Learn architecture, understand CLI flags | `npx get-shit-done-cc` and go | GSD |
@@ -37,9 +37,9 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
37
37
  |---------|-----------|-------------|---------|--------|--------------|-------|
38
38
  | **GitHub Stars** | 594 | 13,700 | 62,400 | 25,000+ | N/A (Commercial) | N/A (Commercial) |
39
39
  | **Agent Count** | 41 types | 64+ agents | 5 roles | Unlimited | 8 parallel | 1 autonomous |
40
- | **Parallel Execution** | Yes (100+) | Yes (swarms) | Sequential | Yes (crews) | Yes (8 worktrees) | Yes (fleet) |
41
- | **Published Benchmarks** | **98.78% HumanEval (multi-agent)** | None | 85.9-87.7% HumanEval | None | ~250 tok/s | 15% complex tasks |
42
- | **SWE-bench Score** | **99.67% patch gen (299/300)** | Unknown | Unknown | Unknown | Unknown | 15% complex |
40
+ | **Parallel Execution** | Yes (multi-agent) | Yes (swarms) | Sequential | Yes (crews) | Yes (8 worktrees) | Yes (fleet) |
41
+ | **Published Benchmarks** | 98.78% HumanEval (self-reported, max 3 retries) | None | 85.9-87.7% HumanEval | None | ~250 tok/s | 15% complex tasks |
42
+ | **SWE-bench Score** | 99.67% patch gen (unevaluated, 299/300) | Unknown | Unknown | Unknown | Unknown | 15% complex |
43
43
  | **Full SDLC** | Yes (8 phases) | Yes | Partial | Partial | No | Partial |
44
44
  | **Business Ops** | **Yes (8 agents)** | No | No | No | No | No |
45
45
  | **Enterprise Security** | `--dangerously-skip-permissions` | MCP sandboxed | Sandboxed | Audit logs, RBAC | Staged autonomy | Sandboxed |
@@ -2,11 +2,11 @@
2
2
 
3
3
  The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installation instructions for all platforms and use cases.
4
4
 
5
- **Version:** v5.49.0
5
+ **Version:** v5.49.2
6
6
 
7
7
  ---
8
8
 
9
- ## What's New in v5.39.0
9
+ ## What's New in v5.49.1
10
10
 
11
11
  ### Enterprise Security (v5.36.0-v5.37.1)
12
12
  - TLS/HTTPS support for dashboard connections
@@ -63,7 +63,7 @@ npm install -g loki-mode
63
63
  brew tap asklokesh/tap && brew install loki-mode
64
64
 
65
65
  # Option C: Docker
66
- docker pull asklokesh/loki-mode:5.32.0
66
+ docker pull asklokesh/loki-mode:latest
67
67
 
68
68
  # Option D: Git clone
69
69
  git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
@@ -160,6 +160,10 @@ Install via npm for the easiest setup with automatic PATH configuration.
160
160
  npm install -g loki-mode
161
161
 
162
162
  # The skill is automatically installed to ~/.claude/skills/loki-mode
163
+
164
+ # Opt out of anonymous install telemetry:
165
+ # LOKI_TELEMETRY_DISABLED=true npm install -g loki-mode
166
+ # Or set DO_NOT_TRACK=1
163
167
  ```
164
168
 
165
169
  ### Usage
@@ -207,8 +211,8 @@ brew tap asklokesh/tap
207
211
  # Install Loki Mode
208
212
  brew install loki-mode
209
213
 
210
- # Set up Claude Code skill integration
211
- loki-mode-install-skill
214
+ # Set up Claude Code skill integration (manual symlink required)
215
+ ln -sf "$(brew --prefix)/opt/loki-mode/libexec" ~/.claude/skills/loki-mode
212
216
  ```
213
217
 
214
218
  ### Dependencies
@@ -254,7 +258,7 @@ Run Loki Mode in a container for isolated execution.
254
258
 
255
259
  ```bash
256
260
  # Pull the image
257
- docker pull asklokesh/loki-mode:5.32.0
261
+ docker pull asklokesh/loki-mode:latest
258
262
 
259
263
  # Or use docker-compose
260
264
  curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode/main/docker-compose.yml
@@ -264,10 +268,10 @@ curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode
264
268
 
265
269
  ```bash
266
270
  # Run with a PRD file
267
- docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:5.32.0 start ./my-prd.md
271
+ docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest start ./my-prd.md
268
272
 
269
273
  # Interactive mode
270
- docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:5.32.0
274
+ docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest
271
275
 
272
276
  # Using docker-compose
273
277
  docker-compose run loki start ./my-prd.md
@@ -280,7 +284,7 @@ Pass your configuration via environment variables:
280
284
  ```bash
281
285
  docker run -e LOKI_MAX_RETRIES=100 -e LOKI_BASE_WAIT=120 \
282
286
  -v $(pwd):/workspace -w /workspace \
283
- asklokesh/loki-mode:5.32.0 start ./my-prd.md
287
+ asklokesh/loki-mode:latest start ./my-prd.md
284
288
  ```
285
289
 
286
290
  ### Updating
@@ -396,12 +400,12 @@ Pass the provider as an environment variable:
396
400
  # Use Codex with Docker
397
401
  docker run -e LOKI_PROVIDER=codex \
398
402
  -v $(pwd):/workspace -w /workspace \
399
- asklokesh/loki-mode:5.32.0 start ./my-prd.md
403
+ asklokesh/loki-mode:latest start ./my-prd.md
400
404
 
401
405
  # Use Gemini with Docker
402
406
  docker run -e LOKI_PROVIDER=gemini \
403
407
  -v $(pwd):/workspace -w /workspace \
404
- asklokesh/loki-mode:5.32.0 start ./my-prd.md
408
+ asklokesh/loki-mode:latest start ./my-prd.md
405
409
  ```
406
410
 
407
411
  ### Degraded Mode
@@ -652,7 +656,11 @@ Add the source command to your startup file so completions load every time you o
652
656
  Add this line to your `~/.bashrc` (Linux) or `~/.bash_profile` (macOS):
653
657
 
654
658
  ```bash
655
- source /path/to/loki/completions/loki.bash
659
+ # npm install: use the npm package path
660
+ source "$(npm root -g)/loki-mode/completions/loki.bash"
661
+
662
+ # git clone: use the skills directory
663
+ source ~/.claude/skills/loki-mode/completions/loki.bash
656
664
  ```
657
665
 
658
666
  ---
@@ -0,0 +1,145 @@
1
+ # Alternative Installation Methods
2
+
3
+ The primary installation method is git clone (see [README](../README.md#installation)). These alternatives serve specific use cases.
4
+
5
+ ---
6
+
7
+ ## npm (Secondary)
8
+
9
+ **Status**: Working. Version tracks releases automatically.
10
+
11
+ ```bash
12
+ npm install -g loki-mode
13
+ ```
14
+
15
+ **Limitation**: Installs to `node_modules`, not `~/.claude/skills/`. To use as a Claude Code skill, you must symlink:
16
+
17
+ ```bash
18
+ npm install -g loki-mode
19
+ ln -sf "$(npm root -g)/loki-mode" ~/.claude/skills/loki-mode
20
+ ```
21
+
22
+ **Best for**: CI/CD pipelines, programmatic access via `loki` CLI.
23
+
24
+ ---
25
+
26
+ ## Homebrew (Secondary)
27
+
28
+ **Status**: Working. Tap and formula exist, version current.
29
+
30
+ ```bash
31
+ brew tap asklokesh/tap
32
+ brew install loki-mode
33
+ ```
34
+
35
+ **Limitation**: Installs the `loki` CLI binary only. Does NOT install the Claude Code skill. To use with Claude Code, also run:
36
+
37
+ ```bash
38
+ git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
39
+ ```
40
+
41
+ **Best for**: Users who want the `loki` CLI wrapper for autonomous mode (`loki start`, `loki stop`, `loki cleanup`).
42
+
43
+ ---
44
+
45
+ ## Docker (Secondary)
46
+
47
+ **Status**: Image exists on Docker Hub. Tags: `latest`, version-specific (e.g., `5.49.1`).
48
+
49
+ ```bash
50
+ docker pull asklokesh/loki-mode:latest
51
+ ```
52
+
53
+ **Limitation**: Claude Code is an interactive CLI that requires API keys and terminal access. Running it inside a Docker container is not the standard workflow. Docker is useful for:
54
+
55
+ - CI/CD sandbox execution (running `loki` in isolated environments)
56
+ - Testing Loki Mode without modifying your local system
57
+ - Air-gapped environments with pre-built images
58
+
59
+ **Not recommended for**: Interactive Claude Code sessions. Use the git clone method instead.
60
+
61
+ See [DOCKER_README.md](../DOCKER_README.md) for Docker-specific usage instructions.
62
+
63
+ ---
64
+
65
+ ## GitHub Action (Secondary)
66
+
67
+ **Status**: Working. Adds automated AI code review to pull requests.
68
+
69
+ ```yaml
70
+ # .github/workflows/loki-review.yml
71
+ name: Loki Code Review
72
+ on:
73
+ pull_request:
74
+ types: [opened, synchronize]
75
+ permissions:
76
+ contents: read
77
+ pull-requests: write
78
+ jobs:
79
+ review:
80
+ runs-on: ubuntu-latest
81
+ steps:
82
+ - uses: actions/checkout@v4
83
+ - uses: asklokesh/loki-mode@v5
84
+ with:
85
+ github_token: ${{ secrets.GITHUB_TOKEN }}
86
+ mode: review
87
+ provider: claude
88
+ max_iterations: 3
89
+ budget_limit: '5.00'
90
+ env:
91
+ ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
92
+ ```
93
+
94
+ **Prerequisites:**
95
+ - API key for your provider (set as repository secret): `ANTHROPIC_API_KEY`, `OPENAI_API_KEY`, or `GOOGLE_API_KEY`
96
+ - The action auto-installs `loki-mode` and `@anthropic-ai/claude-code`
97
+
98
+ **Action Inputs:**
99
+
100
+ | Input | Default | Description |
101
+ |-------|---------|-------------|
102
+ | `mode` | `review` | `review`, `fix`, or `test` |
103
+ | `provider` | `claude` | `claude`, `codex`, or `gemini` |
104
+ | `budget_limit` | `5.00` | Max cost in USD |
105
+ | `max_iterations` | `3` | Max RARV cycles |
106
+ | `github_token` | (required) | GitHub token for PR comments |
107
+ | `prd_file` | | Path to PRD file (for fix/test modes) |
108
+
109
+ **Modes:**
110
+
111
+ | Mode | Description |
112
+ |------|-------------|
113
+ | `review` | Analyze PR diff, post structured review as PR comment |
114
+ | `fix` | Automatically fix issues found in the codebase |
115
+ | `test` | Run autonomous test generation and validation |
116
+
117
+ **Best for**: Automated PR review and CI/CD integration.
118
+
119
+ ---
120
+
121
+ ## GitHub Release Download (Secondary)
122
+
123
+ **Status**: Working. Release assets available for each version.
124
+
125
+ ```bash
126
+ # Download and extract to skills directory
127
+ curl -sL https://github.com/asklokesh/loki-mode/archive/refs/tags/v5.49.1.tar.gz | tar xz
128
+ mv loki-mode-5.49.1 ~/.claude/skills/loki-mode
129
+ ```
130
+
131
+ **Best for**: Offline or air-gapped environments, pinned version deployments.
132
+
133
+ ---
134
+
135
+ ## VS Code Extension (Secondary)
136
+
137
+ **Status**: Available on VS Code Marketplace.
138
+
139
+ Search for "Loki Mode" in VS Code Extensions, or:
140
+
141
+ ```bash
142
+ code --install-extension asklokesh.loki-mode
143
+ ```
144
+
145
+ **Best for**: VS Code users who want dashboard integration within their editor.
@@ -169,7 +169,7 @@ Loki Mode:
169
169
  **Verdict: Loki Mode wins** - Simpler, lighter footprint.
170
170
 
171
171
  ### 10. Cursor Scale Patterns (v3.3.0)
172
- Loki Mode now incorporates proven patterns from Cursor's 100+ agent deployments:
172
+ Loki Mode now incorporates proven patterns from Cursor's large-scale agent deployments:
173
173
  - Recursive sub-planners
174
174
  - Judge agents for cycle decisions
175
175
  - Optimistic concurrency control
@@ -9,9 +9,9 @@
9
9
 
10
10
  | Dimension | Cursor | Loki Mode | Winner |
11
11
  |-----------|--------|-----------|--------|
12
- | **Proven Scale** | 1M+ LoC, 100+ agents | Benchmarks only | Cursor |
12
+ | **Proven Scale** | 1M+ LoC, large agent count | Benchmarks only | Cursor |
13
13
  | **Research Foundation** | Empirical iteration | 25+ academic citations | Loki Mode |
14
- | **Quality Assurance** | Workers self-manage | 7-gate system + anti-sycophancy | Loki Mode |
14
+ | **Quality Assurance** | Workers self-manage | 9-gate system + anti-sycophancy | Loki Mode |
15
15
  | **Anti-Sycophancy** | Not mentioned | CONSENSAGENT blind review | Loki Mode |
16
16
  | **Velocity-Quality Balance** | Not mentioned | arXiv-backed metrics | Loki Mode |
17
17
  | **Full SDLC Coverage** | Code generation focus | PRD to production + growth | Loki Mode |
@@ -66,7 +66,7 @@ velocity_quality_balance:
66
66
 
67
67
  ---
68
68
 
69
- ### 3. 7-Gate Quality System
69
+ ### 3. 9-Gate Quality System
70
70
 
71
71
  **Loki Mode's Gates:**
72
72
  1. Input Guardrails - Validate scope, detect injection (OpenAI SDK pattern)
@@ -122,7 +122,7 @@ BOOTSTRAP -> DISCOVERY -> ARCHITECTURE -> INFRASTRUCTURE
122
122
  -> DEVELOPMENT -> QA -> DEPLOYMENT -> GROWTH (continuous)
123
123
  ```
124
124
 
125
- **41 Specialized Agent Types across 7 swarms:**
125
+ **41 Specialized Agent Types across 8 swarms:**
126
126
  - Engineering (8 types)
127
127
  - Operations (8 types)
128
128
  - Business (8 types)
@@ -174,7 +174,7 @@ Cursor learned through failure:
174
174
  ### 3. Simplicity Principle
175
175
  > "A surprising amount of the system's behavior comes down to how we prompt the agents. The harness and models matter, but the prompts matter more."
176
176
 
177
- **Loki Mode:** More complex infrastructure (7 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
177
+ **Loki Mode:** More complex infrastructure (9 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
178
178
 
179
179
  ---
180
180
 
@@ -184,7 +184,7 @@ We incorporated Cursor's proven patterns:
184
184
 
185
185
  1. **Recursive Sub-Planners** - Planning scales horizontally
186
186
  2. **Judge Agents** - Explicit CONTINUE/COMPLETE/ESCALATE/PIVOT decisions
187
- 3. **Optimistic Concurrency** - No locks, scales to 100+ agents
187
+ 3. **Optimistic Concurrency** - No locks, scales horizontally
188
188
  4. **Scale-Aware Review** - Full review for high-risk only at scale
189
189
 
190
190
  ---
@@ -192,7 +192,7 @@ We incorporated Cursor's proven patterns:
192
192
  ## Conclusion
193
193
 
194
194
  **Loki Mode is scientifically better in:**
195
- - Quality assurance (research-backed 7-gate system)
195
+ - Quality assurance (research-backed 9-gate system)
196
196
  - Anti-sycophancy (CONSENSAGENT blind review)
197
197
  - Velocity-quality balance (arXiv metrics)
198
198
  - Full SDLC coverage (PRD to growth)
@@ -49,7 +49,7 @@ skills/
49
49
 
50
50
  references/ (unchanged)
51
51
  +-- 18 detailed reference files
52
- +-- agents.md (23KB) - Full 37 agent specs
52
+ +-- agents.md (23KB) - Full 41 agent specs
53
53
  +-- openai-patterns.md, lab-research-patterns.md, etc.
54
54
  ```
55
55
 
@@ -117,7 +117,7 @@ Content that didn't exist in v2.38.0:
117
117
  | Handoff message format | A2A specification | skills/agents.md |
118
118
  | Agentic patterns table | awesome-agentic-patterns | skills/agents.md |
119
119
  | "Ralph Wiggum Mode" insight | moridinamael | skills/agents.md |
120
- | Full 37 agent reference | references/agent-types.md | skills/agents.md (pointer) |
120
+ | Full 41 agent reference | references/agent-types.md | skills/agents.md (pointer) |
121
121
  | References directory listing | New | skills/00-index.md |
122
122
 
123
123
  ---
package/mcp/__init__.py CHANGED
@@ -21,4 +21,4 @@ try:
21
21
  except ImportError:
22
22
  __all__ = ['mcp']
23
23
 
24
- __version__ = '5.49.0'
24
+ __version__ = '5.49.2'
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "loki-mode",
3
- "version": "5.49.0",
3
+ "version": "5.49.2",
4
4
  "description": "Loki Mode by Autonomi - Multi-agent autonomous startup system for Claude Code, Codex CLI, and Gemini CLI",
5
5
  "keywords": [
6
6
  "autonomi",
@@ -6,7 +6,7 @@ Complete definitions and capabilities for all 41 specialized agent types.
6
6
 
7
7
  ## Overview
8
8
 
9
- Loki Mode has 41 predefined agent types organized into 7 specialized swarms (37 domain agents + 4 orchestration agents). The orchestrator spawns only the agents needed for your project - a simple app might use 5-10 agents, while a complex startup could spawn 100+ agents working in parallel.
9
+ Loki Mode has 41 predefined agent types organized into 8 specialized swarms (37 domain agents + 4 orchestration agents). The orchestrator spawns only the agents needed for your project -- typically 5-10 for simple projects, more for complex ones.
10
10
 
11
11
  ---
12
12
 
@@ -98,7 +98,7 @@ Loki Mode has 41 predefined agent types organized into 7 specialized swarms (37
98
98
 
99
99
  ## Orchestration Swarm (4 types)
100
100
 
101
- > **Source:** [Cursor Scaling Learnings](./cursor-learnings.md) - patterns proven at 100+ agent scale
101
+ > **Source:** [Cursor Scaling Learnings](./cursor-learnings.md) - patterns proven at large agent scale
102
102
 
103
103
  | Agent | Capabilities |
104
104
  |-------|-------------|
@@ -2,7 +2,7 @@
2
2
 
3
3
  Complete specifications for all 41 specialized agent types in the Loki Mode multi-agent system (37 domain agents + 4 orchestration agents).
4
4
 
5
- **Note:** These are agent TYPE definitions, not a fixed count. Loki Mode dynamically spawns agents based on project needs - a simple todo app might use 5-10 agents, while a complex startup could spawn 100+ agents working in parallel.
5
+ **Note:** These are agent TYPE definitions, not a fixed count. Loki Mode dynamically spawns agents based on project needs - a simple todo app might use 5-10 agents, while a complex startup spawns more as needed.
6
6
 
7
7
  ## Agent Role Prompt Template
8
8
 
@@ -182,7 +182,7 @@ Dexter shows value of domain specialization. Our 41 agent types follow this patt
182
182
  - Most haven't scaled across enterprise
183
183
 
184
184
  ### Loki Mode Alignment
185
- - Multi-agent architecture (41 types, 7 swarms)
185
+ - Multi-agent architecture (41 types, 8 swarms)
186
186
  - Plan Agents (orchestrator, planner)
187
187
  - Execution Agents (eng-*, ops-*, biz-*)
188
188
  - Security controls (LOKI_SANDBOX_MODE, LOKI_BLOCKED_COMMANDS)
@@ -6,7 +6,7 @@ Full RARV cycle, CONTINUITY.md template, and autonomy rules.
6
6
 
7
7
  ## Autonomy Rules
8
8
 
9
- **This system runs with ZERO human intervention.**
9
+ **This system runs with minimal human intervention.** Human oversight is expected for deployment credentials, domain setup, API keys, and critical business decisions.
10
10
 
11
11
  ### Core Rules
12
12
  1. **NEVER ask questions** - Do not say "Would you like me to...", "Should I...", or "What would you prefer?"
@@ -41,7 +41,7 @@
41
41
 
42
42
  ### quality-gates.md
43
43
  **When:** Code review, pre-commit checks, quality assurance
44
- - 7-gate quality system
44
+ - 9-gate quality system
45
45
  - Blind review + anti-sycophancy
46
46
  - Velocity-quality feedback loop (arXiv research)
47
47
  - Mandatory quality checks per task
package/skills/agents.md CHANGED
@@ -1,6 +1,6 @@
1
1
  # Agent Dispatch & Structured Prompting
2
2
 
3
- > **Full agent type definitions:** See `references/agent-types.md` for complete 41 agent role specifications across 7 swarms (Engineering, Operations, Business, Data, Product, Growth, Review, Orchestration).
3
+ > **Full agent type definitions:** See `references/agent-types.md` for complete 41 agent role specifications across 8 swarms (Engineering, Operations, Business, Data, Product, Growth, Review, Orchestration).
4
4
 
5
5
  ---
6
6
 
@@ -245,7 +245,7 @@ Priority order for context:
245
245
 
246
246
  ---
247
247
 
248
- ## The 37 Agent Roles
248
+ ## The 41 Agent Roles (37 Domain + 4 Orchestration)
249
249
 
250
250
  See `references/agent-types.md` for complete specifications. Summary:
251
251
 
@@ -259,4 +259,4 @@ See `references/agent-types.md` for complete specifications. Summary:
259
259
  | Growth | hacker, community, success, lifecycle | 4 |
260
260
  | Review | code, business, security | 3 |
261
261
 
262
- **Spawn only what you need.** Simple project: 5-10 agents. Complex startup: 100+.
262
+ **Spawn only what you need.** Simple project: 5-10 agents. Complex startup: more as needed.
@@ -36,7 +36,7 @@ format: "markdown"
36
36
  contents:
37
37
  - Phase name and duration
38
38
  - Tasks completed (from queue)
39
- - Quality gate results (7 gates)
39
+ - Quality gate results (9 gates)
40
40
  - Coverage metrics
41
41
  - Known issues / TODOs
42
42
  ```
@@ -427,7 +427,7 @@ optimistic_write:
427
427
  - No waiting for locks
428
428
  - No deadlock risk
429
429
  - Failed writes are cheap (just retry)
430
- - Scales to 100+ agents
430
+ - Scales horizontally with agent count
431
431
  ```
432
432
 
433
433
  ### Implementation
@@ -2,7 +2,7 @@
2
2
 
3
3
  **Never ship code without passing all quality gates.**
4
4
 
5
- ## The 7 Quality Gates
5
+ ## The 9 Quality Gates
6
6
 
7
7
  1. **Input Guardrails** - Validate scope, detect injection, check constraints (OpenAI SDK)
8
8
  2. **Static Analysis** - CodeQL, ESLint/Pylint, type checking
@@ -11,6 +11,8 @@
11
11
  5. **Output Guardrails** - Validate code quality, spec compliance, no secrets (tripwire on fail)
12
12
  6. **Severity-Based Blocking** - Critical/High/Medium = BLOCK; Low/Cosmetic = TODO comment
13
13
  7. **Test Coverage Gates** - Unit: 100% pass, >80% coverage; Integration: 100% pass
14
+ 8. **Mock Detector** - Classifies internal vs external mocks; flags tests that never import source code, tautological assertions, and high internal mock ratios
15
+ 9. **Test Mutation Detector** - Detects assertion value changes alongside implementation changes (test fitting), low assertion density, and missing pass/fail tracking
14
16
 
15
17
  ## Guardrails Execution Modes
16
18
 
@@ -472,7 +474,7 @@ See `references/quality-control.md` for complete details.
472
474
 
473
475
  ## Scale Considerations
474
476
 
475
- > **Source:** [Cursor Scaling Learnings](../references/cursor-learnings.md) - integrators became bottlenecks at 100+ agents
477
+ > **Source:** [Cursor Scaling Learnings](../references/cursor-learnings.md) - integrators became bottlenecks at high agent counts
476
478
 
477
479
  ### Review Intensity Scaling
478
480