loki-mode 5.49.0 → 5.49.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +70 -121
- package/SKILL.md +3 -3
- package/VERSION +1 -1
- package/autonomy/CONSTITUTION.md +4 -4
- package/autonomy/app-runner.sh +9 -0
- package/autonomy/loki +107 -0
- package/autonomy/run.sh +170 -4
- package/dashboard/__init__.py +1 -1
- package/dashboard/server.py +172 -20
- package/dashboard/static/index.html +1 -1
- package/docs/COMPARISON.md +15 -15
- package/docs/COMPETITIVE-ANALYSIS.md +4 -4
- package/docs/INSTALLATION.md +20 -12
- package/docs/alternative-installations.md +145 -0
- package/docs/auto-claude-comparison.md +1 -1
- package/docs/cursor-comparison.md +7 -7
- package/docs/thick2thin.md +2 -2
- package/mcp/__init__.py +1 -1
- package/package.json +1 -1
- package/references/agent-types.md +2 -2
- package/references/agents.md +1 -1
- package/references/competitive-analysis.md +1 -1
- package/references/core-workflow.md +1 -1
- package/skills/00-index.md +1 -1
- package/skills/agents.md +3 -3
- package/skills/artifacts.md +1 -1
- package/skills/parallel-workflows.md +1 -1
- package/skills/quality-gates.md +4 -2
package/docs/COMPARISON.md
CHANGED
|
@@ -12,7 +12,7 @@
|
|
|
12
12
|
| Feature | **Loki Mode** | **Zencoder** | **Devin** | **OpenAI Codex** | **Cursor** | **Claude Code** | **Kiro** | **Antigravity** | **Amazon Q** | **OpenCode** |
|
|
13
13
|
|---------|--------------|--------------|-----------|-----------------|------------|-----------------|----------|-----------------|--------------|--------------|
|
|
14
14
|
| **Type** | Skill/Framework | Enterprise Platform | Standalone Agent | Cloud Agent | AI IDE | CLI Agent | AI IDE | AI IDE | Cloud Agent | AI IDE (OSS) |
|
|
15
|
-
| **Autonomy Level** |
|
|
15
|
+
| **Autonomy Level** | High (minimal human) | High | Full | High | Medium-High | High | High | High | High | High |
|
|
16
16
|
| **Max Runtime** | Unlimited | Async/Scheduled | Hours | Per-task | Session | Session | Days | Async | Per-task | Session |
|
|
17
17
|
| **Pricing** | Free (OSS) | Enterprise | $20/mo | ChatGPT Plus | $20/mo | API costs | Free preview | Free preview | $19/mo | Free (OSS) |
|
|
18
18
|
| **Open Source** | Yes | No | No | No | No | No | No | No | No | Yes |
|
|
@@ -24,7 +24,7 @@
|
|
|
24
24
|
|
|
25
25
|
| Feature | **Loki Mode** | **Devin** | **Codex** | **Cursor** | **Kiro** | **Antigravity** | **Amazon Q** | **OpenCode** |
|
|
26
26
|
|---------|--------------|-----------|-----------|------------|----------|-----------------|--------------|--------------|
|
|
27
|
-
| **Multi-Agent** | 41 agents in
|
|
27
|
+
| **Multi-Agent** | 41 agents in 8 swarms | Single | Single | Up to 8 parallel | Background | Manager Surface | Multiple types | 4 built-in |
|
|
28
28
|
| **Orchestration** | Full orchestrator | N/A | N/A | Git worktree | Hooks | Manager view | Workflow | Subagents |
|
|
29
29
|
| **Parallel Exec** | 10+ Haiku, 4 impl (worktree) | No | No | 8 max | Yes | Yes | Yes | Yes |
|
|
30
30
|
| **Agent Swarms** | Eng, Ops, Business, Data, Product, Growth, Review | N/A | N/A | N/A | N/A | N/A | 3 types | N/A |
|
|
@@ -37,7 +37,7 @@
|
|
|
37
37
|
|---------|--------------|-----------|-----------|------------|----------|-----------------|--------------|--------------|
|
|
38
38
|
| **Code Review** | 3 blind reviewers + devil's advocate | Basic | Basic | BugBot PR | Property-based | Artifacts | Doc/Review | Basic |
|
|
39
39
|
| **Anti-Sycophancy** | Yes (CONSENSAGENT) | No | No | No | No | No | No | No |
|
|
40
|
-
| **Quality Gates** |
|
|
40
|
+
| **Quality Gates** | 9 gates + PBT | Basic | Sandbox | Tests | Spec validation | Artifact checks | Tests | Permissions |
|
|
41
41
|
| **Constitutional AI** | Yes (principles) | No | Refusal training | No | No | No | No | No |
|
|
42
42
|
|
|
43
43
|
---
|
|
@@ -146,7 +146,7 @@
|
|
|
146
146
|
|
|
147
147
|
| Feature | **Zencoder** | **Loki Mode** | **Assessment** |
|
|
148
148
|
|---------|-------------|---------------|----------------|
|
|
149
|
-
| **Four Pillars** | Structured Workflows, SDD, Multi-Agent Verification, Parallel Execution | SDLC + RARV +
|
|
149
|
+
| **Four Pillars** | Structured Workflows, SDD, Multi-Agent Verification, Parallel Execution | SDLC + RARV + 9 Gates + Worktrees | TIE |
|
|
150
150
|
| **Spec-Driven Dev** | Specs as first-class objects | OpenAPI-first | TIE |
|
|
151
151
|
| **Multi-Agent Verification** | Model diversity (Claude vs OpenAI, 54% improvement) | 3 blind reviewers + devil's advocate | Different approach (N/A for Claude Code - only Claude models) |
|
|
152
152
|
| **Quality Gates** | Built-in verification loops | 7 explicit gates + anti-sycophancy | **Loki Mode** |
|
|
@@ -180,9 +180,9 @@
|
|
|
180
180
|
|
|
181
181
|
1. **Quality Control**: 7 explicit gates + blind review + devil's advocate vs built-in loops
|
|
182
182
|
2. **Memory System**: 3-tier (episodic/semantic/procedural) with cross-project learning
|
|
183
|
-
3. **Agent Specialization**: 41 pre-defined specialized agents across
|
|
183
|
+
3. **Agent Specialization**: 41 pre-defined specialized agents across 8 swarms
|
|
184
184
|
4. **Anti-Sycophancy**: CONSENSAGENT patterns prevent reviewer groupthink
|
|
185
|
-
5. **Autonomy Design**:
|
|
185
|
+
5. **Autonomy Design**: Minimal human intervention from PRD to production
|
|
186
186
|
6. **Research Foundation**: 10+ academic papers integrated vs proprietary
|
|
187
187
|
|
|
188
188
|
### Where Zencoder EXCEEDS Loki Mode
|
|
@@ -203,13 +203,13 @@
|
|
|
203
203
|
|---------|--------------|---------|-----------------|------------|-----------------|---------------------|----------------|
|
|
204
204
|
| **Stars** | 594 | 11,903 | 35K+ | 26K+ | 13.7K | N/A | N/A |
|
|
205
205
|
| **npm/wk** | 6.1K | 21.4K | N/A | N/A | N/A | N/A | N/A |
|
|
206
|
-
| **Agents** | 41 in
|
|
206
|
+
| **Agents** | 41 in 8 swarms | 11 agents | Fresh per task | 108 agents | Swarm-based | 32 agents | N/A |
|
|
207
207
|
| **Skills** | Progressive disclosure | 6 slash commands | N/A | 129 skills | N/A | 35 skills | Memory focus |
|
|
208
208
|
| **Multi-Provider** | Yes (Claude/Codex/Gemini) | 3 CLIs (separate) | No | No | No | No | No |
|
|
209
209
|
| **Memory System** | 3-tier (episodic/semantic/procedural) | None | N/A | N/A | Hybrid | N/A | SQLite+FTS5 |
|
|
210
|
-
| **Quality Gates** |
|
|
210
|
+
| **Quality Gates** | 9 gates + Completion Council | User verify only | Two-Stage Review | N/A | Consensus | Tiered | N/A |
|
|
211
211
|
| **Context Mgmt** | Standard | Fresh per task (core innovation) | Fresh per task | N/A | N/A | N/A | Progressive |
|
|
212
|
-
| **Autonomy** |
|
|
212
|
+
| **Autonomy** | High (minimal human) | Semi (checkpoints) | Human-guided | Human-guided | Orchestrated | Human-guided | N/A |
|
|
213
213
|
|
|
214
214
|
### What Loki Mode LACKS (Honest Assessment)
|
|
215
215
|
|
|
@@ -232,11 +232,11 @@ These are patterns from competing projects that are **practically and scientific
|
|
|
232
232
|
|----------|---------|-------------------------|
|
|
233
233
|
| **Multi-Provider Support** | Only skill supporting Claude, Codex, and Gemini with graceful degradation | All 8 competitors are Claude-only |
|
|
234
234
|
| **RARV Cycle** | Reason-Act-Reflect-Verify is more rigorous than Plan-Execute | Most use simple Plan-Execute |
|
|
235
|
-
| **
|
|
235
|
+
| **9-Gate Quality System** | Static analysis + 3 reviewers + devil's advocate + anti-sycophancy + severity blocking + coverage + debate | Superpowers has 2-stage, others have less |
|
|
236
236
|
| **Constitutional AI Integration** | Principles-based self-critique from Anthropic research | None have this |
|
|
237
237
|
| **Anti-Sycophancy (CONSENSAGENT)** | Blind review + devil's advocate prevents groupthink | None have this |
|
|
238
238
|
| **Provider Abstraction Layer** | Clean degradation from full-featured to sequential-only | Claude-only projects can't degrade |
|
|
239
|
-
| **41 Specialized Agents** | Purpose-built agents in
|
|
239
|
+
| **41 Specialized Agents** | Purpose-built agents in 8 swarms vs generic | agents (108) has more but less organized |
|
|
240
240
|
| **Research Foundation** | 10+ academic papers integrated with citations | Most have no research backing |
|
|
241
241
|
|
|
242
242
|
### Superpowers Deep-Dive (35K+ Stars)
|
|
@@ -342,7 +342,7 @@ Tiered agent architecture with explicit escalation:
|
|
|
342
342
|
|
|
343
343
|
| Agent | Killer Feature |
|
|
344
344
|
|-------|---------------|
|
|
345
|
-
| **Loki Mode** |
|
|
345
|
+
| **Loki Mode** | Minimal-human-intervention full SDLC, 41 agents in 8 swarms, Constitutional AI, anti-sycophancy, cross-project learning, code transformation, property-based testing |
|
|
346
346
|
| **Devin** | Full software engineer persona, Slack integration, 67% PR merge rate |
|
|
347
347
|
| **OpenAI Codex** | Skills marketplace, $skill-creator, GPT-5.2-Codex, secure sandbox |
|
|
348
348
|
| **Cursor** | 8 parallel agents, BugBot, Memories, $10B valuation, Composer model (250 tok/s) |
|
|
@@ -357,9 +357,9 @@ Tiered agent architecture with explicit escalation:
|
|
|
357
357
|
|
|
358
358
|
| Dimension | Loki Mode Advantage |
|
|
359
359
|
|-----------|-------------------|
|
|
360
|
-
| **Autonomy** |
|
|
361
|
-
| **Multi-Agent** | 41 specialized agents in
|
|
362
|
-
| **Quality** |
|
|
360
|
+
| **Autonomy** | Designed for high autonomy with minimal human intervention |
|
|
361
|
+
| **Multi-Agent** | 41 specialized agents in 8 swarms vs 1-8 in competitors |
|
|
362
|
+
| **Quality** | 9 gates + blind review + devil's advocate + property-based testing |
|
|
363
363
|
| **Research** | 10+ academic papers integrated vs proprietary/undisclosed |
|
|
364
364
|
| **Anti-Sycophancy** | Only agent with CONSENSAGENT-based blind review |
|
|
365
365
|
| **Memory** | 3-tier memory (episodic/semantic/procedural) + review learning + cross-project |
|
|
@@ -20,7 +20,7 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
|
|
|
20
20
|
| Adoption | 594 stars, 6K/wk npm | 11,903 stars, 21K/wk npm | GSD (20x) |
|
|
21
21
|
| Simplicity | Complex (5.4K-line run.sh, 12 Python modules) | Simple (markdown agents + slash commands) | GSD |
|
|
22
22
|
| Full autonomy | Walk away, come back to deployed product | Human checkpoints at discuss/verify/milestone | Loki |
|
|
23
|
-
| Quality gates |
|
|
23
|
+
| Quality gates | 9-gate + Completion Council + anti-sycophancy | User verification only | Loki |
|
|
24
24
|
| Memory system | Episodic/semantic/procedural + vector search | None | Loki |
|
|
25
25
|
| Context management | Standard | Fresh subagent contexts per task (core innovation) | GSD |
|
|
26
26
|
| Time to value | Learn architecture, understand CLI flags | `npx get-shit-done-cc` and go | GSD |
|
|
@@ -37,9 +37,9 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
|
|
|
37
37
|
|---------|-----------|-------------|---------|--------|--------------|-------|
|
|
38
38
|
| **GitHub Stars** | 594 | 13,700 | 62,400 | 25,000+ | N/A (Commercial) | N/A (Commercial) |
|
|
39
39
|
| **Agent Count** | 41 types | 64+ agents | 5 roles | Unlimited | 8 parallel | 1 autonomous |
|
|
40
|
-
| **Parallel Execution** | Yes (
|
|
41
|
-
| **Published Benchmarks** |
|
|
42
|
-
| **SWE-bench Score** |
|
|
40
|
+
| **Parallel Execution** | Yes (multi-agent) | Yes (swarms) | Sequential | Yes (crews) | Yes (8 worktrees) | Yes (fleet) |
|
|
41
|
+
| **Published Benchmarks** | 98.78% HumanEval (self-reported, max 3 retries) | None | 85.9-87.7% HumanEval | None | ~250 tok/s | 15% complex tasks |
|
|
42
|
+
| **SWE-bench Score** | 99.67% patch gen (unevaluated, 299/300) | Unknown | Unknown | Unknown | Unknown | 15% complex |
|
|
43
43
|
| **Full SDLC** | Yes (8 phases) | Yes | Partial | Partial | No | Partial |
|
|
44
44
|
| **Business Ops** | **Yes (8 agents)** | No | No | No | No | No |
|
|
45
45
|
| **Enterprise Security** | `--dangerously-skip-permissions` | MCP sandboxed | Sandboxed | Audit logs, RBAC | Staged autonomy | Sandboxed |
|
package/docs/INSTALLATION.md
CHANGED
|
@@ -2,11 +2,11 @@
|
|
|
2
2
|
|
|
3
3
|
The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installation instructions for all platforms and use cases.
|
|
4
4
|
|
|
5
|
-
**Version:** v5.49.
|
|
5
|
+
**Version:** v5.49.2
|
|
6
6
|
|
|
7
7
|
---
|
|
8
8
|
|
|
9
|
-
## What's New in v5.
|
|
9
|
+
## What's New in v5.49.1
|
|
10
10
|
|
|
11
11
|
### Enterprise Security (v5.36.0-v5.37.1)
|
|
12
12
|
- TLS/HTTPS support for dashboard connections
|
|
@@ -63,7 +63,7 @@ npm install -g loki-mode
|
|
|
63
63
|
brew tap asklokesh/tap && brew install loki-mode
|
|
64
64
|
|
|
65
65
|
# Option C: Docker
|
|
66
|
-
docker pull asklokesh/loki-mode:
|
|
66
|
+
docker pull asklokesh/loki-mode:latest
|
|
67
67
|
|
|
68
68
|
# Option D: Git clone
|
|
69
69
|
git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
|
|
@@ -160,6 +160,10 @@ Install via npm for the easiest setup with automatic PATH configuration.
|
|
|
160
160
|
npm install -g loki-mode
|
|
161
161
|
|
|
162
162
|
# The skill is automatically installed to ~/.claude/skills/loki-mode
|
|
163
|
+
|
|
164
|
+
# Opt out of anonymous install telemetry:
|
|
165
|
+
# LOKI_TELEMETRY_DISABLED=true npm install -g loki-mode
|
|
166
|
+
# Or set DO_NOT_TRACK=1
|
|
163
167
|
```
|
|
164
168
|
|
|
165
169
|
### Usage
|
|
@@ -207,8 +211,8 @@ brew tap asklokesh/tap
|
|
|
207
211
|
# Install Loki Mode
|
|
208
212
|
brew install loki-mode
|
|
209
213
|
|
|
210
|
-
# Set up Claude Code skill integration
|
|
211
|
-
loki-mode-
|
|
214
|
+
# Set up Claude Code skill integration (manual symlink required)
|
|
215
|
+
ln -sf "$(brew --prefix)/opt/loki-mode/libexec" ~/.claude/skills/loki-mode
|
|
212
216
|
```
|
|
213
217
|
|
|
214
218
|
### Dependencies
|
|
@@ -254,7 +258,7 @@ Run Loki Mode in a container for isolated execution.
|
|
|
254
258
|
|
|
255
259
|
```bash
|
|
256
260
|
# Pull the image
|
|
257
|
-
docker pull asklokesh/loki-mode:
|
|
261
|
+
docker pull asklokesh/loki-mode:latest
|
|
258
262
|
|
|
259
263
|
# Or use docker-compose
|
|
260
264
|
curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode/main/docker-compose.yml
|
|
@@ -264,10 +268,10 @@ curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode
|
|
|
264
268
|
|
|
265
269
|
```bash
|
|
266
270
|
# Run with a PRD file
|
|
267
|
-
docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:
|
|
271
|
+
docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest start ./my-prd.md
|
|
268
272
|
|
|
269
273
|
# Interactive mode
|
|
270
|
-
docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:
|
|
274
|
+
docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest
|
|
271
275
|
|
|
272
276
|
# Using docker-compose
|
|
273
277
|
docker-compose run loki start ./my-prd.md
|
|
@@ -280,7 +284,7 @@ Pass your configuration via environment variables:
|
|
|
280
284
|
```bash
|
|
281
285
|
docker run -e LOKI_MAX_RETRIES=100 -e LOKI_BASE_WAIT=120 \
|
|
282
286
|
-v $(pwd):/workspace -w /workspace \
|
|
283
|
-
asklokesh/loki-mode:
|
|
287
|
+
asklokesh/loki-mode:latest start ./my-prd.md
|
|
284
288
|
```
|
|
285
289
|
|
|
286
290
|
### Updating
|
|
@@ -396,12 +400,12 @@ Pass the provider as an environment variable:
|
|
|
396
400
|
# Use Codex with Docker
|
|
397
401
|
docker run -e LOKI_PROVIDER=codex \
|
|
398
402
|
-v $(pwd):/workspace -w /workspace \
|
|
399
|
-
asklokesh/loki-mode:
|
|
403
|
+
asklokesh/loki-mode:latest start ./my-prd.md
|
|
400
404
|
|
|
401
405
|
# Use Gemini with Docker
|
|
402
406
|
docker run -e LOKI_PROVIDER=gemini \
|
|
403
407
|
-v $(pwd):/workspace -w /workspace \
|
|
404
|
-
asklokesh/loki-mode:
|
|
408
|
+
asklokesh/loki-mode:latest start ./my-prd.md
|
|
405
409
|
```
|
|
406
410
|
|
|
407
411
|
### Degraded Mode
|
|
@@ -652,7 +656,11 @@ Add the source command to your startup file so completions load every time you o
|
|
|
652
656
|
Add this line to your `~/.bashrc` (Linux) or `~/.bash_profile` (macOS):
|
|
653
657
|
|
|
654
658
|
```bash
|
|
655
|
-
|
|
659
|
+
# npm install: use the npm package path
|
|
660
|
+
source "$(npm root -g)/loki-mode/completions/loki.bash"
|
|
661
|
+
|
|
662
|
+
# git clone: use the skills directory
|
|
663
|
+
source ~/.claude/skills/loki-mode/completions/loki.bash
|
|
656
664
|
```
|
|
657
665
|
|
|
658
666
|
---
|
|
@@ -0,0 +1,145 @@
|
|
|
1
|
+
# Alternative Installation Methods
|
|
2
|
+
|
|
3
|
+
The primary installation method is git clone (see [README](../README.md#installation)). These alternatives serve specific use cases.
|
|
4
|
+
|
|
5
|
+
---
|
|
6
|
+
|
|
7
|
+
## npm (Secondary)
|
|
8
|
+
|
|
9
|
+
**Status**: Working. Version tracks releases automatically.
|
|
10
|
+
|
|
11
|
+
```bash
|
|
12
|
+
npm install -g loki-mode
|
|
13
|
+
```
|
|
14
|
+
|
|
15
|
+
**Limitation**: Installs to `node_modules`, not `~/.claude/skills/`. To use as a Claude Code skill, you must symlink:
|
|
16
|
+
|
|
17
|
+
```bash
|
|
18
|
+
npm install -g loki-mode
|
|
19
|
+
ln -sf "$(npm root -g)/loki-mode" ~/.claude/skills/loki-mode
|
|
20
|
+
```
|
|
21
|
+
|
|
22
|
+
**Best for**: CI/CD pipelines, programmatic access via `loki` CLI.
|
|
23
|
+
|
|
24
|
+
---
|
|
25
|
+
|
|
26
|
+
## Homebrew (Secondary)
|
|
27
|
+
|
|
28
|
+
**Status**: Working. Tap and formula exist, version current.
|
|
29
|
+
|
|
30
|
+
```bash
|
|
31
|
+
brew tap asklokesh/tap
|
|
32
|
+
brew install loki-mode
|
|
33
|
+
```
|
|
34
|
+
|
|
35
|
+
**Limitation**: Installs the `loki` CLI binary only. Does NOT install the Claude Code skill. To use with Claude Code, also run:
|
|
36
|
+
|
|
37
|
+
```bash
|
|
38
|
+
git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
|
|
39
|
+
```
|
|
40
|
+
|
|
41
|
+
**Best for**: Users who want the `loki` CLI wrapper for autonomous mode (`loki start`, `loki stop`, `loki cleanup`).
|
|
42
|
+
|
|
43
|
+
---
|
|
44
|
+
|
|
45
|
+
## Docker (Secondary)
|
|
46
|
+
|
|
47
|
+
**Status**: Image exists on Docker Hub. Tags: `latest`, version-specific (e.g., `5.49.1`).
|
|
48
|
+
|
|
49
|
+
```bash
|
|
50
|
+
docker pull asklokesh/loki-mode:latest
|
|
51
|
+
```
|
|
52
|
+
|
|
53
|
+
**Limitation**: Claude Code is an interactive CLI that requires API keys and terminal access. Running it inside a Docker container is not the standard workflow. Docker is useful for:
|
|
54
|
+
|
|
55
|
+
- CI/CD sandbox execution (running `loki` in isolated environments)
|
|
56
|
+
- Testing Loki Mode without modifying your local system
|
|
57
|
+
- Air-gapped environments with pre-built images
|
|
58
|
+
|
|
59
|
+
**Not recommended for**: Interactive Claude Code sessions. Use the git clone method instead.
|
|
60
|
+
|
|
61
|
+
See [DOCKER_README.md](../DOCKER_README.md) for Docker-specific usage instructions.
|
|
62
|
+
|
|
63
|
+
---
|
|
64
|
+
|
|
65
|
+
## GitHub Action (Secondary)
|
|
66
|
+
|
|
67
|
+
**Status**: Working. Adds automated AI code review to pull requests.
|
|
68
|
+
|
|
69
|
+
```yaml
|
|
70
|
+
# .github/workflows/loki-review.yml
|
|
71
|
+
name: Loki Code Review
|
|
72
|
+
on:
|
|
73
|
+
pull_request:
|
|
74
|
+
types: [opened, synchronize]
|
|
75
|
+
permissions:
|
|
76
|
+
contents: read
|
|
77
|
+
pull-requests: write
|
|
78
|
+
jobs:
|
|
79
|
+
review:
|
|
80
|
+
runs-on: ubuntu-latest
|
|
81
|
+
steps:
|
|
82
|
+
- uses: actions/checkout@v4
|
|
83
|
+
- uses: asklokesh/loki-mode@v5
|
|
84
|
+
with:
|
|
85
|
+
github_token: ${{ secrets.GITHUB_TOKEN }}
|
|
86
|
+
mode: review
|
|
87
|
+
provider: claude
|
|
88
|
+
max_iterations: 3
|
|
89
|
+
budget_limit: '5.00'
|
|
90
|
+
env:
|
|
91
|
+
ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
|
|
92
|
+
```
|
|
93
|
+
|
|
94
|
+
**Prerequisites:**
|
|
95
|
+
- API key for your provider (set as repository secret): `ANTHROPIC_API_KEY`, `OPENAI_API_KEY`, or `GOOGLE_API_KEY`
|
|
96
|
+
- The action auto-installs `loki-mode` and `@anthropic-ai/claude-code`
|
|
97
|
+
|
|
98
|
+
**Action Inputs:**
|
|
99
|
+
|
|
100
|
+
| Input | Default | Description |
|
|
101
|
+
|-------|---------|-------------|
|
|
102
|
+
| `mode` | `review` | `review`, `fix`, or `test` |
|
|
103
|
+
| `provider` | `claude` | `claude`, `codex`, or `gemini` |
|
|
104
|
+
| `budget_limit` | `5.00` | Max cost in USD |
|
|
105
|
+
| `max_iterations` | `3` | Max RARV cycles |
|
|
106
|
+
| `github_token` | (required) | GitHub token for PR comments |
|
|
107
|
+
| `prd_file` | | Path to PRD file (for fix/test modes) |
|
|
108
|
+
|
|
109
|
+
**Modes:**
|
|
110
|
+
|
|
111
|
+
| Mode | Description |
|
|
112
|
+
|------|-------------|
|
|
113
|
+
| `review` | Analyze PR diff, post structured review as PR comment |
|
|
114
|
+
| `fix` | Automatically fix issues found in the codebase |
|
|
115
|
+
| `test` | Run autonomous test generation and validation |
|
|
116
|
+
|
|
117
|
+
**Best for**: Automated PR review and CI/CD integration.
|
|
118
|
+
|
|
119
|
+
---
|
|
120
|
+
|
|
121
|
+
## GitHub Release Download (Secondary)
|
|
122
|
+
|
|
123
|
+
**Status**: Working. Release assets available for each version.
|
|
124
|
+
|
|
125
|
+
```bash
|
|
126
|
+
# Download and extract to skills directory
|
|
127
|
+
curl -sL https://github.com/asklokesh/loki-mode/archive/refs/tags/v5.49.1.tar.gz | tar xz
|
|
128
|
+
mv loki-mode-5.49.1 ~/.claude/skills/loki-mode
|
|
129
|
+
```
|
|
130
|
+
|
|
131
|
+
**Best for**: Offline or air-gapped environments, pinned version deployments.
|
|
132
|
+
|
|
133
|
+
---
|
|
134
|
+
|
|
135
|
+
## VS Code Extension (Secondary)
|
|
136
|
+
|
|
137
|
+
**Status**: Available on VS Code Marketplace.
|
|
138
|
+
|
|
139
|
+
Search for "Loki Mode" in VS Code Extensions, or:
|
|
140
|
+
|
|
141
|
+
```bash
|
|
142
|
+
code --install-extension asklokesh.loki-mode
|
|
143
|
+
```
|
|
144
|
+
|
|
145
|
+
**Best for**: VS Code users who want dashboard integration within their editor.
|
|
@@ -169,7 +169,7 @@ Loki Mode:
|
|
|
169
169
|
**Verdict: Loki Mode wins** - Simpler, lighter footprint.
|
|
170
170
|
|
|
171
171
|
### 10. Cursor Scale Patterns (v3.3.0)
|
|
172
|
-
Loki Mode now incorporates proven patterns from Cursor's
|
|
172
|
+
Loki Mode now incorporates proven patterns from Cursor's large-scale agent deployments:
|
|
173
173
|
- Recursive sub-planners
|
|
174
174
|
- Judge agents for cycle decisions
|
|
175
175
|
- Optimistic concurrency control
|
|
@@ -9,9 +9,9 @@
|
|
|
9
9
|
|
|
10
10
|
| Dimension | Cursor | Loki Mode | Winner |
|
|
11
11
|
|-----------|--------|-----------|--------|
|
|
12
|
-
| **Proven Scale** | 1M+ LoC,
|
|
12
|
+
| **Proven Scale** | 1M+ LoC, large agent count | Benchmarks only | Cursor |
|
|
13
13
|
| **Research Foundation** | Empirical iteration | 25+ academic citations | Loki Mode |
|
|
14
|
-
| **Quality Assurance** | Workers self-manage |
|
|
14
|
+
| **Quality Assurance** | Workers self-manage | 9-gate system + anti-sycophancy | Loki Mode |
|
|
15
15
|
| **Anti-Sycophancy** | Not mentioned | CONSENSAGENT blind review | Loki Mode |
|
|
16
16
|
| **Velocity-Quality Balance** | Not mentioned | arXiv-backed metrics | Loki Mode |
|
|
17
17
|
| **Full SDLC Coverage** | Code generation focus | PRD to production + growth | Loki Mode |
|
|
@@ -66,7 +66,7 @@ velocity_quality_balance:
|
|
|
66
66
|
|
|
67
67
|
---
|
|
68
68
|
|
|
69
|
-
### 3.
|
|
69
|
+
### 3. 9-Gate Quality System
|
|
70
70
|
|
|
71
71
|
**Loki Mode's Gates:**
|
|
72
72
|
1. Input Guardrails - Validate scope, detect injection (OpenAI SDK pattern)
|
|
@@ -122,7 +122,7 @@ BOOTSTRAP -> DISCOVERY -> ARCHITECTURE -> INFRASTRUCTURE
|
|
|
122
122
|
-> DEVELOPMENT -> QA -> DEPLOYMENT -> GROWTH (continuous)
|
|
123
123
|
```
|
|
124
124
|
|
|
125
|
-
**41 Specialized Agent Types across
|
|
125
|
+
**41 Specialized Agent Types across 8 swarms:**
|
|
126
126
|
- Engineering (8 types)
|
|
127
127
|
- Operations (8 types)
|
|
128
128
|
- Business (8 types)
|
|
@@ -174,7 +174,7 @@ Cursor learned through failure:
|
|
|
174
174
|
### 3. Simplicity Principle
|
|
175
175
|
> "A surprising amount of the system's behavior comes down to how we prompt the agents. The harness and models matter, but the prompts matter more."
|
|
176
176
|
|
|
177
|
-
**Loki Mode:** More complex infrastructure (
|
|
177
|
+
**Loki Mode:** More complex infrastructure (9 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
|
|
178
178
|
|
|
179
179
|
---
|
|
180
180
|
|
|
@@ -184,7 +184,7 @@ We incorporated Cursor's proven patterns:
|
|
|
184
184
|
|
|
185
185
|
1. **Recursive Sub-Planners** - Planning scales horizontally
|
|
186
186
|
2. **Judge Agents** - Explicit CONTINUE/COMPLETE/ESCALATE/PIVOT decisions
|
|
187
|
-
3. **Optimistic Concurrency** - No locks, scales
|
|
187
|
+
3. **Optimistic Concurrency** - No locks, scales horizontally
|
|
188
188
|
4. **Scale-Aware Review** - Full review for high-risk only at scale
|
|
189
189
|
|
|
190
190
|
---
|
|
@@ -192,7 +192,7 @@ We incorporated Cursor's proven patterns:
|
|
|
192
192
|
## Conclusion
|
|
193
193
|
|
|
194
194
|
**Loki Mode is scientifically better in:**
|
|
195
|
-
- Quality assurance (research-backed
|
|
195
|
+
- Quality assurance (research-backed 9-gate system)
|
|
196
196
|
- Anti-sycophancy (CONSENSAGENT blind review)
|
|
197
197
|
- Velocity-quality balance (arXiv metrics)
|
|
198
198
|
- Full SDLC coverage (PRD to growth)
|
package/docs/thick2thin.md
CHANGED
|
@@ -49,7 +49,7 @@ skills/
|
|
|
49
49
|
|
|
50
50
|
references/ (unchanged)
|
|
51
51
|
+-- 18 detailed reference files
|
|
52
|
-
+-- agents.md (23KB) - Full
|
|
52
|
+
+-- agents.md (23KB) - Full 41 agent specs
|
|
53
53
|
+-- openai-patterns.md, lab-research-patterns.md, etc.
|
|
54
54
|
```
|
|
55
55
|
|
|
@@ -117,7 +117,7 @@ Content that didn't exist in v2.38.0:
|
|
|
117
117
|
| Handoff message format | A2A specification | skills/agents.md |
|
|
118
118
|
| Agentic patterns table | awesome-agentic-patterns | skills/agents.md |
|
|
119
119
|
| "Ralph Wiggum Mode" insight | moridinamael | skills/agents.md |
|
|
120
|
-
| Full
|
|
120
|
+
| Full 41 agent reference | references/agent-types.md | skills/agents.md (pointer) |
|
|
121
121
|
| References directory listing | New | skills/00-index.md |
|
|
122
122
|
|
|
123
123
|
---
|
package/mcp/__init__.py
CHANGED
package/package.json
CHANGED
|
@@ -6,7 +6,7 @@ Complete definitions and capabilities for all 41 specialized agent types.
|
|
|
6
6
|
|
|
7
7
|
## Overview
|
|
8
8
|
|
|
9
|
-
Loki Mode has 41 predefined agent types organized into
|
|
9
|
+
Loki Mode has 41 predefined agent types organized into 8 specialized swarms (37 domain agents + 4 orchestration agents). The orchestrator spawns only the agents needed for your project -- typically 5-10 for simple projects, more for complex ones.
|
|
10
10
|
|
|
11
11
|
---
|
|
12
12
|
|
|
@@ -98,7 +98,7 @@ Loki Mode has 41 predefined agent types organized into 7 specialized swarms (37
|
|
|
98
98
|
|
|
99
99
|
## Orchestration Swarm (4 types)
|
|
100
100
|
|
|
101
|
-
> **Source:** [Cursor Scaling Learnings](./cursor-learnings.md) - patterns proven at
|
|
101
|
+
> **Source:** [Cursor Scaling Learnings](./cursor-learnings.md) - patterns proven at large agent scale
|
|
102
102
|
|
|
103
103
|
| Agent | Capabilities |
|
|
104
104
|
|-------|-------------|
|
package/references/agents.md
CHANGED
|
@@ -2,7 +2,7 @@
|
|
|
2
2
|
|
|
3
3
|
Complete specifications for all 41 specialized agent types in the Loki Mode multi-agent system (37 domain agents + 4 orchestration agents).
|
|
4
4
|
|
|
5
|
-
**Note:** These are agent TYPE definitions, not a fixed count. Loki Mode dynamically spawns agents based on project needs - a simple todo app might use 5-10 agents, while a complex startup
|
|
5
|
+
**Note:** These are agent TYPE definitions, not a fixed count. Loki Mode dynamically spawns agents based on project needs - a simple todo app might use 5-10 agents, while a complex startup spawns more as needed.
|
|
6
6
|
|
|
7
7
|
## Agent Role Prompt Template
|
|
8
8
|
|
|
@@ -182,7 +182,7 @@ Dexter shows value of domain specialization. Our 41 agent types follow this patt
|
|
|
182
182
|
- Most haven't scaled across enterprise
|
|
183
183
|
|
|
184
184
|
### Loki Mode Alignment
|
|
185
|
-
- Multi-agent architecture (41 types,
|
|
185
|
+
- Multi-agent architecture (41 types, 8 swarms)
|
|
186
186
|
- Plan Agents (orchestrator, planner)
|
|
187
187
|
- Execution Agents (eng-*, ops-*, biz-*)
|
|
188
188
|
- Security controls (LOKI_SANDBOX_MODE, LOKI_BLOCKED_COMMANDS)
|
|
@@ -6,7 +6,7 @@ Full RARV cycle, CONTINUITY.md template, and autonomy rules.
|
|
|
6
6
|
|
|
7
7
|
## Autonomy Rules
|
|
8
8
|
|
|
9
|
-
**This system runs with
|
|
9
|
+
**This system runs with minimal human intervention.** Human oversight is expected for deployment credentials, domain setup, API keys, and critical business decisions.
|
|
10
10
|
|
|
11
11
|
### Core Rules
|
|
12
12
|
1. **NEVER ask questions** - Do not say "Would you like me to...", "Should I...", or "What would you prefer?"
|
package/skills/00-index.md
CHANGED
|
@@ -41,7 +41,7 @@
|
|
|
41
41
|
|
|
42
42
|
### quality-gates.md
|
|
43
43
|
**When:** Code review, pre-commit checks, quality assurance
|
|
44
|
-
-
|
|
44
|
+
- 9-gate quality system
|
|
45
45
|
- Blind review + anti-sycophancy
|
|
46
46
|
- Velocity-quality feedback loop (arXiv research)
|
|
47
47
|
- Mandatory quality checks per task
|
package/skills/agents.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
# Agent Dispatch & Structured Prompting
|
|
2
2
|
|
|
3
|
-
> **Full agent type definitions:** See `references/agent-types.md` for complete 41 agent role specifications across
|
|
3
|
+
> **Full agent type definitions:** See `references/agent-types.md` for complete 41 agent role specifications across 8 swarms (Engineering, Operations, Business, Data, Product, Growth, Review, Orchestration).
|
|
4
4
|
|
|
5
5
|
---
|
|
6
6
|
|
|
@@ -245,7 +245,7 @@ Priority order for context:
|
|
|
245
245
|
|
|
246
246
|
---
|
|
247
247
|
|
|
248
|
-
## The
|
|
248
|
+
## The 41 Agent Roles (37 Domain + 4 Orchestration)
|
|
249
249
|
|
|
250
250
|
See `references/agent-types.md` for complete specifications. Summary:
|
|
251
251
|
|
|
@@ -259,4 +259,4 @@ See `references/agent-types.md` for complete specifications. Summary:
|
|
|
259
259
|
| Growth | hacker, community, success, lifecycle | 4 |
|
|
260
260
|
| Review | code, business, security | 3 |
|
|
261
261
|
|
|
262
|
-
**Spawn only what you need.** Simple project: 5-10 agents. Complex startup:
|
|
262
|
+
**Spawn only what you need.** Simple project: 5-10 agents. Complex startup: more as needed.
|
package/skills/artifacts.md
CHANGED
package/skills/quality-gates.md
CHANGED
|
@@ -2,7 +2,7 @@
|
|
|
2
2
|
|
|
3
3
|
**Never ship code without passing all quality gates.**
|
|
4
4
|
|
|
5
|
-
## The
|
|
5
|
+
## The 9 Quality Gates
|
|
6
6
|
|
|
7
7
|
1. **Input Guardrails** - Validate scope, detect injection, check constraints (OpenAI SDK)
|
|
8
8
|
2. **Static Analysis** - CodeQL, ESLint/Pylint, type checking
|
|
@@ -11,6 +11,8 @@
|
|
|
11
11
|
5. **Output Guardrails** - Validate code quality, spec compliance, no secrets (tripwire on fail)
|
|
12
12
|
6. **Severity-Based Blocking** - Critical/High/Medium = BLOCK; Low/Cosmetic = TODO comment
|
|
13
13
|
7. **Test Coverage Gates** - Unit: 100% pass, >80% coverage; Integration: 100% pass
|
|
14
|
+
8. **Mock Detector** - Classifies internal vs external mocks; flags tests that never import source code, tautological assertions, and high internal mock ratios
|
|
15
|
+
9. **Test Mutation Detector** - Detects assertion value changes alongside implementation changes (test fitting), low assertion density, and missing pass/fail tracking
|
|
14
16
|
|
|
15
17
|
## Guardrails Execution Modes
|
|
16
18
|
|
|
@@ -472,7 +474,7 @@ See `references/quality-control.md` for complete details.
|
|
|
472
474
|
|
|
473
475
|
## Scale Considerations
|
|
474
476
|
|
|
475
|
-
> **Source:** [Cursor Scaling Learnings](../references/cursor-learnings.md) - integrators became bottlenecks at
|
|
477
|
+
> **Source:** [Cursor Scaling Learnings](../references/cursor-learnings.md) - integrators became bottlenecks at high agent counts
|
|
476
478
|
|
|
477
479
|
### Review Intensity Scaling
|
|
478
480
|
|