npm - loki-mode - Versions diffs - 5.49.0 → 5.49.2 - Mend

loki-mode 5.49.0 → 5.49.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (28) hide show

package/README.md +70 -121
package/SKILL.md +3 -3
package/VERSION +1 -1
package/autonomy/CONSTITUTION.md +4 -4
package/autonomy/app-runner.sh +9 -0
package/autonomy/loki +107 -0
package/autonomy/run.sh +170 -4
package/dashboard/__init__.py +1 -1
package/dashboard/server.py +172 -20
package/dashboard/static/index.html +1 -1
package/docs/COMPARISON.md +15 -15
package/docs/COMPETITIVE-ANALYSIS.md +4 -4
package/docs/INSTALLATION.md +20 -12
package/docs/alternative-installations.md +145 -0
package/docs/auto-claude-comparison.md +1 -1
package/docs/cursor-comparison.md +7 -7
package/docs/thick2thin.md +2 -2
package/mcp/__init__.py +1 -1
package/package.json +1 -1
package/references/agent-types.md +2 -2
package/references/agents.md +1 -1
package/references/competitive-analysis.md +1 -1
package/references/core-workflow.md +1 -1
package/skills/00-index.md +1 -1
package/skills/agents.md +3 -3
package/skills/artifacts.md +1 -1
package/skills/parallel-workflows.md +1 -1
package/skills/quality-gates.md +4 -2

package/docs/COMPARISON.md CHANGED Viewed

@@ -12,7 +12,7 @@
 | Feature | **Loki Mode** | **Zencoder** | **Devin** | **OpenAI Codex** | **Cursor** | **Claude Code** | **Kiro** | **Antigravity** | **Amazon Q** | **OpenCode** |
 |---------|--------------|--------------|-----------|-----------------|------------|-----------------|----------|-----------------|--------------|--------------|
 | **Type** | Skill/Framework | Enterprise Platform | Standalone Agent | Cloud Agent | AI IDE | CLI Agent | AI IDE | AI IDE | Cloud Agent | AI IDE (OSS) |
-| **Autonomy Level** | Full (zero human) | High | Full | High | Medium-High | High | High | High | High | High |
+| **Autonomy Level** | High (minimal human) | High | Full | High | Medium-High | High | High | High | High | High |
 | **Max Runtime** | Unlimited | Async/Scheduled | Hours | Per-task | Session | Session | Days | Async | Per-task | Session |
 | **Pricing** | Free (OSS) | Enterprise | $20/mo | ChatGPT Plus | $20/mo | API costs | Free preview | Free preview | $19/mo | Free (OSS) |
 | **Open Source** | Yes | No | No | No | No | No | No | No | No | Yes |
@@ -24,7 +24,7 @@
 | Feature | **Loki Mode** | **Devin** | **Codex** | **Cursor** | **Kiro** | **Antigravity** | **Amazon Q** | **OpenCode** |
 |---------|--------------|-----------|-----------|------------|----------|-----------------|--------------|--------------|
-| **Multi-Agent** | 41 agents in 7 swarms | Single | Single | Up to 8 parallel | Background | Manager Surface | Multiple types | 4 built-in |
+| **Multi-Agent** | 41 agents in 8 swarms | Single | Single | Up to 8 parallel | Background | Manager Surface | Multiple types | 4 built-in |
 | **Orchestration** | Full orchestrator | N/A | N/A | Git worktree | Hooks | Manager view | Workflow | Subagents |
 | **Parallel Exec** | 10+ Haiku, 4 impl (worktree) | No | No | 8 max | Yes | Yes | Yes | Yes |
 | **Agent Swarms** | Eng, Ops, Business, Data, Product, Growth, Review | N/A | N/A | N/A | N/A | N/A | 3 types | N/A |
@@ -37,7 +37,7 @@
 |---------|--------------|-----------|-----------|------------|----------|-----------------|--------------|--------------|
 | **Code Review** | 3 blind reviewers + devil's advocate | Basic | Basic | BugBot PR | Property-based | Artifacts | Doc/Review | Basic |
 | **Anti-Sycophancy** | Yes (CONSENSAGENT) | No | No | No | No | No | No | No |
-| **Quality Gates** | 7 gates + PBT | Basic | Sandbox | Tests | Spec validation | Artifact checks | Tests | Permissions |
+| **Quality Gates** | 9 gates + PBT | Basic | Sandbox | Tests | Spec validation | Artifact checks | Tests | Permissions |
 | **Constitutional AI** | Yes (principles) | No | Refusal training | No | No | No | No | No |
 ---
@@ -146,7 +146,7 @@
 | Feature | **Zencoder** | **Loki Mode** | **Assessment** |
 |---------|-------------|---------------|----------------|
-| **Four Pillars** | Structured Workflows, SDD, Multi-Agent Verification, Parallel Execution | SDLC + RARV + 7 Gates + Worktrees | TIE |
+| **Four Pillars** | Structured Workflows, SDD, Multi-Agent Verification, Parallel Execution | SDLC + RARV + 9 Gates + Worktrees | TIE |
 | **Spec-Driven Dev** | Specs as first-class objects | OpenAPI-first | TIE |
 | **Multi-Agent Verification** | Model diversity (Claude vs OpenAI, 54% improvement) | 3 blind reviewers + devil's advocate | Different approach (N/A for Claude Code - only Claude models) |
 | **Quality Gates** | Built-in verification loops | 7 explicit gates + anti-sycophancy | **Loki Mode** |
@@ -180,9 +180,9 @@
 1. **Quality Control**: 7 explicit gates + blind review + devil's advocate vs built-in loops
 2. **Memory System**: 3-tier (episodic/semantic/procedural) with cross-project learning
-3. **Agent Specialization**: 41 pre-defined specialized agents across 7 swarms
+3. **Agent Specialization**: 41 pre-defined specialized agents across 8 swarms
 4. **Anti-Sycophancy**: CONSENSAGENT patterns prevent reviewer groupthink
-5. **Autonomy Design**: Zero human intervention from PRD to production
+5. **Autonomy Design**: Minimal human intervention from PRD to production
 6. **Research Foundation**: 10+ academic papers integrated vs proprietary
 ### Where Zencoder EXCEEDS Loki Mode
@@ -203,13 +203,13 @@
 |---------|--------------|---------|-----------------|------------|-----------------|---------------------|----------------|
 | **Stars** | 594 | 11,903 | 35K+ | 26K+ | 13.7K | N/A | N/A |
 | **npm/wk** | 6.1K | 21.4K | N/A | N/A | N/A | N/A | N/A |
-| **Agents** | 41 in 7 swarms | 11 agents | Fresh per task | 108 agents | Swarm-based | 32 agents | N/A |
+| **Agents** | 41 in 8 swarms | 11 agents | Fresh per task | 108 agents | Swarm-based | 32 agents | N/A |
 | **Skills** | Progressive disclosure | 6 slash commands | N/A | 129 skills | N/A | 35 skills | Memory focus |
 | **Multi-Provider** | Yes (Claude/Codex/Gemini) | 3 CLIs (separate) | No | No | No | No | No |
 | **Memory System** | 3-tier (episodic/semantic/procedural) | None | N/A | N/A | Hybrid | N/A | SQLite+FTS5 |
-| **Quality Gates** | 7 gates + Completion Council | User verify only | Two-Stage Review | N/A | Consensus | Tiered | N/A |
+| **Quality Gates** | 9 gates + Completion Council | User verify only | Two-Stage Review | N/A | Consensus | Tiered | N/A |
 | **Context Mgmt** | Standard | Fresh per task (core innovation) | Fresh per task | N/A | N/A | N/A | Progressive |
-| **Autonomy** | Full (zero human) | Semi (checkpoints) | Human-guided | Human-guided | Orchestrated | Human-guided | N/A |
+| **Autonomy** | High (minimal human) | Semi (checkpoints) | Human-guided | Human-guided | Orchestrated | Human-guided | N/A |
 ### What Loki Mode LACKS (Honest Assessment)
@@ -232,11 +232,11 @@ These are patterns from competing projects that are **practically and scientific
 |----------|---------|-------------------------|
 | **Multi-Provider Support** | Only skill supporting Claude, Codex, and Gemini with graceful degradation | All 8 competitors are Claude-only |
 | **RARV Cycle** | Reason-Act-Reflect-Verify is more rigorous than Plan-Execute | Most use simple Plan-Execute |
-| **7-Gate Quality System** | Static analysis + 3 reviewers + devil's advocate + anti-sycophancy + severity blocking + coverage + debate | Superpowers has 2-stage, others have less |
+| **9-Gate Quality System** | Static analysis + 3 reviewers + devil's advocate + anti-sycophancy + severity blocking + coverage + debate | Superpowers has 2-stage, others have less |
 | **Constitutional AI Integration** | Principles-based self-critique from Anthropic research | None have this |
 | **Anti-Sycophancy (CONSENSAGENT)** | Blind review + devil's advocate prevents groupthink | None have this |
 | **Provider Abstraction Layer** | Clean degradation from full-featured to sequential-only | Claude-only projects can't degrade |
-| **41 Specialized Agents** | Purpose-built agents in 7 swarms vs generic | agents (108) has more but less organized |
+| **41 Specialized Agents** | Purpose-built agents in 8 swarms vs generic | agents (108) has more but less organized |
 | **Research Foundation** | 10+ academic papers integrated with citations | Most have no research backing |
 ### Superpowers Deep-Dive (35K+ Stars)
@@ -342,7 +342,7 @@ Tiered agent architecture with explicit escalation:
 | Agent | Killer Feature |
 |-------|---------------|
-| **Loki Mode** | Zero-human-intervention full SDLC, 41 agents in 7 swarms, Constitutional AI, anti-sycophancy, cross-project learning, code transformation, property-based testing |
+| **Loki Mode** | Minimal-human-intervention full SDLC, 41 agents in 8 swarms, Constitutional AI, anti-sycophancy, cross-project learning, code transformation, property-based testing |
 | **Devin** | Full software engineer persona, Slack integration, 67% PR merge rate |
 | **OpenAI Codex** | Skills marketplace, $skill-creator, GPT-5.2-Codex, secure sandbox |
 | **Cursor** | 8 parallel agents, BugBot, Memories, $10B valuation, Composer model (250 tok/s) |
@@ -357,9 +357,9 @@ Tiered agent architecture with explicit escalation:
 | Dimension | Loki Mode Advantage |
 |-----------|-------------------|
-| **Autonomy** | Only agent designed for TRUE zero human intervention |
-| **Multi-Agent** | 41 specialized agents in 7 swarms vs 1-8 in competitors |
-| **Quality** | 7 gates + blind review + devil's advocate + property-based testing |
+| **Autonomy** | Designed for high autonomy with minimal human intervention |
+| **Multi-Agent** | 41 specialized agents in 8 swarms vs 1-8 in competitors |
+| **Quality** | 9 gates + blind review + devil's advocate + property-based testing |
 | **Research** | 10+ academic papers integrated vs proprietary/undisclosed |
 | **Anti-Sycophancy** | Only agent with CONSENSAGENT-based blind review |
 | **Memory** | 3-tier memory (episodic/semantic/procedural) + review learning + cross-project |

package/docs/COMPETITIVE-ANALYSIS.md CHANGED Viewed

@@ -20,7 +20,7 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
 | Adoption | 594 stars, 6K/wk npm | 11,903 stars, 21K/wk npm | GSD (20x) |
 | Simplicity | Complex (5.4K-line run.sh, 12 Python modules) | Simple (markdown agents + slash commands) | GSD |
 | Full autonomy | Walk away, come back to deployed product | Human checkpoints at discuss/verify/milestone | Loki |
-| Quality gates | 7-gate + Completion Council + anti-sycophancy | User verification only | Loki |
+| Quality gates | 9-gate + Completion Council + anti-sycophancy | User verification only | Loki |
 | Memory system | Episodic/semantic/procedural + vector search | None | Loki |
 | Context management | Standard | Fresh subagent contexts per task (core innovation) | GSD |
 | Time to value | Learn architecture, understand CLI flags | `npx get-shit-done-cc` and go | GSD |
@@ -37,9 +37,9 @@ GSD is the closest competitor -- a context engineering system that spawns fresh
 |---------|-----------|-------------|---------|--------|--------------|-------|
 | **GitHub Stars** | 594 | 13,700 | 62,400 | 25,000+ | N/A (Commercial) | N/A (Commercial) |
 | **Agent Count** | 41 types | 64+ agents | 5 roles | Unlimited | 8 parallel | 1 autonomous |
-| **Parallel Execution** | Yes (100+) | Yes (swarms) | Sequential | Yes (crews) | Yes (8 worktrees) | Yes (fleet) |
-| **Published Benchmarks** | **98.78% HumanEval (multi-agent)** | None | 85.9-87.7% HumanEval | None | ~250 tok/s | 15% complex tasks |
-| **SWE-bench Score** | **99.67% patch gen (299/300)** | Unknown | Unknown | Unknown | Unknown | 15% complex |
+| **Parallel Execution** | Yes (multi-agent) | Yes (swarms) | Sequential | Yes (crews) | Yes (8 worktrees) | Yes (fleet) |
+| **Published Benchmarks** | 98.78% HumanEval (self-reported, max 3 retries) | None | 85.9-87.7% HumanEval | None | ~250 tok/s | 15% complex tasks |
+| **SWE-bench Score** | 99.67% patch gen (unevaluated, 299/300) | Unknown | Unknown | Unknown | Unknown | 15% complex |
 | **Full SDLC** | Yes (8 phases) | Yes | Partial | Partial | No | Partial |
 | **Business Ops** | **Yes (8 agents)** | No | No | No | No | No |
 | **Enterprise Security** | `--dangerously-skip-permissions` | MCP sandboxed | Sandboxed | Audit logs, RBAC | Staged autonomy | Sandboxed |

package/docs/INSTALLATION.md CHANGED Viewed

@@ -2,11 +2,11 @@
 The flagship product of [Autonomi](https://www.autonomi.dev/). Complete installation instructions for all platforms and use cases.
-**Version:** v5.49.0
+**Version:** v5.49.2
 ---
-## What's New in v5.39.0
+## What's New in v5.49.1
 ### Enterprise Security (v5.36.0-v5.37.1)
 - TLS/HTTPS support for dashboard connections
@@ -63,7 +63,7 @@ npm install -g loki-mode
 brew tap asklokesh/tap && brew install loki-mode
 # Option C: Docker
-docker pull asklokesh/loki-mode:5.32.0
+docker pull asklokesh/loki-mode:latest
 # Option D: Git clone
 git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
@@ -160,6 +160,10 @@ Install via npm for the easiest setup with automatic PATH configuration.
 npm install -g loki-mode
 # The skill is automatically installed to ~/.claude/skills/loki-mode
+# Opt out of anonymous install telemetry:
+# LOKI_TELEMETRY_DISABLED=true npm install -g loki-mode
+# Or set DO_NOT_TRACK=1
 ```
 ### Usage
@@ -207,8 +211,8 @@ brew tap asklokesh/tap
 # Install Loki Mode
 brew install loki-mode
-# Set up Claude Code skill integration
-loki-mode-install-skill
+# Set up Claude Code skill integration (manual symlink required)
+ln -sf "$(brew --prefix)/opt/loki-mode/libexec" ~/.claude/skills/loki-mode
 ```
 ### Dependencies
@@ -254,7 +258,7 @@ Run Loki Mode in a container for isolated execution.
 ```bash
 # Pull the image
-docker pull asklokesh/loki-mode:5.32.0
+docker pull asklokesh/loki-mode:latest
 # Or use docker-compose
 curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode/main/docker-compose.yml
@@ -264,10 +268,10 @@ curl -o docker-compose.yml https://raw.githubusercontent.com/asklokesh/loki-mode
 ```bash
 # Run with a PRD file
-docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:5.32.0 start ./my-prd.md
+docker run -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest start ./my-prd.md
 # Interactive mode
-docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:5.32.0
+docker run -it -v $(pwd):/workspace -w /workspace asklokesh/loki-mode:latest
 # Using docker-compose
 docker-compose run loki start ./my-prd.md
@@ -280,7 +284,7 @@ Pass your configuration via environment variables:
 ```bash
 docker run -e LOKI_MAX_RETRIES=100 -e LOKI_BASE_WAIT=120 \
   -v $(pwd):/workspace -w /workspace \
-  asklokesh/loki-mode:5.32.0 start ./my-prd.md
+  asklokesh/loki-mode:latest start ./my-prd.md
 ```
 ### Updating
@@ -396,12 +400,12 @@ Pass the provider as an environment variable:
 # Use Codex with Docker
 docker run -e LOKI_PROVIDER=codex \
   -v $(pwd):/workspace -w /workspace \
-  asklokesh/loki-mode:5.32.0 start ./my-prd.md
+  asklokesh/loki-mode:latest start ./my-prd.md
 # Use Gemini with Docker
 docker run -e LOKI_PROVIDER=gemini \
   -v $(pwd):/workspace -w /workspace \
-  asklokesh/loki-mode:5.32.0 start ./my-prd.md
+  asklokesh/loki-mode:latest start ./my-prd.md
 ```
 ### Degraded Mode
@@ -652,7 +656,11 @@ Add the source command to your startup file so completions load every time you o
 Add this line to your `~/.bashrc` (Linux) or `~/.bash_profile` (macOS):
 ```bash
-source /path/to/loki/completions/loki.bash
+# npm install: use the npm package path
+source "$(npm root -g)/loki-mode/completions/loki.bash"
+# git clone: use the skills directory
+source ~/.claude/skills/loki-mode/completions/loki.bash
 ```
 ---

package/docs/alternative-installations.md ADDED Viewed

@@ -0,0 +1,145 @@
+# Alternative Installation Methods
+The primary installation method is git clone (see [README](../README.md#installation)). These alternatives serve specific use cases.
+---
+## npm (Secondary)
+**Status**: Working. Version tracks releases automatically.
+```bash
+npm install -g loki-mode
+```
+**Limitation**: Installs to `node_modules`, not `~/.claude/skills/`. To use as a Claude Code skill, you must symlink:
+```bash
+npm install -g loki-mode
+ln -sf "$(npm root -g)/loki-mode" ~/.claude/skills/loki-mode
+```
+**Best for**: CI/CD pipelines, programmatic access via `loki` CLI.
+---
+## Homebrew (Secondary)
+**Status**: Working. Tap and formula exist, version current.
+```bash
+brew tap asklokesh/tap
+brew install loki-mode
+```
+**Limitation**: Installs the `loki` CLI binary only. Does NOT install the Claude Code skill. To use with Claude Code, also run:
+```bash
+git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
+```
+**Best for**: Users who want the `loki` CLI wrapper for autonomous mode (`loki start`, `loki stop`, `loki cleanup`).
+---
+## Docker (Secondary)
+**Status**: Image exists on Docker Hub. Tags: `latest`, version-specific (e.g., `5.49.1`).
+```bash
+docker pull asklokesh/loki-mode:latest
+```
+**Limitation**: Claude Code is an interactive CLI that requires API keys and terminal access. Running it inside a Docker container is not the standard workflow. Docker is useful for:
+- CI/CD sandbox execution (running `loki` in isolated environments)
+- Testing Loki Mode without modifying your local system
+- Air-gapped environments with pre-built images
+**Not recommended for**: Interactive Claude Code sessions. Use the git clone method instead.
+See [DOCKER_README.md](../DOCKER_README.md) for Docker-specific usage instructions.
+---
+## GitHub Action (Secondary)
+**Status**: Working. Adds automated AI code review to pull requests.
+```yaml
+# .github/workflows/loki-review.yml
+name: Loki Code Review
+on:
+  pull_request:
+    types: [opened, synchronize]
+permissions:
+  contents: read
+  pull-requests: write
+jobs:
+  review:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - uses: asklokesh/loki-mode@v5
+        with:
+          github_token: ${{ secrets.GITHUB_TOKEN }}
+          mode: review
+          provider: claude
+          max_iterations: 3
+          budget_limit: '5.00'
+        env:
+          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
+```
+**Prerequisites:**
+- API key for your provider (set as repository secret): `ANTHROPIC_API_KEY`, `OPENAI_API_KEY`, or `GOOGLE_API_KEY`
+- The action auto-installs `loki-mode` and `@anthropic-ai/claude-code`
+**Action Inputs:**
+| Input | Default | Description |
+|-------|---------|-------------|
+| `mode` | `review` | `review`, `fix`, or `test` |
+| `provider` | `claude` | `claude`, `codex`, or `gemini` |
+| `budget_limit` | `5.00` | Max cost in USD |
+| `max_iterations` | `3` | Max RARV cycles |
+| `github_token` | (required) | GitHub token for PR comments |
+| `prd_file` | | Path to PRD file (for fix/test modes) |
+**Modes:**
+| Mode | Description |
+|------|-------------|
+| `review` | Analyze PR diff, post structured review as PR comment |
+| `fix` | Automatically fix issues found in the codebase |
+| `test` | Run autonomous test generation and validation |
+**Best for**: Automated PR review and CI/CD integration.
+---
+## GitHub Release Download (Secondary)
+**Status**: Working. Release assets available for each version.
+```bash
+# Download and extract to skills directory
+curl -sL https://github.com/asklokesh/loki-mode/archive/refs/tags/v5.49.1.tar.gz | tar xz
+mv loki-mode-5.49.1 ~/.claude/skills/loki-mode
+```
+**Best for**: Offline or air-gapped environments, pinned version deployments.
+---
+## VS Code Extension (Secondary)
+**Status**: Available on VS Code Marketplace.
+Search for "Loki Mode" in VS Code Extensions, or:
+```bash
+code --install-extension asklokesh.loki-mode
+```
+**Best for**: VS Code users who want dashboard integration within their editor.

package/docs/auto-claude-comparison.md CHANGED Viewed

@@ -169,7 +169,7 @@ Loki Mode:
 **Verdict: Loki Mode wins** - Simpler, lighter footprint.
 ### 10. Cursor Scale Patterns (v3.3.0)
-Loki Mode now incorporates proven patterns from Cursor's 100+ agent deployments:
+Loki Mode now incorporates proven patterns from Cursor's large-scale agent deployments:
 - Recursive sub-planners
 - Judge agents for cycle decisions
 - Optimistic concurrency control

package/docs/cursor-comparison.md CHANGED Viewed

@@ -9,9 +9,9 @@
 | Dimension | Cursor | Loki Mode | Winner |
 |-----------|--------|-----------|--------|
-| **Proven Scale** | 1M+ LoC, 100+ agents | Benchmarks only | Cursor |
+| **Proven Scale** | 1M+ LoC, large agent count | Benchmarks only | Cursor |
 | **Research Foundation** | Empirical iteration | 25+ academic citations | Loki Mode |
-| **Quality Assurance** | Workers self-manage | 7-gate system + anti-sycophancy | Loki Mode |
+| **Quality Assurance** | Workers self-manage | 9-gate system + anti-sycophancy | Loki Mode |
 | **Anti-Sycophancy** | Not mentioned | CONSENSAGENT blind review | Loki Mode |
 | **Velocity-Quality Balance** | Not mentioned | arXiv-backed metrics | Loki Mode |
 | **Full SDLC Coverage** | Code generation focus | PRD to production + growth | Loki Mode |
@@ -66,7 +66,7 @@ velocity_quality_balance:
 ---
-### 3. 7-Gate Quality System
+### 3. 9-Gate Quality System
 **Loki Mode's Gates:**
 1. Input Guardrails - Validate scope, detect injection (OpenAI SDK pattern)
@@ -122,7 +122,7 @@ BOOTSTRAP -> DISCOVERY -> ARCHITECTURE -> INFRASTRUCTURE
      -> DEVELOPMENT -> QA -> DEPLOYMENT -> GROWTH (continuous)
 ```
-**41 Specialized Agent Types across 7 swarms:**
+**41 Specialized Agent Types across 8 swarms:**
 - Engineering (8 types)
 - Operations (8 types)
 - Business (8 types)
@@ -174,7 +174,7 @@ Cursor learned through failure:
 ### 3. Simplicity Principle
 > "A surprising amount of the system's behavior comes down to how we prompt the agents. The harness and models matter, but the prompts matter more."
-**Loki Mode:** More complex infrastructure (7 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
+**Loki Mode:** More complex infrastructure (9 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
 ---
@@ -184,7 +184,7 @@ We incorporated Cursor's proven patterns:
 1. **Recursive Sub-Planners** - Planning scales horizontally
 2. **Judge Agents** - Explicit CONTINUE/COMPLETE/ESCALATE/PIVOT decisions
-3. **Optimistic Concurrency** - No locks, scales to 100+ agents
+3. **Optimistic Concurrency** - No locks, scales horizontally
 4. **Scale-Aware Review** - Full review for high-risk only at scale
 ---
@@ -192,7 +192,7 @@ We incorporated Cursor's proven patterns:
 ## Conclusion
 **Loki Mode is scientifically better in:**
-- Quality assurance (research-backed 7-gate system)
+- Quality assurance (research-backed 9-gate system)
 - Anti-sycophancy (CONSENSAGENT blind review)
 - Velocity-quality balance (arXiv metrics)
 - Full SDLC coverage (PRD to growth)

package/docs/thick2thin.md CHANGED Viewed

@@ -49,7 +49,7 @@ skills/
 references/ (unchanged)
   +-- 18 detailed reference files
-  +-- agents.md (23KB) - Full 37 agent specs
+  +-- agents.md (23KB) - Full 41 agent specs
   +-- openai-patterns.md, lab-research-patterns.md, etc.
 ```
@@ -117,7 +117,7 @@ Content that didn't exist in v2.38.0:
 | Handoff message format | A2A specification | skills/agents.md |
 | Agentic patterns table | awesome-agentic-patterns | skills/agents.md |
 | "Ralph Wiggum Mode" insight | moridinamael | skills/agents.md |
-| Full 37 agent reference | references/agent-types.md | skills/agents.md (pointer) |
+| Full 41 agent reference | references/agent-types.md | skills/agents.md (pointer) |
 | References directory listing | New | skills/00-index.md |
 ---

package/mcp/__init__.py CHANGED Viewed

@@ -21,4 +21,4 @@ try:
 except ImportError:
     __all__ = ['mcp']
-__version__ = '5.49.0'
+__version__ = '5.49.2'

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "loki-mode",
-  "version": "5.49.0",
+  "version": "5.49.2",
   "description": "Loki Mode by Autonomi - Multi-agent autonomous startup system for Claude Code, Codex CLI, and Gemini CLI",
   "keywords": [
     "autonomi",

package/references/agent-types.md CHANGED Viewed

@@ -6,7 +6,7 @@ Complete definitions and capabilities for all 41 specialized agent types.
 ## Overview
-Loki Mode has 41 predefined agent types organized into 7 specialized swarms (37 domain agents + 4 orchestration agents). The orchestrator spawns only the agents needed for your project - a simple app might use 5-10 agents, while a complex startup could spawn 100+ agents working in parallel.
+Loki Mode has 41 predefined agent types organized into 8 specialized swarms (37 domain agents + 4 orchestration agents). The orchestrator spawns only the agents needed for your project -- typically 5-10 for simple projects, more for complex ones.
 ---
@@ -98,7 +98,7 @@ Loki Mode has 41 predefined agent types organized into 7 specialized swarms (37
 ## Orchestration Swarm (4 types)
-> **Source:** [Cursor Scaling Learnings](./cursor-learnings.md) - patterns proven at 100+ agent scale
+> **Source:** [Cursor Scaling Learnings](./cursor-learnings.md) - patterns proven at large agent scale
 | Agent | Capabilities |
 |-------|-------------|

package/references/agents.md CHANGED Viewed

@@ -2,7 +2,7 @@
 Complete specifications for all 41 specialized agent types in the Loki Mode multi-agent system (37 domain agents + 4 orchestration agents).
-**Note:** These are agent TYPE definitions, not a fixed count. Loki Mode dynamically spawns agents based on project needs - a simple todo app might use 5-10 agents, while a complex startup could spawn 100+ agents working in parallel.
+**Note:** These are agent TYPE definitions, not a fixed count. Loki Mode dynamically spawns agents based on project needs - a simple todo app might use 5-10 agents, while a complex startup spawns more as needed.
 ## Agent Role Prompt Template

package/references/competitive-analysis.md CHANGED Viewed

@@ -182,7 +182,7 @@ Dexter shows value of domain specialization. Our 41 agent types follow this patt
    - Most haven't scaled across enterprise
 ### Loki Mode Alignment
-- Multi-agent architecture (41 types, 7 swarms)
+- Multi-agent architecture (41 types, 8 swarms)
 - Plan Agents (orchestrator, planner)
 - Execution Agents (eng-*, ops-*, biz-*)
 - Security controls (LOKI_SANDBOX_MODE, LOKI_BLOCKED_COMMANDS)

package/references/core-workflow.md CHANGED Viewed

@@ -6,7 +6,7 @@ Full RARV cycle, CONTINUITY.md template, and autonomy rules.
 ## Autonomy Rules
-**This system runs with ZERO human intervention.**
+**This system runs with minimal human intervention.** Human oversight is expected for deployment credentials, domain setup, API keys, and critical business decisions.
 ### Core Rules
 1. **NEVER ask questions** - Do not say "Would you like me to...", "Should I...", or "What would you prefer?"

package/skills/00-index.md CHANGED Viewed

@@ -41,7 +41,7 @@
 ### quality-gates.md
 **When:** Code review, pre-commit checks, quality assurance
-- 7-gate quality system
+- 9-gate quality system
 - Blind review + anti-sycophancy
 - Velocity-quality feedback loop (arXiv research)
 - Mandatory quality checks per task

package/skills/agents.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Agent Dispatch & Structured Prompting
-> **Full agent type definitions:** See `references/agent-types.md` for complete 41 agent role specifications across 7 swarms (Engineering, Operations, Business, Data, Product, Growth, Review, Orchestration).
+> **Full agent type definitions:** See `references/agent-types.md` for complete 41 agent role specifications across 8 swarms (Engineering, Operations, Business, Data, Product, Growth, Review, Orchestration).
 ---
@@ -245,7 +245,7 @@ Priority order for context:
 ---
-## The 37 Agent Roles
+## The 41 Agent Roles (37 Domain + 4 Orchestration)
 See `references/agent-types.md` for complete specifications. Summary:
@@ -259,4 +259,4 @@ See `references/agent-types.md` for complete specifications. Summary:
 | Growth | hacker, community, success, lifecycle | 4 |
 | Review | code, business, security | 3 |
-**Spawn only what you need.** Simple project: 5-10 agents. Complex startup: 100+.
+**Spawn only what you need.** Simple project: 5-10 agents. Complex startup: more as needed.

package/skills/artifacts.md CHANGED Viewed

@@ -36,7 +36,7 @@ format: "markdown"
 contents:
   - Phase name and duration
   - Tasks completed (from queue)
-  - Quality gate results (7 gates)
+  - Quality gate results (9 gates)
   - Coverage metrics
   - Known issues / TODOs
 ```

package/skills/parallel-workflows.md CHANGED Viewed

@@ -427,7 +427,7 @@ optimistic_write:
     - No waiting for locks
     - No deadlock risk
     - Failed writes are cheap (just retry)
-    - Scales to 100+ agents
+    - Scales horizontally with agent count
 ```
 ### Implementation

package/skills/quality-gates.md CHANGED Viewed

@@ -2,7 +2,7 @@
 **Never ship code without passing all quality gates.**
-## The 7 Quality Gates
+## The 9 Quality Gates
 1. **Input Guardrails** - Validate scope, detect injection, check constraints (OpenAI SDK)
 2. **Static Analysis** - CodeQL, ESLint/Pylint, type checking
@@ -11,6 +11,8 @@
 5. **Output Guardrails** - Validate code quality, spec compliance, no secrets (tripwire on fail)
 6. **Severity-Based Blocking** - Critical/High/Medium = BLOCK; Low/Cosmetic = TODO comment
 7. **Test Coverage Gates** - Unit: 100% pass, >80% coverage; Integration: 100% pass
+8. **Mock Detector** - Classifies internal vs external mocks; flags tests that never import source code, tautological assertions, and high internal mock ratios
+9. **Test Mutation Detector** - Detects assertion value changes alongside implementation changes (test fitting), low assertion density, and missing pass/fail tracking
 ## Guardrails Execution Modes
@@ -472,7 +474,7 @@ See `references/quality-control.md` for complete details.
 ## Scale Considerations
-> **Source:** [Cursor Scaling Learnings](../references/cursor-learnings.md) - integrators became bottlenecks at 100+ agents
+> **Source:** [Cursor Scaling Learnings](../references/cursor-learnings.md) - integrators became bottlenecks at high agent counts
 ### Review Intensity Scaling