npm - loki-mode - Versions diffs - 7.45.0 → 7.46.0 - Mend

loki-mode 7.45.0 → 7.46.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (45) hide show

package/README.md +16 -12
package/SKILL.md +5 -5
package/VERSION +1 -1
package/autonomy/CONSTITUTION.md +9 -2
package/autonomy/lib/sentrux-gate.sh +1 -1
package/autonomy/loki +2 -2
package/autonomy/run.sh +355 -92
package/dashboard/__init__.py +1 -1
package/dashboard/registry.py +156 -62
package/dashboard/server.py +9 -10
package/docs/COMPARISON.md +10 -10
package/docs/COMPETITIVE-ANALYSIS.md +1 -1
package/docs/INSTALLATION.md +2 -2
package/docs/P0-SWEEP-PLAN.md +163 -0
package/docs/architecture/STATE-MACHINES.md +18 -19
package/docs/architecture/bmad-loki-voice-agent-council-analysis.md +1 -1
package/docs/auto-claude-comparison.md +14 -11
package/docs/certification/01-core-concepts/lesson.md +12 -11
package/docs/certification/01-core-concepts/quiz.md +6 -6
package/docs/certification/05-troubleshooting/lesson.md +23 -13
package/docs/certification/05-troubleshooting/quiz.md +3 -3
package/docs/certification/answer-key.md +2 -2
package/docs/certification/certification-exam.md +9 -9
package/docs/competitive/bolt-new-analysis.md +1 -1
package/docs/competitive/emergence-others-analysis.md +9 -9
package/docs/competitive/replit-lovable-analysis.md +3 -3
package/docs/cursor-comparison.md +15 -12
package/docs/dashboard-guide.md +9 -7
package/docs/prd-purple-lab-platform-v2.md +1 -1
package/docs/prd-purple-lab-platform.md +3 -3
package/docs/show-hn-post.md +2 -2
package/loki-ts/dist/loki.js +2 -2
package/mcp/__init__.py +1 -1
package/package.json +2 -2
package/plugins/loki-mode/.claude-plugin/plugin.json +2 -2
package/plugins/loki-mode/README.md +1 -1
package/references/magic-rarv-integration.md +1 -1
package/references/quality-control.md +5 -5
package/references/sdlc-phases.md +1 -2
package/skills/00-index.md +1 -1
package/skills/artifacts.md +1 -1
package/skills/healing.md +1 -1
package/skills/magic-modules.md +3 -3
package/skills/quality-gates.md +52 -39
package/skills/testing.md +1 -1

package/docs/auto-claude-comparison.md CHANGED Viewed

@@ -120,21 +120,24 @@ Loki Mode implements CONSENSAGENT (ACL 2025):
 **Verdict: Loki Mode wins** - Research-backed quality assurance.
 ### 5. Quality Gates
-Loki Mode has 14 quality gates:
+Loki Mode runs 8 deterministic quality gates plus full SDLC phase coverage.
+The 8 deterministic quality gates: static analysis (CodeQL, ESLint), test suite (pass/fail), blind 3-reviewer review with severity blocking, anti-sycophancy Devil's Advocate, mock-integrity, test-mutation, documentation coverage, and Magic Modules debate. (Backward-compatibility is a conditional healing-mode auditor, not a numbered gate.)
+Beyond the gates, the SDLC pipeline covers these phases:
 1. Static analysis (CodeQL, ESLint)
-2. Unit tests (>80% coverage)
+2. Unit tests (test suite passes; coverage % not measured this release)
 3. API/Integration tests
 4. E2E tests (Playwright)
 5. Security scanning (OWASP)
-6. SAML/OIDC/SSO integration
-7. Parallel code review (3 reviewers)
-8. Performance/load testing
-9. Accessibility (WCAG)
-10. Regression testing
-11. UAT simulation
-12. Anti-sycophancy check
-13. Scale-aware review intensity
-14. Continuous monitoring
+6. Parallel code review (3 reviewers)
+7. Performance/load testing
+8. Accessibility (WCAG)
+9. Regression testing
+10. UAT simulation
+11. Anti-sycophancy check
+12. Scale-aware review intensity
+13. Continuous monitoring
 **Auto-Claude:** Single QA validation loop (up to 50 iterations).

package/docs/certification/01-core-concepts/lesson.md CHANGED Viewed

@@ -105,19 +105,20 @@ Full agent type definitions are in `references/agent-types.md`.
 ## Quality Gates
-Loki Mode enforces a 9-gate quality system. Code must pass all applicable gates before moving forward:
+Loki Mode enforces an 8-gate quality system. Code must pass all applicable gates before moving forward:
 | Gate | Name | Purpose |
 |------|------|---------|
-| 1 | Input Guardrails | Validate scope, detect injection, check constraints |
-| 2 | Static Analysis | CodeQL, ESLint/Pylint, type checking |
-| 3 | Blind Review System | 3 specialist reviewers in parallel, blind to each other |
-| 4 | Anti-Sycophancy Check | If reviewers unanimously approve, run a Devil's Advocate reviewer |
-| 5 | Output Guardrails | Validate code quality, spec compliance, no secrets |
-| 6 | Severity-Based Blocking | Critical/High/Medium = BLOCK; Low/Cosmetic = TODO |
-| 7 | Test Coverage Gates | Unit: 100% pass, >80% coverage; Integration: 100% pass |
-| 8 | Mock Detector | Flags tests that mock internal modules instead of real code |
-| 9 | Test Mutation Detector | Detects assertion value changes alongside implementation changes |
+| 1 | Static Analysis | CodeQL, ESLint/Pylint, type checking |
+| 2 | Test Suite (pass/fail) | Red blocks; coverage % not measured this release |
+| 3 | Blind Code Review (3-reviewer council + severity blocking) | 3 specialist reviewers in parallel, blind to each other; Critical/High = BLOCK; Medium/Low advisory |
+| 4 | Anti-Sycophancy / Devil's Advocate | If reviewers unanimously approve, run a Devil's Advocate reviewer |
+| 5 | Mock Integrity Detector | Flags tests that mock internal modules instead of real code |
+| 6 | Test Mutation Detector | Detects assertion value changes alongside implementation changes |
+| 7 | Documentation Coverage | README exists, docs freshness, API docs for packages |
+| 8 | Magic Modules Debate | Spec-vs-implementation debate on generated Magic Modules |
+A conditional backward-compatibility / legacy-healing auditor also runs in healing mode (not one of the 8 numbered gates).
 The blind review system (Gate 3) selects 3 reviewers from a pool of 5 named specialists:
@@ -179,4 +180,4 @@ Every Loki Mode project uses these files in the `.loki/` directory:
 ## Summary
-Loki Mode is an autonomous multi-agent system that follows the RARV cycle to build software from PRDs. It uses 41 agent types organized into 8 domains, enforces quality through 9 gates with blind peer review, and maintains episodic/semantic/procedural memory for continuous learning. Projects are classified into simple, standard, or complex tiers that determine the number of phases executed.
+Loki Mode is an autonomous multi-agent system that follows the RARV cycle to build software from PRDs. It uses 41 agent types organized into 8 domains, enforces quality through 8 gates with blind peer review, and maintains episodic/semantic/procedural memory for continuous learning. Projects are classified into simple, standard, or complex tiers that determine the number of phases executed.

package/docs/certification/01-core-concepts/quiz.md CHANGED Viewed

@@ -45,7 +45,7 @@ D) test-coverage-auditor
 A) 3
 B) 5
 C) 7
-D) 9
+D) 8
 ---
@@ -67,12 +67,12 @@ D) complex
 ---
-**Question 8:** What is the minimum test coverage required by Gate 7 (Test Coverage Gates)?
+**Question 8:** What does Gate 7 (Documentation Coverage) check?
-A) 50%
-B) 60%
-C) 80%
-D) 100%
+A) That unit test coverage is at least 80%
+B) That every function has an inline comment
+C) That a README exists, docs are fresh within 10 commits, and packages have API docs
+D) That cyclomatic complexity stays under 10
 ---

package/docs/certification/05-troubleshooting/lesson.md CHANGED Viewed

@@ -17,30 +17,40 @@ This module covers diagnosing and resolving common issues in Loki Mode: gate fai
 ## Quality Gate Failures
-When a quality gate fails, identify which gate triggered the failure:
+When a quality gate fails, identify which gate triggered the failure (the 8-gate
+system is detailed in `skills/quality-gates.md`):
-**Gates 1-6 (Review gates):**
+**Gates 1-2 (Static analysis and test suite):**
+- Gate 1 (Static Analysis): fix CodeQL/ESLint/Pylint/type-checker findings
+- Gate 2 (Test Suite): the test runner must pass; red blocks. Coverage % is not
+  measured this release. Fix failing tests before proceeding (never delete or
+  skip tests)
+**Gates 3-4 (Review gates):**
 - Check the review output for severity levels
-- Critical/High/Medium = BLOCK (must fix)
+- Critical/High = BLOCK; Medium/Low advisory (recommended to fix)
 - Low/Cosmetic = TODO (informational)
 - If all 3 reviewers pass unanimously, Gate 4 runs Devil's Advocate
-**Gate 7 (Test coverage):**
-- Unit tests must have 100% pass rate and >80% coverage
-- Integration tests must have 100% pass rate
-- Fix failing tests before proceeding (never delete or skip tests)
-**Gate 8 (Mock detector):**
+**Gate 5 (Mock integrity detector):**
 - Runs `tests/detect-mock-problems.sh`
 - Flags tests that mock internal modules instead of using real code
 - Flags tautological assertions and high internal mock ratios
-- Disable with `LOKI_GATE_MOCK_DETECTOR=false` (not recommended)
+- Disable with `LOKI_GATE_MOCK=false` (not recommended)
-**Gate 9 (Test mutation detector):**
+**Gate 6 (Test mutation detector):**
 - Runs `tests/detect-test-mutations.sh`
 - Detects assertion values changed alongside implementation (test fitting)
-- Detects low assertion density and missing pass/fail tracking
-- Disable with `LOKI_GATE_MUTATION_DETECTOR=false` (not recommended)
+- Detects low assertion density
+- Disable with `LOKI_GATE_MUTATION=false` (not recommended)
+**Gate 7 (Documentation coverage):**
+- Checks README presence, docs freshness within 10 commits, and API docs for packages
+- Disable with `LOKI_GATE_DOC_COVERAGE=false` (not recommended for packages)
+**Gate 8 (Magic Modules debate):**
+- Runs the spec-vs-implementation debate on generated Magic Modules
+- BLOCK-severity findings block; disable with `LOKI_GATE_MAGIC_DEBATE=false`
 ## Circuit Breaker System

package/docs/certification/05-troubleshooting/quiz.md CHANGED Viewed

@@ -67,11 +67,11 @@ D) Removes the entire `.loki/` directory
 ---
-**Question 8:** Which environment variable disables Gate 8 (Mock Detector)?
+**Question 8:** Which environment variable disables Gate 5 (Mock Integrity Detector)?
-A) `LOKI_SKIP_MOCK_CHECK=true`
+A) `LOKI_GATE_MOCK=false`
 B) `LOKI_GATE_MOCK_DETECTOR=false`
-C) `LOKI_DISABLE_GATE_8=true`
+C) `LOKI_DISABLE_GATE_5=true`
 D) `LOKI_NO_MOCK_DETECTION=true`
 ---

package/docs/certification/answer-key.md CHANGED Viewed

@@ -12,10 +12,10 @@ This file contains answers for all module quizzes and the final certification ex
 | 2 | C | 41 agent types: 37 domain + 4 orchestration |
 | 3 | B | After 5 failures, the task moves to `.loki/queue/dead-letter.json` |
 | 4 | C | architecture-strategist is always one of the 3 selected reviewers |
-| 5 | D | 9 quality gates (Input Guardrails through Test Mutation Detector) |
+| 5 | D | 8 quality gates (Static Analysis through Magic Modules Debate); backward-compatibility is a conditional healing-mode auditor, not one of the 8 |
 | 6 | B | Episodic, semantic, and procedural memory |
 | 7 | B | Simple tier uses 3 phases |
-| 8 | C | Gate 7 requires >80% unit test coverage |
+| 8 | C | Gate 7 (Documentation Coverage) checks README presence, docs freshness within 10 commits, and API docs for packages; coverage % is not measured this release |
 | 9 | C | Claude Code supports full features; Codex and Gemini run in degraded mode |
 | 10 | B | If all 3 reviewers unanimously approve, a Devil's Advocate reviewer runs |

package/docs/certification/certification-exam.md CHANGED Viewed

@@ -49,7 +49,7 @@ D) test-coverage-auditor
 A) 3
 B) 5
 C) 7
-D) 9
+D) 8
 ---
@@ -71,12 +71,12 @@ D) complex
 ---
-**Question 8:** What is the minimum test coverage required by Gate 7?
+**Question 8:** What does Gate 7 (Documentation Coverage) check?
-A) 50%
-B) 60%
-C) 80%
-D) 100%
+A) That unit test coverage is at least 80%
+B) That every function has an inline comment
+C) That a README exists, docs are fresh within 10 commits, and packages have API docs
+D) That cyclomatic complexity stays under 10
 ---
@@ -439,11 +439,11 @@ D) Removes the entire `.loki/` directory
 ---
-**Question 48:** Which environment variable disables Gate 8 (Mock Detector)?
+**Question 48:** Which environment variable disables Gate 5 (Mock Integrity Detector)?
-A) `LOKI_GATE_MOCK_DETECTOR=false`
+A) `LOKI_GATE_MOCK=false`
 B) `LOKI_SKIP_MOCK_CHECK=true`
-C) `LOKI_DISABLE_GATE_8=true`
+C) `LOKI_DISABLE_GATE_5=true`
 D) `LOKI_NO_MOCK_DETECTION=true`
 ---

package/docs/competitive/bolt-new-analysis.md CHANGED Viewed

@@ -409,7 +409,7 @@ These are bolt.new weaknesses that Loki Mode already solves or can emphasize:
 #### R5: Advertise Production Readiness as Key Differentiator
 - **bolt.new's gap**: 70% done code, no tests, no review, $5-20K remediation
-- **Loki Mode's advantage**: RARV cycle, 10 quality gates, 3-reviewer system, automated testing
+- **Loki Mode's advantage**: RARV cycle, 8 quality gates, 3-reviewer system, automated testing
 - **Action**: Create comparison content showing: "bolt.new gives you a prototype. Loki Mode gives you a product."
 - **Messaging**: "From PRD to production, not PRD to prototype"

package/docs/competitive/emergence-others-analysis.md CHANGED Viewed

@@ -286,7 +286,7 @@ Developers who value open-source tooling, speed, and terminal-native workflows.
 | **Autonomous Iteration** | No (task-level) | No | Partial (/loop, /schedule) | No (requires prompting) | Yes (RARV loop + completion council) |
 | **SDLC Pipeline** | No | No | No | No | Yes (9 phases) |
 | **Code Review** | No | No | Yes (single-pass) | Yes (single-pass) | Yes (3-reviewer blind) |
-| **Quality Gates** | No | No | No | No | Yes (10 gates) |
+| **Quality Gates** | No | No | No | No | Yes (8 gates) |
 | **Anti-Sycophancy** | No | No | No | No | Yes (devil's advocate) |
 | **Memory System** | Enterprise only | No | CLAUDE.md + auto-memory | Session resumption | Episodic/semantic/procedural |
 | **Self-Hosted** | Partial (Agent-E) | No | Partial (CLI local, but subscription or API required) | Yes (with API key) | Yes (fully, any provider API key) |
@@ -383,10 +383,10 @@ Claude Code and Codex CLI offer single-pass code review. Neither provides:
 - 3-reviewer blind parallel review
 - Anti-sycophancy checks (devil's advocate on unanimous approval)
 - Severity-based blocking gates
-- Test coverage enforcement (>80% unit, 100% pass)
+- Test suite enforcement (100% pass; coverage % not measured this release)
 - Static analysis integration (CodeQL, ESLint)
-**Opportunity:** Loki Mode's 10-gate quality system provides enterprise-grade assurance that no competitor matches.
+**Opportunity:** Loki Mode's 8-gate quality system provides enterprise-grade assurance that no competitor matches.
 ### Gap 4: No Persistent Cross-Project Learning
@@ -433,7 +433,7 @@ Rork generates mobile apps but cannot handle backends, APIs, or infrastructure.
 This positioning highlights three unique capabilities no competitor offers together:
 1. **Autonomous SDLC** (not just coding assistance)
 2. **Multi-provider** (not locked to one vendor)
-3. **Quality-assured** (10-gate system, 3-reviewer blind review)
+3. **Quality-assured** (8-gate system, 3-reviewer blind review)
 ### Differentiation by Competitor
@@ -443,7 +443,7 @@ This positioning highlights three unique capabilities no competitor offers toget
 | Autonomy | Semi-autonomous (/loop, /schedule, but no SDLC orchestration) | Fully autonomous (RARV loop + completion council) |
 | Scope | Individual coding tasks, PR reviews | Full SDLC pipeline (9 phases) |
 | Providers | Claude models only (multi-cloud hosting) | 5 providers, 3+ model families |
-| Quality | Single-pass review, GitHub Action | 10-gate, 3-reviewer blind system, anti-sycophancy |
+| Quality | Single-pass review, GitHub Action | 8-gate, 3-reviewer blind system, anti-sycophancy |
 | Memory | CLAUDE.md + auto-memory (session-scoped) | Episodic/semantic/procedural (cross-project) |
 | Cost model | Subscription with rate limits or API | Self-hosted, pay-per-token, any provider |
 | IDE/surface | Terminal, VS Code, JetBrains, Desktop, Web | Terminal, VS Code (via extension) |
@@ -458,7 +458,7 @@ This positioning highlights three unique capabilities no competitor offers toget
 | Speed | 240+ tokens/sec | Depends on provider |
 | Providers | OpenAI only | 5 providers |
 | Multi-agent | Experimental (isolated) | 41 agent types, 8 domains |
-| Quality | Single-pass review | 10-gate system |
+| Quality | Single-pass review | 8-gate system |
 | **Loki Mode advantage:** | Autonomous pipeline, multi-provider, mature multi-agent |
 #### vs. Emergence AI
@@ -475,7 +475,7 @@ This positioning highlights three unique capabilities no competitor offers toget
 |-----------|------|-----------|
 | Focus | Mobile apps (no-code) | Full-stack software |
 | Target user | Non-technical | Developers + technical teams |
-| Quality | No testing/review | 10-gate quality system |
+| Quality | No testing/review | 8-gate quality system |
 | Output | Mobile app only | Any software type |
 | **Loki Mode advantage:** | Developer-grade, full-stack, quality-assured |
@@ -485,7 +485,7 @@ This positioning highlights three unique capabilities no competitor offers toget
 "You already use AI for coding. Loki Mode makes it autonomous -- give it a PRD, and it handles planning, implementation, testing, code review, and deployment. Keep using Claude or Codex under the hood."
 **For engineering leaders evaluating AI tooling:**
-"Loki Mode is the only open-source system with enterprise-grade quality gates (10 gates, 3-reviewer blind review, anti-sycophancy checks) that runs autonomously on any AI provider. Self-hosted, no vendor lock-in."
+"Loki Mode is the only open-source system with enterprise-grade quality gates (8 gates, 3-reviewer blind review, anti-sycophancy checks) that runs autonomously on any AI provider. Self-hosted, no vendor lock-in."
 **For startups and solo developers:**
 "Go from idea to deployed product overnight. Write a PRD, invoke Loki Mode, and let it build, test, and deploy while you sleep. Works with your existing Claude or OpenAI API key."
@@ -557,7 +557,7 @@ The most significant near-term competitive threat is Anthropic's Agent SDK (http
 **However, Loki Mode's structural advantages remain:**
 1. **Multi-provider:** Agent SDK is Claude-only. Loki Mode works with any provider.
-2. **Battle-tested pipeline:** 10 quality gates, completion council, healing, memory -- these took months to build and validate. A new Agent SDK project starts from zero.
+2. **Battle-tested pipeline:** 8 quality gates, completion council, healing, memory -- these took months to build and validate. A new Agent SDK project starts from zero.
 3. **Open source and self-hosted:** No dependency on Anthropic's platform decisions.
 4. **Research foundation:** Built on patterns from OpenAI, DeepMind, Anthropic, and academic research. Not just engineering, but applied AI safety research (Constitutional AI, anti-sycophancy, alignment faking detection).

package/docs/competitive/replit-lovable-analysis.md CHANGED Viewed

@@ -316,7 +316,7 @@ Replit Agent has evolved rapidly through four major versions:
 | Self-testing loop | Yes | No | Yes (RARV cycle) |
 | Code review | No | No | Yes (3-reviewer blind review) |
 | Anti-sycophancy | No | No | Yes (devil's advocate) |
-| Quality gates | No | Security scan only | 10 gates |
+| Quality gates | No | Security scan only | 8 gates |
 | Memory system | No | Project knowledge | Episodic/semantic/procedural |
 | Model selection | Platform-chosen | Platform-chosen | Task-aware (Opus/Sonnet/Haiku) |
 | Multi-provider support | No (Replit only) | No (Lovable only) | Yes (Claude/Codex/Gemini/Cline/Aider) |
@@ -393,7 +393,7 @@ Loki Mode operates as a true autonomous engineering system, not a prompt-respons
 ### 2. Quality Assurance
-Loki Mode's 10-gate quality system (static analysis, 3-reviewer blind review, anti-sycophancy, severity-based blocking, test coverage, backward compatibility) has no equivalent in either platform. Replit and Lovable have zero code review, zero anti-sycophancy, and minimal quality gates. This is Loki Mode's strongest differentiator.
+Loki Mode's 8-gate quality system (static analysis, test suite (pass/fail), blind 3-reviewer review with severity blocking, anti-sycophancy Devil's Advocate, mock-integrity, test-mutation, documentation coverage, Magic Modules debate; backward-compatibility is a conditional healing-mode auditor) has no equivalent in either platform. Replit and Lovable have zero code review, zero anti-sycophancy, and minimal quality gates. This is Loki Mode's strongest differentiator.
 ### 3. Multi-Provider and Multi-Model Intelligence
@@ -531,7 +531,7 @@ Replit now supports React Native/Expo with full backend and RevenueCat monetizat
 When positioning Loki Mode against Replit and Lovable, emphasize:
 1. **"No credit anxiety"** -- You pay your API provider directly. No surprise bills. No credits burned on AI mistakes.
-2. **"Production quality, not prototype quality"** -- 10 quality gates, 3-reviewer blind review, anti-sycophancy. Your code ships to production, not to a rewrite backlog.
+2. **"Production quality, not prototype quality"** -- 8 quality gates, 3-reviewer blind review, anti-sycophancy. Your code ships to production, not to a rewrite backlog.
 3. **"No lock-in"** -- Your code. Your infrastructure. Your choice of AI provider. Export is not a feature -- it is the default.
 4. **"Autonomous, not assistive"** -- Loki Mode does not wait for your next prompt. It plans, builds, tests, reviews, and deploys. You review the output, not babysit the process.
 5. **"Works with your codebase"** -- Legacy systems, brownfield projects, enterprise code. Not just greenfield MVPs.

package/docs/cursor-comparison.md CHANGED Viewed

@@ -11,7 +11,7 @@
 |-----------|--------|-----------|--------|
 | **Proven Scale** | 1M+ LoC, large agent count | Benchmarks only | Cursor |
 | **Research Foundation** | Empirical iteration | 25+ academic citations | Loki Mode |
-| **Quality Assurance** | Workers self-manage | 11-gate system + anti-sycophancy | Loki Mode |
+| **Quality Assurance** | Workers self-manage | 8-gate system + anti-sycophancy | Loki Mode |
 | **Anti-Sycophancy** | Not mentioned | CONSENSAGENT blind review | Loki Mode |
 | **Velocity-Quality Balance** | Not mentioned | arXiv-backed metrics | Loki Mode |
 | **Full SDLC Coverage** | Code generation focus | Spec (PRD/issue/YAML) to production + growth | Loki Mode |
@@ -57,7 +57,7 @@ velocity_quality_balance:
   thresholds:
     max_new_warnings: 0  # Zero tolerance
-    min_coverage: 80%
+    coverage_target: 80%  # Target only; coverage % not measured this release
 ```
 **Research Basis:** [arXiv 2511.04427v2](https://arxiv.org/abs/2511.04427) - Empirical study of 807 repositories
@@ -66,16 +66,19 @@ velocity_quality_balance:
 ---
-### 3. 11-Gate Quality System
+### 3. 8-Gate Quality System
 **Loki Mode's Gates:**
-1. Input Guardrails - Validate scope, detect injection (OpenAI SDK pattern)
-2. Static Analysis - CodeQL, ESLint, type checking
-3. Blind Review System - 3 parallel reviewers
-4. Anti-Sycophancy Check - Devil's advocate on unanimous approval
-5. Output Guardrails - Code quality, spec compliance, no secrets
-6. Severity-Based Blocking - Critical/High/Medium = BLOCK
-7. Test Coverage Gates - 100% pass, >80% coverage
+1. Static Analysis - CodeQL, ESLint, type checking
+2. Test Suite (pass/fail) - red blocks; coverage % not measured this release
+3. Blind Code Review - 3 parallel reviewers + severity blocking (Critical/High = BLOCK; Medium/Low advisory)
+4. Anti-Sycophancy / Devil's Advocate - on unanimous PASS
+5. Mock Integrity Detector - HIGH blocks
+6. Test Mutation Detector - HIGH blocks
+7. Documentation Coverage
+8. Magic Modules Debate - BLOCK severity
+Conditional auditor (not numbered): backward-compatibility / legacy-healing-auditor (healing mode only).
 **Cursor:** Removed dedicated quality roles. Quote: "Dedicated integrator roles created more bottlenecks than they solved."
@@ -174,7 +177,7 @@ Cursor learned through failure:
 ### 3. Simplicity Principle
 > "A surprising amount of the system's behavior comes down to how we prompt the agents. The harness and models matter, but the prompts matter more."
-**Loki Mode:** More elaborate infrastructure (11 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
+**Loki Mode:** More elaborate infrastructure (8 gates, 41 agent types, memory systems). May be over-engineered for some use cases.
 ---
@@ -192,7 +195,7 @@ We incorporated Cursor's proven patterns:
 ## Conclusion
 **Loki Mode is scientifically better in:**
-- Quality assurance (research-backed 11-gate system)
+- Quality assurance (research-backed 8-gate system)
 - Anti-sycophancy (CONSENSAGENT blind review)
 - Velocity-quality balance (arXiv metrics)
 - Full SDLC coverage (spec to growth -- PRD, GitHub issue, or YAML)

package/docs/dashboard-guide.md CHANGED Viewed

@@ -151,13 +151,15 @@ Progress bars for three memory types:
 Shows count and visual progress bar for each.
 #### Quality Gates
-6 quality gates with status icons:
-- **Static Analysis**: CodeQL/ESLint checks
-- **3-Reviewer**: Parallel blind review system
-- **Anti-Sycophancy**: Devil's advocate validation
-- **Test Coverage**: Unit test requirements
-- **Security Scan**: OWASP vulnerability check
-- **Performance**: Performance regression tests
+8 quality gates with status icons:
+- **Static Analysis**: CodeQL/ESLint/type-checker findings on the diff
+- **Test Suite**: Project test runner pass/fail (red blocks)
+- **Blind Code Review**: 3-reviewer council with severity blocking (Critical/High block, Medium/Low advisory)
+- **Anti-Sycophancy**: Devil's Advocate re-review on unanimous PASS
+- **Mock Integrity**: Tautological-assertion and mock-ratio detection
+- **Test Mutation**: Assertion-churn (test-fitting) detection
+- **Documentation Coverage**: README presence, docs freshness, API docs
+- **Magic Modules Debate**: Spec-vs-implementation debate on generated modules
 Status icons:
 - Checkmark (green): Passed

package/docs/prd-purple-lab-platform-v2.md CHANGED Viewed

@@ -86,7 +86,7 @@ Three review loops identified these critical gaps in v1 of this PRD:
 Tabbed panel below the editor (collapsible, resizable height):
 - **Build Log:** Real-time loki output (WebSocket, already works on HomePage)
 - **Agents:** Active agent cards with status (already built as AgentDashboard)
-- **Quality Gates:** 9-gate display (already built as QualityGatesPanel)
+- **Quality Gates:** 8-gate display (already built as QualityGatesPanel)
 - **AI Chat:** NEW -- text input that sends prompts to iterate on the project
 **2b. AI Chat (key differentiator)**

package/docs/prd-purple-lab-platform.md CHANGED Viewed

@@ -97,7 +97,7 @@ Loki Mode already has 75+ commands and 120+ API endpoints. The platform doesn't
 - **Terminal tab:** Real-time loki session output (already streamed via WebSocket)
 - **Agent Log tab:** Shows which agents are active, what they're working on (uses `/api/agents` and `/api/session/agents`)
 - **Build Output tab:** Structured build phases -- RARV cycle visualization, iteration count, current phase
-- **Quality Gates tab:** 9-gate status display (uses existing checklist/quality endpoints)
+- **Quality Gates tab:** 8-gate status display (uses existing checklist/quality endpoints)
 - **AI Chat tab:** Send messages to iterate on the project ("fix the login page", "add dark mode") -- triggers `loki start` with the prompt as PRD amendment
 **Header toolbar:**
@@ -182,12 +182,12 @@ This is the "Loki way" -- instead of just editing code, users can talk to the AI
 ### Quality Gates Panel
-Shows the 9 Loki quality gates in real-time:
+Shows the Loki quality gates and checks in real-time:
 1. Static Analysis (CodeQL/ESLint)
 2. 3-Reviewer Blind Review
 3. Anti-Sycophancy Check
 4. Severity Blocking (Critical/High)
-5. Test Coverage (>80%)
+5. Test Suite (pass/fail; coverage % not measured this release)
 6. Security Scan (OWASP)
 7. Performance Check
 8. Mock Detector

package/docs/show-hn-post.md CHANGED Viewed

@@ -2,7 +2,7 @@
 ## Title
-Show HN: Loki Mode - PRD in, tested code out (41 agent roles, 9 quality gates, RARV self-verification)
+Show HN: Loki Mode - PRD in, tested code out (41 agent roles, 8 quality gates, RARV self-verification)
 ## Body
@@ -40,7 +40,7 @@ Integrations: Jira, Slack, Teams, GitHub Actions.
 ## Feedback wanted
-- Is the 9-gate quality system overkill, or does it actually help for your use cases?
+- Is the 8-gate quality system overkill, or does it actually help for your use cases?
 - How do you handle the tension between autonomous agent speed and code review thoroughness?
 - What PRD complexity level breaks this approach? I have hit walls with highly coupled distributed systems.

package/loki-ts/dist/loki.js CHANGED Viewed

@@ -1,5 +1,5 @@
 // @bun
-var n6=Object.defineProperty;var a6=($)=>$;function s6($,Q){this[$]=a6.bind(null,Q)}var h=($,Q)=>{for(var Z in Q)n6($,Z,{get:Q[Z],enumerable:!0,configurable:!0,set:s6.bind(Q,Z)})};var L=($,Q)=>()=>($&&(Q=$($=0)),Q);var K$=import.meta.require;var S1={};h(S1,{lokiDir:()=>P,homeLokiDir:()=>o$,findRepoRootForVersion:()=>d$,REPO_ROOT:()=>m});import{resolve as n,dirname as l$}from"path";import{fileURLToPath as t6}from"url";import{existsSync as P$}from"fs";import{homedir as r6}from"os";function i6(){let $=N1;for(let Q=0;Q<6;Q++){if(P$(n($,"VERSION"))&&P$(n($,"autonomy/run.sh")))return $;let Z=l$($);if(Z===$)break;$=Z}return n(N1,"..","..","..")}function d$($){let Q=$;for(let Z=0;Z<6;Z++){if(P$(n(Q,"VERSION"))&&P$(n(Q,"autonomy/run.sh")))return Q;let z=l$(Q);if(z===Q)break;Q=z}return n($,"..","..","..")}function P(){return process.env.LOKI_DIR??n(process.cwd(),".loki")}function o$(){return n(r6(),".loki")}var N1,m;var C=L(()=>{N1=l$(t6(import.meta.url));m=i6()});import{readFileSync as e6}from"fs";import{resolve as $Q,dirname as QQ}from"path";import{fileURLToPath as ZQ}from"url";function F$(){if($$!==null)return $$;let $="7.45.0";if(typeof $==="string"&&$.length>0)return $$=$,$$;try{let Q=QQ(ZQ(import.meta.url)),Z=d$(Q);$$=e6($Q(Z,"VERSION"),"utf-8").trim()}catch{$$="unknown"}return $$}var $$=null;var n$=L(()=>{C()});var C1={};h(C1,{runOrThrow:()=>zQ,run:()=>j,commandVersion:()=>KQ,commandExists:()=>f,ShellError:()=>a$});async function j($,Q={}){let Z=Bun.spawn({cmd:[...$],stdout:"pipe",stderr:"pipe",env:Q.env?{...process.env,...Q.env}:process.env,cwd:Q.cwd}),z,X;if(Q.timeoutMs&&Q.timeoutMs>0)z=setTimeout(()=>{try{Z.kill("SIGTERM")}catch{}X=setTimeout(()=>{try{Z.kill("SIGKILL")}catch{}},2000)},Q.timeoutMs);try{let[W,K,U]=await Promise.all([new Response(Z.stdout).text(),new Response(Z.stderr).text(),Z.exited]);return{stdout:W,stderr:K,exitCode:U}}finally{if(z)clearTimeout(z);if(X)clearTimeout(X)}}async function zQ($,Q={}){let Z=await j($,Q);if(Z.exitCode!==0)throw new a$(`command failed (${Z.exitCode}): ${$.join(" ")}`,Z.exitCode,Z.stdout,Z.stderr);return Z}async function f($){let Q=XQ($),Z=await j(["sh","-c",`command -v ${Q}`],{timeoutMs:5000});if(Z.exitCode===0)return Z.stdout.trim()||null;return null}function XQ($){if(!/^[A-Za-z0-9._/-]+$/.test($))throw Error(`refused to shell-escape suspect token: ${$}`);return $}async function KQ($,Q="--version"){if(!await f($))return null;let z=await j([$,Q],{timeoutMs:5000});if(z.exitCode!==0)return null;return((z.stdout||z.stderr).split(/\r?\n/)[0]?.trim()??"")||null}var a$;var d=L(()=>{a$=class a$ extends Error{message;exitCode;stdout;stderr;constructor($,Q,Z,z){super($);this.message=$;this.exitCode=Q;this.stdout=Z;this.stderr=z;this.name="ShellError"}}});function a($){return WQ?"":$}var WQ,T,S,I,TZ,w,R,y,q;var c=L(()=>{WQ=(process.env.NO_COLOR??"").length>0;T=a("\x1B[0;31m"),S=a("\x1B[0;32m"),I=a("\x1B[1;33m"),TZ=a("\x1B[0;34m"),w=a("\x1B[0;36m"),R=a("\x1B[1m"),y=a("\x1B[2m"),q=a("\x1B[0m")});import{existsSync as TQ}from"fs";async function Q$(){if(B$!==void 0)return B$;let $="/opt/homebrew/bin/python3.12";if(TQ($))return B$=$,$;let Q=await f("python3.12");if(Q)return B$=Q,Q;let Z=await f("python3");return B$=Z,Z}async function Z$($,Q={}){let Z=await Q$();if(!Z)return{stdout:"",stderr:"python3 not found",exitCode:127};return j([Z,"-c",$],Q)}var B$;var W$=L(()=>{d()});var t1={};h(t1,{runStatus:()=>gQ});import{existsSync as v,readFileSync as U$,readdirSync as l1,statSync as d1}from"fs";import{resolve as D,basename as xQ}from"path";import{homedir as NQ}from"os";async function DQ(){if(await f("jq"))return!0;return process.stdout.write(`${T}Error: jq is required but not installed.${q}
+var n6=Object.defineProperty;var a6=($)=>$;function s6($,Q){this[$]=a6.bind(null,Q)}var h=($,Q)=>{for(var Z in Q)n6($,Z,{get:Q[Z],enumerable:!0,configurable:!0,set:s6.bind(Q,Z)})};var L=($,Q)=>()=>($&&(Q=$($=0)),Q);var K$=import.meta.require;var S1={};h(S1,{lokiDir:()=>P,homeLokiDir:()=>o$,findRepoRootForVersion:()=>d$,REPO_ROOT:()=>m});import{resolve as n,dirname as l$}from"path";import{fileURLToPath as t6}from"url";import{existsSync as P$}from"fs";import{homedir as r6}from"os";function i6(){let $=N1;for(let Q=0;Q<6;Q++){if(P$(n($,"VERSION"))&&P$(n($,"autonomy/run.sh")))return $;let Z=l$($);if(Z===$)break;$=Z}return n(N1,"..","..","..")}function d$($){let Q=$;for(let Z=0;Z<6;Z++){if(P$(n(Q,"VERSION"))&&P$(n(Q,"autonomy/run.sh")))return Q;let z=l$(Q);if(z===Q)break;Q=z}return n($,"..","..","..")}function P(){return process.env.LOKI_DIR??n(process.cwd(),".loki")}function o$(){return n(r6(),".loki")}var N1,m;var C=L(()=>{N1=l$(t6(import.meta.url));m=i6()});import{readFileSync as e6}from"fs";import{resolve as $Q,dirname as QQ}from"path";import{fileURLToPath as ZQ}from"url";function F$(){if($$!==null)return $$;let $="7.46.0";if(typeof $==="string"&&$.length>0)return $$=$,$$;try{let Q=QQ(ZQ(import.meta.url)),Z=d$(Q);$$=e6($Q(Z,"VERSION"),"utf-8").trim()}catch{$$="unknown"}return $$}var $$=null;var n$=L(()=>{C()});var C1={};h(C1,{runOrThrow:()=>zQ,run:()=>j,commandVersion:()=>KQ,commandExists:()=>f,ShellError:()=>a$});async function j($,Q={}){let Z=Bun.spawn({cmd:[...$],stdout:"pipe",stderr:"pipe",env:Q.env?{...process.env,...Q.env}:process.env,cwd:Q.cwd}),z,X;if(Q.timeoutMs&&Q.timeoutMs>0)z=setTimeout(()=>{try{Z.kill("SIGTERM")}catch{}X=setTimeout(()=>{try{Z.kill("SIGKILL")}catch{}},2000)},Q.timeoutMs);try{let[W,K,U]=await Promise.all([new Response(Z.stdout).text(),new Response(Z.stderr).text(),Z.exited]);return{stdout:W,stderr:K,exitCode:U}}finally{if(z)clearTimeout(z);if(X)clearTimeout(X)}}async function zQ($,Q={}){let Z=await j($,Q);if(Z.exitCode!==0)throw new a$(`command failed (${Z.exitCode}): ${$.join(" ")}`,Z.exitCode,Z.stdout,Z.stderr);return Z}async function f($){let Q=XQ($),Z=await j(["sh","-c",`command -v ${Q}`],{timeoutMs:5000});if(Z.exitCode===0)return Z.stdout.trim()||null;return null}function XQ($){if(!/^[A-Za-z0-9._/-]+$/.test($))throw Error(`refused to shell-escape suspect token: ${$}`);return $}async function KQ($,Q="--version"){if(!await f($))return null;let z=await j([$,Q],{timeoutMs:5000});if(z.exitCode!==0)return null;return((z.stdout||z.stderr).split(/\r?\n/)[0]?.trim()??"")||null}var a$;var d=L(()=>{a$=class a$ extends Error{message;exitCode;stdout;stderr;constructor($,Q,Z,z){super($);this.message=$;this.exitCode=Q;this.stdout=Z;this.stderr=z;this.name="ShellError"}}});function a($){return WQ?"":$}var WQ,T,S,I,TZ,w,R,y,q;var c=L(()=>{WQ=(process.env.NO_COLOR??"").length>0;T=a("\x1B[0;31m"),S=a("\x1B[0;32m"),I=a("\x1B[1;33m"),TZ=a("\x1B[0;34m"),w=a("\x1B[0;36m"),R=a("\x1B[1m"),y=a("\x1B[2m"),q=a("\x1B[0m")});import{existsSync as TQ}from"fs";async function Q$(){if(B$!==void 0)return B$;let $="/opt/homebrew/bin/python3.12";if(TQ($))return B$=$,$;let Q=await f("python3.12");if(Q)return B$=Q,Q;let Z=await f("python3");return B$=Z,Z}async function Z$($,Q={}){let Z=await Q$();if(!Z)return{stdout:"",stderr:"python3 not found",exitCode:127};return j([Z,"-c",$],Q)}var B$;var W$=L(()=>{d()});var t1={};h(t1,{runStatus:()=>gQ});import{existsSync as v,readFileSync as U$,readdirSync as l1,statSync as d1}from"fs";import{resolve as D,basename as xQ}from"path";import{homedir as NQ}from"os";async function DQ(){if(await f("jq"))return!0;return process.stdout.write(`${T}Error: jq is required but not installed.${q}
 `),process.stdout.write(`Install with:
 `),process.stdout.write(`  brew install jq    (macOS)
 `),process.stdout.write(`  apt install jq     (Debian/Ubuntu)
@@ -789,4 +789,4 @@ Set LOKI_LEGACY_BASH=1 to force the bash CLI for every command.
 `),2}default:return process.stderr.write(`Unknown command: ${Q}
 `),process.stderr.write(o6),2}}p1();process.on("SIGINT",()=>process.exit(130));process.on("SIGTERM",()=>process.exit(143));var ZZ=await QZ(Bun.argv.slice(2));process.exit(ZZ);
-//# debugId=7FCBCE9F1C748AE964756E2164756E21
+//# debugId=7B01911F8947B6CD64756E2164756E21

package/mcp/__init__.py CHANGED Viewed

@@ -57,4 +57,4 @@ try:
 except ImportError:
     __all__ = ['mcp']
-__version__ = '7.45.0'
+__version__ = '7.46.0'

package/package.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
   "name": "loki-mode",
   "mcpName": "io.github.asklokesh/loki-mode",
-  "version": "7.45.0",
-  "description": "Loki Mode by Autonomi. Autonomous spec-to-product system: takes a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief to a deployed app via the RARV-C closure loop with 11 quality gates. Provider-agnostic (Claude Code, OpenAI Codex, Cline, Aider).",
+  "version": "7.46.0",
+  "description": "Loki Mode by Autonomi. Autonomous spec-to-product system: takes a PRD, GitHub issue, OpenAPI/JSON/YAML, or one-line brief to a deployed app via the RARV-C closure loop with 8 quality gates. Provider-agnostic (Claude Code, OpenAI Codex, Cline, Aider).",
   "keywords": [
     "agent",
     "agent-orchestration",

package/plugins/loki-mode/.claude-plugin/plugin.json CHANGED Viewed

@@ -2,8 +2,8 @@
   "$schema": "https://json.schemastore.org/claude-code-plugin-manifest.json",
   "name": "loki-mode",
   "displayName": "Loki Mode",
-  "version": "7.45.0",
-  "description": "Autonomous spec-to-product build system with a built-in trust layer (RARV-C closure loop, 11 quality gates, completion council). Ships Loki's spec-hardening, drift-detection, and deterministic PR verification commands plus the Loki MCP server.",
+  "version": "7.46.0",
+  "description": "Autonomous spec-to-product build system with a built-in trust layer (RARV-C closure loop, 8 quality gates, completion council). Ships Loki's spec-hardening, drift-detection, and deterministic PR verification commands plus the Loki MCP server.",
   "author": {
     "name": "Autonomi",
     "url": "https://www.autonomi.dev/"

package/plugins/loki-mode/README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 # Loki Mode plugin for Claude Code
 Loki Mode is the autonomous spec-to-product build system with a built-in trust
-layer (RARV-C closure loop, 11 quality gates, completion council). This plugin
+layer (RARV-C closure loop, 8 quality gates, completion council). This plugin
 brings Loki's spec-hardening, drift-detection, and deterministic PR verification
 into Claude Code as slash commands, and wires up the Loki MCP server.

package/references/magic-rarv-integration.md CHANGED Viewed

@@ -84,4 +84,4 @@ PRD says: "Add a login form with email, password, and submit button."
 - `skills/magic-modules.md` -- skill module for agents
 - `references/magic-modules-patterns.md` -- full API and pattern reference
 - `references/memory-system.md` -- memory engine details
-- `skills/quality-gates.md` -- all 12 gates documented
+- `skills/quality-gates.md` -- all 8 deterministic quality gates documented

package/references/quality-control.md CHANGED Viewed

@@ -240,15 +240,15 @@ This diversity prevents groupthink and catches more issues.
 |----------|--------|-----------|
 | **Critical** | BLOCK - Fix immediately | NO |
 | **High** | BLOCK - Fix immediately | NO |
-| **Medium** | BLOCK - Fix before proceeding | NO |
+| **Medium** | Advisory - Add `// TODO(review): ...` comment | YES |
 | **Low** | Add `// TODO(review): ...` comment | YES |
 | **Cosmetic** | Add `// FIXME(nitpick): ...` comment | YES |
-**Critical/High/Medium = BLOCK and fix before proceeding**
-**Low/Cosmetic = Add TODO/FIXME comment, continue**
+**Critical/High = BLOCK and fix before proceeding**
+**Medium/Low/Cosmetic = Add TODO/FIXME comment, continue (advisory)**
 ### 4. Test Coverage Gates
-- Unit tests: 100% pass, >80% coverage
+- Unit tests: 100% pass (coverage % not measured in this release)
 - Integration tests: 100% pass
 - E2E tests: critical flows pass
@@ -445,7 +445,7 @@ Quality gates are enforced by `autonomy/CONSTITUTION.md`:
 **Pre-Commit (BLOCKING):**
 - Linting (auto-fix enabled)
 - Type checking (strict mode)
-- Contract tests (80% coverage minimum)
+- Contract tests (coverage % not enforced as a gate)
 - Spec validation (Spectral)
 **Post-Implementation (AUTO-FIX):**

package/references/sdlc-phases.md CHANGED Viewed

@@ -233,8 +233,7 @@ npm run test:unit
 # or
 pytest tests/unit/
 ```
-- Coverage: >80% required
-- All tests must pass
+- All tests must pass (coverage % not measured in this release)
 **INTEGRATION Phase:**
 ```bash

package/skills/00-index.md CHANGED Viewed

@@ -48,7 +48,7 @@
 ### quality-gates.md
 **When:** Code review, pre-commit checks, quality assurance
-- 11-gate quality system (Gate 10: backward compatibility for healing; Gate 11: documentation coverage, v6.75.0)
+- 8-gate quality system (gate 7: documentation coverage, v6.75.0; backward compatibility is a conditional healing-mode auditor, not numbered)
 - Blind review + anti-sycophancy
 - Velocity-quality feedback loop (arXiv research)
 - Mandatory quality checks per task