npm - loki-mode - Versions diffs - 6.71.1 → 6.72.0 - Mend

loki-mode 6.71.1 → 6.72.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (91) hide show

package/README.md +9 -1
package/SKILL.md +2 -2
package/VERSION +1 -1
package/autonomy/hooks/migration-hooks.sh +26 -0
package/autonomy/loki +429 -92
package/autonomy/run.sh +219 -38
package/dashboard/__init__.py +1 -1
package/dashboard/server.py +101 -19
package/docs/INSTALLATION.md +20 -11
package/docs/bug-fixes/agent-01-cli-fixes.md +101 -0
package/docs/bug-fixes/agent-02-purplelab-fixes.md +88 -0
package/docs/bug-fixes/agent-03-dashboard-fixes.md +119 -0
package/docs/bug-fixes/agent-04-memory-fixes.md +105 -0
package/docs/bug-fixes/agent-05-provider-fixes.md +86 -0
package/docs/bug-fixes/agent-06-integration-fixes.md +101 -0
package/docs/bug-fixes/agent-07-dash-run-fixes.md +101 -0
package/docs/bug-fixes/agent-08-docker-fixes.md +164 -0
package/docs/bug-fixes/agent-09-e2e-build-fixes.md +69 -0
package/docs/bug-fixes/agent-10-e2e-fullstack-fixes.md +102 -0
package/docs/bug-fixes/agent-11-e2e-session-fixes.md +70 -0
package/docs/bug-fixes/agent-12-scenario-fixes.md +120 -0
package/docs/bug-fixes/agent-13-enterprise-fixes.md +143 -0
package/docs/bug-fixes/agent-14-uat-newuser-fixes.md +88 -0
package/docs/bug-fixes/agent-15-uat-poweruser-fixes.md +132 -0
package/docs/bug-fixes/agent-19-code-review.md +316 -0
package/docs/bug-fixes/agent-20-architecture-review.md +331 -0
package/docs/competitive/bolt-new-analysis.md +579 -0
package/docs/competitive/emergence-others-analysis.md +605 -0
package/docs/competitive/replit-lovable-analysis.md +622 -0
package/docs/test-scenarios/edge-cases.md +813 -0
package/docs/test-scenarios/enterprise-scenarios.md +732 -0
package/mcp/__init__.py +1 -1
package/mcp/server.py +49 -5
package/memory/consolidation.py +33 -0
package/memory/embeddings.py +10 -1
package/memory/engine.py +83 -38
package/memory/retrieval.py +36 -0
package/memory/storage.py +56 -4
package/memory/token_economics.py +14 -2
package/memory/vector_index.py +36 -7
package/package.json +1 -1
package/providers/gemini.sh +89 -2
package/templates/README.md +1 -1
package/templates/cli-tool.md +30 -0
package/templates/dashboard.md +4 -0
package/templates/data-pipeline.md +4 -0
package/templates/discord-bot.md +47 -0
package/templates/game.md +4 -0
package/templates/microservice.md +4 -0
package/templates/npm-library.md +4 -0
package/templates/rest-api-auth.md +50 -20
package/templates/rest-api.md +15 -0
package/templates/saas-starter.md +1 -1
package/templates/slack-bot.md +36 -0
package/templates/static-landing-page.md +9 -1
package/templates/web-scraper.md +4 -0
package/web-app/dist/assets/Badge-CeBkFjo6.js +1 -0
package/web-app/dist/assets/Button-yuhqo8Fq.js +1 -0
package/web-app/dist/assets/{Card-B1bV4syB.js → Card-BG17vsX0.js} +1 -1
package/web-app/dist/assets/{HomePage-CZTV6Nea.js → HomePage-BMSQ7Apj.js} +3 -3
package/web-app/dist/assets/{LoginPage-D4UdURJc.js → LoginPage-aH_6iolg.js} +1 -1
package/web-app/dist/assets/{NotFoundPage-CCLSeL6j.js → NotFoundPage-Di8cNtB1.js} +1 -1
package/web-app/dist/assets/ProjectPage-BtRssmw9.js +285 -0
package/web-app/dist/assets/ProjectsPage-B-FTFagc.js +6 -0
package/web-app/dist/assets/{SettingsPage-Xuv8EfAg.js → SettingsPage-DIJPBla4.js} +1 -1
package/web-app/dist/assets/TeamsPage--19fNX7w.js +36 -0
package/web-app/dist/assets/TemplatesPage-ChUQNOOv.js +11 -0
package/web-app/dist/assets/TerminalOutput-Dwrzecyl.js +31 -0
package/web-app/dist/assets/activity-BNRWeu9N.js +6 -0
package/web-app/dist/assets/{arrow-left-CaGtolHc.js → arrow-left-Ce6g1_YE.js} +1 -1
package/web-app/dist/assets/circle-alert-LIndawHL.js +11 -0
package/web-app/dist/assets/clock-Bpj4VPlP.js +6 -0
package/web-app/dist/assets/{external-link-CazyUyav.js → external-link-BhhdF0iQ.js} +1 -1
package/web-app/dist/assets/folder-open-CM2LgfxI.js +11 -0
package/web-app/dist/assets/index-8-KpWWq7.css +1 -0
package/web-app/dist/assets/index-kPDW4e_b.js +236 -0
package/web-app/dist/assets/lock-sAk3Xe54.js +16 -0
package/web-app/dist/assets/search-CR-2i9by.js +6 -0
package/web-app/dist/assets/server-DuFh4ymA.js +26 -0
package/web-app/dist/assets/trash-2-BmkkT8V_.js +11 -0
package/web-app/dist/index.html +2 -2
package/web-app/server.py +1321 -53
package/web-app/dist/assets/Badge-CBUx2PjL.js +0 -6
package/web-app/dist/assets/Button-DsRiznlh.js +0 -21
package/web-app/dist/assets/ProjectPage-D0w_X9tG.js +0 -237
package/web-app/dist/assets/ProjectsPage-ByYxDlKC.js +0 -16
package/web-app/dist/assets/TemplatesPage-BKWN07mc.js +0 -1
package/web-app/dist/assets/TerminalOutput-Dj98V8Z-.js +0 -51
package/web-app/dist/assets/clock-C_CDmobx.js +0 -11
package/web-app/dist/assets/index-D452pFGl.css +0 -1
package/web-app/dist/assets/index-Df4_kgLY.js +0 -196

package/docs/competitive/emergence-others-analysis.md ADDED Viewed

@@ -0,0 +1,605 @@
+# Competitive Analysis: Emergence AI, Rork, Claude Code CLI, Codex CLI (March 2026)
+**Agent 18 -- Loki Mode Competitive Intelligence**
+**Version:** 1.3 (3 feedback loops completed) | **Date:** 2026-03-24 | **Loki Mode:** v6.71.1
+---
+## Table of Contents
+1. [Executive Summary](#executive-summary)
+2. [Platform Deep Dives](#platform-deep-dives)
+   - [Emergence AI (Agent-E)](#1-emergence-ai-agent-e)
+   - [Rork](#2-rork)
+   - [Anthropic Claude Code CLI](#3-anthropic-claude-code-cli)
+   - [OpenAI Codex CLI](#4-openai-codex-cli)
+3. [Feature Comparison Matrix](#feature-comparison-matrix)
+4. [Market Map](#market-map)
+5. [Gaps and Opportunities](#gaps-and-opportunities)
+6. [Differentiation Strategy for Loki Mode](#differentiation-strategy-for-loki-mode)
+7. [Strategic Recommendations](#strategic-recommendations)
+8. [Key Risk: Claude Code Agent SDK](#key-risk-claude-code-agent-sdk)
+9. [Sources](#sources)
+---
+## Executive Summary
+The AI coding agent market has fragmented into four distinct segments: enterprise workflow automation (Emergence AI), no-code mobile app generation (Rork), developer-centric coding assistants (Claude Code CLI, Codex CLI), and autonomous PRD-to-deployment systems (Loki Mode). While Claude Code and Codex CLI dominate the developer coding assistant segment with strong model capabilities and broad distribution, none of the analyzed platforms deliver the full pipeline that Loki Mode targets -- taking a PRD through planning, implementation, testing, code review, and deployment with multi-provider flexibility and self-hosted operation.
+**Key finding:** The market has strong tools for individual coding tasks but lacks integrated autonomous systems that handle the complete software development lifecycle. This is Loki Mode's primary differentiation opportunity.
+---
+## Platform Deep Dives
+### 1. Emergence AI (Agent-E)
+**Website:** https://www.emergence.ai
+**GitHub:** https://github.com/EmergenceAI/Agent-E (~1,200 stars)
+**License:** Open source (Agent-E); proprietary (Emergence Platform)
+#### What It Does
+Emergence AI operates in two modes:
+**Agent-E (Open Source):** A web automation agent built on the AG2 framework (formerly AutoGen). It translates natural language commands into browser actions using a skills-based architecture. Core capabilities:
+- Web form automation (not PDF forms)
+- E-commerce navigation and product search
+- Content location across websites
+- Media interaction and playback control
+- JIRA task automation
+- Web search and information gathering
+**Emergence Platform (Enterprise):** An enterprise agentic platform with broader ambitions:
+- Goal-to-Agent Translation (ACA) -- automatically creates agents from business goals
+- Multi-agent orchestration engine with async execution, retries, and dependency management
+- Persistent shared memory and state across tasks
+- Skills and integrations with automatic connector code generation
+- Policy-aware governance with enterprise access controls
+- VPC and on-premises deployment options
+#### Architecture
+Agent-E uses a two-agent model:
+- **User Proxy Agent:** Executes skills on behalf of the user
+- **Browser Navigation Agent:** Contains all web interaction skills
+Skills are divided into:
+- **Sensing Skills:** `geturl`, `get_dom_with_content_type` -- understand webpage state
+- **Action Skills:** `click`, `enter text`, `open url` -- interact with the web
+The system uses configured skills that map to human browser interactions rather than allowing LLMs to write arbitrary code. This approach prioritizes safety and predictability. On the WebVoyager benchmark (643 tasks across 15 websites), Agent-E achieves a 73.2% task completion rate, significantly outperforming WILBUR (52.6%) and the multimodal Web Voyager Agent (57.1%).
+The Emergence Platform adds an orchestration layer (the "Emergence Orchestrator") that acts as an autonomous meta-agent, planning, executing, verifying, and iterating workflows by routing tasks to optimal AI models.
+#### Pricing
+- **Agent-E:** Free (open source, bring your own LLM API keys)
+- **Emergence Platform:** Custom enterprise pricing (annual contracts based on solution complexity, number of solutions, and usage volume). Typically starts in the thousands per month.
+#### Limitations
+- Agent-E is specifically a **web automation** tool, not a general-purpose coding agent
+- Local LLM support exists but is not thoroughly tested
+- Requires active LLM API connection for cloud models
+- Constrained by AG2 framework limitations
+- No code generation, refactoring, or software development capabilities in the open-source Agent-E
+- Enterprise platform pricing is opaque and requires sales engagement
+- Not designed for software development lifecycle management
+#### Target User
+Enterprise teams needing web workflow automation and data pipeline orchestration. Not developers building software.
+---
+### 2. Rork
+**Website:** https://rork.com
+**License:** Proprietary (closed-source platform; generated code is user-owned)
+#### What It Does
+Rork is an AI-powered no-code platform specifically for building mobile applications. Users describe app ideas in natural language, and Rork generates production-ready mobile apps using React Native and Expo.
+Core capabilities:
+- Natural language to mobile app generation (iOS + Android + Web)
+- Live preview via Expo Go with QR code scanning
+- Full code export (React Native source code)
+- App Store and Play Store deployment support
+- Over-the-air updates via Expo Application Services
+- **Rork Max** (Feb 2026): Native Swift code generation for the entire Apple ecosystem (iPhone, iPad, Apple Watch, Apple TV, Vision Pro, iMessage), unlocking AR/LiDAR, Metal graphics, Home Screen widgets, Dynamic Island, Live Activities, Siri Intents, HealthKit, and Core ML
+#### Architecture
+- **AI Engine:** Multi-provider setup using GeminiProvider and ClaudeProvider with system prompts for Expo SDK 54. A RorkAgent runs an 11-tool agentic loop.
+- **Frontend Infrastructure:** Built with Next.js, Zustand state management, Supabase backend
+- **Live Preview:** Serverless esbuild API compiles apps to React Native Web in a phone-sized iframe, updating in real-time
+- **Deployment:** Expo Application Services handles build, distribution, and OTA updates
+#### Pricing
+Credit-based subscription model (credits reset monthly, no rollover):
+| Plan | Price | Credits/Month | Key Features |
+|------|-------|--------------|--------------|
+| Free | $0 | 35 (5/day) | Basic prototyping |
+| Junior | $25/mo | ~250 | Idea validation, demos |
+| Senior | $100/mo | ~750 | Full MVP building and iteration |
+Each AI interaction consumes one credit. Complex or iterative builds can consume credits rapidly.
+#### Limitations
+- **Stability issues:** Users report crashes, black screens, persistent errors, slow loading. Trustpilot average: 2.9/5
+- **Publishing difficulties:** Many users report the "Publish" button failing, apps stuck in deployment limbo
+- **Iteration fragility:** Generated React Native codebases accumulate state and navigation logic; each change risks breaking upstream code
+- **Credit burn:** Fixing small UI issues consumes credits at the same rate as major features
+- **Customer support:** Described as nearly non-existent by multiple reviewers
+- **Mobile-only scope:** Cannot build web apps, backends, APIs, CLI tools, or anything outside mobile
+- **No SDLC integration:** No code review, no testing framework, no CI/CD pipeline
+#### Target User
+Non-technical entrepreneurs and product managers who want to prototype or build mobile apps without coding. Not developer-centric.
+---
+### 3. Anthropic Claude Code CLI
+**Website:** https://code.claude.com
+**GitHub:** https://github.com/anthropics/claude-code
+**License:** Proprietary (source-available, 51 contributors)
+#### What It Does
+Claude Code is Anthropic's agentic coding tool -- the most comprehensive developer coding assistant in the market as of March 2026. It operates across terminal, VS Code, JetBrains, a desktop app, and the web.
+Core capabilities:
+- Codebase understanding and multi-file editing
+- Command execution and shell integration
+- Git workflows (commits, branches, PRs)
+- Multi-agent teams with coordinated sub-agents in isolated git worktrees
+- Code review (standalone feature launched March 9, 2026)
+- MCP (Model Context Protocol) for external tool integration
+- CLAUDE.md project memory files
+- Auto-memory that persists learnings across sessions
+- Custom skills/commands (e.g., `/review-pr`, `/deploy-staging`)
+- Hooks for pre/post action automation
+- Scheduled/recurring tasks (cloud and local) via `/loop` and `/schedule`
+- Third-party provider support (Amazon Bedrock, Google Vertex AI, Microsoft Foundry) -- still Claude models only
+- GitHub Actions and GitLab CI/CD integration
+- Slack integration (bug reports to PRs)
+- Chrome extension for debugging live web apps
+- Remote Control (phone/browser access to sessions)
+- Agent SDK for building custom agents
+#### Architecture
+- **Runtime:** Built with Bun for compilation, CommanderJS for CLI structure, React Ink for terminal rendering (ANSI escape codes)
+- **Models:** Claude Opus 4.5 (80.9% SWE-bench), Claude Sonnet 4.5, Claude Haiku 4.5
+- **Context:** 200K token context window
+- **Multi-agent:** Coordinated sub-agents with shared task lists, dependency tracking, and direct agent-to-agent communication via git worktrees
+- **Optimization:** 92% prefix caching reuse rate for cost reduction
+- **Token usage:** Consumes 3-4x more tokens than Codex CLI for equivalent tasks but produces more thorough output
+#### Pricing
+| Plan | Price | Usage Limit | Notes |
+|------|-------|-------------|-------|
+| Free | $0 | Basic | Limited |
+| Pro | $20/mo | 5x Free | 5-hour rolling window + 7-day cap |
+| Max 5x | $100/mo | 5x Pro | For moderate power users |
+| Max 20x | $200/mo | 20x Pro | For heavy usage |
+| API (BYOK) | Pay-per-token | Unlimited | Haiku: $1/$5, Sonnet: $3/$15, Opus: $5/$25 per 1M tokens |
+Prompt caching provides 90% input cost reduction. Batch API offers 50% discount for async processing.
+#### Limitations
+- **Rate limits are the primary complaint:** Users report hitting caps within 10-15 minutes of heavy Opus usage. Rolling windows (5-hour and 7-day) create unpredictable availability.
+- **Cost at scale:** Heavy users can face $200+/month or unpredictable API costs with BYOK
+- **Proprietary:** Not fully open source (source-available but not Apache-2.0)
+- **Token-hungry:** 3-4x more tokens than Codex for equivalent tasks
+- **Model family lock-in:** Supports third-party hosting (Amazon Bedrock, Google Vertex AI, Microsoft Foundry) but only runs Claude models -- no GPT, Gemini, or open-source model support
+- **No PRD-to-deployment pipeline:** Excels at individual coding tasks but lacks orchestrated SDLC workflow
+- **Semi-autonomous at best:** `/loop` and `/schedule` provide repeating task execution, but lack PRD parsing, SDLC phasing, completion council, or quality gate orchestration. These are task-level automation, not end-to-end SDLC autonomy.
+- **OAuth token controversy:** Anthropic shut down OAuth token extraction in January 2026, frustrating power users
+#### Target User
+Professional developers and engineering teams who want an AI pair programmer for daily coding work. The most popular choice for complex codebase work.
+---
+### 4. OpenAI Codex CLI
+**Website:** https://developers.openai.com/codex/cli
+**GitHub:** https://github.com/openai/codex (Apache-2.0, 365 contributors)
+**License:** Apache-2.0 (fully open source)
+#### What It Does
+Codex CLI is OpenAI's open-source coding agent that runs locally in the terminal. Built in Rust for performance, it provides an interactive coding experience with strong safety controls.
+Core capabilities:
+- Interactive terminal UI with real-time action review
+- Non-interactive `exec` mode for scripting and automation
+- Conversation resumption (persistent sessions)
+- Web search (enabled by default, cached results)
+- Code review (dedicated `/review` preset)
+- MCP support for third-party tool integration
+- Image input (screenshots, design specs)
+- Multi-agent support (experimental, isolated git worktrees)
+- Codex Cloud for remote task execution
+- GitHub Action for CI/CD integration
+- Shell completions (bash, zsh, fish)
+- Slash command framework for custom workflows
+#### Architecture
+- **Runtime:** Built in Rust for speed and efficiency
+- **Models:** GPT-5.4 (default), GPT-5.3-Codex, codex-mini
+- **Multi-agent:** Isolated cloud sandboxes per task, independent threads, no inter-agent messaging
+- **Speed:** 240+ tokens/second (2.5x faster than Opus)
+- **Token efficiency:** 3-4x fewer tokens than Claude Code for equivalent tasks
+- **Safety:** Three approval modes -- Auto (default), Read-only, Full Access
+- **Benchmarks:** SWE-bench Pro 56.8%, Terminal-Bench 2.0 77.3%
+#### Pricing
+| Access Method | Price | Notes |
+|--------------|-------|-------|
+| ChatGPT Go | $8/mo | Budget option, basic Codex access |
+| ChatGPT Plus | $20/mo | 30-150 messages per 5-hour window |
+| ChatGPT Pro | $200/mo | Highest limits |
+| API (BYOK) | $1.50/$6 per 1M tokens | codex-mini-latest, 75% cache discount |
+| Open Source (free) | $0 + API costs | Self-host, bring your own keys |
+Open-source maintainers (1,000+ star projects) qualify for 6 months of free ChatGPT Pro access.
+#### Limitations
+- **Model lock-in:** Only works with OpenAI models (GPT family)
+- **Multi-agent is experimental:** No inter-agent communication, isolated sandboxes only
+- **Windows support is experimental:** WSL recommended
+- **No autonomous SDLC pipeline:** Like Claude Code, it handles individual tasks, not end-to-end workflows
+- **MCP overhead:** Each added MCP tool increases context consumption and reduces message limits
+- **Less thorough than Claude Code:** Faster but produces less complete solutions on complex problems
+- **No built-in code review quality gates:** Single-pass review only
+- **No memory system beyond session resumption:** No episodic/semantic memory, no learning across projects
+#### Target User
+Developers who value open-source tooling, speed, and terminal-native workflows. Particularly strong for rapid prototyping and high-volume edits.
+---
+## Feature Comparison Matrix
+| Feature | Emergence AI (Agent-E) | Rork | Claude Code CLI | Codex CLI | Loki Mode |
+|---------|:---------------------:|:----:|:--------------:|:---------:|:---------:|
+| **Primary Focus** | Web automation | Mobile apps | Coding assistant | Coding assistant | PRD-to-deploy |
+| **Open Source** | Partial (Agent-E only) | No | Source-available | Yes (Apache-2.0) | Yes |
+| **Multi-Provider** | Yes (OpenAI, Azure, Ollama) | Yes (Gemini, Claude) | Partial (Claude models via Bedrock/Vertex/Foundry) | No (GPT only) | Yes (5 providers, 3+ model families) |
+| **Multi-Agent** | Yes (2-agent model) | No | Yes (coordinated teams) | Yes (experimental) | Yes (41 agent types) |
+| **Autonomous Iteration** | No (task-level) | No | Partial (/loop, /schedule) | No (requires prompting) | Yes (RARV loop + completion council) |
+| **SDLC Pipeline** | No | No | No | No | Yes (9 phases) |
+| **Code Review** | No | No | Yes (single-pass) | Yes (single-pass) | Yes (3-reviewer blind) |
+| **Quality Gates** | No | No | No | No | Yes (10 gates) |
+| **Anti-Sycophancy** | No | No | No | No | Yes (devil's advocate) |
+| **Memory System** | Enterprise only | No | CLAUDE.md + auto-memory | Session resumption | Episodic/semantic/procedural |
+| **Self-Hosted** | Partial (Agent-E) | No | Partial (CLI local, but subscription or API required) | Yes (with API key) | Yes (fully, any provider API key) |
+| **CI/CD Integration** | No | No | Yes (GH Actions, GitLab) | Yes (GH Action) | Yes (built-in) |
+| **Complexity Detection** | No | No | No | No | Yes (auto-tier) |
+| **Budget Controls** | No | Credit system | Rate limits | Rate limits | Yes (circuit breaker) |
+| **Legacy System Healing** | No | No | No | No | Yes |
+| **Benchmark (SWE-bench)** | N/A | N/A | 80.9% (Opus 4.5) | 56.8% (Pro) | Configurable (uses any model) |
+| **CLI Interface** | No (Python API) | No (web UI) | Yes | Yes | Yes |
+| **IDE Integration** | No | No | Yes (VS Code, JetBrains) | No | Yes (VS Code extension) |
+| **MCP Support** | No | No | Yes | Yes | Yes (15 tools) |
+| **Cost (Heavy Use)** | Enterprise contract | $100/mo | $200/mo or API | $20-200/mo or API | $0 + API costs |
+| **Context Window** | Model-dependent | N/A | 200K tokens | Model-dependent | Model-dependent |
+---
+## Market Map
+```
+                    AUTONOMOUS <----------------------------> ASSISTED
+                         |                                       |
+    E   Loki Mode        |                                       |
+    N   (PRD-to-deploy,  |                                       |
+    T   multi-provider,  |                                       |
+    E   self-hosted)     |                                       |
+    R                    |                                       |
+    P   Emergence AI     |                                       |
+    R   Platform         |                                       |
+    I   (enterprise      |                                       |
+    S   workflow          |                                       |
+    E   automation)      |                                       |
+        -----------------+---------------------------------------+
+    P                    |                    Claude Code CLI     |
+    R                    |                    (complex codebase   |
+    O                    |                     editing, reviews)  |
+    S                    |                                       |
+    U                    |                    Codex CLI           |
+    M                    |                    (fast prototyping,  |
+    E                    |                     open source)       |
+    R                    |                                       |
+                         |              Rork                     |
+        -----------------+-------(mobile app------gen)----------+
+                         |                                       |
+                    AUTONOMOUS                              ASSISTED
+```
+### Segment Breakdown
+| Segment | Players | Target User | Price Range |
+|---------|---------|-------------|-------------|
+| **Autonomous SDLC** | Loki Mode | Engineering teams, solo developers, startups | $0 + API costs |
+| **Enterprise Automation** | Emergence AI Platform | Enterprise data/ops teams | $$$$ (custom) |
+| **Developer Coding Assistant** | Claude Code CLI, Codex CLI | Professional developers | $20-200/mo |
+| **No-Code App Builder** | Rork | Non-technical founders, PMs | $0-100/mo |
+| **Web Automation** | Agent-E (open source) | Automation engineers | $0 + API costs |
+### Who Targets Whom
+- **Claude Code CLI** and **Codex CLI** compete head-to-head for professional developers. Claude Code leads on reasoning depth; Codex leads on speed and open-source ethos.
+- **Rork** targets a completely different audience (non-coders wanting mobile apps) and does not compete with developer tools.
+- **Emergence AI** targets enterprise workflow automation, not software development.
+- **Loki Mode** sits in an underserved category -- autonomous end-to-end software development -- that none of the others fully address.
+---
+## Gaps and Opportunities
+### Gap 1: No Competitor Offers End-to-End Autonomous SDLC
+Claude Code and Codex CLI are powerful coding assistants, but they operate at the task level. Claude Code's `/loop` and `/schedule` commands provide task-level automation (repeat a prompt, run periodic checks), and its Agent Teams can coordinate sub-agents -- but these are building blocks, not a complete SDLC pipeline. No competitor provides:
+- Autonomous PRD parsing and task decomposition into phased work
+- Multi-phase SDLC execution (plan, implement, test, review, deploy) with phase gates
+- Continuous iteration loops with RARV cycle and model tier selection
+- Completion detection via council voting (not just "did the task finish")
+- Budget circuit breakers, stagnation detection, and complexity auto-tiering
+**Opportunity:** Loki Mode is the only tool that takes a PRD and autonomously produces a deployed, tested, reviewed product. Claude Code could theoretically be scripted to do this externally, but Loki Mode has it built in as a first-class capability. This is a category-defining advantage, though the window may narrow as Claude Code's Agent SDK matures.
+### Gap 2: No True Multi-Model-Family Flexibility
+Every competitor is locked to a single model family (even if hostable on multiple clouds):
+- Claude Code: Claude models only (hostable on Bedrock, Vertex AI, Foundry -- but still Claude)
+- Codex CLI: OpenAI models only
+- Emergence AI: Primarily OpenAI, some alternatives via LiteLLM
+- Rork: Internal (Gemini + Claude, not user-selectable)
+**Nuance:** Claude Code's third-party provider support (Bedrock, Vertex AI) gives enterprises deployment flexibility, but not model diversity. You cannot run GPT or Gemini models through Claude Code.
+**Opportunity:** Loki Mode supports 5 providers spanning 3+ model families (Claude, GPT, Gemini) with automatic degraded-mode handling. This enables cost optimization (use cheaper models for simple tasks), redundancy (failover between providers), and leveraging provider-specific strengths (Claude for reasoning, GPT for speed).
+### Gap 3: No Structured Quality Assurance Pipeline
+Claude Code and Codex CLI offer single-pass code review. Neither provides:
+- 3-reviewer blind parallel review
+- Anti-sycophancy checks (devil's advocate on unanimous approval)
+- Severity-based blocking gates
+- Test coverage enforcement (>80% unit, 100% pass)
+- Static analysis integration (CodeQL, ESLint)
+**Opportunity:** Loki Mode's 10-gate quality system provides enterprise-grade assurance that no competitor matches.
+### Gap 4: No Persistent Cross-Project Learning
+- Claude Code has CLAUDE.md and auto-memory (session-scoped)
+- Codex CLI has session resumption only
+- Neither has episodic-to-semantic memory consolidation
+- Neither learns patterns across projects
+**Opportunity:** Loki Mode's memory system (episodic, semantic, procedural) with consolidation pipeline and vector search enables compound learning. Solutions discovered in one project are available in the next.
+### Gap 5: No Legacy System Healing
+No competitor addresses the challenge of modernizing legacy codebases:
+- No friction-as-semantics analysis
+- No failure-first learning
+- No institutional knowledge preservation
+- No incremental healing phases
+**Opportunity:** Loki Mode's healing capability (inspired by Amazon AGI Lab research) is unique in the market.
+### Gap 6: Self-Hosted Operation Without Vendor Lock-in
+- Claude Code requires Anthropic subscription or API
+- Codex CLI requires OpenAI subscription or API
+- Rork is fully cloud-hosted (no self-hosting)
+- Emergence Platform requires enterprise contract
+**Opportunity:** Loki Mode is fully self-hosted, works with any supported provider's API key, and has no cloud dependency. This matters for regulated industries, air-gapped environments, and cost-conscious teams.
+### Gap 7: No Mobile-to-Enterprise Bridging
+Rork generates mobile apps but cannot handle backends, APIs, or infrastructure. Claude Code and Codex CLI cannot generate mobile apps end-to-end from a description. No tool bridges the full stack autonomously.
+**Opportunity:** Loki Mode's PRD templates (13 types including SaaS, CLI, Discord bot) could expand to include mobile app generation, combining Rork's accessibility with developer-grade quality gates.
+---
+## Differentiation Strategy for Loki Mode
+### Primary Positioning
+**"The only autonomous system that takes a PRD to a deployed, tested, reviewed product -- using any AI provider you choose."**
+This positioning highlights three unique capabilities no competitor offers together:
+1. **Autonomous SDLC** (not just coding assistance)
+2. **Multi-provider** (not locked to one vendor)
+3. **Quality-assured** (10-gate system, 3-reviewer blind review)
+### Differentiation by Competitor
+#### vs. Claude Code CLI
+| Dimension | Claude Code | Loki Mode |
+|-----------|-------------|-----------|
+| Autonomy | Semi-autonomous (/loop, /schedule, but no SDLC orchestration) | Fully autonomous (RARV loop + completion council) |
+| Scope | Individual coding tasks, PR reviews | Full SDLC pipeline (9 phases) |
+| Providers | Claude models only (multi-cloud hosting) | 5 providers, 3+ model families |
+| Quality | Single-pass review, GitHub Action | 10-gate, 3-reviewer blind system, anti-sycophancy |
+| Memory | CLAUDE.md + auto-memory (session-scoped) | Episodic/semantic/procedural (cross-project) |
+| Cost model | Subscription with rate limits or API | Self-hosted, pay-per-token, any provider |
+| IDE/surface | Terminal, VS Code, JetBrains, Desktop, Web | Terminal, VS Code (via extension) |
+| **Loki Mode advantage:** | End-to-end SDLC autonomy, multi-model flexibility, structured quality |
+| **Claude Code advantage:** | Broader surface coverage (IDE, desktop, web, mobile), stronger single-task capabilities |
+#### vs. Codex CLI
+| Dimension | Codex CLI | Loki Mode |
+|-----------|-----------|-----------|
+| Autonomy | Assisted (human prompts each task) | Fully autonomous |
+| Open source | Yes (Apache-2.0) | Yes |
+| Speed | 240+ tokens/sec | Depends on provider |
+| Providers | OpenAI only | 5 providers |
+| Multi-agent | Experimental (isolated) | 41 agent types, 8 swarms |
+| Quality | Single-pass review | 10-gate system |
+| **Loki Mode advantage:** | Autonomous pipeline, multi-provider, mature multi-agent |
+#### vs. Emergence AI
+| Dimension | Emergence AI | Loki Mode |
+|-----------|-------------|-----------|
+| Focus | Web/workflow automation | Software development |
+| Pricing | Enterprise contracts | Free + API costs |
+| Self-hosted | VPC option | Fully self-hosted |
+| Open source | Partial | Yes |
+| **Loki Mode advantage:** | Purpose-built for software, open source, accessible pricing |
+#### vs. Rork
+| Dimension | Rork | Loki Mode |
+|-----------|------|-----------|
+| Focus | Mobile apps (no-code) | Full-stack software |
+| Target user | Non-technical | Developers + technical teams |
+| Quality | No testing/review | 10-gate quality system |
+| Output | Mobile app only | Any software type |
+| **Loki Mode advantage:** | Developer-grade, full-stack, quality-assured |
+### Messaging Framework
+**For developers currently using Claude Code or Codex CLI:**
+"You already use AI for coding. Loki Mode makes it autonomous -- give it a PRD, and it handles planning, implementation, testing, code review, and deployment. Keep using Claude or Codex under the hood."
+**For engineering leaders evaluating AI tooling:**
+"Loki Mode is the only open-source system with enterprise-grade quality gates (10 gates, 3-reviewer blind review, anti-sycophancy checks) that runs autonomously on any AI provider. Self-hosted, no vendor lock-in."
+**For startups and solo developers:**
+"Go from idea to deployed product overnight. Write a PRD, invoke Loki Mode, and let it build, test, and deploy while you sleep. Works with your existing Claude or OpenAI API key."
+---
+## Strategic Recommendations
+### 1. Position Against Claude Code and Codex CLI as the "Orchestration Layer"
+Rather than competing as another coding assistant, position Loki Mode as the layer above Claude Code and Codex CLI. Frame it as: "Claude Code and Codex are excellent coding agents. Loki Mode orchestrates them into an autonomous development pipeline."
+This avoids direct feature-by-feature competition on model quality (where Anthropic and OpenAI will always have advantages) and instead competes on workflow orchestration (where Loki Mode is uniquely strong).
+### 2. Publish Autonomous Benchmark Results
+Claude Code publishes SWE-bench scores. Codex CLI publishes Terminal-Bench scores. Loki Mode should publish "PRD-to-Deploy" benchmark results:
+- Time from PRD to working deployed product
+- Lines of code generated per PRD
+- Test coverage achieved autonomously
+- Code review gate pass rates
+- Comparison of multi-provider results (same PRD, different providers)
+This creates a new benchmark category that competitors cannot match because they do not offer end-to-end SDLC.
+### 3. Build a "Bring Your Own Agent" Ecosystem
+Loki Mode already supports 5 providers. Extend this to support any CLI agent as a provider through a plugin interface. This turns every new coding agent (Aider, Cline, future entrants) into a Loki Mode capability rather than a competitor.
+### 4. Target the "AI-Native Startup" Segment
+Solo founders and small teams using AI to build products overnight represent a growing segment that no competitor specifically targets:
+- Claude Code is for professional developers doing daily work
+- Codex CLI is for terminal-native developers
+- Rork is for non-technical mobile app builders
+- Emergence AI is for enterprise operations
+Loki Mode can own the "AI-native startup" segment with PRD templates, automated deployment, and cost-efficient multi-provider operation.
+### 5. Strengthen the Mobile/Frontend Story
+Rork demonstrates demand for AI-generated mobile apps, but with serious quality problems (2.9/5 Trustpilot). Loki Mode could add a mobile PRD template that generates React Native/Expo apps through the same quality-gated pipeline, offering Rork's convenience with developer-grade reliability.
+### 6. Enterprise Self-Hosted Narrative
+Emergence AI charges enterprise contract rates for VPC deployment. Loki Mode is already fully self-hosted and free. For regulated industries (finance, healthcare, government) that cannot use cloud-hosted AI tools, Loki Mode with BYOK API access is compelling:
+- No data leaves your infrastructure (beyond API calls)
+- Audit trail via `.loki/` state files
+- Configurable security controls (LOKI_SANDBOX_MODE, LOKI_BLOCKED_COMMANDS)
+### 7. Watch List -- Emerging Threats
+| Threat | Severity | Timeline | Mitigation |
+|--------|----------|----------|------------|
+| Claude Code Agent SDK enables custom SDLC pipelines | HIGH | 3-6 months | Deepen multi-provider advantage (Agent SDK is Claude-only); publish PRD-to-Deploy benchmarks that prove the integrated pipeline |
+| Claude Code adds autonomous mode natively | HIGH | 6-12 months | Quality gates, memory system, and multi-provider flexibility are structural advantages that cannot be replicated by adding a single feature |
+| Codex CLI adds orchestration layer | MEDIUM | 6-12 months | Codex is OpenAI-only; emphasize 41 agent types, cross-project memory, healing |
+| New entrant builds "Rork for full-stack" | MEDIUM | 6-12 months | Add mobile PRD template; Loki Mode's quality gates differentiate from naive generation |
+| Open-source Auto-Claude gains traction | LOW | Ongoing | Already adopted key patterns (v3.4.0); maintain feature lead |
+| Enterprise CI/CD platforms (GitHub, GitLab) add native AI SDLC | HIGH | 12-18 months | Self-hosted, provider-agnostic positioning; these will be vendor-locked |
+### 8. Key Risk: Claude Code Agent SDK
+The most significant near-term competitive threat is Anthropic's Agent SDK (https://platform.claude.com/docs/en/agent-sdk/overview). It allows developers to build custom agents with Claude Code's tools and capabilities, with full control over orchestration, tool access, and permissions. This means:
+- A motivated team could build a Loki-Mode-like SDLC pipeline on top of Claude Code's Agent SDK
+- It would have native access to Claude Code's broader surface area (VS Code, JetBrains, Desktop, Web)
+- It would benefit from Anthropic's ongoing model improvements
+**However, Loki Mode's structural advantages remain:**
+1. **Multi-provider:** Agent SDK is Claude-only. Loki Mode works with any provider.
+2. **Battle-tested pipeline:** 10 quality gates, completion council, healing, memory -- these took months to build and validate. A new Agent SDK project starts from zero.
+3. **Open source and self-hosted:** No dependency on Anthropic's platform decisions.
+4. **Research foundation:** Built on patterns from OpenAI, DeepMind, Anthropic, and academic research. Not just engineering, but applied AI safety research (Constitutional AI, anti-sycophancy, alignment faking detection).
+---
+## Sources
+### Emergence AI
+- [Emergence AI Platform](https://www.emergence.ai/platform)
+- [Agent-E GitHub Repository](https://github.com/EmergenceAI/Agent-E)
+- [Emergence AI SOTA Results on WebVoyager Benchmark](https://www.emergence.ai/blog/agent-e-sota)
+- [Emergence AI Multi-Agent Orchestrator](https://www.emergence.ai/blog/introducing-the-emergence-multi-agent-orchestrator)
+### Rork
+- [Rork Official Site](https://rork.com)
+- [Rork AI Review 2026 -- Rapid Dev](https://www.rapidevelopers.com/blog/rork-ai-review)
+- [Rork AI Review 2026: Pricing, Features & Honest Verdict -- No Code MBA](https://www.nocode.mba/articles/rork-ai-review-2026)
+- [Rork Review -- Medium](https://medium.com/@e2larsen/rork-com-review-can-this-no-code-platform-really-build-your-mobile-app-d17f32bd2870)
+- [Rork on Product Hunt](https://www.producthunt.com/products/rork-app-for-ios)
+### Claude Code CLI
+- [Claude Code Overview](https://code.claude.com/docs/en/overview)
+- [Claude Code GitHub](https://github.com/anthropics/claude-code)
+- [Claude Code on Amazon Bedrock](https://code.claude.com/docs/en/amazon-bedrock)
+- [Claude Code Scheduled Tasks](https://code.claude.com/docs/en/scheduled-tasks)
+- [Claude Code Agent SDK](https://platform.claude.com/docs/en/agent-sdk/overview)
+- [Claude Code Pricing -- ClaudeLog](https://claudelog.com/claude-code-pricing/)
+- [Claude Code Rate Limits -- The Register](https://www.theregister.com/2026/01/05/claude_devs_usage_limits/)
+- [Claude Code vs Codex -- Builder.io](https://www.builder.io/blog/codex-vs-claude-code)
+- [Claude Pricing Plans](https://claude.com/pricing)
+- [Claude Code Loops and Scheduling -- Medium](https://medium.com/@richardhightower/put-claude-on-autopilot-scheduled-tasks-with-loop-and-schedule-built-in-skills-43f3be5ac1ec)
+### Codex CLI
+- [Codex CLI Documentation](https://developers.openai.com/codex/cli)
+- [Codex CLI Features](https://developers.openai.com/codex/cli/features)
+- [Codex CLI GitHub](https://github.com/openai/codex)
+- [Codex Pricing](https://developers.openai.com/codex/pricing)
+- [Codex for Open Source](https://developers.openai.com/codex/open-source)
+### Comparative Analysis
+- [Codex vs Claude Code Benchmarks -- MorphLLM](https://www.morphllm.com/comparisons/codex-vs-claude-code)
+- [State of AI Coding Agents 2026 -- Medium](https://medium.com/@dave-patten/the-state-of-ai-coding-agents-2026-from-pair-programming-to-autonomous-ai-teams-b11f2b39232a)
+- [Best AI Coding Agents 2026 -- Codegen](https://codegen.com/blog/best-ai-coding-agents/)
+- [15 AI Coding Agents Tested -- MorphLLM](https://www.morphllm.com/ai-coding-agent)
+- [Agentic CLI Tools Compared -- AiMultiple](https://aimultiple.com/agentic-cli)